; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G019860 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G019860
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionYqaJ domain-containing protein
Genome locationchr02:25787764..25810695
RNA-Seq ExpressionLsi02G019860
SyntenyLsi02G019860
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR011335 - Restriction endonuclease type II-like
IPR011604 - Exonuclease, phage-type/RecB, C-terminal
IPR019080 - YqaJ viral recombinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456487.1 PREDICTED: uncharacterized protein LOC103496427 [Cucumis melo]6.6e-9488.36Show/hide
Query:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW
        MFN KKI FAC+QAIGN S+RNF+S S SSL+FETGNH SVLQSSSFQHWFKNW+ELRK+KLTASTFAGAIGFWPRRRAQLWLEKLGAI+ FCGNLATCW
Subjt:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW

Query:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        SNMKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GDM+ ASPWS+VP YCIPQAQG
Subjt:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

XP_011659114.2 uncharacterized protein LOC101215512 [Cucumis sativus]1.2e-9588.89Show/hide
Query:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW
        MFNCKKI FAC+QAIGN S+RNF+S SSSSL+FET NH SVLQS+SFQHWFKNW+ELRK+KLTASTFAGAIGFWPRRR QLWLEKLGAIDQFCGNLATCW
Subjt:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW

Query:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        SNMKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKM+YGLPS+GVLEIKCPFF+GDM+ ASPWSRVPLYCIPQAQG
Subjt:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

XP_022981568.1 uncharacterized protein LOC111480647 [Cucurbita maxima]9.5e-9388.42Show/hide
Query:  MFNCKKIFACNQAIGNWSLRNFHSA--SSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATC
        MFNCK IFA  Q I NWS  NF+SA  SSSS KFETGN+ SVLQSSSFQHWFKNWKELRK+KLTASTFAGAIGFWPRRRAQLWLEKLGAIDQF GNLATC
Subjt:  MFNCKKIFACNQAIGNWSLRNFHSA--SSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATC

Query:  WSNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        WSNMKEEEALERYKLITGNSVLFP+FQVYGK NS  DWLAASPDG IDKMVYGLPSRGVLEIKCPFFDGDM+KASPWSRVPLYCIPQ QG
Subjt:  WSNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

XP_031742491.1 uncharacterized protein LOC101207616 [Cucumis sativus]3.0e-9488.89Show/hide
Query:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW
        MFN KKI FAC+QAIGN SLRNF+S S SSL+FETGNH SVLQSSSFQHWFKNW+ELRK+KLTASTFAGAIGFWPRRR QLWLEKLGAIDQFCGNLATCW
Subjt:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW

Query:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        SNMKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GD++ A PWSRVP YCIPQAQG
Subjt:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

XP_038900094.1 uncharacterized protein LOC120087241 [Benincasa hispida]9.5e-10192.06Show/hide
Query:  MFNCKKIFACNQAIGNWSLRNFHS-ASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW
        MFNCKKIFAC+QAIGNWS+RNF+S ASSS+LKFETGNH S+LQSSSFQHWFKNW ELRK+KLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW
Subjt:  MFNCKKIFACNQAIGNWSLRNFHS-ASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW

Query:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        SNMKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKM+YGLPSRGVLEIKCPFFDGDM+KASPWSR+PLYCIPQAQG
Subjt:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

TrEMBL top hitse value%identityAlignment
A0A0A0K4T5 YqaJ domain-containing protein3.1e-9789.06Show/hide
Query:  KPTMFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLA
        K +MFNCKKI FAC+QAIGN S+RNF+S SSSSL+FET NH SVLQS+SFQHWFKNW+ELRK+KLTASTFAGAIGFWPRRR QLWLEKLGAIDQFCGNLA
Subjt:  KPTMFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLA

Query:  TCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        TCWSNMKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GDM+ ASPWSRVPLYCIPQAQG
Subjt:  TCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

A0A0A0KG47 YqaJ domain-containing protein1.4e-9488.89Show/hide
Query:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW
        MFN KKI FAC+QAIGN SLRNF+S S SSL+FETGNH SVLQSSSFQHWFKNW+ELRK+KLTASTFAGAIGFWPRRR QLWLEKLGAIDQFCGNLATCW
Subjt:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW

Query:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        SNMKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GD++ A PWSRVP YCIPQAQG
Subjt:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

A0A1S3C4M6 uncharacterized protein LOC1034964273.2e-9488.36Show/hide
Query:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW
        MFN KKI FAC+QAIGN S+RNF+S S SSL+FETGNH SVLQSSSFQHWFKNW+ELRK+KLTASTFAGAIGFWPRRRAQLWLEKLGAI+ FCGNLATCW
Subjt:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW

Query:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        SNMKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GDM+ ASPWS+VP YCIPQAQG
Subjt:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

A0A5A7SZ18 DNA-directed RNA polymerase subunit beta3.2e-9488.36Show/hide
Query:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW
        MFN KKI FAC+QAIGN S+RNF+S S SSL+FETGNH SVLQSSSFQHWFKNW+ELRK+KLTASTFAGAIGFWPRRRAQLWLEKLGAI+ FCGNLATCW
Subjt:  MFNCKKI-FACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCW

Query:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        SNMKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GDM+ ASPWS+VP YCIPQAQG
Subjt:  SNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

A0A6J1J2F9 uncharacterized protein LOC1114806474.6e-9388.42Show/hide
Query:  MFNCKKIFACNQAIGNWSLRNFHSA--SSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATC
        MFNCK IFA  Q I NWS  NF+SA  SSSS KFETGN+ SVLQSSSFQHWFKNWKELRK+KLTASTFAGAIGFWPRRRAQLWLEKLGAIDQF GNLATC
Subjt:  MFNCKKIFACNQAIGNWSLRNFHSA--SSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATC

Query:  WSNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
        WSNMKEEEALERYKLITGNSVLFP+FQVYGK NS  DWLAASPDG IDKMVYGLPSRGVLEIKCPFFDGDM+KASPWSRVPLYCIPQ QG
Subjt:  WSNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

SwissProt top hitse value%identityAlignment
P06599 Extensin1.7e-0446.6Show/hide
Query:  PREVFTSRSPPLP---PPKKDKYDHSPPPPPTK-YEYKSPPPP-----PPKKDKYDLPPLPP--------TKYKYKSPPPPPPKKDKYDSPLTPPTKYDY
        P  V+  +SPP P   P     Y +  PPPPT  Y+YKSPPPP     P    KY  PP P          KYKYKSPPPP P   KY SP  P   Y Y
Subjt:  PREVFTSRSPPLP---PPKKDKYDHSPPPPPTK-YEYKSPPPP-----PPKKDKYDLPPLPP--------TKYKYKSPPPPPPKKDKYDSPLTPPTKYDY

Query:  KSP
        KSP
Subjt:  KSP

Q9FLQ7 Formin-like protein 203.0e-0450.7Show/hide
Query:  SPPLPPPKKDKYDHSPPPPPTKYEYKSPPPPPPKKDKY-DLPPLPPTKYKYKSPPPPPPKKDKYDSPLTPP
        SPP PPP    Y   PPPPP    Y SPPPPPP    Y   PP PP    Y SPPPPPP    + S + PP
Subjt:  SPPLPPPKKDKYDHSPPPPPTKYEYKSPPPPPPKKDKY-DLPPLPPTKYKYKSPPPPPPKKDKYDSPLTPP

Arabidopsis top hitse value%identityAlignment
AT1G13810.1 Restriction endonuclease, type II-like superfamily protein3.1e-4152.32Show/hide
Query:  SVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDD-WL
        +V+ +    HW KNW++LRK +LTAS FA AIGF P  R  LWLEK+GA   F GN AT W    E EALERY  +TGN +L P+F VY    S ++ WL
Subjt:  SVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSGDD-WL

Query:  AASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
         ASPDG I+ +  G+ S GVLE+KCPF + D  K  PW +VP  C+PQ QG
Subjt:  AASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

AT1G26250.1 Proline-rich extensin-like family protein1.2e-0550Show/hide
Query:  STIPREVFTSRSPPLPP-----PKKDKYDHSPPPPPTKYEYKSPPPPPPKKDKYDLPPLPPTKYKYKSPPPPPPKKDKYDSPLTPPTKYDYKSP
        S+ P   +   SPP PP     P    Y +SPPPPP  Y Y+SPPPPP     Y  PP PP  Y YKSPPPPP     Y SP  PP  Y YKSP
Subjt:  STIPREVFTSRSPPLPP-----PKKDKYDHSPPPPPTKYEYKSPPPPPPKKDKYDLPPLPPTKYKYKSPPPPPPKKDKYDSPLTPPTKYDYKSP

AT1G67660.1 Restriction endonuclease, type II-like superfamily protein1.7e-2339.35Show/hide
Query:  SVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWP-RRRAQLWLEKL----GAIDQFCGNLATCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSG
        S+L  S      + W  LRK KLT STF+ A+GFW   RRA+LW EK+      + +     A  W    E  A+ERYK I G  V    F ++  +N  
Subjt:  SVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWP-RRRAQLWLEKL----GAIDQFCGNLATCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSG

Query:  DDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
          WL ASPDG +D         G+LE+KCP+  G  +   PW +VP Y +PQ QG
Subjt:  DDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

AT1G67660.2 Restriction endonuclease, type II-like superfamily protein1.7e-2339.35Show/hide
Query:  SVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWP-RRRAQLWLEKL----GAIDQFCGNLATCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSG
        S+L  S      + W  LRK KLT STF+ A+GFW   RRA+LW EK+      + +     A  W    E  A+ERYK I G  V    F ++  +N  
Subjt:  SVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWP-RRRAQLWLEKL----GAIDQFCGNLATCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSG

Query:  DDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
          WL ASPDG +D         G+LE+KCP+  G  +   PW +VP Y +PQ QG
Subjt:  DDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG

AT1G67660.3 Restriction endonuclease, type II-like superfamily protein1.7e-2339.35Show/hide
Query:  SVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWP-RRRAQLWLEKL----GAIDQFCGNLATCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSG
        S+L  S      + W  LRK KLT STF+ A+GFW   RRA+LW EK+      + +     A  W    E  A+ERYK I G  V    F ++  +N  
Subjt:  SVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWP-RRRAQLWLEKL----GAIDQFCGNLATCWSNMKEEEALERYKLITGNSVLFPKFQVYGKANSG

Query:  DDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG
          WL ASPDG +D         G+LE+KCP+  G  +   PW +VP Y +PQ QG
Subjt:  DDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAAACCAACAATGTTCAATTGCAAAAAGATATTTGCCTGTAATCAAGCTATTGGGAACTGGTCTTTGCGTAATTTCCATTCTGCTTCTTCTTCTTCTTTAAAATT
TGAAACGGGCAATCACTGTTCTGTTCTTCAGTCCAGTAGTTTTCAGCATTGGTTCAAAAATTGGAAAGAGCTTCGAAAGTATAAGTTAACAGCAAGTACTTTTGCTGGGG
CAATTGGGTTTTGGCCTCGTCGAAGGGCTCAATTGTGGTTAGAGAAACTTGGGGCAATTGACCAATTTTGTGGTAATCTTGCTACTTGTTGGAGTAACATGAAAGAAGAA
GAAGCACTTGAAAGATACAAGCTTATTACTGGAAACTCTGTTTTGTTTCCTAAATTTCAAGTCTATGGGAAAGCAAACTCTGGAGATGATTGGTTGGCTGCTTCACCCGA
TGGTACAATTGATAAGATGGTTTATGGATTGCCTTCACGAGGTGTATTGGAGATTAAGTGCCCATTTTTTGATGGTGATATGAAAAAGGCTTCACCATGGTCACGAGTTC
CTCTTTACTGTATTCCTCAGGCTCAAGGTTCAGAGCTTGAAGGAGTTTATCTTAACATGAAAACTGGGGAATCTAACAACGATTTGAATGTTGTCAAACCAAGCACATCT
CGAGACATGGTAGCAGCGGTTAGCTTCAGTGTGCAGCCTGAAGGCCAAAGGGCCAGGGGAGTTATGGGTACCATGGAAAAGCAAGGATTATTTGGATCAATATTGAACGA
AGAAGAGAAAGAGAGGAACAGAGGATCTTTGATAGCAACAGTTCCCAATCTCCTCATGCGTCATAATGCTCTCGAGAAGACTAGACAACCCAACATTGTCTCACATTGCG
GGGTTTTTATAAAGTCAGTTTCTACGATCCCACGTGAAGTTTTTACTTCTAGGTCACCCCCTCTACCACCACCAAAAAAAGACAAATATGATCACTCGCCTCCACCGCCA
CCAACTAAGTACGAATACAAGTCACCCCCTCCACCGCCACCAAAGAAAGACAAATATGACTTGCCTCCACTACCACCAACTAAGTACAAGTACAAGTCACCCCCTCCACC
CCCACCAAAAAAAGACAAATATGATTCGCCTCTAACACCACCAACTAAGTACGATTACAAGTCACCCCTCCAGTACCATGGTCAAAGAAATCGCCTCCTCCACTACTGCC
AAAAAATTGGCCTTCTCCACCACCGTCACATAAGAAGTCTCCTCCACCACCACCGAAGAGAGACAGATACAAGTCGTCTCCTCCACCACAATCCCCTTACAACTATTAGG
TCAACAAGGGCTTCTCATCAACTATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTAAACCAACAATGTTCAATTGCAAAAAGATATTTGCCTGTAATCAAGCTATTGGGAACTGGTCTTTGCGTAATTTCCATTCTGCTTCTTCTTCTTCTTTAAAATT
TGAAACGGGCAATCACTGTTCTGTTCTTCAGTCCAGTAGTTTTCAGCATTGGTTCAAAAATTGGAAAGAGCTTCGAAAGTATAAGTTAACAGCAAGTACTTTTGCTGGGG
CAATTGGGTTTTGGCCTCGTCGAAGGGCTCAATTGTGGTTAGAGAAACTTGGGGCAATTGACCAATTTTGTGGTAATCTTGCTACTTGTTGGAGTAACATGAAAGAAGAA
GAAGCACTTGAAAGATACAAGCTTATTACTGGAAACTCTGTTTTGTTTCCTAAATTTCAAGTCTATGGGAAAGCAAACTCTGGAGATGATTGGTTGGCTGCTTCACCCGA
TGGTACAATTGATAAGATGGTTTATGGATTGCCTTCACGAGGTGTATTGGAGATTAAGTGCCCATTTTTTGATGGTGATATGAAAAAGGCTTCACCATGGTCACGAGTTC
CTCTTTACTGTATTCCTCAGGCTCAAGGTTCAGAGCTTGAAGGAGTTTATCTTAACATGAAAACTGGGGAATCTAACAACGATTTGAATGTTGTCAAACCAAGCACATCT
CGAGACATGGTAGCAGCGGTTAGCTTCAGTGTGCAGCCTGAAGGCCAAAGGGCCAGGGGAGTTATGGGTACCATGGAAAAGCAAGGATTATTTGGATCAATATTGAACGA
AGAAGAGAAAGAGAGGAACAGAGGATCTTTGATAGCAACAGTTCCCAATCTCCTCATGCGTCATAATGCTCTCGAGAAGACTAGACAACCCAACATTGTCTCACATTGCG
GGGTTTTTATAAAGTCAGTTTCTACGATCCCACGTGAAGTTTTTACTTCTAGGTCACCCCCTCTACCACCACCAAAAAAAGACAAATATGATCACTCGCCTCCACCGCCA
CCAACTAAGTACGAATACAAGTCACCCCCTCCACCGCCACCAAAGAAAGACAAATATGACTTGCCTCCACTACCACCAACTAAGTACAAGTACAAGTCACCCCCTCCACC
CCCACCAAAAAAAGACAAATATGATTCGCCTCTAACACCACCAACTAAGTACGATTACAAGTCACCCCTCCAGTACCATGGTCAAAGAAATCGCCTCCTCCACTACTGCC
AAAAAATTGGCCTTCTCCACCACCGTCACATAAGAAGTCTCCTCCACCACCACCGAAGAGAGACAGATACAAGTCGTCTCCTCCACCACAATCCCCTTACAACTATTAGG
TCAACAAGGGCTTCTCATCAACTATAA
Protein sequenceShow/hide protein sequence
MRKPTMFNCKKIFACNQAIGNWSLRNFHSASSSSLKFETGNHCSVLQSSSFQHWFKNWKELRKYKLTASTFAGAIGFWPRRRAQLWLEKLGAIDQFCGNLATCWSNMKEE
EALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGSELEGVYLNMKTGESNNDLNVVKPSTS
RDMVAAVSFSVQPEGQRARGVMGTMEKQGLFGSILNEEEKERNRGSLIATVPNLLMRHNALEKTRQPNIVSHCGVFIKSVSTIPREVFTSRSPPLPPPKKDKYDHSPPPP
PTKYEYKSPPPPPPKKDKYDLPPLPPTKYKYKSPPPPPPKKDKYDSPLTPPTKYDYKSPLQYHGQRNRLLHYCQKIGLLHHRHIRSLLHHHRRETDTSRLLHHNPLTTIR
STRASHQL