; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0012896 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0012896
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr07:10545064..10547451
RNA-Seq ExpressionPay0012896
SyntenyPay0012896
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035966.1 F5J5.1 [Cucumis melo var. makuwa]4.0e-16348.28Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        +I VNGVSV KPEVDWT+AEEQAS+GN RALNAIFNGV LNVFKLINSCST KEA KTLEVAYEGTSKVKISRLQLITSKFEALRMTEDE VSDYNKRVL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        +IANESL+LGEKIPDSKIVRKVLRS+ RKFD+KV +IEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGI+FKSTHV++E   DT+ANM+ESI LL 
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC
        KQF+N ++  KN      NA     + RK  +N    +++  +  + D                        +D N+NAF + I++ ++ D SECS +  
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC

Query:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT
        +   + E+L+ LWKED EAR IQKE IQDL+EENE LMS                   LKSVKMLNSG +NLD IL +G NG  ++GLGF +SA     T
Subjt:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT

Query:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHR------------------------------------------------
        +EIKFVPAS+  + DT+     +  S K+     +YCGQKGHIR  CYKL +                                                
Subjt:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHR------------------------------------------------

Query:  -----------------------------------------------DILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCY
                                                       D+ YVDGLK NLIS++QLCDQGY V+F                  K  +DNCY
Subjt:  -----------------------------------------------DILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCY

Query:  HWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYD
        HW SN S+ C L + DQTWLWHRKL    ++ +++ +KN+ V+G+ ++DVN       +    +C         GK++V V  +D++R+ WV FLK K D
Subjt:  HWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYD

Query:  TPKVCISLCLILQREKGVKIVRIRSDH
          ++C +LCL LQREK  KI RIRSDH
Subjt:  TPKVCISLCLILQREKGVKIVRIRSDH

KAA0045252.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.9e-21986.13Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        MIIVNGVSVLKPEVD TD EEQASIGNARALNAIFNGV LNVFKLINSCST KEALKTLEVAYEGTSKVKISRLQLITSKFEAL MTEDEPVSDYNKRVL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        EIANESLML EKIPDSKIVRKVL+SLPR F LKVI+IEEAHDITTLKLDELFGSLLTFEMAT DRESKKGKGIAFK THVSEE VSDT+ANMNESIDLLI
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREY---------------------------KDTGDGEE
        KQFSNV+KKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCRE                            KDTGDGEE
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREY---------------------------KDTGDGEE

Query:  DNNMNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLM------------------STLKSVKMLNSGTENLD
        DN+MNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKEDFEARAIQKERIQDLMEENE+LM                   TLKSVKMLNSGTENLD
Subjt:  DNNMNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLM------------------STLKSVKMLNSGTENLD

Query:  VILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILY
        VILNSGQNGLN+HGLGFDTSARKINTTTEI FVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQK HIRPFCYKLHRDILY
Subjt:  VILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILY

KAA0053561.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.3e-15247.21Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        MI VNGVSV KPEVDWTDAEEQA +GNARALNAIFNGV LNVFK                                                      VL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        EIANESL+LGEKIPDSKIVRKVLRSLPRKFD+KV +IEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFK THV+EE V D EANM+E I LL 
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC
        KQF+N ++  KN N T  N Q    Y+R++    TRR NEN +RR+  Y +KKE        +E  D+ D  +D N+NAF + I++ ++ D+SECS +  
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC

Query:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT
        +   + E+L  LWKED EARAIQK+RIQDL+EENE+LMS                   LKSVKMLNSGT+NLD IL +  NG +++GLGF  SA     T
Subjt:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT

Query:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIR-----------------------------------PFCYKLH--------------
        +EIKFV AS+  + DT+ +   +    K+    C+YCG+KGHIR                                     C   H              
Subjt:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIR-----------------------------------PFCYKLH--------------

Query:  -----------RDILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCYHWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDR
                    D+ YVDGLK NLIS+SQLCDQGY V+F                  K  +DNCYHW SN S+ C L + DQTWLWHRKLG + ++ +++
Subjt:  -----------RDILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCYHWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDR

Query:  TVKNEVVIGVSNIDVN----------RKQTKASHKSLKECSTNR--------------TESLGGKKYVFVAEEDFSRFAWVRFLKDKYDTPKVCISLCLI
         +KN+ V+G+ ++DVN           KQT+ +HK+LKEC TNR               ESL GK+YV V  +D+SR+ WV FLK K DT + C +LCL 
Subjt:  TVKNEVVIGVSNIDVN----------RKQTKASHKSLKECSTNR--------------TESLGGKKYVFVAEEDFSRFAWVRFLKDKYDTPKVCISLCLI

Query:  LQREKGVKIVRIRSDH
        LQREKG KI RIRSDH
Subjt:  LQREKGVKIVRIRSDH

TYK30437.1 F5J5.1 [Cucumis melo var. makuwa]4.0e-16348.28Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        +I VNGVSV KPEVDWT+AEEQAS+GN RALNAIFNGV LNVFKLINSCST KEA KTLEVAYEGTSKVKISRLQLITSKFEALRMTEDE VSDYNKRVL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        +IANESL+LGEKIPDSKIVRKVLRS+ RKFD+KV +IEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGI+FKSTHV++E   DT+ANM+ESI LL 
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC
        KQF+N ++  KN      NA     + RK  +N    +++  +  + D                        +D N+NAF + I++ ++ D SECS +  
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC

Query:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT
        +   + E+L+ LWKED EAR IQKE IQDL+EENE LMS                   LKSVKMLNSG +NLD IL +G NG  ++GLGF +SA     T
Subjt:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT

Query:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHR------------------------------------------------
        +EIKFVPAS+  + DT+     +  S K+     +YCGQKGHIR  CYKL +                                                
Subjt:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHR------------------------------------------------

Query:  -----------------------------------------------DILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCY
                                                       D+ YVDGLK NLIS++QLCDQGY V+F                  K  +DNCY
Subjt:  -----------------------------------------------DILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCY

Query:  HWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYD
        HW SN S+ C L + DQTWLWHRKL    ++ +++ +KN+ V+G+ ++DVN       +    +C         GK++V V  +D++R+ WV FLK K D
Subjt:  HWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYD

Query:  TPKVCISLCLILQREKGVKIVRIRSDH
          ++C +LCL LQREK  KI RIRSDH
Subjt:  TPKVCISLCLILQREKGVKIVRIRSDH

XP_008444307.1 PREDICTED: uncharacterized protein LOC103487675 [Cucumis melo]1.0e-29185.81Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        MIIVNGVSVLKPEVD TD EEQASIGNARALNAIFNGV LNVFKLINSCST KEALKTLEVAYEGTSKVKISRLQLITSKFEAL MTEDEPVSDYNKRVL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        EIANESLML EKIPDSKIVRKVL+SLPR F LKVI+IEEAHDITTLKLDELFGSLLTFEMAT DRESKKGKGIAFK THVSEE VSDT+ANMNESIDLLI
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC
        KQFSNV+KKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDN+MNAFTVCISETDSGDESECSGQIC
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC

Query:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMSTLKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVM
        DKNFTFEELRVLWKEDFEAR                   TLKSVKMLNSGTENLDVILNSGQNGLN+HGLGFDTSARKINTTTEI FVPASVNDKTDTVM
Subjt:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMSTLKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVM

Query:  ATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILYVDGLKENLISVSQLCDQGYSVNFSKDN-----------------SDNCYHWISNDSEVCHLN
        ATKVVSPSAKTTKWICHYCGQK HIRPFCYKLHRDILYVDGLK NLISVS+LCDQGYSVNFSKDN                 +DNCYHWISNDSEVCHLN
Subjt:  ATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILYVDGLKENLISVSQLCDQGYSVNFSKDN-----------------SDNCYHWISNDSEVCHLN

Query:  KEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVN----------RKQTKASHKSLKECSTNR--------------TESLGGKKYVFVAEEDFSRF
        KEDQT LWHRKLG IDLKSID T+KNEVVIGV NIDVN           KQTKASHKSLKECSTNR              TESLGGKKYVFVAEEDFSRF
Subjt:  KEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVN----------RKQTKASHKSLKECSTNR--------------TESLGGKKYVFVAEEDFSRF

Query:  AWVRFLKDKYDTPKVCISLCLILQREK
         WVRFLKDKYDTPKVCISLCLILQREK
Subjt:  AWVRFLKDKYDTPKVCISLCLILQREK

TrEMBL top hitse value%identityAlignment
A0A1S3BAW0 uncharacterized protein LOC1034876754.9e-29285.81Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        MIIVNGVSVLKPEVD TD EEQASIGNARALNAIFNGV LNVFKLINSCST KEALKTLEVAYEGTSKVKISRLQLITSKFEAL MTEDEPVSDYNKRVL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        EIANESLML EKIPDSKIVRKVL+SLPR F LKVI+IEEAHDITTLKLDELFGSLLTFEMAT DRESKKGKGIAFK THVSEE VSDT+ANMNESIDLLI
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC
        KQFSNV+KKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDN+MNAFTVCISETDSGDESECSGQIC
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC

Query:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMSTLKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVM
        DKNFTFEELRVLWKEDFEAR                   TLKSVKMLNSGTENLDVILNSGQNGLN+HGLGFDTSARKINTTTEI FVPASVNDKTDTVM
Subjt:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMSTLKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVM

Query:  ATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILYVDGLKENLISVSQLCDQGYSVNFSKDN-----------------SDNCYHWISNDSEVCHLN
        ATKVVSPSAKTTKWICHYCGQK HIRPFCYKLHRDILYVDGLK NLISVS+LCDQGYSVNFSKDN                 +DNCYHWISNDSEVCHLN
Subjt:  ATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILYVDGLKENLISVSQLCDQGYSVNFSKDN-----------------SDNCYHWISNDSEVCHLN

Query:  KEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVN----------RKQTKASHKSLKECSTNR--------------TESLGGKKYVFVAEEDFSRF
        KEDQT LWHRKLG IDLKSID T+KNEVVIGV NIDVN           KQTKASHKSLKECSTNR              TESLGGKKYVFVAEEDFSRF
Subjt:  KEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVN----------RKQTKASHKSLKECSTNR--------------TESLGGKKYVFVAEEDFSRF

Query:  AWVRFLKDKYDTPKVCISLCLILQREK
         WVRFLKDKYDTPKVCISLCLILQREK
Subjt:  AWVRFLKDKYDTPKVCISLCLILQREK

A0A5A7T169 F5J5.11.9e-16348.28Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        +I VNGVSV KPEVDWT+AEEQAS+GN RALNAIFNGV LNVFKLINSCST KEA KTLEVAYEGTSKVKISRLQLITSKFEALRMTEDE VSDYNKRVL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        +IANESL+LGEKIPDSKIVRKVLRS+ RKFD+KV +IEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGI+FKSTHV++E   DT+ANM+ESI LL 
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC
        KQF+N ++  KN      NA     + RK  +N    +++  +  + D                        +D N+NAF + I++ ++ D SECS +  
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC

Query:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT
        +   + E+L+ LWKED EAR IQKE IQDL+EENE LMS                   LKSVKMLNSG +NLD IL +G NG  ++GLGF +SA     T
Subjt:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT

Query:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHR------------------------------------------------
        +EIKFVPAS+  + DT+     +  S K+     +YCGQKGHIR  CYKL +                                                
Subjt:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHR------------------------------------------------

Query:  -----------------------------------------------DILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCY
                                                       D+ YVDGLK NLIS++QLCDQGY V+F                  K  +DNCY
Subjt:  -----------------------------------------------DILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCY

Query:  HWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYD
        HW SN S+ C L + DQTWLWHRKL    ++ +++ +KN+ V+G+ ++DVN       +    +C         GK++V V  +D++R+ WV FLK K D
Subjt:  HWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYD

Query:  TPKVCISLCLILQREKGVKIVRIRSDH
          ++C +LCL LQREK  KI RIRSDH
Subjt:  TPKVCISLCLILQREKGVKIVRIRSDH

A0A5A7TPF7 Gag-proteinase polyprotein1.9e-21986.13Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        MIIVNGVSVLKPEVD TD EEQASIGNARALNAIFNGV LNVFKLINSCST KEALKTLEVAYEGTSKVKISRLQLITSKFEAL MTEDEPVSDYNKRVL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        EIANESLML EKIPDSKIVRKVL+SLPR F LKVI+IEEAHDITTLKLDELFGSLLTFEMAT DRESKKGKGIAFK THVSEE VSDT+ANMNESIDLLI
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREY---------------------------KDTGDGEE
        KQFSNV+KKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCRE                            KDTGDGEE
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREY---------------------------KDTGDGEE

Query:  DNNMNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLM------------------STLKSVKMLNSGTENLD
        DN+MNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKEDFEARAIQKERIQDLMEENE+LM                   TLKSVKMLNSGTENLD
Subjt:  DNNMNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLM------------------STLKSVKMLNSGTENLD

Query:  VILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILY
        VILNSGQNGLN+HGLGFDTSARKINTTTEI FVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQK HIRPFCYKLHRDILY
Subjt:  VILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILY

A0A5A7UJA4 Gag-pol polyprotein4.5e-15247.21Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        MI VNGVSV KPEVDWTDAEEQA +GNARALNAIFNGV LNVFK                                                      VL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        EIANESL+LGEKIPDSKIVRKVLRSLPRKFD+KV +IEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFK THV+EE V D EANM+E I LL 
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC
        KQF+N ++  KN N T  N Q    Y+R++    TRR NEN +RR+  Y +KKE        +E  D+ D  +D N+NAF + I++ ++ D+SECS +  
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC

Query:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT
        +   + E+L  LWKED EARAIQK+RIQDL+EENE+LMS                   LKSVKMLNSGT+NLD IL +  NG +++GLGF  SA     T
Subjt:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT

Query:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIR-----------------------------------PFCYKLH--------------
        +EIKFV AS+  + DT+ +   +    K+    C+YCG+KGHIR                                     C   H              
Subjt:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIR-----------------------------------PFCYKLH--------------

Query:  -----------RDILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCYHWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDR
                    D+ YVDGLK NLIS+SQLCDQGY V+F                  K  +DNCYHW SN S+ C L + DQTWLWHRKLG + ++ +++
Subjt:  -----------RDILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCYHWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDR

Query:  TVKNEVVIGVSNIDVN----------RKQTKASHKSLKECSTNR--------------TESLGGKKYVFVAEEDFSRFAWVRFLKDKYDTPKVCISLCLI
         +KN+ V+G+ ++DVN           KQT+ +HK+LKEC TNR               ESL GK+YV V  +D+SR+ WV FLK K DT + C +LCL 
Subjt:  TVKNEVVIGVSNIDVN----------RKQTKASHKSLKECSTNR--------------TESLGGKKYVFVAEEDFSRFAWVRFLKDKYDTPKVCISLCLI

Query:  LQREKGVKIVRIRSDH
        LQREKG KI RIRSDH
Subjt:  LQREKGVKIVRIRSDH

A0A5D3E2Y4 F5J5.11.9e-16348.28Show/hide
Query:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL
        +I VNGVSV KPEVDWT+AEEQAS+GN RALNAIFNGV LNVFKLINSCST KEA KTLEVAYEGTSKVKISRLQLITSKFEALRMTEDE VSDYNKRVL
Subjt:  MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI
        +IANESL+LGEKIPDSKIVRKVLRS+ RKFD+KV +IEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGI+FKSTHV++E   DT+ANM+ESI LL 
Subjt:  EIANESLMLGEKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLI

Query:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC
        KQF+N ++  KN      NA     + RK  +N    +++  +  + D                        +D N+NAF + I++ ++ D SECS +  
Subjt:  KQFSNVVKKFKNLNTTGSNAQNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQIC

Query:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT
        +   + E+L+ LWKED EAR IQKE IQDL+EENE LMS                   LKSVKMLNSG +NLD IL +G NG  ++GLGF +SA     T
Subjt:  DKNFTFEELRVLWKEDFEARAIQKERIQDLMEENEQLMST------------------LKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTT

Query:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHR------------------------------------------------
        +EIKFVPAS+  + DT+     +  S K+     +YCGQKGHIR  CYKL +                                                
Subjt:  TEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHR------------------------------------------------

Query:  -----------------------------------------------DILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCY
                                                       D+ YVDGLK NLIS++QLCDQGY V+F                  K  +DNCY
Subjt:  -----------------------------------------------DILYVDGLKENLISVSQLCDQGYSVNF-----------------SKDNSDNCY

Query:  HWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYD
        HW SN S+ C L + DQTWLWHRKL    ++ +++ +KN+ V+G+ ++DVN       +    +C         GK++V V  +D++R+ WV FLK K D
Subjt:  HWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYD

Query:  TPKVCISLCLILQREKGVKIVRIRSDH
          ++C +LCL LQREK  KI RIRSDH
Subjt:  TPKVCISLCLILQREKGVKIVRIRSDH

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-0624.76Show/hide
Query:  CYKLHRDILYVDGLKENLISVSQLCDQGYSVNFSKDN---------------SDNCYHWISNDSEVCH-----LNKEDQTWLWHRKLGDIDLKSIDRTVK
        C  + +D+ +V  L+ NLIS   L   GY   F+                      Y     ++E+C         E    LWH+++G +  K +    K
Subjt:  CYKLHRDILYVDGLKENLISVSQLCDQGYSVNFSKDN---------------SDNCYHWISNDSEVCH-----LNKEDQTWLWHRKLGDIDLKSIDRTVK

Query:  NEVVIGVSNIDVN-------RKQTKASHKSLKE-------------CSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYDTPKVCISLCLILQREKGV
          ++       V         KQ + S ++  E             C     ES+GG KY     +D SR  WV  LK K    +V      +++RE G 
Subjt:  NEVVIGVSNIDVN-------RKQTKASHKSLKE-------------CSTNRTESLGGKKYVFVAEEDFSRFAWVRFLKDKYDTPKVCISLCLILQREKGV

Query:  KIVRIRSDHG
        K+ R+RSD+G
Subjt:  KIVRIRSDHG

Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein3.9e-0727.83Show/hide
Query:  GEEDNNMNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKE--DFEARAIQKERI-QDLMEENEQLMSTLKSVKMLNSGTENLDVILNSGQNGLNK
        G +++  +   + IS+ +  +  +   + C K    EE  VL KE    EA+ ++  +   +  EE  QL  T K+++MLN+GT+ L  IL+ G+   +K
Subjt:  GEEDNNMNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKE--DFEARAIQKERI-QDLMEENEQLMSTLKSVKMLNSGTENLDVILNSGQNGLNK

Query:  HGLGFDTSARKIN--------------TTTEIKFVPASVND-KTDTVMATKVVSPSAKTTKW----------ICHYCGQKGHIRPFCYKLHRDILYVDGL
         GLGF  +  K +              T  E   V    +D +TD+   T+  S + K  K           +CH+CG  GHIRP C++L R+       
Subjt:  HGLGFDTSARKIN--------------TTTEIKFVPASVND-KTDTVMATKVVSPSAKTTKW----------ICHYCGQKGHIRPFCYKLHRDILYVDGL

Query:  KENLISVSQLCDQGYSVNFSKDNSDNCYHW
        K  L++        Y V F   +   CYH+
Subjt:  KENLISVSQLCDQGYSVNFSKDNSDNCYHW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTATCGTGAATGGTGTTTCGGTTCTAAAACCTGAAGTTGATTGGACTGATGCTGAAGAGCAAGCTTCTATTGGGAATGCCAGAGCACTTAACGCGATATTTAATGG
TGTTGGCCTGAACGTTTTCAAGTTAATAAATTCTTGCAGTACAGTCAAGGAAGCCTTGAAAACCTTAGAGGTAGCGTATGAAGGTACTTCCAAAGTAAAGATCTCAAGAT
TACAGTTGATAACGTCTAAGTTTGAGGCATTAAGAATGACCGAGGATGAACCAGTGTCTGATTACAATAAGAGAGTGCTTGAAATCGCAAATGAATCTTTGATGCTCGGT
GAAAAAATACCTGACTCTAAAATAGTGCGGAAAGTACTTCGATCCTTGCCCAGGAAATTTGATTTGAAAGTTATTTCCATTGAGGAAGCTCATGATATTACAACGCTCAA
ACTTGATGAATTGTTTGGTTCGTTGCTTACGTTTGAGATGGCCACTGCTGATAGAGAAAGTAAGAAAGGCAAGGGAATTGCTTTTAAATCCACACATGTAAGTGAGGAGG
TTGTAAGTGACACTGAAGCAAACATGAACGAATCAATAGACCTCCTGATCAAACAGTTTTCTAATGTGGTCAAGAAATTCAAAAACTTGAATACCACAGGATCAAACGCT
CAAAATTTGATTAACTATCAAAGAAAAGATGGTGAGAACAATACGAGAAGGATTAATGAAAATTCAAACAGGAGAAATAGTGATTATGGACGAAAAAAAGAGGGTGAAGG
AAGAGTTTTCAGATGTAGAGAATATAAAGATACTGGTGATGGTGAAGAAGATAACAACATGAATGCATTTACAGTATGTATTTCAGAAACTGACTCTGGAGATGAAAGTG
AATGTTCTGGGCAGATTTGTGATAAAAACTTTACATTTGAAGAGCTCAGAGTCCTATGGAAAGAAGATTTTGAAGCTAGAGCAATACAAAAAGAAAGAATTCAAGATCTT
ATGGAAGAAAATGAACAATTAATGTCTACATTAAAGTCTGTAAAGATGCTAAATTCAGGAACTGAGAATTTAGATGTAATATTAAATTCAGGACAGAATGGTTTAAACAA
ACATGGTCTTGGGTTTGATACTTCTGCAAGAAAGATCAATACTACAACTGAAATAAAGTTTGTACCTGCATCAGTTAATGACAAGACAGATACAGTCATGGCAACGAAAG
TAGTTAGCCCTTCAGCTAAAACTACTAAATGGATCTGTCACTACTGTGGTCAGAAAGGCCATATTAGACCTTTTTGCTATAAACTACATAGAGATATACTATATGTGGAT
GGCTTAAAAGAAAATTTGATTAGTGTAAGTCAGCTATGTGATCAAGGCTACAGTGTAAACTTTAGCAAAGACAATTCGGATAACTGCTATCACTGGATCTCCAATGATTC
AGAAGTTTGTCATTTGAATAAAGAAGATCAAACCTGGTTGTGGCACAGAAAGCTAGGAGACATCGACCTGAAAAGCATAGACAGGACTGTAAAAAATGAAGTTGTGATAG
GTGTTTCAAATATTGATGTAAATAGGAAGCAAACTAAAGCATCCCACAAAAGCCTAAAGGAATGTTCCACTAATAGAACTGAAAGTCTTGGTGGAAAGAAATATGTGTTT
GTAGCTGAAGAAGATTTTTCTCGATTTGCATGGGTTCGATTTTTGAAAGACAAGTATGATACTCCCAAAGTCTGCATCAGCTTGTGCTTGATTTTGCAGCGAGAAAAAGG
AGTGAAAATTGTTAGAATCAGAAGTGATCATGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTATCGTGAATGGTGTTTCGGTTCTAAAACCTGAAGTTGATTGGACTGATGCTGAAGAGCAAGCTTCTATTGGGAATGCCAGAGCACTTAACGCGATATTTAATGG
TGTTGGCCTGAACGTTTTCAAGTTAATAAATTCTTGCAGTACAGTCAAGGAAGCCTTGAAAACCTTAGAGGTAGCGTATGAAGGTACTTCCAAAGTAAAGATCTCAAGAT
TACAGTTGATAACGTCTAAGTTTGAGGCATTAAGAATGACCGAGGATGAACCAGTGTCTGATTACAATAAGAGAGTGCTTGAAATCGCAAATGAATCTTTGATGCTCGGT
GAAAAAATACCTGACTCTAAAATAGTGCGGAAAGTACTTCGATCCTTGCCCAGGAAATTTGATTTGAAAGTTATTTCCATTGAGGAAGCTCATGATATTACAACGCTCAA
ACTTGATGAATTGTTTGGTTCGTTGCTTACGTTTGAGATGGCCACTGCTGATAGAGAAAGTAAGAAAGGCAAGGGAATTGCTTTTAAATCCACACATGTAAGTGAGGAGG
TTGTAAGTGACACTGAAGCAAACATGAACGAATCAATAGACCTCCTGATCAAACAGTTTTCTAATGTGGTCAAGAAATTCAAAAACTTGAATACCACAGGATCAAACGCT
CAAAATTTGATTAACTATCAAAGAAAAGATGGTGAGAACAATACGAGAAGGATTAATGAAAATTCAAACAGGAGAAATAGTGATTATGGACGAAAAAAAGAGGGTGAAGG
AAGAGTTTTCAGATGTAGAGAATATAAAGATACTGGTGATGGTGAAGAAGATAACAACATGAATGCATTTACAGTATGTATTTCAGAAACTGACTCTGGAGATGAAAGTG
AATGTTCTGGGCAGATTTGTGATAAAAACTTTACATTTGAAGAGCTCAGAGTCCTATGGAAAGAAGATTTTGAAGCTAGAGCAATACAAAAAGAAAGAATTCAAGATCTT
ATGGAAGAAAATGAACAATTAATGTCTACATTAAAGTCTGTAAAGATGCTAAATTCAGGAACTGAGAATTTAGATGTAATATTAAATTCAGGACAGAATGGTTTAAACAA
ACATGGTCTTGGGTTTGATACTTCTGCAAGAAAGATCAATACTACAACTGAAATAAAGTTTGTACCTGCATCAGTTAATGACAAGACAGATACAGTCATGGCAACGAAAG
TAGTTAGCCCTTCAGCTAAAACTACTAAATGGATCTGTCACTACTGTGGTCAGAAAGGCCATATTAGACCTTTTTGCTATAAACTACATAGAGATATACTATATGTGGAT
GGCTTAAAAGAAAATTTGATTAGTGTAAGTCAGCTATGTGATCAAGGCTACAGTGTAAACTTTAGCAAAGACAATTCGGATAACTGCTATCACTGGATCTCCAATGATTC
AGAAGTTTGTCATTTGAATAAAGAAGATCAAACCTGGTTGTGGCACAGAAAGCTAGGAGACATCGACCTGAAAAGCATAGACAGGACTGTAAAAAATGAAGTTGTGATAG
GTGTTTCAAATATTGATGTAAATAGGAAGCAAACTAAAGCATCCCACAAAAGCCTAAAGGAATGTTCCACTAATAGAACTGAAAGTCTTGGTGGAAAGAAATATGTGTTT
GTAGCTGAAGAAGATTTTTCTCGATTTGCATGGGTTCGATTTTTGAAAGACAAGTATGATACTCCCAAAGTCTGCATCAGCTTGTGCTTGATTTTGCAGCGAGAAAAAGG
AGTGAAAATTGTTAGAATCAGAAGTGATCATGGCTAA
Protein sequenceShow/hide protein sequence
MIIVNGVSVLKPEVDWTDAEEQASIGNARALNAIFNGVGLNVFKLINSCSTVKEALKTLEVAYEGTSKVKISRLQLITSKFEALRMTEDEPVSDYNKRVLEIANESLMLG
EKIPDSKIVRKVLRSLPRKFDLKVISIEEAHDITTLKLDELFGSLLTFEMATADRESKKGKGIAFKSTHVSEEVVSDTEANMNESIDLLIKQFSNVVKKFKNLNTTGSNA
QNLINYQRKDGENNTRRINENSNRRNSDYGRKKEGEGRVFRCREYKDTGDGEEDNNMNAFTVCISETDSGDESECSGQICDKNFTFEELRVLWKEDFEARAIQKERIQDL
MEENEQLMSTLKSVKMLNSGTENLDVILNSGQNGLNKHGLGFDTSARKINTTTEIKFVPASVNDKTDTVMATKVVSPSAKTTKWICHYCGQKGHIRPFCYKLHRDILYVD
GLKENLISVSQLCDQGYSVNFSKDNSDNCYHWISNDSEVCHLNKEDQTWLWHRKLGDIDLKSIDRTVKNEVVIGVSNIDVNRKQTKASHKSLKECSTNRTESLGGKKYVF
VAEEDFSRFAWVRFLKDKYDTPKVCISLCLILQREKGVKIVRIRSDHG