; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0003009 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0003009
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Genome locationchr06:27215584..27218071
RNA-Seq ExpressionPay0003009
SyntenyPay0003009
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.1e-16647.61Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL
        MIIVN +S+ KPEVDWT+ +EQASVGNARALN IFNG                       VAYEGTSKVKISRLQL TSKFEALRMTEDES+SDYNK VL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESID---
        EIANESL+L                        VTAIEEAHDITTLKLDELFGSLLTFEM T +RESKKGK IAFKSTHV++E   DT+ANM+E  +   
Subjt:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESID---

Query:  ---LLIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEAR
              +  + + K+ KN   T S+ + +     +D + N   F      +N+D                 +SE S +      + E+L+ LWK+D EAR
Subjt:  ---LLIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEAR

Query:  AIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQ
        AIQKERIQD +EENERLMSNLD IL +G NG ++YGLGF AS      T+EIKFVPAS+  + D +     +    K+     YYCG+K HIR  CYKL+
Subjt:  AIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQ

Query:  R-----------------------------------------------------------------------------------------------EILY
        R                                                                                               ++ Y
Subjt:  R-----------------------------------------------------------------------------------------------EILY

Query:  VDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVN
        VDGLKANLI++SQLCDQGY V+F    CVV+NK+NQI M+G RQADNCYHW SN S+ C L + DQTWLWHRKLGH+ ++ +++ +KN+ V+G+PN+DVN
Subjt:  VDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVN

Query:  SKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLILQREKGVKIVRIRSDH
            C DC   KQT+++HKSLKEC TNRVLELLHMDLMG +QT+SLGGK                       DT ++  +LCL LQRE+  KI RIRSDH
Subjt:  SKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLILQREKGVKIVRIRSDH

Query:  DKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAK
         KEF NE  N+FC  EG HHE+SAPIT QQNGVVERKNKTLQEMA+VM+HAK
Subjt:  DKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAK

KAA0045252.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]9.9e-16068.32Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL
        MIIVN +SVLKPEVD T+ +EQAS+GNARALN IFNG                       VAYEGTSKVKISRLQLITSKFEAL MTEDE +SDYNKRVL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLI
        EIANESLML EKIPDSKIV+KVL+SLPR F LKV AIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGK IAFK THVS+E VSDTKANMNESIDLLI
Subjt:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLI

Query:  KQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREY-----------------------------------
        KQFSNV+KK KNLNTTGSNAQNLINYQRKDGENNTRR NENSNRRNSDYGRKKEGEGRVFRCRE                                    
Subjt:  KQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREY-----------------------------------

Query:  ------------------ESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLM-------------------------------SNLD
                          ESE SGQIC KNFTFEEL+VLWK+D EARAIQKERIQD MEENERLM                                NLD
Subjt:  ------------------ESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLM-------------------------------SNLD

Query:  VILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILY
        VILNSGQNGLN++GLGFD S RKINTTTEI FVPASVN+KTD V ATKVV+PSAKTTKWI +YCG+KDHIRPFCYKL R+ILY
Subjt:  VILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILY

KAA0046862.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.5e-18969.34Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNGVAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVLEIANESLMLGEKIPDSKIVQKVL
        MIIVNS+ VLKPEVDWTNAKEQASVGNARALNVIFNGV                       +MTEDESLSDYNKRVLEIANESLMLGEKIPDSKIVQKVL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNGVAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVLEIANESLMLGEKIPDSKIVQKVL

Query:  RSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLIKQFSNVVKKLKNLNTTGSNAQNL
        RSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEM TTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLIKQFSNVVKK KNLNTTGSNAQNL
Subjt:  RSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLIKQFSNVVKKLKNLNTTGSNAQNL

Query:  INYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQ
        INYQRKDGENNTRR +   NR  S +   KE       C       SG + +   T              R I K                     N  +
Subjt:  INYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQ

Query:  NGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFS
        N L                                                             P  Y    ++ YVDGLKANLISVSQLCDQGYSVNFS
Subjt:  NGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFS

Query:  KDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKEC
        KDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKEC
Subjt:  KDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKEC

Query:  STNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLIL
        STNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLIL
Subjt:  STNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLIL

KAA0054435.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.4e-16652.68Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL
        MIIVN +SV KPE+DWT+A+EQASVG ARA+N IFNG                       VAYEGTSKVKIS+L+LITSKFEAL+MTEDE++S+YNKRVL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVV--SDTKANMNESIDL
        EI N+ L+LGEKI +SKIV KVLRSLPRKFD+KVTAI+EA DITTL LDELFGSLLTFEMA +DRESKKGK IAFKS +  K+ V  S  +AN +ESI L
Subjt:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVV--SDTKANMNESIDL

Query:  LIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQK
        L KQFS + KK K LNT            R D EN+ R+ N++S RRNSD+G+K E  G                                         
Subjt:  LIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQK

Query:  ERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREIL
                       +LD IL+SGQN  +KYGLGFD ST+ +  T E K VP                                            ++I 
Subjt:  ERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREIL

Query:  YVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDV
        + DG    +I+           N  K    + +K+NQ+ M+G R++DNCYHW SN S +CHL K DQTWLWHRKLGHI L+S+D+ ++NE V+G+P++D+
Subjt:  YVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDV

Query:  NSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLILQREKGVKIVRIRSD
        N K  CGDC   KQTK+SH  LKEC T RVLELLH+DLMG +QTESLGGKKYV VV +D+SRFTWV FLK K DT K+ ISLCL LQ EKG KI+RIRSD
Subjt:  NSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLILQREKGVKIVRIRSD

Query:  HDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAK
        H+KEF NE+LNNFC  EGIHHE +APIT QQNGVVERKN+TLQEMA+VM+HAK
Subjt:  HDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAK

XP_008444307.1 PREDICTED: uncharacterized protein LOC103487675 [Cucumis melo]1.9e-26780.62Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL
        MIIVN +SVLKPEVD T+ +EQAS+GNARALN IFNG                       VAYEGTSKVKISRLQLITSKFEAL MTEDE +SDYNKRVL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLI
        EIANESLML EKIPDSKIV+KVL+SLPR F LKV AIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGK IAFK THVS+E VSDTKANMNESIDLLI
Subjt:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLI

Query:  KQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREY--------------------------ESEYSGQIC
        KQFSNV+KK KNLNTTGSNAQNLINYQRKDGENNTRR NENSNRRNSDYGRKKEGEGRVFRCREY                          ESE SGQIC
Subjt:  KQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREY--------------------------ESEYSGQIC

Query:  YKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTK
         KNFTFEEL+VLWK+D EAR ++  ++ +   E      NLDVILNSGQNGLN++GLGFD S RKINTTTEI FVPASVN+KTD V ATKVV+PSAKTTK
Subjt:  YKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTK

Query:  WIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLG
        WI +YCG+KDHIRPFCYKL R+ILYVDGLKANLISVS+LCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISN+SEVCHLNKEDQT LWHRKLG
Subjt:  WIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLG

Query:  HIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTP
        HIDLKSID T+KNEVVIGVPNIDVNSKLVC DCLTEKQTKASHKSLKECSTNRVLELLHMDLMG LQTESLGGKKYVFV EEDFSRFTWVRFLKDKYDTP
Subjt:  HIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTP

Query:  KVYISLCLILQREK
        KV ISLCLILQREK
Subjt:  KVYISLCLILQREK

TrEMBL top hitse value%identityAlignment
A0A1S3BAW0 uncharacterized protein LOC1034876759.0e-26880.62Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL
        MIIVN +SVLKPEVD T+ +EQAS+GNARALN IFNG                       VAYEGTSKVKISRLQLITSKFEAL MTEDE +SDYNKRVL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLI
        EIANESLML EKIPDSKIV+KVL+SLPR F LKV AIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGK IAFK THVS+E VSDTKANMNESIDLLI
Subjt:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLI

Query:  KQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREY--------------------------ESEYSGQIC
        KQFSNV+KK KNLNTTGSNAQNLINYQRKDGENNTRR NENSNRRNSDYGRKKEGEGRVFRCREY                          ESE SGQIC
Subjt:  KQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREY--------------------------ESEYSGQIC

Query:  YKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTK
         KNFTFEEL+VLWK+D EAR ++  ++ +   E      NLDVILNSGQNGLN++GLGFD S RKINTTTEI FVPASVN+KTD V ATKVV+PSAKTTK
Subjt:  YKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTK

Query:  WIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLG
        WI +YCG+KDHIRPFCYKL R+ILYVDGLKANLISVS+LCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISN+SEVCHLNKEDQT LWHRKLG
Subjt:  WIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLG

Query:  HIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTP
        HIDLKSID T+KNEVVIGVPNIDVNSKLVC DCLTEKQTKASHKSLKECSTNRVLELLHMDLMG LQTESLGGKKYVFV EEDFSRFTWVRFLKDKYDTP
Subjt:  HIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTP

Query:  KVYISLCLILQREK
        KV ISLCLILQREK
Subjt:  KVYISLCLILQREK

A0A5A7TNK7 Gag-pol polyprotein3.4e-16647.61Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL
        MIIVN +S+ KPEVDWT+ +EQASVGNARALN IFNG                       VAYEGTSKVKISRLQL TSKFEALRMTEDES+SDYNK VL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESID---
        EIANESL+L                        VTAIEEAHDITTLKLDELFGSLLTFEM T +RESKKGK IAFKSTHV++E   DT+ANM+E  +   
Subjt:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESID---

Query:  ---LLIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEAR
              +  + + K+ KN   T S+ + +     +D + N   F      +N+D                 +SE S +      + E+L+ LWK+D EAR
Subjt:  ---LLIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEAR

Query:  AIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQ
        AIQKERIQD +EENERLMSNLD IL +G NG ++YGLGF AS      T+EIKFVPAS+  + D +     +    K+     YYCG+K HIR  CYKL+
Subjt:  AIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQ

Query:  R-----------------------------------------------------------------------------------------------EILY
        R                                                                                               ++ Y
Subjt:  R-----------------------------------------------------------------------------------------------EILY

Query:  VDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVN
        VDGLKANLI++SQLCDQGY V+F    CVV+NK+NQI M+G RQADNCYHW SN S+ C L + DQTWLWHRKLGH+ ++ +++ +KN+ V+G+PN+DVN
Subjt:  VDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVN

Query:  SKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLILQREKGVKIVRIRSDH
            C DC   KQT+++HKSLKEC TNRVLELLHMDLMG +QT+SLGGK                       DT ++  +LCL LQRE+  KI RIRSDH
Subjt:  SKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLILQREKGVKIVRIRSDH

Query:  DKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAK
         KEF NE  N+FC  EG HHE+SAPIT QQNGVVERKNKTLQEMA+VM+HAK
Subjt:  DKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAK

A0A5A7TPF7 Gag-proteinase polyprotein4.8e-16068.32Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL
        MIIVN +SVLKPEVD T+ +EQAS+GNARALN IFNG                       VAYEGTSKVKISRLQLITSKFEAL MTEDE +SDYNKRVL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLI
        EIANESLML EKIPDSKIV+KVL+SLPR F LKV AIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGK IAFK THVS+E VSDTKANMNESIDLLI
Subjt:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLI

Query:  KQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREY-----------------------------------
        KQFSNV+KK KNLNTTGSNAQNLINYQRKDGENNTRR NENSNRRNSDYGRKKEGEGRVFRCRE                                    
Subjt:  KQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREY-----------------------------------

Query:  ------------------ESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLM-------------------------------SNLD
                          ESE SGQIC KNFTFEEL+VLWK+D EARAIQKERIQD MEENERLM                                NLD
Subjt:  ------------------ESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLM-------------------------------SNLD

Query:  VILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILY
        VILNSGQNGLN++GLGFD S RKINTTTEI FVPASVN+KTD V ATKVV+PSAKTTKWI +YCG+KDHIRPFCYKL R+ILY
Subjt:  VILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILY

A0A5A7TTT8 Gag-pol polyprotein2.2e-18969.34Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNGVAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVLEIANESLMLGEKIPDSKIVQKVL
        MIIVNS+ VLKPEVDWTNAKEQASVGNARALNVIFNGV                       +MTEDESLSDYNKRVLEIANESLMLGEKIPDSKIVQKVL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNGVAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVLEIANESLMLGEKIPDSKIVQKVL

Query:  RSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLIKQFSNVVKKLKNLNTTGSNAQNL
        RSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEM TTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLIKQFSNVVKK KNLNTTGSNAQNL
Subjt:  RSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLIKQFSNVVKKLKNLNTTGSNAQNL

Query:  INYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQ
        INYQRKDGENNTRR +   NR  S +   KE       C       SG + +   T              R I K                     N  +
Subjt:  INYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQ

Query:  NGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFS
        N L                                                             P  Y    ++ YVDGLKANLISVSQLCDQGYSVNFS
Subjt:  NGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFS

Query:  KDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKEC
        KDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKEC
Subjt:  KDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKEC

Query:  STNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLIL
        STNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLIL
Subjt:  STNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLIL

A0A5D3CS19 Gag-pol polyprotein2.6e-16652.68Show/hide
Query:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL
        MIIVN +SV KPE+DWT+A+EQASVG ARA+N IFNG                       VAYEGTSKVKIS+L+LITSKFEAL+MTEDE++S+YNKRVL
Subjt:  MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNG-----------------------VAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVL

Query:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVV--SDTKANMNESIDL
        EI N+ L+LGEKI +SKIV KVLRSLPRKFD+KVTAI+EA DITTL LDELFGSLLTFEMA +DRESKKGK IAFKS +  K+ V  S  +AN +ESI L
Subjt:  EIANESLMLGEKIPDSKIVQKVLRSLPRKFDLKVTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVV--SDTKANMNESIDL

Query:  LIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQK
        L KQFS + KK K LNT            R D EN+ R+ N++S RRNSD+G+K E  G                                         
Subjt:  LIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSNRRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQK

Query:  ERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREIL
                       +LD IL+SGQN  +KYGLGFD ST+ +  T E K VP                                            ++I 
Subjt:  ERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASVNNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREIL

Query:  YVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDV
        + DG    +I+           N  K    + +K+NQ+ M+G R++DNCYHW SN S +CHL K DQTWLWHRKLGHI L+S+D+ ++NE V+G+P++D+
Subjt:  YVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKEDQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDV

Query:  NSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLILQREKGVKIVRIRSD
        N K  CGDC   KQTK+SH  LKEC T RVLELLH+DLMG +QTESLGGKKYV VV +D+SRFTWV FLK K DT K+ ISLCL LQ EKG KI+RIRSD
Subjt:  NSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLILQREKGVKIVRIRSD

Query:  HDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAK
        H+KEF NE+LNNFC  EGIHHE +APIT QQNGVVERKN+TLQEMA+VM+HAK
Subjt:  HDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2730.22Show/hide
Query:  CYKLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCH-----LNKEDQTWLWHRKLGHIDLKSIDRT
        C  + +++ +V  L+ NLIS   L   GY   F+      + K + ++  G  +          N+E+C         E    LWH+++GH+  K +   
Subjt:  CYKLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCH-----LNKEDQTWLWHRKLGHIDLKSIDRT

Query:  VKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLIL
         K  ++       V     C  CL  KQ + S ++  E   N +L+L++ D+ G ++ ES+GG KY     +D SR  WV  LK K    +V+     ++
Subjt:  VKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDTPKVYISLCLIL

Query:  QREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAKK
        +RE G K+ R+RSD+  E+ +     +C S GI HE + P T Q NGV ER N+T+ E  + ML   K
Subjt:  QREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAKK

P25384 Transposon Ty2-C Gag-Pol polyprotein2.2e-1624.56Show/hide
Query:  KLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWIS---------NNSEVCHLNKEDQT-----WLWHRKLGHID
        K   + L+   +  +L+S+S+L +Q  +  F+++   +   D  +L    +  D  ++W+S         +   + ++NK          L HR LGH +
Subjt:  KLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWIS---------NNSEVCHLNKEDQT-----WLWHRKLGHID

Query:  LKSIDRTVKNEVVIGVPNIDVN----SKLVCGDCLTEKQTKASH---KSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDK
         +SI +++K   V  +   D+     S   C DCL  K TK  H     LK   +    + LH D+ G +         Y     ++ +RF WV  L D+
Subjt:  LKSIDRTVKNEVVIGVPNIDVN----SKLVCGDCLTEKQTKASH---KSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDK

Query:  YDTP--KVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLH
         +     V+ S+   ++ +   +++ I+ D   E+ N+ L+ F  + GI   Y+    S+ +GV ER N+TL    + +LH
Subjt:  YDTP--KVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLH

Q03494 Transposon Ty2-DR2 Gag-Pol polyprotein2.2e-1624.56Show/hide
Query:  KLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWIS---------NNSEVCHLNKEDQT-----WLWHRKLGHID
        K   + L+   +  +L+S+S+L +Q  +  F+++   +   D  +L    +  D  ++W+S         +   + ++NK          L HR LGH +
Subjt:  KLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWIS---------NNSEVCHLNKEDQT-----WLWHRKLGHID

Query:  LKSIDRTVKNEVVIGVPNIDVN----SKLVCGDCLTEKQTKASH---KSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDK
         +SI +++K   V  +   D+     S   C DCL  K TK  H     LK   +    + LH D+ G +         Y     ++ +RF WV  L D+
Subjt:  LKSIDRTVKNEVVIGVPNIDVN----SKLVCGDCLTEKQTKASH---KSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDK

Query:  YDTP--KVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLH
         +     V+ S+   ++ +   +++ I+ D   E+ N+ L+ F  + GI   Y+    S+ +GV ER N+TL    + +LH
Subjt:  YDTP--KVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLH

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein2.2e-1624.56Show/hide
Query:  KLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWIS---------NNSEVCHLNKEDQT-----WLWHRKLGHID
        K   + L+   +  +L+S+S+L +Q  +  F+++   +   D  +L    +  D  ++W+S         +   + ++NK          L HR LGH +
Subjt:  KLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWIS---------NNSEVCHLNKEDQT-----WLWHRKLGHID

Query:  LKSIDRTVKNEVVIGVPNIDVN----SKLVCGDCLTEKQTKASH---KSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDK
         +SI +++K   V  +   D+     S   C DCL  K TK  H     LK   +    + LH D+ G +         Y     ++ +RF WV  L D+
Subjt:  LKSIDRTVKNEVVIGVPNIDVN----SKLVCGDCLTEKQTKASH---KSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDK

Query:  YDTP--KVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLH
         +     V+ S+   ++ +   +++ I+ D   E+ N+ L+ F  + GI   Y+    S+ +GV ER N+TL    + +LH
Subjt:  YDTP--KVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLH

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.2e-1624.56Show/hide
Query:  KLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWIS---------NNSEVCHLNKEDQT-----WLWHRKLGHID
        K   + L+   +  +L+S+S+L +Q  +  F+++   +   D  +L    +  D  ++W+S         +   + ++NK          L HR LGH +
Subjt:  KLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWIS---------NNSEVCHLNKEDQT-----WLWHRKLGHID

Query:  LKSIDRTVKNEVVIGVPNIDVN----SKLVCGDCLTEKQTKASH---KSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDK
         +SI +++K   V  +   D+     S   C DCL  K TK  H     LK   +    + LH D+ G +         Y     ++ +RF WV  L D+
Subjt:  LKSIDRTVKNEVVIGVPNIDVN----SKLVCGDCLTEKQTKASH---KSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDK

Query:  YDTP--KVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLH
         +     V+ S+   ++ +   +++ I+ D   E+ N+ L+ F  + GI   Y+    S+ +GV ER N+TL    + +LH
Subjt:  YDTP--KVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLH

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTATCGTGAATAGTATTTCGGTTCTAAAACCTGAAGTTGATTGGACTAATGCTAAAGAGCAAGCTTCTGTTGGAAATGCCAGAGCACTTAACGTGATATTTAATGG
TGTAGCGTATGAAGGTACTTCCAAAGTAAAGATCTCAAGATTACAGTTGATAACATCTAAGTTTGAGGCATTAAGAATGACCGAGGATGAATCATTGTCTGATTACAATA
AAAGAGTGCTTGAAATCGCAAATGAATCTTTGATGCTCGGTGAAAAAATACCTGACTCTAAAATAGTGCAGAAAGTACTTCGATCCTTGCCCAGGAAATTTGATTTGAAA
GTTACTGCCATTGAGGAAGCTCATGATATTACAACGCTGAAACTTGATGAATTGTTTGGTTCGTTGCTTACGTTTGAGATGGCCACTACTGATAGAGAAAGTAAGAAAGG
CAAGAGAATTGCTTTTAAATCCACACATGTAAGTAAGGAGGTTGTAAGTGACACTAAAGCAAACATGAATGAATCAATAGACCTTCTGATCAAACAGTTTTCTAATGTGG
TCAAGAAATTAAAAAACTTGAATACCACAGGATCAAATGCTCAAAATCTGATTAACTATCAAAGAAAAGATGGTGAGAACAATACGAGAAGGTTTAATGAAAATTCAAAC
AGGAGAAATAGTGATTATGGACGTAAAAAAGAGGGCGAAGGAAGAGTTTTCAGATGTAGAGAATATGAAAGTGAATATTCTGGGCAGATTTGTTATAAAAACTTTACATT
TGAAGAGCTCAAAGTTCTATGGAAAAAAGATTCTGAAGCCAGAGCAATACAAAAAGAAAGAATTCAAGATTTTATGGAAGAAAATGAAAGATTAATGTCTAATTTAGATG
TAATATTAAATTCAGGACAAAATGGTTTAAACAAATATGGTCTTGGGTTTGATGCTTCTACAAGAAAGATCAATACTACAACCGAAATAAAGTTTGTACCTGCATCAGTT
AATAACAAGACAGATATAGTCACGGCAACGAAAGTTGTTAACCCTTCAGCTAAAACTACTAAATGGATCTGGTACTACTGTGGTAAAAAAGACCATATTAGACCTTTTTG
CTATAAACTACAGAGAGAGATATTATATGTGGATGGCTTAAAAGCAAATTTGATTAGTGTAAGTCAGCTATGTGATCAAGGCTACAGTGTAAACTTTAGCAAAGACAATT
GTGTGGTAATTAATAAAGATAATCAGATTCTTATGAATGGTAGTCGGCAAGCGGATAATTGTTATCACTGGATCTCCAATAATTCAGAAGTTTGTCATTTGAATAAAGAA
GATCAAACCTGGTTGTGGCACAGAAAGCTAGGACACATCGACCTGAAAAGCATAGACAGGACTGTAAAAAATGAAGTTGTGATAGGTGTTCCAAATATTGATGTGAATAG
CAAATTGGTTTGTGGAGATTGTCTAACTGAGAAGCAAACTAAAGCATCCCATAAAAGCCTAAAGGAATGTTCCACTAATAGAGTCCTTGAACTTCTACATATGGATCTTA
TGGGATTATTGCAGACTGAAAGTCTTGGTGGAAAGAAATATGTGTTTGTAGTTGAAGAAGATTTTTCTAGATTTACATGGGTTCGATTTTTGAAAGACAAGTATGATACT
CCCAAAGTCTACATCAGCTTGTGCTTGATTTTGCAGCGAGAAAAAGGAGTGAAAATTGTTAGAATCAGAAGTGATCATGACAAAGAATTTAAAAATGAAAATCTCAATAA
CTTCTGTGATTCTGAAGGAATACACCATGAATACTCTGCTCCTATAACTTCTCAACAAAATGGAGTTGTTGAAAGGAAAAACAAAACATTACAGGAGATGGCTCAAGTCA
TGTTACATGCCAAAAAAAAAATTACCCTTGCATTTTGTGACAGAAGCTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTATCGTGAATAGTATTTCGGTTCTAAAACCTGAAGTTGATTGGACTAATGCTAAAGAGCAAGCTTCTGTTGGAAATGCCAGAGCACTTAACGTGATATTTAATGG
TGTAGCGTATGAAGGTACTTCCAAAGTAAAGATCTCAAGATTACAGTTGATAACATCTAAGTTTGAGGCATTAAGAATGACCGAGGATGAATCATTGTCTGATTACAATA
AAAGAGTGCTTGAAATCGCAAATGAATCTTTGATGCTCGGTGAAAAAATACCTGACTCTAAAATAGTGCAGAAAGTACTTCGATCCTTGCCCAGGAAATTTGATTTGAAA
GTTACTGCCATTGAGGAAGCTCATGATATTACAACGCTGAAACTTGATGAATTGTTTGGTTCGTTGCTTACGTTTGAGATGGCCACTACTGATAGAGAAAGTAAGAAAGG
CAAGAGAATTGCTTTTAAATCCACACATGTAAGTAAGGAGGTTGTAAGTGACACTAAAGCAAACATGAATGAATCAATAGACCTTCTGATCAAACAGTTTTCTAATGTGG
TCAAGAAATTAAAAAACTTGAATACCACAGGATCAAATGCTCAAAATCTGATTAACTATCAAAGAAAAGATGGTGAGAACAATACGAGAAGGTTTAATGAAAATTCAAAC
AGGAGAAATAGTGATTATGGACGTAAAAAAGAGGGCGAAGGAAGAGTTTTCAGATGTAGAGAATATGAAAGTGAATATTCTGGGCAGATTTGTTATAAAAACTTTACATT
TGAAGAGCTCAAAGTTCTATGGAAAAAAGATTCTGAAGCCAGAGCAATACAAAAAGAAAGAATTCAAGATTTTATGGAAGAAAATGAAAGATTAATGTCTAATTTAGATG
TAATATTAAATTCAGGACAAAATGGTTTAAACAAATATGGTCTTGGGTTTGATGCTTCTACAAGAAAGATCAATACTACAACCGAAATAAAGTTTGTACCTGCATCAGTT
AATAACAAGACAGATATAGTCACGGCAACGAAAGTTGTTAACCCTTCAGCTAAAACTACTAAATGGATCTGGTACTACTGTGGTAAAAAAGACCATATTAGACCTTTTTG
CTATAAACTACAGAGAGAGATATTATATGTGGATGGCTTAAAAGCAAATTTGATTAGTGTAAGTCAGCTATGTGATCAAGGCTACAGTGTAAACTTTAGCAAAGACAATT
GTGTGGTAATTAATAAAGATAATCAGATTCTTATGAATGGTAGTCGGCAAGCGGATAATTGTTATCACTGGATCTCCAATAATTCAGAAGTTTGTCATTTGAATAAAGAA
GATCAAACCTGGTTGTGGCACAGAAAGCTAGGACACATCGACCTGAAAAGCATAGACAGGACTGTAAAAAATGAAGTTGTGATAGGTGTTCCAAATATTGATGTGAATAG
CAAATTGGTTTGTGGAGATTGTCTAACTGAGAAGCAAACTAAAGCATCCCATAAAAGCCTAAAGGAATGTTCCACTAATAGAGTCCTTGAACTTCTACATATGGATCTTA
TGGGATTATTGCAGACTGAAAGTCTTGGTGGAAAGAAATATGTGTTTGTAGTTGAAGAAGATTTTTCTAGATTTACATGGGTTCGATTTTTGAAAGACAAGTATGATACT
CCCAAAGTCTACATCAGCTTGTGCTTGATTTTGCAGCGAGAAAAAGGAGTGAAAATTGTTAGAATCAGAAGTGATCATGACAAAGAATTTAAAAATGAAAATCTCAATAA
CTTCTGTGATTCTGAAGGAATACACCATGAATACTCTGCTCCTATAACTTCTCAACAAAATGGAGTTGTTGAAAGGAAAAACAAAACATTACAGGAGATGGCTCAAGTCA
TGTTACATGCCAAAAAAAAAATTACCCTTGCATTTTGTGACAGAAGCTATTAA
Protein sequenceShow/hide protein sequence
MIIVNSISVLKPEVDWTNAKEQASVGNARALNVIFNGVAYEGTSKVKISRLQLITSKFEALRMTEDESLSDYNKRVLEIANESLMLGEKIPDSKIVQKVLRSLPRKFDLK
VTAIEEAHDITTLKLDELFGSLLTFEMATTDRESKKGKRIAFKSTHVSKEVVSDTKANMNESIDLLIKQFSNVVKKLKNLNTTGSNAQNLINYQRKDGENNTRRFNENSN
RRNSDYGRKKEGEGRVFRCREYESEYSGQICYKNFTFEELKVLWKKDSEARAIQKERIQDFMEENERLMSNLDVILNSGQNGLNKYGLGFDASTRKINTTTEIKFVPASV
NNKTDIVTATKVVNPSAKTTKWIWYYCGKKDHIRPFCYKLQREILYVDGLKANLISVSQLCDQGYSVNFSKDNCVVINKDNQILMNGSRQADNCYHWISNNSEVCHLNKE
DQTWLWHRKLGHIDLKSIDRTVKNEVVIGVPNIDVNSKLVCGDCLTEKQTKASHKSLKECSTNRVLELLHMDLMGLLQTESLGGKKYVFVVEEDFSRFTWVRFLKDKYDT
PKVYISLCLILQREKGVKIVRIRSDHDKEFKNENLNNFCDSEGIHHEYSAPITSQQNGVVERKNKTLQEMAQVMLHAKKKITLAFCDRSY