; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0026861 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0026861
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr01:7892464..7896912
RNA-Seq ExpressionIVF0026861
SyntenyIVF0026861
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006488 - dolichol-linked oligosaccharide biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR016197 - Chromo-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037196.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.080.49Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRELVR+ECGLISAYD K+GHK  QTKNT +TA KEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ+VENLNIELS  S  G + P   + K + G          G   N +        G 
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK

Query:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL
           +T         + GSGTAVKGKGVC DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSL
Subjt:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL

Query:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE
        KNLMKSWGADDQGFLVECRTIECG LEEHEQDR QG  + E IA LL++FA VFEWP+ LPPQR IDHHIYLKSG DPVNVRPYRYAHHQKEEMERLVDE
Subjt:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE

Query:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM
        ML+SGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVM
Subjt:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM

Query:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP
        PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G++EH+QHLEVVLGLL+EKELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWP
Subjt:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP

Query:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK
         P NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E ETAF KLK+AMMTLPVL MPDF+LPFEIESDASGFG+GAVLTQCRKPVAYFSK
Subjt:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK

Query:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA
        TLSMR+R+RPV +   +A VLA          RK     D+  + +  +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT +LNQITA
Subjt:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA

Query:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC
        P MIDV+I KEETR DPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ S STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMKKD++RYC
Subjt:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC

Query:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV
        EECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAK VAETFV+EV+RLHGYPRSIVSDRDKV
Subjt:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV

Query:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNST
        FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W H                                 GDMETPNST
Subjt:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNST

Query:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV
        LDQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAYKLELPA+AAIHPVFHV
Subjt:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV

Query:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN
        SQ KKAVGRGETV  LNPYMNENHEWITQPEEVYGYRKNP T +WEALISWKGLPPHEATWE+C DMKYQFP+FHL     +EEESDARPPILFTY+R+N
Subjt:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN

Query:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKKHETNEGET G+E  GHETN E+ R   EESK DGDQ G P
Subjt:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

KAA0050511.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.080.48Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRE+VR+ECGLISAYD K+GHK  QTKNT +T TKEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGD---Q
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ VENLNIELS  S  G + P   + K K G   G     L      H     D   +
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGD---Q

Query:  TGI---DAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLK
         G+   +      + GSGTAVKGKGVC+DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSLK
Subjt:  TGI---DAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLK

Query:  NLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEM
        NLMKSWGADDQGFLVECRTIECG LEE+EQDR  G  + E IA LL++FA VFEWP+ LPPQR IDHHIYLKSG DPVNVRPYRYAHHQKEEMERLVDEM
Subjt:  NLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEM

Query:  LSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMP
        L+SGIIRPSKSPYSSPVLLVRK+DGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVMP
Subjt:  LSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMP

Query:  FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPT
        FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQG++EH+QHLEVVLGLL+EKELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWPT
Subjt:  FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPT

Query:  PTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKT
        P NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E E AF KLK+AMMTLPVL MPDF+LPFEIESDASGFGVGAVLTQCRKPVAYFSKT
Subjt:  PTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKT

Query:  LSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAP
        LS+R+R+RPV +   +A VLA          RK     D+  +  + +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT QLNQITAP
Subjt:  LSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAP

Query:  TMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE
         +IDV+I KEETRQDPAL+EIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ + STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMK+D++RYCE
Subjt:  TMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE

Query:  ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVF
        ECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKY HFLLLKHPFTAK VAETF++EV+RLHGYPRSIVSDRDKVF
Subjt:  ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVF

Query:  LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNSTL
        LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W H                                 GDMETPNSTL
Subjt:  LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNSTL

Query:  DQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVS
        DQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAY+LELPA+AAIHPVFHVS
Subjt:  DQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVS

Query:  QPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNK
        Q KKAVGRGETVQ L PY+NENHEWITQPEEVYGYRKNP+T +WEALISWKGLPPHEATWE+C DMKYQFP+FHLEDKVDLEEESDARPPILFTY+R+NK
Subjt:  QPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNK

Query:  KKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKHETNEGET G+E   HETN E+ R   EESKEDGDQ G P
Subjt:  KKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

TYK06572.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.080.25Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRE+VR+ECGLISAYD K+GHK  QTKNT +T TKEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGD---Q
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ VENLNIELS  S  G + P   + K K G   G     L      H     D   +
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGD---Q

Query:  TGI---DAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLK
         G+   +      + GSGTAVKGKGVC+DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSLK
Subjt:  TGI---DAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLK

Query:  NLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEM
        NLMKSWGADDQGFLVECRTIECG LEE+EQDR  G  + E IA LL++FA VFEWP+ LPPQR IDHHIY+KSG DPVNVRPYRYAHHQKEEMERLVDEM
Subjt:  NLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEM

Query:  LSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMP
        L+SGIIRPSKSPYSSPVLLVRK+DGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVMP
Subjt:  LSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMP

Query:  FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPT
        FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G++EH QHLEVVLGLL+ KELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWPT
Subjt:  FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPT

Query:  PTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKT
        P NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E E AF KLK+AMMTLPVL MPDF+LPFEIESDASGFGVGAVLTQCRKPVAYFSKT
Subjt:  PTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKT

Query:  LSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAP
        LS+R+R+RPV +   +A VLA          RK     D+  +  + +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT QLNQITAP
Subjt:  LSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAP

Query:  TMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE
         +IDV+I KEETRQDPAL+EIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ + STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMK+D++RYCE
Subjt:  TMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE

Query:  ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVF
        ECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKY HFLLLKHPFTAK VAETF++EV+RLHGYPRSIVSDRDKVF
Subjt:  ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVF

Query:  LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNSTL
        LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W H                                 GDMETPNSTL
Subjt:  LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNSTL

Query:  DQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVS
        DQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAY+LELPA+AAIHPVFHVS
Subjt:  DQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVS

Query:  QPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNK
        Q KKAVGRGETVQ L PY+NENHEWITQPEEVYGYRKNP+T +WEALISWKGLPPHEATWE+C DMKYQFP+FHLEDKVDLEEESDARPPILFTY+R+NK
Subjt:  QPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNK

Query:  KKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKHETNEGET G+E   HETN E+ R   EESKEDGDQ G P
Subjt:  KKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

TYK13876.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.080.64Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRELVR+ECGLISAYD K+GHK  QTKNT +TATKEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ+VENLNIELS  S  G + P   + K + G          G   N +        G 
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK

Query:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL
           +T         + GSGTAVKGKGVC DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSL
Subjt:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL

Query:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE
        KNLMKSWGADDQGFLVECRTIECG LEEHEQDR QG  + E IA LL++FA VFEWP+ LPPQR IDHHIYLKSG DPVNVRPYRYAHHQKEEMERLVDE
Subjt:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE

Query:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM
        ML+SGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVM
Subjt:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM

Query:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP
        PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G++EH+QHLEVVLGLL+EKELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWP
Subjt:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP

Query:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK
        TP NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E ETAF KLK+AMMTLPVL MPDF+LPFEIESDASGFG+GAVLTQCRKPVAYFSK
Subjt:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK

Query:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA
        TLSMR+R+RPV +   +A VLA          RK     D+  + +  +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT +LNQITA
Subjt:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA

Query:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC
        P MIDV+I KEETR DPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ S STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMKKD++RYC
Subjt:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC

Query:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV
        EECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAK VAETFV+EV+RLHGYPRSIVSDRDKV
Subjt:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV

Query:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNST
        FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W H                                 GDMETPNST
Subjt:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNST

Query:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV
        LDQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAYKLELPA+AAIHPVFHV
Subjt:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV

Query:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN
        SQ KKAVGRGETV  LNPYMNENHEWITQPEEVYGYRKNP T +WEALISWKGLPPHEATWE+C DMKYQFP+FHL     +EEESDARPPILFTY+R+N
Subjt:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN

Query:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKKHETNEGET G+E  GHETN E+ R   EESK DGDQ G P
Subjt:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

TYK24654.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]0.080.25Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRE+VR+ECGLISAYD K+GHK  QTKNT +T TKEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGD---Q
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ VENLNIELS  S  G + P   + K K G   G     L      H     D   +
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSG-GTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGD---Q

Query:  TGI---DAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLK
         G+   +      + GSGTAVKGKGVC+DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSLK
Subjt:  TGI---DAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLK

Query:  NLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEM
        NLMKSWGADDQGFLVECRTIECG LEE+EQDR  G  + E IA LL++FA VFEWP+ LPPQR IDHHIY+KSG DPVNVRPYRYAHHQKEEMERLVDEM
Subjt:  NLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEM

Query:  LSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMP
        L+SGIIRPSKSPYSSPVLLVRK+DGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVMP
Subjt:  LSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMP

Query:  FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPT
        FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G++EH QHLEVVLGLL+ KELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWPT
Subjt:  FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPT

Query:  PTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKT
        P NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E E AF KLK+AMMTLPVL MPDF+LPFEIESDASGFGVGAVLTQCRKPVAYFSKT
Subjt:  PTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKT

Query:  LSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAP
        LS+R+R+RPV +   +A VLA          RK     D+  +  + +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT QLNQITAP
Subjt:  LSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAP

Query:  TMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE
         +IDV+I KEETRQDPAL+EIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ + STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMK+D++RYCE
Subjt:  TMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE

Query:  ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVF
        ECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKY HFLLLKHPFTAK VAETF++EV+RLHGYPRSIVSDRDKVF
Subjt:  ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVF

Query:  LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNSTL
        LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W H                                 GDMETPNSTL
Subjt:  LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNSTL

Query:  DQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVS
        DQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAY+LELPA+AAIHPVFHVS
Subjt:  DQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVS

Query:  QPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNK
        Q KKAVGRGETVQ L PY+NENHEWITQPEEVYGYRKNP+T +WEALISWKGLPPHEATWE+C DMKYQFP+FHLEDKVDLEEESDARPPILFTY+R+NK
Subjt:  QPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNK

Query:  KKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKHETNEGET G+E   HETN E+ R   EESKEDGDQ G P
Subjt:  KKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

TrEMBL top hitse value%identityAlignment
A0A5A7T4Y0 Ty3/gypsy retrotransposon protein0.0e+0080.49Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRELVR+ECGLISAYD K+GHK  QTKNT +TA KEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ+VENLNIELS  S  G + P   + K + G          G   N +        G 
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK

Query:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL
           +T         + GSGTAVKGKGVC DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSL
Subjt:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL

Query:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE
        KNLMKSWGADDQGFLVECRTIECG LEEHEQDR QG  + E IA LL++FA VFEWP+ LPPQR IDHHIYLKSG DPVNVRPYRYAHHQKEEMERLVDE
Subjt:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE

Query:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM
        ML+SGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVM
Subjt:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM

Query:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP
        PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G++EH+QHLEVVLGLL+EKELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWP
Subjt:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP

Query:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK
         P NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E ETAF KLK+AMMTLPVL MPDF+LPFEIESDASGFG+GAVLTQCRKPVAYFSK
Subjt:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK

Query:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA
        TLSMR+R+RPV +   +A VLA          RK     D+ +  +  +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT +LNQITA
Subjt:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA

Query:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC
        P MIDV+I KEETR DPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ S STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMKKD++RYC
Subjt:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC

Query:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV
        EECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAK VAETFV+EV+RLHGYPRSIVSDRDKV
Subjt:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV

Query:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLW---------------------------------RHGDMETPNST
        FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W                                  HGDMETPNST
Subjt:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLW---------------------------------RHGDMETPNST

Query:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV
        LDQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAYKLELPA+AAIHPVFHV
Subjt:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV

Query:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN
        SQ KKAVGRGETV  LNPYMNENHEWITQPEEVYGYRKNP T +WEALISWKGLPPHEATWE+C DMKYQFP+FHL     +EEESDARPPILFTY+R+N
Subjt:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN

Query:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKKHETNEGET G+E  GHETN E+ R   EESK DGDQ G P
Subjt:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

A0A5A7UAE4 Ty3/gypsy retrotransposon protein0.0e+0080.48Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRE+VR+ECGLISAYD K+GHK  QTKNT +T TKEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGD---Q
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ VENLNIELS  S  G + P   + K K G   G     L      H     D   +
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGD---Q

Query:  TGI---DAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLK
         G+   +      + GSGTAVKGKGVC+DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSLK
Subjt:  TGI---DAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLK

Query:  NLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEM
        NLMKSWGADDQGFLVECRTIECG LEE+EQDR  G  + E IA LL++FA VFEWP+ LPPQR IDHHIYLKSG DPVNVRPYRYAHHQKEEMERLVDEM
Subjt:  NLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEM

Query:  LSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMP
        L+SGIIRPSKSPYSSPVLLVRK+DGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVMP
Subjt:  LSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMP

Query:  FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPT
        FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQG++EH+QHLEVVLGLL+EKELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWPT
Subjt:  FGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPT

Query:  PTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKT
        P NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E E AF KLK+AMMTLPVL MPDF+LPFEIESDASGFGVGAVLTQCRKPVAYFSKT
Subjt:  PTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSKT

Query:  LSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAP
        LS+R+R+RPV +   +A VLA          RK     D+ +  +  +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT QLNQITAP
Subjt:  LSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAP

Query:  TMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE
         +IDV+I KEETRQDPAL+EIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ + STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMK+D++RYCE
Subjt:  TMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE

Query:  ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVF
        ECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKY HFLLLKHPFTAK VAETF++EV+RLHGYPRSIVSDRDKVF
Subjt:  ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVF

Query:  LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNSTL
        LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W H                                 GDMETPNSTL
Subjt:  LSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNSTL

Query:  DQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVS
        DQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAY+LELPA+AAIHPVFHVS
Subjt:  DQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVS

Query:  QPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNK
        Q KKAVGRGETVQ L PY+NENHEWITQPEEVYGYRKNP+T +WEALISWKGLPPHEATWE+C DMKYQFP+FHLEDKVDLEEESDARPPILFTY+R+NK
Subjt:  QPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNK

Query:  KKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKHETNEGET G+E   HETN E+ R   EESKEDGDQ G P
Subjt:  KKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

A0A5D3C5N7 Ty3/gypsy retrotransposon protein0.0e+0080.12Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRE+VR+ECGLISAYD K+GHK  QTKNT +T TKEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ VENLNIELS  S  G + P   + K K G          G   N +        G 
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK

Query:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL
           +T         + GSGTAVKGKGVC+DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSL
Subjt:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL

Query:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE
        KNLMKSWGADDQGFLVECRTIECG LEE+EQDR  G  + E IA LL++FA VFEWP+ LPPQR IDHHIY+KSG DPVNVRPYRYAHHQKEEMERLVDE
Subjt:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE

Query:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM
        ML+SGIIRPSKSPYSSPVLLVRK+DGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVM
Subjt:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM

Query:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP
        PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G++EH QHLEVVLGLL+ KELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWP
Subjt:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP

Query:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK
        TP NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E E AF KLK+AMMTLPVL MPDF+LPFEIESDASGFGVGAVLTQCRKPVAYFSK
Subjt:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK

Query:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA
        TLS+R+R+RPV +   +A VLA          RK     D+ +  +  +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT QLNQITA
Subjt:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA

Query:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC
        P +IDV+I KEETRQDPAL+EIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ + STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMK+D++RYC
Subjt:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC

Query:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV
        EECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKY HFLLLKHPFTAK VAETF++EV+RLHGYPRSIVSDRDKV
Subjt:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV

Query:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNST
        FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W H                                 GDMETPNST
Subjt:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNST

Query:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV
        LDQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAY+LELPA+AAIHPVFHV
Subjt:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV

Query:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN
        SQ KKAVGRGETVQ L PY+NENHEWITQPEEVYGYRKNP+T +WEALISWKGLPPHEATWE+C DMKYQFP+FHLEDKVDLEEESDARPPILFTY+R+N
Subjt:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN

Query:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKKHETNEGET G+E   HETN E+ R   EESKEDGDQ G P
Subjt:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

A0A5D3CU05 Ty3/gypsy retrotransposon protein0.0e+0080.64Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRELVR+ECGLISAYD K+GHK  QTKNT +TATKEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ+VENLNIELS  S  G + P   + K + G          G   N +        G 
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK

Query:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL
           +T         + GSGTAVKGKGVC DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSL
Subjt:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL

Query:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE
        KNLMKSWGADDQGFLVECRTIECG LEEHEQDR QG  + E IA LL++FA VFEWP+ LPPQR IDHHIYLKSG DPVNVRPYRYAHHQKEEMERLVDE
Subjt:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE

Query:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM
        ML+SGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVM
Subjt:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM

Query:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP
        PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G++EH+QHLEVVLGLL+EKELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWP
Subjt:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP

Query:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK
        TP NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E ETAF KLK+AMMTLPVL MPDF+LPFEIESDASGFG+GAVLTQCRKPVAYFSK
Subjt:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK

Query:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA
        TLSMR+R+RPV +   +A VLA          RK     D+ +  +  +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT +LNQITA
Subjt:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA

Query:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC
        P MIDV+I KEETR DPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ S STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMKKD++RYC
Subjt:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC

Query:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV
        EECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAK VAETFV+EV+RLHGYPRSIVSDRDKV
Subjt:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV

Query:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLW---------------------------------RHGDMETPNST
        FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W                                  HGDMETPNST
Subjt:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLW---------------------------------RHGDMETPNST

Query:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV
        LDQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAYKLELPA+AAIHPVFHV
Subjt:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV

Query:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN
        SQ KKAVGRGETV  LNPYMNENHEWITQPEEVYGYRKNP T +WEALISWKGLPPHEATWE+C DMKYQFP+FHL     +EEESDARPPILFTY+R+N
Subjt:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN

Query:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKKHETNEGET G+E  GHETN E+ R   EESK DGDQ G P
Subjt:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

A0A5D3DM31 Ty3/gypsy retrotransposon protein0.0e+0080.12Show/hide
Query:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
        MMKLALKIENRE+VR+ECGLISAYD K+GHK  QTKNT +T TKEG T GSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG
Subjt:  MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAG

Query:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK
        HRCK  EHKEL+MLVV+EGGEELEIVEEEFFDAEAEMKQV+VQ VENLNIELS  S  G + P   + K K G          G   N +        G 
Subjt:  HRCKSNEHKELQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLS-GGTDQPRDHEGKRKGGRRR-------GGNPNRLWGYPQLHRGK

Query:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL
           +T         + GSGTAVKGKGVC+DVEV LEGWKV DSFLPLQLGGVDMILGMQWLHSLGVTEVDWK L+LTFHHQG+KVVI+GDPSLTK RVSL
Subjt:  IGDQTGIDAAGDTELWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSL

Query:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE
        KNLMKSWGADDQGFLVECRTIECG LEE+EQDR  G  + E IA LL++FA VFEWP+ LPPQR IDHHIY+KSG DPVNVRPYRYAHHQKEEMERLVDE
Subjt:  KNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGREDEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDE

Query:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM
        ML+SGIIRPSKSPYSSPVLLVRK+DGSWRFCVDYRALNNVTIPDKFPIP+IEELFDELKGASVFSKIDLKAGYHQIRMCPEDI+KTAFRTHEGHYEFLVM
Subjt:  MLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVM

Query:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP
        PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYS+G++EH QHLEVVLGLL+ KELY N+EKCSFAKPRISYLGH ISEQGIEADPEKIRAVSEWP
Subjt:  PFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWP

Query:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK
        TP NVREVRGFL LTGYYRRFVK+YG IAAPLTQLLKKGAYKW  E E AF KLK+AMMTLPVL MPDF+LPFEIESDASGFGVGAVLTQCRKPVAYFSK
Subjt:  TPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKPVAYFSK

Query:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA
        TLS+R+R+RPV +   +A VLA          RK     D+ +  +  +      PQYQ+WVAKLLGYSFEV YQPGLENKAADALSRI+PT QLNQITA
Subjt:  TLSMRERARPVTK---MAAVLA----------RKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITA

Query:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC
        P +IDV+I KEETRQDPAL+EIIRLIEEQGMEIPHYTLQQGVLKFKGRLV+ + STLLPTILHTYHDSVFGGHSGFLRTYKRLTGE+YWKGMK+D++RYC
Subjt:  PTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYC

Query:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV
        EECAICQRNKSSAL+PAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKY HFLLLKHPFTAK VAETF++EV+RLHGYPRSIVSDRDKV
Subjt:  EECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKV

Query:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNST
        FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYL C     P     W H                                 GDMETPNST
Subjt:  FLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRH---------------------------------GDMETPNST

Query:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV
        LDQQLKDRDI LGALK+H+K+ QERMKKQAD+KRREVEFQEGD+VFLKLRPYRQ S+RKKRNEKLSPKYFGPYR+LERIG+VAY+LELPA+AAIHPVFHV
Subjt:  LDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV

Query:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN
        SQ KKAVGRGETVQ L PY+NENHEWITQPEEVYGYRKNP+T +WEALISWKGLPPHEATWE+C DMKYQFP+FHLEDKVDLEEESDARPPILFTY+R+N
Subjt:  SQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENCADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRN

Query:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP
        KKKHETNEGET G+E   HETN E+ R   EESKEDGDQ G P
Subjt:  KKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.3e-11330.69Show/hide
Query:  PQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGA
        P + ++  + L      + +R Y     + + M   +++ L SGIIR SK+  + PV+ V KK+G+ R  VDY+ LN    P+ +P+P+IE+L  +++G+
Subjt:  PQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGA

Query:  SVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELY
        ++F+K+DLK+ YH IR+   D  K AFR   G +E+LVMP+G++ AP+ FQ  +N +        V+ + DDIL++S+   EH++H++ VL  LK   L 
Subjt:  SVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELY

Query:  ANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGA-YKWDVETETAFDKLKKAMMT
         N  KC F + ++ ++G+ ISE+G     E I  V +W  P N +E+R FL    Y R+F+     +  PL  LLKK   +KW      A + +K+ +++
Subjt:  ANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGA-YKWDVETETAFDKLKKAMMT

Query:  LPVLAMPDFNLPFEIESDASGFGVGAVLTQCRK-----PVAYFSKTLSMRERARPVT--KMAAVLARK--------------EVHGEDRPKVAEVSVGTT
         PVL   DF+    +E+DAS   VGAVL+Q        PV Y+S  +S  +    V+  +M A++                 ++  + R  +  ++  + 
Subjt:  LPVLAMPDFNLPFEIESDASGFGVGAVLTQCRK-----PVAYFSKTLSMRERARPVT--KMAAVLARK--------------EVHGEDRPKVAEVSVGTT

Query:  CCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQ-------------LNQITAPTMIDVDITKEETRQDPALQEIIRLIEEQGMEI-PHYTL
            +  RW   L  ++FE+ Y+PG  N  ADALSRI    +             +NQI+        +  E T       +++ L+  +   +  +  L
Subjt:  CCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQ-------------LNQITAPTMIDVDITKEETRQDPALQEIIRLIEEQGMEI-PHYTL

Query:  QQGVL-KFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDF
        + G+L   K ++++P+++ L  TI+  YH+     H G       +     WKG++K I  Y + C  CQ NKS    P G L P+   +  W  +SMDF
Subjt:  QQGVL-KFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDF

Query:  IEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKS
        I  LP+S G++ + VVVDR SK    +      TA+  A  F + VI   G P+ I++D D +F S  WK+        +  S  Y PQ+DGQTE  N++
Subjt:  IEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKS

Query:  VETYLSCLWRPTPSTGV-------------------------LWRHGDMETP------NSTLDQQLKDRDIALGALKKHMKITQERMKKQADTKRREV-E
        VE  L C+    P+T V                         + R+    +P      +   D+  ++       +K+H+     +MKK  D K +E+ E
Subjt:  VETYLSCLWRPTPSTGV-------------------------LWRHGDMETP------NSTLDQQLKDRDIALGALKKHMKITQERMKKQADTKRREV-E

Query:  FQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIH---PVFHVSQPKK
        FQ GDLV +K     +T    K N KL+P + GP+ +L++ G   Y+L+LP D+  H     FHVS  +K
Subjt:  FQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIH---PVFHVSQPKK

P0CT35 Transposon Tf2-2 polyprotein1.3e-11330.69Show/hide
Query:  PQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGA
        P + ++  + L      + +R Y     + + M   +++ L SGIIR SK+  + PV+ V KK+G+ R  VDY+ LN    P+ +P+P+IE+L  +++G+
Subjt:  PQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGA

Query:  SVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELY
        ++F+K+DLK+ YH IR+   D  K AFR   G +E+LVMP+G++ AP+ FQ  +N +        V+ + DDIL++S+   EH++H++ VL  LK   L 
Subjt:  SVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELY

Query:  ANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGA-YKWDVETETAFDKLKKAMMT
         N  KC F + ++ ++G+ ISE+G     E I  V +W  P N +E+R FL    Y R+F+     +  PL  LLKK   +KW      A + +K+ +++
Subjt:  ANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGA-YKWDVETETAFDKLKKAMMT

Query:  LPVLAMPDFNLPFEIESDASGFGVGAVLTQCRK-----PVAYFSKTLSMRERARPVT--KMAAVLARK--------------EVHGEDRPKVAEVSVGTT
         PVL   DF+    +E+DAS   VGAVL+Q        PV Y+S  +S  +    V+  +M A++                 ++  + R  +  ++  + 
Subjt:  LPVLAMPDFNLPFEIESDASGFGVGAVLTQCRK-----PVAYFSKTLSMRERARPVT--KMAAVLARK--------------EVHGEDRPKVAEVSVGTT

Query:  CCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQ-------------LNQITAPTMIDVDITKEETRQDPALQEIIRLIEEQGMEI-PHYTL
            +  RW   L  ++FE+ Y+PG  N  ADALSRI    +             +NQI+        +  E T       +++ L+  +   +  +  L
Subjt:  CCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQ-------------LNQITAPTMIDVDITKEETRQDPALQEIIRLIEEQGMEI-PHYTL

Query:  QQGVL-KFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDF
        + G+L   K ++++P+++ L  TI+  YH+     H G       +     WKG++K I  Y + C  CQ NKS    P G L P+   +  W  +SMDF
Subjt:  QQGVL-KFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDF

Query:  IEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKS
        I  LP+S G++ + VVVDR SK    +      TA+  A  F + VI   G P+ I++D D +F S  WK+        +  S  Y PQ+DGQTE  N++
Subjt:  IEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKS

Query:  VETYLSCLWRPTPSTGV-------------------------LWRHGDMETP------NSTLDQQLKDRDIALGALKKHMKITQERMKKQADTKRREV-E
        VE  L C+    P+T V                         + R+    +P      +   D+  ++       +K+H+     +MKK  D K +E+ E
Subjt:  VETYLSCLWRPTPSTGV-------------------------LWRHGDMETP------NSTLDQQLKDRDIALGALKKHMKITQERMKKQADTKRREV-E

Query:  FQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIH---PVFHVSQPKK
        FQ GDLV +K     +T    K N KL+P + GP+ +L++ G   Y+L+LP D+  H     FHVS  +K
Subjt:  FQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIH---PVFHVSQPKK

P0CT41 Transposon Tf2-12 polyprotein1.3e-11330.69Show/hide
Query:  PQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGA
        P + ++  + L      + +R Y     + + M   +++ L SGIIR SK+  + PV+ V KK+G+ R  VDY+ LN    P+ +P+P+IE+L  +++G+
Subjt:  PQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEELFDELKGA

Query:  SVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELY
        ++F+K+DLK+ YH IR+   D  K AFR   G +E+LVMP+G++ AP+ FQ  +N +        V+ + DDIL++S+   EH++H++ VL  LK   L 
Subjt:  SVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEKELY

Query:  ANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGA-YKWDVETETAFDKLKKAMMT
         N  KC F + ++ ++G+ ISE+G     E I  V +W  P N +E+R FL    Y R+F+     +  PL  LLKK   +KW      A + +K+ +++
Subjt:  ANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGA-YKWDVETETAFDKLKKAMMT

Query:  LPVLAMPDFNLPFEIESDASGFGVGAVLTQCRK-----PVAYFSKTLSMRERARPVT--KMAAVLARK--------------EVHGEDRPKVAEVSVGTT
         PVL   DF+    +E+DAS   VGAVL+Q        PV Y+S  +S  +    V+  +M A++                 ++  + R  +  ++  + 
Subjt:  LPVLAMPDFNLPFEIESDASGFGVGAVLTQCRK-----PVAYFSKTLSMRERARPVT--KMAAVLARK--------------EVHGEDRPKVAEVSVGTT

Query:  CCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQ-------------LNQITAPTMIDVDITKEETRQDPALQEIIRLIEEQGMEI-PHYTL
            +  RW   L  ++FE+ Y+PG  N  ADALSRI    +             +NQI+        +  E T       +++ L+  +   +  +  L
Subjt:  CCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQ-------------LNQITAPTMIDVDITKEETRQDPALQEIIRLIEEQGMEI-PHYTL

Query:  QQGVL-KFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDF
        + G+L   K ++++P+++ L  TI+  YH+     H G       +     WKG++K I  Y + C  CQ NKS    P G L P+   +  W  +SMDF
Subjt:  QQGVL-KFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDF

Query:  IEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKS
        I  LP+S G++ + VVVDR SK    +      TA+  A  F + VI   G P+ I++D D +F S  WK+        +  S  Y PQ+DGQTE  N++
Subjt:  IEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSYHPQSDGQTEVVNKS

Query:  VETYLSCLWRPTPSTGV-------------------------LWRHGDMETP------NSTLDQQLKDRDIALGALKKHMKITQERMKKQADTKRREV-E
        VE  L C+    P+T V                         + R+    +P      +   D+  ++       +K+H+     +MKK  D K +E+ E
Subjt:  VETYLSCLWRPTPSTGV-------------------------LWRHGDMETP------NSTLDQQLKDRDIALGALKKHMKITQERMKKQADTKRREV-E

Query:  FQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIH---PVFHVSQPKK
        FQ GDLV +K     +T    K N KL+P + GP+ +L++ G   Y+L+LP D+  H     FHVS  +K
Subjt:  FQEGDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIH---PVFHVSQPKK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.5e-12133.11Show/hide
Query:  LPPQRS------IDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEE
        LPP+ +      + H I +K G     ++PY      ++E+ ++V ++L +  I PSKSP SSPV+LV KKDG++R CVDYR LN  TI D FP+P I+ 
Subjt:  LPPQRS------IDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEE

Query:  LFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLG
        L   +  A +F+ +DL +GYHQI M P+D  KTAF T  G YE+ VMPFGL NAPSTF   M   F+    RFV V+ DDIL++S+  +EH +HL+ VL 
Subjt:  LFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLG

Query:  LLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDK
         LK + L    +KC FA     +LG+ I  Q I     K  A+ ++PTP  V++ + FL +  YYRRF+ +   IA P+ QL      +W  + + A +K
Subjt:  LLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDK

Query:  LKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKP------VAYFSKTLSMRERARPVTKM-------AAVLARKEVHGED----RPKVAEVSV
        LK A+   PVL   +    + + +DAS  G+GAVL +          V YFSK+L   ++  P  ++       A    R  +HG+        ++ +S+
Subjt:  LKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKP------VAYFSKTLSMRERARPVTKM-------AAVLARKEVHGED----RPKVAEVSV

Query:  -GTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAPTMIDVDITKEETRQDPALQEIIRLIEE------------------QG
              A + QRW+  L  Y F + Y  G +N  ADA+SR   T+   + + P  ID +  K   + DP    ++  ++E                  + 
Subjt:  -GTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAPTMIDVDITKEETRQDPALQEIIRLIEE------------------QG

Query:  MEI-----PHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHD-SVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLE
        +E+      +Y+L+  ++ ++ RLV+P        ++  YHD ++FGGH G   T  +++   YW  ++  II+Y   C  CQ  KS      GLL PL 
Subjt:  MEI-----PHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHD-SVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLE

Query:  IPDAIWSDISMDFIEGL-PKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSY
        I +  W DISMDF+ GL P S   ++ILVVVDR SK  HF+  +    A  + +   R +   HG+PR+I SDRD    +  ++EL +  G K   SS+ 
Subjt:  IPDAIWSDISMDFIEGL-PKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSY

Query:  HPQSDGQTE----VVNKSVETYLSC---LWR-------------PTPSTGVLWRHGDM----ETPNSTLDQQLKDRDIALGALKKHMKITQERMKKQADT
        HPQ+DGQ+E     +N+ +  Y+S     W              PT + G      D+     TP    D ++  R      L KH+K    + K+Q + 
Subjt:  HPQSDGQTE----VVNKSVETYLSC---LWR-------------PTPSTGVLWRHGDM----ETPNSTLDQQLKDRDIALGALKKHMKITQERMKKQADT

Query:  KRREVEFQE-----------GDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV
         + E+E              GD V +    +R    +K    K+   Y GP+R++++I + AY+L+L +    H V +V
Subjt:  KRREVEFQE-----------GDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHV

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.5e-12233.11Show/hide
Query:  LPPQRS------IDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEE
        LPP+ +      + H I +K G     ++PY      ++E+ ++V ++L +  I PSKSP SSPV+LV KKDG++R CVDYR LN  TI D FP+P I+ 
Subjt:  LPPQRS------IDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPIPMIEE

Query:  LFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLG
        L   +  A +F+ +DL +GYHQI M P+D  KTAF T  G YE+ VMPFGL NAPSTF   M   F+    RFV V+ DDIL++S+  +EH +HL+ VL 
Subjt:  LFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLG

Query:  LLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDK
         LK + L    +KC FA     +LG+ I  Q I     K  A+ ++PTP  V++ + FL +  YYRRF+ +   IA P+ QL      +W  + + A DK
Subjt:  LLKEKELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDK

Query:  LKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKP------VAYFSKTLSMRERARPVTKM-------AAVLARKEVHGED----RPKVAEVSV
        LK A+   PVL   +    + + +DAS  G+GAVL +          V YFSK+L   ++  P  ++       A    R  +HG+        ++ +S+
Subjt:  LKKAMMTLPVLAMPDFNLPFEIESDASGFGVGAVLTQCRKP------VAYFSKTLSMRERARPVTKM-------AAVLARKEVHGED----RPKVAEVSV

Query:  -GTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAPTMIDVDITKEETRQDPALQEIIRLIEE------------------QG
              A + QRW+  L  Y F + Y  G +N  ADA+SR   T+   + + P  ID +  K   + DP    ++  ++E                  + 
Subjt:  -GTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISPTVQLNQITAPTMIDVDITKEETRQDPALQEIIRLIEE------------------QG

Query:  MEI-----PHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHD-SVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLE
        +E+      +Y+L+  ++ ++ RLV+P        ++  YHD ++FGGH G   T  +++   YW  ++  II+Y   C  CQ  KS      GLL PL 
Subjt:  MEI-----PHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHD-SVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCEECAICQRNKSSALSPAGLLMPLE

Query:  IPDAIWSDISMDFIEGL-PKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSY
        I +  W DISMDF+ GL P S   ++ILVVVDR SK  HF+  +    A  + +   R +   HG+PR+I SDRD    +  ++EL +  G K   SS+ 
Subjt:  IPDAIWSDISMDFIEGL-PKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFRLAGTKLNRSSSY

Query:  HPQSDGQTE----VVNKSVETYLSC---LWR-------------PTPSTGVLWRHGDM----ETPNSTLDQQLKDRDIALGALKKHMKITQERMKKQADT
        HPQ+DGQ+E     +N+ +  Y S     W              PT + G      D+     TP    D ++  R      L KH+K    + K+Q + 
Subjt:  HPQSDGQTE----VVNKSVETYLSC---LWR-------------PTPSTGVLWRHGDM----ETPNSTLDQQLKDRDIALGALKKHMKITQERMKKQADT

Query:  KRREVEFQE-----------GDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVSQPKKAVGRGETVQPLNP
         + E+E              GD V +    +R    +K    K+   Y GP+R++++I + AY+L+L +    H V +V   KK V R +      P
Subjt:  KRREVEFQE-----------GDLVFLKLRPYRQTSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVSQPKKAVGRGETVQPLNP

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein1.8e-0428.72Show/hide
Query:  LWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLG--GVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLKNLMKS
        L G    ++  G C  + + ++  ++ ++FL L L    VD+ILG +WL  LG T V+W+    +F H  + + +  +    + +V+ K  MKS
Subjt:  LWGSGTAVKGKGVCRDVEVQLEGWKVKDSFLPLQLG--GVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLKNLMKS

ATMG00850.1 DNA/RNA polymerases superfamily protein1.6e-0556.41Show/hide
Query:  QKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSW
        ++  ++  + EML + II+PS SPYSSPVLLV+KKDG W
Subjt:  QKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein6.6e-3654.2Show/hide
Query:  IQHLEVVLGLLKEKELYANLEKCSFAKPRISYLG--HVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYK
        + HL +VL + ++ + YAN +KC+F +P+I+YLG  H+IS +G+ ADP K+ A+  WP P N  E+RGFL LTGYYRRFVK+YG I  PLT+LLKK + K
Subjt:  IQHLEVVLGLLKEKELYANLEKCSFAKPRISYLG--HVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYK

Query:  WDVETETAFDKLKKAMMTLPVLAMPDFNLPF
        W      AF  LK A+ TLPVLA+PD  LPF
Subjt:  WDVETETAFDKLKKAMMTLPVLAMPDFNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAGTTGGCATTAAAGATAGAAAACAGGGAGCTGGTCCGAAAGGAATGTGGGCTGATTAGTGCCTACGACGTTAAATCCGGCCATAAAAGTCAACAGACTAAGAA
CACCGGTTCAACAGCCACGAAGGAGGGATTGACCGGGGGAAGTTGGCCAATGAGAACAATAACTCTGAGAGAGGTAGCCACGGGAGATAACCGTCGGGAAGGACCCACGA
AACGACTGTCAGATGCAGAGTTCCAAGCCCGGAGGGAAAAGGGATTATGTTTTCGTTGTGGGGAGAAGTATTTTGCAGGACACCGATGCAAGTCAAACGAACATAAAGAA
CTCCAAATGCTGGTAGTCAGAGAAGGAGGGGAGGAGCTGGAGATAGTGGAAGAGGAGTTCTTTGATGCCGAAGCTGAGATGAAACAAGTAGAAGTCCAGAATGTGGAGAA
CCTGAACATTGAACTCTCTCAACTCAGTGGTGGGACTGACCAACCCAGGGACCATGAAGGTAAAAGGAAGGGTGGGAGAAGAAGAGGTGGTAATCCTAATCGACTGTGGG
GCTACCCACAACTTCATCGCGGAAAAATTGGTGACCAAACTGGGATTGACGCTGCAGGAGACACCGAACTATGGGGGTCTGGAACGGCAGTTAAGGGGAAGGGAGTGTGC
CGAGATGTCGAAGTACAATTGGAAGGCTGGAAGGTGAAGGATAGCTTCCTACCACTGCAATTGGGAGGGGTTGACATGATCCTAGGTATGCAGTGGCTCCATTCTCTGGG
GGTGACGGAAGTCGACTGGAAAGAGCTGATGCTGACCTTCCATCACCAGGGAAGGAAGGTTGTGATAAAAGGGGATCCGAGCCTCACCAAAACACGGGTGAGTTTGAAGA
ACTTGATGAAATCCTGGGGAGCAGACGATCAGGGATTCCTTGTAGAATGTCGAACCATAGAATGTGGACTGTTAGAAGAACATGAACAAGACAGAGGGCAGGGGAGGGAA
GATGAAGAAGCAATAGCGACCCTGTTAAAGCAGTTTGCTAGCGTGTTCGAATGGCCCACAGCACTCCCACCACAGCGCAGTATTGATCATCACATCTACCTGAAGAGTGG
AATGGACCCCGTGAATGTCAGACCATACCGGTATGCGCACCATCAGAAGGAAGAGATGGAGCGATTGGTAGACGAAATGCTTTCCTCAGGGATCATACGACCGAGCAAAA
GCCCTTATTCCAGCCCGGTGCTGTTGGTAAGGAAGAAAGACGGGAGTTGGAGGTTTTGTGTAGACTACCGAGCATTGAATAACGTGACGATCCCAGACAAGTTCCCAATA
CCGATGATAGAAGAATTGTTCGACGAATTGAAGGGAGCTAGTGTATTTTCCAAAATAGATCTCAAAGCCGGATACCATCAGATTAGGATGTGCCCCGAGGACATCAAAAA
GACCGCGTTCAGAACTCATGAAGGGCACTACGAATTCTTAGTGATGCCGTTCGGATTAACGAATGCTCCATCGACCTTTCAGGCACTGATGAATCAGGTGTTTAAGCCAT
ACTTGAGACGATTTGTGTTGGTATTCTTCGATGATATTTTGGTCTACAGCCAAGGGATAGACGAGCACATCCAGCACTTAGAGGTGGTCTTAGGACTGCTGAAAGAAAAG
GAGTTATATGCGAATTTGGAGAAGTGTAGTTTTGCAAAGCCTCGGATCAGTTATTTGGGGCATGTCATTTCGGAACAGGGCATTGAAGCAGATCCGGAAAAGATAAGAGC
GGTTAGTGAATGGCCAACTCCGACCAATGTGAGGGAAGTTCGGGGATTCCTTGTGCTGACCGGCTACTACCGGCGCTTTGTCAAAGACTATGGAGCAATAGCAGCGCCAC
TCACCCAACTGTTGAAGAAGGGGGCGTACAAGTGGGATGTTGAAACTGAGACTGCTTTTGATAAGTTGAAGAAGGCCATGATGACTCTACCGGTACTTGCCATGCCCGAC
TTCAATCTGCCCTTCGAAATCGAATCAGATGCTTCAGGATTTGGGGTTGGGGCGGTATTGACTCAATGCCGAAAGCCCGTAGCTTATTTCAGTAAAACACTAAGTATGCG
AGAGAGAGCGCGGCCGGTTACAAAGATGGCGGCCGTACTTGCTAGGAAGGAAGTTCACGGTGAAGACAGACCAAAGGTCGCTGAAGTTTCTGTTGGAACAACGTGTTGTG
CACCTCAGTATCAAAGATGGGTCGCGAAGCTGTTGGGATATTCGTTCGAAGTAACCTACCAGCCAGGACTGGAGAATAAAGCCGCTGACGCCCTGTCCCGAATTTCACCA
ACCGTGCAGTTGAATCAAATCACAGCCCCCACAATGATAGATGTGGATATCACCAAGGAAGAAACAAGGCAGGACCCCGCGCTGCAAGAAATAATTAGATTGATCGAAGA
GCAAGGGATGGAGATACCCCATTACACTCTGCAGCAGGGGGTGTTGAAGTTTAAAGGACGGCTGGTGATTCCAAGCAACTCCACTCTGCTACCTACAATATTACACACCT
ACCATGATTCAGTGTTCGGGGGGCACTCGGGATTTTTAAGAACGTATAAGCGGTTGACTGGGGAGCTCTACTGGAAGGGCATGAAGAAGGACATTATAAGGTATTGTGAA
GAGTGCGCGATATGCCAGCGAAATAAGTCTTCAGCATTGTCACCGGCAGGGCTACTGATGCCATTGGAAATTCCCGACGCAATATGGAGTGACATCTCCATGGACTTCAT
TGAAGGGTTGCCAAAATCCAAGGGGTGGGATGTGATACTTGTGGTGGTGGATAGACTGAGTAAATATGGCCATTTCCTGCTTTTGAAGCATCCTTTCACAGCCAAGACGG
TGGCAGAAACTTTTGTCAGGGAAGTGATCCGACTTCATGGGTATCCGAGATCGATAGTGTCTGATAGGGACAAGGTTTTCCTAAGTCATTTCTGGAAAGAACTATTCCGT
TTAGCAGGCACTAAGCTGAACCGAAGCTCCTCCTATCACCCACAATCAGATGGTCAGACCGAGGTGGTGAACAAGAGTGTTGAGACATATTTGAGCTGTCTATGGAGGCC
TACCCCCTCCACTGGTGTACTATGGAGACATGGAGACATGGAGACACCAAATTCGACACTCGACCAGCAGTTGAAAGACAGAGATATCGCACTAGGGGCGTTGAAGAAAC
ACATGAAAATAACTCAAGAAAGGATGAAGAAACAGGCTGACACCAAGAGAAGGGAAGTTGAATTCCAAGAAGGGGATTTGGTGTTCCTCAAATTACGACCTTACCGGCAG
ACATCACTTAGAAAGAAAAGGAATGAAAAGCTATCACCAAAGTACTTCGGGCCTTATCGGATCTTAGAAAGAATCGGAGAAGTAGCATACAAACTCGAACTTCCTGCGGA
TGCTGCTATCCACCCCGTGTTCCATGTGTCACAGCCGAAGAAAGCTGTTGGGAGAGGCGAAACGGTGCAACCATTGAATCCATACATGAATGAAAACCATGAATGGATCA
CGCAGCCCGAAGAAGTCTACGGCTATCGAAAGAATCCAGCAACTAATGATTGGGAAGCATTGATCAGTTGGAAGGGACTGCCGCCACACGAGGCAACATGGGAGAATTGC
GCTGACATGAAGTACCAGTTCCCGAAGTTCCACCTTGAGGACAAGGTGGATTTGGAAGAGGAGAGTGATGCTAGGCCCCCTATCTTATTTACGTATAATAGGAGGAATAA
GAAGAAACATGAAACCAATGAGGGGGAAACAGGTGGCAGGGAAGGTCGTGGCCATGAAACCAATACCGAGGAGACACGTGGGGGATGGGAAGAAAGCAAGGAGGATGGGG
ACCAAGAAGGGGGACCCATAGTTAGTTAG
mRNA sequenceShow/hide mRNA sequence
CGGAGAAACTCACAGTGGCAGTGATCAGCTTCAATGGGCCAGCGTTGGACTGGTACCGGTCACAAAAAGAGCGAAAAGCATTTGCCTGATGGGATGACTTGAAACAGAAA
ATGTTAGTAAGGTTCCGAGAGACCAGGGAAGGAACGTTGGTGGGCCGATTCTTAACGATCAAACAGGAGACCACCGTCGAGGAATACCGGAACCGTTTCGACAAGCTACT
AGCACCGGTGGCTTCCTTGCCCACGGTGGTGTTAGAGGAAACATTTATGAATGGGCTAAACCCGTGGTTGAAGTCGGAAGTGGAAACCCTGGAGCCCAATGGGCTGGCCT
AAATGATGAAGTTGGCATTAAAGATAGAAAACAGGGAGCTGGTCCGAAAGGAATGTGGGCTGATTAGTGCCTACGACGTTAAATCCGGCCATAAAAGTCAACAGACTAAG
AACACCGGTTCAACAGCCACGAAGGAGGGATTGACCGGGGGAAGTTGGCCAATGAGAACAATAACTCTGAGAGAGGTAGCCACGGGAGATAACCGTCGGGAAGGACCCAC
GAAACGACTGTCAGATGCAGAGTTCCAAGCCCGGAGGGAAAAGGGATTATGTTTTCGTTGTGGGGAGAAGTATTTTGCAGGACACCGATGCAAGTCAAACGAACATAAAG
AACTCCAAATGCTGGTAGTCAGAGAAGGAGGGGAGGAGCTGGAGATAGTGGAAGAGGAGTTCTTTGATGCCGAAGCTGAGATGAAACAAGTAGAAGTCCAGAATGTGGAG
AACCTGAACATTGAACTCTCTCAACTCAGTGGTGGGACTGACCAACCCAGGGACCATGAAGGTAAAAGGAAGGGTGGGAGAAGAAGAGGTGGTAATCCTAATCGACTGTG
GGGCTACCCACAACTTCATCGCGGAAAAATTGGTGACCAAACTGGGATTGACGCTGCAGGAGACACCGAACTATGGGGGTCTGGAACGGCAGTTAAGGGGAAGGGAGTGT
GCCGAGATGTCGAAGTACAATTGGAAGGCTGGAAGGTGAAGGATAGCTTCCTACCACTGCAATTGGGAGGGGTTGACATGATCCTAGGTATGCAGTGGCTCCATTCTCTG
GGGGTGACGGAAGTCGACTGGAAAGAGCTGATGCTGACCTTCCATCACCAGGGAAGGAAGGTTGTGATAAAAGGGGATCCGAGCCTCACCAAAACACGGGTGAGTTTGAA
GAACTTGATGAAATCCTGGGGAGCAGACGATCAGGGATTCCTTGTAGAATGTCGAACCATAGAATGTGGACTGTTAGAAGAACATGAACAAGACAGAGGGCAGGGGAGGG
AAGATGAAGAAGCAATAGCGACCCTGTTAAAGCAGTTTGCTAGCGTGTTCGAATGGCCCACAGCACTCCCACCACAGCGCAGTATTGATCATCACATCTACCTGAAGAGT
GGAATGGACCCCGTGAATGTCAGACCATACCGGTATGCGCACCATCAGAAGGAAGAGATGGAGCGATTGGTAGACGAAATGCTTTCCTCAGGGATCATACGACCGAGCAA
AAGCCCTTATTCCAGCCCGGTGCTGTTGGTAAGGAAGAAAGACGGGAGTTGGAGGTTTTGTGTAGACTACCGAGCATTGAATAACGTGACGATCCCAGACAAGTTCCCAA
TACCGATGATAGAAGAATTGTTCGACGAATTGAAGGGAGCTAGTGTATTTTCCAAAATAGATCTCAAAGCCGGATACCATCAGATTAGGATGTGCCCCGAGGACATCAAA
AAGACCGCGTTCAGAACTCATGAAGGGCACTACGAATTCTTAGTGATGCCGTTCGGATTAACGAATGCTCCATCGACCTTTCAGGCACTGATGAATCAGGTGTTTAAGCC
ATACTTGAGACGATTTGTGTTGGTATTCTTCGATGATATTTTGGTCTACAGCCAAGGGATAGACGAGCACATCCAGCACTTAGAGGTGGTCTTAGGACTGCTGAAAGAAA
AGGAGTTATATGCGAATTTGGAGAAGTGTAGTTTTGCAAAGCCTCGGATCAGTTATTTGGGGCATGTCATTTCGGAACAGGGCATTGAAGCAGATCCGGAAAAGATAAGA
GCGGTTAGTGAATGGCCAACTCCGACCAATGTGAGGGAAGTTCGGGGATTCCTTGTGCTGACCGGCTACTACCGGCGCTTTGTCAAAGACTATGGAGCAATAGCAGCGCC
ACTCACCCAACTGTTGAAGAAGGGGGCGTACAAGTGGGATGTTGAAACTGAGACTGCTTTTGATAAGTTGAAGAAGGCCATGATGACTCTACCGGTACTTGCCATGCCCG
ACTTCAATCTGCCCTTCGAAATCGAATCAGATGCTTCAGGATTTGGGGTTGGGGCGGTATTGACTCAATGCCGAAAGCCCGTAGCTTATTTCAGTAAAACACTAAGTATG
CGAGAGAGAGCGCGGCCGGTTACAAAGATGGCGGCCGTACTTGCTAGGAAGGAAGTTCACGGTGAAGACAGACCAAAGGTCGCTGAAGTTTCTGTTGGAACAACGTGTTG
TGCACCTCAGTATCAAAGATGGGTCGCGAAGCTGTTGGGATATTCGTTCGAAGTAACCTACCAGCCAGGACTGGAGAATAAAGCCGCTGACGCCCTGTCCCGAATTTCAC
CAACCGTGCAGTTGAATCAAATCACAGCCCCCACAATGATAGATGTGGATATCACCAAGGAAGAAACAAGGCAGGACCCCGCGCTGCAAGAAATAATTAGATTGATCGAA
GAGCAAGGGATGGAGATACCCCATTACACTCTGCAGCAGGGGGTGTTGAAGTTTAAAGGACGGCTGGTGATTCCAAGCAACTCCACTCTGCTACCTACAATATTACACAC
CTACCATGATTCAGTGTTCGGGGGGCACTCGGGATTTTTAAGAACGTATAAGCGGTTGACTGGGGAGCTCTACTGGAAGGGCATGAAGAAGGACATTATAAGGTATTGTG
AAGAGTGCGCGATATGCCAGCGAAATAAGTCTTCAGCATTGTCACCGGCAGGGCTACTGATGCCATTGGAAATTCCCGACGCAATATGGAGTGACATCTCCATGGACTTC
ATTGAAGGGTTGCCAAAATCCAAGGGGTGGGATGTGATACTTGTGGTGGTGGATAGACTGAGTAAATATGGCCATTTCCTGCTTTTGAAGCATCCTTTCACAGCCAAGAC
GGTGGCAGAAACTTTTGTCAGGGAAGTGATCCGACTTCATGGGTATCCGAGATCGATAGTGTCTGATAGGGACAAGGTTTTCCTAAGTCATTTCTGGAAAGAACTATTCC
GTTTAGCAGGCACTAAGCTGAACCGAAGCTCCTCCTATCACCCACAATCAGATGGTCAGACCGAGGTGGTGAACAAGAGTGTTGAGACATATTTGAGCTGTCTATGGAGG
CCTACCCCCTCCACTGGTGTACTATGGAGACATGGAGACATGGAGACACCAAATTCGACACTCGACCAGCAGTTGAAAGACAGAGATATCGCACTAGGGGCGTTGAAGAA
ACACATGAAAATAACTCAAGAAAGGATGAAGAAACAGGCTGACACCAAGAGAAGGGAAGTTGAATTCCAAGAAGGGGATTTGGTGTTCCTCAAATTACGACCTTACCGGC
AGACATCACTTAGAAAGAAAAGGAATGAAAAGCTATCACCAAAGTACTTCGGGCCTTATCGGATCTTAGAAAGAATCGGAGAAGTAGCATACAAACTCGAACTTCCTGCG
GATGCTGCTATCCACCCCGTGTTCCATGTGTCACAGCCGAAGAAAGCTGTTGGGAGAGGCGAAACGGTGCAACCATTGAATCCATACATGAATGAAAACCATGAATGGAT
CACGCAGCCCGAAGAAGTCTACGGCTATCGAAAGAATCCAGCAACTAATGATTGGGAAGCATTGATCAGTTGGAAGGGACTGCCGCCACACGAGGCAACATGGGAGAATT
GCGCTGACATGAAGTACCAGTTCCCGAAGTTCCACCTTGAGGACAAGGTGGATTTGGAAGAGGAGAGTGATGCTAGGCCCCCTATCTTATTTACGTATAATAGGAGGAAT
AAGAAGAAACATGAAACCAATGAGGGGGAAACAGGTGGCAGGGAAGGTCGTGGCCATGAAACCAATACCGAGGAGACACGTGGGGGATGGGAAGAAAGCAAGGAGGATGG
GGACCAAGAAGGGGGACCCATAGTTAGTTAGTAGTGGGTAATAAAAAGGGGACAGCAGGCACAGGGAAGGGCAGAAAATGTTTTAGAGGAAAAGGGGGCAGTGGTAGCCA
TCTCCTG
Protein sequenceShow/hide protein sequence
MMKLALKIENRELVRKECGLISAYDVKSGHKSQQTKNTGSTATKEGLTGGSWPMRTITLREVATGDNRREGPTKRLSDAEFQARREKGLCFRCGEKYFAGHRCKSNEHKE
LQMLVVREGGEELEIVEEEFFDAEAEMKQVEVQNVENLNIELSQLSGGTDQPRDHEGKRKGGRRRGGNPNRLWGYPQLHRGKIGDQTGIDAAGDTELWGSGTAVKGKGVC
RDVEVQLEGWKVKDSFLPLQLGGVDMILGMQWLHSLGVTEVDWKELMLTFHHQGRKVVIKGDPSLTKTRVSLKNLMKSWGADDQGFLVECRTIECGLLEEHEQDRGQGRE
DEEAIATLLKQFASVFEWPTALPPQRSIDHHIYLKSGMDPVNVRPYRYAHHQKEEMERLVDEMLSSGIIRPSKSPYSSPVLLVRKKDGSWRFCVDYRALNNVTIPDKFPI
PMIEELFDELKGASVFSKIDLKAGYHQIRMCPEDIKKTAFRTHEGHYEFLVMPFGLTNAPSTFQALMNQVFKPYLRRFVLVFFDDILVYSQGIDEHIQHLEVVLGLLKEK
ELYANLEKCSFAKPRISYLGHVISEQGIEADPEKIRAVSEWPTPTNVREVRGFLVLTGYYRRFVKDYGAIAAPLTQLLKKGAYKWDVETETAFDKLKKAMMTLPVLAMPD
FNLPFEIESDASGFGVGAVLTQCRKPVAYFSKTLSMRERARPVTKMAAVLARKEVHGEDRPKVAEVSVGTTCCAPQYQRWVAKLLGYSFEVTYQPGLENKAADALSRISP
TVQLNQITAPTMIDVDITKEETRQDPALQEIIRLIEEQGMEIPHYTLQQGVLKFKGRLVIPSNSTLLPTILHTYHDSVFGGHSGFLRTYKRLTGELYWKGMKKDIIRYCE
ECAICQRNKSSALSPAGLLMPLEIPDAIWSDISMDFIEGLPKSKGWDVILVVVDRLSKYGHFLLLKHPFTAKTVAETFVREVIRLHGYPRSIVSDRDKVFLSHFWKELFR
LAGTKLNRSSSYHPQSDGQTEVVNKSVETYLSCLWRPTPSTGVLWRHGDMETPNSTLDQQLKDRDIALGALKKHMKITQERMKKQADTKRREVEFQEGDLVFLKLRPYRQ
TSLRKKRNEKLSPKYFGPYRILERIGEVAYKLELPADAAIHPVFHVSQPKKAVGRGETVQPLNPYMNENHEWITQPEEVYGYRKNPATNDWEALISWKGLPPHEATWENC
ADMKYQFPKFHLEDKVDLEEESDARPPILFTYNRRNKKKHETNEGETGGREGRGHETNTEETRGGWEESKEDGDQEGGPIVS