; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0065741 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0065741
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:7355599..7357821
RNA-Seq ExpressionCmc03g0065741
SyntenyCmc03g0065741
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035938.1 pol protein [Cucumis melo var. makuwa]1.9e-28274.56Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VT W
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------
        +RPST+SEV SFLGLA                                                                                    
Subjt:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------

Query:  ------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCL
                                GKANVV D LSRKVSHSAA ITRQAPLH+D ERAEIAVSVGAVT+QLAQLTVQPTLR++IIDAQS+DPYLVEKR L
Subjt:  ------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCL

Query:  AKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVP
        A+AGQ  EF +SSDGGLLFE RLCVPSDSA+KTELL+EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREV EFVSKCLVCQQVKAP+QKP GLLQPLS+P
Subjt:  AKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVP

Query:  EWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHL
        EWKWENVSMDFITGLPRTLRG  VIWV+VDRLT+SAHF+PGKSTYT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFWKGLQ AMGTRLDFST FH 
Subjt:  EWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHL

Query:  QTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSR
        QTDGQ E LNQVLEDMLRACALEF GSWD+HLHLMEF+YNN++QATIGM PFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR RMHTAQSR
Subjt:  QTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSR

Query:  QKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
        QKSYA+VRRKDLEF++GDKVFLKVAPMKGV+RFERRGKLSP FVG FEILERIGPVAYRLALPPSL   +DVFHVSMLRKYVPD
Subjt:  QKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

KAA0047001.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-28173.94Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLM RVFREFLDTF+IVFIDDILIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VTGW
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEV-------------------------------------------------------------------------------------------
        +RPST+SE                                                                                            
Subjt:  SRPSTISEV-------------------------------------------------------------------------------------------

Query:  --------------------CSFLGLAGKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEK
                            C  L   GKANVVAD LSRKVSHSAA ITRQAPLH+D ERAEIAVSVGAVT+QLAQLTVQPTLR++IIDAQS+DPYLVEK
Subjt:  --------------------CSFLGLAGKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEK

Query:  RCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPL
        R LA+AGQAVEF +SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREV EFVS+CLVCQQVKAP+QKP GLLQPL
Subjt:  RCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPL

Query:  SVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTT
        S+PEWKWENVSMDFITGLPRTLRG  VIWV+VDRLT+SAHF+PGKS YT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFWKGLQ AMGTRLDFST 
Subjt:  SVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTT

Query:  FHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTA
        FH QTDGQ E LNQVLEDMLRACALEF GSWDSHLHLMEF+YNN++QATIGM PFEALY KCCRS VCWGEVGEQRLMGPELVQSTNE IQKIR RMHTA
Subjt:  FHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTA

Query:  QSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
        QSRQKSYANVRRKDLEF VGDKVFLKVAPM+GV+RFE++GKLSP FVG FEILERIGPVAYRLALPPSLS VHDVFHVSMLRKYVPD
Subjt:  QSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

KAA0051522.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-28478.69Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEE+LR+VLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VTGW
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISE-------------------------------------VCSFLG------------------------------LAGKANVVADVLSRKVSHS
        +RPST+SE                                     V +++                               L GKANVVAD LSRKVSHS
Subjt:  SRPSTISE-------------------------------------VCSFLG------------------------------LAGKANVVADVLSRKVSHS

Query:  AAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHS
        AA ITRQAPLH+D ERAEIAVSVGAVT+QLAQLTVQPTLR++IIDAQS+DPYLVEKR  A+A QAVEF ISSDGGLLF RRLCVPSDSAVKTELLSEAHS
Subjt:  AAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPG
        SPFSMHPGSTKMYQDLKRVYWWRNMKREV EFVS+CLVCQQVKAP+QKP GLLQPLS+PEWKWENVSMDFITGLPRTLRG  VIWV+V+RLT+SAHF+PG
Subjt:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPG

Query:  KSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNN
        KSTYT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFW+GLQ AMGTRLDFST FH QTDGQ E LNQVLEDMLRACALEF GSWDSHLHLMEF+YNN
Subjt:  KSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNN

Query:  NFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSP
        ++QATIGMTPFEALYGKCCRSPV WGEVGEQRLMGPELVQSTNEAIQKIR  MHTAQ+RQKSYA+VRRKDLEF+VGDKVFLKVAPM+GV+RFERRGKLSP
Subjt:  NFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSP

Query:  CFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
         FVG FE+LERIGPVAYRLALPPSLS VHDVFHVSMLRKYVPD
Subjt:  CFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

KAA0062141.1 pol protein [Cucumis melo var. makuwa]4.4e-28474.03Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFR+FLDTFVIVFIDDILIYSKTEAEHEE+LR+VLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VTGW
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------
        +RPST+SEV SFLGLA                                                                                    
Subjt:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------

Query:  ---------------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSD
                                         GKANVVAD LSRKVSHSAA ITRQAPLH+D ERAEIAVS+GAVT+QLAQLTVQPTLR++IIDAQS+D
Subjt:  ---------------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSD

Query:  PYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPV
        PYLVEKR LA+ GQAVEF ISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGS KMYQ+LKRVYWWRNMKREV EFVS+CLVCQQVKAP+QKP 
Subjt:  PYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPV

Query:  GLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTR
        GLLQPLS+ EWKWENVSMDFITGLPRTLRG  VIWV+VDRLT+SAHF+PGKSTYT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFWKGLQ AMGTR
Subjt:  GLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTR

Query:  LDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR
        LDFST FH QTDGQ E LNQVLEDMLRACALEF GSWDSHLHLMEF+YNN++QATIGM PFE LYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR
Subjt:  LDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR

Query:  LRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
         RMHTAQSRQKSYA+VRRKDLEF+VGDKVFLKVAPM+GV+RFERRGKLSP FVG FEILERIGPVAYRLALPPSLS VHDVFHVSMLRKYVPD
Subjt:  LRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

KAA0062245.1 pol protein [Cucumis melo var. makuwa]1.4e-28574.11Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VTGW
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------
        +RPSTISEV SFLGLA                                                                                    
Subjt:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------

Query:  ---------------------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKII
                                               GKANVVAD LSRKVSHSAA ITRQAPLH+D ERAEIAVSVGAVT+QLAQLTVQPTLR++II
Subjt:  ---------------------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKII

Query:  DAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKA
        DAQS+DPYLVEKR LA+AGQA EF +SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREV EFVSKCLVCQQVKA
Subjt:  DAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKA

Query:  PKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQ
        P QKP GLLQPLS+PEWKWENVSMDFITGLPRTLRG  VIWV+VDRLT+SAHF+ GKSTYT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFWKGLQ
Subjt:  PKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQ

Query:  AAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNE
         AMGTRLDFST FH QTDGQ E LNQVLEDMLRACALEF GSWDSHLHLMEF+YNN++QATIGM PFEALYGKCC+SPVCWGEVGEQRLMGPELVQSTNE
Subjt:  AAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNE

Query:  AIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
        AIQKIR RMHTAQSRQKSYA+VRRKDLEF+VGDKVFLKVAPM+GV+RFERRGKLSP FVG FEILERIGP+AYRLALPPSLS VHDVFHVSMLRKYVPD
Subjt:  AIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

TrEMBL top hitse value%identityAlignment
A0A5A7SZD6 Pol protein9.0e-28374.56Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VT W
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------
        +RPST+SEV SFLGLA                                                                                    
Subjt:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------

Query:  ------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCL
                                GKANVV D LSRKVSHSAA ITRQAPLH+D ERAEIAVSVGAVT+QLAQLTVQPTLR++IIDAQS+DPYLVEKR L
Subjt:  ------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCL

Query:  AKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVP
        A+AGQ  EF +SSDGGLLFE RLCVPSDSA+KTELL+EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREV EFVSKCLVCQQVKAP+QKP GLLQPLS+P
Subjt:  AKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVP

Query:  EWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHL
        EWKWENVSMDFITGLPRTLRG  VIWV+VDRLT+SAHF+PGKSTYT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFWKGLQ AMGTRLDFST FH 
Subjt:  EWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHL

Query:  QTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSR
        QTDGQ E LNQVLEDMLRACALEF GSWD+HLHLMEF+YNN++QATIGM PFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR RMHTAQSR
Subjt:  QTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSR

Query:  QKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
        QKSYA+VRRKDLEF++GDKVFLKVAPMKGV+RFERRGKLSP FVG FEILERIGPVAYRLALPPSL   +DVFHVSMLRKYVPD
Subjt:  QKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

A0A5A7U6Z4 Ty3-gypsy retrotransposon protein1.6e-28478.69Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEE+LR+VLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VTGW
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISE-------------------------------------VCSFLG------------------------------LAGKANVVADVLSRKVSHS
        +RPST+SE                                     V +++                               L GKANVVAD LSRKVSHS
Subjt:  SRPSTISE-------------------------------------VCSFLG------------------------------LAGKANVVADVLSRKVSHS

Query:  AAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHS
        AA ITRQAPLH+D ERAEIAVSVGAVT+QLAQLTVQPTLR++IIDAQS+DPYLVEKR  A+A QAVEF ISSDGGLLF RRLCVPSDSAVKTELLSEAHS
Subjt:  AAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPG
        SPFSMHPGSTKMYQDLKRVYWWRNMKREV EFVS+CLVCQQVKAP+QKP GLLQPLS+PEWKWENVSMDFITGLPRTLRG  VIWV+V+RLT+SAHF+PG
Subjt:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPG

Query:  KSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNN
        KSTYT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFW+GLQ AMGTRLDFST FH QTDGQ E LNQVLEDMLRACALEF GSWDSHLHLMEF+YNN
Subjt:  KSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNN

Query:  NFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSP
        ++QATIGMTPFEALYGKCCRSPV WGEVGEQRLMGPELVQSTNEAIQKIR  MHTAQ+RQKSYA+VRRKDLEF+VGDKVFLKVAPM+GV+RFERRGKLSP
Subjt:  NFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSP

Query:  CFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
         FVG FE+LERIGPVAYRLALPPSLS VHDVFHVSMLRKYVPD
Subjt:  CFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

A0A5A7UP94 Pol protein1.3e-28170.01Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VTGW
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------
        +RPST+SEV SFLGLA                                                                                    
Subjt:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------

Query:  -----------------------------------------------------------------------------GKANVVADVLSRKVSHSAAFITR
                                                                                     GKANVVAD LSRKVSHSAA ITR
Subjt:  -----------------------------------------------------------------------------GKANVVADVLSRKVSHSAAFITR

Query:  QAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMH
        QAPLH+D ERAEIAVSVGAVT+QLAQLTVQPTLR++IIDAQS+DPYLVEKR LA+AGQAVEF +SSDGGLLFERRLCVPSDSAVKTELL+EAHSSPFSMH
Subjt:  QAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMH

Query:  PGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTS
        PGSTKMYQDLKR+YWWRNMKREV EFVSKCLVCQQVKAP+QKP GLLQPLS+PEWKWENVSMDFI GLPRTLRG  VIWV+VDRLT+SAHF+PGKSTYT+
Subjt:  PGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTS

Query:  SKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATI
        SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFWKGLQ AMGTRLDFST FH QTDGQ E LN+VLEDMLRACALEF GSWDSHLHLMEF+YNN++QATI
Subjt:  SKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATI

Query:  GMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSF
        GM PFEALY KCCRSP+CWGEVGEQRLMGPELVQSTNEAIQKIR RMHTAQSRQKSYA+VRRKDLEF+VGDKVFLKVAPM+GVVRFERRGKLSP FVG F
Subjt:  GMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSF

Query:  EILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
        EILERIGPVAYRLALPPSLS VHDVFHVSMLRKYVPD
Subjt:  EILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

A0A5A7V8L8 Pol protein6.7e-28674.11Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEE+LRMVLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VTGW
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------
        +RPSTISEV SFLGLA                                                                                    
Subjt:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------

Query:  ---------------------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKII
                                               GKANVVAD LSRKVSHSAA ITRQAPLH+D ERAEIAVSVGAVT+QLAQLTVQPTLR++II
Subjt:  ---------------------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKII

Query:  DAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKA
        DAQS+DPYLVEKR LA+AGQA EF +SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREV EFVSKCLVCQQVKA
Subjt:  DAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKA

Query:  PKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQ
        P QKP GLLQPLS+PEWKWENVSMDFITGLPRTLRG  VIWV+VDRLT+SAHF+ GKSTYT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFWKGLQ
Subjt:  PKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQ

Query:  AAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNE
         AMGTRLDFST FH QTDGQ E LNQVLEDMLRACALEF GSWDSHLHLMEF+YNN++QATIGM PFEALYGKCC+SPVCWGEVGEQRLMGPELVQSTNE
Subjt:  AAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNE

Query:  AIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
        AIQKIR RMHTAQSRQKSYA+VRRKDLEF+VGDKVFLKVAPM+GV+RFERRGKLSP FVG FEILERIGP+AYRLALPPSLS VHDVFHVSMLRKYVPD
Subjt:  AIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

A0A5A7V8X5 Pol protein2.1e-28474.03Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M FGLTNAP VFMDLMNRVFR+FLDTFVIVFIDDILIYSKTEAEHEE+LR+VLQTLRDNKLYAKFSKCEFWLKQV FLGHVV KAGV VDP KIE VTGW
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------
        +RPST+SEV SFLGLA                                                                                    
Subjt:  SRPSTISEVCSFLGLA------------------------------------------------------------------------------------

Query:  ---------------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSD
                                         GKANVVAD LSRKVSHSAA ITRQAPLH+D ERAEIAVS+GAVT+QLAQLTVQPTLR++IIDAQS+D
Subjt:  ---------------------------------GKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSD

Query:  PYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPV
        PYLVEKR LA+ GQAVEF ISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGS KMYQ+LKRVYWWRNMKREV EFVS+CLVCQQVKAP+QKP 
Subjt:  PYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPV

Query:  GLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTR
        GLLQPLS+ EWKWENVSMDFITGLPRTLRG  VIWV+VDRLT+SAHF+PGKSTYT+SKWAQLYMSEIVRL  V VSIVSDRDA FTSKFWKGLQ AMGTR
Subjt:  GLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTR

Query:  LDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR
        LDFST FH QTDGQ E LNQVLEDMLRACALEF GSWDSHLHLMEF+YNN++QATIGM PFE LYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR
Subjt:  LDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR

Query:  LRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD
         RMHTAQSRQKSYA+VRRKDLEF+VGDKVFLKVAPM+GV+RFERRGKLSP FVG FEILERIGPVAYRLALPPSLS VHDVFHVSMLRKYVPD
Subjt:  LRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAYRLALPPSLSVVHDVFHVSMLRKYVPD

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.6e-5023.47Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ + + G       I+ V  W
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------
         +P    E+  FLG          K + +   L+  +                       ++     H DF +  +       V+VGAV  Q        
Subjt:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST
                         + Q+++    + +++   ++D  L+    L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG  
Subjt:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST

Query:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA
         +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G   ++V+VDR ++ A  +P   + T+ + A
Subjt:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA

Query:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP
        +++   ++        I++D D  FTS+ WK         + FS  +  QTDGQ E  NQ +E +LR        +W  H+ L++ SYNN   +   MTP
Subjt:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP

Query:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI
        FE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y +++ +++ EF  GD V +K     G +   +  KL+P F G F +
Subjt:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI

Query:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY
        L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein1.6e-5023.47Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ + + G       I+ V  W
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------
         +P    E+  FLG          K + +   L+  +                       ++     H DF +  +       V+VGAV  Q        
Subjt:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST
                         + Q+++    + +++   ++D  L+    L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG  
Subjt:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST

Query:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA
         +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G   ++V+VDR ++ A  +P   + T+ + A
Subjt:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA

Query:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP
        +++   ++        I++D D  FTS+ WK         + FS  +  QTDGQ E  NQ +E +LR        +W  H+ L++ SYNN   +   MTP
Subjt:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP

Query:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI
        FE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y +++ +++ EF  GD V +K     G +   +  KL+P F G F +
Subjt:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI

Query:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY
        L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein1.6e-5023.47Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ + + G       I+ V  W
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------
         +P    E+  FLG          K + +   L+  +                       ++     H DF +  +       V+VGAV  Q        
Subjt:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST
                         + Q+++    + +++   ++D  L+    L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG  
Subjt:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST

Query:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA
         +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G   ++V+VDR ++ A  +P   + T+ + A
Subjt:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA

Query:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP
        +++   ++        I++D D  FTS+ WK         + FS  +  QTDGQ E  NQ +E +LR        +W  H+ L++ SYNN   +   MTP
Subjt:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP

Query:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI
        FE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y +++ +++ EF  GD V +K     G +   +  KL+P F G F +
Subjt:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI

Query:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY
        L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY

P0CT37 Transposon Tf2-4 polyprotein1.6e-5023.47Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ + + G       I+ V  W
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------
         +P    E+  FLG          K + +   L+  +                       ++     H DF +  +       V+VGAV  Q        
Subjt:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST
                         + Q+++    + +++   ++D  L+    L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG  
Subjt:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST

Query:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA
         +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G   ++V+VDR ++ A  +P   + T+ + A
Subjt:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA

Query:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP
        +++   ++        I++D D  FTS+ WK         + FS  +  QTDGQ E  NQ +E +LR        +W  H+ L++ SYNN   +   MTP
Subjt:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP

Query:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI
        FE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y +++ +++ EF  GD V +K     G +   +  KL+P F G F +
Subjt:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI

Query:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY
        L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein1.6e-5023.47Show/hide
Query:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW
        M +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH ++++ VLQ L++  L    +KCEF   QV F+G+ + + G       I+ V  W
Subjt:  MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGW

Query:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------
         +P    E+  FLG          K + +   L+  +                       ++     H DF +  +       V+VGAV  Q        
Subjt:  SRPSTISEVCSFLGLAG-------KANVVADVLSRKVSHSA-----------------AFITRQAPLHQDFERAEIA------VSVGAVTLQ--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST
                         + Q+++    + +++   ++D  L+    L    + VE  I    GLL   +  + +P+D+ +   ++ + H     +HPG  
Subjt:  -----------------LAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGST

Query:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA
         +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G   ++V+VDR ++ A  +P   + T+ + A
Subjt:  KMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHFIPGKSTYTSSKWA

Query:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP
        +++   ++        I++D D  FTS+ WK         + FS  +  QTDGQ E  NQ +E +LR        +W  H+ L++ SYNN   +   MTP
Subjt:  QLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIGMTP

Query:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI
        FE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y +++ +++ EF  GD V +K     G +   +  KL+P F G F +
Subjt:  FEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDL-EFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEI

Query:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY
        L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LERIGPVAYRLALPPSLS-VVHDVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGATTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTGATGACATTTTGAT
ATATTCCAAGACAGAGGCCGAGCATGAGGAGTATTTACGTATGGTTCTACAAACCCTTCGAGATAATAAACTGTATGCGAAGTTCTCGAAATGTGAGTTTTGGTTGAAGC
AGGTATTCTTTCTAGGCCATGTGGTTTTTAAGGCTGGAGTTTTTGTGGATCCAACTAAGATAGAGGTAGTCACCGGTTGGTCCCGACCTTCTACGATCAGTGAGGTTTGT
AGCTTTCTGGGTTTAGCAGGCAAGGCGAATGTGGTAGCTGATGTTCTTAGTAGAAAGGTATCACATTCAGCAGCATTTATTACCCGACAGGCCCCATTGCATCAAGATTT
TGAGAGGGCTGAGATTGCAGTGTCAGTAGGGGCAGTCACTTTGCAATTAGCCCAGTTAACGGTGCAGCCGACTTTGAGGGAGAAGATCATTGATGCTCAAAGTAGCGATC
CTTACTTGGTTGAGAAGCGTTGTCTAGCAAAAGCAGGGCAAGCTGTTGAGTTTTGCATATCCTCTGATGGTGGACTTTTGTTTGAGAGGCGCCTCTGTGTGCCATCAGAT
AGTGCGGTTAAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACGAAGATGTATCAAGACCTGAAACGGGTTTACTGGTGGCGTAA
TATGAAGAGAGAGGTGGTAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAAGCACCAAAGCAGAAACCAGTAGGTTTATTACAACCCTTGAGCGTACCGGAAT
GGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGACTGCCTAGAACTCTGAGGGGTTTAATAGTGATTTGGGTTCTGGTTGACAGGCTTACCGAATCAGCACACTTC
ATTCCGGGGAAATCCACTTATACTTCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGACTACAAGAAGTGCAAGTGTCGATTGTTTCTGATAGAGATGCCCA
CTTCACTTCCAAATTCTGGAAGGGTTTGCAGGCTGCTATGGGTACGAGGTTAGACTTCAGTACGACTTTCCACCTACAGACTGATGGTCAAAATGAGCATCTTAACCAAG
TTTTAGAGGATATGTTGCGAGCTTGTGCATTAGAATTTCTAGGTAGTTGGGACTCCCACTTGCATTTGATGGAATTTTCTTATAATAACAATTTTCAGGCTACCATTGGC
ATGACACCATTTGAGGCCTTGTACGGCAAGTGTTGTAGATCCCCTGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAATGA
AGCGATACAGAAAATTAGATTACGTATGCATACTGCACAGAGTAGGCAGAAGAGTTATGCTAATGTGAGACGGAAAGATCTTGAGTTTGATGTGGGGGACAAAGTGTTCT
TGAAAGTAGCACCTATGAAAGGTGTCGTACGATTTGAAAGGAGAGGAAAGCTGAGTCCTTGTTTTGTTGGGTCGTTTGAGATTCTAGAGCGAATTGGCCCTGTAGCGTAT
CGTTTGGCGTTGCCACCATCACTCTCGGTAGTTCATGATGTGTTCCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCATCCCATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGATTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTGATGACATTTTGAT
ATATTCCAAGACAGAGGCCGAGCATGAGGAGTATTTACGTATGGTTCTACAAACCCTTCGAGATAATAAACTGTATGCGAAGTTCTCGAAATGTGAGTTTTGGTTGAAGC
AGGTATTCTTTCTAGGCCATGTGGTTTTTAAGGCTGGAGTTTTTGTGGATCCAACTAAGATAGAGGTAGTCACCGGTTGGTCCCGACCTTCTACGATCAGTGAGGTTTGT
AGCTTTCTGGGTTTAGCAGGCAAGGCGAATGTGGTAGCTGATGTTCTTAGTAGAAAGGTATCACATTCAGCAGCATTTATTACCCGACAGGCCCCATTGCATCAAGATTT
TGAGAGGGCTGAGATTGCAGTGTCAGTAGGGGCAGTCACTTTGCAATTAGCCCAGTTAACGGTGCAGCCGACTTTGAGGGAGAAGATCATTGATGCTCAAAGTAGCGATC
CTTACTTGGTTGAGAAGCGTTGTCTAGCAAAAGCAGGGCAAGCTGTTGAGTTTTGCATATCCTCTGATGGTGGACTTTTGTTTGAGAGGCGCCTCTGTGTGCCATCAGAT
AGTGCGGTTAAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACGAAGATGTATCAAGACCTGAAACGGGTTTACTGGTGGCGTAA
TATGAAGAGAGAGGTGGTAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAAGCACCAAAGCAGAAACCAGTAGGTTTATTACAACCCTTGAGCGTACCGGAAT
GGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGACTGCCTAGAACTCTGAGGGGTTTAATAGTGATTTGGGTTCTGGTTGACAGGCTTACCGAATCAGCACACTTC
ATTCCGGGGAAATCCACTTATACTTCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGACTACAAGAAGTGCAAGTGTCGATTGTTTCTGATAGAGATGCCCA
CTTCACTTCCAAATTCTGGAAGGGTTTGCAGGCTGCTATGGGTACGAGGTTAGACTTCAGTACGACTTTCCACCTACAGACTGATGGTCAAAATGAGCATCTTAACCAAG
TTTTAGAGGATATGTTGCGAGCTTGTGCATTAGAATTTCTAGGTAGTTGGGACTCCCACTTGCATTTGATGGAATTTTCTTATAATAACAATTTTCAGGCTACCATTGGC
ATGACACCATTTGAGGCCTTGTACGGCAAGTGTTGTAGATCCCCTGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAATGA
AGCGATACAGAAAATTAGATTACGTATGCATACTGCACAGAGTAGGCAGAAGAGTTATGCTAATGTGAGACGGAAAGATCTTGAGTTTGATGTGGGGGACAAAGTGTTCT
TGAAAGTAGCACCTATGAAAGGTGTCGTACGATTTGAAAGGAGAGGAAAGCTGAGTCCTTGTTTTGTTGGGTCGTTTGAGATTCTAGAGCGAATTGGCCCTGTAGCGTAT
CGTTTGGCGTTGCCACCATCACTCTCGGTAGTTCATGATGTGTTCCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCATCCCATGTAG
Protein sequenceShow/hide protein sequence
MYFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEYLRMVLQTLRDNKLYAKFSKCEFWLKQVFFLGHVVFKAGVFVDPTKIEVVTGWSRPSTISEVC
SFLGLAGKANVVADVLSRKVSHSAAFITRQAPLHQDFERAEIAVSVGAVTLQLAQLTVQPTLREKIIDAQSSDPYLVEKRCLAKAGQAVEFCISSDGGLLFERRLCVPSD
SAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVVEFVSKCLVCQQVKAPKQKPVGLLQPLSVPEWKWENVSMDFITGLPRTLRGLIVIWVLVDRLTESAHF
IPGKSTYTSSKWAQLYMSEIVRLQEVQVSIVSDRDAHFTSKFWKGLQAAMGTRLDFSTTFHLQTDGQNEHLNQVLEDMLRACALEFLGSWDSHLHLMEFSYNNNFQATIG
MTPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRLRMHTAQSRQKSYANVRRKDLEFDVGDKVFLKVAPMKGVVRFERRGKLSPCFVGSFEILERIGPVAY
RLALPPSLSVVHDVFHVSMLRKYVPDHPM