; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0023871 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0023871
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Genome locationchr10:8146107..8150435
RNA-Seq ExpressionIVF0023871
SyntenyIVF0023871
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025344.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.083.06Show/hide
Query:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT
        QPGQESIASTVRR PCT CGRNHRGQCLVG GVCYQCGQPGHFKKDCPQLNMTVQRDQGVG QT+EQSRVSVVPTEGTSGARQKG VGRPRQQGKVYAMT
Subjt:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT

Query:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
        QQE EDAPDVIT TILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPL E LAIYT VGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
Subjt:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM

Query:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------
        DFLFAHYASMDC+RK+VVF+KPGFA+VVFRGM+KAVSRSLIS LKAEKLLRKGCT FL+HIVVVQREKLK EDVPVVKEFL                   
Subjt:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------

Query:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
                    APYRMAPSELKELKMQLQELVDKG IRPSVS WGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
Subjt:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL

Query:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---
        +SGY QLKVRES+IA TAFR RYGHYEFRV+PFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK   
Subjt:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---

Query:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV
                                 +VVNWE+PISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSF ELKKRLVTAPILALPV
Subjt:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV

Query:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD
        TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHE                                  HKSLKYIFDQKELNLRQRRWLELIKDYD
Subjt:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD

Query:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA
        CTIEYHP KANVVADALSRK RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQS++SNLQKKFEKSKKGLEVEFELRTDGA
Subjt:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA

Query:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
        IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
Subjt:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP

Query:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM
        RTSSGHD                                                              ++ G GLKFSTSFHPQTDGQSERTIQTL+DM
Subjt:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM

Query:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV
        LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG+APY ALYGRPCRT VCWNEVGERKLVGPELVQITTNNIKLIRENLR  QDRQKSYADKRRRNLEFQV
Subjt:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV
        GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQI ERVGP AYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQ+QPVELKEDLSYVEE VQILDRKEQV
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV

Query:  LRNKMIPLIK
        LRNK IPLIK
Subjt:  LRNKMIPLIK

KAA0032277.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.083.06Show/hide
Query:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT
        QPGQESIASTVRR PCT CGRNHRGQCLVG GVCYQCGQPGHFKKDCPQLNMTVQRDQGVG QT+EQSRVSVVPTEGTSGARQKG VGRPRQQGKVYAMT
Subjt:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT

Query:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
        QQE EDAPDVIT TILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPL E LAIYT VGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
Subjt:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM

Query:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------
        DFLFAHYASMDC+RK+VVF+KPGFA+VVFRGM+KAVSRSLIS LKAEKLLRKGCT FL+HIVVVQREKLK EDVPVVKEFL                   
Subjt:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------

Query:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
                    APYRMAPSELKELKMQLQELVDKG IRPSVS WGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
Subjt:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL

Query:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---
        +SGY QLKVRES+IA TAFR RYGHYEFRV+PFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK   
Subjt:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---

Query:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV
                                 +VVNWE+PISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSF ELKKRLVTAPILALPV
Subjt:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV

Query:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD
        TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHE                                  HKSLKYIFDQKELNLRQRRWLELIKDYD
Subjt:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD

Query:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA
        CTIEYHP KANVVADALSRK RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQS++SNLQKKFEKSKKGLEVEFELRTDGA
Subjt:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA

Query:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
        IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
Subjt:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP

Query:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM
        RTSSGHD                                                              ++ G GLKFSTSFHPQTDGQSERTIQTL+DM
Subjt:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM

Query:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV
        LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG+APY ALYGRPCRT VCWNEVGERKLVGPELVQITTNNIKLIRENLR  QDRQKSYADKRRRNLEFQV
Subjt:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV
        GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQI ERVGP AYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQ+QPVELKEDLSYVEE VQILDRKEQV
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV

Query:  LRNKMIPLIK
        LRNK IPLIK
Subjt:  LRNKMIPLIK

KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.083.06Show/hide
Query:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT
        QPGQESIASTVRR PCT CGRNHRGQCLVG GVCYQCGQPGHFKKDCPQLNMTVQRDQGVG QT+EQSRVSVVPTEGTSGARQKG VGRPRQQGKVYAMT
Subjt:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT

Query:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
        QQE EDAPDVIT TILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPL E LAIYT VGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
Subjt:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM

Query:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------
        DFLFAHYASMDC+RK+VVF+KPGFA+VVFRGM+KAVSRSLIS LKAEKLLRKGCT FL+HIVVVQREKLK EDVPVVKEFL                   
Subjt:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------

Query:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
                    APYRMAPSELKELKMQLQELVDKG IRPSVS WGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
Subjt:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL

Query:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---
        +SGY QLKVRES+IA TAFR RYGHYEFRV+PFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK   
Subjt:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---

Query:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV
                                 +VVNWE+PISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSF ELKKRLVTAPILALPV
Subjt:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV

Query:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD
        TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHE                                  HKSLKYIFDQKELNLRQRRWLELIKDYD
Subjt:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD

Query:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA
        CTIEYHP KANVVADALSRK RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQS++SNLQKKFEKSKKGLEVEFELRTDGA
Subjt:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA

Query:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
        IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
Subjt:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP

Query:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM
        RTSSGHD                                                              ++ G GLKFSTSFHPQTDGQSERTIQTL+DM
Subjt:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM

Query:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV
        LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG+APY ALYGRPCRT VCWNEVGERKLVGPELVQITTNNIKLIRENLR  QDRQKSYADKRRRNLEFQV
Subjt:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV
        GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQI ERVGP AYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQ+QPVELKEDLSYVEE VQILDRKEQV
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV

Query:  LRNKMIPLIK
        LRNK IPLIK
Subjt:  LRNKMIPLIK

KAA0051482.1 putative retrotransposon protein, identical [Cucumis melo var. makuwa]0.092.38Show/hide
Query:  MQPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAM
        MQPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAM
Subjt:  MQPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAM

Query:  TQQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILG
        TQQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILG
Subjt:  TQQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILG

Query:  MDFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFLH-----------------
        MDFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFLH                 
Subjt:  MDFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFLH-----------------

Query:  -------------APYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKID
                     APYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKID
Subjt:  -------------APYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKID

Query:  LKSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTL-REKQLYAKFSKS
        LKSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTL +   +  K  ++
Subjt:  LKSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTL-REKQLYAKFSKS

Query:  VVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHEY-HKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQF
        IAYASRQLKEHE  + +       KELNLRQRRWLELIKDYDCTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQF
Subjt:  IAYASRQLKEHEY-HKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQF

Query:  QVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYV
        QVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYV
Subjt:  QVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYV

Query:  DRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSSGHD-------------------------------------ESNGNGLKFSTSFH
        DRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSSGHD                                     ESNGNGLKFSTSFH
Subjt:  DRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSSGHD-------------------------------------ESNGNGLKFSTSFH

Query:  PQTDGQSERTIQTLKDMLRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQD
        PQTDGQSERTIQTLKDMLRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQD
Subjt:  PQTDGQSERTIQTLKDMLRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQD

Query:  RQKSYADKRRRNLEFQVGDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKED
        RQKSYADKRRRNLEFQVGDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKED
Subjt:  RQKSYADKRRRNLEFQVGDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKED

Query:  LSYVEEQVQILDRKEQVLRNKMIPLIKSSGSVEHTYQSSAPEGCTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGCT
        LSYVEEQVQILDRKEQVLRNKMIPLIKSSGSVEHTYQSSAPEGCTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGCT
Subjt:  LSYVEEQVQILDRKEQVLRNKMIPLIKSSGSVEHTYQSSAPEGCTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGCT

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.083.06Show/hide
Query:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT
        QPGQESIASTVRR PCT CGRNHRGQCLVG GVCYQCGQPGHFKKDCPQLNMTVQRDQGVG QT+EQSRVSVVPTEGTSGARQKG VGRPRQQGKVYAMT
Subjt:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT

Query:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
        QQE EDAPDVIT TILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPL E LAIYT VGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
Subjt:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM

Query:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------
        DFLFAHYASMDC+RK+VVF+KPGFA+VVFRGM+KAVSRSLIS LKAEKLLRKGCT FL+HIVVVQREKLK EDVPVVKEFL                   
Subjt:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------

Query:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
                    APYRMAPSELKELKMQLQELVDKG IRPSVS WGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
Subjt:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL

Query:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---
        +SGY QLKVRES+IA TAFR RYGHYEFRV+PFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK   
Subjt:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---

Query:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV
                                 +VVNWE+PISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSF ELKKRLVTAPILALPV
Subjt:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV

Query:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD
        TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHE                                  HKSLKYIFDQKELNLRQRRWLELIKDYD
Subjt:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD

Query:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA
        CTIEYHP KANVVADALSRK RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQS++SNLQKKFEKSKKGLEVEFELRTDGA
Subjt:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA

Query:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
        IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
Subjt:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP

Query:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM
        RTSSGHD                                                              ++ G GLKFSTSFHPQTDGQSERTIQTL+DM
Subjt:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM

Query:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV
        LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG+APY ALYGRPCRT VCWNEVGERKLVGPELVQITTNNIKLIRENLR  QDRQKSYADKRRRNLEFQV
Subjt:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV
        GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQI ERVGP AYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQ+QPVELKEDLSYVEE VQILDRKEQV
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV

Query:  LRNKMIPLIK
        LRNK IPLIK
Subjt:  LRNKMIPLIK

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase0.0e+0083.06Show/hide
Query:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT
        QPGQESIASTVRR PCT CGRNHRGQCLVG GVCYQCGQPGHFKKDCPQLNMTVQRDQGVG QT+EQSRVSVVPTEGTSGARQKG VGRPRQQGKVYAMT
Subjt:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT

Query:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
        QQE EDAPDVIT TILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPL E LAIYT VGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
Subjt:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM

Query:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------
        DFLFAHYASMDC+RK+VVF+KPGFA+VVFRGM+KAVSRSLIS LKAEKLLRKGCT FL+HIVVVQREKLK EDVPVVKEFL                   
Subjt:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------

Query:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
                    APYRMAPSELKELKMQLQELVDKG IRPSVS WGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
Subjt:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL

Query:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---
        +SGY QLKVRES+IA TAFR RYGHYEFRV+PFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK   
Subjt:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---

Query:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV
                                 +VVNWE+PISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSF ELKKRLVTAPILALPV
Subjt:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV

Query:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD
        TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHE                                  HKSLKYIFDQKELNLRQRRWLELIKDYD
Subjt:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD

Query:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA
        CTIEYHP KANVVADALSRK RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQS++SNLQKKFEKSKKGLEVEFELRTDGA
Subjt:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA

Query:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
        IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
Subjt:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP

Query:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM
        RTSSGHD                                                              ++ G GLKFSTSFHPQTDGQSERTIQTL+DM
Subjt:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM

Query:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV
        LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG+APY ALYGRPCRT VCWNEVGERKLVGPELVQITTNNIKLIRENLR  QDRQKSYADKRRRNLEFQV
Subjt:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV
        GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQI ERVGP AYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQ+QPVELKEDLSYVEE VQILDRKEQV
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV

Query:  LRNKMIPLIK
        LRNK IPLIK
Subjt:  LRNKMIPLIK

A0A5A7U2V7 Reverse transcriptase0.0e+0083.06Show/hide
Query:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT
        QPGQESIASTVRR PCT CGRNHRGQCLVG GVCYQCGQPGHFKKDCPQLNMTVQRDQGVG QT+EQSRVSVVPTEGTSGARQKG VGRPRQQGKVYAMT
Subjt:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT

Query:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
        QQE EDAPDVIT TILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPL E LAIYT VGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
Subjt:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM

Query:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------
        DFLFAHYASMDC+RK+VVF+KPGFA+VVFRGM+KAVSRSLIS LKAEKLLRKGCT FL+HIVVVQREKLK EDVPVVKEFL                   
Subjt:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------

Query:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
                    APYRMAPSELKELKMQLQELVDKG IRPSVS WGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
Subjt:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL

Query:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---
        +SGY QLKVRES+IA TAFR RYGHYEFRV+PFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK   
Subjt:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---

Query:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV
                                 +VVNWE+PISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSF ELKKRLVTAPILALPV
Subjt:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV

Query:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD
        TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHE                                  HKSLKYIFDQKELNLRQRRWLELIKDYD
Subjt:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD

Query:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA
        CTIEYHP KANVVADALSRK RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQS++SNLQKKFEKSKKGLEVEFELRTDGA
Subjt:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA

Query:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
        IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
Subjt:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP

Query:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM
        RTSSGHD                                                              ++ G GLKFSTSFHPQTDGQSERTIQTL+DM
Subjt:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM

Query:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV
        LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG+APY ALYGRPCRT VCWNEVGERKLVGPELVQITTNNIKLIRENLR  QDRQKSYADKRRRNLEFQV
Subjt:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV
        GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQI ERVGP AYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQ+QPVELKEDLSYVEE VQILDRKEQV
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV

Query:  LRNKMIPLIK
        LRNK IPLIK
Subjt:  LRNKMIPLIK

A0A5A7U6V2 Putative retrotransposon protein, identical0.0e+0092.38Show/hide
Query:  MQPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAM
        MQPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAM
Subjt:  MQPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAM

Query:  TQQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILG
        TQQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILG
Subjt:  TQQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILG

Query:  MDFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFLH-----------------
        MDFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFLH                 
Subjt:  MDFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFLH-----------------

Query:  -------------APYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKID
                     APYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKID
Subjt:  -------------APYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKID

Query:  LKSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTL-REKQLYAKFSKS
        LKSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTL +   +  K  ++
Subjt:  LKSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTL-REKQLYAKFSKS

Query:  VVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHE-YHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQF
        IAYASRQLKEHE  + +       KELNLRQRRWLELIKDYDCTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQF
Subjt:  IAYASRQLKEHE-YHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQF

Query:  QVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYV
        QVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYV
Subjt:  QVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYV

Query:  DRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSSGHD-------------------------------------ESNGNGLKFSTSFH
        DRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSSGHD                                     ESNGNGLKFSTSFH
Subjt:  DRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSSGHD-------------------------------------ESNGNGLKFSTSFH

Query:  PQTDGQSERTIQTLKDMLRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQD
        PQTDGQSERTIQTLKDMLRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQD
Subjt:  PQTDGQSERTIQTLKDMLRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQD

Query:  RQKSYADKRRRNLEFQVGDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKED
        RQKSYADKRRRNLEFQVGDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKED
Subjt:  RQKSYADKRRRNLEFQVGDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKED

Query:  LSYVEEQVQILDRKEQVLRNKMIPLIKSSGSVEHTYQSSAPEGCTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGCT
        LSYVEEQVQILDRKEQVLRNKMIPLIKSSGSVEHTYQSSAPEGCTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGCT
Subjt:  LSYVEEQVQILDRKEQVLRNKMIPLIKSSGSVEHTYQSSAPEGCTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGYTYQSSAPEGCT

A0A5A7UNA3 Reverse transcriptase0.0e+0083.06Show/hide
Query:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT
        QPGQESIASTVRR PCT CGRNHRGQCLVG GVCYQCGQPGHFKKDCPQLNMTVQRDQGVG QT+EQSRVSVVPTEGTSGARQKG VGRPRQQGKVYAMT
Subjt:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT

Query:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
        QQE EDAPDVIT TILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPL E LAIYT VGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
Subjt:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM

Query:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------
        DFLFAHYASMDC+RK+VVF+KPGFA+VVFRGM+KAVSRSLIS LKAEKLLRKGCT FL+HIVVVQREKLK EDVPVVKEFL                   
Subjt:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------

Query:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
                    APYRMAPSELKELKMQLQELVDKG IRPSVS WGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
Subjt:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL

Query:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---
        +SGY QLKVRES+IA TAFR RYGHYEFRV+PFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK   
Subjt:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---

Query:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV
                                 +VVNWE+PISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSF ELKKRLVTAPILALPV
Subjt:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV

Query:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD
        TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHE                                  HKSLKYIFDQKELNLRQRRWLELIKDYD
Subjt:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD

Query:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA
        CTIEYHP KANVVADALSRK RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQS++SNLQKKFEKSKKGLEVEFELRTDGA
Subjt:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA

Query:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
        IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
Subjt:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP

Query:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM
        RTSSGHD                                                              ++ G GLKFSTSFHPQTDGQSERTIQTL+DM
Subjt:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM

Query:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV
        LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG+APY ALYGRPCRT VCWNEVGERKLVGPELVQITTNNIKLIRENLR  QDRQKSYADKRRRNLEFQV
Subjt:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV
        GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQI ERVGP AYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQ+QPVELKEDLSYVEE VQILDRKEQV
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV

Query:  LRNKMIPLIK
        LRNK IPLIK
Subjt:  LRNKMIPLIK

A0A5D3BHI1 Reverse transcriptase0.0e+0083.06Show/hide
Query:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT
        QPGQESIASTVRR PCT CGRNHRGQCLVG GVCYQCGQPGHFKKDCPQLNMTVQRDQGVG QT+EQSRVSVVPTEGTSGARQKG VGRPRQQGKVYAMT
Subjt:  QPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMT

Query:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
        QQE EDAPDVIT TILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPL E LAIYT VGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM
Subjt:  QQEAEDAPDVITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGM

Query:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------
        DFLFAHYASMDC+RK+VVF+KPGFA+VVFRGM+KAVSRSLIS LKAEKLLRKGCT FL+HIVVVQREKLK EDVPVVKEFL                   
Subjt:  DFLFAHYASMDCNRKKVVFKKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFL-------------------

Query:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
                    APYRMAPSELKELKMQLQELVDKG IRPSVS WGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL
Subjt:  -----------HAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDL

Query:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---
        +SGY QLKVRES+IA TAFR RYGHYEFRV+PFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK   
Subjt:  KSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK---

Query:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV
                                 +VVNWE+PISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSF ELKKRLVTAPILALPV
Subjt:  -------------------------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPV

Query:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD
        TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHE                                  HKSLKYIFDQKELNLRQRRWLELIKDYD
Subjt:  TGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHEY---------------------------------HKSLKYIFDQKELNLRQRRWLELIKDYD

Query:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA
        CTIEYHP KANVVADALSRK RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQS++SNLQKKFEKSKKGLEVEFELRTDGA
Subjt:  CTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGA

Query:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
        IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP
Subjt:  IVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLP

Query:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM
        RTSSGHD                                                              ++ G GLKFSTSFHPQTDGQSERTIQTL+DM
Subjt:  RTSSGHD--------------------------------------------------------------ESNGNGLKFSTSFHPQTDGQSERTIQTLKDM

Query:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV
        LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG+APY ALYGRPCRT VCWNEVGERKLVGPELVQITTNNIKLIRENLR  QDRQKSYADKRRRNLEFQV
Subjt:  LRACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV
        GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQI ERVGP AYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQ+QPVELKEDLSYVEE VQILDRKEQV
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQV

Query:  LRNKMIPLIK
        LRNK IPLIK
Subjt:  LRNKMIPLIK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.4e-8726.56Show/hide
Query:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI
        Y + P +++ +  ++ + +  G IR S +    PV+FV KK+GTLR+ +DY+ LNK    N YPLP I+ L  +++G+ +F+K+DLKS Y  ++VR+ + 
Subjt:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI

Query:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------
           AFR   G +E+ V+P+G++ APA F   +N I     +  V+ ++DDIL++S     H +H++ VLQ L+   L    +K                 
Subjt:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------

Query:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL
                    V+ W++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+++W+    Q+   +K+ LV+ P+L      K  ++  DAS +
Subjt:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL

Query:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY
         +G VL Q  +      + Y S ++ + + +            KSLK+              + D + L           N R  RW   ++D++  I Y
Subjt:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY

Query:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI
         P  AN +ADALSR +     +PK                     + ++S + + Q  +      ++V   + ++ L        K +E   +L+    I
Subjt:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI

Query:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR
          + ++ +PN ++L   I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+P  E  WE ++MDF+  LP 
Subjt:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR

Query:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML
         SSG++                                       GN                           +KFS  + PQTDGQ+ERT QT++ +L
Subjt:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML

Query:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV
        R        +W  H+ L++ +YNN   S+  + P+  ++      S    E+        E  Q T    + ++E+L     + K Y D + + + EFQ 
Subjt:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY
        GD V +K +   G +   +  KL+P + GP+ ++++ GP  Y L+LP  +  +    FHVS L KY
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein1.4e-8726.56Show/hide
Query:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI
        Y + P +++ +  ++ + +  G IR S +    PV+FV KK+GTLR+ +DY+ LNK    N YPLP I+ L  +++G+ +F+K+DLKS Y  ++VR+ + 
Subjt:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI

Query:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------
           AFR   G +E+ V+P+G++ APA F   +N I     +  V+ ++DDIL++S     H +H++ VLQ L+   L    +K                 
Subjt:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------

Query:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL
                    V+ W++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+++W+    Q+   +K+ LV+ P+L      K  ++  DAS +
Subjt:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL

Query:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY
         +G VL Q  +      + Y S ++ + + +            KSLK+              + D + L           N R  RW   ++D++  I Y
Subjt:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY

Query:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI
         P  AN +ADALSR +     +PK                     + ++S + + Q  +      ++V   + ++ L        K +E   +L+    I
Subjt:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI

Query:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR
          + ++ +PN ++L   I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+P  E  WE ++MDF+  LP 
Subjt:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR

Query:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML
         SSG++                                       GN                           +KFS  + PQTDGQ+ERT QT++ +L
Subjt:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML

Query:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV
        R        +W  H+ L++ +YNN   S+  + P+  ++      S    E+        E  Q T    + ++E+L     + K Y D + + + EFQ 
Subjt:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY
        GD V +K +   G +   +  KL+P + GP+ ++++ GP  Y L+LP  +  +    FHVS L KY
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein1.4e-8726.56Show/hide
Query:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI
        Y + P +++ +  ++ + +  G IR S +    PV+FV KK+GTLR+ +DY+ LNK    N YPLP I+ L  +++G+ +F+K+DLKS Y  ++VR+ + 
Subjt:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI

Query:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------
           AFR   G +E+ V+P+G++ APA F   +N I     +  V+ ++DDIL++S     H +H++ VLQ L+   L    +K                 
Subjt:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------

Query:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL
                    V+ W++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+++W+    Q+   +K+ LV+ P+L      K  ++  DAS +
Subjt:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL

Query:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY
         +G VL Q  +      + Y S ++ + + +            KSLK+              + D + L           N R  RW   ++D++  I Y
Subjt:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY

Query:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI
         P  AN +ADALSR +     +PK                     + ++S + + Q  +      ++V   + ++ L        K +E   +L+    I
Subjt:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI

Query:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR
          + ++ +PN ++L   I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+P  E  WE ++MDF+  LP 
Subjt:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR

Query:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML
         SSG++                                       GN                           +KFS  + PQTDGQ+ERT QT++ +L
Subjt:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML

Query:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV
        R        +W  H+ L++ +YNN   S+  + P+  ++      S    E+        E  Q T    + ++E+L     + K Y D + + + EFQ 
Subjt:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY
        GD V +K +   G +   +  KL+P + GP+ ++++ GP  Y L+LP  +  +    FHVS L KY
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY

P0CT37 Transposon Tf2-4 polyprotein1.4e-8726.56Show/hide
Query:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI
        Y + P +++ +  ++ + +  G IR S +    PV+FV KK+GTLR+ +DY+ LNK    N YPLP I+ L  +++G+ +F+K+DLKS Y  ++VR+ + 
Subjt:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI

Query:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------
           AFR   G +E+ V+P+G++ APA F   +N I     +  V+ ++DDIL++S     H +H++ VLQ L+   L    +K                 
Subjt:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------

Query:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL
                    V+ W++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+++W+    Q+   +K+ LV+ P+L      K  ++  DAS +
Subjt:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL

Query:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY
         +G VL Q  +      + Y S ++ + + +            KSLK+              + D + L           N R  RW   ++D++  I Y
Subjt:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY

Query:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI
         P  AN +ADALSR +     +PK                     + ++S + + Q  +      ++V   + ++ L        K +E   +L+    I
Subjt:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI

Query:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR
          + ++ +PN ++L   I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+P  E  WE ++MDF+  LP 
Subjt:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR

Query:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML
         SSG++                                       GN                           +KFS  + PQTDGQ+ERT QT++ +L
Subjt:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML

Query:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV
        R        +W  H+ L++ +YNN   S+  + P+  ++      S    E+        E  Q T    + ++E+L     + K Y D + + + EFQ 
Subjt:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY
        GD V +K +   G +   +  KL+P + GP+ ++++ GP  Y L+LP  +  +    FHVS L KY
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein1.4e-8726.56Show/hide
Query:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI
        Y + P +++ +  ++ + +  G IR S +    PV+FV KK+GTLR+ +DY+ LNK    N YPLP I+ L  +++G+ +F+K+DLKS Y  ++VR+ + 
Subjt:  YRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNI

Query:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------
           AFR   G +E+ V+P+G++ APA F   +N I     +  V+ ++DDIL++S     H +H++ VLQ L+   L    +K                 
Subjt:  ANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSK-----------------

Query:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL
                    V+ W++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+++W+    Q+   +K+ LV+ P+L      K  ++  DAS +
Subjt:  -----------SVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVIYCDASRL

Query:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY
         +G VL Q  +      + Y S ++ + + +            KSLK+              + D + L           N R  RW   ++D++  I Y
Subjt:  GLGCVLMQDGN-----VIAYASRQLKEHEYH------------KSLKY--------------IFDQKEL-----------NLRQRRWLELIKDYDCTIEY

Query:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI
         P  AN +ADALSR +     +PK                     + ++S + + Q  +      ++V   + ++ L        K +E   +L+    I
Subjt:  HPIKANVVADALSRKL----RLPKSALCGIRVALLNELRGSKAVVTTEDSGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAI

Query:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR
          + ++ +PN ++L   I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+P  E  WE ++MDF+  LP 
Subjt:  VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPR

Query:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML
         SSG++                                       GN                           +KFS  + PQTDGQ+ERT QT++ +L
Subjt:  TSSGHD------------------------------------ESNGNG--------------------------LKFSTSFHPQTDGQSERTIQTLKDML

Query:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV
        R        +W  H+ L++ +YNN   S+  + P+  ++      S    E+        E  Q T    + ++E+L     + K Y D + + + EFQ 
Subjt:  RACVLQLKGSWDTHLPLMEFAYNNNYQSSIGLAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNL-EFQV

Query:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY
        GD V +K +   G +   +  KL+P + GP+ ++++ GP  Y L+LP  +  +    FHVS L KY
Subjt:  GDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAYRLELPIELARI-HDVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.4e-1342.67Show/hide
Query:  KSVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALP
        +++V W +P + TE+R FLGL GYYRRF++++ ++  PLT L +KN   +W++    +F  LK  + T P+LALP
Subjt:  KSVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCAGGTCAGGAGTCCATTGCTAGTACCGTCAGGCGAACACCATGCACGTGTTGTGGTAGGAACCATCGGGGTCAGTGTTTGGTAGGTGTCGGTGTATGTTACCA
GTGCGGACAGCCAGGACATTTCAAGAAAGATTGTCCGCAGTTGAACATGACAGTTCAGAGAGATCAGGGAGTTGGGTTCCAGACAGTTGAGCAATCGAGAGTTTCAGTGG
TTCCAACAGAGGGCACCAGTGGTGCAAGGCAAAAGGGATTTGTTGGAAGACCGAGGCAACAGGGAAAAGTCTATGCTATGACTCAACAAGAAGCGGAGGACGCACCAGAC
GTTATTACTGCTACGATTCTTATTTGTAATGTACCTGCAGATGTTTTATTTGATCCAGGTGCTACGCATTCCTTTGTTTCTAGTATATTTTTGACTAAGTTGAATAGGAT
GTTAGAGCCTTTACCTGAGAGGTTAGCTATATACACTACAGTTGGTGACGTTTTACTTGTTAATGAGGTGTTACGTAATTGTGAAGTTTTAGTAGAAGGTATCAGTTTGC
TAGTGGACTTGCTACCACTAGAGTTGCAGAGGTTAGATGTAATTTTGGGAATGGATTTCTTATTTGCTCACTATGCATCTATGGATTGCAATAGGAAGAAAGTGGTTTTC
AAAAAACCAGGCTTTGCTAAAGTGGTTTTTAGAGGTATGAAGAAGGCCGTTTCTAGAAGTTTAATCTCAGCTTTGAAAGCTGAGAAATTACTGAGGAAGGGTTGCACAAC
GTTTCTTTCACACATCGTAGTAGTGCAAAGAGAAAAGCTAAAGCTAGAAGATGTTCCTGTGGTGAAAGAGTTTCTTCATGCCCCGTATAGAATGGCTCCAAGCGAGCTAA
AAGAATTGAAGATGCAGTTACAAGAACTAGTTGACAAGGGATGCATCAGGCCTAGTGTTTCGGCGTGGGGAGCACCAGTGCTTTTTGTGAAAAAGAAAGATGGTACCCTC
AGATTATGTATTGACTATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCACGCATCGATGACTTATTTGATCAACTAAGGGGAGCAGCGTTGTTCTC
TAAGATTGACTTAAAATCAGGATACCCCCAATTGAAGGTTAGAGAATCAAATATTGCTAATACAGCATTTAGAATGAGGTATGGGCATTATGAGTTTCGAGTTATCCCAT
TCGGTTTAACGAATGCACCAGCGGTTTTCATGGATCTCATGAACAGGATCTTCCATCGGTATTTAGATCAGTTTGTGATTGTGTTCATTGATGATATATTAGTTTACTCG
GTTGACAGAGAATCTCATGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGAAAAACAGTTATACGCTAAGTTCAGCAAATCGGTTGTCAATTGGGAAAAACCAAT
TAGTGCGACAGAAGTACGTAGTTTCCTGGGTTTGGCAGGATACTATAGGCGTTTTATTGAGGATTTCTCTCGATTGGCATTGCCTTTGACCGCTTTGACAAGGAAGAATG
TTAAGTTTGAGTGGTCAGATAAATGCGAGCAAAGTTTTCATGAATTGAAGAAAAGACTAGTTACAGCACCTATTTTGGCACTTCCTGTAACAGGGAAGGACTATGTGATT
TATTGTGATGCTTCAAGGCTAGGATTAGGTTGTGTGCTTATGCAAGATGGGAATGTAATAGCTTATGCTTCAAGGCAGTTGAAGGAGCATGAGTATCATAAAAGTCTGAA
GTATATTTTTGATCAGAAAGAGCTAAATCTGAGACAAAGGCGATGGCTAGAACTGATTAAAGATTATGATTGTACTATAGAGTATCATCCAATTAAGGCCAACGTAGTAG
CAGATGCATTAAGTAGGAAGTTAAGACTTCCGAAGAGTGCCTTGTGTGGTATTCGAGTAGCTTTGTTGAATGAGTTAAGAGGTTCCAAGGCAGTAGTAACTACAGAGGAT
TCAGGAAGTCTCTTAGCTCAATTTCAGGTTCGGTCTTCTCTAGTAACTGAGATTGTAAGAAGACAGTCAAAAGAGAGTAATTTACAGAAGAAGTTTGAGAAATCCAAGAA
AGGCTTGGAGGTGGAGTTTGAGCTGAGAACAGATGGAGCCATTGTTAAACAGGGAAGATTATGTGTTCCAAATATCAGTGAGCTTAAGAATGCTATTCTAGAAGAAGCTC
ACAGTTCAGCTTACGCTATGCATCCAGGTAGCACCAAGATGTACAGAACTTTAAAGAAGACTTATTGGTGGTCTGGAATGAAGCAAGAGATAGCTGAATATGTTGATAGA
TGTTTGATTTGTCAACAGGTTAAACCAGTAAGACAGAGGCCAGGAGGATTTCTTAATCCTTTGCCAGTGCCAGAGTGGAAATGGGAGCATATTACTATGGATTTTCTATT
TGGATTACCTCGTACATCTAGTGGACATGATGAAAGCAATGGGAACGGACTAAAGTTTAGTACATCATTTCATCCCCAAACAGATGGTCAGTCCGAGAGGACCATCCAAA
CTTTAAAGGACATGTTGAGAGCATGTGTCCTACAACTTAAAGGAAGTTGGGATACCCACTTGCCACTTATGGAGTTTGCTTATAATAATAACTATCAGTCTAGTATCGGT
TTGGCACCATATGGAGCCTTATACGGGAGACCATGCAGAACTTCTGTGTGCTGGAATGAAGTGGGAGAGCGGAAGTTAGTAGGTCCTGAGTTGGTTCAGATTACGACAAA
CAATATTAAGTTGATCAGAGAAAATCTAAGGATAGGCCAAGATCGACAGAAAAGTTATGCGGATAAGCGACGAAGAAACCTAGAATTCCAAGTTGGAGATCAAGTTTTCT
TGAAATTATCTCCATGGCGAGGTGTTATTCGTTTCGGAAGAAAAGGTAAGTTAAGTCCTAGATATATTGGGCCATATCAGATAATGGAACGAGTGGGACCAACAGCGTAT
AGACTTGAGTTGCCAATAGAACTTGCACGAATACATGATGTTTTCCATGTATCCATGTTAAGAAAATATATACCAGATCCATCACATGTGTTGCAAGAGCAACCCGTTGA
ATTAAAAGAAGATTTGAGTTATGTTGAAGAACAAGTTCAGATTCTCGACAGAAAGGAACAAGTTTTGAGAAACAAAATGATTCCACTCATAAAGTCTAGTGGTTCCGTAG
AACACACATATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATACACATATCAGTCTAGTGCTCCCGAAGGATACACATATCAGTCTAGT
GCTCCCGAAGGATACACATATCAGTCTAGTGCTCCCGAAGGATACACATATCAGTCTAGTGCTCCCGAAGGATGCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCAGGTCAGGAGTCCATTGCTAGTACCGTCAGGCGAACACCATGCACGTGTTGTGGTAGGAACCATCGGGGTCAGTGTTTGGTAGGTGTCGGTGTATGTTACCA
GTGCGGACAGCCAGGACATTTCAAGAAAGATTGTCCGCAGTTGAACATGACAGTTCAGAGAGATCAGGGAGTTGGGTTCCAGACAGTTGAGCAATCGAGAGTTTCAGTGG
TTCCAACAGAGGGCACCAGTGGTGCAAGGCAAAAGGGATTTGTTGGAAGACCGAGGCAACAGGGAAAAGTCTATGCTATGACTCAACAAGAAGCGGAGGACGCACCAGAC
GTTATTACTGCTACGATTCTTATTTGTAATGTACCTGCAGATGTTTTATTTGATCCAGGTGCTACGCATTCCTTTGTTTCTAGTATATTTTTGACTAAGTTGAATAGGAT
GTTAGAGCCTTTACCTGAGAGGTTAGCTATATACACTACAGTTGGTGACGTTTTACTTGTTAATGAGGTGTTACGTAATTGTGAAGTTTTAGTAGAAGGTATCAGTTTGC
TAGTGGACTTGCTACCACTAGAGTTGCAGAGGTTAGATGTAATTTTGGGAATGGATTTCTTATTTGCTCACTATGCATCTATGGATTGCAATAGGAAGAAAGTGGTTTTC
AAAAAACCAGGCTTTGCTAAAGTGGTTTTTAGAGGTATGAAGAAGGCCGTTTCTAGAAGTTTAATCTCAGCTTTGAAAGCTGAGAAATTACTGAGGAAGGGTTGCACAAC
GTTTCTTTCACACATCGTAGTAGTGCAAAGAGAAAAGCTAAAGCTAGAAGATGTTCCTGTGGTGAAAGAGTTTCTTCATGCCCCGTATAGAATGGCTCCAAGCGAGCTAA
AAGAATTGAAGATGCAGTTACAAGAACTAGTTGACAAGGGATGCATCAGGCCTAGTGTTTCGGCGTGGGGAGCACCAGTGCTTTTTGTGAAAAAGAAAGATGGTACCCTC
AGATTATGTATTGACTATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCACGCATCGATGACTTATTTGATCAACTAAGGGGAGCAGCGTTGTTCTC
TAAGATTGACTTAAAATCAGGATACCCCCAATTGAAGGTTAGAGAATCAAATATTGCTAATACAGCATTTAGAATGAGGTATGGGCATTATGAGTTTCGAGTTATCCCAT
TCGGTTTAACGAATGCACCAGCGGTTTTCATGGATCTCATGAACAGGATCTTCCATCGGTATTTAGATCAGTTTGTGATTGTGTTCATTGATGATATATTAGTTTACTCG
GTTGACAGAGAATCTCATGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGAAAAACAGTTATACGCTAAGTTCAGCAAATCGGTTGTCAATTGGGAAAAACCAAT
TAGTGCGACAGAAGTACGTAGTTTCCTGGGTTTGGCAGGATACTATAGGCGTTTTATTGAGGATTTCTCTCGATTGGCATTGCCTTTGACCGCTTTGACAAGGAAGAATG
TTAAGTTTGAGTGGTCAGATAAATGCGAGCAAAGTTTTCATGAATTGAAGAAAAGACTAGTTACAGCACCTATTTTGGCACTTCCTGTAACAGGGAAGGACTATGTGATT
TATTGTGATGCTTCAAGGCTAGGATTAGGTTGTGTGCTTATGCAAGATGGGAATGTAATAGCTTATGCTTCAAGGCAGTTGAAGGAGCATGAGTATCATAAAAGTCTGAA
GTATATTTTTGATCAGAAAGAGCTAAATCTGAGACAAAGGCGATGGCTAGAACTGATTAAAGATTATGATTGTACTATAGAGTATCATCCAATTAAGGCCAACGTAGTAG
CAGATGCATTAAGTAGGAAGTTAAGACTTCCGAAGAGTGCCTTGTGTGGTATTCGAGTAGCTTTGTTGAATGAGTTAAGAGGTTCCAAGGCAGTAGTAACTACAGAGGAT
TCAGGAAGTCTCTTAGCTCAATTTCAGGTTCGGTCTTCTCTAGTAACTGAGATTGTAAGAAGACAGTCAAAAGAGAGTAATTTACAGAAGAAGTTTGAGAAATCCAAGAA
AGGCTTGGAGGTGGAGTTTGAGCTGAGAACAGATGGAGCCATTGTTAAACAGGGAAGATTATGTGTTCCAAATATCAGTGAGCTTAAGAATGCTATTCTAGAAGAAGCTC
ACAGTTCAGCTTACGCTATGCATCCAGGTAGCACCAAGATGTACAGAACTTTAAAGAAGACTTATTGGTGGTCTGGAATGAAGCAAGAGATAGCTGAATATGTTGATAGA
TGTTTGATTTGTCAACAGGTTAAACCAGTAAGACAGAGGCCAGGAGGATTTCTTAATCCTTTGCCAGTGCCAGAGTGGAAATGGGAGCATATTACTATGGATTTTCTATT
TGGATTACCTCGTACATCTAGTGGACATGATGAAAGCAATGGGAACGGACTAAAGTTTAGTACATCATTTCATCCCCAAACAGATGGTCAGTCCGAGAGGACCATCCAAA
CTTTAAAGGACATGTTGAGAGCATGTGTCCTACAACTTAAAGGAAGTTGGGATACCCACTTGCCACTTATGGAGTTTGCTTATAATAATAACTATCAGTCTAGTATCGGT
TTGGCACCATATGGAGCCTTATACGGGAGACCATGCAGAACTTCTGTGTGCTGGAATGAAGTGGGAGAGCGGAAGTTAGTAGGTCCTGAGTTGGTTCAGATTACGACAAA
CAATATTAAGTTGATCAGAGAAAATCTAAGGATAGGCCAAGATCGACAGAAAAGTTATGCGGATAAGCGACGAAGAAACCTAGAATTCCAAGTTGGAGATCAAGTTTTCT
TGAAATTATCTCCATGGCGAGGTGTTATTCGTTTCGGAAGAAAAGGTAAGTTAAGTCCTAGATATATTGGGCCATATCAGATAATGGAACGAGTGGGACCAACAGCGTAT
AGACTTGAGTTGCCAATAGAACTTGCACGAATACATGATGTTTTCCATGTATCCATGTTAAGAAAATATATACCAGATCCATCACATGTGTTGCAAGAGCAACCCGTTGA
ATTAAAAGAAGATTTGAGTTATGTTGAAGAACAAGTTCAGATTCTCGACAGAAAGGAACAAGTTTTGAGAAACAAAATGATTCCACTCATAAAGTCTAGTGGTTCCGTAG
AACACACATATCAGTCTAGTGCTCCCGAAGGATGCACATATCAGTCTAGTGCTCCCGAAGGATACACATATCAGTCTAGTGCTCCCGAAGGATACACATATCAGTCTAGT
GCTCCCGAAGGATACACATATCAGTCTAGTGCTCCCGAAGGATACACATATCAGTCTAGTGCTCCCGAAGGATGCACATGA
Protein sequenceShow/hide protein sequence
MQPGQESIASTVRRTPCTCCGRNHRGQCLVGVGVCYQCGQPGHFKKDCPQLNMTVQRDQGVGFQTVEQSRVSVVPTEGTSGARQKGFVGRPRQQGKVYAMTQQEAEDAPD
VITATILICNVPADVLFDPGATHSFVSSIFLTKLNRMLEPLPERLAIYTTVGDVLLVNEVLRNCEVLVEGISLLVDLLPLELQRLDVILGMDFLFAHYASMDCNRKKVVF
KKPGFAKVVFRGMKKAVSRSLISALKAEKLLRKGCTTFLSHIVVVQREKLKLEDVPVVKEFLHAPYRMAPSELKELKMQLQELVDKGCIRPSVSAWGAPVLFVKKKDGTL
RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLKSGYPQLKVRESNIANTAFRMRYGHYEFRVIPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYS
VDRESHEEHLRIVLQTLREKQLYAKFSKSVVNWEKPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFHELKKRLVTAPILALPVTGKDYVI
YCDASRLGLGCVLMQDGNVIAYASRQLKEHEYHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPIKANVVADALSRKLRLPKSALCGIRVALLNELRGSKAVVTTED
SGSLLAQFQVRSSLVTEIVRRQSKESNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDR
CLICQQVKPVRQRPGGFLNPLPVPEWKWEHITMDFLFGLPRTSSGHDESNGNGLKFSTSFHPQTDGQSERTIQTLKDMLRACVLQLKGSWDTHLPLMEFAYNNNYQSSIG
LAPYGALYGRPCRTSVCWNEVGERKLVGPELVQITTNNIKLIRENLRIGQDRQKSYADKRRRNLEFQVGDQVFLKLSPWRGVIRFGRKGKLSPRYIGPYQIMERVGPTAY
RLELPIELARIHDVFHVSMLRKYIPDPSHVLQEQPVELKEDLSYVEEQVQILDRKEQVLRNKMIPLIKSSGSVEHTYQSSAPEGCTYQSSAPEGYTYQSSAPEGYTYQSS
APEGYTYQSSAPEGYTYQSSAPEGCT