; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0094551 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0094551
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:7411504..7413448
RNA-Seq ExpressionCmc04g0094551
SyntenyCmc04g0094551
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

KAA0050493.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

KAA0050527.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

KAA0062270.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

A0A5A7U2V7 Reverse transcriptase0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

A0A5A7UNA3 Reverse transcriptase0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

A0A5A7UUL6 Reverse transcriptase0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

A0A5D3BHI1 Reverse transcriptase0.0e+0092.53Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MPFGLTNAPAVFMDLMNRIFHRYLDQF+IVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA
        ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK                                         LGLGCVLMQDGNVIAYA
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGNVIAYA

Query:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG
        SRQLKEHECNYPTHDLE AAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKS LCG
Subjt:  SRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCG

Query:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
        IRVALLNELRGSKAVVT EDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM
Subjt:  IRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKNAILEEAHSSAYAM

Query:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
        HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPL VPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST
Subjt:  HPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLTKTTRFIPIKMTST

Query:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS
        LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQ KGSWDTHLPLMEFAYNNNYQSS
Subjt:  LDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEFAYNNNYQSS

Query:  IGMAPYEALYGRPCRTPVCWNEVGERKLV
        IGMAPYEALYGRPCRTPVCWNEVGERKLV
Subjt:  IGMAPYEALYGRPCRTPVCWNEVGERKLV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.4e-8529.21Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MP+G++ APA F   +N I     +  ++ ++DDIL++S     H +H++ VLQ L+   L    +KCEF   QV F+G+ +S KG +   + ++ V+ W
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----
        ++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+                                         + +G VL Q  +     
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----

Query:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--
         + Y S ++ + + NY   D E  A++ +LK WRHYL    E   I TDH++L  +   + +  N R  RW   ++D++  I Y PG AN +ADALSR  
Subjt:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--

Query:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN
             +PK                     + ++S + + Q  +      ++V   + D+ L        K +E   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN

Query:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+   E  WE ++MDF+  LP  SSG++ ++V+VDR +
Subjt:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT

Query:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP
        K    +P   + T +Q AR++  ++++ +G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+ 
Subjt:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP

Query:  LMEFAYNNNYQSSIGMAPYEALY
        L++ +YNN   S+  M P+E ++
Subjt:  LMEFAYNNNYQSSIGMAPYEALY

P0CT35 Transposon Tf2-2 polyprotein3.4e-8529.21Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MP+G++ APA F   +N I     +  ++ ++DDIL++S     H +H++ VLQ L+   L    +KCEF   QV F+G+ +S KG +   + ++ V+ W
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----
        ++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+                                         + +G VL Q  +     
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----

Query:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--
         + Y S ++ + + NY   D E  A++ +LK WRHYL    E   I TDH++L  +   + +  N R  RW   ++D++  I Y PG AN +ADALSR  
Subjt:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--

Query:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN
             +PK                     + ++S + + Q  +      ++V   + D+ L        K +E   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN

Query:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+   E  WE ++MDF+  LP  SSG++ ++V+VDR +
Subjt:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT

Query:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP
        K    +P   + T +Q AR++  ++++ +G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+ 
Subjt:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP

Query:  LMEFAYNNNYQSSIGMAPYEALY
        L++ +YNN   S+  M P+E ++
Subjt:  LMEFAYNNNYQSSIGMAPYEALY

P0CT36 Transposon Tf2-3 polyprotein3.4e-8529.21Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MP+G++ APA F   +N I     +  ++ ++DDIL++S     H +H++ VLQ L+   L    +KCEF   QV F+G+ +S KG +   + ++ V+ W
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----
        ++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+                                         + +G VL Q  +     
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----

Query:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--
         + Y S ++ + + NY   D E  A++ +LK WRHYL    E   I TDH++L  +   + +  N R  RW   ++D++  I Y PG AN +ADALSR  
Subjt:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--

Query:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN
             +PK                     + ++S + + Q  +      ++V   + D+ L        K +E   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN

Query:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+   E  WE ++MDF+  LP  SSG++ ++V+VDR +
Subjt:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT

Query:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP
        K    +P   + T +Q AR++  ++++ +G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+ 
Subjt:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP

Query:  LMEFAYNNNYQSSIGMAPYEALY
        L++ +YNN   S+  M P+E ++
Subjt:  LMEFAYNNNYQSSIGMAPYEALY

P0CT37 Transposon Tf2-4 polyprotein3.4e-8529.21Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MP+G++ APA F   +N I     +  ++ ++DDIL++S     H +H++ VLQ L+   L    +KCEF   QV F+G+ +S KG +   + ++ V+ W
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----
        ++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+                                         + +G VL Q  +     
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----

Query:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--
         + Y S ++ + + NY   D E  A++ +LK WRHYL    E   I TDH++L  +   + +  N R  RW   ++D++  I Y PG AN +ADALSR  
Subjt:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--

Query:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN
             +PK                     + ++S + + Q  +      ++V   + D+ L        K +E   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN

Query:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+   E  WE ++MDF+  LP  SSG++ ++V+VDR +
Subjt:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT

Query:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP
        K    +P   + T +Q AR++  ++++ +G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+ 
Subjt:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP

Query:  LMEFAYNNNYQSSIGMAPYEALY
        L++ +YNN   S+  M P+E ++
Subjt:  LMEFAYNNNYQSSIGMAPYEALY

P0CT41 Transposon Tf2-12 polyprotein3.4e-8529.21Show/hide
Query:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW
        MP+G++ APA F   +N I     +  ++ ++DDIL++S     H +H++ VLQ L+   L    +KCEF   QV F+G+ +S KG +   + ++ V+ W
Subjt:  MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNW

Query:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----
        ++P +  E+R FLG   Y R+FI   S+L  PL  L +K+V+                                         + +G VL Q  +     
Subjt:  ERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVK-----------------------------------------LGLGCVLMQDGN-----

Query:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--
         + Y S ++ + + NY   D E  A++ +LK WRHYL    E   I TDH++L  +   + +  N R  RW   ++D++  I Y PG AN +ADALSR  
Subjt:  VIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFG--EKCHIFTDHKSL--KYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSR--

Query:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN
             +PK                     + ++S + + Q  +      ++V   + D+ L        K +E   +L+    I  + ++ +PN ++L  
Subjt:  --KSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAIVKQGRLCVPNISELKN

Query:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT
         I+++ H     +HPG   +   + + + W G++++I EYV  C  CQ  K    +P G L P+   E  WE ++MDF+  LP  SSG++ ++V+VDR +
Subjt:  AILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWVIVDRLT

Query:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP
        K    +P   + T +Q AR++  ++++ +G P  I++D D  FTS+ W          +KFS  + PQTDGQ+ERT QT+E +LR        +W  H+ 
Subjt:  KTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLP

Query:  LMEFAYNNNYQSSIGMAPYEALY
        L++ +YNN   S+  M P+E ++
Subjt:  LMEFAYNNNYQSSIGMAPYEALY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.8e-2148.42Show/hide
Query:  HLRIVLQTLREKQLYAKFSKCEFWLEQVVFLG--HVVSAKGVSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKN
        HL +VLQ   + Q YA   KC F   Q+ +LG  H++S +GVS DP K+EA+V W  P + TE+R FLGL GYYRRF++++ ++  PLT L +KN
Subjt:  HLRIVLQTLREKQLYAKFSKCEFWLEQVVFLG--HVVSAKGVSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATTCGGTTTAACGAATGCGCCAGCGGTTTTCATGGATCTCATGAACAGGATCTTCCATCGGTATTTAGATCAGTTTTTGATTGTGTTCATTGATGATATATTAGT
TTACTCAGTTGACAGAGAATCTCATGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGAAAAACAGTTATACGCTAAGTTCAGCAAATGTGAGTTCTGGTTGGAAC
AAGTAGTATTTTTGGGGCATGTAGTTTCAGCAAAAGGAGTTAGTGTCGATCCACAAAAAGTAGAAGCGGTTGTCAATTGGGAAAGACCAATTAGTGCGACAGAAGTACGT
AGTTTCCTTGGTTTGGCAGGATACTATAGGCGTTTTATTGAAGATTTCTCTCGATTGGCATTGCCTTTGACCGCTTTGACAAGGAAGAATGTTAAGCTAGGATTAGGTTG
TGTGCTTATGCAAGATGGGAATGTAATAGCTTATGCTTCAAGGCAGTTGAAGGAGCATGAGTGTAATTACCCTACCCATGATCTTGAGCCAGCAGCAGTTGTTTTAGCAC
TAAAAATCTGGAGACACTATTTGTTCGGGGAAAAGTGCCATATTTTCACAGATCATAAAAGTCTGAAGTATATTTTTGATCAAAAAGAGCTAAATCTGAGACAAAGGCGA
TGGCTAGAACTGATTAAAGATTATGATTGTACTATAGAGTATCATCCAGGTAAGGCCAACGTAGTAGCAGATGCATTAAGTAGGAAGTCAAGACTTCCGAAGAGTGTCTT
GTGTGGTATTCGAGTAGCTTTGTTGAATGAGTTAAGAGGTTCCAAGGCAGTAGTAACTATAGAGGATTCAGGAAGTCTCTTAGCTCAATTTCAGGTTCGGTCTTCTCTAG
TAACTGAGATTGTAAGAAGACAGTCAGAAGACAGTAATTTACAGAAGAAGTTTGAGAAATCCAAGAAAGGCTTAGAGGTGGAGTTTGAGCTGAGAACAGATGGAGCCATT
GTTAAACAAGGAAGATTATGTGTTCCGAATATCAGTGAGCTTAAGAATGCTATTCTAGAAGAAGCTCACAGTTCAGCTTACGCTATGCATCCAGGTAGCACCAAGATGTA
CAGAACTTTAAAGAAGACTTATTGGTGGTCTGGAATGAAGCAAGAGATAGCTGAATATGTTGATAGATGTTTGATTTGTCAACAGGTTAAACCAGTAAGACAGAGGCCAG
GAGGATTTCTTAATCCTTTGTCAGTGCCTGAGTGGAAATGGGAGCATATTACTATGGATTTTCTATTTGGATTACCTCGTACATCCAGTGGACATGATGGTATATGGGTA
ATAGTAGACAGACTCACCAAGACGACACGATTTATACCGATTAAAATGACATCTACGTTAGACCAGCTAGCGAGATTATATGTTGATAAGATTGTGAGTCAGTATGGAGT
ACCAGTGTCCATAGTTTCAGATAGGGATCCGAGGTTTACTTCTAAATTTTGGCCTAGTTTACAGAAAGCAATGGGAACAGGGCTAAAGTTTAGTACATCATTTCATCCCC
AAACAGATGGTCAGTCCGAGAGGACCATCCAAACTTTAGAGGACATGTTGAGAGCATGTGTCCTACAATTTAAAGGAAGTTGGGATACCCACTTGCCACTTATGGAGTTT
GCTTATAATAATAACTATCAGTCTAGTATCGGTATGGCACCATATGAAGCCTTATACGGGAGACCATGCAGAACTCCTGTGTGCTGGAATGAAGTGGGAGAGCGGAAGTT
AGTAGAGAAAATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCATTCGGTTTAACGAATGCGCCAGCGGTTTTCATGGATCTCATGAACAGGATCTTCCATCGGTATTTAGATCAGTTTTTGATTGTGTTCATTGATGATATATTAGT
TTACTCAGTTGACAGAGAATCTCATGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGAAAAACAGTTATACGCTAAGTTCAGCAAATGTGAGTTCTGGTTGGAAC
AAGTAGTATTTTTGGGGCATGTAGTTTCAGCAAAAGGAGTTAGTGTCGATCCACAAAAAGTAGAAGCGGTTGTCAATTGGGAAAGACCAATTAGTGCGACAGAAGTACGT
AGTTTCCTTGGTTTGGCAGGATACTATAGGCGTTTTATTGAAGATTTCTCTCGATTGGCATTGCCTTTGACCGCTTTGACAAGGAAGAATGTTAAGCTAGGATTAGGTTG
TGTGCTTATGCAAGATGGGAATGTAATAGCTTATGCTTCAAGGCAGTTGAAGGAGCATGAGTGTAATTACCCTACCCATGATCTTGAGCCAGCAGCAGTTGTTTTAGCAC
TAAAAATCTGGAGACACTATTTGTTCGGGGAAAAGTGCCATATTTTCACAGATCATAAAAGTCTGAAGTATATTTTTGATCAAAAAGAGCTAAATCTGAGACAAAGGCGA
TGGCTAGAACTGATTAAAGATTATGATTGTACTATAGAGTATCATCCAGGTAAGGCCAACGTAGTAGCAGATGCATTAAGTAGGAAGTCAAGACTTCCGAAGAGTGTCTT
GTGTGGTATTCGAGTAGCTTTGTTGAATGAGTTAAGAGGTTCCAAGGCAGTAGTAACTATAGAGGATTCAGGAAGTCTCTTAGCTCAATTTCAGGTTCGGTCTTCTCTAG
TAACTGAGATTGTAAGAAGACAGTCAGAAGACAGTAATTTACAGAAGAAGTTTGAGAAATCCAAGAAAGGCTTAGAGGTGGAGTTTGAGCTGAGAACAGATGGAGCCATT
GTTAAACAAGGAAGATTATGTGTTCCGAATATCAGTGAGCTTAAGAATGCTATTCTAGAAGAAGCTCACAGTTCAGCTTACGCTATGCATCCAGGTAGCACCAAGATGTA
CAGAACTTTAAAGAAGACTTATTGGTGGTCTGGAATGAAGCAAGAGATAGCTGAATATGTTGATAGATGTTTGATTTGTCAACAGGTTAAACCAGTAAGACAGAGGCCAG
GAGGATTTCTTAATCCTTTGTCAGTGCCTGAGTGGAAATGGGAGCATATTACTATGGATTTTCTATTTGGATTACCTCGTACATCCAGTGGACATGATGGTATATGGGTA
ATAGTAGACAGACTCACCAAGACGACACGATTTATACCGATTAAAATGACATCTACGTTAGACCAGCTAGCGAGATTATATGTTGATAAGATTGTGAGTCAGTATGGAGT
ACCAGTGTCCATAGTTTCAGATAGGGATCCGAGGTTTACTTCTAAATTTTGGCCTAGTTTACAGAAAGCAATGGGAACAGGGCTAAAGTTTAGTACATCATTTCATCCCC
AAACAGATGGTCAGTCCGAGAGGACCATCCAAACTTTAGAGGACATGTTGAGAGCATGTGTCCTACAATTTAAAGGAAGTTGGGATACCCACTTGCCACTTATGGAGTTT
GCTTATAATAATAACTATCAGTCTAGTATCGGTATGGCACCATATGAAGCCTTATACGGGAGACCATGCAGAACTCCTGTGTGCTGGAATGAAGTGGGAGAGCGGAAGTT
AGTAGAGAAAATCTGA
Protein sequenceShow/hide protein sequence
MPFGLTNAPAVFMDLMNRIFHRYLDQFLIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNWERPISATEVR
SFLGLAGYYRRFIEDFSRLALPLTALTRKNVKLGLGCVLMQDGNVIAYASRQLKEHECNYPTHDLEPAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRR
WLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSVLCGIRVALLNELRGSKAVVTIEDSGSLLAQFQVRSSLVTEIVRRQSEDSNLQKKFEKSKKGLEVEFELRTDGAI
VKQGRLCVPNISELKNAILEEAHSSAYAMHPGSTKMYRTLKKTYWWSGMKQEIAEYVDRCLICQQVKPVRQRPGGFLNPLSVPEWKWEHITMDFLFGLPRTSSGHDGIWV
IVDRLTKTTRFIPIKMTSTLDQLARLYVDKIVSQYGVPVSIVSDRDPRFTSKFWPSLQKAMGTGLKFSTSFHPQTDGQSERTIQTLEDMLRACVLQFKGSWDTHLPLMEF
AYNNNYQSSIGMAPYEALYGRPCRTPVCWNEVGERKLVEKI