; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0063541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0063541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:4060916..4062232
RNA-Seq ExpressionCmc03g0063541
SyntenyCmc03g0063541
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.9e-24898.84Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGA LFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

KAA0036676.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.0e-24899.07Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

KAA0050527.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.0e-24899.07Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

KAA0062270.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.0e-24899.07Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.0e-24899.07Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

TrEMBL top hitse value%identityAlignment
A0A5A7T3K4 Reverse transcriptase4.9e-24999.07Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

A0A5A7U2V7 Reverse transcriptase1.9e-24898.84Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGA LFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

A0A5A7U819 Reverse transcriptase4.9e-24999.07Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

A0A5A7V4W9 Reverse transcriptase4.9e-24999.07Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

A0A5A7VNK4 Reverse transcriptase4.9e-24999.07Show/hide
Query:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
        MAPSELKELKMQLQELV+KGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK
Subjt:  MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAK

Query:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG
        TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKF+KCEFWLEQVVF GHVVSAKG
Subjt:  TAFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKG

Query:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
        VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKK+LVTAPILALPVTGKDYVIYCDASRLGL
Subjt:  VSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT
        LSRKSRLPKSALCGIRVALLNELRGSKAVVT
Subjt:  LSRKSRLPKSALCGIRVALLNELRGSKAVVT

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.0e-9039.45Show/hide
Query:  KELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGT-----LRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKT
        +E++ Q+Q+++N+G IR S SP+ +P+  V KK         R+ IDYR+LN++T+ +++P+P +D++  +L     F+ IDL  G+HQ+++    ++KT
Subjt:  KELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGT-----LRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKT

Query:  AFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGV
        AF T++GHYE+  MPFGL NAPA F   MN I    L++  +V++DDI+V+S   + H + L +V + L +  L  + +KCEF  ++  F GHV++  G+
Subjt:  AFRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGV

Query:  SVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSD-KCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
          +P+K+EA+  +  P    E+++FLGL GYYR+FI +F+ +A P+T   +KN+K + ++ + + +F++LK  +   PIL +P   K + +  DAS + L
Subjt:  SVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSD-KCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        G VL QDG+ ++Y SR L EHE NY T + EL A+V A K +RHYL G    I +DH+ L +++  K+ N +  RW   + ++D  I+Y  GK N VADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

P20825 Retrovirus-related Pol polyprotein from transposon 2978.5e-8940.69Show/hide
Query:  ELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKD-----GTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTA
        E++ Q+QE++N+G IR S SP+ +P   V KK         R+ IDYR+LN++TI ++YP+P +D++  +L     F+ IDL  G+HQ+++ E  I+KTA
Subjt:  ELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKD-----GTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTA

Query:  FRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVS
        F T+ GHYE+  MPFGL NAPA F   MN I    L++  +V++DDI+++S     H   +++V   L +  L  + +KCEF  ++  F GH+V+  G+ 
Subjt:  FRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVS

Query:  VDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCE--QSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL
         +P KV+A+V++  P    E+R+FLGL GYYR+FI +++ +A P+T+  +K  K + + K E  ++F++LK  ++  PIL LP   K +V+  DAS L L
Subjt:  VDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCE--QSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGL

Query:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA
        G VL Q+G+ I++ SR L +HE NY   + EL A+V A K +RHYL G +  I +DH+ L+++ + KE   +  RW   + +Y   I+Y  GK N VADA
Subjt:  GCVLMQDGNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.0e-7839.6Show/hide
Query:  KELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTAFRTR
        +E+   +Q+L++  +I PS SP  +PV+ V KKDGT RLC+DYR LNK TI + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF T 
Subjt:  KELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTAFRTR

Query:  YGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVSVDPQ
         G YE+ VMPFGL NAP+ F   M   F     +FV V++DDIL++S   E H +HL  VL+ L+ + L  K  KC+F  E+  F G+ +  + ++    
Subjt:  YGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVSVDPQ

Query:  KVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGK-DYVIYCDASRLGLGCVLM
        K  A+ ++  P +  + + FLG+  YYRRFI + S++A P+        K +W++K +++ ++LK  L  +P+L +P   K +Y +  DAS+ G+G VL 
Subjt:  KVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGK-DYVIYCDASRLGLGCVLM

Query:  QDGN------VIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVAD
        +  N      V+ Y S+ L+  + NYP  +LEL  ++ AL  +R+ L G+   + TDH SL  + ++ E   R +RWL+ +  YD T+EY  G  NVVAD
Subjt:  QDGN------VIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVAD

Query:  ALSR
        A+SR
Subjt:  ALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus9.7e-8540.53Show/hide
Query:  ELKMQLQELVNKGYIRPSVSPWGAPVLFVKKK-----DGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTA
        E++ Q+ EL+  G IRPS SP+ +P+  V KK     +   R+ +D+++LN VTI + YP+P I+     L  A  F+ +DL SG+HQ+ ++ESDI KTA
Subjt:  ELKMQLQELVNKGYIRPSVSPWGAPVLFVKKK-----DGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTA

Query:  FRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVS
        F T  G YEF  +PFGL NAPA+F  +++ I   ++ +   V+IDDI+V+S D ++H ++LR+VL +L +  L     K  F   QV F G++V+A G+ 
Subjt:  FRTRYGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVS

Query:  VDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTR---KNVKFEWSDKCE--------QSFQELKKKLVTAPILALPVTGKDYVI
         DP+KV A+     P S  E++ FLG+  YYR+FI+D++++A PLT LTR    N+K   S K          QSF +LK  L ++ ILA P   K + +
Subjt:  VDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTR---KNVKFEWSDKCE--------QSFQELKKKLVTAPILALPVTGKDYVI

Query:  YCDASRLGLGCVLMQD----GNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGE-KCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCT
          DAS   +G VL QD       IAY SR L + E NY T + E+ A++ +L   R YL+G     ++TDH+ L +    +  N + +RW   I++Y+C 
Subjt:  YCDASRLGLGCVLMQD----GNVIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGE-KCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCT

Query:  IEYHPGKANVVADALSR
        + Y PGK+NVVADALSR
Subjt:  IEYHPGKANVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.8e-7839.6Show/hide
Query:  KELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTAFRTR
        +E+   +Q+L++  +I PS SP  +PV+ V KKDGT RLC+DYR LNK TI + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF T 
Subjt:  KELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTAFRTR

Query:  YGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVSVDPQ
         G YE+ VMPFGL NAP+ F   M   F     +FV V++DDIL++S   E H +HL  VL+ L+ + L  K  KC+F  E+  F G+ +  + ++    
Subjt:  YGHYEFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVSVDPQ

Query:  KVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGK-DYVIYCDASRLGLGCVLM
        K  A+ ++  P +  + + FLG+  YYRRFI + S++A P+        K +W++K +++  +LK  L  +P+L +P   K +Y +  DAS+ G+G VL 
Subjt:  KVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGK-DYVIYCDASRLGLGCVLM

Query:  QDGN------VIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVAD
        +  N      V+ Y S+ L+  + NYP  +LEL  ++ AL  +R+ L G+   + TDH SL  + ++ E   R +RWL+ +  YD T+EY  G  NVVAD
Subjt:  QDGN------VIAYASRQLKEHECNYPTHDLELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVAD

Query:  ALSR
        A+SR
Subjt:  ALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.1e-2644.35Show/hide
Query:  HLRIVLQTLREKQLYAKFNKCEFWLEQVVFFG--HVVSAKGVSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEW
        HL +VLQ   + Q YA   KC F   Q+ + G  H++S +GVS DP K+EA+V W  P + TE+R FLGL GYYRRF++++ ++  PLT L +KN   +W
Subjt:  HLRIVLQTLREKQLYAKFNKCEFWLEQVVFFG--HVVSAKGVSVDPQKVEAVVNWERPISATEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEW

Query:  SDKCEQSFQELKKKLVTAPILALP
        ++    +F+ LK  + T P+LALP
Subjt:  SDKCEQSFQELKKKLVTAPILALP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAAGCGAGCTAAAAGAATTGAAGATGCAGTTACAGGAACTAGTTAACAAGGGATACATCAGGCCTAGTGTTTCGCCGTGGGGAGCACCAGTGCTTTTTGTGAA
GAAGAAAGATGGTACCCTCAGATTATGTATTGACTATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCACGCATCGATGACTTATTTGATCAACTAA
GGGGAGCAACGTTGTTCTCTAAGATTGACTTAAGGTCAGGATACCACCAGTTGAAGGTTAGAGAATCAGATATTGCTAAGACAGCATTTAGAACGAGGTATGGGCATTAT
GAGTTTCGAGTTATGCCATTCGGTTTAACGAATGCGCCAGCGGTTTTCATGGATCTCATGAACAGGATCTTCCATCGGTATTTAGATCAGTTTGTGATTGTGTTCATTGA
TGATATATTAGTTTACTCAGTTGACAGAGAATCTCATGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGAAAAACAGTTATACGCTAAGTTCAACAAATGTGAGT
TCTGGTTGGAACAGGTAGTATTTTTTGGGCATGTAGTTTCAGCAAAAGGAGTTAGTGTCGATCCACAAAAAGTAGAAGCGGTTGTCAATTGGGAAAGACCAATTAGTGCG
ACAGAAGTACGTAGTTTCCTGGGTTTGGCAGGATACTATAGGCGTTTTATTGAGGATTTCTCTCGATTGGCATTGCCTTTGACCGCTTTGACAAGGAAGAATGTTAAGTT
TGAGTGGTCAGATAAATGCGAGCAAAGTTTTCAGGAATTGAAGAAAAAACTAGTTACAGCACCTATTTTGGCACTTCCTGTAACAGGGAAGGACTATGTGATTTATTGTG
ATGCTTCAAGGCTAGGATTAGGTTGTGTGCTTATGCAAGATGGGAATGTAATAGCTTATGCTTCAAGGCAGTTGAAGGAGCATGAGTGTAATTACCCTACCCATGATCTT
GAGCTAGCAGCAGTTGTTTTAGCACTAAAAATCTGGAGACACTATTTGTTCGGGGAAAAGTGCCATATTTTCACAGATCATAAAAGTCTGAAGTATATTTTTGATCAAAA
AGAGCTAAATCTGAGACAAAGGCGATGGCTAGAACTGATTAAAGATTATGATTGTACTATAGAGTATCATCCAGGTAAGGCCAACGTAGTAGCAGATGCATTAAGTAGGA
AGTCAAGACTTCCGAAGAGTGCCTTGTGTGGTATTCGAGTAGCTTTGTTGAATGAGTTAAGAGGTTCCAAGGCAGTAGTAACTAAGAGGATTCAGGAAGTCTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCAAGCGAGCTAAAAGAATTGAAGATGCAGTTACAGGAACTAGTTAACAAGGGATACATCAGGCCTAGTGTTTCGCCGTGGGGAGCACCAGTGCTTTTTGTGAA
GAAGAAAGATGGTACCCTCAGATTATGTATTGACTATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCACGCATCGATGACTTATTTGATCAACTAA
GGGGAGCAACGTTGTTCTCTAAGATTGACTTAAGGTCAGGATACCACCAGTTGAAGGTTAGAGAATCAGATATTGCTAAGACAGCATTTAGAACGAGGTATGGGCATTAT
GAGTTTCGAGTTATGCCATTCGGTTTAACGAATGCGCCAGCGGTTTTCATGGATCTCATGAACAGGATCTTCCATCGGTATTTAGATCAGTTTGTGATTGTGTTCATTGA
TGATATATTAGTTTACTCAGTTGACAGAGAATCTCATGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGAAAAACAGTTATACGCTAAGTTCAACAAATGTGAGT
TCTGGTTGGAACAGGTAGTATTTTTTGGGCATGTAGTTTCAGCAAAAGGAGTTAGTGTCGATCCACAAAAAGTAGAAGCGGTTGTCAATTGGGAAAGACCAATTAGTGCG
ACAGAAGTACGTAGTTTCCTGGGTTTGGCAGGATACTATAGGCGTTTTATTGAGGATTTCTCTCGATTGGCATTGCCTTTGACCGCTTTGACAAGGAAGAATGTTAAGTT
TGAGTGGTCAGATAAATGCGAGCAAAGTTTTCAGGAATTGAAGAAAAAACTAGTTACAGCACCTATTTTGGCACTTCCTGTAACAGGGAAGGACTATGTGATTTATTGTG
ATGCTTCAAGGCTAGGATTAGGTTGTGTGCTTATGCAAGATGGGAATGTAATAGCTTATGCTTCAAGGCAGTTGAAGGAGCATGAGTGTAATTACCCTACCCATGATCTT
GAGCTAGCAGCAGTTGTTTTAGCACTAAAAATCTGGAGACACTATTTGTTCGGGGAAAAGTGCCATATTTTCACAGATCATAAAAGTCTGAAGTATATTTTTGATCAAAA
AGAGCTAAATCTGAGACAAAGGCGATGGCTAGAACTGATTAAAGATTATGATTGTACTATAGAGTATCATCCAGGTAAGGCCAACGTAGTAGCAGATGCATTAAGTAGGA
AGTCAAGACTTCCGAAGAGTGCCTTGTGTGGTATTCGAGTAGCTTTGTTGAATGAGTTAAGAGGTTCCAAGGCAGTAGTAACTAAGAGGATTCAGGAAGTCTCTTAG
Protein sequenceShow/hide protein sequence
MAPSELKELKMQLQELVNKGYIRPSVSPWGAPVLFVKKKDGTLRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGATLFSKIDLRSGYHQLKVRESDIAKTAFRTRYGHY
EFRVMPFGLTNAPAVFMDLMNRIFHRYLDQFVIVFIDDILVYSVDRESHEEHLRIVLQTLREKQLYAKFNKCEFWLEQVVFFGHVVSAKGVSVDPQKVEAVVNWERPISA
TEVRSFLGLAGYYRRFIEDFSRLALPLTALTRKNVKFEWSDKCEQSFQELKKKLVTAPILALPVTGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHECNYPTHDL
ELAAVVLALKIWRHYLFGEKCHIFTDHKSLKYIFDQKELNLRQRRWLELIKDYDCTIEYHPGKANVVADALSRKSRLPKSALCGIRVALLNELRGSKAVVTKRIQEVS