; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0167821 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0167821
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr06:21171919..21173259
RNA-Seq ExpressionCmc06g0167821
SyntenyCmc06g0167821
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-23994.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

KAA0037582.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-23994.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

KAA0041108.1 reverse transcriptase [Cucumis melo var. makuwa]1.1e-23994.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

KAA0050493.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-23994.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.1e-23994.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase5.6e-24094.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

A0A5A7U2V7 Reverse transcriptase5.6e-24094.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

A0A5A7UNA3 Reverse transcriptase5.6e-24094.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

A0A5A7UUL6 Reverse transcriptase5.6e-24094.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

A0A5D3BHI1 Reverse transcriptase5.6e-24094.17Show/hide
Query:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY
        MQLQELVDKGYIRPSVSPWG PVLFVKKKDGT RLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFR RYGHY
Subjt:  MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHY

Query:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
        EFRVMPFGLTNAPAVFMDLMN IFH+YLDQFVIVFIDDILVYS+DR+S EEHLRIVLQTLR+KQLYA+FSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA
Subjt:  EFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEA

Query:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
        VVNWERP SATEVRSFLGLAGYYRRFI+DFSRLALPLTALTRKN KFEWS KCEQSFQ+LKKRLVT PILALPVTGKDYVIYCDASRLGLGCVLMQDGNV
Subjt:  VVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNV

Query:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS
        IAYASRQLKEHECNY T+DLELAAVVLALKIWRHYLFGEK HIFTD KSLKYIFDQKELNLRQRR LELIKDYDCTIEYHP KANVVADALS+KSRLPKS
Subjt:  IAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKS

Query:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ
        ALCGIRVALLNELRGSKAVVTTEDS SLLAQFQV SSLV EIVRRQ
Subjt:  ALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVIEIVRRQ

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.7e-8438.94Show/hide
Query:  QLQELVDKGYIRPSVSPWGVPVLFVKKKDGT-----FRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMR
        Q+Q+++++G IR S SP+  P+  V KK        FR+ IDYR+LN++T+ +++P+P +D++  +L     F+ IDL  G+HQ+++    ++KTAF  +
Subjt:  QLQELVDKGYIRPSVSPWGVPVLFVKKKDGT-----FRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMR

Query:  YGHYEFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQ
        +GHYE+  MPFGL NAPA F   MN I    L++  +V++DDI+V+S       + L +V + L    L  +  KCEF  ++  FLGHV++  G+  +P+
Subjt:  YGHYEFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQ

Query:  KVEAVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSH-KCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLM
        K+EA+  +  PT   E+++FLGL GYYR+FI +F+ +A P+T   +KN K + ++ + + +F+KLK  +   PIL +P   K + +  DAS + LG VL 
Subjt:  KVEAVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSH-KCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLM

Query:  QDGNVIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSK
        QDG+ ++Y SR L EHE NY T + EL A+V A K +RHYL G  + I +D + L +++  K+ N +  R    + ++D  I+Y   K N VADALS+
Subjt:  QDGNVIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSK

P20825 Retrovirus-related Pol polyprotein from transposon 2974.9e-8440.1Show/hide
Query:  QLQELVDKGYIRPSVSPWGVPVLFVKKKD-----GTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMR
        Q+QE++++G IR S SP+  P   V KK        +R+ IDYR+LN++TI ++YP+P +D++  +L     F+ IDL  G+HQ+++ E  I+KTAF  +
Subjt:  QLQELVDKGYIRPSVSPWGVPVLFVKKKD-----GTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMR

Query:  YGHYEFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQ
         GHYE+  MPFGL NAPA F   MN I    L++  +V++DDI+++S         +++V   L D  L  +  KCEF  ++  FLGH+V+  G+  +P 
Subjt:  YGHYEFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQ

Query:  KVEAVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCE--QSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVL
        KV+A+V++  PT   E+R+FLGL GYYR+FI +++ +A P+T+  +K  K + + K E  ++F+KLK  ++  PIL LP   K +V+  DAS L LG VL
Subjt:  KVEAVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCE--QSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVL

Query:  MQDGNVIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSK
         Q+G+ I++ SR L +HE NY   + EL A+V A K +RHYL G ++ I +D + L+++ + KE   +  R    + +Y   I+Y   K N VADALS+
Subjt:  MQDGNVIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.6e-7339.4Show/hide
Query:  LQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHYEF
        +Q+L+D  +I PS SP   PV+ V KKDGTFRLC+DYR LNK TI + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF    G YE+
Subjt:  LQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHYEF

Query:  RVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEE---HLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVE
         VMPFGL NAP+ F   M   F     +FV V++DDIL++S   +SPEE   HL  VL+ L+++ L  +  KC+F  E+  FLG+ +  + ++    K  
Subjt:  RVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEE---HLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVE

Query:  AVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGK-DYVIYCDASRLGLGCVLMQDG
        A+ ++  P +  + + FLG+  YYRRFI + S++A P+        K +W+ K +++ +KLK  L  +P+L +P   K +Y +  DAS+ G+G VL +  
Subjt:  AVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGK-DYVIYCDASRLGLGCVLMQDG

Query:  N------VIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALS
        N      V+ Y S+ L+  + NY   +LEL  ++ AL  +R+ L G+ + + TD  SL  + ++ E   R +R L+ +  YD T+EY     NVVADA+S
Subjt:  N------VIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALS

Query:  K
        +
Subjt:  K

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.2e-7739.71Show/hide
Query:  QLQELVDKGYIRPSVSPWGVPVLFVKKK-----DGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMR
        Q+ EL+  G IRPS SP+  P+  V KK     +  +R+ +D+++LN VTI + YP+P I+     L  A  F+ +DL SG+HQ+ ++ESDI KTAF   
Subjt:  QLQELVDKGYIRPSVSPWGVPVLFVKKK-----DGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMR

Query:  YGHYEFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQ
         G YEF  +PFGL NAPA+F  +++ I  +++ +   V+IDDI+V+S D  +  ++LR+VL +L    L     K  F   QV FLG++V+A G+  DP+
Subjt:  YGHYEFRVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQ

Query:  KVEAVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTR---KNAKFEWSHKCE--------QSFQKLKKRLVTTPILALPVTGKDYVIYCDA
        KV A+     PTS  E++ FLG+  YYR+FI+D++++A PLT LTR    N K   S K          QSF  LK  L ++ ILA P   K + +  DA
Subjt:  KVEAVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTR---KNAKFEWSHKCE--------QSFQKLKKRLVTTPILALPVTGKDYVIYCDA

Query:  SRLGLGCVLMQD----GNVIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGE-KYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYH
        S   +G VL QD       IAY SR L + E NY T + E+ A++ +L   R YL+G     ++TD + L +    +  N + +R    I++Y+C + Y 
Subjt:  SRLGLGCVLMQD----GNVIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGE-KYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYH

Query:  PDKANVVADALSK
        P K+NVVADALS+
Subjt:  PDKANVVADALSK

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.1e-7239.4Show/hide
Query:  LQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHYEF
        +Q+L+D  +I PS SP   PV+ V KKDGTFRLC+DYR LNK TI + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF    G YE+
Subjt:  LQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHYEF

Query:  RVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEE---HLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVE
         VMPFGL NAP+ F   M   F     +FV V++DDIL++S   +SPEE   HL  VL+ L+++ L  +  KC+F  E+  FLG+ +  + ++    K  
Subjt:  RVMPFGLTNAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEE---HLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVE

Query:  AVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGK-DYVIYCDASRLGLGCVLMQDG
        A+ ++  P +  + + FLG+  YYRRFI + S++A P+        K +W+ K +++  KLK  L  +P+L +P   K +Y +  DAS+ G+G VL +  
Subjt:  AVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGK-DYVIYCDASRLGLGCVLMQDG

Query:  N------VIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALS
        N      V+ Y S+ L+  + NY   +LEL  ++ AL  +R+ L G+ + + TD  SL  + ++ E   R +R L+ +  YD T+EY     NVVADA+S
Subjt:  N------VIAYASRQLKEHECNYRTYDLELAAVVLALKIWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALS

Query:  K
        +
Subjt:  K

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.3e-2745.97Show/hide
Query:  HLRIVLQTLRDKQLYARFSKCEFWLEQVVFLG--HVVSAKGVSVDPQKVEAVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEW
        HL +VLQ     Q YA   KC F   Q+ +LG  H++S +GVS DP K+EA+V W  P + TE+R FLGL GYYRRF+K++ ++  PLT L +KN+  +W
Subjt:  HLRIVLQTLRDKQLYARFSKCEFWLEQVVFLG--HVVSAKGVSVDPQKVEAVVNWERPTSATEVRSFLGLAGYYRRFIKDFSRLALPLTALTRKNAKFEW

Query:  SHKCEQSFQKLKKRLVTTPILALP
        +     +F+ LK  + T P+LALP
Subjt:  SHKCEQSFQKLKKRLVTTPILALP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTTACAAGAACTAGTTGACAAGGGATACATCAGGCCTAGTGTTTCGCCGTGGGGAGTACCAGTGCTATTTGTGAAAAAGAAAGATGGTACCTTCAGATTATGTAT
TGACTATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCACGCATCGATGACTTATTTGATCAACTAAGGGGAGCAGCGTTGTTCTCTAAGATTGACT
TAAGGTCAGGATACCACCAGTTGAAGGTTAGAGAATCAGATATTGCTAAGACAGCATTTAGAATGAGGTATGGGCATTATGAGTTTCGAGTTATGCCATTCGGTTTAACG
AATGCGCCGGCGGTTTTCATGGATCTCATGAACATGATCTTCCATCAGTATTTAGATCAGTTTGTGATTGTATTCATTGATGATATATTAGTTTACTCGATTGACAGACA
ATCCCCTGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGATAAACAGTTGTACGCTAGGTTCAGCAAATGTGAGTTCTGGTTGGAACAAGTAGTATTTTTGGGGC
ATGTAGTTTCCGCAAAAGGAGTTAGTGTCGATCCACAAAAAGTAGAAGCGGTTGTCAATTGGGAAAGACCAACTAGTGCGACAGAAGTACGTAGTTTCCTGGGTTTGGCA
GGATACTATAGGCGTTTTATTAAGGATTTCTCTCGATTGGCATTGCCTTTGACCGCTTTGACAAGGAAGAATGCTAAGTTTGAGTGGTCACATAAATGCGAGCAAAGTTT
TCAGAAATTGAAGAAAAGACTAGTTACAACACCTATTTTGGCACTTCCTGTAACAGGGAAGGACTATGTGATTTATTGTGATGCTTCAAGGCTAGGATTAGGTTGTGTGC
TTATGCAAGATGGGAATGTAATAGCTTATGCTTCAAGGCAGTTGAAGGAGCATGAGTGTAATTACCGTACCTATGATCTTGAGCTAGCAGCAGTTGTTTTAGCACTAAAA
ATCTGGAGACACTATTTGTTCGGGGAAAAGTACCATATTTTCACAGATCAGAAAAGTCTAAAGTATATTTTTGATCAGAAAGAGCTAAATCTGAGACAAAGGCGATTGCT
AGAACTGATTAAAGATTATGATTGTACTATAGAGTATCATCCAGATAAGGCCAACGTAGTAGCGGATGCATTAAGTAAGAAGTCAAGACTTCCGAAGAGTGCCTTGTGTG
GTATTCGAGTAGCTTTATTGAATGAGTTAAGAGGTTCCAAGGCAGTAGTAACTACAGAGGATTCAAGAAGTCTTTTAGCTCAATTTCAGGTTTGGTCTTCTCTAGTAATT
GAGATTGTAAGAAGACAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTTACAAGAACTAGTTGACAAGGGATACATCAGGCCTAGTGTTTCGCCGTGGGGAGTACCAGTGCTATTTGTGAAAAAGAAAGATGGTACCTTCAGATTATGTAT
TGACTATAGACAGTTAAACAAGGTTACAATACGTAACAAGTATCCTTTACCACGCATCGATGACTTATTTGATCAACTAAGGGGAGCAGCGTTGTTCTCTAAGATTGACT
TAAGGTCAGGATACCACCAGTTGAAGGTTAGAGAATCAGATATTGCTAAGACAGCATTTAGAATGAGGTATGGGCATTATGAGTTTCGAGTTATGCCATTCGGTTTAACG
AATGCGCCGGCGGTTTTCATGGATCTCATGAACATGATCTTCCATCAGTATTTAGATCAGTTTGTGATTGTATTCATTGATGATATATTAGTTTACTCGATTGACAGACA
ATCCCCTGAGGAACATCTGAGGATTGTTCTACAGACTCTACGTGATAAACAGTTGTACGCTAGGTTCAGCAAATGTGAGTTCTGGTTGGAACAAGTAGTATTTTTGGGGC
ATGTAGTTTCCGCAAAAGGAGTTAGTGTCGATCCACAAAAAGTAGAAGCGGTTGTCAATTGGGAAAGACCAACTAGTGCGACAGAAGTACGTAGTTTCCTGGGTTTGGCA
GGATACTATAGGCGTTTTATTAAGGATTTCTCTCGATTGGCATTGCCTTTGACCGCTTTGACAAGGAAGAATGCTAAGTTTGAGTGGTCACATAAATGCGAGCAAAGTTT
TCAGAAATTGAAGAAAAGACTAGTTACAACACCTATTTTGGCACTTCCTGTAACAGGGAAGGACTATGTGATTTATTGTGATGCTTCAAGGCTAGGATTAGGTTGTGTGC
TTATGCAAGATGGGAATGTAATAGCTTATGCTTCAAGGCAGTTGAAGGAGCATGAGTGTAATTACCGTACCTATGATCTTGAGCTAGCAGCAGTTGTTTTAGCACTAAAA
ATCTGGAGACACTATTTGTTCGGGGAAAAGTACCATATTTTCACAGATCAGAAAAGTCTAAAGTATATTTTTGATCAGAAAGAGCTAAATCTGAGACAAAGGCGATTGCT
AGAACTGATTAAAGATTATGATTGTACTATAGAGTATCATCCAGATAAGGCCAACGTAGTAGCGGATGCATTAAGTAAGAAGTCAAGACTTCCGAAGAGTGCCTTGTGTG
GTATTCGAGTAGCTTTATTGAATGAGTTAAGAGGTTCCAAGGCAGTAGTAACTACAGAGGATTCAAGAAGTCTTTTAGCTCAATTTCAGGTTTGGTCTTCTCTAGTAATT
GAGATTGTAAGAAGACAGTAA
Protein sequenceShow/hide protein sequence
MQLQELVDKGYIRPSVSPWGVPVLFVKKKDGTFRLCIDYRQLNKVTIRNKYPLPRIDDLFDQLRGAALFSKIDLRSGYHQLKVRESDIAKTAFRMRYGHYEFRVMPFGLT
NAPAVFMDLMNMIFHQYLDQFVIVFIDDILVYSIDRQSPEEHLRIVLQTLRDKQLYARFSKCEFWLEQVVFLGHVVSAKGVSVDPQKVEAVVNWERPTSATEVRSFLGLA
GYYRRFIKDFSRLALPLTALTRKNAKFEWSHKCEQSFQKLKKRLVTTPILALPVTGKDYVIYCDASRLGLGCVLMQDGNVIAYASRQLKEHECNYRTYDLELAAVVLALK
IWRHYLFGEKYHIFTDQKSLKYIFDQKELNLRQRRLLELIKDYDCTIEYHPDKANVVADALSKKSRLPKSALCGIRVALLNELRGSKAVVTTEDSRSLLAQFQVWSSLVI
EIVRRQ