; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032994 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032994
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr11:39667549..39673565
RNA-Seq ExpressionLag0032994
SyntenyLag0032994
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN79190.1 hypothetical protein VITISV_000232 [Vitis vinifera]2.3e-8547.93Show/hide
Query:  DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS
        D +E++  +  + L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI VLE + G ++   D I+ EIL +F KLY   S
Subjt:  DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS

Query:  SPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVK
           + +EG+DW+ I  +S+  LE  F+E+EI KAI  +   K+PGPDG      ++ W+++K DLV VF EF ++ I+N+ TN ++I L+PKK  A K+ 
Subjt:  SPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVK

Query:  DYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNW
        DYRPI+L+TSLYK+IAKVL  RL+ VL  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD +++ KGF  RWR W
Subjt:  DYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNW

Query:  IRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        IRGCL + +F+++VNG ++  + A+RGLRQGD LSP L
Subjt:  IRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

CAN79190.1 hypothetical protein VITISV_000232 [Vitis vinifera]3.8e-0329.35Show/hide
Query:  ARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGHSEGWITGVYDPCGYQEISQLLQELYDLQGLCQGVWCLADNFNLI
        AR+     L A  ++   L+MW   ++   + V+G+FSVS++    G  + W++ VY P          +EL D+  L    WC+  +FN+I
Subjt:  ARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGHSEGWITGVYDPCGYQEISQLLQELYDLQGLCQGVWCLADNFNLI

CAN79190.1 hypothetical protein VITISV_000232 [Vitis vinifera]4.4e-8446.45Show/hide
Query:  DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS
        D LE++  +  + L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI  LE ++G+++   + I+ EIL +F KLY   S
Subjt:  DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS

Query:  SPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVK
           + +EG+DW+ I  +S+  LE  F+E+EI KAI  +   K+PGPDG      ++ W ++K DLV+VF EF ++ I+N+ TN ++I L+PKK  + ++ 
Subjt:  SPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVK

Query:  DYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNW
        D+RPI+L+TSLYK+IAKVL  R+++VL  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD ++++KGF  RWR W
Subjt:  DYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNW

Query:  IRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        +RGCL + +F+V+VNG ++  + A+RGLRQGD LSP L
Subjt:  IRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

RVW90400.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.3e-8547.41Show/hide
Query:  KSFVALLT--DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILS
        K  +A+L   D LE++  +  ++L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI  LE +SG ++   + I+ EIL 
Subjt:  KSFVALLT--DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILS

Query:  FFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLI
        +F KLY   S   + +EG+DW+ ID +S+  LE  F+E+EI KAI  +   K+PGPDG      ++ W+++K DLV VF EF ++ I+N+ TN ++I LI
Subjt:  FFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLI

Query:  PKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKL
        PKK  + ++ DYRPI+L+TSLYK+IAKVL  RL+ VL  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD +L++
Subjt:  PKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKL

Query:  KGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        KGF  RWR W+RGCL + +++V+VNG ++  + A+RGLRQGD LSP L
Subjt:  KGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

RVW98505.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.8e-8547.13Show/hide
Query:  KSFVALLT--DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILS
        K  +A+L   D LE++  +  + L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI  LE +SG ++   + I+ EIL 
Subjt:  KSFVALLT--DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILS

Query:  FFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLI
        +F KLY   S   + +EG+DW+ ID +S+  LE  F+E+EI KAI  +   K+PGPDG      ++ W+++K DLV VF EF ++ I+N+ TN ++I L+
Subjt:  FFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLI

Query:  PKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKL
        PKK ++ ++ D+RPI+L+TSLYK+IAKVL ERL+ VL  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD +L++
Subjt:  PKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKL

Query:  KGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        KGF  RWR W+RGCL + ++ V+VNG ++  + A+RGLRQGD LSP L
Subjt:  KGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

RVX04736.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.8e-8547.01Show/hide
Query:  EEIANKSFVALLTDRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGE
        E+IAN        D +E++  +  + L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI VLE + G ++   D I+ E
Subjt:  EEIANKSFVALLTDRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGE

Query:  ILSFFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYI
        IL +F KLY   S   + +EG+DW+ I  +S+  LE  F E+EI KAI  +   K+PGPDG      ++ W+++K DLV VF EF ++ I+N+ TN ++I
Subjt:  ILSFFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYI

Query:  CLIPKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFI
         L+PKK  A K+ DYRPI+L+TSLYK+IAKVLV RL+ +L  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD +
Subjt:  CLIPKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFI

Query:  LKLKGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        ++ KGF  +WR WIRGCL + +F+++VNG ++  + A+RGLRQGD LSP L
Subjt:  LKLKGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.7e-0329.66Show/hide
Query:  EDVDINSFLERADEVSHEDYLLERERARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGHSEGWITGVYDPCGYQEISQLLQELYD
        E  D+  F E   E     ++     AR+     L A  ++   LI+W   ++   + ++G+FSVSI+ T  G    W++ VY P        L  EL D
Subjt:  EDVDINSFLERADEVSHEDYLLERERARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGHSEGWITGVYDPCGYQEISQLLQELYD

Query:  LQGLCQGVWCLADNFNLI
        + GL    WC+  +FN+I
Subjt:  LQGLCQGVWCLADNFNLI

TrEMBL top hitse value%identityAlignment
A0A438IPG2 Transposon TX1 uncharacterized 149 kDa protein8.6e-8647.13Show/hide
Query:  KSFVALLT--DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILS
        K  +A+L   D LE++  +  + L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI  LE +SG ++   + I+ EIL 
Subjt:  KSFVALLT--DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILS

Query:  FFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLI
        +F KLY   S   + +EG+DW+ ID +S+  LE  F+E+EI KAI  +   K+PGPDG      ++ W+++K DLV VF EF ++ I+N+ TN ++I L+
Subjt:  FFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLI

Query:  PKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKL
        PKK ++ ++ D+RPI+L+TSLYK+IAKVL ERL+ VL  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD +L++
Subjt:  PKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKL

Query:  KGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        KGF  RWR W+RGCL + ++ V+VNG ++  + A+RGLRQGD LSP L
Subjt:  KGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

A0A803QI00 Uncharacterized protein1.0e-8649.41Show/hide
Query:  DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS
        DRLE  NS     +EER+KLK +  +L  +E+RS+  K K +W KEGD NS FFH      K R  IS +E++ G II KE+EI  E++ FFSKLY  ++
Subjt:  DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS

Query:  SPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVK
             IE ++W  I Y S+C LE +F E+E+++++      K+PGPDG      +N W  +K DL+EVF+ F +   +    NET+ICLIPK+  + KVK
Subjt:  SPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVK

Query:  DYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNW
        D+RPI+L+TS+YK++AK L  RL+ VL  TIS+ Q+AFV+GRQIL ++L+A+E VED   R +KG + K+DLEKAYD V+WDFLD +LK KGF   WR W
Subjt:  DYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNW

Query:  IRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        IRGC+ +++FS+++NGR R K   +RGLRQGD LSP L
Subjt:  IRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

A0A803QI00 Uncharacterized protein2.5e-0832.74Show/hide
Query:  RARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGHSEGWITGVYDPCGYQEISQLLQELYDLQGLCQGVWCLADNFNLI-SPKEFQ
        R+R  A + + AI  +  TL++W    + V+DS++G FS+S+    +G    W +GVY PC Y+       EL  L  +C   WC+  +FN+   P E  
Subjt:  RARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGHSEGWITGVYDPCGYQEISQLLQELYDLQGLCQGVWCLADNFNLI-SPKEFQ

Query:  QPLSADYSSKSFN
           S   S K F+
Subjt:  QPLSADYSSKSFN

A0A803QI00 Uncharacterized protein8.6e-8647.01Show/hide
Query:  EEIANKSFVALLTDRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGE
        E+IAN        D +E++  +  + L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI VLE + G ++   D I+ E
Subjt:  EEIANKSFVALLTDRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGE

Query:  ILSFFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYI
        IL +F KLY   S   + +EG+DW+ I  +S+  LE  F E+EI KAI  +   K+PGPDG      ++ W+++K DLV VF EF ++ I+N+ TN ++I
Subjt:  ILSFFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYI

Query:  CLIPKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFI
         L+PKK  A K+ DYRPI+L+TSLYK+IAKVLV RL+ +L  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD +
Subjt:  CLIPKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFI

Query:  LKLKGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        ++ KGF  +WR WIRGCL + +F+++VNG ++  + A+RGLRQGD LSP L
Subjt:  LKLKGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

A5BQD9 Reverse transcriptase domain-containing protein1.1e-8547.93Show/hide
Query:  DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS
        D +E++  +  + L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI VLE + G ++   D I+ EIL +F KLY   S
Subjt:  DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS

Query:  SPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVK
           + +EG+DW+ I  +S+  LE  F+E+EI KAI  +   K+PGPDG      ++ W+++K DLV VF EF ++ I+N+ TN ++I L+PKK  A K+ 
Subjt:  SPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVK

Query:  DYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNW
        DYRPI+L+TSLYK+IAKVL  RL+ VL  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD +++ KGF  RWR W
Subjt:  DYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNW

Query:  IRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        IRGCL + +F+++VNG ++  + A+RGLRQGD LSP L
Subjt:  IRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

A5BQD9 Reverse transcriptase domain-containing protein1.8e-0329.35Show/hide
Query:  ARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGHSEGWITGVYDPCGYQEISQLLQELYDLQGLCQGVWCLADNFNLI
        AR+     L A  ++   L+MW   ++   + V+G+FSVS++    G  + W++ VY P          +EL D+  L    WC+  +FN+I
Subjt:  ARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGHSEGWITGVYDPCGYQEISQLLQELYDLQGLCQGVWCLADNFNLI

A5BQD9 Reverse transcriptase domain-containing protein1.1e-8547.41Show/hide
Query:  KSFVALLT--DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILS
        K  +A+L   D LE++  +  ++L +R   K +L EL++ E+    QK +++W+KEGD NS FFH+ A   +NR FI  LE +SG ++   + I+ EIL 
Subjt:  KSFVALLT--DRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILS

Query:  FFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLI
        +F KLY   S   + +EG+DW+ ID +S+  LE  F+E+EI KAI  +   K+PGPDG      ++ W+++K DLV VF EF ++ I+N+ TN ++I LI
Subjt:  FFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLI

Query:  PKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKL
        PKK  + ++ DYRPI+L+TSLYK+IAKVL  RL+ VL  TI   Q AFVQGRQIL A+L+A+E+V+++    E+GV+ K+D EKAYD V+WDFLD +L++
Subjt:  PKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKL

Query:  KGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
        KGF  RWR W+RGCL + +++V+VNG ++  + A+RGLRQGD LSP L
Subjt:  KGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

SwissProt top hitse value%identityAlignment
F4INA9 ATP-dependent DNA helicase homolog RECG, chloroplastic1.3e-3353.59Show/hide
Query:  LARLFQMLEGLGSAIEKDGLLDKYRQPHLNAAYMKEWSCLTQKFLKALPYSLTESQ--------------------MKGVVGCGKSVVAFLACMEVIGAG
        LARL+QML+ LG+ IEKD LL+K+R+P LN+ Y++EWS LT+ FLKALPYSLT SQ                    ++G VGCGK+VVAFLACMEVIG+G
Subjt:  LARLFQMLEGLGSAIEKDGLLDKYRQPHLNAAYMKEWSCLTQKFLKALPYSLTESQ--------------------MKGVVGCGKSVVAFLACMEVIGAG

Query:  YQAAFMVLTELFAIQHYQHLLGLLETMEEIANKSFVALLTDRLEEQNSIQSQQ
        YQAAFM  TEL AIQHY+    LLE ME +++K  + LLT     + S   +Q
Subjt:  YQAAFMVLTELFAIQHYQHLLGLLETMEEIANKSFVALLTDRLEEQNSIQSQQ

P11369 LINE-1 retrotransposable element ORF2 protein2.2e-2226.45Show/hide
Query:  EEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIR-WLKEG-DENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLY-----
        +E NS +  + +E  KL+ ++ +  ++ +R++ +  + R W  E  ++      R     +++  I+ +  + G+I T  +EI+  I SF+ +LY     
Subjt:  EEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIR-WLKEG-DENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLY-----

Query:  NFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPK-KKK
        N D   KF ++      ++     +L    S +EI+  I  L   KSPGPDG   EF + +   L P L ++F +      +     E  I LIPK +K 
Subjt:  NFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPK-KKK

Query:  ADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVE-DQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFR
          K++++RPI+L+    K++ K+L  R+++ +   I   Q  F+ G Q    I  +  V+      +++  +++ LD EKA+D +   F+  +L+  G +
Subjt:  ADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVE-DQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFR

Query:  CRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL
          + N I+        ++ VNG   + I    G RQG  LSP L
Subjt:  CRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSL

P14381 Transposon TX1 uncharacterized 149 kDa protein8.4e-2225.82Show/hide
Query:  EQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS-SPK
        E  ++Q + LE ++ L+     +   + R    + +++ L + D  S FF+       NR  I+ L  + G  +   + I     SF+  L++ D  SP 
Subjt:  EQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS-SPK

Query:  FVIEGVDWAH-IDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVKDY
           E  D    +  +    LE   +  E+ +A+  + + KSPG DG+  EF + +W+ L PD   V  E F+   +        + L+PKK     +K++
Subjt:  FVIEGVDWAH-IDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVKDY

Query:  RPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNWIR
        RP++L+++ YK++AK +  RLK VL   I   Q+  V GR I   + +  +++            + LD EKA+D V+  +L   L+   F  ++  +++
Subjt:  RPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNWIR

Query:  GCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSLH
            ++   V +N      +   RG+RQG  LS  L+
Subjt:  GCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSLH

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)5.8e-0729.14Show/hide
Query:  LIPKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFIL
        LIPK    +   ++RPI + ++L +L+ ++L +RL+  + L  +    A + G  ++ ++L+ + +   +  R    V + LD+ KA+D V+   +   L
Subjt:  LIPKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFIL

Query:  KLKGFRCRWRNWIRGCLRNSNFSVMVN-GRSRDKIVATRGLRQGDLLSPSL
        +  G      N+I G L +S  ++ V  G    KI   RG++QGD LSP L
Subjt:  KLKGFRCRWRNWIRGCLRNSNFSVMVN-GRSRDKIVATRGLRQGDLLSPSL

Q03278 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)4.5e-0725.38Show/hide
Query:  GPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQI
        GPDGM        WN +   +  +F     +    +R  ++   LIPK+        +RP+++ +   +   ++L  R+ +  LL     Q AF+    +
Subjt:  GPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQI

Query:  LGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSLHHC
             + S ++++   + +   +  LD++KA+D V    +   L+ K      RN+I    RNS   + V       I   RG+RQGD LSP L +C
Subjt:  LGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNWIRGCLRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSLHHC

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.0e-1829.35Show/hide
Query:  QKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS---SPKFV--IEGVDWAHIDYQSSCNLEENFSEQEI
        QK +I+WL++GD N+ FFH+     + +  I  L  D    +    +++  I+++++ L   DS   +P  V  I+ +     +   +  L    S++EI
Subjt:  QKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEIEGEILSFFSKLYNFDS---SPKFV--IEGVDWAHIDYQSSCNLEENFSEQEI

Query:  QKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVKDYRPINLVTSLYKLI
          A+  +   K+PGPD    EF    W ++K   +   +EFF+   + KR N T I LIPK    D++  +RP++  T +YK+I
Subjt:  QKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKKKADKVKDYRPINLVTSLYKLI

AT2G01440.1 DEAD/DEAH box RNA helicase family protein8.9e-3553.59Show/hide
Query:  LARLFQMLEGLGSAIEKDGLLDKYRQPHLNAAYMKEWSCLTQKFLKALPYSLTESQ--------------------MKGVVGCGKSVVAFLACMEVIGAG
        LARL+QML+ LG+ IEKD LL+K+R+P LN+ Y++EWS LT+ FLKALPYSLT SQ                    ++G VGCGK+VVAFLACMEVIG+G
Subjt:  LARLFQMLEGLGSAIEKDGLLDKYRQPHLNAAYMKEWSCLTQKFLKALPYSLTESQ--------------------MKGVVGCGKSVVAFLACMEVIGAG

Query:  YQAAFMVLTELFAIQHYQHLLGLLETMEEIANKSFVALLTDRLEEQNSIQSQQ
        YQAAFM  TEL AIQHY+    LLE ME +++K  + LLT     + S   +Q
Subjt:  YQAAFMVLTELFAIQHYQHLLGLLETMEEIANKSFVALLTDRLEEQNSIQSQQ

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.6e-0742.17Show/hide
Query:  LVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGV----LMKLDLEKAYDMVNWDFLDFILKLKGFRCRW
        +VERLK ++   I   Q +F+ GR     I+   E V   S R +KGV    L+KLDLEKAYD + WD+L+  L   GF   W
Subjt:  LVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGV----LMKLDLEKAYDMVNWDFLDFILKLKGFRCRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGGTCCACGGACAAGGGGAAAAGCCCACAGAGGAAAGCTGAGGAGGCCTGGAACGGTGATCTAGGGATGGATTCAGACTTATCCATTTCAAGTCCAACCAGTAG
TGAAAGCGATAGGGAGGGGGTAACGAGAAACGGTTACGGTGAAGAAACGAAAAACGACTTCCTGGAAGTCTCCCCGAATGCCTTAGCCATCATTCCTTCGCCGACTGTGC
AGAGAAATCCTATCGATAAAGAGGTTGGAAACCCTTTAATTAGCAAAGATCTGATCCTCACCCATAGAAAAAATAATTTATGCATTAGACCGATATCAGACATGAATGCT
AAGAAAGGAAACTCTACCAGAAAAAGGCGAATCAAAGAGGTAACTAATCTCTTGAGAACTTGGGAGAAAGAAGCAAAACTGAGCATGGAGATTGATAACGAAGAGGAAGA
AGATGTAGACATTAATAGCTTTTTAGAAAGGGCCGACGAGGTCTCCCATGAAGATTATCTCCTGGAACGTGAGAGGGCTAGGCACATAGCTCGGGTGGCCCTAGACGCTA
TTAATTCCGCTGTATGCACTCTCATTATGTGGAAAGATTCTGAGGTTGATGTGGTGGATTCAGTAATGGGGGCTTTTTCTGTGTCGATTAGATGTACCTTCCAGGGTCAT
TCTGAAGGGTGGATCACGGGGGTTTATGACCCTTGTGGCTATCAGGAGATATCCCAGTTATTGCAGGAGTTGTATGATTTGCAAGGGTTATGTCAGGGGGTTTGGTGTTT
AGCAGACAACTTTAACCTAATCAGCCCTAAGGAATTCCAGCAGCCACTGTCCGCGGATTATTCTTCCAAAAGTTTTAATGTTAGCATTGGAAAAATGTGTAACCAGGAAG
CTCCCAAAAGTATTAATGATTGCATTGGAAAAGTGTGCAATCAGCAGGCAGCTTTGAAATCTTATGTTGGCTGTAATGATCCTTCCAAGATGATTAATGATAATAGTTGT
AATTTGATTAATGATTTACAACAGATTCCACGAGAGAAGGACCAATTTAATGAAGCTTTGGGTTCTCCAAAAGGTGCTTCATTGCATGAAGAGTGTATTAATTATGTTGG
AGGTAATGGTATTAAAGATAGTATTAATGAGCCGGTTTTTGTTCTCTCTACATCCAAAGATGATAATGTGTTTGATGTGTTCAGCCCTCAGGAAGTCCGACAGTCCCAGT
TTTTGGAATCTCCTTCTAAGAATTTGAATGTTGTTAATTGCAATTCAAATAATAATGTCCAGCAGGTTTCGCGATCTTCAAATGTCAATTCTGGAAATGGTTTGTTTCAG
GATGATGAGTCTATGGTTAGTGTAAGCAGTGAGGATTCTGATCAGTTGTTGGCACGCTTATTCCAAATGCTTGAAGGCCTCGGTTCAGCAATAGAGAAAGATGGTTTGCT
GGACAAGTATCGTCAACCTCATTTAAATGCTGCATATATGAAGGAATGGTCTTGTCTCACCCAAAAGTTTCTTAAGGCTCTTCCATATTCCCTTACAGAGAGTCAGATGA
AAGGTGTCGTAGGATGTGGGAAGTCAGTGGTTGCTTTTCTGGCATGTATGGAAGTTATAGGCGCTGGATATCAGGCAGCTTTCATGGTTCTGACTGAGTTGTTTGCTATT
CAGCATTATCAACATTTGCTTGGTTTGTTAGAGACCATGGAAGAAATCGCTAATAAATCTTTTGTTGCTTTACTAACAGATCGTTTAGAGGAGCAAAATAGCATCCAAAG
TCAGCAGCTAGAGGAAAGAAAGAAGCTTAAAGCAGACCTTCTTGAATTATTGATGGATGAACAGAGAAGCTTAAACCAAAAATGCAAAATTAGATGGCTTAAAGAGGGGG
ATGAAAACTCTACATTCTTTCATAGATGGGCCACGACTATGAAAAACAGGGCCTTCATTTCAGTGTTAGAGAAAGATAGTGGGGAGATTATTACTAAAGAAGATGAGATT
GAGGGGGAAATTCTGTCTTTTTTCAGCAAGCTTTATAATTTTGATTCTAGCCCAAAATTTGTGATTGAGGGTGTGGATTGGGCCCATATTGACTACCAGAGCAGTTGTAA
TCTGGAAGAGAACTTCAGTGAGCAGGAAATTCAAAAGGCTATTTGTGGGTTGGGAAATCTGAAATCTCCGGGTCCGGATGGTATGATGGGAGAGTTTTTAAAAAATTATT
GGAACATCTTGAAGCCAGATTTAGTAGAGGTGTTCCAAGAATTTTTTCAAAACAGCATAGTGAACAAGCGGACTAATGAAACTTACATTTGTTTGATTCCCAAGAAGAAA
AAGGCAGATAAAGTTAAAGATTATAGACCGATCAACCTTGTTACCTCCTTGTATAAGCTTATAGCTAAGGTGCTTGTCGAGAGGCTTAAGAAAGTTTTGCTCCTTACAAT
TAGTGACTGCCAAACGGCTTTTGTTCAAGGTAGACAAATCCTTGGTGCCATTTTAGTGGCTAGTGAGGTTGTTGAAGACCAAAGTTGTAGGAATGAGAAGGGTGTTCTTA
TGAAGCTTGATCTTGAAAAAGCTTACGATATGGTTAATTGGGATTTCCTCGACTTTATTCTCAAATTGAAGGGATTCAGATGTAGATGGAGGAATTGGATCAGAGGGTGC
CTCAGAAATTCAAATTTCTCGGTTATGGTCAATGGAAGGTCGAGAGATAAAATTGTGGCAACCAGAGGGCTTCGACAAGGGGACCTTTTATCCCCTTCTCTTCACCATTG
TTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGGTCCACGGACAAGGGGAAAAGCCCACAGAGGAAAGCTGAGGAGGCCTGGAACGGTGATCTAGGGATGGATTCAGACTTATCCATTTCAAGTCCAACCAGTAG
TGAAAGCGATAGGGAGGGGGTAACGAGAAACGGTTACGGTGAAGAAACGAAAAACGACTTCCTGGAAGTCTCCCCGAATGCCTTAGCCATCATTCCTTCGCCGACTGTGC
AGAGAAATCCTATCGATAAAGAGGTTGGAAACCCTTTAATTAGCAAAGATCTGATCCTCACCCATAGAAAAAATAATTTATGCATTAGACCGATATCAGACATGAATGCT
AAGAAAGGAAACTCTACCAGAAAAAGGCGAATCAAAGAGGTAACTAATCTCTTGAGAACTTGGGAGAAAGAAGCAAAACTGAGCATGGAGATTGATAACGAAGAGGAAGA
AGATGTAGACATTAATAGCTTTTTAGAAAGGGCCGACGAGGTCTCCCATGAAGATTATCTCCTGGAACGTGAGAGGGCTAGGCACATAGCTCGGGTGGCCCTAGACGCTA
TTAATTCCGCTGTATGCACTCTCATTATGTGGAAAGATTCTGAGGTTGATGTGGTGGATTCAGTAATGGGGGCTTTTTCTGTGTCGATTAGATGTACCTTCCAGGGTCAT
TCTGAAGGGTGGATCACGGGGGTTTATGACCCTTGTGGCTATCAGGAGATATCCCAGTTATTGCAGGAGTTGTATGATTTGCAAGGGTTATGTCAGGGGGTTTGGTGTTT
AGCAGACAACTTTAACCTAATCAGCCCTAAGGAATTCCAGCAGCCACTGTCCGCGGATTATTCTTCCAAAAGTTTTAATGTTAGCATTGGAAAAATGTGTAACCAGGAAG
CTCCCAAAAGTATTAATGATTGCATTGGAAAAGTGTGCAATCAGCAGGCAGCTTTGAAATCTTATGTTGGCTGTAATGATCCTTCCAAGATGATTAATGATAATAGTTGT
AATTTGATTAATGATTTACAACAGATTCCACGAGAGAAGGACCAATTTAATGAAGCTTTGGGTTCTCCAAAAGGTGCTTCATTGCATGAAGAGTGTATTAATTATGTTGG
AGGTAATGGTATTAAAGATAGTATTAATGAGCCGGTTTTTGTTCTCTCTACATCCAAAGATGATAATGTGTTTGATGTGTTCAGCCCTCAGGAAGTCCGACAGTCCCAGT
TTTTGGAATCTCCTTCTAAGAATTTGAATGTTGTTAATTGCAATTCAAATAATAATGTCCAGCAGGTTTCGCGATCTTCAAATGTCAATTCTGGAAATGGTTTGTTTCAG
GATGATGAGTCTATGGTTAGTGTAAGCAGTGAGGATTCTGATCAGTTGTTGGCACGCTTATTCCAAATGCTTGAAGGCCTCGGTTCAGCAATAGAGAAAGATGGTTTGCT
GGACAAGTATCGTCAACCTCATTTAAATGCTGCATATATGAAGGAATGGTCTTGTCTCACCCAAAAGTTTCTTAAGGCTCTTCCATATTCCCTTACAGAGAGTCAGATGA
AAGGTGTCGTAGGATGTGGGAAGTCAGTGGTTGCTTTTCTGGCATGTATGGAAGTTATAGGCGCTGGATATCAGGCAGCTTTCATGGTTCTGACTGAGTTGTTTGCTATT
CAGCATTATCAACATTTGCTTGGTTTGTTAGAGACCATGGAAGAAATCGCTAATAAATCTTTTGTTGCTTTACTAACAGATCGTTTAGAGGAGCAAAATAGCATCCAAAG
TCAGCAGCTAGAGGAAAGAAAGAAGCTTAAAGCAGACCTTCTTGAATTATTGATGGATGAACAGAGAAGCTTAAACCAAAAATGCAAAATTAGATGGCTTAAAGAGGGGG
ATGAAAACTCTACATTCTTTCATAGATGGGCCACGACTATGAAAAACAGGGCCTTCATTTCAGTGTTAGAGAAAGATAGTGGGGAGATTATTACTAAAGAAGATGAGATT
GAGGGGGAAATTCTGTCTTTTTTCAGCAAGCTTTATAATTTTGATTCTAGCCCAAAATTTGTGATTGAGGGTGTGGATTGGGCCCATATTGACTACCAGAGCAGTTGTAA
TCTGGAAGAGAACTTCAGTGAGCAGGAAATTCAAAAGGCTATTTGTGGGTTGGGAAATCTGAAATCTCCGGGTCCGGATGGTATGATGGGAGAGTTTTTAAAAAATTATT
GGAACATCTTGAAGCCAGATTTAGTAGAGGTGTTCCAAGAATTTTTTCAAAACAGCATAGTGAACAAGCGGACTAATGAAACTTACATTTGTTTGATTCCCAAGAAGAAA
AAGGCAGATAAAGTTAAAGATTATAGACCGATCAACCTTGTTACCTCCTTGTATAAGCTTATAGCTAAGGTGCTTGTCGAGAGGCTTAAGAAAGTTTTGCTCCTTACAAT
TAGTGACTGCCAAACGGCTTTTGTTCAAGGTAGACAAATCCTTGGTGCCATTTTAGTGGCTAGTGAGGTTGTTGAAGACCAAAGTTGTAGGAATGAGAAGGGTGTTCTTA
TGAAGCTTGATCTTGAAAAAGCTTACGATATGGTTAATTGGGATTTCCTCGACTTTATTCTCAAATTGAAGGGATTCAGATGTAGATGGAGGAATTGGATCAGAGGGTGC
CTCAGAAATTCAAATTTCTCGGTTATGGTCAATGGAAGGTCGAGAGATAAAATTGTGGCAACCAGAGGGCTTCGACAAGGGGACCTTTTATCCCCTTCTCTTCACCATTG
TTGGTGA
Protein sequenceShow/hide protein sequence
MVWSTDKGKSPQRKAEEAWNGDLGMDSDLSISSPTSSESDREGVTRNGYGEETKNDFLEVSPNALAIIPSPTVQRNPIDKEVGNPLISKDLILTHRKNNLCIRPISDMNA
KKGNSTRKRRIKEVTNLLRTWEKEAKLSMEIDNEEEEDVDINSFLERADEVSHEDYLLERERARHIARVALDAINSAVCTLIMWKDSEVDVVDSVMGAFSVSIRCTFQGH
SEGWITGVYDPCGYQEISQLLQELYDLQGLCQGVWCLADNFNLISPKEFQQPLSADYSSKSFNVSIGKMCNQEAPKSINDCIGKVCNQQAALKSYVGCNDPSKMINDNSC
NLINDLQQIPREKDQFNEALGSPKGASLHEECINYVGGNGIKDSINEPVFVLSTSKDDNVFDVFSPQEVRQSQFLESPSKNLNVVNCNSNNNVQQVSRSSNVNSGNGLFQ
DDESMVSVSSEDSDQLLARLFQMLEGLGSAIEKDGLLDKYRQPHLNAAYMKEWSCLTQKFLKALPYSLTESQMKGVVGCGKSVVAFLACMEVIGAGYQAAFMVLTELFAI
QHYQHLLGLLETMEEIANKSFVALLTDRLEEQNSIQSQQLEERKKLKADLLELLMDEQRSLNQKCKIRWLKEGDENSTFFHRWATTMKNRAFISVLEKDSGEIITKEDEI
EGEILSFFSKLYNFDSSPKFVIEGVDWAHIDYQSSCNLEENFSEQEIQKAICGLGNLKSPGPDGMMGEFLKNYWNILKPDLVEVFQEFFQNSIVNKRTNETYICLIPKKK
KADKVKDYRPINLVTSLYKLIAKVLVERLKKVLLLTISDCQTAFVQGRQILGAILVASEVVEDQSCRNEKGVLMKLDLEKAYDMVNWDFLDFILKLKGFRCRWRNWIRGC
LRNSNFSVMVNGRSRDKIVATRGLRQGDLLSPSLHHCW