; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011989 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011989
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr1:36095849..36100943
RNA-Seq ExpressionLag0011989
SyntenyLag0011989
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]2.5e-8935.22Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++   F + +   LP+              F  GPTPFRFEN+WL+HP F+  F  WW+     GW G+K M KL+ +K  ++ WNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +K+++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEGD N+ FFH+ A   +NR FI  LE+EN     +
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT
           I+ +IL            +  ++E L          + +    +  + C   +  + D+           F   W +  ++  + F         N 
Subjt:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT

Query:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN
            +F  L+PKK  + ++ DFRPISL+TS YKIIAK LA R+++VL   I   Q +FV GRQILDA+L+A + V++ R   ++GV+ K+D EKAY+ V+
Subjt:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN

Query:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT
        W+ LD ++ MKGFGI+WR W+RGCL + +F+V++N   +G + A+ GLRQ DPLSPFLFTIV D++S+ +    E+ +L+G+ VG+N+  VS  Q+ DDT
Subjt:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT

Query:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
        + FS + E D+    ++L +    S L VN+ K+++ GIN++ N ++  A+   C A   P+ YL
Subjt:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

CAN80093.1 hypothetical protein VITISV_010721 [Vitis vinifera]2.9e-9036.04Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++   F + +   LP+              F  GPTPF FEN+WL+HP F+  F  WW+     GW G+K M KL+ +K  ++EWNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +KK++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEGD N+ FFH+ A   +NR FI  LE+EN      
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRWLKEDDENTNFFHRWAIAMKNRAF-ISVLEKSSTNTLRRHTFCLIPKKKKAAKVRDFRP
                                 L MN   S+            +E   +F +   +     + +  L+ S  +        L+PKK  + ++ DFRP
Subjt:  EAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRWLKEDDENTNFFHRWAIAMKNRAF-ISVLEKSSTNTLRRHTFCLIPKKKKAAKVRDFRP

Query:  ISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRMWIRGC
        ISL+TS YKIIAK LA R++ VL   I   Q +FV GRQILDA+L+A + V++ R   ++GV+ K+D EKAY+ V+W+ LD +L MKGFG++WR W+RGC
Subjt:  ISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRMWIRGC

Query:  LINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIMEG
        L + +F+V++N   +G + A+ GLRQ DPLSPFLFTIV D++S+ +    E+ +L+G+ VG+N+  VS  Q+ DDT+ FS + E D+    ++L +    
Subjt:  LINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIMEG

Query:  SRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
        S L VN+ K+++ GIN++ N ++  A+   C     P+ YL
Subjt:  SRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

CAN83313.1 hypothetical protein VITISV_001463 [Vitis vinifera]3.2e-8937.02Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++  +F + +   LP+              F  GP PFRFEN+WL+HP+F+  F  WW+     GW G+K M KL+ +K  ++EWNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +KK++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEG+ N+ FFH+ A   +NR FI+ LE+EN+    +
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRW--LKEDDENT-NFFHRWAIAMKNRAFISVLEKSSTNTLRRHTFCLIPKKKKAAKVRDF
           I+ +IL  +        I +            Q C   W  +KED       FHR           S +   STN        L+PKK  + +  DF
Subjt:  EAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRW--LKEDDENT-NFFHRWAIAMKNRAFISVLEKSSTNTLRRHTFCLIPKKKKAAKVRDF

Query:  RPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRMWIR
        RPISL+TS YKIIAK L  RL+ VL   I   Q +FV GRQILDA+L+A + V++ R   ++G++ K+D EKAY+ V+W+ LD +L MKGF  +WR W++
Subjt:  RPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRMWIR

Query:  GCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIM
        GCL + +F+V++N   +G + A+ GLRQD+PLSPFLFTIV D++S+ +    E+ +L+G+ VG+N++ VS  Q+ DDT+ FS   E D+    ++L +  
Subjt:  GCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIM

Query:  EGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
          S L VN+ K+++ GIN++ N ++  A+   C A   P+ YL
Subjt:  EGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.5e-8935.22Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++   F + +   LP+              F  GPTPFRFEN+WL+HP F+  F  WW+     GW G+K M KL+ +K  ++ WNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +K+++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEGD N+ FFH+ A   +NR FI  LE+EN     +
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT
           I+ +IL            +  ++E L          + +    +  + C   +  + D+           F   W +  ++  + F         N 
Subjt:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT

Query:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN
            +F  L+PKK  + ++ DFRPISL+TS YKIIAK LA R+++VL   I   Q +FV GRQILDA+L+A + V++ R   ++GV+ K+D EKAY+ V+
Subjt:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN

Query:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT
        W+ LD ++ MKGFGI+WR W+RGCL + +F+V++N   +G + A+ GLRQ DPLSPFLFTIV D++S+ +    E+ +L+G+ VG+N+  VS  Q+ DDT
Subjt:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT

Query:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
        + FS + E D+    ++L +    S L VN+ K+++ GIN++ N ++  A+   C A   P+ YL
Subjt:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.5e-8935.22Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++   F + +   LP+              F  GPTPFRFEN+WL+HP F+  F  WW+     GW G+K M KL+ +K  ++ WNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +K+++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEGD N+ FFH+ A   +NR FI  LE+EN     +
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT
           I+ +IL            +  ++E L          + +    +  + C   +  + D+           F   W +  ++  + F         N 
Subjt:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT

Query:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN
            +F  L+PKK  + ++ DFRPISL+TS YKIIAK LA R+++VL   I   Q +FV GRQILDA+L+A + V++ R   ++GV+ K+D EKAY+ V+
Subjt:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN

Query:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT
        W+ LD ++ MKGFGI+WR W+RGCL + +F+V++N   +G + A+ GLRQ DPLSPFLFTIV D++S+ +    E+ +L+G+ VG+N+  VS  Q+ DDT
Subjt:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT

Query:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
        + FS + E D+    ++L +    S L VN+ K+++ GIN++ N ++  A+   C A   P+ YL
Subjt:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

TrEMBL top hitse value%identityAlignment
A0A438GDE7 LINE-1 retrotransposable element ORF2 protein1.2e-8935.22Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++   F + +   LP+              F  GPTPFRFEN+WL+HP F+  F  WW+     GW G+K M KL+ +K  ++ WNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +K+++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEGD N+ FFH+ A   +NR FI  LE+EN     +
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT
           I+ +IL            +  ++E L          + +    +  + C   +  + D+           F   W +  ++  + F         N 
Subjt:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT

Query:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN
            +F  L+PKK  + ++ DFRPISL+TS YKIIAK LA R+++VL   I   Q +FV GRQILDA+L+A + V++ R   ++GV+ K+D EKAY+ V+
Subjt:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN

Query:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT
        W+ LD ++ MKGFGI+WR W+RGCL + +F+V++N   +G + A+ GLRQ DPLSPFLFTIV D++S+ +    E+ +L+G+ VG+N+  VS  Q+ DDT
Subjt:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT

Query:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
        + FS + E D+    ++L +    S L VN+ K+++ GIN++ N ++  A+   C A   P+ YL
Subjt:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

A0A438JX47 LINE-1 retrotransposable element ORF2 protein1.2e-8935.22Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++   F + +   LP+              F  GPTPFRFEN+WL+HP F+  F  WW+     GW G+K M KL+ +K  ++ WNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +K+++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEGD N+ FFH+ A   +NR FI  LE+EN     +
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT
           I+ +IL            +  ++E L          + +    +  + C   +  + D+           F   W +  ++  + F         N 
Subjt:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT

Query:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN
            +F  L+PKK  + ++ DFRPISL+TS YKIIAK LA R+++VL   I   Q +FV GRQILDA+L+A + V++ R   ++GV+ K+D EKAY+ V+
Subjt:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN

Query:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT
        W+ LD ++ MKGFGI+WR W+RGCL + +F+V++N   +G + A+ GLRQ DPLSPFLFTIV D++S+ +    E+ +L+G+ VG+N+  VS  Q+ DDT
Subjt:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT

Query:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
        + FS + E D+    ++L +    S L VN+ K+++ GIN++ N ++  A+   C A   P+ YL
Subjt:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

A5AI05 Reverse transcriptase domain-containing protein1.4e-9036.04Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++   F + +   LP+              F  GPTPF FEN+WL+HP F+  F  WW+     GW G+K M KL+ +K  ++EWNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +KK++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEGD N+ FFH+ A   +NR FI  LE+EN      
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRWLKEDDENTNFFHRWAIAMKNRAF-ISVLEKSSTNTLRRHTFCLIPKKKKAAKVRDFRP
                                 L MN   S+            +E   +F +   +     + +  L+ S  +        L+PKK  + ++ DFRP
Subjt:  EAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRWLKEDDENTNFFHRWAIAMKNRAF-ISVLEKSSTNTLRRHTFCLIPKKKKAAKVRDFRP

Query:  ISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRMWIRGC
        ISL+TS YKIIAK LA R++ VL   I   Q +FV GRQILDA+L+A + V++ R   ++GV+ K+D EKAY+ V+W+ LD +L MKGFG++WR W+RGC
Subjt:  ISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRMWIRGC

Query:  LINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIMEG
        L + +F+V++N   +G + A+ GLRQ DPLSPFLFTIV D++S+ +    E+ +L+G+ VG+N+  VS  Q+ DDT+ FS + E D+    ++L +    
Subjt:  LINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIMEG

Query:  SRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
        S L VN+ K+++ GIN++ N ++  A+   C     P+ YL
Subjt:  SRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

A5BJC9 Reverse transcriptase domain-containing protein1.6e-8937.02Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++  +F + +   LP+              F  GP PFRFEN+WL+HP+F+  F  WW+     GW G+K M KL+ +K  ++EWNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +KK++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEG+ N+ FFH+ A   +NR FI+ LE+EN+    +
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRW--LKEDDENT-NFFHRWAIAMKNRAFISVLEKSSTNTLRRHTFCLIPKKKKAAKVRDF
           I+ +IL  +        I +            Q C   W  +KED       FHR           S +   STN        L+PKK  + +  DF
Subjt:  EAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRW--LKEDDENT-NFFHRWAIAMKNRAFISVLEKSSTNTLRRHTFCLIPKKKKAAKVRDF

Query:  RPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRMWIR
        RPISL+TS YKIIAK L  RL+ VL   I   Q +FV GRQILDA+L+A + V++ R   ++G++ K+D EKAY+ V+W+ LD +L MKGF  +WR W++
Subjt:  RPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRMWIR

Query:  GCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIM
        GCL + +F+V++N   +G + A+ GLRQD+PLSPFLFTIV D++S+ +    E+ +L+G+ VG+N++ VS  Q+ DDT+ FS   E D+    ++L +  
Subjt:  GCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIM

Query:  EGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
          S L VN+ K+++ GIN++ N ++  A+   C A   P+ YL
Subjt:  EGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

A5CAA2 Reverse transcriptase domain-containing protein1.2e-8935.22Show/hide
Query:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK
        PV +R+   L+S ++   F + +   LP+              F  GPTPFRFEN+WL+HP F+  F  WW+     GW G+K M KL+ +K  ++ WNK
Subjt:  PVIERIQHKLHSWKYAFIFRKVVDIFLPK-------------SFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGWAGYKVMEKLKHLKVHIREWNK

Query:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS
            +   +K+++LS + +   LE++G +    + +R   K EL +L + E+    QK +++W+KEGD N+ FFH+ A   +NR FI  LE+EN     +
Subjt:  DILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNRAFISVLESENEDARTS

Query:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT
           I+ +IL            +  ++E L          + +    +  + C   +  + D+           F   W +  ++  + F         N 
Subjt:  EAEIEIKILS-----LEIEERKRLKIELL---------KLTMNEQRSLNQKCTIRWLKEDDENTN--------FFHRWAIAMKN--RAFISVLEKSSTNT

Query:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN
            +F  L+PKK  + ++ DFRPISL+TS YKIIAK LA R+++VL   I   Q +FV GRQILDA+L+A + V++ R   ++GV+ K+D EKAY+ V+
Subjt:  LRRHTF-CLIPKKKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVN

Query:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT
        W+ LD ++ MKGFGI+WR W+RGCL + +F+V++N   +G + A+ GLRQ DPLSPFLFTIV D++S+ +    E+ +L+G+ VG+N+  VS  Q+ DDT
Subjt:  WELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT

Query:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL
        + FS + E D+    ++L +    S L VN+ K+++ GIN++ N ++  A+   C A   P+ YL
Subjt:  LIFSPNNESDVAIWWDILKLIMEGSRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.6e-1729.13Show/hide
Query:  NTLRRHTFCLIPKK-KKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKG-VLVKLDLEKAYN
        N+    +  LIPK  +   K  +FRPISL+    KI+ K LA R+++ +   I   QV F+ G Q    I  +   ++ +   KDK  V++ +D EKA++
Subjt:  NTLRRHTFCLIPKK-KKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKG-VLVKLDLEKAYN

Query:  IVNWELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYV
         +    +   LN  G    +   IR        ++ILN +        TG RQ  PLSP LF IV ++++++++   ++  +KG  +GK +V +S+  + 
Subjt:  IVNWELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYV

Query:  DDTLIFSPNNESDVAIWWDILKLIMEGSRL
        DD +++    E+ +    ++LKLI   S++
Subjt:  DDTLIFSPNNESDVAIWWDILKLIMEGSRL

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-1527.73Show/hide
Query:  LEKSSTNTLRRHTFCLIPK-KKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKG-VLVKLD
        +E +  N+    T  LIPK +K   K+ +FRPISL+    KI+ K LA R+++ +   I   QV F+ G Q    I  +   +  +   KDK  +++ LD
Subjt:  LEKSSTNTLRRHTFCLIPK-KKKAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKG-VLVKLD

Query:  LEKAYNIVNWELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVV
         EKA++ +    +  +L   G    +   I+        ++ +N      +   +G RQ  PLSP+LF IV ++++++++   ++  +KG  +GK +V +
Subjt:  LEKAYNIVNWELLDAILNMKGFGIKWRMWIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVV

Query:  SVQQYVDDTLIF--SPNNES
        S+    DD +++   P N +
Subjt:  SVQQYVDDTLIF--SPNNES

Q6K4V3 Zinc finger CCCH domain-containing protein 153.0e-2142.55Show/hide
Query:  DSRATATLETETEFSRDARAIRERVLKQVEEALK------------GKGARVLGVRSCI---KGSMPMWITRLASEGNTQFQVKEPLLI-LELQKDF--I
        DSRATATLETETEF RDARAIRER LKQ EE+LK            G G    G+        G            G +   ++    I L  + D+   
Subjt:  DSRATATLETETEFSRDARAIRERVLKQVEEALK------------GKGARVLGVRSCI---KGSMPMWITRLASEGNTQFQVKEPLLI-LELQKDF--I

Query:  ISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK
        I   +  T           MHDR DY+S W        +EKARK ++AMG D  D +A E+ D+DDE+ALP AC+IC E FVDPVVTK
Subjt:  ISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK

Q8GX84 Zinc finger CCCH domain-containing protein 14.8e-1939.79Show/hide
Query:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK
        K+++  NDS ATATLETET+F++DARAIRERVLK+ +EALKG   +         + G      G            G +   ++    I      + Q 
Subjt:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK

Query:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK
        D  I   +  T           +HDR DY+  W        +EK RK   AMG +D D++A + SDE DE+ALP ACFIC E FVDPVVTK
Subjt:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK

Q9FNG6 Zinc finger CCCH domain-containing protein 516.9e-1838.74Show/hide
Query:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK
        K+++  NDS ATATLETET+F++DARAIRERVLK+ + ALKG   +         + G      G            G +   ++    I      + Q 
Subjt:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK

Query:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK
        D  I   +  T           +HDR DY+  W        +EK RK   AMG +D D++A + SDE DE+ALP ACFIC E F+DPVVTK
Subjt:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK

Arabidopsis top hitse value%identityAlignment
AT1G01350.1 Zinc finger (CCCH-type/C3HC4-type RING finger) family protein3.4e-2039.79Show/hide
Query:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK
        K+++  NDS ATATLETET+F++DARAIRERVLK+ +EALKG   +         + G      G            G +   ++    I      + Q 
Subjt:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK

Query:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK
        D  I   +  T           +HDR DY+  W        +EK RK   AMG +D D++A + SDE DE+ALP ACFIC E FVDPVVTK
Subjt:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.6e-0840.74Show/hide
Query:  LAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKD-KG-VLVKLDLEKAYNIVNWELLDAILNMKGFGIKW
        + ERLK ++ + I   Q SF+ GR   D I+   +AV  +R +K  KG +L+KLDLEKAY+ + W+ L+  L   GF   W
Subjt:  LAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKD-KG-VLVKLDLEKAYNIVNWELLDAILNMKGFGIKW

AT5G06420.1 Zinc finger (CCCH-type/C3HC4-type RING finger) family protein4.9e-1938.74Show/hide
Query:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK
        K+++  NDS ATATLETET+F++DARAIRERVLK+ + ALKG   +         + G      G            G +   ++    I      + Q 
Subjt:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK

Query:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK
        D  I   +  T           +HDR DY+  W        +EK RK   AMG +D D++A + SDE DE+ALP ACFIC E F+DPVVTK
Subjt:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK

AT5G06420.2 Zinc finger (CCCH-type/C3HC4-type RING finger) family protein4.9e-1938.74Show/hide
Query:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK
        K+++  NDS ATATLETET+F++DARAIRERVLK+ + ALKG   +         + G      G            G +   ++    I      + Q 
Subjt:  KKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGAR---------VLGVRSCIKGSMPMWITRLASEGNTQFQVKEPLLI-----LELQK

Query:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK
        D  I   +  T           +HDR DY+  W        +EK RK   AMG +D D++A + SDE DE+ALP ACFIC E F+DPVVTK
Subjt:  DFIISLIFARTSRRLVTTVVTEMHDRVDYESRWL-------SEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICGEAFVDPVVTK

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.8e-0636.76Show/hide
Query:  ILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT
        I+N  P+G +  + GLRQ DPLSP+LF +  +++S   +   E+G L G  V  N   ++   + DDT
Subjt:  ILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAAGCGTTGAGGACGACAAGGTTAAAGCAGCGGCATGACGGAGAGGAGCAATACGGCGAATTGAAGCTTCAGTGGTGGGGAACGGCGACTTTATTATCT
CATAGTGCAAATATGGGTCGAATTGCTTCTCATCCGGTTGGGAATCCTCAGCTTCATGTGAATCATTTACAATTCGTTGATGATACGTTATTATTCTCCATTTAT
TGTAAAGATGCACTGGTTAACTTGTTCGATATTATAAAAATTTTTGAGATGGTTTCTGGGTTGAATATTAACTATGCCAAGAGTGAGGTCTTGGGGATTCATTTA
GATGATTCAAAATTGGAGTGGTTGACAACAACTTTTGGATGCAAACAAGGGACTTGGCCTTCTACCTATCTTGGTTTACCTTTGGGTGGCAATGCAAAATCAATT
CATTTTTGGCCACCGGTGATAGAAAGAATTCAACATAAGCTTCATAGTTGGAAGTATGCTTTTATATTTCGAAAGGTGGTCGACATATTCTTACCCAAGTCGTTC
TCTCTAGGGCCAACCCCGTTTAGATTTGAGAACGTTTGGCTGGAGCACCCAGACTTTGAAGGAAAGTTTGAGCATTGGTGGCAATACGAAAACCCGTGTGGCTGG
GCAGGGTACAAAGTGATGGAAAAACTTAAGCACCTGAAGGTGCACATTAGAGAATGGAATAAAGACATTCTTGCAAAGTCTGTGAGTAAGAAAAAGGAAGTTTTG
AGCCAAATTGATCACAATTACTGGCTCGAAGAGCAGGGTAATATTCATAGCCTCGAGATTGAAGAGAGAAAGCGCCTTAAGATTGAGTTGCTTAAGTTGACCATG
AATGAGCAAAGAAGCCTCAACCAAAAATGTAAGATTAGATGGCTTAAGGAAGGTGATGAGAACACCAATTTCTTTCATAGATGGGCCATAGCTATGAAAAACCGG
GCCTTCATCTCGGTCCTTGAAAGTGAGAATGAGGATGCCCGTACTTCTGAAGCTGAGATTGAAATAAAGATTCTTAGCCTCGAGATTGAAGAGAGAAAGCGCCTT
AAGATTGAGTTGCTTAAGTTGACCATGAATGAGCAAAGAAGCCTCAACCAAAAATGTACGATTAGATGGCTTAAGGAAGATGATGAGAACACCAATTTCTTTCAT
AGATGGGCCATAGCTATGAAAAACCGGGCCTTCATCTCGGTCCTTGAAAAATCATCAACAAACACACTAAGGAGACATACATTTTGCTTGATTCCTAAGAAGAAA
AAAGCTGCCAAAGTTAGGGACTTTAGGCCGATTAGCCTAGTTACCTCTCCTTATAAAATAATAGCAAAGGCGTTGGCTGAGAGACTCAAGAAAGTGCTTCCACAC
GCTATCAGTGATTGCCAAGTTTCTTTTGTGCATGGGAGACAAATCCTAGATGCTATTTTAGTAGCTACCAAAGCTGTGGAAGACTTAAGGATTAGAAAAGATAAG
GGTGTACTGGTCAAGCTTGATTTAGAAAAGGCCTACAATATAGTAAATTGGGAGCTCCTCGATGCTATTCTCAATATGAAGGGCTTTGGCATTAAATGGAGGATG
TGGATAAGGGGCTGCCTCATAAACACTAATTTCTCGGTAATTTTAAATGCTAGACCAAGAGGGAAGCTTGTGGCCACTACAGGTTTAAGACAAGACGATCCACTC
TCCCCCTTTCTTTTTACAATAGTTGGGGACATTATTAGCAAATCTGTGCAATTCTGCTTAGAAAAGGGGATTTTGAAAGGTTGGCTAGTTGGGAAAAACAAGGTT
GTGGTTTCTGTTCAGCAATATGTTGATGATACTTTGATTTTTAGCCCCAATAACGAGTCAGATGTGGCCATATGGTGGGATATTCTCAAGCTTATAATGGAAGGG
TCAAGGCTGGTTGTTAATATGGCAAAGACTTCCATGATAGGGATCAACATGGATGGAAATGATGTGACTAACTGGGCCAAGTCCCATGGATGTCTAGCGGATTCA
CTCCCTGTTAACTATCTAGACCAATTTTCCAATTTGAATCTTCAAAAGAAACTCAAGTTCATCAATGATAGCAGAGCCACTGCAACTCTAGAAACAGAGACTGAG
TTCTCAAGAGATGCACGAGCCATCCGTGAGAGAGTTTTGAAGCAAGTTGAGGAGGCTTTGAAGGGTAAAGGGGCCAGAGTTCTGGGGGTGAGAAGTTGTATAAAG
GGCTCAATGCCTATGTGGATTACAAGGCTGGCTTCAGAAGGGAACACACAATTTCAAGTGAAAGAGCCTCTGCTCATATTAGAGCTTCAGAAAGATTTTATTATC
AGCCTGATATTTGCAAGGACTTCAAGGAGACTGGTTACTACTGTGGTTACGGAGATGCATGACCGAGTGGACTACGAGTCTCGGTGGCTGTCAGAGAAAGCCAGG
AAGAGTAAATTAGCTATGGGATCTGATGATAGGGATGAGGATGCTGCAGAACAGAGTGATGAAGATGATGAAGATGCCTTGCCCTTGGCTTGTTTTATTTGTGGT
GAGGCATTTGTGGATCCTGTTGTGACCAAAGTGCAAGCACTACTTCTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAAGCGTTGAGGACGACAAGGTTAAAGCAGCGGCATGACGGAGAGGAGCAATACGGCGAATTGAAGCTTCAGTGGTGGGGAACGGCGACTTTATTATCT
CATAGTGCAAATATGGGTCGAATTGCTTCTCATCCGGTTGGGAATCCTCAGCTTCATGTGAATCATTTACAATTCGTTGATGATACGTTATTATTCTCCATTTAT
TGTAAAGATGCACTGGTTAACTTGTTCGATATTATAAAAATTTTTGAGATGGTTTCTGGGTTGAATATTAACTATGCCAAGAGTGAGGTCTTGGGGATTCATTTA
GATGATTCAAAATTGGAGTGGTTGACAACAACTTTTGGATGCAAACAAGGGACTTGGCCTTCTACCTATCTTGGTTTACCTTTGGGTGGCAATGCAAAATCAATT
CATTTTTGGCCACCGGTGATAGAAAGAATTCAACATAAGCTTCATAGTTGGAAGTATGCTTTTATATTTCGAAAGGTGGTCGACATATTCTTACCCAAGTCGTTC
TCTCTAGGGCCAACCCCGTTTAGATTTGAGAACGTTTGGCTGGAGCACCCAGACTTTGAAGGAAAGTTTGAGCATTGGTGGCAATACGAAAACCCGTGTGGCTGG
GCAGGGTACAAAGTGATGGAAAAACTTAAGCACCTGAAGGTGCACATTAGAGAATGGAATAAAGACATTCTTGCAAAGTCTGTGAGTAAGAAAAAGGAAGTTTTG
AGCCAAATTGATCACAATTACTGGCTCGAAGAGCAGGGTAATATTCATAGCCTCGAGATTGAAGAGAGAAAGCGCCTTAAGATTGAGTTGCTTAAGTTGACCATG
AATGAGCAAAGAAGCCTCAACCAAAAATGTAAGATTAGATGGCTTAAGGAAGGTGATGAGAACACCAATTTCTTTCATAGATGGGCCATAGCTATGAAAAACCGG
GCCTTCATCTCGGTCCTTGAAAGTGAGAATGAGGATGCCCGTACTTCTGAAGCTGAGATTGAAATAAAGATTCTTAGCCTCGAGATTGAAGAGAGAAAGCGCCTT
AAGATTGAGTTGCTTAAGTTGACCATGAATGAGCAAAGAAGCCTCAACCAAAAATGTACGATTAGATGGCTTAAGGAAGATGATGAGAACACCAATTTCTTTCAT
AGATGGGCCATAGCTATGAAAAACCGGGCCTTCATCTCGGTCCTTGAAAAATCATCAACAAACACACTAAGGAGACATACATTTTGCTTGATTCCTAAGAAGAAA
AAAGCTGCCAAAGTTAGGGACTTTAGGCCGATTAGCCTAGTTACCTCTCCTTATAAAATAATAGCAAAGGCGTTGGCTGAGAGACTCAAGAAAGTGCTTCCACAC
GCTATCAGTGATTGCCAAGTTTCTTTTGTGCATGGGAGACAAATCCTAGATGCTATTTTAGTAGCTACCAAAGCTGTGGAAGACTTAAGGATTAGAAAAGATAAG
GGTGTACTGGTCAAGCTTGATTTAGAAAAGGCCTACAATATAGTAAATTGGGAGCTCCTCGATGCTATTCTCAATATGAAGGGCTTTGGCATTAAATGGAGGATG
TGGATAAGGGGCTGCCTCATAAACACTAATTTCTCGGTAATTTTAAATGCTAGACCAAGAGGGAAGCTTGTGGCCACTACAGGTTTAAGACAAGACGATCCACTC
TCCCCCTTTCTTTTTACAATAGTTGGGGACATTATTAGCAAATCTGTGCAATTCTGCTTAGAAAAGGGGATTTTGAAAGGTTGGCTAGTTGGGAAAAACAAGGTT
GTGGTTTCTGTTCAGCAATATGTTGATGATACTTTGATTTTTAGCCCCAATAACGAGTCAGATGTGGCCATATGGTGGGATATTCTCAAGCTTATAATGGAAGGG
TCAAGGCTGGTTGTTAATATGGCAAAGACTTCCATGATAGGGATCAACATGGATGGAAATGATGTGACTAACTGGGCCAAGTCCCATGGATGTCTAGCGGATTCA
CTCCCTGTTAACTATCTAGACCAATTTTCCAATTTGAATCTTCAAAAGAAACTCAAGTTCATCAATGATAGCAGAGCCACTGCAACTCTAGAAACAGAGACTGAG
TTCTCAAGAGATGCACGAGCCATCCGTGAGAGAGTTTTGAAGCAAGTTGAGGAGGCTTTGAAGGGTAAAGGGGCCAGAGTTCTGGGGGTGAGAAGTTGTATAAAG
GGCTCAATGCCTATGTGGATTACAAGGCTGGCTTCAGAAGGGAACACACAATTTCAAGTGAAAGAGCCTCTGCTCATATTAGAGCTTCAGAAAGATTTTATTATC
AGCCTGATATTTGCAAGGACTTCAAGGAGACTGGTTACTACTGTGGTTACGGAGATGCATGACCGAGTGGACTACGAGTCTCGGTGGCTGTCAGAGAAAGCCAGG
AAGAGTAAATTAGCTATGGGATCTGATGATAGGGATGAGGATGCTGCAGAACAGAGTGATGAAGATGATGAAGATGCCTTGCCCTTGGCTTGTTTTATTTGTGGT
GAGGCATTTGTGGATCCTGTTGTGACCAAAGTGCAAGCACTACTTCTGTGA
Protein sequenceShow/hide protein sequence
MIEALRTTRLKQRHDGEEQYGELKLQWWGTATLLSHSANMGRIASHPVGNPQLHVNHLQFVDDTLLFSIYCKDALVNLFDIIKIFEMVSGLNINYAKSEVLGIHL
DDSKLEWLTTTFGCKQGTWPSTYLGLPLGGNAKSIHFWPPVIERIQHKLHSWKYAFIFRKVVDIFLPKSFSLGPTPFRFENVWLEHPDFEGKFEHWWQYENPCGW
AGYKVMEKLKHLKVHIREWNKDILAKSVSKKKEVLSQIDHNYWLEEQGNIHSLEIEERKRLKIELLKLTMNEQRSLNQKCKIRWLKEGDENTNFFHRWAIAMKNR
AFISVLESENEDARTSEAEIEIKILSLEIEERKRLKIELLKLTMNEQRSLNQKCTIRWLKEDDENTNFFHRWAIAMKNRAFISVLEKSSTNTLRRHTFCLIPKKK
KAAKVRDFRPISLVTSPYKIIAKALAERLKKVLPHAISDCQVSFVHGRQILDAILVATKAVEDLRIRKDKGVLVKLDLEKAYNIVNWELLDAILNMKGFGIKWRM
WIRGCLINTNFSVILNARPRGKLVATTGLRQDDPLSPFLFTIVGDIISKSVQFCLEKGILKGWLVGKNKVVVSVQQYVDDTLIFSPNNESDVAIWWDILKLIMEG
SRLVVNMAKTSMIGINMDGNDVTNWAKSHGCLADSLPVNYLDQFSNLNLQKKLKFINDSRATATLETETEFSRDARAIRERVLKQVEEALKGKGARVLGVRSCIK
GSMPMWITRLASEGNTQFQVKEPLLILELQKDFIISLIFARTSRRLVTTVVTEMHDRVDYESRWLSEKARKSKLAMGSDDRDEDAAEQSDEDDEDALPLACFICG
EAFVDPVVTKVQALLL