; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037471 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037471
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr2:6582213..6584232
RNA-Seq ExpressionLag0037471
SyntenyLag0037471
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]1.3e-9133.04Show/hide
Query:  VLPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPS
        VL +TI++ Q AFV  RQILDA L+ANE++++     +KG++ K+D EKA+D V+W F+D ++  KGFG  WR WI+GC+ S N+SI+ING+PRGK   S
Subjt:  VLPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPS

Query:  RGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS-----------
        RG+RQGDPLSPFLF LVSD LSR++  +  +  +     G+ Q+ V+ LQFAD T+ F    ++  +N+ +++K+F   SG+ IN +KS           
Subjt:  RGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPS-IIQRSSHKSPWRLISSTMDLVYSRAKRSLG
                                     RN AL AKW+WRF  E N+LWH++I +KY     S+ W +  I + S ++PWR IS          + S+G
Subjt:  ---------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPS-IIQRSSHKSPWRLISSTMDLVYSRAKRSLG

Query:  NGLATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQ----TAWDLSLRRNLNDLETEEWVELSLILSSISL-QNRNDSWSWPLESSNIFSVK
        NG    FW D WL  GIL   FPRLY L+ R    +   W A+       WD   RRNL++ E  E V L  IL ++ L  +R D  SW +E    FS K
Subjt:  NGLATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQ----TAWDLSLRRNLNDLETEEWVELSLILSSISL-QNRNDSWSWPLESSNIFSVK

Query:  SLMKDLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN
        S    L+    +    Y+ IW    P KI+ F+W  ++G INT + +QRR P   LSPSWC+ C   +E+  HLF+HCS++ + W  +LDA G   V P 
Subjt:  SLMKDLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN

Query:  CIKDVLTLIFVSHPFHGEKK---ILWLALNRVFFWFLWGERNSRIFR-DSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS
          K    L+ ++    G+ K   IL   L    FW +W ERN RIF+  S    ++  D I F A  W      F DY  S
Subjt:  CIKDVLTLIFVSHPFHGEKK---ILWLALNRVFFWFLWGERNSRIFR-DSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS

KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-9131.86Show/hide
Query:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR
        LP T+AENQMAFV  RQI+DA L+ANE ID W + + +G +IKLD+EKAFDK++W F+D +L+ KG+   WR WI  CISSV YSIIINGRPRGKI PSR
Subjt:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR

Query:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------
        GIRQGDP+SPF+F+L  D +SRLL+    +G  +        +++  L FAD  LLF    + ++ N+  II +F+LASGL+IN +KS            
Subjt:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG
                                  ++ N ALL KW+WR++HED+ LW K+I AKY + +   + P +   SS +SPW  I   ++         + NG
Subjt:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG

Query:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMK---
         + SFWH  W     L++++PRLY L+    S + + W  +   WDL+ RR L + E   W EL   L++   +N NDS  W L S+ +++V S+ K   
Subjt:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMK---

Query:  ----DLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN
            +L+D+     N +K +W  S PKK   F+W L + ++NTA +L +R+P+    PSWC+MC    E   HLF+ C  A   W  I      +    N
Subjt:  ----DLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN

Query:  CIK-DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS
        C+    L +   S     +K ++         W +W ERN+RIF     +  +  + I   A  W      FS+Y  S
Subjt:  CIK-DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-9132.14Show/hide
Query:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR
        LP TI+E+QMAFV  RQI +A LIANE +D W   +++G +IKLD+EKAFDK++W F+D +L+ K +   WRK I  CISSV YSI+INGRPRG+I PSR
Subjt:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR

Query:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------
        GIRQGDPLSPF+F+L  D LSRLL++     +I      +  L++  + FAD  L+F     D + N+  I+ +FE ASGLNIN SKS            
Subjt:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG
                                   + N ALL KW+W+FL E + LW +LI++KY    + S +PS  + SS+ SPW+ ++  +   Y      + +G
Subjt:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG

Query:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMKDLV
           SFW D+W     L+   PRL+ L+   +  V E W  S   W L + R L D E   W  +   L +      +    W L S+NIF   S+ + + 
Subjt:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMKDLV

Query:  DYLV----IEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCIK
        +  +       NLYK +W   +PKK K F+W L HG INTA+RLQ+R+P++ LSP+WC MC    E   HLF+HC ++ + WS+      W++  P  ++
Subjt:  DYLV----IEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCIK

Query:  DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSS-FDKFMDIILFHALYWCKCHHPFSDY
         ++  I  S     +K ++    N    W +W ERN+RIF+    +  D + D +    L+ CK    FS+Y
Subjt:  DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSS-FDKFMDIILFHALYWCKCHHPFSDY

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-9032.44Show/hide
Query:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR
        LP TI+E QMAFV  RQI +A LIANE +D W   +++G +IKLD+EKAFDK++W F+D +L+ K +   WR  I  CISSV YSI+INGRPRG+I P+R
Subjt:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR

Query:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------
        GIRQGDPLSPF+F+L  D LS LL +    G+I     G   L++  + FAD  L+F    +D + N+  I+ +FE ASGLNIN SKS            
Subjt:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG
                                   + N ALL KW+W+FL E   LW +LI++KY    +   +PS  + SS+ SPW+ +++ +   Y      + +G
Subjt:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG

Query:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWS-WPLESSNIFSVKSLMKDL
           SFW D+W     L+   PRL+ L+   +  V + W  S   W++ + R L D E   W  +   L +  L +R  S   W L S+NIF   S+ KDL
Subjt:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWS-WPLESSNIFSVKSLMKDL

Query:  VDYLVIEDN----LYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCI
         +      N    LYK +W   +PKK K F+W L HG INTA+RLQ+R+P++ LSP+WC MC    E   HLF+HC ++ + WS+      W++  PN +
Subjt:  VDYLVIEDN----LYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCI

Query:  KDVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDY
        K +   I  S     +K ++      +  W +W ERN+RIF+     F    + IL     W      FS+Y
Subjt:  KDVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDY

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]2.1e-9131.86Show/hide
Query:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR
        LP T+AENQMAFV  RQI+DA L+ANE ID W + + +G +IKLD+EKAFDK++W F+D +L+ KG+   WR WI  CISSV YSIIINGRPRGKI PSR
Subjt:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR

Query:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------
        GIRQGDP+SPF+F+L  D +SRLL+    +G  +        +++  L FAD  LLF    + ++ N+  II +F+LASGL+IN +KS            
Subjt:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG
                                  ++ N ALL KW+WR++HED+ LW K+I AKY + +   + P +   SS +SPW  I   ++         + NG
Subjt:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG

Query:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMK---
         + SFWH  W     L++++PRLY L+    S + + W  +   WDL+ RR L + E   W EL   L++   +N NDS  W L S+ +++V S+ K   
Subjt:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMK---

Query:  ----DLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN
            +L+D+     N +K +W  S PKK   F+W L + ++NTA +L +R+P+    PSWC+MC    E   HLF+ C  A   W  I      +    N
Subjt:  ----DLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN

Query:  CIK-DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS
        C+    L +   S     +K ++         W +W ERN+RIF     +  +  + I   A  W      FS+Y  S
Subjt:  CIK-DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein1.0e-9131.86Show/hide
Query:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR
        LP T+AENQMAFV  RQI+DA L+ANE ID W + + +G +IKLD+EKAFDK++W F+D +L+ KG+   WR WI  CISSV YSIIINGRPRGKI PSR
Subjt:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR

Query:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------
        GIRQGDP+SPF+F+L  D +SRLL+    +G  +        +++  L FAD  LLF    + ++ N+  II +F+LASGL+IN +KS            
Subjt:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG
                                  ++ N ALL KW+WR++HED+ LW K+I AKY + +   + P +   SS +SPW  I   ++         + NG
Subjt:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG

Query:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMK---
         + SFWH  W     L++++PRLY L+    S + + W  +   WDL+ RR L + E   W EL   L++   +N NDS  W L S+ +++V S+ K   
Subjt:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMK---

Query:  ----DLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN
            +L+D+     N +K +W  S PKK   F+W L + ++NTA +L +R+P+    PSWC+MC    E   HLF+ C  A   W  I      +    N
Subjt:  ----DLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN

Query:  CIK-DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS
        C+    L +   S     +K ++         W +W ERN+RIF     +  +  + I   A  W      FS+Y  S
Subjt:  CIK-DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.0e-9132.14Show/hide
Query:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR
        LP TI+E+QMAFV  RQI +A LIANE +D W   +++G +IKLD+EKAFDK++W F+D +L+ K +   WRK I  CISSV YSI+INGRPRG+I PSR
Subjt:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR

Query:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------
        GIRQGDPLSPF+F+L  D LSRLL++     +I      +  L++  + FAD  L+F     D + N+  I+ +FE ASGLNIN SKS            
Subjt:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG
                                   + N ALL KW+W+FL E + LW +LI++KY    + S +PS  + SS+ SPW+ ++  +   Y      + +G
Subjt:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG

Query:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMKDLV
           SFW D+W     L+   PRL+ L+   +  V E W  S   W L + R L D E   W  +   L +      +    W L S+NIF   S+ + + 
Subjt:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMKDLV

Query:  DYLV----IEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCIK
        +  +       NLYK +W   +PKK K F+W L HG INTA+RLQ+R+P++ LSP+WC MC    E   HLF+HC ++ + WS+      W++  P  ++
Subjt:  DYLV----IEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCIK

Query:  DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSS-FDKFMDIILFHALYWCKCHHPFSDY
         ++  I  S     +K ++    N    W +W ERN+RIF+    +  D + D +    L+ CK    FS+Y
Subjt:  DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSS-FDKFMDIILFHALYWCKCHHPFSDY

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein6.7e-9132.44Show/hide
Query:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR
        LP TI+E QMAFV  RQI +A LIANE +D W   +++G +IKLD+EKAFDK++W F+D +L+ K +   WR  I  CISSV YSI+INGRPRG+I P+R
Subjt:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR

Query:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------
        GIRQGDPLSPF+F+L  D LS LL +    G+I     G   L++  + FAD  L+F    +D + N+  I+ +FE ASGLNIN SKS            
Subjt:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG
                                   + N ALL KW+W+FL E   LW +LI++KY    +   +PS  + SS+ SPW+ +++ +   Y      + +G
Subjt:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG

Query:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWS-WPLESSNIFSVKSLMKDL
           SFW D+W     L+   PRL+ L+   +  V + W  S   W++ + R L D E   W  +   L +  L +R  S   W L S+NIF   S+ KDL
Subjt:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWS-WPLESSNIFSVKSLMKDL

Query:  VDYLVIEDN----LYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCI
         +      N    LYK +W   +PKK K F+W L HG INTA+RLQ+R+P++ LSP+WC MC    E   HLF+HC ++ + WS+      W++  PN +
Subjt:  VDYLVIEDN----LYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCI

Query:  KDVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDY
        K +   I  S     +K ++      +  W +W ERN+RIF+     F    + IL     W      FS+Y
Subjt:  KDVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDY

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein1.0e-9131.86Show/hide
Query:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR
        LP T+AENQMAFV  RQI+DA L+ANE ID W + + +G +IKLD+EKAFDK++W F+D +L+ KG+   WR WI  CISSV YSIIINGRPRGKI PSR
Subjt:  LPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR

Query:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------
        GIRQGDP+SPF+F+L  D +SRLL+    +G  +        +++  L FAD  LLF    + ++ N+  II +F+LASGL+IN +KS            
Subjt:  GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG
                                  ++ N ALL KW+WR++HED+ LW K+I AKY + +   + P +   SS +SPW  I   ++         + NG
Subjt:  --------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNG

Query:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMK---
         + SFWH  W     L++++PRLY L+    S + + W  +   WDL+ RR L + E   W EL   L++   +N NDS  W L S+ +++V S+ K   
Subjt:  LATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMK---

Query:  ----DLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN
            +L+D+     N +K +W  S PKK   F+W L + ++NTA +L +R+P+    PSWC+MC    E   HLF+ C  A   W  I      +    N
Subjt:  ----DLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN

Query:  CIK-DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS
        C+    L +   S     +K ++         W +W ERN+RIF     +  +  + I   A  W      FS+Y  S
Subjt:  CIK-DVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS

A0A5H2XQW2 TatD related DNase6.1e-9233.04Show/hide
Query:  VLPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPS
        VL +TI++ Q AFV  RQILDA L+ANE++++     +KG++ K+D EKA+D V+W F+D ++  KGFG  WR WI+GC+ S N+SI+ING+PRGK   S
Subjt:  VLPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPS

Query:  RGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS-----------
        RG+RQGDPLSPFLF LVSD LSR++  +  +  +     G+ Q+ V+ LQFAD T+ F    ++  +N+ +++K+F   SG+ IN +KS           
Subjt:  RGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPS-IIQRSSHKSPWRLISSTMDLVYSRAKRSLG
                                     RN AL AKW+WRF  E N+LWH++I +KY     S+ W +  I + S ++PWR IS          + S+G
Subjt:  ---------------------------ENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPS-IIQRSSHKSPWRLISSTMDLVYSRAKRSLG

Query:  NGLATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQ----TAWDLSLRRNLNDLETEEWVELSLILSSISL-QNRNDSWSWPLESSNIFSVK
        NG    FW D WL  GIL   FPRLY L+ R    +   W A+       WD   RRNL++ E  E V L  IL ++ L  +R D  SW +E    FS K
Subjt:  NGLATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQ----TAWDLSLRRNLNDLETEEWVELSLILSSISL-QNRNDSWSWPLESSNIFSVK

Query:  SLMKDLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN
        S    L+    +    Y+ IW    P KI+ F+W  ++G INT + +QRR P   LSPSWC+ C   +E+  HLF+HCS++ + W  +LDA G   V P 
Subjt:  SLMKDLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPN

Query:  CIKDVLTLIFVSHPFHGEKK---ILWLALNRVFFWFLWGERNSRIFR-DSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS
          K    L+ ++    G+ K   IL   L    FW +W ERN RIF+  S    ++  D I F A  W      F DY  S
Subjt:  CIKDVLTLIFVSHPFHGEKK---ILWLALNRVFFWFLWGERNSRIFR-DSFSSFDKFMDIILFHALYWCKCHHPFSDYSLS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-0829.41Show/hide
Query:  KKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSH
        K  ++I +D EKAFDK+   F+   L   G   ++ K I         +II+NG+         G RQG PLSP LF +V + L+R +        I   
Subjt:  KKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSH

Query:  PIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKSE
         +G  ++ ++   FAD  +++      +  N+ ++I  F   SG  IN  KS+
Subjt:  PIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKSE

P08548 LINE-1 reverse transcriptase homolog1.3e-0932.05Show/hide
Query:  LSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR-GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGR
        L  K  M++ +D EKAFD +   F+   L   G    + K I    S    +II+NG  + K  P R G RQG PLSP LF +V + L+  +     +  
Subjt:  LSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSR-GIRQGDPLSPFLFILVSDCLSRLLSHSTYMGR

Query:  IVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS
        I    IG+ ++ ++   FAD  +++    +D+   + E+IK +   SG  IN  KS
Subjt:  IVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS

P0C2F6 Putative ribonuclease H protein At1g657504.7e-1726.35Show/hide
Query:  GLNINYSKSENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNL-SSVWPSIIQRSSHKSPWRLIS-STMDLVYSRAKRSLGNGLATSFWHDSWLSCGI
        GL +  +KS NR  AL++K  WR L E N+LW  ++  KY+   +  S W  +I + S  S WR I+    D+V        G+G    FW D W+S G 
Subjt:  GLNINYSKSENRNLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNL-SSVWPSIIQRSSHKSPWRLIS-STMDLVYSRAKRSLGNGLATSFWHDSWLSCGI

Query:  LATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMKDLVDYLVIEDNL---YKI
                 R T+    +  + WI  +  WD +         T   +EL  ++  + +    D  SW       FSV+S  + L    V   N+   +  
Subjt:  LATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSLILSSISLQNRNDSWSWPLESSNIFSVKSLMKDLVDYLVIEDNL---YKI

Query:  IWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPS-WCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCIKDVLTLIFVSHPFHGE
        +W    P+++K FLW + + A+ T     RR    HLS S  C +C  G E   H+   C      W  ++        F   + + L          G 
Subjt:  IWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPS-WCIMCAAGSEHSGHLFVHCSFASRYWSEILDAFGWSTVFPNCIKDVLTLIFVSHPFHGE

Query:  KKILWLALNRVFFWFLWGERNSRIFRDSFSSFDK
        + I W  +  V  W+ W  R   IF ++    D+
Subjt:  KKILWLALNRVFFWFLWGERNSRIFRDSFSSFDK

P11369 LINE-1 retrotransposable element ORF2 protein2.1e-0930.97Show/hide
Query:  LSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRI
        L  K  M+I LD EKAFDK+   F+  +L   G    +   I    S    +I +NG     I    G RQG PLSP+LF +V + L+R +        I
Subjt:  LSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRI

Query:  VSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS
            IG  ++ ++ L  AD  +++    K++   +  +I  F    G  IN +KS
Subjt:  VSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKS

P92555 Uncharacterized mitochondrial protein AtMg012501.7e-1150Show/hide
Query:  IINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYT
        IING P+G + PSRG+RQGDPLSP+LFIL ++ LS L   +   GR+    + N+   +N L FAD T
Subjt:  IINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYT

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.0e-0735.53Show/hide
Query:  VLPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKG----MMIKLDLEKAFDKVDWDFLDAILLAKGFGTVW
        ++ + I   Q +F+  R   D  +   E +   ++ RKKG    M++KLDLEKA+D++ WD+L+  L++ GF  VW
Subjt:  VLPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKG----MMIKLDLEKAFDKVDWDFLDAILLAKGFGTVW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-0432.93Show/hide
Query:  NLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNGLATSFWHDSWL
        N ALLAK  +R +H+ +TL  +L+ ++Y+  + S +  S+  R S+   WR I    +L+     R++G+G+ T  W D W+
Subjt:  NLALLAKWIWRFLHEDNTLWHKLIVAKYYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNGLATSFWHDSWL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-1250Show/hide
Query:  IINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYT
        IING P+G + PSRG+RQGDPLSP+LFIL ++ LS L   +   GR+    + N+   +N L FAD T
Subjt:  IINGRPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATTGCCATCCACTATTGCGGAAAATCAAATGGCTTTTGTGGCTAACAGACAAATTCTAGATGCTTCACTTATAGCAAACGAGTTGATTGACGATTGGAATTTATC
TCGTAAGAAAGGTATGATGATTAAATTAGATCTTGAAAAGGCTTTTGATAAAGTCGATTGGGATTTTCTGGATGCAATTCTTCTAGCCAAGGGTTTTGGTACGGTTTGGA
GAAAATGGATTTATGGCTGCATTTCTAGTGTTAACTACTCTATTATTATCAATGGAAGACCACGAGGCAAGATTATTCCCTCTCGAGGCATTCGTCAAGGGGATCCCCTT
TCTCCTTTCCTTTTCATCTTGGTGTCTGATTGCCTAAGTCGCTTATTATCTCACAGTACTTATATGGGTCGAATTGTTTCTCATCCGATAGGGAATTCACAACTTCACGT
GAATCGTTTACAATTTGCTGATTATACATTATTATTCTCCATATTTCATAAGGATGCATTGGTTAACATGTTTGAAATCATTAAAATTTTTGAGCTGGCTTCTGGGCTGA
ATATTAACTATTCCAAGAGTGAGAATCGCAATCTAGCTCTTCTAGCAAAGTGGATTTGGAGATTTTTACATGAGGATAACACTCTATGGCATAAACTGATTGTAGCTAAG
TATTATAACTCTAATTTGAGTAGTGTTTGGCCTAGCATTATTCAGAGAAGTTCACACAAATCTCCTTGGCGATTAATTTCTTCTACTATGGACCTGGTATATTCTCGCGC
TAAAAGAAGTTTGGGTAATGGTCTCGCTACATCTTTCTGGCATGATTCATGGTTAAGTTGTGGTATTCTGGCTACAAATTTTCCTCGTCTTTATCGTTTAACTAATCGTC
CGAGGAGTTTGGTTGGTGAGACATGGATTGCTTCTCAAACAGCATGGGACCTGAGTCTTAGGCGAAATTTGAATGATTTAGAGACAGAAGAATGGGTGGAATTATCTCTT
ATTCTTTCCTCCATCAGCCTTCAGAACCGTAATGATTCCTGGTCATGGCCATTGGAATCGTCCAATATTTTTTCTGTTAAATCCCTTATGAAGGATCTTGTAGACTATCT
AGTTATTGAGGATAATCTCTATAAGATAATTTGGGCCGATTCCTATCCAAAGAAGATCAAGATTTTTCTATGGGAGCTTAGTCATGGTGCTATTAATACAGCTAATCGAC
TTCAACGTCGAATGCCTCATTTTCATTTATCCCCATCTTGGTGCATTATGTGTGCCGCTGGTTCAGAACATTCTGGCCACCTATTTGTTCACTGTTCCTTCGCTTCCAGA
TATTGGTCAGAGATTCTTGATGCTTTTGGATGGTCCACCGTTTTTCCAAATTGCATTAAGGATGTTCTCACTCTCATTTTTGTGAGTCATCCCTTCCATGGAGAAAAGAA
GATCTTATGGCTTGCTTTGAACAGAGTCTTCTTTTGGTTTTTATGGGGTGAGAGAAATTCTCGAATTTTCAGAGATTCTTTCTCATCCTTTGATAAATTCATGGATATAA
TTCTCTTTCATGCTTTATACTGGTGTAAATGTCATCACCCTTTTTCTGATTATAGTTTATCTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATTGCCATCCACTATTGCGGAAAATCAAATGGCTTTTGTGGCTAACAGACAAATTCTAGATGCTTCACTTATAGCAAACGAGTTGATTGACGATTGGAATTTATC
TCGTAAGAAAGGTATGATGATTAAATTAGATCTTGAAAAGGCTTTTGATAAAGTCGATTGGGATTTTCTGGATGCAATTCTTCTAGCCAAGGGTTTTGGTACGGTTTGGA
GAAAATGGATTTATGGCTGCATTTCTAGTGTTAACTACTCTATTATTATCAATGGAAGACCACGAGGCAAGATTATTCCCTCTCGAGGCATTCGTCAAGGGGATCCCCTT
TCTCCTTTCCTTTTCATCTTGGTGTCTGATTGCCTAAGTCGCTTATTATCTCACAGTACTTATATGGGTCGAATTGTTTCTCATCCGATAGGGAATTCACAACTTCACGT
GAATCGTTTACAATTTGCTGATTATACATTATTATTCTCCATATTTCATAAGGATGCATTGGTTAACATGTTTGAAATCATTAAAATTTTTGAGCTGGCTTCTGGGCTGA
ATATTAACTATTCCAAGAGTGAGAATCGCAATCTAGCTCTTCTAGCAAAGTGGATTTGGAGATTTTTACATGAGGATAACACTCTATGGCATAAACTGATTGTAGCTAAG
TATTATAACTCTAATTTGAGTAGTGTTTGGCCTAGCATTATTCAGAGAAGTTCACACAAATCTCCTTGGCGATTAATTTCTTCTACTATGGACCTGGTATATTCTCGCGC
TAAAAGAAGTTTGGGTAATGGTCTCGCTACATCTTTCTGGCATGATTCATGGTTAAGTTGTGGTATTCTGGCTACAAATTTTCCTCGTCTTTATCGTTTAACTAATCGTC
CGAGGAGTTTGGTTGGTGAGACATGGATTGCTTCTCAAACAGCATGGGACCTGAGTCTTAGGCGAAATTTGAATGATTTAGAGACAGAAGAATGGGTGGAATTATCTCTT
ATTCTTTCCTCCATCAGCCTTCAGAACCGTAATGATTCCTGGTCATGGCCATTGGAATCGTCCAATATTTTTTCTGTTAAATCCCTTATGAAGGATCTTGTAGACTATCT
AGTTATTGAGGATAATCTCTATAAGATAATTTGGGCCGATTCCTATCCAAAGAAGATCAAGATTTTTCTATGGGAGCTTAGTCATGGTGCTATTAATACAGCTAATCGAC
TTCAACGTCGAATGCCTCATTTTCATTTATCCCCATCTTGGTGCATTATGTGTGCCGCTGGTTCAGAACATTCTGGCCACCTATTTGTTCACTGTTCCTTCGCTTCCAGA
TATTGGTCAGAGATTCTTGATGCTTTTGGATGGTCCACCGTTTTTCCAAATTGCATTAAGGATGTTCTCACTCTCATTTTTGTGAGTCATCCCTTCCATGGAGAAAAGAA
GATCTTATGGCTTGCTTTGAACAGAGTCTTCTTTTGGTTTTTATGGGGTGAGAGAAATTCTCGAATTTTCAGAGATTCTTTCTCATCCTTTGATAAATTCATGGATATAA
TTCTCTTTCATGCTTTATACTGGTGTAAATGTCATCACCCTTTTTCTGATTATAGTTTATCTTTTTGA
Protein sequenceShow/hide protein sequence
MVLPSTIAENQMAFVANRQILDASLIANELIDDWNLSRKKGMMIKLDLEKAFDKVDWDFLDAILLAKGFGTVWRKWIYGCISSVNYSIIINGRPRGKIIPSRGIRQGDPL
SPFLFILVSDCLSRLLSHSTYMGRIVSHPIGNSQLHVNRLQFADYTLLFSIFHKDALVNMFEIIKIFELASGLNINYSKSENRNLALLAKWIWRFLHEDNTLWHKLIVAK
YYNSNLSSVWPSIIQRSSHKSPWRLISSTMDLVYSRAKRSLGNGLATSFWHDSWLSCGILATNFPRLYRLTNRPRSLVGETWIASQTAWDLSLRRNLNDLETEEWVELSL
ILSSISLQNRNDSWSWPLESSNIFSVKSLMKDLVDYLVIEDNLYKIIWADSYPKKIKIFLWELSHGAINTANRLQRRMPHFHLSPSWCIMCAAGSEHSGHLFVHCSFASR
YWSEILDAFGWSTVFPNCIKDVLTLIFVSHPFHGEKKILWLALNRVFFWFLWGERNSRIFRDSFSSFDKFMDIILFHALYWCKCHHPFSDYSLSF