; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019045 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019045
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr5:37967302..37968619
RNA-Seq ExpressionLag0019045
SyntenyLag0019045
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-9845.79Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        F    +IN+ VNET I LI KK      +DFRPISLT+ +Y+LIAKTL +RLK TL  TI+E+Q+AFVKGRQIT+AIL+ANE +DFW+S + RG +IKLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFLIFSLQM-------------EFSCFLNM-
        IEKAFDK+NW FID VL+ K +S  W   I +CI SV YSIL+NG+PR R+      +     +PF+  L + +  ++             +FS  LN+ 
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFLIFSLQM-------------EFSCFLNM-

Query:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN
                          + NL  I+ +FE +SGLNIN SKS+I  IN+ + R   +A  WG     LP SYLG  LGG PSS++FW  +++KI  KL N
Subjt:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN

Query:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES
        W++  +SKGGR+TLI +TL S+P Y +S+FK P+ +  KI+   R+FLW+G     N+ L+ WN + +PKE GGLG      +N AL  KWLW++  E+ 
Subjt:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES

Query:  PLWKRFISAKYSAPRPGALPNQARFSSS
        PLWKR I +KY   + G+ P+  +FSS+
Subjt:  PLWKRFISAKYSAPRPGALPNQARFSSS

KAA0045262.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.4e-10050.93Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        FF+KGVIN+N+N TYIALI KK    Q  DFRPISLT+ +Y++IAKTL  RLK TL  TI+ NQLAF+K RQITDAIL+ANE +D+WK  + +G I+KLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKP--RVRLFRRSRAKASLRDAPFLQYLFL---IFSLQMEFSCFLNMLENLFNIIK
        IEKAFD +NWDFID VL  K F N+W  WI  CI +V+YS+++NG+P  R++  R  R   SL +   + ++     I     +  CFLN   NL   + 
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKP--RVRLFRRSRAKASLRDAPFLQYLFL---IFSLQMEFSCFLNMLENLFNIIK

Query:  VFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYML
        +FE +SGL IN SKS++  +N+   R  + AS WG     LP++YLG  LGGNP S+ FW  + +KI  KL NW++  ISKGGRLTLI+ TL+S+P Y L
Subjt:  VFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYML

Query:  SIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEESPLWK
        S+F+AP S    I+K  R+FLW G + S    L+NW  V  PKE GGLG  + +++N AL  KWLWRYY E + LW+
Subjt:  SIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEESPLWK

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-9947.66Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        F + G++N NVN T+IALI KK K  + SD+RPISLT+ LY+L+AK L  RLK  L  TIAENQ+AF+KGRQI DAIL+ANE ID WK  + +G ++KLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS--------LQMEFSCFLNM-
        +EKAFDKI+W FID +L  K F + W  WI ACI +V YSILLNG P+ R+      +     +PF     + YL  + S          + F+ + N+ 
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS--------LQMEFSCFLNM-

Query:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN
                          L NL   + +FE +SGL  N SKS+IS IN+ + R  Q+AS +G     LP++YLG  LGGNP S SFW+  IE IH KL  
Subjt:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN

Query:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES
        W++  ISKGGRLTL++A+L+S+PTY LS FKAP SV+ +I+K  R FLW G +   N  L+NWN+  +PKE GGLG  K + +N AL  KWLWRY+ E +
Subjt:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES

Query:  PLWKRFISAKYSAPRPGALPNQARFSSS
         LWK+ I AKY+    G +P   R SS+
Subjt:  PLWKRFISAKYSAPRPGALPNQARFSSS

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-9847.43Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        F + G++N NVN T+IALI KK K  + SD+RPISLT+ LY+++AK L  RLK  L  TIAENQ+AF+KGRQI DAIL+ANE ID WK  + +G ++KLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS----------LQMEFSCFLN
        IEKAFDKI+W FID +L  K F + W  WI ACI +V YSILLNG P+ R+      +     +PF     + YL  + S          +     C ++
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS----------LQMEFSCFLN

Query:  -----------------MLENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN
                          L NL   + +FE +SGL  N SKS+IS IN+ + R  Q+AS +G     LP++YLG  LGGNP S SFW   IE IH KL  
Subjt:  -----------------MLENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN

Query:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES
        W++  ISKGGRLTL++A+L+S+PTY LS FKAP SV+ +I+K  R FLW G +   N  L+NWN+  +PKE GGLG  K + +N AL  KWLWRY+ E +
Subjt:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES

Query:  PLWKRFISAKYSAPRPGALPNQARFSSS
         LWK+ I AKY+    G +P   R SS+
Subjt:  PLWKRFISAKYSAPRPGALPNQARFSSS

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-9947.9Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        F + G++N NVN T+IALI KK K  + SD+RPISLT+ LY+++AK L  RLK  L  TIAENQ+AF+KGRQI DAIL+ANEVID WK  + +G ++KLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS--------LQMEFSCFLNM-
        IEKAFDKI+W FID +L  K F + W  WI ACI +V YSILLNG P+ R+      +     +PF     + YL  + S          + F+ + N+ 
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS--------LQMEFSCFLNM-

Query:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN
                          L NL   + +FE +SGL  N SKS+IS IN+ + R  Q+AS +G     LP++YLG  LGGNP S SFW   IE IH KL  
Subjt:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN

Query:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES
        W++  ISKGGRLTL++A+L+S+PTY LS FKAP SV+ +I+K  R FLW G +   N  L+NWN+  +PKE GGLG  K + +N AL  KWLWRY+ E +
Subjt:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES

Query:  PLWKRFISAKYSAPRPGALPNQARFSSS
         LWK+ I AKY+    G +P   R SS+
Subjt:  PLWKRFISAKYSAPRPGALPNQARFSSS

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein5.2e-9945.79Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        F    +IN+ VNET I LI KK      +DFRPISLT+ +Y+LIAKTL +RLK TL  TI+E+Q+AFVKGRQIT+AIL+ANE +DFW+S + RG +IKLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFLIFSLQM-------------EFSCFLNM-
        IEKAFDK+NW FID VL+ K +S  W   I +CI SV YSIL+NG+PR R+      +     +PF+  L + +  ++             +FS  LN+ 
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFLIFSLQM-------------EFSCFLNM-

Query:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN
                          + NL  I+ +FE +SGLNIN SKS+I  IN+ + R   +A  WG     LP SYLG  LGG PSS++FW  +++KI  KL N
Subjt:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN

Query:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES
        W++  +SKGGR+TLI +TL S+P Y +S+FK P+ +  KI+   R+FLW+G     N+ L+ WN + +PKE GGLG      +N AL  KWLW++  E+ 
Subjt:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES

Query:  PLWKRFISAKYSAPRPGALPNQARFSSS
        PLWKR I +KY   + G+ P+  +FSS+
Subjt:  PLWKRFISAKYSAPRPGALPNQARFSSS

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein6.7e-9947.43Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        F + G++N NVN T+IALI KK K  + SD+RPISLT+ LY+++AK L  RLK  L  TIAENQ+AF+KGRQI DAIL+ANE ID WK  + +G ++KLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS----------LQMEFSCFLN
        IEKAFDKI+W FID +L  K F + W  WI ACI +V YSILLNG P+ R+      +     +PF     + YL  + S          +     C ++
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS----------LQMEFSCFLN

Query:  -----------------MLENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN
                          L NL   + +FE +SGL  N SKS+IS IN+ + R  Q+AS +G     LP++YLG  LGGNP S SFW   IE IH KL  
Subjt:  -----------------MLENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN

Query:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES
        W++  ISKGGRLTL++A+L+S+PTY LS FKAP SV+ +I+K  R FLW G +   N  L+NWN+  +PKE GGLG  K + +N AL  KWLWRY+ E +
Subjt:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES

Query:  PLWKRFISAKYSAPRPGALPNQARFSSS
         LWK+ I AKY+    G +P   R SS+
Subjt:  PLWKRFISAKYSAPRPGALPNQARFSSS

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein1.0e-9947.66Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        F + G++N NVN T+IALI KK K  + SD+RPISLT+ LY+L+AK L  RLK  L  TIAENQ+AF+KGRQI DAIL+ANE ID WK  + +G ++KLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS--------LQMEFSCFLNM-
        +EKAFDKI+W FID +L  K F + W  WI ACI +V YSILLNG P+ R+      +     +PF     + YL  + S          + F+ + N+ 
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS--------LQMEFSCFLNM-

Query:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN
                          L NL   + +FE +SGL  N SKS+IS IN+ + R  Q+AS +G     LP++YLG  LGGNP S SFW+  IE IH KL  
Subjt:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN

Query:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES
        W++  ISKGGRLTL++A+L+S+PTY LS FKAP SV+ +I+K  R FLW G +   N  L+NWN+  +PKE GGLG  K + +N AL  KWLWRY+ E +
Subjt:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES

Query:  PLWKRFISAKYSAPRPGALPNQARFSSS
         LWK+ I AKY+    G +P   R SS+
Subjt:  PLWKRFISAKYSAPRPGALPNQARFSSS

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein6.1e-10047.9Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        F + G++N NVN T+IALI KK K  + SD+RPISLT+ LY+++AK L  RLK  L  TIAENQ+AF+KGRQI DAIL+ANEVID WK  + +G ++KLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS--------LQMEFSCFLNM-
        IEKAFDKI+W FID +L  K F + W  WI ACI +V YSILLNG P+ R+      +     +PF     + YL  + S          + F+ + N+ 
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPF-----LQYLFLIFS--------LQMEFSCFLNM-

Query:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN
                          L NL   + +FE +SGL  N SKS+IS IN+ + R  Q+AS +G     LP++YLG  LGGNP S SFW   IE IH KL  
Subjt:  ------------------LENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLEN

Query:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES
        W++  ISKGGRLTL++A+L+S+PTY LS FKAP SV+ +I+K  R FLW G +   N  L+NWN+  +PKE GGLG  K + +N AL  KWLWRY+ E +
Subjt:  WRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEES

Query:  PLWKRFISAKYSAPRPGALPNQARFSSS
         LWK+ I AKY+    G +P   R SS+
Subjt:  PLWKRFISAKYSAPRPGALPNQARFSSS

A0A5D3DZ07 LINE-1 retrotransposable element ORF2 protein3.6e-10050.93Show/hide
Query:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD
        FF+KGVIN+N+N TYIALI KK    Q  DFRPISLT+ +Y++IAKTL  RLK TL  TI+ NQLAF+K RQITDAIL+ANE +D+WK  + +G I+KLD
Subjt:  FFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKP--RVRLFRRSRAKASLRDAPFLQYLFL---IFSLQMEFSCFLNMLENLFNIIK
        IEKAFD +NWDFID VL  K F N+W  WI  CI +V+YS+++NG+P  R++  R  R   SL +   + ++     I     +  CFLN   NL   + 
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKP--RVRLFRRSRAKASLRDAPFLQYLFL---IFSLQMEFSCFLNMLENLFNIIK

Query:  VFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYML
        +FE +SGL IN SKS++  +N+   R  + AS WG     LP++YLG  LGGNP S+ FW  + +KI  KL NW++  ISKGGRLTLI+ TL+S+P Y L
Subjt:  VFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYML

Query:  SIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEESPLWK
        S+F+AP S    I+K  R+FLW G + S    L+NW  V  PKE GGLG  + +++N AL  KWLWRYY E + LW+
Subjt:  SIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEESPLWK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.2e-1421.67Show/hide
Query:  QKGVINRNVNETYIALIPKKSK-SLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRG-IIIKLD
        ++G++  +  E  I LIPK  + + +  +FRPISL +I  +++ K L  R++  +   I  +Q+ F+ G Q    I  +  VI     ++ +  +II +D
Subjt:  QKGVINRNVNETYIALIPKKSK-SLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRG-IIIKLD

Query:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFL-----------------IFSLQMEFSCF
         EKAFDKI   F+ K L   G    +   I A     + +I+LNG+       ++  +     +P L  + L                 +   +++ S F
Subjt:  IEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFL-----------------IFSLQMEFSCF

Query:  LNML-----------ENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSS--ASFWAPMIEKIHHKLENWR
         + +           +NL  +I  F   SG  IN  KS     N      +Q+  +    ++   I YLG  L  +        + P++++I      W+
Subjt:  LNML-----------ENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSS--ASFWAPMIEKIHHKLENWR

Query:  FFYISKGGRLTLIQATLNSIPTYMLSI--FKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQ-EE
            S  GR+ +++  +     Y  +    K P + + +++K    F+W     +     +  ++++   + GG+     ++   A   K  W +YQ  +
Subjt:  FFYISKGGRLTLIQATLNSIPTYMLSI--FKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQ-EE

Query:  SPLWKR
           W R
Subjt:  SPLWKR

P0C2F6 Putative ribonuclease H protein At1g657509.6e-1836.07Show/hide
Query:  MIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQI
        ++E++  ++  WR   +S  GRLTL +A L+S+P + +S    PQS+  ++D++ R+FLW          LV W+ V +PK+ GGLG   ++  N AL  
Subjt:  MIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQI

Query:  KWLWRYYQEESPLWKRFISAKY
        K  WR  QE++ LW   +  KY
Subjt:  KWLWRYYQEESPLWKRFISAKY

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-1322.66Show/hide
Query:  EHSQTFDHGSVP---RFFFQ---KGVINRNVNETYIALIPKKSKS-LQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILV
        E  QTF    +P   + F +   +G +  +  E  I LIPK  K   +I +FRPISL +I  +++ K L  R++  + A I  +Q+ F+ G Q    I  
Subjt:  EHSQTFDHGSVP---RFFFQ---KGVINRNVNETYIALIPKKSKS-LQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILV

Query:  ANEVIDFWKSSRTRG-IIIKLDIEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFL------
        +  VI +    + +  +II LD EKAFDKI   F+ KVL   G    + + I A       +I +NG+    +  +S  +     +P+L  + L      
Subjt:  ANEVIDFWKSSRTRG-IIIKLDIEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFL------

Query:  ------------------IFSLQMEFSCFL----NMLENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPS
                          I  L  +   ++    N    L N+I  F    G  IN +KS             ++       +    I YLG  L     
Subjt:  ------------------IFSLQMEFSCFL----NMLENLFNIIKVFEISSGLNINFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPS

Query:  SA--SFWAPMIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYMLSI--FKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFF
              +  + ++I   L  W+    S  GR+ +++  +     Y  +    K P   + +++  +  F+W     +   P +  +++   +  GG+   
Subjt:  SA--SFWAPMIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYMLSI--FKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFF

Query:  KSRISNNALQIKWLWRYYQE-ESPLWKR
          ++   A+ IK  W +Y++ +   W R
Subjt:  KSRISNNALQIKWLWRYYQE-ESPLWKR

P14381 Transposon TX1 uncharacterized 149 kDa protein2.4e-1624.46Show/hide
Query:  FQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLDI
        F+KG +  +     ++L+PKK     I ++RP+SL S  Y+++AK +  RLK  LA  I  +Q   V GR I D + +  +++ F + +      + LD 
Subjt:  FQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGIIIKLDI

Query:  EKAFDKINWDFIDKVLLFKGFSNTWCSWIMA------CIFSVSYSIL--------------LNG-------KPRVRLFRRSRAKASLRDAPFLQYLFLIF
        EKAFD+++  ++   L    F   +  ++        C+  +++S+               L+G       +P + L R+      L++      L    
Subjt:  EKAFDKINWDFIDKVLLFKGFSNTWCSWIMA------CIFSVSYSIL--------------LNG-------KPRVRLFRRSRAKASLRDAPFLQYLFLIF

Query:  SLQMEFSCFLNMLENLFNIIKVFEISSGLNINFSKSS---ISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGN--PSSASFWAPMIEKIHHKLENW
           +  +  L  LE      +V+  +S   IN+SKSS      + ++    A     W   +    I YLG  L     P S +F   + E +  +L  W
Subjt:  SLQMEFSCFLNMLENLFNIIKVFEISSGLNINFSKSS---ISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGN--PSSASFWAPMIEKIHHKLENW

Query:  RFF--YISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRY-YQE
        + F   +S  GR  +I   + S   Y L      Q    KI + +  FLW G         V+  V + P + GG G    R   +  +++ + RY Y +
Subjt:  RFF--YISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSFLWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRY-YQE

Query:  ESPLWKRFISAKYSAPR
         SP W    S+ Y   R
Subjt:  ESPLWKRFISAKYSAPR

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)3.5e-0424.59Show/hide
Query:  QTFDHGSVPRFFFQKGVINRNVNETYIA----LIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVID
        Q      +PR F Q  ++  +V   + A    LIPK       S++RPI++ S L RL+ + L +RL+  +    A+   A + G  +   +L  +  I 
Subjt:  QTFDHGSVPRFFFQKGVINRNVNETYIA----LIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVID

Query:  FWKSSRTRGIIIKLDIEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLN-GKPRVRLFRRSRAKASLRDAPFL
          +  R    ++ LD+ KAFD ++   I + L   G      ++I   +   + +I +  G    ++  R   K     +PFL
Subjt:  FWKSSRTRGIIIKLDIEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLN-GKPRVRLFRRSRAKASLRDAPFL

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.4e-0834.78Show/hide
Query:  LVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGI----IIKLDIEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIF
        +VERLK  +   I   Q +F+ GR  TD I+   E +   +  R +G+    ++KLD+EKA+D+I WD+++  L+  GF   W   I    F
Subjt:  LVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGI----IIKLDIEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACACTCTCAAACTTTCGATCATGGAAGTGTTCCACGATTTTTTTTTCAGAAGGGTGTTATCAACCGTAATGTCAATGAAACCTACATTGCTCTGATTCCGAAGAA
GAGCAAATCATTACAAATTTCTGATTTCAGGCCGATCAGTCTCACTTCTATCCTGTATCGTCTTATAGCTAAAACTCTGGTAGAAAGACTTAAGTGTACGCTTGCTGCAA
CCATAGCAGAAAACCAGTTAGCTTTTGTCAAAGGACGTCAGATCACTGATGCTATTTTAGTGGCGAATGAGGTTATTGATTTCTGGAAAAGCTCTCGAACGCGAGGAATT
ATCATTAAGCTTGACATAGAAAAAGCTTTTGACAAGATCAACTGGGATTTTATTGACAAAGTTCTTTTATTTAAAGGCTTTTCGAATACTTGGTGTAGTTGGATCATGGC
TTGTATCTTCTCAGTATCCTATTCCATTTTATTAAATGGGAAGCCCAGAGTCCGCTTATTCAGGCGGTCGAGAGCAAAGGCCTCATTAAGGGATGCTCCTTTCCTTCAGT
ATCTGTTTCTCATCTTCTCTTTGCAGATGGAATTCTCCTGTTTCCTCAACATGCTTGAAAATCTGTTCAACATTATTAAAGTTTTTGAGATTTCCTCTGGCCTCAATATC
AACTTCAGTAAGTCTTCTATCTCGGGCATTAATCTGGAAAGTTACAGAGGGGCTCAAGTTGCCTCTAAATGGGGTTGCCCTCTTTCCCCCCTCCCTATTTCATACTTGGG
TACTCTGCTTGGTGGTAATCCTTCATCAGCTTCCTTTTGGGCACCTATGATTGAGAAAATTCATCACAAGTTGGAGAACTGGCGTTTCTTTTATATTTCTAAGGGGGGTC
GACTAACTTTAATTCAAGCCACTTTAAACAGCATCCCCACTTACATGCTATCTATTTTCAAGGCTCCTCAGTCGGTTTGGTTTAAGATTGACAAGATTGTTAGATCTTTT
CTTTGGCATGGTTTGGATCTCAGTGGTAATCTCCCTCTCGTTAATTGGAATGTGGTGGCTGCTCCTAAAGAATTCGGGGGTCTTGGCTTTTTCAAGTCTAGGATCTCAAA
TAATGCGCTTCAAATTAAATGGCTTTGGAGATATTATCAAGAGGAATCTCCACTTTGGAAACGTTTTATCTCAGCCAAGTACTCCGCTCCCAGGCCAGGCGCTCTTCCAA
ATCAGGCTCGGTTCTCTAGTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAACACTCTCAAACTTTCGATCATGGAAGTGTTCCACGATTTTTTTTTCAGAAGGGTGTTATCAACCGTAATGTCAATGAAACCTACATTGCTCTGATTCCGAAGAA
GAGCAAATCATTACAAATTTCTGATTTCAGGCCGATCAGTCTCACTTCTATCCTGTATCGTCTTATAGCTAAAACTCTGGTAGAAAGACTTAAGTGTACGCTTGCTGCAA
CCATAGCAGAAAACCAGTTAGCTTTTGTCAAAGGACGTCAGATCACTGATGCTATTTTAGTGGCGAATGAGGTTATTGATTTCTGGAAAAGCTCTCGAACGCGAGGAATT
ATCATTAAGCTTGACATAGAAAAAGCTTTTGACAAGATCAACTGGGATTTTATTGACAAAGTTCTTTTATTTAAAGGCTTTTCGAATACTTGGTGTAGTTGGATCATGGC
TTGTATCTTCTCAGTATCCTATTCCATTTTATTAAATGGGAAGCCCAGAGTCCGCTTATTCAGGCGGTCGAGAGCAAAGGCCTCATTAAGGGATGCTCCTTTCCTTCAGT
ATCTGTTTCTCATCTTCTCTTTGCAGATGGAATTCTCCTGTTTCCTCAACATGCTTGAAAATCTGTTCAACATTATTAAAGTTTTTGAGATTTCCTCTGGCCTCAATATC
AACTTCAGTAAGTCTTCTATCTCGGGCATTAATCTGGAAAGTTACAGAGGGGCTCAAGTTGCCTCTAAATGGGGTTGCCCTCTTTCCCCCCTCCCTATTTCATACTTGGG
TACTCTGCTTGGTGGTAATCCTTCATCAGCTTCCTTTTGGGCACCTATGATTGAGAAAATTCATCACAAGTTGGAGAACTGGCGTTTCTTTTATATTTCTAAGGGGGGTC
GACTAACTTTAATTCAAGCCACTTTAAACAGCATCCCCACTTACATGCTATCTATTTTCAAGGCTCCTCAGTCGGTTTGGTTTAAGATTGACAAGATTGTTAGATCTTTT
CTTTGGCATGGTTTGGATCTCAGTGGTAATCTCCCTCTCGTTAATTGGAATGTGGTGGCTGCTCCTAAAGAATTCGGGGGTCTTGGCTTTTTCAAGTCTAGGATCTCAAA
TAATGCGCTTCAAATTAAATGGCTTTGGAGATATTATCAAGAGGAATCTCCACTTTGGAAACGTTTTATCTCAGCCAAGTACTCCGCTCCCAGGCCAGGCGCTCTTCCAA
ATCAGGCTCGGTTCTCTAGTTCTTGA
Protein sequenceShow/hide protein sequence
MEHSQTFDHGSVPRFFFQKGVINRNVNETYIALIPKKSKSLQISDFRPISLTSILYRLIAKTLVERLKCTLAATIAENQLAFVKGRQITDAILVANEVIDFWKSSRTRGI
IIKLDIEKAFDKINWDFIDKVLLFKGFSNTWCSWIMACIFSVSYSILLNGKPRVRLFRRSRAKASLRDAPFLQYLFLIFSLQMEFSCFLNMLENLFNIIKVFEISSGLNI
NFSKSSISGINLESYRGAQVASKWGCPLSPLPISYLGTLLGGNPSSASFWAPMIEKIHHKLENWRFFYISKGGRLTLIQATLNSIPTYMLSIFKAPQSVWFKIDKIVRSF
LWHGLDLSGNLPLVNWNVVAAPKEFGGLGFFKSRISNNALQIKWLWRYYQEESPLWKRFISAKYSAPRPGALPNQARFSSS