; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017270 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017270
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:1600183..1601490
RNA-Seq ExpressionLag0017270
SyntenyLag0017270
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBG98819.1 VIRB2-interacting protein 2 [Prunus dulcis]7.0e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

BBN69746.1 VIRB2-interacting protein 2 [Prunus dulcis]7.0e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

VVA13439.1 Hypothetical predicted protein, partial [Prunus dulcis]7.0e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

VVA21938.1 Hypothetical predicted protein, partial [Prunus dulcis]7.0e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

VVA41200.1 PREDICTED: RNA-directed DNA polymerase, partial [Prunus dulcis]7.0e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

TrEMBL top hitse value%identityAlignment
A0A4Y1R3V4 VIRB2-interacting protein 23.4e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

A0A5E4EEP2 Reverse transcriptase domain-containing protein (Fragment)3.4e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

A0A5E4F859 Reverse transcriptase domain-containing protein (Fragment)3.4e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

A0A5E4GN72 PREDICTED: RNA-directed DNA polymerase (Fragment)3.4e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

A0A5H2Y6K0 VIRB2-interacting protein 23.4e-11749.54Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI
        R+RN I +L    G  +V +  IE E ++F++ L+S   +  W L  + +W  IS   +  L+ PF E+EV RAV D G +KSPG DGF+   F+  W+I
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKK-DGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNI

Query:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR
        +K+D+M V  DFF    INA  NET+ICLIPKK  +  V D+RPISL + LYK+V++VL+ RL++VL  TI  YQS FV  RQILDA+L+ANE+++E +R
Subjt:  LKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQR

Query:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK
          + G+  K+D+EKA+D V+W F+DE+L  KGFG  WR WIRGC+   NFS++ING+PRGKF ASRGLRQGDPLSPFLF +V+D LSR++ KA+  D   
Subjt:  KKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIK

Query:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW
        GL  G    + I+HLQFADDTI F    E + +NL + ++LF    G+ IN  K   +GI LD  ++  +A  +GC +G WP  YLGLPL G P+++ FW
Subjt:  GLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFW

Query:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL
          V+EK+E RL  W    L KG RLT+IQA L
Subjt:  ASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.8e-3628.34Show/hide
Query:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKKDGNWFLPDIEDWGTISDSLS---------ASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAE
        R +N I  + +  G    D + I+T   ++Y+ L++ K     L ++E+  T  D+ +          SL  P T  E+   +N L + KSPG DGFTAE
Subjt:  RRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKKDGNWFLPDIEDWGTISDSLS---------ASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAE

Query:  FFKKSWNILKKDIMGVFNDFFKSATINANLNETYICLIPKK-IGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVA
        F+++    L   ++ +F    K   +  +  E  I LIPK         ++RPISL +   KI+ ++L+ R+++ +   I   Q  F+   Q       +
Subjt:  FFKKSWNILKKDIMGVFNDFFKSATINANLNETYICLIPKK-IGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVA

Query:  NELIDEWQRKKEKG-VCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRML
          +I    R K+K  V I +D EKAFD +   F+ + L   G    + + IR        +II+NG+    F    G RQG PLSP LF +V++ L+R +
Subjt:  NELIDEWQRKKEKG-VCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRML

Query:  IKAEQQDLIKGLHVGGSHA-LSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLP
            Q+  IKG+ +G     LS+    FADD I++         NL K I  F +  G  IN  K++      + Q    +       I      YLG+ 
Subjt:  IKAEQQDLIKGLHVGGSHA-LSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLP

Query:  LNGKPKSL--TFWASVLEKIEKRLHSW
        L    K L    +  +L++I++  + W
Subjt:  LNGKPKSL--TFWASVLEKIEKRLHSW

P08548 LINE-1 reverse transcriptase homolog4.0e-3028.65Show/hide
Query:  MAATRRRNSIMELLS--RSGQSLV--DDSSIETEFVDFYRKLFSKKDGNWFLPDIEDW------GTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLD
        +A   R+  +  L+S  R+G   +  D S I+    ++Y+KL+S K  N  L +I+ +        +S      L  P +  E+   + +L   KSPG D
Subjt:  MAATRRRNSIMELLS--RSGQSLV--DDSSIETEFVDFYRKLFSKKDGNWFLPDIEDW------GTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLD

Query:  GFTAEFFKKSWNILKKDIMGVFNDFFKSATINANLNETYICLIPKK-IGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILD
        GFT+EF++     L   ++ +F +  K   +     E  I LIPK         +YRPISL +   KI+ ++L+ R+++ +   I   Q  F+   Q   
Subjt:  GFTAEFFKKSWNILKKDIMGVFNDFFKSATINANLNETYICLIPKK-IGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILD

Query:  ASLVANELIDEWQRKKEKG-VCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDC
            +  +I    + K K  + + +D EKAFD +   F+   L+  G   T+ + I    S    +II+NG     F    G RQG PLSP LF +V++ 
Subjt:  ASLVANELIDEWQRKKEKG-VCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDC

Query:  LSRMLIKAEQQDLIKGLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKT
        L+   I   ++  IKG+H+G      I    FADD I++          L + IK +    G  IN  K+
Subjt:  LSRMLIKAEQQDLIKGLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKT

P11369 LINE-1 retrotransposable element ORF2 protein8.0e-3126.82Show/hide
Query:  IMELLSRSGQSLVDDSSIETEFVDFYRKLFSKKDGNWFLPDIEDWGTISDSLSA---------SLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKS
        I ++ +  G    D   I+     FY++L+S K     L ++++     D              L  P + KE+   +N L + KSPG DGF+AEF++  
Subjt:  IMELLSRSGQSLVDDSSIETEFVDFYRKLFSKKDGNWFLPDIEDWGTISDSLSA---------SLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKS

Query:  WNILKKDIMGVFNDFFKSATINANLNETY----ICLIPK-KIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVAN
            K+D++ + +  F    +   L  ++    I LIPK +     + ++RPISL +   KI+ ++L+ R+++ +   I   Q  F+   Q       + 
Subjt:  WNILKKDIMGVFNDFFKSATINANLNETY----ICLIPK-KIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVAN

Query:  ELIDEWQRKKEKG-VCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLI
         +I    + K+K  + I LD EKAFD +   F+ ++L   G    +   I+   S    +I +NG+         G RQG PLSP+LF +V++ L+R + 
Subjt:  ELIDEWQRKKEKG-VCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLI

Query:  KAEQQDLIKGLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLN
           QQ  IKG+ + G   + I+ L  ADD I++ S  +     L   I  F E  G  IN  K+       + Q    + +     I      YLG+ L 
Subjt:  KAEQQDLIKGLHVGGSHALSITHLQFADDTILFSSPNEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLN

Query:  GKPKSL--TFWASVLEKIEKRLHSW
         + K L    + S+ ++I++ L  W
Subjt:  GKPKSL--TFWASVLEKIEKRLHSW

P14381 Transposon TX1 uncharacterized 149 kDa protein7.2e-3233.11Show/hide
Query:  RNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKKDGNWFLPDI--EDWG---TISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSW
        R  I  L +  G  L D  +I      FY+ LFS        PD   E W     +S+     LE P T  E+ +A+  +  NKSPGLDG T EFF+  W
Subjt:  RNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKKDGNWFLPDI--EDWG---TISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSW

Query:  NILKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEW
        + L  D   V  + FK   +  +     + L+PKK   + + ++RP+SL S  YKIVA+ +S RLK VL   I   QS  V  R I D   +  +L+   
Subjt:  NILKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEW

Query:  QRKKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVD---CLSR
        +R       + LD EKAFD VD ++L   L+   FG  +  +++   +     + IN          RG+RQG PLS  L+ + ++   CL R
Subjt:  QRKKEKGVCIKLDIEKAFDMVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVD---CLSR

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.5e-1328.07Show/hide
Query:  LIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQRKKEKGVCIKLDIEKAFDMVDWEFLDEIL
        LIPK    ++  ++RPI++ S L +++ R+L++RL+  +   +   Q  +      L  SL+ +  I   + +++    + LD+ KAFD V    +   L
Subjt:  LIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQRKKEKGVCIKLDIEKAFDMVDWEFLDEIL

Query:  RVKGFGYTWRRWIRGCISLVNFSIIIN-GKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIKGLHVGGSHALSITHLQFADDTILFSSP
        +  G       +I G +S    +I +  G    K    RG++QGDPLSPFLF  V+D    +L   +    I G  +G      I  L FADD +L    
Subjt:  RVKGFGYTWRRWIRGCISLVNFSIIIN-GKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIKGLHVGGSHALSITHLQFADDTILFSSP

Query:  NEAHLDNLFKSIKLFEEAPGLNINCFKT
        N+  L     ++  F    G+++N  K+
Subjt:  NEAHLDNLFKSIKLFEEAPGLNINCFKT

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.8e-1436.63Show/hide
Query:  SDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNILKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKI
        +D+L++ L    ++KE+  AV  +  NK+PG D FTAEFF +SW ++K   +    +FF++  +    N T I LIPK  G   +  +RP+S  + +YKI
Subjt:  SDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNILKKDIMGVFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKI

Query:  V
        +
Subjt:  V

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.4e-0635.44Show/hide
Query:  ERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQRKK-EKG-VCIKLDIEKAFDMVDWEFLDEILRVKGFGYTW
        ERLK ++ + I   Q+ F+  R   D  +   E +   +RKK  KG + +KLD+EKA+D + W++L++ L   GF   W
Subjt:  ERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQRKK-EKG-VCIKLDIEKAFDMVDWEFLDEILRVKGFGYTW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-1250.72Show/hide
Query:  IINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIKGLHVGGSHALSITHLQFADDT
        IING P+G    SRGLRQGDPLSP+LFI+  + LS +  +A++Q  + G+ V  +++  I HL FADDT
Subjt:  IINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIKGLHVGGSHALSITHLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCACTCGAAGAAGGAACTCCATTATGGAACTTTTATCTCGATCGGGTCAGAGTTTGGTTGATGATTCTAGCATTGAAACAGAGTTTGTGGATTTCTATAGGAA
GTTATTTTCTAAAAAGGATGGCAATTGGTTTTTACCTGACATAGAAGATTGGGGTACAATATCAGATAGCCTTAGTGCTAGCCTGGAAGTTCCCTTCACCGAAAAAGAAG
TCCATAGAGCTGTCAACGATTTGGGATCCAATAAATCTCCCGGCCTGGATGGTTTCACGGCTGAATTCTTTAAAAAATCATGGAACATTCTTAAGAAAGACATTATGGGA
GTGTTCAATGATTTTTTTAAGAGTGCTACTATTAACGCCAACTTAAATGAGACTTATATTTGTCTTATCCCAAAGAAAATTGGAGCTAAATCGGTTGGTGACTATAGACC
CATTAGCCTTACATCATGCCTCTACAAAATTGTGGCTCGTGTCTTATCAGAAAGATTGAAGAAAGTCTTGCCCCACACTATCATCGAATACCAATCTGTCTTTGTCGCAG
ATAGACAAATCTTAGATGCCTCTCTCGTTGCCAATGAGCTTATTGACGAGTGGCAAAGGAAAAAGGAAAAAGGAGTTTGCATCAAACTTGATATTGAAAAGGCCTTTGAT
ATGGTTGACTGGGAATTCCTTGACGAGATTCTTCGTGTTAAGGGTTTTGGTTACACATGGAGGAGATGGATTAGGGGATGTATATCATTGGTAAACTTTTCTATTATCAT
AAATGGGAAACCAAGAGGAAAATTCGATGCATCTCGTGGCCTTCGACAAGGTGACCCTTTATCTCCATTCTTATTTATTATGGTTGTTGATTGCCTTAGTAGGATGCTTA
TCAAGGCCGAGCAACAAGATCTTATTAAGGGCCTACACGTTGGTGGGTCTCATGCTCTCTCCATCACCCATCTACAATTTGCGGATGACACAATCCTTTTCTCCTCCCCA
AACGAAGCTCACCTTGACAATCTCTTCAAATCGATAAAGCTTTTTGAGGAAGCTCCTGGGTTGAATATTAATTGTTTTAAAACAGAGTTCATGGGCATTGGCTTGGATCC
ACAAATCCTTTATTCATTGGCTGATCGTTATGGATGCAAAATTGGTGGCTGGCCAAACACGTATTTGGGTCTTCCTTTGAATGGGAAGCCAAAGTCCTTAACTTTTTGGG
CGTCTGTTTTAGAGAAAATTGAGAAAAGACTTCATTCTTGGGGATCCCAACACCTCCCGAAAGGATGTAGACTTACCCTTATACAAGCTACTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCACTCGAAGAAGGAACTCCATTATGGAACTTTTATCTCGATCGGGTCAGAGTTTGGTTGATGATTCTAGCATTGAAACAGAGTTTGTGGATTTCTATAGGAA
GTTATTTTCTAAAAAGGATGGCAATTGGTTTTTACCTGACATAGAAGATTGGGGTACAATATCAGATAGCCTTAGTGCTAGCCTGGAAGTTCCCTTCACCGAAAAAGAAG
TCCATAGAGCTGTCAACGATTTGGGATCCAATAAATCTCCCGGCCTGGATGGTTTCACGGCTGAATTCTTTAAAAAATCATGGAACATTCTTAAGAAAGACATTATGGGA
GTGTTCAATGATTTTTTTAAGAGTGCTACTATTAACGCCAACTTAAATGAGACTTATATTTGTCTTATCCCAAAGAAAATTGGAGCTAAATCGGTTGGTGACTATAGACC
CATTAGCCTTACATCATGCCTCTACAAAATTGTGGCTCGTGTCTTATCAGAAAGATTGAAGAAAGTCTTGCCCCACACTATCATCGAATACCAATCTGTCTTTGTCGCAG
ATAGACAAATCTTAGATGCCTCTCTCGTTGCCAATGAGCTTATTGACGAGTGGCAAAGGAAAAAGGAAAAAGGAGTTTGCATCAAACTTGATATTGAAAAGGCCTTTGAT
ATGGTTGACTGGGAATTCCTTGACGAGATTCTTCGTGTTAAGGGTTTTGGTTACACATGGAGGAGATGGATTAGGGGATGTATATCATTGGTAAACTTTTCTATTATCAT
AAATGGGAAACCAAGAGGAAAATTCGATGCATCTCGTGGCCTTCGACAAGGTGACCCTTTATCTCCATTCTTATTTATTATGGTTGTTGATTGCCTTAGTAGGATGCTTA
TCAAGGCCGAGCAACAAGATCTTATTAAGGGCCTACACGTTGGTGGGTCTCATGCTCTCTCCATCACCCATCTACAATTTGCGGATGACACAATCCTTTTCTCCTCCCCA
AACGAAGCTCACCTTGACAATCTCTTCAAATCGATAAAGCTTTTTGAGGAAGCTCCTGGGTTGAATATTAATTGTTTTAAAACAGAGTTCATGGGCATTGGCTTGGATCC
ACAAATCCTTTATTCATTGGCTGATCGTTATGGATGCAAAATTGGTGGCTGGCCAAACACGTATTTGGGTCTTCCTTTGAATGGGAAGCCAAAGTCCTTAACTTTTTGGG
CGTCTGTTTTAGAGAAAATTGAGAAAAGACTTCATTCTTGGGGATCCCAACACCTCCCGAAAGGATGTAGACTTACCCTTATACAAGCTACTCTTTAG
Protein sequenceShow/hide protein sequence
MAATRRRNSIMELLSRSGQSLVDDSSIETEFVDFYRKLFSKKDGNWFLPDIEDWGTISDSLSASLEVPFTEKEVHRAVNDLGSNKSPGLDGFTAEFFKKSWNILKKDIMG
VFNDFFKSATINANLNETYICLIPKKIGAKSVGDYRPISLTSCLYKIVARVLSERLKKVLPHTIIEYQSVFVADRQILDASLVANELIDEWQRKKEKGVCIKLDIEKAFD
MVDWEFLDEILRVKGFGYTWRRWIRGCISLVNFSIIINGKPRGKFDASRGLRQGDPLSPFLFIMVVDCLSRMLIKAEQQDLIKGLHVGGSHALSITHLQFADDTILFSSP
NEAHLDNLFKSIKLFEEAPGLNINCFKTEFMGIGLDPQILYSLADRYGCKIGGWPNTYLGLPLNGKPKSLTFWASVLEKIEKRLHSWGSQHLPKGCRLTLIQATL