; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031352 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031352
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr11:7409373..7411799
RNA-Seq ExpressionLag0031352
SyntenyLag0031352
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.5e-22649.55Show/hide
Query:  LWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDL
        +W+    S + +  G FS+SI     +G  WWLS IYGPAKR++R   W EL  L  +C   W+LGGDFNV RW  E+++  PA  SM  FN+FI+N +L
Subjt:  LWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDL

Query:  VDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGY
        +DPPL N  +TWSN+R +A +SRLDRFL++  W   F  H S+ L R TSDHFPI+LE+  ++WGPSPFRF N  L +  + +NI+ WW  T Q G+ GY
Subjt:  VDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGY

Query:  SFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARR
        SF+RRLKQLA ++K W +      +  KKA   +I+ ID LE +G    IH +KR +LK DL +  + E +   Q+CK+ W+ EGDEN+SFFHKIC+AR+
Subjt:  SFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARR

Query:  RKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLK
        +K  IS++I+++  + + D  +    + HF++I+         I NL+W PI   ++  + +PF E E+W  LKS   NK+PGPDG+ ++F +KSW+ +K
Subjt:  RKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLK

Query:  SPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSH
          I  +F +F    +IN+ VNE+ I LI KK +    +++RPISLTT +Y+LIAK+LA+R+K TLP TI+ESQ AFVKGRQI +AIL+ANE +D WR   
Subjt:  SPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSH

Query:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGC
          GF+IKLDIEKAFDK++W FI+ +L  K +   W   I +CISSV YSIL+NG+PRG+I+  RGIRQGDP+SPFIFVLAMDYLSRL+     K  I G 
Subjt:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGC

Query:  SIN-DISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIV
          + ++++TH+LFADDIL+FV D D ++ N  MI+  FE ASGLNIN SKS+I  INV  DR   IA  WG S   LP SYLG PLG +PS+ +FW  ++
Subjt:  SIN-DISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIV

Query:  EKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLV
        +KI + L  W+YS +SKGGR+TL+ STL S  IY +SVFK P+ I  +I+  +RNFLW G  N  H I L+
Subjt:  EKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLV

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-22348.62Show/hide
Query:  SKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFN
        S  +   I S  ++  A     G  GGIL+LWD       DI  G +S+S+N  L     WWL+ +YGP K  DR KLW EL  L  LC   WL+ GDFN
Subjt:  SKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFN

Query:  VFRWISESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFR
        + RW  E+++ +  KR+M+NFN FI+  +L+DPP +N  FTWSN+R     SRLDRFL S  W   F  H SR L R  SDHFPI+LE+P + WGP PFR
Subjt:  VFRWISESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFR

Query:  FDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQEL
         +N  L +K F +N   WW+ ++Q G PGY+FI+ L  L+  +K+W+   + +    KKAL  +I+ ID LE QG M   HHQKRISLK DL      + 
Subjt:  FDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQEL

Query:  RFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVW
        +   QR ++ W   GDEN S+FH+IC+  +RKN I  +      SL +   + +  + HF+NI+        +I NL+W PI     + + +PF E E+ 
Subjt:  RFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVW

Query:  QNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIA
          + S  + K+PGPDG+T+ F+KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+++AK+LA R+K  LP TIA
Subjt:  QNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIA

Query:  ESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGD
        E+Q AF+KGRQI DAIL+ANE +D W+     GF++KLDIEKAFDKISW FI+ ML  K FP  W  WIKACIS+V YSILLNG P+G+I+A RGIRQGD
Subjt:  ESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGD

Query:  PISPFIFVLAMDYLSRLIQAAEHKGLIEGCSIND-ISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW
        P+SPFIFVLAMDYLSRL+   E KG I+G S N+  +++HLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +
Subjt:  PISPFIFVLAMDYLSRLIQAAEHKGLIEGCSIND-ISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW

Query:  GCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDH
        G   + LP++YLG PLG  P + SFW   +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++ +R+FLW G+E+  +
Subjt:  GCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDH

TYJ99326.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.3e-22950.58Show/hide
Query:  LWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDL
        +W+    S   +  G FS+SI     +G  WWLS IYGPAKR++R   W EL +L  +C   W+LGGDFNV RW  E+S+  PA  SM  FN FI+N +L
Subjt:  LWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDL

Query:  VDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGY
        +DPPL N  FTWSN+R +A +SRLDRFL+S  W   F  H S+ L R TSDHFPI+LE+  ++WGPSPFRF N  L +  + +NI+ WW  T Q G  GY
Subjt:  VDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGY

Query:  SFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARR
        SF+ RLKQLA  +K W ++     +  KKA   +I+ I+ LE +G    IH +KRI+LK DL +  + E +   Q+CK+ W+ EGDEN+SFFHKIC+AR+
Subjt:  SFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARR

Query:  RKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLK
        +K  IS++I+    + + D  +    + HF+ I+         I NL+W PI   ++  + +PF E E+W  LKS   NK+PGPDGFT++F +KSW+ +K
Subjt:  RKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLK

Query:  SPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSH
          I  +F +F  +  IN+ VNE+ I  I KK N   ++++RPISLTT +Y+LIAK LA+R+K TLP TI+ESQ AFVKGRQI +AIL+ANE +DLWR   
Subjt:  SPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSH

Query:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGC
          GF+IKLDIEKAFDK++W FI+ ML  K +   W   I +CISSV YSIL+NG+PRG+I+  RGIRQGDP+SPFIFVLAMDYLSRL+     KG I G 
Subjt:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGC

Query:  SIN-DISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIV
        +   ++++TH+LFADDIL+FV D D ++ N  MI+  FE ASGLNIN SKS+I  INV  DR + IA  WG S   LP SYLG PLG KPS+ +FW  ++
Subjt:  SIN-DISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIV

Query:  EKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK
        +KI + L  W+YS +SKG R+TL+ STL S  IY LSVFK P+ I  +I+  +RNFLW GT N  H I    G K
Subjt:  EKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.1e-21148.65Show/hide
Query:  LWDALKLSAVDIITGVFSLSINFSLGDG---FRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINN
        +WD L+ +  D I G FSLSIN +  DG     WWLS IYGP+  R+R+  W EL DL   C+  WLL GDFNV R+ SE+S+  P+K SM  FN FI +
Subjt:  LWDALKLSAVDIITGVFSLSINFSLGDG---FRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINN

Query:  LDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGH
         +L+DPPL N  FTWSN+R   V+SR+DRFLY+  W   F+ H S+ L+RVTSDHFPI+LE+  ++WGPSPF+  N  L E  F  NI  WW   +QEGH
Subjt:  LDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGH

Query:  PGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICS
        PG+SF+R+LKQL++++++ ++KN       K A   +I++ID LE +G +      +R  LK D+  +  +E +  +Q+ K+ W+ EGDENTSFFHKICS
Subjt:  PGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICS

Query:  ARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPG-WIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSW
        AR+R++ IS + S++ V   T+  + +  + HF++I+        W+I NLNW PI  + A  +   FTE+E+ + L +   NKSP     TV       
Subjt:  ARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPG-WIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSW

Query:  TTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLW
                            +  +N + IALI KK      ++YRPISLTT +Y+LIAK +AER+K TLP T+AE+Q AFVK RQI+DAIL+ANE +D W
Subjt:  TTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLW

Query:  RVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGL
        R     GF+IKLDIEKAFDK++W FI+ ML  KG+P  W  WI+ACISSV YSI++NG+PRGKIQ  RGIRQGDPISPFIFVLAMDY+SRL+ +   K  
Subjt:  RVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGL

Query:  IEGCSI-NDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFW
        I+G  +  +I++THLLFADDILLFV D++  ++N   II  F+ ASGL+IN +KS+IS INV   R   IAS+WG S + LPI+YLG PLG K +  +FW
Subjt:  IEGCSI-NDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFW

Query:  SPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK
          + EKI + L  W+YS +SKGG++TL++S+L S   Y LS+FKAP S C  I++ +RNFLWK    + HK+ LV   K
Subjt:  SPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.2e-22448.74Show/hide
Query:  SKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFN
        S  +   I S  ++  A     G  GGIL+LWD       DI  G +S+S+N  L     WWL+ +YGP K  DR KLW EL  L  LC   WL+ GDFN
Subjt:  SKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFN

Query:  VFRWISESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFR
        + RW  E+++ +  KR+M+NFN FI+  +L+DPP +N  FTWSN+R     SRLDRFL S  W   F  H SR L R  SDHFPI+LE+P + WGP PFR
Subjt:  VFRWISESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFR

Query:  FDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQEL
         +N  L +K F +N   WW+ ++Q G PGY+FI+ L  L+  +K+W+   + +    KKAL  +I+ ID LE QG M   HHQKRISLK DL      + 
Subjt:  FDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQEL

Query:  RFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVW
        +   QR ++ W   GDEN S+FH+IC+  +RKN I  +      SL +   + +  + HF+NI+        +I NL+W PI     + + +PF E E+ 
Subjt:  RFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVW

Query:  QNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIA
          + S  + K+PGPDG+T+ F+KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+++AK+LA R+K  LP TIA
Subjt:  QNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIA

Query:  ESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGD
        E+Q AF+KGRQI DAIL+ANEV+D W+     GF++KLDIEKAFDKISW FI+ ML  K FP  W  WIKACIS+V YSILLNG P+G+I+A RGIRQGD
Subjt:  ESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGD

Query:  PISPFIFVLAMDYLSRLIQAAEHKGLIEGCSIND-ISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW
        P+SPFIFVLAMDYLSRL+   E KG I+G S N+  +++HLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +
Subjt:  PISPFIFVLAMDYLSRLIQAAEHKGLIEGCSIND-ISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW

Query:  GCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDH
        G   + LP++YLG PLG  P + SFW   +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++ +R+FLW G+E+  +
Subjt:  GCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDH

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein2.2e-22649.55Show/hide
Query:  LWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDL
        +W+    S + +  G FS+SI     +G  WWLS IYGPAKR++R   W EL  L  +C   W+LGGDFNV RW  E+++  PA  SM  FN+FI+N +L
Subjt:  LWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDL

Query:  VDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGY
        +DPPL N  +TWSN+R +A +SRLDRFL++  W   F  H S+ L R TSDHFPI+LE+  ++WGPSPFRF N  L +  + +NI+ WW  T Q G+ GY
Subjt:  VDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGY

Query:  SFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARR
        SF+RRLKQLA ++K W +      +  KKA   +I+ ID LE +G    IH +KR +LK DL +  + E +   Q+CK+ W+ EGDEN+SFFHKIC+AR+
Subjt:  SFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARR

Query:  RKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLK
        +K  IS++I+++  + + D  +    + HF++I+         I NL+W PI   ++  + +PF E E+W  LKS   NK+PGPDG+ ++F +KSW+ +K
Subjt:  RKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLK

Query:  SPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSH
          I  +F +F    +IN+ VNE+ I LI KK +    +++RPISLTT +Y+LIAK+LA+R+K TLP TI+ESQ AFVKGRQI +AIL+ANE +D WR   
Subjt:  SPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSH

Query:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGC
          GF+IKLDIEKAFDK++W FI+ +L  K +   W   I +CISSV YSIL+NG+PRG+I+  RGIRQGDP+SPFIFVLAMDYLSRL+     K  I G 
Subjt:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGC

Query:  SIN-DISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIV
          + ++++TH+LFADDIL+FV D D ++ N  MI+  FE ASGLNIN SKS+I  INV  DR   IA  WG S   LP SYLG PLG +PS+ +FW  ++
Subjt:  SIN-DISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIV

Query:  EKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLV
        +KI + L  W+YS +SKGGR+TL+ STL S  IY +SVFK P+ I  +I+  +RNFLW G  N  H I L+
Subjt:  EKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLV

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.0e-22348.62Show/hide
Query:  SKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFN
        S  +   I S  ++  A     G  GGIL+LWD       DI  G +S+S+N  L     WWL+ +YGP K  DR KLW EL  L  LC   WL+ GDFN
Subjt:  SKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFN

Query:  VFRWISESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFR
        + RW  E+++ +  KR+M+NFN FI+  +L+DPP +N  FTWSN+R     SRLDRFL S  W   F  H SR L R  SDHFPI+LE+P + WGP PFR
Subjt:  VFRWISESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFR

Query:  FDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQEL
         +N  L +K F +N   WW+ ++Q G PGY+FI+ L  L+  +K+W+   + +    KKAL  +I+ ID LE QG M   HHQKRISLK DL      + 
Subjt:  FDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQEL

Query:  RFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVW
        +   QR ++ W   GDEN S+FH+IC+  +RKN I  +      SL +   + +  + HF+NI+        +I NL+W PI     + + +PF E E+ 
Subjt:  RFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVW

Query:  QNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIA
          + S  + K+PGPDG+T+ F+KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+++AK+LA R+K  LP TIA
Subjt:  QNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIA

Query:  ESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGD
        E+Q AF+KGRQI DAIL+ANE +D W+     GF++KLDIEKAFDKISW FI+ ML  K FP  W  WIKACIS+V YSILLNG P+G+I+A RGIRQGD
Subjt:  ESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGD

Query:  PISPFIFVLAMDYLSRLIQAAEHKGLIEGCSIND-ISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW
        P+SPFIFVLAMDYLSRL+   E KG I+G S N+  +++HLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +
Subjt:  PISPFIFVLAMDYLSRLIQAAEHKGLIEGCSIND-ISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW

Query:  GCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDH
        G   + LP++YLG PLG  P + SFW   +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++ +R+FLW G+E+  +
Subjt:  GCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDH

A0A5D3BJP3 LINE-1 retrotransposable element ORF2 protein2.1e-22950.58Show/hide
Query:  LWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDL
        +W+    S   +  G FS+SI     +G  WWLS IYGPAKR++R   W EL +L  +C   W+LGGDFNV RW  E+S+  PA  SM  FN FI+N +L
Subjt:  LWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDL

Query:  VDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGY
        +DPPL N  FTWSN+R +A +SRLDRFL+S  W   F  H S+ L R TSDHFPI+LE+  ++WGPSPFRF N  L +  + +NI+ WW  T Q G  GY
Subjt:  VDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGY

Query:  SFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARR
        SF+ RLKQLA  +K W ++     +  KKA   +I+ I+ LE +G    IH +KRI+LK DL +  + E +   Q+CK+ W+ EGDEN+SFFHKIC+AR+
Subjt:  SFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARR

Query:  RKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLK
        +K  IS++I+    + + D  +    + HF+ I+         I NL+W PI   ++  + +PF E E+W  LKS   NK+PGPDGFT++F +KSW+ +K
Subjt:  RKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLK

Query:  SPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSH
          I  +F +F  +  IN+ VNE+ I  I KK N   ++++RPISLTT +Y+LIAK LA+R+K TLP TI+ESQ AFVKGRQI +AIL+ANE +DLWR   
Subjt:  SPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSH

Query:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGC
          GF+IKLDIEKAFDK++W FI+ ML  K +   W   I +CISSV YSIL+NG+PRG+I+  RGIRQGDP+SPFIFVLAMDYLSRL+     KG I G 
Subjt:  TSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGC

Query:  SIN-DISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIV
        +   ++++TH+LFADDIL+FV D D ++ N  MI+  FE ASGLNIN SKS+I  INV  DR + IA  WG S   LP SYLG PLG KPS+ +FW  ++
Subjt:  SIN-DISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIV

Query:  EKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK
        +KI + L  W+YS +SKG R+TL+ STL S  IY LSVFK P+ I  +I+  +RNFLW GT N  H I    G K
Subjt:  EKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein4.4e-21148.65Show/hide
Query:  LWDALKLSAVDIITGVFSLSINFSLGDG---FRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINN
        +WD L+ +  D I G FSLSIN +  DG     WWLS IYGP+  R+R+  W EL DL   C+  WLL GDFNV R+ SE+S+  P+K SM  FN FI +
Subjt:  LWDALKLSAVDIITGVFSLSINFSLGDG---FRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINN

Query:  LDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGH
         +L+DPPL N  FTWSN+R   V+SR+DRFLY+  W   F+ H S+ L+RVTSDHFPI+LE+  ++WGPSPF+  N  L E  F  NI  WW   +QEGH
Subjt:  LDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGH

Query:  PGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICS
        PG+SF+R+LKQL++++++ ++KN       K A   +I++ID LE +G +      +R  LK D+  +  +E +  +Q+ K+ W+ EGDENTSFFHKICS
Subjt:  PGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICS

Query:  ARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPG-WIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSW
        AR+R++ IS + S++ V   T+  + +  + HF++I+        W+I NLNW PI  + A  +   FTE+E+ + L +   NKSP     TV       
Subjt:  ARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPG-WIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSW

Query:  TTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLW
                            +  +N + IALI KK      ++YRPISLTT +Y+LIAK +AER+K TLP T+AE+Q AFVK RQI+DAIL+ANE +D W
Subjt:  TTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLW

Query:  RVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGL
        R     GF+IKLDIEKAFDK++W FI+ ML  KG+P  W  WI+ACISSV YSI++NG+PRGKIQ  RGIRQGDPISPFIFVLAMDY+SRL+ +   K  
Subjt:  RVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGL

Query:  IEGCSI-NDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFW
        I+G  +  +I++THLLFADDILLFV D++  ++N   II  F+ ASGL+IN +KS+IS INV   R   IAS+WG S + LPI+YLG PLG K +  +FW
Subjt:  IEGCSI-NDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFW

Query:  SPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK
          + EKI + L  W+YS +SKGG++TL++S+L S   Y LS+FKAP S C  I++ +RNFLWK    + HK+ LV   K
Subjt:  SPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein3.5e-22448.74Show/hide
Query:  SKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFN
        S  +   I S  ++  A     G  GGIL+LWD       DI  G +S+S+N  L     WWL+ +YGP K  DR KLW EL  L  LC   WL+ GDFN
Subjt:  SKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFN

Query:  VFRWISESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFR
        + RW  E+++ +  KR+M+NFN FI+  +L+DPP +N  FTWSN+R     SRLDRFL S  W   F  H SR L R  SDHFPI+LE+P + WGP PFR
Subjt:  VFRWISESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFR

Query:  FDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQEL
         +N  L +K F +N   WW+ ++Q G PGY+FI+ L  L+  +K+W+   + +    KKAL  +I+ ID LE QG M   HHQKRISLK DL      + 
Subjt:  FDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQEL

Query:  RFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVW
        +   QR ++ W   GDEN S+FH+IC+  +RKN I  +      SL +   + +  + HF+NI+        +I NL+W PI     + + +PF E E+ 
Subjt:  RFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVW

Query:  QNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIA
          + S  + K+PGPDG+T+ F+KK W  LK  +++VF +F + G++N NVN ++IALI KK    + S+YRPISLTT LY+++AK+LA R+K  LP TIA
Subjt:  QNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIA

Query:  ESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGD
        E+Q AF+KGRQI DAIL+ANEV+D W+     GF++KLDIEKAFDKISW FI+ ML  K FP  W  WIKACIS+V YSILLNG P+G+I+A RGIRQGD
Subjt:  ESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGD

Query:  PISPFIFVLAMDYLSRLIQAAEHKGLIEGCSIND-ISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW
        P+SPFIFVLAMDYLSRL+   E KG I+G S N+  +++HLLFADD+L+FV DN+ +L N  M +  FE+ASGL  N SKS+IS IN+S  R   IAS +
Subjt:  PISPFIFVLAMDYLSRLIQAAEHKGLIEGCSIND-ISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW

Query:  GCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDH
        G   + LP++YLG PLG  P + SFW   +E I++ L+ W+YS ISKGGRLTLL+++L+S   Y LS FKAP S+   I++ +R+FLW G+E+  +
Subjt:  GCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKGTENSDH

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.9e-4423.74Show/hide
Query:  LLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDLVD------PPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIIL
        L+ GDFN    I + S+     +     N+ ++  DL+D      P      F           S++D  + S A   +    ++  +    SDH  I L
Subjt:  LLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDLVD------PPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIIL

Query:  ENPHLNWGPS---PFRFDNYLLNE----KRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDS-LERQGLMEG
        E    N   S    ++ +N LLN+          I M++   + +     +     K +       + K I +   ++K     I+ + S L+     E 
Subjt:  ENPHLNWGPS---PFRFDNYLLNE----KRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDS-LERQGLMEG

Query:  IHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKI----------CSARRRKNSISELISSNEVSLVTD-HQLEQEVVGHFKNIF----
         H +     ++    A ++E+       +KT  K  +  + FF +I             +R KN I + I +++  + TD  +++  +  ++K+++    
Subjt:  IHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKI----------CSARRRKNSISELISSNEVSLVTD-HQLEQEVVGHFKNIF----

Query:  HSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKA-N
         +       +       ++ +   ++ RP T  E+   + S+   KSPGPDGFT EF+++    L   ++ +F    + G++  +  E+ I LIPK   +
Subjt:  HSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKA-N

Query:  SLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLA-NEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFP
        + +   +RPISL  +  +++ K LA RI+  +   I   Q  F+ G Q    I  + N +  + R    +  II +D EKAFDKI   F+   L   G  
Subjt:  SLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLA-NEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFP

Query:  DIWCGWIKACISSVSYSILLNGKPRGKIQAF---RGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVFLEN
         ++   I+A     + +I+LNG+   K++AF    G RQG P+SP +F + ++ L+R I+  +    I+G  +    V   LFADD+++++ +  V  +N
Subjt:  DIWCGWIKACISSVSYSILLNGKPRGKIQAF---RGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVFLEN

Query:  YIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSF---WSPIVEKIYRHLDKWQYSYISKGGRLTLLQST
         + +I  F + SG  IN  KS     N +    S I      +  +  I YLG  L  +   D F   + P++++I    +KW+    S  GR+ +++  
Subjt:  YIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSF---WSPIVEKIYRHLDKWQYSYISKGGRLTLLQST

Query:  LNSSLIYPLSV--FKAPQSICTRIDRIFRNFLW
        +   +IY  +    K P +  T +++    F+W
Subjt:  LNSSLIYPLSV--FKAPQSICTRIDRIFRNFLW

P08548 LINE-1 reverse transcriptase homolog1.5e-3821.72Show/hide
Query:  IYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDLVD------PPLINGGFTWSNMRERAVMSRLDRFLY
        IY P      Q +   L D+  L +   ++ GDFN    + + SS     + + + N+ I +LDL D      P      F  S        S++D  L 
Subjt:  IYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDLVD------PPLINGGFTWSNMRERAVMSRLDRFLY

Query:  SPAWAMQFSDHQSRRLNRVTSDHFPIILE---NPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIE--II
          +   +F   +   +  + SDH  I +E   N +L+     ++ +N +L +   I  I    +   ++ +   +  + L   A  V   K   ++  + 
Subjt:  SPAWAMQFSDHQSRRLNRVTSDHFPIILE---NPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIE--II

Query:  KNRKKALSDDIEAIDSLERQ--GLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQL
        K  ++ +++ +  +  LE++     +    ++   ++ +L+E   + +     + K  + ++ ++       +   +R K+ IS + + N+       ++
Subjt:  KNRKKALSDDIEAIDSLERQ--GLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQL

Query:  EQEVVGHFKNI----FHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINR
        ++ +  ++K +    + +       +   +   +       + RP +  E+   ++++   KSPGPDGFT EF++     L   ++++F    + G++  
Subjt:  EQEVVGHFKNI----FHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINR

Query:  NVNESYIALIPKKA-NSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLA-NEVVDLWRVSHTSGFIIKLDIEKAFDK
           E+ I LIPK   +  R   YRPISL  +  +++ K L  RI+  +   I   Q  F+ G Q    I  + N +  + ++ +    I+ +D EKAFD 
Subjt:  NVNESYIALIPKKA-NSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLA-NEVVDLWRVSHTSGFIIKLDIEKAFDK

Query:  ISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDI
        I   F+   L+  G    +   I+A  S  + +I+LNG          G RQG P+SP +F + M+ L+  I   E K  I+G  I    +   LFADD+
Subjt:  ISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDI

Query:  LLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSF---WSPIVEKIYRHLDKWQYSY
        ++++ +        + +IK +   SG  IN  KS       +      +      +     + YLG  L  K   D +   +  + ++I   ++KW+   
Subjt:  LLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSF---WSPIVEKIYRHLDKWQYSY

Query:  ISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSICTRIDRIFRNFLW
         S  GR+ +++ ++    IY  +    KAP S    +++I  +F+W
Subjt:  ISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSICTRIDRIFRNFLW

P11369 LINE-1 retrotransposable element ORF2 protein4.0e-3627.43Show/hide
Query:  IDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPK-KANSLRISEYRPISLTTVLY
        ++ D  + +  P +  E+   + S+   KSPGPDGF+ EF++     L   +  +FH+    G +  +  E+ I LIPK + +  +I  +RPISL  +  
Subjt:  IDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPK-KANSLRISEYRPISLTTVLY

Query:  RLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVD-LWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYS
        +++ K LA RI+  +   I   Q  F+ G Q    I  +  V+  + ++   +  II LD EKAFDKI   F+  +L   G    +   IKA  S    +
Subjt:  RLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVD-LWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYS

Query:  ILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSK
        I +NG+    I    G RQG P+SP++F + ++ L+R I+  +    I+G  I    V   L ADD+++++ D        + +I +F +  G  IN +K
Subjt:  ILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSK

Query:  SSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPL--GAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSIC
        S       ++     I      S  T  I YLG  L    K   D  +  + ++I   L +W+    S  GR+ +++  +    IY  +    K P    
Subjt:  SSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPL--GAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLSV--FKAPQSIC

Query:  TRIDRIFRNFLW
          ++     F+W
Subjt:  TRIDRIFRNFLW

P14381 Transposon TX1 uncharacterized 149 kDa protein9.3e-4925.54Show/hide
Query:  LSAVDIITG-VFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTEL--YDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDLVD-
        LSA  +I G +  L +  S   G  + L  +Y P    +R + +  L  Y      ++  ++GGDFN      + + P     S S     I +  LVD 
Subjt:  LSAVDIITG-VFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTEL--YDLGGLCNDCWLLGGDFNVFRWISESSSPTPAKRSMSNFNAFINNLDLVD-

Query:  -----PPLINGGFTWSNMRERAV-MSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPS-----PFRFDNYLLNEKRFIQNI-DMW--
             P  +   FT+  +R+  V  SR+DR +Y  +  M  +   + RL    SDH  + L    ++  PS      + F+N LL ++ F +++ D W  
Subjt:  -----PPLINGGFTWSNMRERAV-MSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPS-----PFRFDNYLLNEKRFIQNI-DMW--

Query:  WSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNI-------EIIKNRKKALSDDIEAIDS--LERQGLMEGIHHQ----KRISLKMDLHEAAMQELRFHWQ
        W   Q E              A++ + W    +       E  K+     + +IEA++   L+ +  + G   Q    + +  K  L     ++ R  + 
Subjt:  WSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNI-------EIIKNRKKALSDDIEAIDS--LERQGLMEGIHHQ----KRISLKMDLHEAAMQELRFHWQ

Query:  RCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAP-PPGWIISNLNWFPIDADSANTIIR-PFTEDEVWQNL
        R +   L + D  + FF+ +   +  +  I+ L + +   L     +       ++N+F   P  P       +  P+ ++     +  P T DE+ Q L
Subjt:  RCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAP-PPGWIISNLNWFPIDADSANTIIR-PFTEDEVWQNL

Query:  KSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQ
        + M HNKSPG DG T+EFF+  W TL      V  E ++ G +  +   + ++L+PKK +   I  +RP+SL +  Y+++AK+++ R+K  L   I   Q
Subjt:  KSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQ

Query:  FAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPIS
           V GR I D + L  +++   R +  S   + LD EKAFD++   ++   L+   F   + G++K   +S    + +N      +   RG+RQG P+S
Subjt:  FAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPIS

Query:  PFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW-GCS
          ++ LA++    L++     GL+      D+ V    +ADD++L  +D  V LE      + +  AS   IN+SKS  SG+     +V  +   +   S
Subjt:  PFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRW-GCS

Query:  AQTLPISYLGTPLGAK--PSNDSFWSPIVEKIYRHLDKWQ--YSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKG
         ++  I YLG  L A+  P + +F   + E +   L KW+     +S  GR  ++   + S + Y L      Q    +I R   +FLW G
Subjt:  AQTLPISYLGTPLGAK--PSNDSFWSPIVEKIYRHLDKWQ--YSYISKGGRLTLLQSTLNSSLIYPLSVFKAPQSICTRIDRIFRNFLWKG

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.2e-2128.35Show/hide
Query:  RPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIA----LIPKKANSLRISEYRPISLTTVLYRLIAKSL
        RP   +E+   +K      +PG DG TV+    + T  + P       F +  ++  +V   + A    LIPK  +    S +RPI++ + L RL+ + L
Subjt:  RPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIA----LIPKKANSLRISEYRPISLTTVLYRLIAKSL

Query:  AERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLN-GKP
        A+R++  +    A+  +A + G  +    LL +  +   R    +  ++ LD+ KAFD +S   I   L+  G  +    +I   +S  + +I +  G  
Subjt:  AERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDKISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLN-GKP

Query:  RGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGIN
          KI   RG++QGDP+SPF+F   +D L   +Q+    G+  G +I +  +  L FADD+LL + DNDV L   +  +  F +  G+++N +K S+S   
Subjt:  RGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVFLENYIMIIKAFEQASGLNINFSKSSISGIN

Query:  VSEDRVSLIASRWGCSAQTLP
             +S+ AS   C  +T P
Subjt:  VSEDRVSLIASRWGCSAQTLP

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.2e-1124.21Show/hide
Query:  DRQKLWTELYDLGG---LCNDCWLLGGDFNVFRWISESSSPTPAK---RSMSNFNAFINNLDLVDPPLINGGFTWSN-MRERAVMSRLDRFLYSPAWAMQ
        +R+ LW ++  L     LCN  WL+ GDFN    ++E  S  P+    + + +  A + + DLVD P     +TWSN  ++  ++ +LDR + +  W   
Subjt:  DRQKLWTELYDLGG---LCNDCWLLGGDFNVFRWISESSSPTPAK---RSMSNFNAFINNLDLVDPPLINGGFTWSN-MRERAVMSRLDRFLYSPAWAMQ

Query:  FSDHQSRRLNRVTSDHFP--IILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDD
        F    +       SDH    +IL N         F++ ++L     FI +I   W      G   +S    LK+     +   ++    I+ +  +   D
Subjt:  FSDHQSRRLNRVTSDHFP--IILENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDD

Query:  IEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGD
                   L    H  ++     +   AA++   F+ Q+ +  WLKEGD
Subjt:  IEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGD

AT1G43760.1 DNAse I-like superfamily protein6.0e-3527.18Show/hide
Query:  LLGGDFNVFRWISESSSPTPAK---RSMSNFNAFINNLDLVDPPLINGGFTWSNMR-ERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFP--IIL
        +L GDF+     S+  S        R +  F   + + DLVD P     +TWSN + +  ++ +LDR + +  W   F    +       SDH P  IIL
Subjt:  LLGGDFNVFRWISESSSPTPAK---RSMSNFNAFINNLDLVDPPLINGGFTWSNMR-ERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFP--IIL

Query:  ENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRIS
        EN         FR+ ++L     F+ ++ + W      G   +S    LK      K   ++    I+++ K   D +E+I S       + +   + ++
Subjt:  ENPHLNWGPSPFRFDNYLLNEKRFIQNIDMWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRIS

Query:  LKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHS-----APPPGWIISNLNWFPI
         K     AA  E  F+ Q+ +  WL++GD NT FFHK+  A + KN I  L   ++V +    Q+++ +V ++ ++  S      P     I +++ F  
Subjt:  LKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKICSARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHS-----APPPGWIISNLNWFPI

Query:  DADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRL
        +   A+ +    ++ E+   + +M  NK+PGPD FT EFF +SW  +K   ++   EF+  G + +  N + I LIPK     ++S +RP+S  TV+Y++
Subjt:  DADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSVFHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRL

Query:  I
        I
Subjt:  I

AT4G20520.1 RNA binding;RNA-directed DNA polymerases7.4e-0937.04Show/hide
Query:  LAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVV-DLWRVSHTSGF-IIKLDIEKAFDKISWDFIESMLRFKGFPDIW
        + ER+K  +   I  +Q +F+ GR   D I+   E V  + R     G+ ++KLD+EKA+D+I WD++E  L   GFP++W
Subjt:  LAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVV-DLWRVSHTSGF-IIKLDIEKAFDKISWDFIESMLRFKGFPDIW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.7e-1143.28Show/hide
Query:  LLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDIS--VTHLLFADD
        ++NG P+G +   RG+RQGDP+SP++F+L  + LS L + A+ +G + G  +++ S  + HLLFADD
Subjt:  LLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDIS--VTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCCATCTCAAAAAAATTTATTAAATCAATTTGGAGTTCCGTTAGCATTAAATGGGCATCTCTGGATTCTTGTGGCAGATCAGGAGGCATCCTTATTCTTTGGGA
TGCTTTAAAGCTTTCAGCTGTTGATATTATTACGGGTGTTTTCTCTCTTTCTATTAATTTCTCTCTGGGAGATGGGTTTCGATGGTGGCTCTCGGGCATTTATGGTCCTG
CTAAGAGAAGGGATAGGCAGAAGCTTTGGACAGAGCTTTATGACCTTGGAGGGCTGTGCAATGATTGTTGGCTTTTGGGTGGTGACTTCAATGTATTTCGGTGGATCTCT
GAATCTTCCTCTCCCACCCCTGCAAAGAGAAGTATGTCCAATTTTAATGCCTTTATTAATAACCTGGATCTTGTCGATCCGCCACTTATTAATGGAGGCTTTACGTGGTC
AAATATGAGAGAAAGAGCTGTTATGTCTAGGCTGGACAGGTTCCTCTACTCCCCTGCATGGGCCATGCAGTTCTCAGACCACCAATCTCGAAGGCTAAATAGGGTTACGT
CTGACCATTTTCCAATCATTTTGGAAAATCCTCATCTAAATTGGGGTCCAAGTCCTTTCCGATTTGACAACTACCTGCTTAATGAAAAGAGGTTCATTCAAAATATTGAC
ATGTGGTGGTCCCTCACCCAGCAGGAAGGTCACCCTGGATACTCTTTCATTCGAAGACTTAAACAGCTAGCTTCTATGGTTAAAGACTGGAAGAAGAAAAACATTGAGAT
TATTAAAAATAGAAAAAAGGCCTTATCGGATGATATTGAAGCTATTGACTCTCTGGAAAGGCAAGGTCTAATGGAAGGAATTCACCACCAGAAGAGGATATCTCTCAAAA
TGGATCTTCACGAGGCAGCTATGCAGGAATTGAGATTCCACTGGCAAAGATGCAAAAAGACTTGGCTAAAGGAAGGGGATGAAAATACTTCCTTCTTTCATAAAATATGC
TCTGCCCGCCGAAGGAAAAACTCTATTTCAGAACTAATTTCTTCCAATGAAGTTAGTCTAGTTACAGACCACCAGCTTGAGCAGGAGGTGGTTGGGCATTTCAAGAATAT
TTTCCATTCCGCCCCTCCTCCGGGATGGATTATCTCAAACCTTAATTGGTTTCCTATTGATGCGGATTCGGCGAATACTATTATTCGTCCTTTCACAGAAGATGAGGTTT
GGCAAAACTTGAAGTCTATGGGTCACAATAAATCTCCAGGACCGGATGGCTTTACGGTGGAGTTTTTTAAAAAATCATGGACCACTCTAAAATCACCCATTATGTCTGTA
TTCCATGAATTCTGGGAGCATGGAGTGATCAACCGAAATGTCAATGAATCATACATTGCTTTGATTCCTAAAAAGGCAAATTCCTTGAGAATTTCTGAATATCGGCCGAT
TAGCTTAACAACGGTCCTTTATAGATTGATAGCAAAGTCCCTCGCTGAAAGGATAAAATGCACTCTCCCCTGCACCATTGCTGAAAGTCAATTTGCTTTTGTTAAAGGTC
GCCAGATTCTTGACGCCATTTTATTAGCTAATGAGGTCGTAGATCTTTGGAGAGTCTCGCACACAAGTGGCTTTATCATTAAGCTTGATATTGAAAAGGCCTTTGACAAG
ATCAGTTGGGACTTCATAGAAAGTATGCTTCGTTTTAAAGGTTTCCCTGATATCTGGTGTGGATGGATAAAAGCGTGCATCTCTTCTGTTTCATATTCCATTCTGCTCAA
TGGTAAGCCGAGGGGTAAAATTCAGGCTTTCAGAGGAATTCGCCAGGGAGATCCTATTTCTCCATTCATCTTTGTCCTTGCCATGGATTACCTTAGTAGACTCATCCAAG
CAGCTGAGCATAAGGGTCTCATTGAGGGCTGTTCGATCAATGACATATCCGTCACTCATCTTCTTTTTGCGGACGATATTCTCCTATTCGTTAGAGATAACGATGTTTTC
TTGGAGAACTACATTATGATCATAAAGGCATTTGAGCAAGCCTCGGGTCTAAACATCAACTTTTCCAAATCTTCCATCTCCGGTATAAATGTTTCCGAGGATAGAGTTTC
CTTGATTGCATCTAGATGGGGCTGTTCTGCTCAAACTCTTCCAATCTCCTATTTGGGTACTCCCTTAGGAGCTAAACCCTCCAATGATTCTTTCTGGAGCCCGATTGTGG
AGAAAATCTACAGACATCTTGATAAATGGCAATATTCATACATTTCAAAAGGAGGTCGATTAACCCTCTTGCAGTCCACTCTGAATAGTTCTCTTATCTACCCCTTGTCG
GTCTTCAAAGCACCGCAATCTATTTGTACACGCATTGACCGAATCTTTCGAAACTTTCTATGGAAGGGGACCGAAAATTCTGATCACAAGATCCCTTTGGTGGGTGGAAT
AAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATTCCATCTCAAAAAAATTTATTAAATCAATTTGGAGTTCCGTTAGCATTAAATGGGCATCTCTGGATTCTTGTGGCAGATCAGGAGGCATCCTTATTCTTTGGGA
TGCTTTAAAGCTTTCAGCTGTTGATATTATTACGGGTGTTTTCTCTCTTTCTATTAATTTCTCTCTGGGAGATGGGTTTCGATGGTGGCTCTCGGGCATTTATGGTCCTG
CTAAGAGAAGGGATAGGCAGAAGCTTTGGACAGAGCTTTATGACCTTGGAGGGCTGTGCAATGATTGTTGGCTTTTGGGTGGTGACTTCAATGTATTTCGGTGGATCTCT
GAATCTTCCTCTCCCACCCCTGCAAAGAGAAGTATGTCCAATTTTAATGCCTTTATTAATAACCTGGATCTTGTCGATCCGCCACTTATTAATGGAGGCTTTACGTGGTC
AAATATGAGAGAAAGAGCTGTTATGTCTAGGCTGGACAGGTTCCTCTACTCCCCTGCATGGGCCATGCAGTTCTCAGACCACCAATCTCGAAGGCTAAATAGGGTTACGT
CTGACCATTTTCCAATCATTTTGGAAAATCCTCATCTAAATTGGGGTCCAAGTCCTTTCCGATTTGACAACTACCTGCTTAATGAAAAGAGGTTCATTCAAAATATTGAC
ATGTGGTGGTCCCTCACCCAGCAGGAAGGTCACCCTGGATACTCTTTCATTCGAAGACTTAAACAGCTAGCTTCTATGGTTAAAGACTGGAAGAAGAAAAACATTGAGAT
TATTAAAAATAGAAAAAAGGCCTTATCGGATGATATTGAAGCTATTGACTCTCTGGAAAGGCAAGGTCTAATGGAAGGAATTCACCACCAGAAGAGGATATCTCTCAAAA
TGGATCTTCACGAGGCAGCTATGCAGGAATTGAGATTCCACTGGCAAAGATGCAAAAAGACTTGGCTAAAGGAAGGGGATGAAAATACTTCCTTCTTTCATAAAATATGC
TCTGCCCGCCGAAGGAAAAACTCTATTTCAGAACTAATTTCTTCCAATGAAGTTAGTCTAGTTACAGACCACCAGCTTGAGCAGGAGGTGGTTGGGCATTTCAAGAATAT
TTTCCATTCCGCCCCTCCTCCGGGATGGATTATCTCAAACCTTAATTGGTTTCCTATTGATGCGGATTCGGCGAATACTATTATTCGTCCTTTCACAGAAGATGAGGTTT
GGCAAAACTTGAAGTCTATGGGTCACAATAAATCTCCAGGACCGGATGGCTTTACGGTGGAGTTTTTTAAAAAATCATGGACCACTCTAAAATCACCCATTATGTCTGTA
TTCCATGAATTCTGGGAGCATGGAGTGATCAACCGAAATGTCAATGAATCATACATTGCTTTGATTCCTAAAAAGGCAAATTCCTTGAGAATTTCTGAATATCGGCCGAT
TAGCTTAACAACGGTCCTTTATAGATTGATAGCAAAGTCCCTCGCTGAAAGGATAAAATGCACTCTCCCCTGCACCATTGCTGAAAGTCAATTTGCTTTTGTTAAAGGTC
GCCAGATTCTTGACGCCATTTTATTAGCTAATGAGGTCGTAGATCTTTGGAGAGTCTCGCACACAAGTGGCTTTATCATTAAGCTTGATATTGAAAAGGCCTTTGACAAG
ATCAGTTGGGACTTCATAGAAAGTATGCTTCGTTTTAAAGGTTTCCCTGATATCTGGTGTGGATGGATAAAAGCGTGCATCTCTTCTGTTTCATATTCCATTCTGCTCAA
TGGTAAGCCGAGGGGTAAAATTCAGGCTTTCAGAGGAATTCGCCAGGGAGATCCTATTTCTCCATTCATCTTTGTCCTTGCCATGGATTACCTTAGTAGACTCATCCAAG
CAGCTGAGCATAAGGGTCTCATTGAGGGCTGTTCGATCAATGACATATCCGTCACTCATCTTCTTTTTGCGGACGATATTCTCCTATTCGTTAGAGATAACGATGTTTTC
TTGGAGAACTACATTATGATCATAAAGGCATTTGAGCAAGCCTCGGGTCTAAACATCAACTTTTCCAAATCTTCCATCTCCGGTATAAATGTTTCCGAGGATAGAGTTTC
CTTGATTGCATCTAGATGGGGCTGTTCTGCTCAAACTCTTCCAATCTCCTATTTGGGTACTCCCTTAGGAGCTAAACCCTCCAATGATTCTTTCTGGAGCCCGATTGTGG
AGAAAATCTACAGACATCTTGATAAATGGCAATATTCATACATTTCAAAAGGAGGTCGATTAACCCTCTTGCAGTCCACTCTGAATAGTTCTCTTATCTACCCCTTGTCG
GTCTTCAAAGCACCGCAATCTATTTGTACACGCATTGACCGAATCTTTCGAAACTTTCTATGGAAGGGGACCGAAAATTCTGATCACAAGATCCCTTTGGTGGGTGGAAT
AAAGTGA
Protein sequenceShow/hide protein sequence
MHSISKKFIKSIWSSVSIKWASLDSCGRSGGILILWDALKLSAVDIITGVFSLSINFSLGDGFRWWLSGIYGPAKRRDRQKLWTELYDLGGLCNDCWLLGGDFNVFRWIS
ESSSPTPAKRSMSNFNAFINNLDLVDPPLINGGFTWSNMRERAVMSRLDRFLYSPAWAMQFSDHQSRRLNRVTSDHFPIILENPHLNWGPSPFRFDNYLLNEKRFIQNID
MWWSLTQQEGHPGYSFIRRLKQLASMVKDWKKKNIEIIKNRKKALSDDIEAIDSLERQGLMEGIHHQKRISLKMDLHEAAMQELRFHWQRCKKTWLKEGDENTSFFHKIC
SARRRKNSISELISSNEVSLVTDHQLEQEVVGHFKNIFHSAPPPGWIISNLNWFPIDADSANTIIRPFTEDEVWQNLKSMGHNKSPGPDGFTVEFFKKSWTTLKSPIMSV
FHEFWEHGVINRNVNESYIALIPKKANSLRISEYRPISLTTVLYRLIAKSLAERIKCTLPCTIAESQFAFVKGRQILDAILLANEVVDLWRVSHTSGFIIKLDIEKAFDK
ISWDFIESMLRFKGFPDIWCGWIKACISSVSYSILLNGKPRGKIQAFRGIRQGDPISPFIFVLAMDYLSRLIQAAEHKGLIEGCSINDISVTHLLFADDILLFVRDNDVF
LENYIMIIKAFEQASGLNINFSKSSISGINVSEDRVSLIASRWGCSAQTLPISYLGTPLGAKPSNDSFWSPIVEKIYRHLDKWQYSYISKGGRLTLLQSTLNSSLIYPLS
VFKAPQSICTRIDRIFRNFLWKGTENSDHKIPLVGGIK