; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037569 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037569
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:7353103..7354899
RNA-Seq ExpressionLag0037569
SyntenyLag0037569
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268750.1 unnamed protein product [Prunus armeniaca]5.6e-13343.23Show/hide
Query:  MWVSSISFSLLSFSKNHIDGWISWGGC--RWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCG
        MW   ++ SL S S NHID  +   G   RWR TGFYG P      +SW LL RL      PWL  GDFN +L  DEK GGR +   ++  F+  VD CG
Subjt:  MWVSSISFSLLSFSKNHIDGWISWGGC--RWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKPLSELAAFQSVVDLCG

Query:  LRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQR-IVRFEETWLRQPGLQQLVGRSWAA
         +DLGF G  FTW    P  E I  RLDR    T W   +P   V HL+ ++SDH P+ + ++   +  S +  R + RFEE W++     + +   W  
Subjt:  LRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQR-IVRFEETWLRQPGLQQLVGRSWAA

Query:  GPGESSD--ITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRA
            S+   +T   KR    ++GW R   G+   +I+S   ++   +    T  + +   +  A+L+ ++ + E+YW+QRSR LWL+ GDRNT++FH +A
Subjt:  GPGESSD--ITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRA

Query:  SYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYR
        S R++ N I GLED+ G+WQ  +  +   +  YFQ LFSS+G +  D+E     +   V +EMN+ LL  FT EE+  AL Q HP+KAPGPDG S  FY+
Subjt:  SYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYR

Query:  HHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECI
         +WD+VG D++ + L  L+ G     +N T + LIPKV   + M   RPISL NV YK+ +KVL NR+K ILP LIS +QSAF+PGR + DN+I+ FE +
Subjt:  HHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECI

Query:  HELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
        H + +R+ GR  + ALKLDMSKAYDRVEW FL  +ML +GF ++WV LI+ C++SVS+SF LNG  VG V+P RGL
Subjt:  HELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

CCA66054.1 hypothetical protein [Beta vulgaris subsp. vulgaris]1.3e-13242.14Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQ
        + + K  LGF N F V   GR+GGL L W   + FSL+SFS++HI G +  G  +WR  G YG+   +    +WSLL  L      P L+GGDFN +L  
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQ

Query:  DEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQR
         EKEGG ++   E+  F+  +D   LRDLG+VG  +TW   R     I ERLDR   +  W+DLYP+ V  H    +SDH  +  VL        +   R
Subjt:  DEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQR

Query:  IVRFEETWLRQPGLQQLVGRSWAAGPGESSDITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWK
         + FE +WL     + +V  SW    GE   +T       Q +V W   K  N  ++I +A + +  A     +  +    V  E +L+E+  + E YW 
Subjt:  IVRFEETWLRQPGLQQLVGRSWAAGPGESSDITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWK

Query:  QRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLL
         RSR   +++GD+NT++FH +AS R+K N + GL D  G W++E   +  + T YF  +F+SS PS    E  +  +EP V +E N  LL PF+K+E+L 
Subjt:  QRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLL

Query:  ALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQ
        AL+Q HP KAPGPDG+   FY+  W IVG D+      +L+   SP  VN+T I LIPKVK   +  +FRPI+LCNV YKL+SK +V R+K  LP++IS+
Subjt:  ALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQ

Query:  NQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
        NQSAF+PGR + DNA++  E  H ++ R+R R    A+KLDMSKAYDRVEW FLR+++L +GF  +WV+LI+  VSSV++SF +NG   G VVP+RGL
Subjt:  NQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

XP_012836341.1 PREDICTED: uncharacterized protein LOC105956976 [Erythranthe guttata]4.8e-14042.66Show/hide
Query:  NCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDK
        N F VD  GRSGG+ L W   +   L+S+S NHID  +       +WR+TGFYGFP       SWSLL  LR     PW++GGDFN +LC  EKEGG  K
Subjt:  NCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDK

Query:  PLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEETWL
          + + AF+  +D+C L DLGF G  FTW N +    T+ ERLDRV  N  W   YP   V HL+Y  SDH P++L+L PP   +    +R  RFE  WL
Subjt:  PLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEETWL

Query:  RQPGLQQLVGRSWA----AGPGESSDITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRE
        R+   + +V   ++    A P E+  +    + C  +++ W ++      RRI    +R+   +  L T D++  + Q + ++E+  +E ++YW+QRS+ 
Subjt:  RQPGLQQLVGRSWA----AGPGESSDITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRE

Query:  LWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQT
         W++EGDRNT++FH +A+ R ++NR+  L+DD G+W+  +  + +++++YF+ LFSS+GPS Q+ +  L ++   +  E  Q L  PFT +EV  A+ Q 
Subjt:  LWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQT

Query:  HPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAF
         P K+PGPDGL   FY  +W I+G D++   L  LN    P  +N T IVLIPKVK   ++ D+RPISLCNV YK  +KV+ NR+K +L  LIS  QSAF
Subjt:  HPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAF

Query:  IPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
        +P R + DN ++ +E  H ++  S  R  + ALKLD+SKAYDR+EW FL+ ++LR G    +VDLI+ CVSSVSFSF  NG + G V PSRGL
Subjt:  IPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

XP_030510497.1 uncharacterized protein LOC115725200 [Cannabis sativa]9.6e-13342.9Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWIS-WGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLC
        M   +  LGFD CF VD  G SGGLAL+W  S    +  FS NHID  I       WR TG YG P  D+  ++W+LL  L+  +  PW I GD N +  
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWIS-WGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLC

Query:  QDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQ
        Q EK+GGR  P S +  F   +  C L DL  VG  FTW   R  GE I ERLD+   N  W+ ++   V+ +L++S SDH P++LV       +  +  
Subjt:  QDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQ

Query:  RIVRFEETWLRQPGLQQLVGRSWAAGPGESSDITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYW
           RFE  WLR+P  +QLV   W  G G +S I      C + +  WGR  +G+F  RI     R++S   G   S S     +A+AQL EVL + E++W
Subjt:  RIVRFEETWLRQPGLQQLVGRSWAAGPGESSDITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYW

Query:  KQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVL
        KQRS++ WL  GD+N+++FH  AS R++ N I  L+D+ G W   ++ +  V+T YF  LF SS  ++      L  + PSV  + N  LL P +++EV 
Subjt:  KQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVL

Query:  LALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLIS
         AL Q HP+K+PGPDG++ +FY+ HW IVGPD++Q      + G  P  +NDT IVLIPK K   +M D RPISLCNV YK+ SKV+ NRMK +L   IS
Subjt:  LALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLIS

Query:  QNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
        + QSAF+ GR + DN ++ FE +H L+R+++GR  + ALKLDMSKAYDRVEW FL  V+  +GF+++W+ L++ CV+SV +     G K+G ++P+RG+
Subjt:  QNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

XP_038718167.1 uncharacterized protein LOC120011171 [Tripterygium wilfordii]1.6e-13242.43Show/hide
Query:  MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGC--RWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVL
        M    R LGF  C    C   + GLAL+W    S  LLSFSKNHIDG +   G   +W +TGFYG P       SWSLL  L+GC++ PWL+ GDFN +L
Subjt:  MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGC--RWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVL

Query:  CQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSG
           EK GGR + +  +  FQS ++ C L DLG+VG  +TWCN R  G  I+ERLDR   +  W  L+P+ +VNH   + SDH P+ +V   P     ++ 
Subjt:  CQDEKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSG

Query:  QRIVRFEETWLRQPGLQQLVGRSWAAGPGESSD-ITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEV
        +R  RFEE W R  GL+ +V + W  G G++ D I  L+      +  W RS  G+  RR++  ++R+       S+ + R+ +   ++++ E+L  EE+
Subjt:  QRIVRFEETWLRQPGLQQLVGRSWAAGPGESSD-ITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEV

Query:  YWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEE
         W+QRSR  WL++GD+NT++FH +AS R K NRI G+ED  G W Q++  V  ++ ++F+ LF S  P   D   AL  +   +  +MN+ L+R   ++E
Subjt:  YWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEE

Query:  VLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQL
        V  AL + HP KAPGPDG+   F++ +W++V  D++         G     VNDT I LIPKV   +R+ DFRPISLCNV YK+ISKVL NR+K ++P L
Subjt:  VLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQL

Query:  ISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRG
        +S +QSAF+ GR + DN ++ +E +H L+ R  G+  + ALKLDM+KAYDRVEW FL  VM  +GF  +WV LI+ CV +V +S  +NG   G  +P RG
Subjt:  ISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRG

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein3.9e-14043.14Show/hide
Query:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI-SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGG
        LG   CF V+  G  GGLAL+W  S+   + S+SK+HID W+ S  G  WR TGFYG P       SW LL RL+G ++ PWL+ GDFN ++  DEK+G 
Subjt:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI-SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGG

Query:  RDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPV----ELVLTPPPQCWSQSGQRIV
          +  +++A F+  ++ C L DLGF G  FTW N R   E + ERLDR      WMDL+P   + H+ ++ SDH  +    E V+  P +    S +R  
Subjt:  RDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPV----ELVLTPPPQCWSQSGQRIV

Query:  RFEETWLRQPGLQQLVGRSWAAGPGESS--DITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWK
         FE  WLR+ G ++ + ++W      ++   ++   K+C   ++ W ++      + I    +R+     G  +  +         +L  +LQ+EE+YW+
Subjt:  RFEETWLRQPGLQQLVGRSWAAGPGESS--DITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWK

Query:  QRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLL
        QRSR  WLREGDRNT +FH  AS R+K N I G+ D Q VWQ+E+T +  V+  YF  +++++ P   D    +R+++  V  +MNQ LL+PFT+EEV +
Subjt:  QRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLL

Query:  ALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQ
        AL Q  P+KAPGPDG++  F++  W IVG D+  + L  LN G    ++N T I LIPKVK+   M  FRPISLCNV YK+ISKVLVNRMK ILP ++S 
Subjt:  ALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQ

Query:  NQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
        +QSAF+PGR + DN ++ FE IH L+ +  G+    A KLDMSKAY+RVEW +L+++ML+LGF ++WV LI+ CV+SVS+S  +NG+  G V PSRGL
Subjt:  NQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

A0A2N9GJ35 Uncharacterized protein2.3e-14043.41Show/hide
Query:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWG-GCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGG
        LG   C  V+ HG+ GGLAL+W SS+  ++ S+S++HIDG +    G RWRLTGFYG+P A +  +SWSLL  LR  ++ PW+I GDFN +   +EK G 
Subjt:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWG-GCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGG

Query:  RDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVL----TPPPQCWSQSGQRIV
         D+  +++AAF+  +  C L+D+GF G  FTW N R  G+ +  RLDR   +  W+ L+P+  +NHL  + SDH  V L+L      P     Q  +R+ 
Subjt:  RDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVL----TPPPQCWSQSGQRIV

Query:  RFEETWLRQPGLQQLVGRSWAAGPGESS--DITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQS-AIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYW
        RFE++WL++ G ++++  +W   P  ++   +    K+C   ++ W +S      + I S  +++Q   +      DSR + +  +  L  + ++ E+ W
Subjt:  RFEETWLRQPGLQQLVGRSWAAGPGESS--DITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQS-AIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYW

Query:  KQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVL
        +QRSR +WL EGDRNT++FH  AS R+K+N I GL D Q  W+ E   V Q+  DYF  LF+SS P   D    L ++E  V   MN  L+RPFT+EE+ 
Subjt:  KQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVL

Query:  LALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLIS
         AL Q HP+K+PGPDG+S  F++ +W IV  D+  + L  L  G   G++N T +VLIPKV A   +  FRPISLCNV YK++SKVLVNRMK ILPQ+IS
Subjt:  LALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLIS

Query:  QNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
         +QSAF+PGR + DN I+ FE IH L+    G     A+KLDMSKAYDRVEW +L+ +M++LGF  QWV L++ CV + ++S  +NGE  G + P RGL
Subjt:  QNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

A0A2N9I946 Uncharacterized protein6.7e-14043.36Show/hide
Query:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDG-WISWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGG
        LG   CF V+ +G  GGLAL+W SS++  + SFS NHID   +   G +WR+TGFYG P   +   SW+LL +L      PWL+ GDFN VL  +E+ G 
Subjt:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDG-WISWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGG

Query:  RDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEE
         D+ LS++AAF+  +  C L+DLG+ G  F+W NRR  G  +  RLDR   N  W+ L+P+Y V+H+ ++ SDH  + ++L PPP   S + ++  RFE 
Subjt:  RDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEE

Query:  TWLRQPGLQQLVGRSWAAGPGESSDITPLA---KRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRS
         W+R+ G +  +  +W+  P   + +  +A   K C   ++ W +S+     R I     R+    +      S   +     ++  ++++EE++W+QRS
Subjt:  TWLRQPGLQQLVGRSWAAGPGESSDITPLA---KRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRS

Query:  RELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALK
        R  WL+EGDRNT+++H  AS R+K N I GL DDQG+WQ E  A+  +  +YF  LF SS P        +  ++  V   MN ALLR F+ EE+  AL 
Subjt:  RELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALK

Query:  QTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQS
        Q  P+KAPGPDG++  F++ +W IVG D+  + L   + G   G++N T IVLIPKVK    M  FRPISLCNV YK+ SKVLVNRMK ILP +IS +QS
Subjt:  QTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQS

Query:  AFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
        AF+PGR + DN I+ FE +H L+    G     A KLDMSKAYDRVEW FL+ ++L+LGF ++WVDLI+ CV+S S+S  +NG   G + PSRGL
Subjt:  AFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

A0A2N9IMU5 Uncharacterized protein2.2e-13842.02Show/hide
Query:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGG-CRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGG
        LG   CF VD HG  GGLAL+W SS+S  + S+S  HID  +      +WR+TGFYG P   +   SW+LL  L      PWL+ GDFN ++  +E+ G 
Subjt:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGG-CRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGG

Query:  RDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEE
         D+ L ++AAF+  + +C L+DLG+ G  F+W NRR  G  +  RLDR   N  WM L+P+Y V+H+ ++ SDH  + ++L PP      + +++ RF+ 
Subjt:  RDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEE

Query:  TWLRQPGLQQLVGRSWAAGPGESSDITPLA---KRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRS
        TW+R+ G ++ +  +W+  P   + +  +A   K C  +++ W + ++    R I S   R+    +      +   +    +++  +  +EE++W+QRS
Subjt:  TWLRQPGLQQLVGRSWAAGPGESSDITPLA---KRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRS

Query:  RELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALK
        R  WL+EGDRNT++FH  A+ R+K+N I GL DD GVWQ E  A+  +  +YF HLF SS P+       +  ++  V   MN+ALL+  + EE+  AL 
Subjt:  RELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALK

Query:  QTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQS
        Q  P+KAPGPDG++  FY+ +W IVG D+  +       G   G++N T IVLIPKVK    M  FRPISLCNV YK+ SKVLVNRMK ILP++IS +QS
Subjt:  QTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQS

Query:  AFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
        AF+PGR + DN I+ FE +H L+    G     A KLDMSKAYDRVEW FL+ ++L+ GF ++WVDLI+ CVS+ S++  +NG   G + PSRGL
Subjt:  AFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.3e-14341.81Show/hide
Query:  FDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGR
        FD  FCV   G  GGLA++W++ +   L ++S+NHID  I     G  +RLTGFYG P      +SW+LL  L   + +PWL  GDFN +L  +E+ G  
Subjt:  FDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI--SWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGR

Query:  DKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEET
         +P  ++  F+  V  CGL DLG+VG+ +TW  +R G   +  RLDR+  +  W+  Y   VV+HL    SDH P+ L +   P    +  +++ RFE  
Subjt:  DKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEET

Query:  WLRQPGLQQLVGRSWAAGPGESSD---ITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAI----AGLSTSDSRDLLVQAEAQLEEVLQEEEVYWK
        W++    ++++  +W  G  E S    +    K C  S++GW R + G+    I+   +++Q  I    +G ST      +++ +  L  +L++EE++W+
Subjt:  WLRQPGLQQLVGRSWAAGPGESSD---ITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAI----AGLSTSDSRDLLVQAEAQLEEVLQEEEVYWK

Query:  QRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLL
        QRSR  W+ EGD+NT++FH + + R++ N I GL D  GVWQ EKT + ++  DYFQ +F+SS PS +     L+ +E  V + MN  L   FTK+EV L
Subjt:  QRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLL

Query:  ALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQ
        ALKQ +P KAPGPDG+S  FY+ +WDIVGP++ Q+ L++L+ G     +N T I LIPKVK    + DFRPISLCNV YK++SKVL NR+K +LP +IS+
Subjt:  ALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQ

Query:  NQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
         QSAF+PGR + DN ++ FE +H +  + +G+    ALKLDMSKAYDRVEW FL  +M  +GFA++W+ L++ C+ SVS+S  +NGE+ G    SRG+
Subjt:  NQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.8e-2522.67Show/hide
Query:  RSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTA
        RSK      +++   ++ Q+     S +  R  + +  A+L+E+  ++ +     SR  +    ++  R        +++ N+I  +++D+G    + T 
Subjt:  RSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTA

Query:  VIQVMTDYFQHLFSSSGPSVQDFEVALRDLE-PSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSP
        +   + +Y++HL+++   ++++ +  L     P ++ E  ++L RP T  E++  +      K+PGPDG +  FY+ + + + P +++   ++   G  P
Subjt:  VIQVMTDYFQHLFSSSGPSVQDFEVALRDLE-PSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSP

Query:  GAVNDTMIVLIPKV-KAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKA
         +  +  I+LIPK  +   +  +FRPISL N+  K+++K+L NR++  + +LI  +Q  FIPG     N       I  + R          + +D  KA
Subjt:  GAVNDTMIVLIPKV-KAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKA

Query:  YDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKV
        +D+++  F+ + + +LG    ++ +I       + +  LNG+K+
Subjt:  YDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKV

P08548 LINE-1 reverse transcriptase homolog2.5e-1924.3Show/hide
Query:  EKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLE-PSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNL
        + + + +++ +Y++ L+S    ++++ +  L     P +  +  + L RP +  E+   ++     K+PGPDG +  FY+   + + P ++     +   
Subjt:  EKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLE-PSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNL

Query:  GCSPGAVNDTMIVLIPKV-KAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGR---CVVDNAILGFECIHELRRRSRGRAKWAAL
        G  P    +  I LIPK  K   R  ++RPISL N+  K+++K+L NR++  + ++I  +Q  FIPG      +  +I   + I++L+ +         L
Subjt:  GCSPGAVNDTMIVLIPKV-KAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGR---CVVDNAILGFECIHELRRRSRGRAKWAAL

Query:  KLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKV
         +D  KA+D ++  F+   + ++G    ++ LI    S  + +  LNG K+
Subjt:  KLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKV

P11369 LINE-1 retrotransposable element ORF2 protein4.3e-1924.52Show/hide
Query:  IGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVAL-RDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVG
        I  + +++G    +   +   +  +++ L+S+   ++ + +  L R   P ++ +    L  P + +E+   +      K+PGPDG S  FY+   + + 
Subjt:  IGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVAL-RDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVG

Query:  PDIIQSCLAVLNLGCSPGAVNDTMIVLIPK-VKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRR
        P + +    +   G  P +  +  I LIPK  K   ++ +FRPISL N+  K+++K+L NR++  +  +I  +Q  FIPG     N       IH + + 
Subjt:  PDIIQSCLAVLNLGCSPGAVNDTMIVLIPK-VKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRR

Query:  SRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKV
                 + LD  KA+D+++  F+ +V+ R G    ++++I    S    +  +NGEK+
Subjt:  SRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKV

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-3426.47Show/hide
Query:  LIGGDFNAVL-CQD----EKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPV
        +IGGDFN  L  +D    +K    +  L EL A  S+VD+   R+       FT+   R  G     R+DR++ ++  M    +  +    +  SDH  V
Subjt:  LIGGDFNAVL-CQD----EKEGGRDKPLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPV

Query:  ELVLTPPPQCWSQSGQRIVRFEETWLRQPGLQQLV---GRSWAAGPGESSDITP--------LAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGL
         L ++  P   S        F  + L   G  + V    R W A   E + +          L   C +    + +S SG     I + N  V      L
Subjt:  ELVLTPPPQCWSQSGQRIVRFEETWLRQPGLQQLV---GRSWAAGPGESSDITP--------LAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGL

Query:  STSDSRDL---LVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQD
        S S+ + L    ++ +  L  + Q +      RSR   L + DR +R+F+     +    +I  L  + G   ++  A+      ++Q+LFS   P   D
Subjt:  STSDSRDL---LVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQD

Query:  FEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDF
            L D  P V +   + L  P T +E+  AL+    NK+PG DGL+  F++  WD +GPD  +        G  P +    ++ L+PK    R + ++
Subjt:  FEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDF

Query:  RPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVD
        RP+SL +  YK+++K +  R+K +L ++I  +QS  +PGR + DN  L  + +H  RR        A L LD  KA+DRV+  +L   +    F  Q+V 
Subjt:  RPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVD

Query:  LILRCVSSVSFSFNLNGEKVGQVVPSRGL
         +    +S      +N      +   RG+
Subjt:  LILRCVSSVSFSFNLNGEKVGQVVPSRGL

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM3.7e-1024Show/hide
Query:  QVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTK-EEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGA
        ++M  Y++ + +   PS    EV           +M+ +L R ++   E  L   +   + +PGPDG++    R   ++    +++    +L  G  P +
Subjt:  QVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTK-EEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGA

Query:  VNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDR
        +     V IPK   A+R  DFRPIS+ +V  + ++ +L  R+   +       Q  F+P     DNA +  + +  LR   +         LD+SKA+D 
Subjt:  VNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDR

Query:  VEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL
        +  A + + +   G  + +VD +         S N +G    + VP+RG+
Subjt:  VEWAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein8.9e-2826.67Show/hide
Query:  PLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEETWL
        P+  L  FQ+ +    L D+   G  +TW N +     I  +LDR   N  W   +P+ +        SDH P  ++L   P    +  ++  R+     
Subjt:  PLSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEETWL

Query:  RQPGLQQLVGRSWAAGPGESSDITPL------AKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEE--------
          P     +  +W       S +  L      AK+C + +    R   GN   + + A   ++S  + L T+ S  L      ++E V +++        
Subjt:  RQPGLQQLVGRSWAAGPGESSDITPL------AKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEE--------

Query:  EVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLF-SSSGPSVQDFEVALRDLEP-SVDDEMNQALLRPF
        E +++Q+SR  WL++GD NTR+FH      Q  N I  L  D  V  +  T V +++  Y+ HL  S S     D    ++D+ P   +D +   L    
Subjt:  EVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDDQGVWQQEKTAVIQVMTDYFQHLF-SSSGPSVQDFEVALRDLEP-SVDDEMNQALLRPF

Query:  TKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLIS
        + +E+  A+     NKAPGPD  +  F+   W +V    I +       G      N T I LIPKV    ++  FRP+S C V YK+I+
Subjt:  TKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVLNLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.5e-1439.77Show/hide
Query:  LVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILR
        +V R+K ++  LI   Q++FIPGR   DN +   E +H +RR+ +G   W  LKLD+ KAYDR+ W +L + ++  GF + W+  I R
Subjt:  LVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVEWAFLREVMLRLGFAQQWVDLILR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCGGCGAAGCGTTTGTTGGGGTTTGATAACTGCTTTTGTGTTGATTGTCATGGACGGAGTGGTGGGTTGGCCCTCATGTGGGTTTCCTCGATATCCTTCAGCCT
TCTTTCATTCTCCAAGAATCACATTGATGGATGGATCTCATGGGGTGGTTGTAGGTGGCGGCTCACTGGTTTCTATGGTTTCCCTTCAGCTGATATGCACCCCCAGTCAT
GGTCTCTTCTGTCTAGACTGAGGGGCTGTGCTGAGACACCGTGGCTGATTGGTGGGGATTTTAACGCCGTTTTATGTCAGGATGAGAAGGAGGGGGGCAGGGATAAGCCA
TTGTCTGAACTGGCTGCCTTTCAGAGTGTGGTTGATTTGTGTGGTCTACGGGACTTGGGTTTTGTGGGGGATTGCTTTACTTGGTGTAACAGACGACCAGGAGGGGAGAC
GATTTATGAGCGGTTGGATCGGGTCTTTGGCAACACGCCTTGGATGGACCTCTATCCGAACTATGTGGTTAACCACCTGGATTACAGCCGGTCTGACCATAGGCCAGTGG
AGCTCGTTCTTACCCCTCCGCCGCAGTGTTGGTCTCAGAGTGGCCAGCGGATTGTTCGGTTTGAGGAAACTTGGCTTCGGCAACCGGGGTTGCAACAGTTGGTTGGTAGG
TCGTGGGCCGCCGGGCCTGGGGAGTCTAGTGATATAACGCCTCTGGCTAAGAGATGTATGCAGTCGATGGTCGGTTGGGGTCGATCAAAGTCCGGGAACTTCCTGAGGCG
CATTCGTAGTGCGAACCAGAGGGTTCAGTCGGCCATCGCTGGTCTTAGTACATCGGACTCTCGTGACTTGCTTGTTCAGGCCGAGGCTCAGTTGGAGGAGGTGTTGCAGG
AGGAAGAGGTATATTGGAAACAGAGGTCAAGGGAGTTATGGCTTCGAGAAGGGGATCGCAATACTCGGTGGTTCCACTGTCGAGCCTCTTACCGGCAGAAGCTTAATCGC
ATTGGAGGATTGGAGGATGATCAGGGAGTGTGGCAACAGGAGAAGACTGCAGTTATTCAGGTGATGACTGATTATTTTCAGCACCTGTTTTCCTCATCAGGTCCGAGTGT
TCAGGACTTTGAGGTGGCGTTGCGGGATTTGGAGCCTTCTGTGGATGATGAGATGAACCAAGCGTTATTGCGACCTTTTACCAAAGAGGAGGTTCTGTTGGCTTTGAAGC
AGACGCATCCTAACAAAGCCCCAGGTCCAGATGGGCTGTCGAGGAGCTTTTACAGGCACCACTGGGACATTGTTGGGCCAGATATTATTCAGAGTTGTTTGGCAGTTCTG
AATCTTGGGTGCTCCCCAGGTGCTGTTAATGATACTATGATCGTGCTCATCCCGAAGGTTAAAGCAGCCCGGCGGATGGTTGATTTTCGACCCATCTCCCTGTGTAATGT
GAGTTACAAGCTGATTTCGAAGGTCTTGGTCAACCGCATGAAGTATATACTGCCTCAGCTGATCTCGCAGAACCAGAGTGCTTTTATTCCAGGCAGATGTGTTGTGGACA
ATGCTATTCTGGGGTTTGAGTGTATCCATGAGCTGCGGAGGAGAAGTAGGGGGAGGGCCAAATGGGCAGCGCTAAAACTGGATATGAGTAAAGCATATGACAGGGTGGAA
TGGGCTTTCCTCCGGGAGGTTATGCTACGACTGGGTTTTGCGCAGCAGTGGGTTGATCTGATCCTCCGGTGTGTCAGCTCGGTTTCGTTTTCCTTCAATCTGAATGGGGA
AAAGGTGGGGCAGGTGGTACCGTCTAGGGGTCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCGGCGAAGCGTTTGTTGGGGTTTGATAACTGCTTTTGTGTTGATTGTCATGGACGGAGTGGTGGGTTGGCCCTCATGTGGGTTTCCTCGATATCCTTCAGCCT
TCTTTCATTCTCCAAGAATCACATTGATGGATGGATCTCATGGGGTGGTTGTAGGTGGCGGCTCACTGGTTTCTATGGTTTCCCTTCAGCTGATATGCACCCCCAGTCAT
GGTCTCTTCTGTCTAGACTGAGGGGCTGTGCTGAGACACCGTGGCTGATTGGTGGGGATTTTAACGCCGTTTTATGTCAGGATGAGAAGGAGGGGGGCAGGGATAAGCCA
TTGTCTGAACTGGCTGCCTTTCAGAGTGTGGTTGATTTGTGTGGTCTACGGGACTTGGGTTTTGTGGGGGATTGCTTTACTTGGTGTAACAGACGACCAGGAGGGGAGAC
GATTTATGAGCGGTTGGATCGGGTCTTTGGCAACACGCCTTGGATGGACCTCTATCCGAACTATGTGGTTAACCACCTGGATTACAGCCGGTCTGACCATAGGCCAGTGG
AGCTCGTTCTTACCCCTCCGCCGCAGTGTTGGTCTCAGAGTGGCCAGCGGATTGTTCGGTTTGAGGAAACTTGGCTTCGGCAACCGGGGTTGCAACAGTTGGTTGGTAGG
TCGTGGGCCGCCGGGCCTGGGGAGTCTAGTGATATAACGCCTCTGGCTAAGAGATGTATGCAGTCGATGGTCGGTTGGGGTCGATCAAAGTCCGGGAACTTCCTGAGGCG
CATTCGTAGTGCGAACCAGAGGGTTCAGTCGGCCATCGCTGGTCTTAGTACATCGGACTCTCGTGACTTGCTTGTTCAGGCCGAGGCTCAGTTGGAGGAGGTGTTGCAGG
AGGAAGAGGTATATTGGAAACAGAGGTCAAGGGAGTTATGGCTTCGAGAAGGGGATCGCAATACTCGGTGGTTCCACTGTCGAGCCTCTTACCGGCAGAAGCTTAATCGC
ATTGGAGGATTGGAGGATGATCAGGGAGTGTGGCAACAGGAGAAGACTGCAGTTATTCAGGTGATGACTGATTATTTTCAGCACCTGTTTTCCTCATCAGGTCCGAGTGT
TCAGGACTTTGAGGTGGCGTTGCGGGATTTGGAGCCTTCTGTGGATGATGAGATGAACCAAGCGTTATTGCGACCTTTTACCAAAGAGGAGGTTCTGTTGGCTTTGAAGC
AGACGCATCCTAACAAAGCCCCAGGTCCAGATGGGCTGTCGAGGAGCTTTTACAGGCACCACTGGGACATTGTTGGGCCAGATATTATTCAGAGTTGTTTGGCAGTTCTG
AATCTTGGGTGCTCCCCAGGTGCTGTTAATGATACTATGATCGTGCTCATCCCGAAGGTTAAAGCAGCCCGGCGGATGGTTGATTTTCGACCCATCTCCCTGTGTAATGT
GAGTTACAAGCTGATTTCGAAGGTCTTGGTCAACCGCATGAAGTATATACTGCCTCAGCTGATCTCGCAGAACCAGAGTGCTTTTATTCCAGGCAGATGTGTTGTGGACA
ATGCTATTCTGGGGTTTGAGTGTATCCATGAGCTGCGGAGGAGAAGTAGGGGGAGGGCCAAATGGGCAGCGCTAAAACTGGATATGAGTAAAGCATATGACAGGGTGGAA
TGGGCTTTCCTCCGGGAGGTTATGCTACGACTGGGTTTTGCGCAGCAGTGGGTTGATCTGATCCTCCGGTGTGTCAGCTCGGTTTCGTTTTCCTTCAATCTGAATGGGGA
AAAGGTGGGGCAGGTGGTACCGTCTAGGGGTCTCTAG
Protein sequenceShow/hide protein sequence
MSSAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISWGGCRWRLTGFYGFPSADMHPQSWSLLSRLRGCAETPWLIGGDFNAVLCQDEKEGGRDKP
LSELAAFQSVVDLCGLRDLGFVGDCFTWCNRRPGGETIYERLDRVFGNTPWMDLYPNYVVNHLDYSRSDHRPVELVLTPPPQCWSQSGQRIVRFEETWLRQPGLQQLVGR
SWAAGPGESSDITPLAKRCMQSMVGWGRSKSGNFLRRIRSANQRVQSAIAGLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNR
IGGLEDDQGVWQQEKTAVIQVMTDYFQHLFSSSGPSVQDFEVALRDLEPSVDDEMNQALLRPFTKEEVLLALKQTHPNKAPGPDGLSRSFYRHHWDIVGPDIIQSCLAVL
NLGCSPGAVNDTMIVLIPKVKAARRMVDFRPISLCNVSYKLISKVLVNRMKYILPQLISQNQSAFIPGRCVVDNAILGFECIHELRRRSRGRAKWAALKLDMSKAYDRVE
WAFLREVMLRLGFAQQWVDLILRCVSSVSFSFNLNGEKVGQVVPSRGL