; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028876 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028876
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:32242756..32244141
RNA-Seq ExpressionLag0028876
SyntenyLag0028876
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268750.1 unnamed protein product [Prunus armeniaca]2.2e-8439.48Show/hide
Query:  VWNSSTRLNVVSYSKGHIDSVIE--DQFGKWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG
        +W     +++ S S  HID+ +E     G+WRFTGFYG P   ER  SW LL RL     LPW+  G FNEI+  +EK GG +R A+QM+ F+  ++ CG
Subjt:  VWNSSTRLNVVSYSKGHIDSVIE--DQFGKWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG

Query:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN
          D+GF G  FTW+R+  +   IR RL+R  A+     +    KV HLN  +SDH P+   + K  +++   + +   +FEE WV+ E C   I+E W  
Subjt:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN

Query:  SEIAS-PINISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEI-QRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTK
         E  S P  ++ K++   ++L  W++  + GSL   I   + ++ + L+A  +P   E       +LD L+ + E+YWR RSR  WL  GD NTK+FH K
Subjt:  SEIAS-PINISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEI-QRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTK

Query:  ASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFF
        AS RKR N I G+ D +  W + E E+  T   YF+ LFSS G  D    EV   +  ++SEE ++ L  DF+ EEI  AL  M P+KA GPDG   LF+
Subjt:  ASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFF

Query:  QKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYRQG
        Q+YW  +G DV +  L  L+ G+ +  +N T++ LIPK  +PK +T +RPISL N +Y+ G
Subjt:  QKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYRQG

XP_012836341.1 PREDICTED: uncharacterized protein LOC105956976 [Erythranthe guttata]7.0e-8335.78Show/hide
Query:  MLVWNSSTRLNVVSYSKGHIDSVIED--QFGKWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINL
        +L W     ++++SYS  HID+ + D     KWR TGFYG P    R +SW+LL  L+    +PW+VGG FNEI+ + EK GG  ++   +E F+  +++
Subjt:  MLVWNSSTRLNVVSYSKGHIDSVIED--QFGKWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINL

Query:  CGLVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIG-KKKRQCKFEEAWVKFEECNSIIKES
        C L D+GFEG  FTW  +     ++RERL+R  A++   ++    KV+HL    SDH P+  ++  +    R   +KKR  +FE  W++ +EC SI+   
Subjt:  CGLVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIG-KKKRQCKFEEAWVKFEECNSIIKES

Query:  WNNSEIASPIN-ISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAK-EGELDKLLEEEELYWRIRSREEWLLWGDNNTKWF
        +++  +A P+  +  K + C + L  W +T +    +  I +    +  L      ++ +  I + + E++K  EE ++YWR RS+ +W+  GD NTK+F
Subjt:  WNNSEIASPIN-ISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAK-EGELDKLLEEEELYWRIRSREEWLLWGDNNTKWF

Query:  HTKASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHA
        H KA+ R R N+++ + D+   W + + +I    + YF+ LFSS GP ++ I+EV   +   IS E ++ L   F+ +E+ +A+  M+P K+ GPDG   
Subjt:  HTKASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHA

Query:  LFFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYRQG
        +F+ KYW  +G DV T  L  LN       LN T+I LIPK K P+++TD RPISLCN +Y+ G
Subjt:  LFFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYRQG

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.0e-8940.43Show/hide
Query:  LVWNSSTRLNVVSYSKGHIDSVIEDQF--GKWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLC
        L+W     L V SY++ HID+VI +     KWR TGFYG P   +R  SW LL  L     LPW+  G FNEI+   EK GG +R   QM+ F+ ++N C
Subjt:  LVWNSSTRLNVVSYSKGHIDSVIEDQF--GKWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLC

Query:  GLVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWN
        G  D+G+ G  +TW      +N I  RL+R  A+   + K   +KV HL     DH  LL         +R   + ++  FE  W K E+C +II+ SW 
Subjt:  GLVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWN

Query:  -NSEIASPINISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAK-EGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT
           ++++P  IS  ++ C ++LS W+ T + G +   I      +  L       +  L I +   E++ LL++EE YW  R++  WL  GD NTK+FH 
Subjt:  -NSEIASPINISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAK-EGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT

Query:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF
        +AS+R++ N I GI+D   RW  +E+ I     SYF +++SS  P    I EV +AI FK++EE ++ L R+F++EE+  ALK + P KA GPDG  A+F
Subjt:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF

Query:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        FQKYW  +G +VT + L VLN    + +LNKT I LIPK  NPKR+TD RPISLCN VY+
Subjt:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR

XP_028090832.1 uncharacterized protein LOC114291041 [Camellia sinensis]3.2e-8337.39Show/hide
Query:  LVWNSSTRLNVVSYSKGHIDSVIEDQFGK--WRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLC
        L+W ++ +++V S+S GHID  ++   G+  WRF GFYG P + +R++SW LL RL     +PW+  G FNEI+F  EK G   R   QME F++V+   
Subjt:  LVWNSSTRLNVVSYSKGHIDSVIEDQFGK--WRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLC

Query:  GLVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFE---MKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKE
         L D+GF G  FTW  +      +RERL+R FA+D   L   + KV  +    S+H P+  +   +K++ + +  G   R  +FE  W++   C  ++ +
Subjt:  GLVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFE---MKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKE

Query:  SWNNSEIASPINISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNEL--WIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKW
        +W  S  A+  ++ S I H    L  W++  + G +K  +    D++QRL    S   +++        ++D+LLE+E ++W  R+R  WL  GD NT +
Subjt:  SWNNSEIASPINISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNEL--WIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKW

Query:  FHTKASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFS-----SLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASG
        FH+KA+QR +  KI GI D +  W  D+  +     +YF+ LFS     ++GPI + I E       ++S+  +  L R FSE E+  AL  M PTKA G
Subjt:  FHTKASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFS-----SLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASG

Query:  PDGAHALFFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        PDG  ALFFQK+W+ +G  V++V L VLN  ++++ +N T+I LIPK KNPK +++ RPISLCN VY+
Subjt:  PDGAHALFFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR

XP_030932272.1 uncharacterized protein LOC115958047 [Quercus lobata]3.4e-8540.13Show/hide
Query:  LVWNSSTRLNVVSYSKGHIDSVIED-QFGK-WRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLC
        L+W    ++++ S+S  HID++++    GK WR TGFYG+P   +RQ SW LL+RL     LPW+  G FNE++   EK GG +R AKQME F   IN+ 
Subjt:  LVWNSSTRLNVVSYSKGHIDSVIED-QFGK-WRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLC

Query:  GLVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRI-GKKKRQCKFEEAWVKFEECNSIIKESW
         L D+G+ G  FTW R   N+  I+ERL+R   S   A +   +++ H     SDH  L+    KE++  R  G++K+  +FEE W+K E+C  ++ E+W
Subjt:  GLVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRI-GKKKRQCKFEEAWVKFEECNSIIKESW

Query:  NNS-EIASPINISSKIQHCIMKLSAWNQTRL-KGSLKGAIFRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFH
        +    + S   +SS ++ C   LSAWN         K A+ +T  E        S +  E+   +  E++ LL+ EE+ WR RSR  WL  GD NT +FH
Subjt:  NNS-EIASPINISSKIQHCIMKLSAWNQTRL-KGSLKGAIFRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFH

Query:  TKASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHAL
        TKAS R   N I G+ DNS  W S++  I  +   YF SLF++  P+     E+ +AI  K++E  +  L RDF   EI  ALK M PT A GPDG   +
Subjt:  TKASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHAL

Query:  FFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        F+QK+W  +   V    L  LN G    + N+T+I LIPK K P+R+TD RPISLCN VY+
Subjt:  FFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR

TrEMBL top hitse value%identityAlignment
A0A2N9EL92 Reverse transcriptase domain-containing protein1.9e-8638.7Show/hide
Query:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG
        L+WN   ++ + ++S+ H+DS ++   G KWRFTGFYG P    +  SW LLD+L    + PW+  G FNEI+   E+ G      ++M+DF  V+N CG
Subjt:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG

Query:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN
        LVD+GF G  FTW      +  I++RL+R  A+       +   V H+    SDH PLL  M    +      K+R  KFEE W    EC +II++ W+ 
Subjt:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN

Query:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT
         E I SP+ I   KI+HC   L  W +  + G  +  I   T      + +N +  NN    + + E++KLL  EEL+WR RSR  WL  GD+NTK+FH+
Subjt:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT

Query:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF
        +A+QR+R+N + G++++ N W +DE +I     SYF  +F +  P++  + +   A+  +++ E ++ L + F+ +E+  AL  M P+KA GPDG  + F
Subjt:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF

Query:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        FQKYW  +G DV    L VLN G+ +  +N T+I LIPK KNP+R+++ RPISLCN VY+
Subjt:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR

A0A2N9HRH8 Reverse transcriptase domain-containing protein1.5e-8638.7Show/hide
Query:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG
        L+WN   ++ + ++S+ H+DS ++   G KWRFTGFYG P    +  SW LLD+L    + PW+  G FNEI+   E+ G      ++M+DF  V+N CG
Subjt:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG

Query:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN
        LVD+GF G  FTW      +  I++RL+R  A+       +   V H+    SDH PLL  M    +      K+R  KFEE W    EC +II++ W+ 
Subjt:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN

Query:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT
         E I SP+ I   KI+HC   L  W +  + G  +  I   T      + +N +  NN    + + E++KLL  EEL+WR RSR  WL  GD+NTK+FH+
Subjt:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT

Query:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF
        +A+QR+R+N + G++++ N W +DE +I     SYF  +F +  P++  + +   A+  +++ E ++ L + F+ +E+  AL  M P+KA GPDG  + F
Subjt:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF

Query:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        FQKYW  +G DV    L VLN G+ +  +N T+I LIPK KNP+R+++ RPISLCN VY+
Subjt:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR

A0A2N9HU09 Reverse transcriptase domain-containing protein1.9e-8638.7Show/hide
Query:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG
        L+WN   ++ + ++S+ H+DS ++   G KWRFTGFYG P    +  SW LLD+L    + PW+  G FNEI+   E+ G      ++M+DF  V+N CG
Subjt:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG

Query:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN
        LVD+GF G  FTW      +  I++RL+R  A+       +   V H+    SDH PLL  M    +      K+R  KFEE W    EC +II++ W+ 
Subjt:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN

Query:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT
         E I SP+ I   KI+HC   L  W +  + G  +  I   T      + +N +  NN    + + E++KLL  EEL+WR RSR  WL  GD+NTK+FH+
Subjt:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT

Query:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF
        +A+QR+R+N + G++++ N W +DE +I     SYF  +F +  P++  + +   A+  +++ E ++ L + F+ +E+  AL  M P+KA GPDG  + F
Subjt:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF

Query:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        FQKYW  +G DV    L VLN G+ +  +N T+I LIPK KNP+R+++ RPISLCN VY+
Subjt:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR

A0A2N9I475 Reverse transcriptase domain-containing protein1.9e-8638.7Show/hide
Query:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG
        L+WN   ++ + ++S+ H+DS ++   G KWRFTGFYG P    +  SW LLD+L    + PW+  G FNEI+   E+ G      ++M+DF  V+N CG
Subjt:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG

Query:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN
        LVD+GF G  FTW      +  I++RL+R  A+       +   V H+    SDH PLL  M    +      K+R  KFEE W    EC +II++ W+ 
Subjt:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN

Query:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT
         E I SP+ I   KI+HC   L  W +  + G  +  I   T      + +N +  NN    + + E++KLL  EEL+WR RSR  WL  GD+NTK+FH+
Subjt:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT

Query:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF
        +A+QR+R+N + G++++ N W +DE +I     SYF  +F +  P++  + +   A+  +++ E ++ L + F+ +E+  AL  M P+KA GPDG  + F
Subjt:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF

Query:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        FQKYW  +G DV    L VLN G+ +  +N T+I LIPK KNP+R+++ RPISLCN VY+
Subjt:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR

A0A2N9IT57 Reverse transcriptase domain-containing protein1.9e-8638.7Show/hide
Query:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG
        L+WN   ++ + ++S+ H+DS ++   G KWRFTGFYG P    +  SW LLD+L    + PW+  G FNEI+   E+ G      ++M+DF  V+N CG
Subjt:  LVWNSSTRLNVVSYSKGHIDSVIEDQFG-KWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCG

Query:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN
        LVD+GF G  FTW      +  I++RL+R  A+       +   V H+    SDH PLL  M    +      K+R  KFEE W    EC +II++ W+ 
Subjt:  LVDVGFEGDSFTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNN

Query:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT
         E I SP+ I   KI+HC   L  W +  + G  +  I   T      + +N +  NN    + + E++KLL  EEL+WR RSR  WL  GD+NTK+FH+
Subjt:  SE-IASPINI-SSKIQHCIMKLSAWNQTRLKGSLKGAI-FRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHT

Query:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF
        +A+QR+R+N + G++++ N W +DE +I     SYF  +F +  P++  + +   A+  +++ E ++ L + F+ +E+  AL  M P+KA GPDG  + F
Subjt:  KASQRKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALF

Query:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        FQKYW  +G DV    L VLN G+ +  +N T+I LIPK KNP+R+++ RPISLCN VY+
Subjt:  FQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.1e-0920.58Show/hide
Query:  GSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCGLVDV----GFEGDSFTWYRSP----------INKNSI
        G+PR  ++     +L  L+   D   ++ G FN  +   ++     +V K  ++  S ++   L+D+      +   +T++ +P          +   ++
Subjt:  GSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCGLVDV----GFEGDSFTWYRSP----------INKNSI

Query:  RERLNRF-----FASDLEALKVDNIKVRHLNLHQSDH---RPLLF-------EMKKEVKVVRIGKKKRQCKFEEAWVKFEE-CNS--IIKESWNNSEIAS
          +  R      + SD  A+K++ +++++L   +S       LL        EMK E+K+     + +   ++  W  F+  C    I   ++   +  S
Subjt:  RERLNRF-----FASDLEALKVDNIKVRHLNLHQSDH---RPLLF-------EMKKEVKVVRIGKKKRQCKFEEAWVKFEE-CNS--IIKESWNNSEIAS

Query:  PIN-ISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTKASQRKR
         I+ ++S+++    +L    QT  K S +        EI +++A            KE E  K L+      +I     W     N       +  ++KR
Subjt:  PIN-ISSKIQHCIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTKASQRKR

Query:  -SNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSS-LGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFFQKYW
          N+I+ I ++     +D  EI+ T   Y+K L+++ L  ++     +      ++++EE + L+R  +  EI+  +  +   K+ GPDG  A F+Q+Y 
Subjt:  -SNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSS-LGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFFQKYW

Query:  KDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPK-CKNPKRLTDMRPISLCN
        +++   +  +   +  +G       +  I LIPK  ++  +  + RPISL N
Subjt:  KDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPK-CKNPKRLTDMRPISLCN

P11369 LINE-1 retrotransposable element ORF2 protein6.5e-0724.19Show/hide
Query:  IQRLQAN-PSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTKASQRKRSN-KIEGIFDNSNRWVSDEDEIRATTASYFKSLFSS-
        +++ +AN P     +  I   GE+++ +E      RI     W     N       + ++  R    I  I +      +D +EI+ T  S++K L+S+ 
Subjt:  IQRLQAN-PSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTKASQRKRSN-KIEGIFDNSNRWVSDEDEIRATTASYFKSLFSS-

Query:  LGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPK-CK
        L  +D     + +    K+++++   L+   S +EI   +  +   K+ GPDG  A F+Q + +D+   +  +  ++  +G       +  I LIPK  K
Subjt:  LGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPK-CK

Query:  NPKRLTDMRPISLCN
        +P ++ + RPISL N
Subjt:  NPKRLTDMRPISLCN

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein1.4e-0432.97Show/hide
Query:  ERQSSWTLLDRLKHH---CDLPWIVGGGFNEIMFDEEKLG-GPSRVAKQ-MEDFKSVINLCGLVDVGFEGDSFTWYRSPINKNSIRERLNR
        ER+S W  + RL      C+ PW+V G FN+I    E     PS ++ Q +ED ++ +    LVD+   G  +TW       N I  +L+R
Subjt:  ERQSSWTLLDRLKHH---CDLPWIVGGGFNEIMFDEEKLG-GPSRVAKQ-MEDFKSVINLCGLVDVGFEGDSFTWYRSPINKNSIRERLNR

AT1G43760.1 DNAse I-like superfamily protein7.2e-1721.83Show/hide
Query:  SYSKGHIDSVIEDQFGKWRFTGFYGSPRVEERQSSW--TLLDRLKHHCDLPWIVGGGFNEI--MFDEEKLGGPSRVAKQMEDFKSVINLCGLVDVGFEGD
        S S+ +  ++++     WR    Y    +      W  ++   +    D   I+ G F++I    D   +   S   + +E+F++ +    LVD+   G 
Subjt:  SYSKGHIDSVIEDQFGKWRFTGFYGSPRVEERQSSW--TLLDRLKHHCDLPWIVGGGFNEI--MFDEEKLGGPSRVAKQMEDFKSVINLCGLVDVGFEGD

Query:  SFTWYRSPINKNSIRERLNRFFAS-DLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSI-IKESWNNSEIASPI
         +TW     + N I  +L+R  A+ D  +     I V  L+   SDH P +  ++       + K+ ++C    +++       + +  +W   +I    
Subjt:  SFTWYRSPINKNSIRERLNRFFAS-DLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSI-IKESWNNSEIASPI

Query:  NISSKIQH------CIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTKASQ
        ++ S  +H      C   L+      ++   K A+   E    +L  NPS     +      + +      E ++R +SR +WL  GD NT++FH     
Subjt:  NISSKIQH------CIMKLSAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTKASQ

Query:  RKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPI--DRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFFQ
         +  N I+ +  + +  V +  +++    +Y+  L  S   I    ++  +K    F+ ++  +  L    S++EI  A+  M   KA GPD   A FF 
Subjt:  RKRSNKIEGIFDNSNRWVSDEDEIRATTASYFKSLFSSLGPI--DRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFFQ

Query:  KYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR
        + W  +              G  +   N T I LIPK     +L+  RP+S C  VY+
Subjt:  KYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNPKRLTDMRPISLCNGVYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTGTGGAATAGCAGTACAAGGCTGAACGTTGTATCCTATTCGAAAGGGCACATAGATTCGGTCATTGAAGATCAGTTCGGTAAGTGGAGATTTACAGGTTTCTA
CGGAAGCCCAAGAGTGGAGGAGAGGCAGTCTTCTTGGACTCTATTAGATAGACTTAAACATCATTGTGATCTCCCGTGGATTGTAGGTGGGGGTTTCAATGAAATTATGT
TTGATGAGGAGAAGTTAGGTGGCCCCAGTCGTGTGGCTAAGCAGATGGAAGATTTTAAAAGTGTGATTAATTTGTGTGGTTTAGTTGATGTGGGCTTTGAAGGGGATAGT
TTCACTTGGTATAGAAGCCCAATAAACAAAAATTCCATAAGGGAGAGGCTGAATAGGTTCTTTGCTTCTGATTTGGAGGCCTTGAAGGTGGACAATATCAAAGTAAGACA
TCTAAATCTTCACCAATCAGACCACAGGCCTTTGCTATTTGAAATGAAAAAAGAGGTCAAGGTAGTCAGAATTGGGAAAAAGAAAAGGCAATGTAAGTTTGAAGAGGCTT
GGGTGAAATTTGAGGAATGCAACTCTATTATTAAGGAGAGTTGGAACAACTCTGAGATAGCCTCTCCTATCAACATTTCCTCCAAAATCCAGCATTGTATAATGAAGCTA
TCGGCTTGGAATCAGACCAGACTGAAAGGGTCATTAAAGGGAGCCATATTTAGAACAGAAGATGAAATTCAGAGACTTCAAGCGAACCCATCTCCTATGAATAATGAGTT
GTGGATAGCTAAGGAAGGTGAGTTGGATAAGTTGCTCGAGGAGGAAGAATTGTATTGGAGAATTCGGTCGAGAGAAGAGTGGTTATTGTGGGGGGATAATAACACCAAGT
GGTTCCACACTAAAGCCTCTCAAAGGAAGAGATCCAATAAAATAGAGGGGATCTTTGACAACTCCAATAGGTGGGTGAGCGATGAAGACGAGATAAGAGCCACGACTGCT
TCCTATTTCAAATCTCTTTTCAGCTCCTTGGGTCCTATAGACAGGGCCATTAACGAGGTGAAGAAGGCTATTTGCTTCAAGATCAGTGAAGAAGAATCCAAGTGGTTGGA
TAGGGATTTTTCCGAAGAAGAGATCCTTAAAGCGCTTAAGGGTATGAGCCCAACTAAAGCCTCGGGTCCAGACGGAGCTCATGCTTTATTTTTTCAAAAGTATTGGAAAG
ACATAGGGAAGGATGTCACCACTGTTTGTTTGAGAGTGCTGAACCAAGGGGAGGACATGGCTGATCTTAATAAAACTTACATCTTCCTTATTCCAAAATGCAAAAATCCA
AAGAGGTTGACCGATATGAGACCCATTAGCCTTTGCAATGGGGTGTATCGCCAAGGCCCTTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTGTGGAATAGCAGTACAAGGCTGAACGTTGTATCCTATTCGAAAGGGCACATAGATTCGGTCATTGAAGATCAGTTCGGTAAGTGGAGATTTACAGGTTTCTA
CGGAAGCCCAAGAGTGGAGGAGAGGCAGTCTTCTTGGACTCTATTAGATAGACTTAAACATCATTGTGATCTCCCGTGGATTGTAGGTGGGGGTTTCAATGAAATTATGT
TTGATGAGGAGAAGTTAGGTGGCCCCAGTCGTGTGGCTAAGCAGATGGAAGATTTTAAAAGTGTGATTAATTTGTGTGGTTTAGTTGATGTGGGCTTTGAAGGGGATAGT
TTCACTTGGTATAGAAGCCCAATAAACAAAAATTCCATAAGGGAGAGGCTGAATAGGTTCTTTGCTTCTGATTTGGAGGCCTTGAAGGTGGACAATATCAAAGTAAGACA
TCTAAATCTTCACCAATCAGACCACAGGCCTTTGCTATTTGAAATGAAAAAAGAGGTCAAGGTAGTCAGAATTGGGAAAAAGAAAAGGCAATGTAAGTTTGAAGAGGCTT
GGGTGAAATTTGAGGAATGCAACTCTATTATTAAGGAGAGTTGGAACAACTCTGAGATAGCCTCTCCTATCAACATTTCCTCCAAAATCCAGCATTGTATAATGAAGCTA
TCGGCTTGGAATCAGACCAGACTGAAAGGGTCATTAAAGGGAGCCATATTTAGAACAGAAGATGAAATTCAGAGACTTCAAGCGAACCCATCTCCTATGAATAATGAGTT
GTGGATAGCTAAGGAAGGTGAGTTGGATAAGTTGCTCGAGGAGGAAGAATTGTATTGGAGAATTCGGTCGAGAGAAGAGTGGTTATTGTGGGGGGATAATAACACCAAGT
GGTTCCACACTAAAGCCTCTCAAAGGAAGAGATCCAATAAAATAGAGGGGATCTTTGACAACTCCAATAGGTGGGTGAGCGATGAAGACGAGATAAGAGCCACGACTGCT
TCCTATTTCAAATCTCTTTTCAGCTCCTTGGGTCCTATAGACAGGGCCATTAACGAGGTGAAGAAGGCTATTTGCTTCAAGATCAGTGAAGAAGAATCCAAGTGGTTGGA
TAGGGATTTTTCCGAAGAAGAGATCCTTAAAGCGCTTAAGGGTATGAGCCCAACTAAAGCCTCGGGTCCAGACGGAGCTCATGCTTTATTTTTTCAAAAGTATTGGAAAG
ACATAGGGAAGGATGTCACCACTGTTTGTTTGAGAGTGCTGAACCAAGGGGAGGACATGGCTGATCTTAATAAAACTTACATCTTCCTTATTCCAAAATGCAAAAATCCA
AAGAGGTTGACCGATATGAGACCCATTAGCCTTTGCAATGGGGTGTATCGCCAAGGCCCTTGCTAA
Protein sequenceShow/hide protein sequence
MLVWNSSTRLNVVSYSKGHIDSVIEDQFGKWRFTGFYGSPRVEERQSSWTLLDRLKHHCDLPWIVGGGFNEIMFDEEKLGGPSRVAKQMEDFKSVINLCGLVDVGFEGDS
FTWYRSPINKNSIRERLNRFFASDLEALKVDNIKVRHLNLHQSDHRPLLFEMKKEVKVVRIGKKKRQCKFEEAWVKFEECNSIIKESWNNSEIASPINISSKIQHCIMKL
SAWNQTRLKGSLKGAIFRTEDEIQRLQANPSPMNNELWIAKEGELDKLLEEEELYWRIRSREEWLLWGDNNTKWFHTKASQRKRSNKIEGIFDNSNRWVSDEDEIRATTA
SYFKSLFSSLGPIDRAINEVKKAICFKISEEESKWLDRDFSEEEILKALKGMSPTKASGPDGAHALFFQKYWKDIGKDVTTVCLRVLNQGEDMADLNKTYIFLIPKCKNP
KRLTDMRPISLCNGVYRQGPC