; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036062 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036062
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:38191742..38194735
RNA-Seq ExpressionLag0036062
SyntenyLag0036062
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU30590.1 hypothetical protein TSUD_392810 [Trifolium subterraneum]5.7e-7743.11Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        +IK+GF ++W+   M C+SS N+S L+N E  G     RGLRQ DP SPYLF+LVAE L+ LI     +G L GV   +  PS+SHLLFADD  +FC++ 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
          E   L  +L  YE  SG+ IN SKS +FFS+  SR     LS+++GV +V   G YL +PS++ R   + F+FI D++W+ +  W+    S AGKEI+
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        IK + QAIP+Y+MS++ LP  L  +I +    FWWG  +N K + W +WKK+ +PK  G L F D   FN A++AKQ    I NP+ L A+  K+ YF  
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI
         ++L +  G N S  WR ++  R+++  G R+SI
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI

GAU34086.1 hypothetical protein TSUD_255820 [Trifolium subterraneum]6.8e-7843.41Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        +IK+GF  +W+   M C+SS N+S L+N E  G     RGLRQ DP SPYLF++VAEGL+ LI      G + G+   +  PS+SHLLFADD  +FC+A 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
          E   L  +L +YE  SG+ IN +KS +FFS+ +SR     LS+++GV +V   G YL +PS++ R K + F+FI DK+W+ +  W+    S AG EI+
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        IK + QAIP+Y+MS++ LP  L ++I R    FWWG  +N K + W +WK++  PK  G L F D + FN A++AKQ   LI NP+ L AK  K+ YF  
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI
         ++L ++ G N S  WR ++  R+++  G R+SI
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI

XP_013645762.1 uncharacterized protein LOC106350421 [Brassica napus]4.7e-7946.11Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        M ++GF   WI   M C+SS +FS+LING P+G     RG+RQ DP SPYLF+L AE LSHL++    + SLLGV  +   P+++HLLFADD+  F +A 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
              LK +LNLYE VSG+++N +KS++ F    S      +++LLG+      G YL +P     KK + F +I+DK+ EA  GW R F S  GKEIL
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        +K +  A+P + MS+F+LPK + EEI    A FWWGS  N K MHW SW++IC PK  G L F D+E FNQAL+ KQV RL+  P+ LAA+ +K+ Y+ +
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI
         +IL A+  R  S++W+ L + R+L+ QG+RF I
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI

XP_023889222.1 LOW QUALITY PROTEIN: uncharacterized protein LOC112001275 [Quercus suber]1.2e-7742.51Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        M K+GFD+ W+AL M+CI++ ++S++INGEP  V   SRG+RQ DP SPYLFLL  EGL +L+      G + GVS  +  P ++HL FADD+ +FC+A 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
          E   ++ LL+ YE  SG+ +N  K++LFFSK    D    + + LGV  V+    YL +P+++ + K + F +I +++W  L GWK    S AG+E+L
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        +K + QAIP+Y MS FKLP  L  EI     +FWWG    ++K+HW  W  +C PK+LG + F D++ FN+A++AKQV RL+ N D L  +F K+ +F +
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI
        G+IL+AE G N S  W+ +   R +I +G+ + +
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI

XP_030478973.1 uncharacterized protein LOC115696051 [Cannabis sativa]3.0e-7838.66Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        M+   F  S+++L MKCI+++N S LING   G    SRG+RQ DP SPYLFLL AEGLS L+ +   +GSL GV+ S+  PSISHLLFADD+ +FC A+
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
             +L  +LNLY   SG+ INF+KS++ FS    +D      +   +     +  YL +P  L R K + F+F+ DK+   L  W   +FS AGKEIL
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        +K + QAIP+Y+M+ F+LP +L + I    ARFWWGSS N  K+HW SWKK+C  KS+G L F  +  FNQA++AKQ  ++   PD L  + +K+ YF +
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI-------------------------------------------------------VKSGYNLFVK-
         ++L+A  G N S+ WR + W R+L+  G+ + I                                                       VKS Y+L +  
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI-------------------------------------------------------VKSGYNLFVK-

Query:  NKISAESSNNNMRKFWTNLWSLKIPMKYAELI
          I + SS+ N RKFW  +W  K P K    I
Subjt:  NKISAESSNNNMRKFWTNLWSLKIPMKYAELI

TrEMBL top hitse value%identityAlignment
A0A2N9EMZ0 Reverse transcriptase domain-containing protein6.0e-8045.9Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        M K+GF N W+ L M+CIS+ ++S+L+NGEP G    SRGLRQ DP SPYLFLL AEGL  LI    I+G+L GVS S+  P I+HL FADD+ +FCKA 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
          ++I ++G+L+ YE  SG+ +N  K+ LFFSK       + + ++LGV  ++    YL +PS + R K   F+ I +++W  L GWK    S AG+EIL
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        IK + QAIP+Y MS F+LP RL +EI     RFWWG   +K KMHW  W+ +C  K  G +   D+  FN+AL+AKQV RL+ NP  L +K  K+ YF  
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQG
         +IL+A+     S+ W+ +   R+LI +G
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQG

A0A2N9FFZ2 Reverse transcriptase domain-containing protein6.0e-8045.9Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        M K+GF N W+ L M+CIS+ ++S+L+NGEP G    SRGLRQ DP SPYLFLL AEGL  LI    I+G+L GVS S+  P I+HL FADD+ +FCKA 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
          ++I ++G+L+ YE  SG+ +N  K+ LFFSK       + + ++LGV  ++    YL +PS + R K   F+ I +++W  L GWK    S AG+EIL
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        IK + QAIP+Y MS F+LP RL +EI     RFWWG   +K KMHW  W+ +C  K  G +   D+  FN+AL+AKQV RL+ NP  L +K  K+ YF  
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQG
         +IL+A+     S+ W+ +   R+LI +G
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQG

A0A2N9HYE3 Reverse transcriptase domain-containing protein6.0e-8045.9Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        M K+GF N W+ L M+CIS+ ++S+L+NGEP G    SRGLRQ DP SPYLFLL AEGL  LI    I+G+L GVS S+  P I+HL FADD+ +FCKA 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
          ++I ++G+L+ YE  SG+ +N  K+ LFFSK       + + ++LGV  ++    YL +PS + R K   F+ I +++W  L GWK    S AG+EIL
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        IK + QAIP+Y MS F+LP RL +EI     RFWWG   +K KMHW  W+ +C  K  G +   D+  FN+AL+AKQV RL+ NP  L +K  K+ YF  
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQG
         +IL+A+     S+ W+ +   R+LI +G
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQG

A0A803PTB0 Uncharacterized protein1.9e-8146.96Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        M+KLGFD+ WI    +CISS +FSVLING P  +F   RGLRQ DP SP+LFL  AE LS LI      G + G+   +   S+SHL FADD+ +F +A 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
        + E   +K +L  YEA SG++INF+KS + F K          + L GV  V+    YL IP  + RKK + F+ I  ++W  L GWKRS FS+  KE+L
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        IK + QA+P Y M VF LPK+   E+    ARFWWGSS+ KKK HW +W K+C PK  G L F D+E FN+AL+AKQV R++ NP  L  K +KS YF  
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSIVKSGYNLFVKN
         +IL A+ G  +S +WR L W RE++  G R+  V SG N+ + N
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSIVKSGYNLFVKN

A0A803PWX1 Uncharacterized protein5.4e-8138.34Show/hide
Query:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE
        M+KLG+D  W++  M+C++S  FS LINGE +G     RGLRQ DP SP+LFLL AE  S LI     +G L G+   +   S+SHL FADD+ VF  A 
Subjt:  MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAE

Query:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
        E E    K LL  Y   SG+ +NF KS + F +  +     +L++++GV  V++ G YL +PS + R K + F FI +K+W  L GWK SFFS+AGKE+L
Subjt:  EHELINLKGLLNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD
        IK + QAIP+Y MS F+LPK+    I    ARFWWGSSE   K+HWC W  +C  K  G L F D+  FNQAL+AKQV R I  P+ L ++ +K+ Y+ +
Subjt:  IKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKD

Query:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI---------------------------------------------VKSGYN---LFVKNKISAESS
        G +++A+ G + S +WR L W +++I +G R+ I                                              +SGY+      K++   +S 
Subjt:  GNILQAEFGRNTSHLWRILFWDRELIYQGVRFSI---------------------------------------------VKSGYN---LFVKNKISAESS

Query:  NNNMRKFWTNLWSLKIPMKYAELIAIM--EGLKLSIVLGRKNIAVE
             K+W+ LW LKIP K    +  M    +  +  L  ++I VE
Subjt:  NNNMRKFWTNLWSLKIPMKYAELIAIM--EGLKLSIVLGRKNIAVE

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.7e-1332.5Show/hide
Query:  ILTRKKTKD-FSFIMDKLWEALGGWKRSFFSSAGKEILIKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLN
        +L ++  KD F  I++++   + GW+    S AG+  L K +  ++P + MS   LP+ +   + +    F WGS+  KKK H   W K+C PK  G L 
Subjt:  ILTRKKTKD-FSFIMDKLWEALGGWKRSFFSSAGKEILIKCLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLN

Query:  FTDMEGFNQALIAKQVSRLI
            +  N+ALI+K   RL+
Subjt:  FTDMEGFNQALIAKQVSRLI

P11369 LINE-1 retrotransposable element ORF2 protein7.0e-0924.1Show/hide
Query:  GFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLI-SSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAEEHE
        G    ++ +     S    ++ +NGE     P   G RQ  P SPYLF +V E L+  I     I G  +G         +   L ADD  V+    ++ 
Subjt:  GFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLI-SSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAEEHE

Query:  LINLKGLLNLYEAVSGESINFSKS-ALFFSKGYSRDRGVYLSSLLGV--NYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL
           L  L+N +  V G  IN +KS A  ++K    ++ +  ++   +  N ++ LG  + +   +     K+F  +  ++ E L  WK    S  G+  +
Subjt:  LINLKGLLNLYEAVSGESINFSKS-ALFFSKGYSRDRGVYLSSLLGV--NYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEIL

Query:  IK--CLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAK
        +K   L +AI  +     K+P +   E+     +F W    NKK     S  K    ++ G +   D++ + +A++ K
Subjt:  IK--CLDQAIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAK

P92555 Uncharacterized mitochondrial protein AtMg012509.7e-1153.73Show/hide
Query:  LINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADD
        +ING P+G+   SRGLRQ DP SPYLF+L  E LS L       G L G+  S   P I+HLLFADD
Subjt:  LINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADD

P93295 Uncharacterized mitochondrial protein AtMg003101.1e-2240Show/hide
Query:  AIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPK-SLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKDGNILQ
        A+P Y MS F+L K L +++T     FWW S ENK+K+ W +W+K+C  K   G L F D+  FNQAL+AKQ  R+I  P  L ++ ++S YF   ++++
Subjt:  AIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPK-SLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKDGNILQ

Query:  AEFGRNTSHLWRILFWDRELIYQGV
           G   S+ WR +   REL+ +G+
Subjt:  AEFGRNTSHLWRILFWDRELIYQGV

Arabidopsis top hitse value%identityAlignment
AT2G13980.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.0e-0431.65Show/hide
Query:  AELIAIMEGLKLSIVLGRKNIAVESDSLQALNLIKGDEEPRSELGSLVDEIRQLASAFDGISFSFISRNLNIIADTIAK
        AE  A++  L+ + + G   + +E D     NL+ G     + L +L+D+IR  A  F  + FSF+ R  N +A  +AK
Subjt:  AELIAIMEGLKLSIVLGRKNIAVESDSLQALNLIKGDEEPRSELGSLVDEIRQLASAFDGISFSFISRNLNIIADTIAK

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.4e-0632.26Show/hide
Query:  SLKIPMKYAELIAIMEGLKLSIVLGRKNIAVESDSLQALNLIKGDEEPRSELGSLVDEIRQLASAFDGISFSFISRNLNIIADTIAKKAKIEF
        ++++P+  AE IA+   L+ +  +G   +++ SDS Q +  I   E P +E   ++ +I  L+  F  +SFSF+ R+ N +AD +AK + I F
Subjt:  SLKIPMKYAELIAIMEGLKLSIVLGRKNIAVESDSLQALNLIKGDEEPRSELGSLVDEIRQLASAFDGISFSFISRNLNIIADTIAKKAKIEF

AT4G29090.1 Ribonuclease H-like superfamily protein8.7e-2335.77Show/hide
Query:  AIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKDGNILQA
        A+P+Y M+ F LPK + ++I    A FWW + +  K MHW +W  +   K+ G + F D+E FN AL+ KQ+ R++  P+ L AK  KS YF   + L A
Subjt:  AIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKDGNILQA

Query:  EFGRNTSHLWRILFWDRELIYQGVRFSIVKSGYNLFV
          G   S +W+ +   +E++ QG R ++V +G ++ +
Subjt:  EFGRNTSHLWRILFWDRELIYQGVRFSIVKSGYNLFV

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.9e-2440Show/hide
Query:  AIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPK-SLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKDGNILQ
        A+P Y MS F+L K L +++T     FWW S ENK+K+ W +W+K+C  K   G L F D+  FNQAL+AKQ  R+I  P  L ++ ++S YF   ++++
Subjt:  AIPSYLMSVFKLPKRLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPK-SLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKDGNILQ

Query:  AEFGRNTSHLWRILFWDRELIYQGV
           G   S+ WR +   REL+ +G+
Subjt:  AEFGRNTSHLWRILFWDRELIYQGV

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)6.9e-1253.73Show/hide
Query:  LINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADD
        +ING P+G+   SRGLRQ DP SPYLF+L  E LS L       G L G+  S   P I+HLLFADD
Subjt:  LINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAAATTAGGATTTGACAATAGTTGGATTGCTTTAACTATGAAATGTATATCATCTGCTAACTTTTCAGTTTTAATCAATGGAGAGCCTAAAGGAGTATTTCCTTC
TAGTCGTGGACTTCGACAAAGAGATCCAAGTTCTCCATATCTTTTTCTTCTAGTTGCAGAAGGGCTTTCACATCTTATTTCCTCTGTTAACATTAATGGCAGCCTATTAG
GTGTTTCTTGTTCGCAGATGTGTCCATCTATTTCTCACTTACTGTTTGCTGATGACAACCAAGTTTTTTGTAAAGCTGAAGAACATGAGCTGATCAATCTGAAAGGATTA
CTTAACCTCTATGAAGCTGTCTCAGGAGAGAGCATCAATTTCTCAAAATCTGCCTTATTTTTTTCCAAAGGTTACAGCCGAGACAGAGGAGTCTACCTAAGCAGCCTGCT
GGGAGTGAACTATGTTGAAGATCTGGGGAACTACTTGGACATCCCATCTATCCTCACAAGGAAAAAAACAAAGGATTTCAGTTTTATCATGGATAAGCTATGGGAAGCAT
TAGGCGGATGGAAAAGATCTTTTTTCTCATCAGCAGGAAAAGAAATTTTAATCAAGTGCCTGGATCAGGCAATTCCTTCTTACTTGATGAGTGTCTTCAAGCTTCCCAAG
CGCCTCCACGAAGAAATAACCAGAACATTTGCTAGGTTCTGGTGGGGCTCCTCGGAGAACAAGAAGAAGATGCACTGGTGCTCTTGGAAGAAAATTTGCTTCCCAAAGAG
CCTTGGCAGCCTAAACTTCACAGACATGGAGGGATTCAACCAAGCTTTAATCGCTAAACAGGTGTCGAGACTTATAATCAATCCTGACATTTTGGCCGCTAAGTTCATTA
AGAGTATATACTTTAAGGATGGAAATATCCTCCAAGCAGAATTTGGAAGGAATACATCTCACCTATGGCGAATCCTCTTCTGGGATCGTGAGCTAATATATCAAGGTGTC
CGTTTTAGTATCGTCAAAAGCGGCTACAATCTCTTCGTGAAGAACAAAATCAGTGCTGAATCGAGCAACAATAACATGAGGAAATTCTGGACAAACCTGTGGTCTTTGAA
AATTCCAATGAAGTATGCAGAGCTAATTGCAATAATGGAAGGCCTAAAGCTTAGTATTGTCCTGGGAAGGAAAAACATTGCGGTAGAATCAGACAGTCTTCAAGCGCTCA
ACCTCATAAAAGGGGATGAAGAACCCAGAAGTGAGTTGGGTTCGTTAGTGGATGAAATCCGACAGTTGGCCTCAGCCTTTGATGGAATCTCTTTCTCTTTTATTTCGAGA
AATTTGAACATAATAGCTGACACTATTGCAAAAAAGGCAAAAATCGAATTCTGTAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGATAAAATTAGGATTTGACAATAGTTGGATTGCTTTAACTATGAAATGTATATCATCTGCTAACTTTTCAGTTTTAATCAATGGAGAGCCTAAAGGAGTATTTCCTTC
TAGTCGTGGACTTCGACAAAGAGATCCAAGTTCTCCATATCTTTTTCTTCTAGTTGCAGAAGGGCTTTCACATCTTATTTCCTCTGTTAACATTAATGGCAGCCTATTAG
GTGTTTCTTGTTCGCAGATGTGTCCATCTATTTCTCACTTACTGTTTGCTGATGACAACCAAGTTTTTTGTAAAGCTGAAGAACATGAGCTGATCAATCTGAAAGGATTA
CTTAACCTCTATGAAGCTGTCTCAGGAGAGAGCATCAATTTCTCAAAATCTGCCTTATTTTTTTCCAAAGGTTACAGCCGAGACAGAGGAGTCTACCTAAGCAGCCTGCT
GGGAGTGAACTATGTTGAAGATCTGGGGAACTACTTGGACATCCCATCTATCCTCACAAGGAAAAAAACAAAGGATTTCAGTTTTATCATGGATAAGCTATGGGAAGCAT
TAGGCGGATGGAAAAGATCTTTTTTCTCATCAGCAGGAAAAGAAATTTTAATCAAGTGCCTGGATCAGGCAATTCCTTCTTACTTGATGAGTGTCTTCAAGCTTCCCAAG
CGCCTCCACGAAGAAATAACCAGAACATTTGCTAGGTTCTGGTGGGGCTCCTCGGAGAACAAGAAGAAGATGCACTGGTGCTCTTGGAAGAAAATTTGCTTCCCAAAGAG
CCTTGGCAGCCTAAACTTCACAGACATGGAGGGATTCAACCAAGCTTTAATCGCTAAACAGGTGTCGAGACTTATAATCAATCCTGACATTTTGGCCGCTAAGTTCATTA
AGAGTATATACTTTAAGGATGGAAATATCCTCCAAGCAGAATTTGGAAGGAATACATCTCACCTATGGCGAATCCTCTTCTGGGATCGTGAGCTAATATATCAAGGTGTC
CGTTTTAGTATCGTCAAAAGCGGCTACAATCTCTTCGTGAAGAACAAAATCAGTGCTGAATCGAGCAACAATAACATGAGGAAATTCTGGACAAACCTGTGGTCTTTGAA
AATTCCAATGAAGTATGCAGAGCTAATTGCAATAATGGAAGGCCTAAAGCTTAGTATTGTCCTGGGAAGGAAAAACATTGCGGTAGAATCAGACAGTCTTCAAGCGCTCA
ACCTCATAAAAGGGGATGAAGAACCCAGAAGTGAGTTGGGTTCGTTAGTGGATGAAATCCGACAGTTGGCCTCAGCCTTTGATGGAATCTCTTTCTCTTTTATTTCGAGA
AATTTGAACATAATAGCTGACACTATTGCAAAAAAGGCAAAAATCGAATTCTGTAACTAA
Protein sequenceShow/hide protein sequence
MIKLGFDNSWIALTMKCISSANFSVLINGEPKGVFPSSRGLRQRDPSSPYLFLLVAEGLSHLISSVNINGSLLGVSCSQMCPSISHLLFADDNQVFCKAEEHELINLKGL
LNLYEAVSGESINFSKSALFFSKGYSRDRGVYLSSLLGVNYVEDLGNYLDIPSILTRKKTKDFSFIMDKLWEALGGWKRSFFSSAGKEILIKCLDQAIPSYLMSVFKLPK
RLHEEITRTFARFWWGSSENKKKMHWCSWKKICFPKSLGSLNFTDMEGFNQALIAKQVSRLIINPDILAAKFIKSIYFKDGNILQAEFGRNTSHLWRILFWDRELIYQGV
RFSIVKSGYNLFVKNKISAESSNNNMRKFWTNLWSLKIPMKYAELIAIMEGLKLSIVLGRKNIAVESDSLQALNLIKGDEEPRSELGSLVDEIRQLASAFDGISFSFISR
NLNIIADTIAKKAKIEFCN