; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018781 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018781
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:34268175..34270526
RNA-Seq ExpressionLag0018781
SyntenyLag0018781
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]7.7e-9840.67Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        MMLK+GF+A WV  +  CIS   +S    G   G++ P    RQG P SPYLFL+C EG SCLL  AER   + G+++A   PS++H  FAD+S+LF +A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE
           + +A+  +   YE   GQ INY KS +S         F+   GV     + +  + YLGL     + R      +KD++W+ I GWK KLL   G+E
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE

Query:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF
        +L+K+++QAIP YSM CF++PK L ++++ +MARFWW   K+ R IHWV                                              + RY 
Subjt:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF

Query:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI
          V FLEAEVG+ PSFIWRSL WGKELL KG+ WRVG+G  I+VY   W+    C K+ SP  LP    V DL T  GQWN  LL+      EV+ IL I
Subjt:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI

Query:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP
        PL  +   D  IWH+E++G+YSVKS YRL        +   S+   L S +WK+ W + IP+K+K FLWR   D LP
Subjt:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]6.9e-9940.59Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        MMLK+GF+A WV  +  CIS   +S    G   G++ P    RQG P SPYLFL+C EG SCLLH AER   + G+++A   PS++H  FAD+S+LF +A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSST----------GVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGR
             +A+  +   YE   GQ INY KS +S + +           G+N    + +  + YLGL     + R      +KD++W+ I GWK KLL   G+
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSST----------GVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGR

Query:  EVLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERY
        E+L+K+++QAIP YSM CFQ+PK L ++++ +MARFWW   K+ R IHWV                                              + RY
Subjt:  EVLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERY

Query:  FSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILT
           V FLEAEVG+ PSFIW SL WGKELL KG+ WRVG+G  I+VY   W+    C K+ SP  LP    V DL T  GQWN  LL+      EV+ IL 
Subjt:  FSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILT

Query:  IPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP
        IPL  +   D  IWH+E++G+YSVKS YRL +      +   S+   L S +WK+ W + IP+K+K FLWR   D LP
Subjt:  IPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]2.7e-9540.88Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        MMLK+GF+A WV  +  CIS   +S    G   G++ P    RQG P SPYLFL+C EG SCLL  AER   + G+++A  GPS++H  FAD+S+LF +A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE
              A+  +   YE   GQ INY KS  S         F+   GV     + Q  ++YLGL     + R      +KD++W+ I GWK KLL   G+E
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE

Query:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF
        +L+K+++QAIP YSM CF++PK L ++++ +MARFWW   K+ R IHWV                                              + RY 
Subjt:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF

Query:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI
          V FLEAEVG+ PSFIWRSL WGKELL KG+ WRVGNG  I+VY   W+      K+ SP  LP    V DL T  GQWN  LL+      EV+  L I
Subjt:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI

Query:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP
        PL  +   D  IWH+E++G+YSVKS YRL        +   S    L S +WK+ W + IP+K+K FLWR   D LP
Subjt:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]8.5e-9739.46Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        MM+K+G++ GWV  I  C++ V++SF +NG   G V P    RQGDP SP+LFLLCAE  S L+ +AE+   + G+     G  +SH FFAD+SL+F  A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSSTGVNAQNQLSQI--------FQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREV
         E+E    + +L KY    GQ +N+ KS + F  +     + QL+ +        + +YLGL  F+ R +     FIK+RVW +++GWKG       +EV
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSSTGVNAQNQLSQI--------FQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREV

Query:  LLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHW----VLKERYFSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKI
        L+K+IVQAIP Y+M CF+LPK+ I  I  + ARFWW   ++D +IHW    VLK  Y+     LEA+ G+  SF+WRSL+WGK++++KG  WR+GN   +
Subjt:  LLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHW----VLKERYFSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKI

Query:  RVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTIPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSS
        RV   +W+   +  K+     LP   +V DL    G W++E +R    P + + IL++P      ED+ +WH+ K G YSV+S YR+   + L      S
Subjt:  RVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTIPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSS

Query:  SNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVKGVDV
          E+   WWK  WK+ IP KVK F+W++    +PT   LA + + V
Subjt:  SNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVKGVDV

XP_030964220.1 uncharacterized protein LOC115985421 [Quercus lobata]3.1e-9136.43Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        +M+K+GF   W++L+  C+S V YS  +NG  CGN++PS   RQGDP SP LFLLCAEG S L+HEA R++ I+G+ +    P I+H FFAD+SLLF +A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSSTGVNAQNQLSQIF--------QEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREV
           E  A+  +L KYE   GQ IN DKS + F+ +T  + +  +  I          +YLGL   + + +T   + +KDRV +++ GWKGKLL  GGRE+
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSSTGVNAQNQLSQIF--------QEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREV

Query:  LLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHW---------------------------------------------VLKERYFS
        L+K++ QA+P Y+M CFQLPK L  D+  +M  FWW    ++ +I W                                             V K +YF 
Subjt:  LLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHW---------------------------------------------VLKERYFS

Query:  GVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSP-ITLPHDAHVADLITVEGQ-WNEELLRHHLRPHEVNFILT
          D L ++ GS PS+ WRS+    E++R+G  WRVGNG +I ++   W+      KV SP +       V+ LI ++ + W  +++R    PHE + IL 
Subjt:  GVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSP-ITLPHDAHVADLITVEGQ-WNEELLRHHLRPHEVNFILT

Query:  IPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAP--TSSSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVKGVDVLNEILMGSE
        IP+ +   +D  IW   K G +SVKS Y +    L+D +    SSS+ S    WKR W+  +P K+KIF WR C++ LPT+  L  +G+   +       
Subjt:  IPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAP--TSSSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVKGVDVLNEILMGSE

Query:  WGPLLRNVRAGSMINILQDIKAKMDWG
        + PL   V   ++  +L    AKM WG
Subjt:  WGPLLRNVRAGSMINILQDIKAKMDWG

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein3.7e-9840.67Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        MMLK+GF+A WV  +  CIS   +S    G   G++ P    RQG P SPYLFL+C EG SCLL  AER   + G+++A   PS++H  FAD+S+LF +A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE
           + +A+  +   YE   GQ INY KS +S         F+   GV     + +  + YLGL     + R      +KD++W+ I GWK KLL   G+E
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE

Query:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF
        +L+K+++QAIP YSM CF++PK L ++++ +MARFWW   K+ R IHWV                                              + RY 
Subjt:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF

Query:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI
          V FLEAEVG+ PSFIWRSL WGKELL KG+ WRVG+G  I+VY   W+    C K+ SP  LP    V DL T  GQWN  LL+      EV+ IL I
Subjt:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI

Query:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP
        PL  +   D  IWH+E++G+YSVKS YRL        +   S+   L S +WK+ W + IP+K+K FLWR   D LP
Subjt:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP

A0A5E4FZN9 PREDICTED: retrotransposon1.3e-9540.88Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        MMLK+GF+A WV  +  CIS   +S    G   G++ P    RQG P SPYLFL+C EG SCLL  AER   + G+++A  GPS++H  FAD+S+LF +A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE
              A+  +   YE   GQ INY KS  S         F+   GV     + Q  ++YLGL     + R      +KD++W+ I GWK KLL   G+E
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE

Query:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF
        +L+K+++QAIP YSM CF++PK L ++++ +MARFWW   K+ R IHWV                                              + RY 
Subjt:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF

Query:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI
          V FLEAEVG+ PSFIWRSL WGKELL KG+ WRVGNG  I+VY   W+      K+ SP  LP    V DL T  GQWN  LL+      EV+  L I
Subjt:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI

Query:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP
        PL  +   D  IWH+E++G+YSVKS YRL        +   S    L S +WK+ W + IP+K+K FLWR   D LP
Subjt:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP

A0A803QCG6 Uncharacterized protein3.5e-9639.88Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        +ML+MGFA+ WV LI  CI+   +SF++NG   G+V P    RQGDP SPYLFL+C+EGLS LLH  E    + GL+L    P++SH  FAD+SLLF RA
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSSTGVNAQN--------QLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREV
        N+  A+AI+  L  Y    GQ +N +KS +SF+ +T + AQ+         +++  + YLGL  +  RD+    S IK++VW+ +  W  ++   GG+EV
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSSTGVNAQN--------QLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREV

Query:  LLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHW---VLKERYFSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIR
        LLK++VQ+IP Y+M CF+L K+    +  +MA FWW  ++   +IHW   +LK RYFS   FL+A +G  PS+ W+S+ WG+ELL KG+ ++VGNG  I 
Subjt:  LLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHW---VLKERYFSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIR

Query:  VYGSNWISDDVCLKVQSPITL--PHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTIPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTS
             WI      K   PI+   P +  V+ LIT +  WN E+L    +P +V  ILTIPL    S DR IWH   SG Y+VKS + L   S L++    
Subjt:  VYGSNWISDDVCLKVQSPITL--PHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTIPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTS

Query:  SSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGL-AVKGVDVLNEILMGSEW---GPLLRNVRAGSMINILQDIKAKMDWGSGIVVRNSVGLAM
        S+++  L+WWK  W + +P K++IF W++  + LPT   L   K +D     L  S W   G  L + +      I QD K   D+     ++N   L +
Subjt:  SSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGL-AVKGVDVLNEILMGSEW---GPLLRNVRAGSMINILQDIKAKMDWGSGIVVRNSVGLAM

Query:  CSTV
         S+V
Subjt:  CSTV

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)3.7e-9840.67Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        MMLK+GF+A WV  +  CIS   +S    G   G++ P    RQG P SPYLFL+C EG SCLL  AER   + G+++A   PS++H  FAD+S+LF +A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE
           + +A+  +   YE   GQ INY KS +S         F+   GV     + +  + YLGL     + R      +KD++W+ I GWK KLL   G+E
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLIS---------FNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGRE

Query:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF
        +L+K+++QAIP YSM CF++PK L ++++ +MARFWW   K+ R IHWV                                              + RY 
Subjt:  VLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERYF

Query:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI
          V FLEAEVG+ PSFIWRSL WGKELL KG+ WRVG+G  I+VY   W+    C K+ SP  LP    V DL T  GQWN  LL+      EV+ IL I
Subjt:  SGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILTI

Query:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP
        PL  +   D  IWH+E++G+YSVKS YRL        +   S+   L S +WK+ W + IP+K+K FLWR   D LP
Subjt:  PLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP

M5WJW2 Reverse transcriptase domain-containing protein3.4e-9940.59Show/hide
Query:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA
        MMLK+GF+A WV  +  CIS   +S    G   G++ P    RQG P SPYLFL+C EG SCLLH AER   + G+++A   PS++H  FAD+S+LF +A
Subjt:  MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRA

Query:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSST----------GVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGR
             +A+  +   YE   GQ INY KS +S + +           G+N    + +  + YLGL     + R      +KD++W+ I GWK KLL   G+
Subjt:  NESEALAIRVMLLKYECPLGQTINYDKSLISFNSST----------GVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGR

Query:  EVLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERY
        E+L+K+++QAIP YSM CFQ+PK L ++++ +MARFWW   K+ R IHWV                                              + RY
Subjt:  EVLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV---------------------------------------------LKERY

Query:  FSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILT
           V FLEAEVG+ PSFIW SL WGKELL KG+ WRVG+G  I+VY   W+    C K+ SP  LP    V DL T  GQWN  LL+      EV+ IL 
Subjt:  FSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRHHLRPHEVNFILT

Query:  IPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP
        IPL  +   D  IWH+E++G+YSVKS YRL +      +   S+   L S +WK+ W + IP+K+K FLWR   D LP
Subjt:  IPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLS-WWKRCWKMGIPSKVKIFLWRLCIDRLP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.9e-1523.2Show/hide
Query:  RDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREVLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWVLKERYFS-------GV---
        R    T   I +RV  ++ GW+ K L   GR  L K+++ ++P +SM    LP+ ++  + ++   F W    E ++ H V   +  S       GV   
Subjt:  RDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREVLLKSIVQAIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWVLKERYFS-------GV---

Query:  ----DFLEAEVGSRP---------------------------------SFIWRSLMWG-KELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQS-PITL
              L ++VG R                                  S  WRS+  G ++++  G+ W  G+G++IR +   W+S    L++ +     
Subjt:  ----DFLEAEVGSRP---------------------------------SFIWRSLMWG-KELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQS-PITL

Query:  PHDAHVADLITVEGQ-WNEELLRHHLRPHEVNFILTIPLRHV-WSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLSWWKRCWKMGIPSK
          D  VA  + + G+ W+   +  +   +    +  + L  V  + DR  W F + G +SV+S Y +     +D+ P      ++ S++   WK+ +P +
Subjt:  PHDAHVADLITVEGQ-WNEELLRHHLRPHEVNFILTIPLRHV-WSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLSWWKRCWKMGIPSK

Query:  VKIFLW
        VK FLW
Subjt:  VKIFLW

P92555 Uncharacterized mitochondrial protein AtMg012501.5e-0844.93Show/hide
Query:  FNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADES
        F +NG   G V PS   RQGDP SPYLF+LC E LS L   A+    + G++++   P I+H  FAD++
Subjt:  FNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADES

P93295 Uncharacterized mitochondrial protein AtMg003103.2e-1428.19Show/hide
Query:  AIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV----------------------------------------------LKERYFSGVDFLE
        A+P Y+M CF+L K L + ++  M  FWW+  +  R+I WV                                              L+ RYF     +E
Subjt:  AIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV----------------------------------------------LKERYFSGVDFLE

Query:  AEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCL
          VG+RPS+ WRS++ G+ELL +G+   +G+G   +V+   WI D+  L
Subjt:  AEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein5.4e-1726.9Show/hide
Query:  LKERYFSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQ---WNEELLRHHLRP
        +K RYF  V  L+A+V  + S+ W SL+ G  LL+KG    +G+G+ IR+   N +       + +  T   +  + +L   +G    W++  +   +  
Subjt:  LKERYFSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQ---WNEELLRHHLRP

Query:  HEVNFILTIPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVKGVDV
         +  FI  I L      D+ IW++  +G Y+V+S Y L         P  +     +    R W + I  K+K FLWR     L T + L  +G+ +
Subjt:  HEVNFILTIPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVKGVDV

AT4G29090.1 Ribonuclease H-like superfamily protein3.2e-2527.97Show/hide
Query:  AIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHW---------------------------------------------VLKERYFSGVDFLEA
        A+P Y+M CF LPK + + I  V+A FWW   +E + +HW                                             V K RYF   D L A
Subjt:  AIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHW---------------------------------------------VLKERYFSGVDFLEA

Query:  EVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISD---DVCLKV-----QSPITLPHDAHVADLITVEG-QWNEELLRHHLRPHEVNFILT
         +GSRPSF+W+S+   +E+LR+G    VGNGE I ++   W+        L++     Q   ++     V+DLI   G +W ++++       E   I  
Subjt:  EVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISD---DVCLKV-----QSPITLPHDAHVADLITVEG-QWNEELLRHHLRPHEVNFILT

Query:  IPLRHVWSEDRAIWHFEKSGVYSVKSRY-RLGQRSLLDQAPTSSSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVK
        +        D   W +  SG Y+VKS Y  L Q      +P   S  SL   +++ WK     K++ FLW+   + LP    LA +
Subjt:  IPLRHVWSEDRAIWHFEKSGVYSVKSRY-RLGQRSLLDQAPTSSSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.3e-1528.19Show/hide
Query:  AIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV----------------------------------------------LKERYFSGVDFLE
        A+P Y+M CF+L K L + ++  M  FWW+  +  R+I WV                                              L+ RYF     +E
Subjt:  AIPCYSMKCFQLPKRLIQDISRVMARFWWNGDKEDRRIHWV----------------------------------------------LKERYFSGVDFLE

Query:  AEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCL
          VG+RPS+ WRS++ G+ELL +G+   +G+G   +V+   WI D+  L
Subjt:  AEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-0944.93Show/hide
Query:  FNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADES
        F +NG   G V PS   RQGDP SPYLF+LC E LS L   A+    + G++++   P I+H  FAD++
Subjt:  FNVNGVRCGNVAPS---RQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTTGAAGATGGGTTTCGCTGCTGGCTGGGTAGAACTAATTCACCTCTGTATTTCATTAGTCAGGTACTCTTTTAATGTGAATGGCGTTAGGTGTGGGAATGTGGC
TCCAAGTAGACAGGGTGATCCACCATCTCCGTATCTGTTTTTGTTGTGTGCTGAGGGTCTCTCTTGTTTGTTGCATGAGGCTGAGAGGTCTAAGACTATCTCAGGGCTAA
AACTAGCTTGGGTTGGCCCATCCATATCCCACCGGTTCTTTGCTGACGAAAGTCTTCTATTTTTTCGGGCCAATGAGAGTGAGGCCCTAGCTATTCGTGTGATGCTCTTG
AAGTATGAATGTCCCTTGGGTCAAACTATCAACTATGACAAGTCTCTTATCTCATTTAACTCGAGTACAGGTGTGAACGCCCAAAACCAGTTGAGTCAAATTTTTCAGGA
GTATTTGGGCTTGTCGTTCTTTATGCCTCGAGACAGAACGAGCACATTGAGTTTTATTAAGGATCGTGTCTGGCAGCAAATTCAAGGCTGGAAGGGTAAGTTGTTGTATG
AAGGGGGCAGAGAGGTCTTGTTGAAATCTATTGTCCAGGCTATCCCGTGCTACTCTATGAAATGCTTTCAGTTGCCCAAGAGGCTTATTCAAGACATTAGTAGGGTCATG
GCTCGCTTCTGGTGGAATGGGGATAAAGAGGATCGTAGGATCCATTGGGTGTTGAAGGAACGATATTTTTCGGGTGTAGATTTTTTGGAGGCTGAGGTGGGGTCGAGACC
CTCTTTCATATGGAGGAGTCTCATGTGGGGGAAGGAGCTTTTGAGAAAGGGTATTCACTGGAGAGTGGGGAATGGTGAGAAAATCAGGGTGTATGGGTCTAATTGGATCT
CGGATGATGTTTGCCTGAAGGTGCAGTCCCCAATCACCTTACCGCATGATGCTCATGTTGCTGACCTTATCACAGTTGAGGGGCAGTGGAACGAAGAGTTACTTCGGCAC
CACCTCCGCCCCCACGAGGTAAATTTCATCCTTACTATTCCTCTTCGACATGTTTGGTCTGAGGATAGAGCTATCTGGCATTTTGAGAAAAGCGGTGTTTACTCTGTTAA
GAGTAGGTACCGGTTAGGCCAAAGGAGCTTGCTTGACCAGGCCCCAACCTCTTCTTCTAATGAGTCTTTACTTAGTTGGTGGAAGAGATGTTGGAAGATGGGGATCCCTA
GTAAAGTGAAGATCTTTTTGTGGAGACTTTGTATTGATCGTCTTCCTACAGTGGATGGTTTGGCGGTTAAAGGTGTTGATGTTTTGAATGAAATTTTGATGGGCTCTGAA
TGGGGTCCTTTGTTGAGGAATGTGCGAGCAGGCTCCATGATTAATATTCTCCAGGATATCAAGGCTAAGATGGATTGGGGTTCGGGGATTGTAGTCAGGAATAGTGTTGG
CTTGGCCATGTGCTCGACAGTAGTAAAACATGTGAATGTGAGATGCTCGGATATGACCGAGGGGCTGGCAGTAGTAGATGGTTTTCGACTTGCGTCGAAAATGGGCCTTT
TTCCTCTGATCCTGAAGTCAGATTCCATGAGGATTGTTCAGCTGCTGCGTGGTGAGGGAATATCAGACTTGTCTGTGGTAGGGGCGGTGGTTACTACTTTGCGCAAGGAG
GTTCCAGGGGGATGTGGTTTCGTTGCAGTGGATGTGAGCCTCCTGCTAGAAGATGACGCCCAATCTTTCTTTGCATTAGAGCCTGAAAGAGTCAAGGCAGGATTAGCAAT
GAAATCCAAGAATCCGCTATACCAGAAAGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGTTGAAGATGGGTTTCGCTGCTGGCTGGGTAGAACTAATTCACCTCTGTATTTCATTAGTCAGGTACTCTTTTAATGTGAATGGCGTTAGGTGTGGGAATGTGGC
TCCAAGTAGACAGGGTGATCCACCATCTCCGTATCTGTTTTTGTTGTGTGCTGAGGGTCTCTCTTGTTTGTTGCATGAGGCTGAGAGGTCTAAGACTATCTCAGGGCTAA
AACTAGCTTGGGTTGGCCCATCCATATCCCACCGGTTCTTTGCTGACGAAAGTCTTCTATTTTTTCGGGCCAATGAGAGTGAGGCCCTAGCTATTCGTGTGATGCTCTTG
AAGTATGAATGTCCCTTGGGTCAAACTATCAACTATGACAAGTCTCTTATCTCATTTAACTCGAGTACAGGTGTGAACGCCCAAAACCAGTTGAGTCAAATTTTTCAGGA
GTATTTGGGCTTGTCGTTCTTTATGCCTCGAGACAGAACGAGCACATTGAGTTTTATTAAGGATCGTGTCTGGCAGCAAATTCAAGGCTGGAAGGGTAAGTTGTTGTATG
AAGGGGGCAGAGAGGTCTTGTTGAAATCTATTGTCCAGGCTATCCCGTGCTACTCTATGAAATGCTTTCAGTTGCCCAAGAGGCTTATTCAAGACATTAGTAGGGTCATG
GCTCGCTTCTGGTGGAATGGGGATAAAGAGGATCGTAGGATCCATTGGGTGTTGAAGGAACGATATTTTTCGGGTGTAGATTTTTTGGAGGCTGAGGTGGGGTCGAGACC
CTCTTTCATATGGAGGAGTCTCATGTGGGGGAAGGAGCTTTTGAGAAAGGGTATTCACTGGAGAGTGGGGAATGGTGAGAAAATCAGGGTGTATGGGTCTAATTGGATCT
CGGATGATGTTTGCCTGAAGGTGCAGTCCCCAATCACCTTACCGCATGATGCTCATGTTGCTGACCTTATCACAGTTGAGGGGCAGTGGAACGAAGAGTTACTTCGGCAC
CACCTCCGCCCCCACGAGGTAAATTTCATCCTTACTATTCCTCTTCGACATGTTTGGTCTGAGGATAGAGCTATCTGGCATTTTGAGAAAAGCGGTGTTTACTCTGTTAA
GAGTAGGTACCGGTTAGGCCAAAGGAGCTTGCTTGACCAGGCCCCAACCTCTTCTTCTAATGAGTCTTTACTTAGTTGGTGGAAGAGATGTTGGAAGATGGGGATCCCTA
GTAAAGTGAAGATCTTTTTGTGGAGACTTTGTATTGATCGTCTTCCTACAGTGGATGGTTTGGCGGTTAAAGGTGTTGATGTTTTGAATGAAATTTTGATGGGCTCTGAA
TGGGGTCCTTTGTTGAGGAATGTGCGAGCAGGCTCCATGATTAATATTCTCCAGGATATCAAGGCTAAGATGGATTGGGGTTCGGGGATTGTAGTCAGGAATAGTGTTGG
CTTGGCCATGTGCTCGACAGTAGTAAAACATGTGAATGTGAGATGCTCGGATATGACCGAGGGGCTGGCAGTAGTAGATGGTTTTCGACTTGCGTCGAAAATGGGCCTTT
TTCCTCTGATCCTGAAGTCAGATTCCATGAGGATTGTTCAGCTGCTGCGTGGTGAGGGAATATCAGACTTGTCTGTGGTAGGGGCGGTGGTTACTACTTTGCGCAAGGAG
GTTCCAGGGGGATGTGGTTTCGTTGCAGTGGATGTGAGCCTCCTGCTAGAAGATGACGCCCAATCTTTCTTTGCATTAGAGCCTGAAAGAGTCAAGGCAGGATTAGCAAT
GAAATCCAAGAATCCGCTATACCAGAAAGGGTGA
Protein sequenceShow/hide protein sequence
MMLKMGFAAGWVELIHLCISLVRYSFNVNGVRCGNVAPSRQGDPPSPYLFLLCAEGLSCLLHEAERSKTISGLKLAWVGPSISHRFFADESLLFFRANESEALAIRVMLL
KYECPLGQTINYDKSLISFNSSTGVNAQNQLSQIFQEYLGLSFFMPRDRTSTLSFIKDRVWQQIQGWKGKLLYEGGREVLLKSIVQAIPCYSMKCFQLPKRLIQDISRVM
ARFWWNGDKEDRRIHWVLKERYFSGVDFLEAEVGSRPSFIWRSLMWGKELLRKGIHWRVGNGEKIRVYGSNWISDDVCLKVQSPITLPHDAHVADLITVEGQWNEELLRH
HLRPHEVNFILTIPLRHVWSEDRAIWHFEKSGVYSVKSRYRLGQRSLLDQAPTSSSNESLLSWWKRCWKMGIPSKVKIFLWRLCIDRLPTVDGLAVKGVDVLNEILMGSE
WGPLLRNVRAGSMINILQDIKAKMDWGSGIVVRNSVGLAMCSTVVKHVNVRCSDMTEGLAVVDGFRLASKMGLFPLILKSDSMRIVQLLRGEGISDLSVVGAVVTTLRKE
VPGGCGFVAVDVSLLLEDDAQSFFALEPERVKAGLAMKSKNPLYQKG