; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011722 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011722
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:31652943..31655451
RNA-Seq ExpressionLag0011722
SyntenyLag0011722
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4309574.1 unnamed protein product [Prunus armeniaca]7.5e-3630Show/hide
Query:  VSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS------------DR---PSPSNPDRMHAWWSGLWRLNVPSKHKFFLW
        V DLF +SG WN  +L+  F + + +AILRIPL      D L+WH+E++G +S            D+    S +  D    +W  +W L +P+K KFFLW
Subjt:  VSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS------------DR---PSPSNPDRMHAWWSGLWRLNVPSKHKFFLW

Query:  RLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFELVVIFWWSAQQRDNGECG
        R   + LP    L KR +  +S+C  C   VE   H  W C V + +W  S +          +F +  ++  ++LP              Q        
Subjt:  RLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFELVVIFWWSAQQRDNGECG

Query:  GWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGM
         WRPP  G  K+ VD ++K        G V+R   GE FMAAC      W   L   +     E  A  +G+  A  +GF+D ++E D+   +  +    
Subjt:  GWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGM

Query:  EDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRV-WLEEGPSEISDALRGDVIS
        E       L+++V   L  +      ++PR GN+VAH LA  A    D V W+EE P  +   L  DV++
Subjt:  EDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRV-WLEEGPSEISDALRGDVIS

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]2.6e-3629.06Show/hide
Query:  VSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS------------DRPS--PS-NPDRMHAWWSGLWRLNVPSKHKFFLW
        V DLF +SG WN  +L+  F + + +A L+IPL    G D LIWH+E++G +S            D+ S  PS   D    +W  +W L +P+K KFFLW
Subjt:  VSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS------------DRPS--PS-NPDRMHAWWSGLWRLNVPSKHKFFLW

Query:  RLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFELVVIFWWSAQQRDN----
        R   + LP    L  R +  + +C  C    E   H  W C   + +W  S +      +    F E+  A++ S  G +  L     W    R N    
Subjt:  RLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFELVVIFWWSAQQRDN----

Query:  ---------------------------------------GECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEG
                                                   GWRPP  G  K+NVD +VK        G V+R ANGE FMAAC          +   
Subjt:  ---------------------------------------GECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEG

Query:  WSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRV-WLEEGP
        +     E  A  +G+  A  +GF   V+E D+   +  +    E     G L+++V   LH +      ++PR GN+VAH LA  AF   + V W+EE P
Subjt:  WSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRV-WLEEGP

Query:  SEISDALRGDVIS
          +   L  DV+S
Subjt:  SEISDALRGDVIS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.7e-4329.78Show/hide
Query:  GGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFSDR---------------PSPSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHNRLP
        GGW   V+R  F   + + IL IP+  G  +DRLIW++EK G +S R               PS S+ + +  WW+G W++++P+K K FLWRL  +RLP
Subjt:  GGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFSDR---------------PSPSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHNRLP

Query:  TKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFA---------PHHRSFSQFRFEE---VIWAMKDSLPGPDFE----------LVVI
        T  NL KR + +++ C  C    ED  HLFW C   E++W+ SKF            H S S+  FEE   VIW + +      F           + ++
Subjt:  TKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFA---------PHHRSFSQFRFEE---VIWAMKDSLPGPDFE----------LVVI

Query:  FWWSAQQRDNGECGG--------------WRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGV
         W +    +  E                 W+PP  G  K+N DAS       A  G ++    G++  AA   L+           SVD+AE  A  +G+
Subjt:  FWWSAQQRDNGECGG--------------WRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGV

Query:  WLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNS---KILFSPRQGNRVAHALASLAFSYAD-RVWLEEGPSEISDALRGDVI
         LA ++G                +H  +ED+SE G +   V +A + W  S      F  R+GN+ AH LA  A    +  +W+E+ P E+   L  + +
Subjt:  WLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNS---KILFSPRQGNRVAHALASLAFSYAD-RVWLEEGPSEISDALRGDVI

Query:  SSL
          L
Subjt:  SSL

XP_023904177.1 uncharacterized protein LOC112015942 [Quercus suber]2.2e-3528.57Show/hide
Query:  WNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFSDRPS---PSNPDRMHAW------------WSGLWRLNVPSKHKFFLWRLFHNRLPTK
        W   ++   F+  D +AI RIPL     DD +IW   K+G +S +          R   W            W  LW++NVP+K K F WR     LPT+
Subjt:  WNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFSDRPS---PSNPDRMHAW------------WSGLWRLNVPSKHKFFLWRLFHNRLPTK

Query:  VNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFELVVI-FWWSAQQRDNGECGG--------
        +NL KR +   + C +C +  E+  H+ W C V + +W  S+         Q    ++   +   L   D  L ++  W    QR++   GG        
Subjt:  VNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFELVVI-FWWSAQQRDNGECGG--------

Query:  ------------------------------WRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKG
                                      WRPPA    K+N DA++  D+G +  G V+R   GE+ MAA  ++    S       S D AE  A  K 
Subjt:  ------------------------------WRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKG

Query:  VWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSY-ADRVWLEEGPSEISDALRGDVIS
        +  A   GF + ++E DS+ ++K L     D+S VG ++ D++  +         +  R  NRVAHALA  A S   D  W+E+ P    DAL  D IS
Subjt:  VWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSY-ADRVWLEEGPSEISDALRGDVIS

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]4.8e-3528.57Show/hide
Query:  SCPSFA--CTVSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS--------------DRPSPSNPDRMHAWWSGLWRLNV
        S PS     TV++L      W E ++  HF   D EAI++IPL     +D+LIWH++K G +S              + PS SN D+    W  +W+L +
Subjt:  SCPSFA--CTVSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS--------------DRPSPSNPDRMHAWWSGLWRLNV

Query:  PSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAM----KDSLPGPDFELVVIF
        P K K FLWR  H+ LPT  NL K+ +    +C  C   VE   H    C     +W  S  A   R   +    +++W +    +        E+  + 
Subjt:  PSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAM----KDSLPGPDFELVVIF

Query:  W--WSAQQR--------------DNGEC------------------------GGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQ
        W  W A+ +               N E                           W PP  G  K+NVDA+V  +   A  G V+R ++G    AA  SL+
Subjt:  W--WSAQQR--------------DNGEC------------------------GGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQ

Query:  RCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYA
           SV +AE  +++    W    G+ +A +      + E+DSL ++ +++     ++E+G L+ D++  L  + N K   SPR  N  AH+LA LA    
Subjt:  RCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYA

Query:  DRV-WLEEGPSEI
        + V WL+E P EI
Subjt:  DRV-WLEEGPSEI

TrEMBL top hitse value%identityAlignment
A0A5E4FZN9 PREDICTED: retrotransposon1.2e-3629.06Show/hide
Query:  VSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS------------DRPS--PS-NPDRMHAWWSGLWRLNVPSKHKFFLW
        V DLF +SG WN  +L+  F + + +A L+IPL    G D LIWH+E++G +S            D+ S  PS   D    +W  +W L +P+K KFFLW
Subjt:  VSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS------------DRPS--PS-NPDRMHAWWSGLWRLNVPSKHKFFLW

Query:  RLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFELVVIFWWSAQQRDN----
        R   + LP    L  R +  + +C  C    E   H  W C   + +W  S +      +    F E+  A++ S  G +  L     W    R N    
Subjt:  RLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFELVVIFWWSAQQRDN----

Query:  ---------------------------------------GECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEG
                                                   GWRPP  G  K+NVD +VK        G V+R ANGE FMAAC          +   
Subjt:  ---------------------------------------GECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEG

Query:  WSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRV-WLEEGP
        +     E  A  +G+  A  +GF   V+E D+   +  +    E     G L+++V   LH +      ++PR GN+VAH LA  AF   + V W+EE P
Subjt:  WSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRV-WLEEGP

Query:  SEISDALRGDVIS
          +   L  DV+S
Subjt:  SEISDALRGDVIS

A0A6J1DAR4 uncharacterized protein LOC1110189548.0e-4429.78Show/hide
Query:  GGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFSDR---------------PSPSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHNRLP
        GGW   V+R  F   + + IL IP+  G  +DRLIW++EK G +S R               PS S+ + +  WW+G W++++P+K K FLWRL  +RLP
Subjt:  GGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFSDR---------------PSPSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHNRLP

Query:  TKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFA---------PHHRSFSQFRFEE---VIWAMKDSLPGPDFE----------LVVI
        T  NL KR + +++ C  C    ED  HLFW C   E++W+ SKF            H S S+  FEE   VIW + +      F           + ++
Subjt:  TKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFA---------PHHRSFSQFRFEE---VIWAMKDSLPGPDFE----------LVVI

Query:  FWWSAQQRDNGECGG--------------WRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGV
         W +    +  E                 W+PP  G  K+N DAS       A  G ++    G++  AA   L+           SVD+AE  A  +G+
Subjt:  FWWSAQQRDNGECGG--------------WRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGV

Query:  WLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNS---KILFSPRQGNRVAHALASLAFSYAD-RVWLEEGPSEISDALRGDVI
         LA ++G                +H  +ED+SE G +   V +A + W  S      F  R+GN+ AH LA  A    +  +W+E+ P E+   L  + +
Subjt:  WLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNS---KILFSPRQGNRVAHALASLAFSYAD-RVWLEEGPSEISDALRGDVI

Query:  SSL
          L
Subjt:  SSL

M5VU98 Reverse transcriptase domain-containing protein1.9e-3730.5Show/hide
Query:  VSDLFATSGG--WNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS----------------DRPSPSNPDRMHAWWSGLWRLNVPSKHKF
        VS+L    G   W+   L   F   D   I+RIPL      DR++W+++KHG F+                D  S SN D     W  +W   VP+K K 
Subjt:  VSDLFATSGG--WNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS----------------DRPSPSNPDRMHAWWSGLWRLNVPSKHKF

Query:  FLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPH-HRSFSQFRFEEVIWAMK---DSLPGPDFELVVIFWWSAQQ
        F WR+ H+ LPTK NL+K+ +++  +C+ C +  E   H+   C    + W  S    H H+   +   E V +A +   + +   D    V    + + 
Subjt:  FLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPH-HRSFSQFRFEEVIWAMK---DSLPGPDFELVVIFWWSAQQ

Query:  RDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLV
        RD      W  P +G LK N D +  P +G    G V R A+G    A   S        + E  S + AE  A  +GV LA  LG    + E DS  +V
Subjt:  RDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLV

Query:  KILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADR-VWLEEGPSEISDALRGDVIS
          +    +D S +G +++DV+     + +S   F+PR+ N VAH LA       D  +W E  P  I DAL  DV+S
Subjt:  KILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADR-VWLEEGPSEISDALRGDVIS

M5XHI9 Reverse transcriptase domain-containing protein7.3e-3730.24Show/hide
Query:  VSDLFATSGG--WNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS----------------DRPSPSNPDRMHAWWSGLWRLNVPSKHKF
        VS+L    G   W+   L   F   D   I+RIPL      DR++W+++KHG F+                D  S SN D     W  +W   VP+K K 
Subjt:  VSDLFATSGG--WNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS----------------DRPSPSNPDRMHAWWSGLWRLNVPSKHKF

Query:  FLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPH-HRSFSQFRFEEVIWAMK---DSLPGPDFELVVIFWWSAQQ
        F WR+ H+ LPTK NL+K+ +++  +C+ C +  E   H+   C    + W  S    H H+   +   E V +A +   + +   D    V    + + 
Subjt:  FLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPH-HRSFSQFRFEEVIWAMK---DSLPGPDFELVVIFWWSAQQ

Query:  RDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLV
        RD      W  P +G LK N D +  P +G    G V R A+G    A   S        + E  S + AE     +GV LA  LG    + E DS  +V
Subjt:  RDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLV

Query:  KILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADR-VWLEEGPSEISDALRGDVIS
          +    +D S +G +++DV+     + +S   F+PR+ N VAH LA       D  +W E  P  I DAL  DV+S
Subjt:  KILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADR-VWLEEGPSEISDALRGDVIS

M5XK32 Reverse transcriptase domain-containing protein (Fragment)2.8e-3629.71Show/hide
Query:  VSDLFATSGG--WNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS----------------DRPSPSNPDRMHAWWSGLWRLNVPSKHKF
        VS+L    G   W+   L   F   D    +RIPL      DR++W+++KHG F+                D  S SN D     W  +W   VP+K K 
Subjt:  VSDLFATSGG--WNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFS----------------DRPSPSNPDRMHAWWSGLWRLNVPSKHKF

Query:  FLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPH-HRSFSQFRFEEVIWAMK---DSLPGPDFELVVIFWWSAQQ
        F WR+ H+ LPTK NL+K+ +++  +C+ C +  E   H+   C    + W  S    H H+   +   + V +A +   + +   D    V    + + 
Subjt:  FLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPH-HRSFSQFRFEEVIWAMK---DSLPGPDFELVVIFWWSAQQ

Query:  RDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLV
        RD      W  P++G LK N D +  P +G    G V R A+G    A   S        + E  S + AE  A  +GV LA  LG    + E DS  +V
Subjt:  RDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLV

Query:  KILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADR-VWLEEGPSEISDALRGDVIS
          +    +D S +G +++DV+     + +S   F+PR+ N V H LA       D  +W E  P  I DAL  DV+S
Subjt:  KILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADR-VWLEEGPSEISDALRGDVIS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657509.2e-2124.7Show/hide
Query:  DLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLG-DDRLIWHFEKHGAFSDR-----------PSPSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHN
        DL+    GW+ A +  +   +    +  + L    G  DRL W F + G FS R           P P+    M ++++ LW++ VP + K FLW + + 
Subjt:  DLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLG-DDRLIWHFEKHGAFSDR-----------PSPSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHN

Query:  RLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRS---FSQFRFEEVIWAMKDSLPGPDFE----LVVIFWWSAQQR----
         + T+    +R L+ S++C +C   VE   H+   C     +W+  +  P  R    FS+  FE +   + D     D        VI WW  + R    
Subjt:  RLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRS---FSQFRFEEVIWAMKDSLPGPDFE----LVVIFWWSAQQR----

Query:  --DNGECG----------------------------------GWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGW
          +N +C                                   GW  P  G +K+N D + + + G A  G VLR   G        ++ RC         
Subjt:  --DNGECG----------------------------------GWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGW

Query:  SVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRVW-LEEGPS
        S   AE W VY G++ A +       +E DS  +V  L  G+ D   +  L+      L      +I+   R+ NR+A  LA+ AFS +      +  P 
Subjt:  SVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRVW-LEEGPS

Query:  EISDALRGDVISS
         +S  LR D + S
Subjt:  EISDALRGDVISS

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.1e-0828.71Show/hide
Query:  DDRLIWHFEKHGAFSDRPSPSNPDRMH------AWWSGLWRLNVPSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESM
        DD  IW  + H   +   +      +H       W+  +W  N   KH F  W +  NRL T+  L    L++ ++C+LC+   E R HLF+ C    ++
Subjt:  DDRLIWHFEKHGAFSDRPSPSNPDRMH------AWWSGLWRLNVPSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESM

Query:  W
        W
Subjt:  W

AT2G02650.1 Ribonuclease H-like superfamily protein6.6e-1423.85Show/hide
Query:  RPSPSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDR--RHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEE
        +P P + +   A    +W+L+V  K K FLWR     L T   L  R+++   +C  C  C+E+    H+ ++C   +S+W  +     ++      FE+
Subjt:  RPSPSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDR--RHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEE

Query:  ------------------------VIWAM---------KDSLPGPDFEL-----VVIFWWSAQ---------------QRDNGECGGWRPPATGELKLNV
                                ++W +         +     PD+E          W +A                Q    +   W PP  G +K N 
Subjt:  ------------------------VIWAM---------KDSLPGPDFEL-----VVIFWWSAQ---------------QRDNGECGGWRPPATGELKLNV

Query:  DASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVR
        D+     +   R G  +R  NG I +     LQ       AE      A G+     V  A  L +V F  E+DS  LV +++ G ED S +G L+ D+R
Subjt:  DASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVR

Query:  RALHPWDNSKILFSPRQGNRVAHALAS
          +       + F  R+ N  A ALAS
Subjt:  RALHPWDNSKILFSPRQGNRVAHALAS

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.5e-0727.72Show/hide
Query:  DDRLIWHFEKHGAFSDRPSPSNPDRMH------AWWSGLWRLNVPSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESM
        DD  +W  + H   +   +P     +H       W   +W  N   KH F  W +  NRL T+  L    L++ + C+LC+   + R HLF+ C     +
Subjt:  DDRLIWHFEKHGAFSDRPSPSNPDRMH------AWWSGLWRLNVPSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESM

Query:  W
        W
Subjt:  W

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-1824.94Show/hide
Query:  VSDLFATSG-GWNEAVLRAHFNESDCEAI--LRIPLRHGLGDDRLIWHFEKHGAFS---------------DRPSPSNPDRMHAWWSGLWRLNVPSKHKF
        VSDL   SG  W + V+   F E + + I  LR   R  L  D   W +   G ++                 P   +   ++  +  +W+     K + 
Subjt:  VSDLFATSG-GWNEAVLRAHFNESDCEAI--LRIPLRHGLGDDRLIWHFEKHGAFS---------------DRPSPSNPDRMHAWWSGLWRLNVPSKHKF

Query:  FLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFA-------------------------PHHRSFSQ----------
        FLW+   N LP    L  R L+  S C+ C  C E   HL + C      W  S                            P     SQ          
Subjt:  FLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFA-------------------------PHHRSFSQ----------

Query:  ------------FRFEEVIWAMKDSLPGPDFELVVIFWWSAQQRDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCW
                    F  +EV+   +D L             +  Q +   CG WRPP    +K N DA+   D      G VLR   GE+      +L +  
Subjt:  ------------FRFEEVIWAMKDSLPGPDFELVVIFWWSAQQRDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCW

Query:  SVDLAEGWSVDLAEGWAVYKGVWLAR-QLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSY
        SV  AE      A  WAV     L+R Q  +V F  E+DS  L++IL+   E    +   + D++R L  +   K +F PR+GN +A  +A  + S+
Subjt:  SVDLAEGWSVDLAEGWAVYKGVWLAR-QLGFVDFVVETDSLRLVKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSY

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.0e-0828.97Show/hide
Query:  SGGWNEAVLRAHFNESDCEAILRIPLRH-GLGDDRLIWHFEKHGAFSDRPSPSNPD-----RMHA----WWSGLWRLNVPSKHKFFLWRLFHNRLPTKVN
        +G W     R+  ++    A+   P+ H   G D  +W   ++ A S  PS S+ D     R+H+    W   +W      +     W  F  RLPT+  
Subjt:  SGGWNEAVLRAHFNESDCEAILRIPLRH-GLGDDRLIWHFEKHGAFSDRPSPSNPD-----RMHA----WWSGLWRLNVPSKHKFFLWRLFHNRLPTKVN

Query:  LLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMW--LCSKFAP
        L    +N+ S  VLCS   E   HLF+ C    ++W    SKF P
Subjt:  LLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMW--LCSKFAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAAGGAACTGGTGAGGACAATCAGACAGAAATCGGACTGGGAGATGGACCCAAGAGGCGAAACCGGCAAGTGGGACGGGCCAAGACTGAAGGCCGAGCTCGTCCG
GCTCCGTTTGGTCCTCATCGCCTCTAGCTGCCCCAGTTTCGCCTGTACAGTTAGCGATCTATTTGCCACATCGGGTGGATGGAATGAGGCTGTGCTCAGAGCCCATTTTA
ATGAGTCGGATTGTGAGGCCATCTTGAGAATCCCATTACGACATGGCCTGGGGGATGATCGTTTAATTTGGCACTTTGAGAAGCATGGGGCCTTTTCTGACCGACCTTCA
CCGTCGAATCCTGATAGGATGCATGCGTGGTGGTCTGGCCTCTGGAGGCTAAATGTGCCTAGCAAGCACAAGTTTTTCTTATGGCGATTGTTCCATAACCGCTTGCCTAC
TAAGGTAAACCTTCTCAAACGTGACCTTAATGTTTCTAGCCTGTGTGTTTTGTGTAGTGAGTGTGTGGAGGACCGCCGACATCTGTTCTGGAGTTGCCTTGTGATTGAGA
GTATGTGGTTGTGCTCCAAGTTTGCCCCTCACCACCGGTCCTTTTCCCAATTTCGGTTCGAGGAAGTCATTTGGGCGATGAAGGATAGTCTTCCAGGGCCGGATTTTGAG
CTTGTGGTCATTTTCTGGTGGTCTGCTCAACAAAGGGATAACGGAGAATGCGGTGGGTGGAGGCCGCCAGCTACTGGAGAGCTGAAGCTTAATGTCGATGCCTCTGTCAA
GCCTGATACAGGGGAAGCTAGGGGTGGCTGTGTACTGCGTGGGGCTAATGGTGAGATTTTTATGGCGGCCTGTTTCAGCTTACAGAGGTGTTGGAGCGTGGATTTGGCAG
AGGGTTGGAGCGTGGATTTGGCAGAGGGTTGGGCTGTGTATAAAGGGGTTTGGCTTGCTCGCCAGCTGGGGTTTGTTGATTTTGTGGTGGAGACTGACTCTTTGAGATTG
GTCAAAATCCTTCATGGTGGTATGGAGGATGTGTCGGAAGTAGGACGGTTGATGGATGACGTCCGAAGGGCCCTCCATCCTTGGGACAATAGTAAGATTTTGTTTTCGCC
ACGCCAGGGGAACAGGGTGGCACATGCTTTGGCTAGCTTGGCCTTTTCTTATGCTGACCGTGTTTGGCTTGAAGAAGGGCCTAGCGAGATTTCTGATGCCCTAAGGGGTG
ATGTTATTTCATCTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAAGGAACTGGTGAGGACAATCAGACAGAAATCGGACTGGGAGATGGACCCAAGAGGCGAAACCGGCAAGTGGGACGGGCCAAGACTGAAGGCCGAGCTCGTCCG
GCTCCGTTTGGTCCTCATCGCCTCTAGCTGCCCCAGTTTCGCCTGTACAGTTAGCGATCTATTTGCCACATCGGGTGGATGGAATGAGGCTGTGCTCAGAGCCCATTTTA
ATGAGTCGGATTGTGAGGCCATCTTGAGAATCCCATTACGACATGGCCTGGGGGATGATCGTTTAATTTGGCACTTTGAGAAGCATGGGGCCTTTTCTGACCGACCTTCA
CCGTCGAATCCTGATAGGATGCATGCGTGGTGGTCTGGCCTCTGGAGGCTAAATGTGCCTAGCAAGCACAAGTTTTTCTTATGGCGATTGTTCCATAACCGCTTGCCTAC
TAAGGTAAACCTTCTCAAACGTGACCTTAATGTTTCTAGCCTGTGTGTTTTGTGTAGTGAGTGTGTGGAGGACCGCCGACATCTGTTCTGGAGTTGCCTTGTGATTGAGA
GTATGTGGTTGTGCTCCAAGTTTGCCCCTCACCACCGGTCCTTTTCCCAATTTCGGTTCGAGGAAGTCATTTGGGCGATGAAGGATAGTCTTCCAGGGCCGGATTTTGAG
CTTGTGGTCATTTTCTGGTGGTCTGCTCAACAAAGGGATAACGGAGAATGCGGTGGGTGGAGGCCGCCAGCTACTGGAGAGCTGAAGCTTAATGTCGATGCCTCTGTCAA
GCCTGATACAGGGGAAGCTAGGGGTGGCTGTGTACTGCGTGGGGCTAATGGTGAGATTTTTATGGCGGCCTGTTTCAGCTTACAGAGGTGTTGGAGCGTGGATTTGGCAG
AGGGTTGGAGCGTGGATTTGGCAGAGGGTTGGGCTGTGTATAAAGGGGTTTGGCTTGCTCGCCAGCTGGGGTTTGTTGATTTTGTGGTGGAGACTGACTCTTTGAGATTG
GTCAAAATCCTTCATGGTGGTATGGAGGATGTGTCGGAAGTAGGACGGTTGATGGATGACGTCCGAAGGGCCCTCCATCCTTGGGACAATAGTAAGATTTTGTTTTCGCC
ACGCCAGGGGAACAGGGTGGCACATGCTTTGGCTAGCTTGGCCTTTTCTTATGCTGACCGTGTTTGGCTTGAAGAAGGGCCTAGCGAGATTTCTGATGCCCTAAGGGGTG
ATGTTATTTCATCTTTGTGA
Protein sequenceShow/hide protein sequence
MHKELVRTIRQKSDWEMDPRGETGKWDGPRLKAELVRLRLVLIASSCPSFACTVSDLFATSGGWNEAVLRAHFNESDCEAILRIPLRHGLGDDRLIWHFEKHGAFSDRPS
PSNPDRMHAWWSGLWRLNVPSKHKFFLWRLFHNRLPTKVNLLKRDLNVSSLCVLCSECVEDRRHLFWSCLVIESMWLCSKFAPHHRSFSQFRFEEVIWAMKDSLPGPDFE
LVVIFWWSAQQRDNGECGGWRPPATGELKLNVDASVKPDTGEARGGCVLRGANGEIFMAACFSLQRCWSVDLAEGWSVDLAEGWAVYKGVWLARQLGFVDFVVETDSLRL
VKILHGGMEDVSEVGRLMDDVRRALHPWDNSKILFSPRQGNRVAHALASLAFSYADRVWLEEGPSEISDALRGDVISSL