; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025332 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025332
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase
Genome locationscaffold13:41105411..41106598
RNA-Seq ExpressionSpg025332
SyntenySpg025332
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_021847414.1 uncharacterized protein LOC110787151 [Spinacia oleracea]2.3e-2928.45Show/hide
Query:  SKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIF
        ++D + W+ E  G +SV+SAY        + +   SS+ S  S+WK IWK N +P  K+  W+  ++ALPT+  ++K+    +  C +C    ES  H  
Subjt:  SKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIF

Query:  WICKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNISELK
          CK ++ +W +        W  C      +D+W+   + L  E     I L W +W ARN + +     D     +   T  +E           S   
Subjt:  WICKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNISELK

Query:  SLSSHLHWEPPDPGCWKLNADASWSEKDEIG-GIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVN
             + W  P+ G +K+N DA      E+G G+G  IRD  G ++    ++  + W     EAKA++ G++   + G          +VVESD + V+ 
Subjt:  SLSSHLHWEPPDPGCWKLNADASWSEKDEIG-GIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVN

Query:  AVNRVADDATELCLFVDEIDSFRRS--NRVRSFSKCSRQSNSLAHELA
        A+ R A   +E  L V++I +F  +  N V SF K  R  N +AH+LA
Subjt:  AVNRVADDATELCLFVDEIDSFRRS--NRVRSFSKCSRQSNSLAHELA

XP_023915006.1 uncharacterized protein LOC112026546 [Quercus suber]2.7e-3028.98Show/hide
Query:  KDEIIWKYEDKGSFSVKSAYHLAKS-LSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIF
        +D ++W    KGSF+VKSAY++A S L S +    SS      +WK IW     P  KI  W++  + LPT  N++ +G+H +  C LC    E+TAH  
Subjt:  KDEIIWKYEDKGSFSVKSAYHLAKS-LSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIF

Query:  WICKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNISELK
          C  +K  WA +    + L       ++P+D    I +N S     + I + W IW  RN +         +++       + E     ++  + S L 
Subjt:  WICKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNISELK

Query:  SLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNA
         +SS ++W PP  G  K+N D + S+       G  IRDS GS I   C+ +S  +     EA A+ +G+    + G          ++ ESD++ ++ A
Subjt:  SLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNA

Query:  VNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNG
        +N   +   E+   +  I     S    +F    R+ N  AHELARAA   G
Subjt:  VNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNG

XP_024190234.1 uncharacterized protein LOC112194221 [Rosa chinensis]3.4e-3326.99Show/hide
Query:  DEIIWKYEDKGSFSVKSAYHLAKSLSSKDIAS-GSSDASSK---SIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAH
        D I+W Y   G ++VKS   LA  L      S G S++S+K    +W ++WK       K+ +W+     LP   N+ ++ +  +  CC C   +E+T H
Subjt:  DEIIWKYEDKGSFSVKSAYHLAKSLSSKDIAS-GSSDASSK---SIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAH

Query:  IFWICKFSKKLWADFIPNASPLWLRCRDWEKP--IDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNI
          W C +SKK+W     N        + W +P  ID +  + +  SKEE  +  ++ W +W +RN       + +   +       ++   + Q  N+N+
Subjt:  IFWICKFSKKLWADFIPNASPLWLRCRDWEKP--IDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNI

Query:  S-ELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSV
        + +       + W PP     KLN DA+   K +   +G  +RD  G L   G K +    +I  +EA A+  GL  Y ++ GF+       LVVESDS 
Subjt:  S-ELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSV

Query:  EVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNGDFLFFVNFSLLYGEEDRFWREVPFPEWCRSVV
         V++A+N+   D +     +D+I     +     + K +R+ N  AH++A+ A              L  +EDR W+E   P W   ++
Subjt:  EVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNGDFLFFVNFSLLYGEEDRFWREVPFPEWCRSVV

XP_027118368.1 uncharacterized protein LOC113735569 [Coffea arabica]7.7e-3028.33Show/hide
Query:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSS--KDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKEST
        S + D++IW Y   GSF+VKSAYHL K+ S+  +D+ASG S+ S + IW  +WK N     KI +W+I    LPT + +  +G+H +  C LC+   E  
Subjt:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSS--KDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKEST

Query:  AHIFWICKFSKKLWADFIPNASPLWLRCR-DWEKPIDFW-NFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQ
         H+F  C  + + WA      S L L+ R D  K +  W   +   L K++  +  ++LW++W  RN +  +    D   +   ++ SI+     +E N 
Subjt:  AHIFWICKFSKKLWADFIPNASPLWLRCR-DWEKPIDFW-NFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQ

Query:  N-ISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESD
        +  + +++  + LHW  P  G  K N D +   +    G+G  +RD  G+ +     KIS +++ + +EA A    +             + P +++E D
Subjt:  N-ISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESD

Query:  SVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNGD
        S+++VN +  +  D +   + +D+I    ++      +   R++N  AH LAR A    D
Subjt:  SVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNGD

XP_042939495.1 uncharacterized protein LOC122274525 [Carya illinoinensis]2.0e-3027.82Show/hide
Query:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAH
        SG  D+ IW Y   G FSVKSAYHL   +S K    G S       WK +W  N   + K+ +WK LND LPT+TN+ K+ +  + CC +     ES  H
Subjt:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAH

Query:  IFWICKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQ-NIS
          W C+ S  +WAD     SPL     +  +  + W  +   ++KEE  + ++++  +W  RN     N   D   +  + A S++     Q   +  + 
Subjt:  IFWICKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQ-NIS

Query:  ELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEV
                  W+ P   C K+N DAS ++KDE  GIG  +RD  G ++   C       +    E +A+   +K   +   FE       ++ E D++ V
Subjt:  ELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEV

Query:  VNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNGDFLFFVNFSLLYGEEDRFWREVPFPE
        VNAVN   +        V+++    +  +        R+ N +AH L +           V+F+     E++ W E   PE
Subjt:  VNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNGDFLFFVNFSLLYGEEDRFWREVPFPE

TrEMBL top hitse value%identityAlignment
A0A2N9H3I8 Uncharacterized protein2.0e-3129.43Show/hide
Query:  DEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIFWI
        D +IW    +G FSVKSAYHL  SL +   A+ SS  S  SIW SIW     P  K+ VWK  +D LPT+T + +KGI  +  C  C    E+  H+ W 
Subjt:  DEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIFWI

Query:  CKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIA-TSIDERPNLQEDNQNISELKS
        C+F+ K+W       S  +  C  ++   DF   +   L      IA    W +W ARN +     +P ++ + + +A  ++D    L+ +++ + ++  
Subjt:  CKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIA-TSIDERPNLQEDNQNISELKS

Query:  LSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNAV
        L     W+ P  G  KLN    +S      G G  IRD+ G ++ +    I    ++    A+A+++ ++   D     G RR   LVVE D++E+ N +
Subjt:  LSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNAV

Query:  NRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKN
                 + + V++I S+     + SF    +  N  AH LA  AA +
Subjt:  NRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKN

A0A6P6WSG1 uncharacterized protein LOC1137355693.7e-3028.33Show/hide
Query:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSS--KDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKEST
        S + D++IW Y   GSF+VKSAYHL K+ S+  +D+ASG S+ S + IW  +WK N     KI +W+I    LPT + +  +G+H +  C LC+   E  
Subjt:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSS--KDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKEST

Query:  AHIFWICKFSKKLWADFIPNASPLWLRCR-DWEKPIDFW-NFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQ
         H+F  C  + + WA      S L L+ R D  K +  W   +   L K++  +  ++LW++W  RN +  +    D   +   ++ SI+     +E N 
Subjt:  AHIFWICKFSKKLWADFIPNASPLWLRCR-DWEKPIDFW-NFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQ

Query:  N-ISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESD
        +  + +++  + LHW  P  G  K N D +   +    G+G  +RD  G+ +     KIS +++ + +EA A    +             + P +++E D
Subjt:  N-ISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESD

Query:  SVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNGD
        S+++VN +  +  D +   + +D+I    ++      +   R++N  AH LAR A    D
Subjt:  SVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKNGD

A0A7C9A7V4 Uncharacterized protein3.4e-3130.03Show/hide
Query:  KDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASS--KSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHI
        +D +IW     G F+V+SA  LA+ +  +   +  +  S      W  IW A   P AK   W+   DALPT  N+ K+G+  ++ C +C NG E+T HI
Subjt:  KDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASS--KSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHI

Query:  FWICKFSKKLWADFIPNASPLWLRCRD-WEKPIDFW-NFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNIS
        F  C  + + W       SP     RD  E     W +   + L K +  +   LLW +W  RN    +    D+     R   S +    + + N    
Subjt:  FWICKFSKKLWADFIPNASPLWLRCRD-WEKPIDFW-NFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNIS

Query:  ELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEV
          K     L WEPP  G  K+N DAS + +D + G+G  +RD NG +  + C+     W ++ +EA A++ GL+     G          L +E+DS +V
Subjt:  ELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEV

Query:  VNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAK
         +A+N   D       F+ +  +      V SFS  SR +N +AHELAR A +
Subjt:  VNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAK

A0A803PAX6 Uncharacterized protein1.3e-3026.06Show/hide
Query:  SKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIF
        S D++IW +   G ++V+S Y +A+  + K  A  SS+A+++  W SIWK    P  +  +W++ N  +P  T + ++G++ N  C  C   +E+  H  
Subjt:  SKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIF

Query:  WICKFSKKLWADFIPNASPLWLRCRDWEKP-IDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPD----INRVKRRIATSIDERPNLQEDNQN
        W C +SK++W  +     P W   +  +   ID    I+  ++KEE    I++ W IW+ RN   +N+++      +   +  +   + + P L+  N  
Subjt:  WICKFSKKLWADFIPNASPLWLRCRDWEKP-IDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPD----INRVKRRIATSIDERPNLQEDNQN

Query:  ISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSV
           L     HL WEPP    + +N+DAS    +   G+G  IR + G +     + ++ +++I+  EA AI  G+     +          P +++SD +
Subjt:  ISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSV

Query:  EVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAA
         V+  +       T+    +DE+ S       ++ S  SR +N +AH LA+ A
Subjt:  EVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAA

A0A803QH76 Uncharacterized protein6.4e-3028.79Show/hide
Query:  DEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIFWI
        D+I+W + D G ++V+S YHLA SL S+D    SS ++++  W  +W        KI  W+++NDALPT  N+A + I ++  C LC+   ES  H  + 
Subjt:  DEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIFWI

Query:  CKFSKKLWAD-----FIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQN-I
        C  +K +W       FIPN + +        K  D ++ +    +  +  I   LLW IW  RN   I+   P    +    A +   +  +    Q   
Subjt:  CKFSKKLWAD-----FIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQN-I

Query:  SELKSLSSHL------------HWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRR
        S L S S +              W+PPD GC+KLN DA+ +E   + G G  +RD +G +I    K     ++ K +E  A+   L+       +     
Subjt:  SELKSLSSHL------------HWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRR

Query:  KPPLVVESDSVEVVNAVNRVADDATELCLFVDEID--SFRRSNRVR-SFSKCSRQSNSLAHELARAAAKNGDFLFFVNFSLLYGEEDRFWREVPFP
         P   +E+D++ VVNA+N+ A     L  F D ++  S+  S   R   S   R +N  AH LA             NF+L   E+  +  E P P
Subjt:  KPPLVVESDSVEVVNAVNRVADDATELCLFVDEID--SFRRSNRVR-SFSKCSRQSNSLAHELARAAAKNGDFLFFVNFSLLYGEEDRFWREVPFP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.3e-0820.95Show/hide
Query:  MSGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTA
        ++G++D + WK+   G FSV+SAY +   L+  ++       +  S +  +WK       K  +W + N A+ T+    ++ +  +  C +C+ G ES  
Subjt:  MSGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTA

Query:  HIFWICKFSKKLWADFIPNASPLWLRCRDWEKPIDFW---NFIQRNLSKE---ETRIAILLLW-HIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQ
        H+   C     +W   +P           + K +  W   N   R+  ++    T  A+++ W   W   N    N    D  +  +  A  +  R +  
Subjt:  HIFWICKFSKKLWADFIPNASPLWLRCRDWEKPIDFW---NFIQRNLSKE---ETRIAILLLW-HIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQ

Query:  EDNQNISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVV
             I++ + +   + W  P  G  K+N D +      +   G  +RD  G+  G     I +  +    E   +  GL        F   ++ P + +
Subjt:  EDNQNISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVV

Query:  ESDSVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAA
        E DS  +V  +     D+  L   V     F + + +       R++N LA  LA  A
Subjt:  ESDSVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAA

Arabidopsis top hitse value%identityAlignment
AT1G52990.1 thioredoxin family protein1.1e-0528.57Show/hide
Query:  PGC-WKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNAVNRVADDATEL
        P C  K N DAS  E D + G+GW IR+S G+++  G  K       +  E  A+I  ++A    G          ++ E D+  V   +N  +D+   L
Subjt:  PGC-WKLNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNAVNRVADDATEL

Query:  CLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKN
          ++D I S+  S     F    R+ N  A  L + A K+
Subjt:  CLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELARAAAKN

AT3G09510.1 Ribonuclease H-like superfamily protein3.1e-2125.5Show/hide
Query:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAH
        S   D+IIW Y   G ++V+S Y L     S +I + +    S  +   IW    +P  K  +W+ L+ AL T   +  +G+  +  C  C    ES  H
Subjt:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAH

Query:  IFWICKFSKKLWADFIPNASPLWLRCRDWEKPI-DFWNFIQ-RNLSKEETRIAILLLWHIWDARNTSNINNYLPDINR-VKRRIATSIDERPNLQEDNQN
          + C F+   W     +     L   D+E+ I +  NF+Q   +S     + + L+W IW ARN    N +    ++ V    A + D     Q   + 
Subjt:  IFWICKFSKKLWADFIPNASPLWLRCRDWEKPI-DFWNFIQ-RNLSKEETRIAILLLWHIWDARNTSNINNYLPDINR-VKRRIATSIDERPNLQEDNQN

Query:  ISELKSLSSH-LHWEPPDPGCWKLNADASWS-EKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESD
         S  + ++ + + W  P     K N DA +  +K E  G GW IR+  G+ I  G  K++   N    E KA++  L+           R    + +E D
Subjt:  ISELKSLSSH-LHWEPPDPGCWKLNADASWS-EKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESD

Query:  SVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELAR
           ++N +N ++  ++ L   +++I  +        F    R+ N LAH LA+
Subjt:  SVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAHELAR

AT4G09775.1 BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT2G02650.1)2.4e-0525.19Show/hide
Query:  LWHIWDARNTSNINNYLPDINRVKRRIA------------TSIDERPNLQEDNQNISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDS
        +W +W +RN          I+R   ++A            T+I++  N     Q        S    W PP  G  K N D+ + +  +     W IRDS
Subjt:  LWHIWDARNTSNINNYLPDINRVKRRIA------------TSIDERPNLQEDNQNISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDS

Query:  NGSLIGLGCKKISKLWNIKCLEAKAIIEGLK
        NG +I  GC K+ + ++    EA   +  L+
Subjt:  NGSLIGLGCKKISKLWNIKCLEAKAIIEGLK

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.4e-0726.79Show/hide
Query:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIW-KSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTA
        S  +D  +W+          +A     S SS+D        S    W K +W    IP   +  W    + LPT+  +   G++      LC NG E+ A
Subjt:  SGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIW-KSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTA

Query:  HIFWICKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLL----WHIWDARN
        H+F+ C FS  +W  F     P       +  P      +Q  L    T I  LLL    +H+W  RN
Subjt:  HIFWICKFSKKLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLL----WHIWDARN

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0925.96Show/hide
Query:  LLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPN---------LQEDNQNISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNG
        L+W IW + N    N+        + +  T+++   N         +  + QN +     S +  W PP     K N DAS  E++ + G+GW +R+S G
Subjt:  LLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPN---------LQEDNQNISELKSLSSHLHWEPPDPGCWKLNADASWSEKDEIGGIGWAIRDSNG

Query:  SLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAH
        ++I  G  K       +  E   +I  ++A   S GF G ++   ++ E D+  +   +N  + +   L  F+D I S+  S     FS   R+ N  A 
Subjt:  SLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNAVNRVADDATELCLFVDEIDSFRRSNRVRSFSKCSRQSNSLAH

Query:  ELARAAAK
         LA+ A K
Subjt:  ELARAAAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGGGTCCAAGGACGAGATAATATGGAAGTACGAGGACAAAGGCAGCTTCTCTGTAAAGAGTGCTTACCACCTAGCGAAAAGTTTGAGCTCCAAAGATATAGCTTC
CGGGTCCAGCGATGCTAGTTCAAAATCCATTTGGAAGTCTATATGGAAGGCCAATTGTATCCCTATAGCTAAGATCACCGTGTGGAAAATCCTCAATGACGCCCTCCCAA
CTAAGACCAATATTGCTAAAAAAGGAATTCATACTAATCTTTGTTGTTGTCTGTGCAGGAATGGCAAGGAGTCCACAGCTCACATATTCTGGATATGTAAGTTCTCCAAA
AAGCTTTGGGCTGATTTCATCCCTAATGCTAGCCCTCTTTGGCTTCGGTGCAGGGACTGGGAGAAGCCAATTGATTTTTGGAACTTCATTCAAAGGAACCTCTCTAAGGA
GGAGACCAGAATTGCCATCCTTTTGCTGTGGCATATTTGGGATGCAAGGAATACAAGCAACATCAACAACTATTTGCCAGACATAAACAGAGTTAAAAGGCGAATTGCGA
CCAGCATTGATGAAAGGCCTAATCTTCAAGAGGACAACCAGAACATTTCAGAATTAAAGAGCCTTTCGAGTCACTTACATTGGGAACCTCCTGACCCCGGCTGCTGGAAG
CTTAATGCTGATGCTTCCTGGTCTGAGAAAGATGAGATTGGCGGGATTGGATGGGCCATTCGTGACTCTAACGGATCTTTAATTGGTTTGGGCTGCAAAAAAATTTCCAA
ATTGTGGAACATCAAATGTCTTGAAGCGAAAGCCATTATTGAAGGATTGAAAGCTTACGAAGACAGTGGCGGTTTCGAAGGGAGCAGACGCAAGCCACCGCTGGTTGTTG
AGTCAGATTCCGTAGAAGTCGTGAACGCTGTTAATCGAGTCGCCGATGACGCCACGGAACTCTGCTTGTTCGTGGATGAAATCGACAGTTTCAGGCGCTCGAACCGGGTG
AGATCCTTCTCCAAATGCTCGAGGCAGAGCAACTCTTTGGCGCACGAGCTTGCGCGAGCGGCGGCCAAAAATGGCGATTTCCTGTTTTTTGTAAATTTCTCTCTCCTCTA
TGGAGAAGAGGATAGGTTTTGGAGGGAAGTTCCTTTCCCCGAGTGGTGTAGGTCTGTTGTTAATGTGGTGGGTGTAACTGTTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGGGTCCAAGGACGAGATAATATGGAAGTACGAGGACAAAGGCAGCTTCTCTGTAAAGAGTGCTTACCACCTAGCGAAAAGTTTGAGCTCCAAAGATATAGCTTC
CGGGTCCAGCGATGCTAGTTCAAAATCCATTTGGAAGTCTATATGGAAGGCCAATTGTATCCCTATAGCTAAGATCACCGTGTGGAAAATCCTCAATGACGCCCTCCCAA
CTAAGACCAATATTGCTAAAAAAGGAATTCATACTAATCTTTGTTGTTGTCTGTGCAGGAATGGCAAGGAGTCCACAGCTCACATATTCTGGATATGTAAGTTCTCCAAA
AAGCTTTGGGCTGATTTCATCCCTAATGCTAGCCCTCTTTGGCTTCGGTGCAGGGACTGGGAGAAGCCAATTGATTTTTGGAACTTCATTCAAAGGAACCTCTCTAAGGA
GGAGACCAGAATTGCCATCCTTTTGCTGTGGCATATTTGGGATGCAAGGAATACAAGCAACATCAACAACTATTTGCCAGACATAAACAGAGTTAAAAGGCGAATTGCGA
CCAGCATTGATGAAAGGCCTAATCTTCAAGAGGACAACCAGAACATTTCAGAATTAAAGAGCCTTTCGAGTCACTTACATTGGGAACCTCCTGACCCCGGCTGCTGGAAG
CTTAATGCTGATGCTTCCTGGTCTGAGAAAGATGAGATTGGCGGGATTGGATGGGCCATTCGTGACTCTAACGGATCTTTAATTGGTTTGGGCTGCAAAAAAATTTCCAA
ATTGTGGAACATCAAATGTCTTGAAGCGAAAGCCATTATTGAAGGATTGAAAGCTTACGAAGACAGTGGCGGTTTCGAAGGGAGCAGACGCAAGCCACCGCTGGTTGTTG
AGTCAGATTCCGTAGAAGTCGTGAACGCTGTTAATCGAGTCGCCGATGACGCCACGGAACTCTGCTTGTTCGTGGATGAAATCGACAGTTTCAGGCGCTCGAACCGGGTG
AGATCCTTCTCCAAATGCTCGAGGCAGAGCAACTCTTTGGCGCACGAGCTTGCGCGAGCGGCGGCCAAAAATGGCGATTTCCTGTTTTTTGTAAATTTCTCTCTCCTCTA
TGGAGAAGAGGATAGGTTTTGGAGGGAAGTTCCTTTCCCCGAGTGGTGTAGGTCTGTTGTTAATGTGGTGGGTGTAACTGTTCCTTAA
Protein sequenceShow/hide protein sequence
MSGSKDEIIWKYEDKGSFSVKSAYHLAKSLSSKDIASGSSDASSKSIWKSIWKANCIPIAKITVWKILNDALPTKTNIAKKGIHTNLCCCLCRNGKESTAHIFWICKFSK
KLWADFIPNASPLWLRCRDWEKPIDFWNFIQRNLSKEETRIAILLLWHIWDARNTSNINNYLPDINRVKRRIATSIDERPNLQEDNQNISELKSLSSHLHWEPPDPGCWK
LNADASWSEKDEIGGIGWAIRDSNGSLIGLGCKKISKLWNIKCLEAKAIIEGLKAYEDSGGFEGSRRKPPLVVESDSVEVVNAVNRVADDATELCLFVDEIDSFRRSNRV
RSFSKCSRQSNSLAHELARAAAKNGDFLFFVNFSLLYGEEDRFWREVPFPEWCRSVVNVVGVTVP