; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026648 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026648
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr10:40082812..40083986
RNA-Seq ExpressionLag0026648
SyntenyLag0026648
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU43361.1 hypothetical protein TSUD_82070 [Trifolium subterraneum]1.6e-1324.19Show/hide
Query:  SYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKG------IATQVSKDRDPKELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTP
        S+ R +  W  +W ++A P++K  + +I    + T+A +Q KG      +++ ++ + +   +   M+  LSKE+    A ILWSIW   N  I NN+T 
Subjt:  SYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKG------IATQVSKDRDPKELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTP

Query:  ASEALQISIETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGL
        A   +       ++E     +    S + +   +  +W +P     K N D ++   + + GIG  +RD  G+ I ++ +  S    +   EA  +F  L
Subjt:  ASEALQISIETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGL

Query:  NCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAF---SVSFIHCNIILNTVAHCVANEA
        N + +       P+  E D+  V+      + D +    I E         +   SV F+      N VAH +A  A
Subjt:  NCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAF---SVSFIHCNIILNTVAHCVANEA

XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]4.5e-1634.21Show/hide
Query:  KELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISIETNIKERILTFDKQR-NSRTARSQVSQ--------------AMWERPKLNQ
        K+ W W+++ LS EE+     I W IW   N+ I    T   + L  SI   I   I   DK    S+T RSQ +                 W  P  N 
Subjt:  KELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISIETNIKERILTFDKQR-NSRTARSQVSQ--------------AMWERPKLNQ

Query:  WKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLS
        WKLN+D +W E+ +  GIGWI+ D  G ++ +    I     I  LE   I  GL  I     Q R P+ +ESD++EVI+++KKE+ DL+
Subjt:  WKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLS

XP_022148549.1 uncharacterized protein LOC111017181 [Momordica charantia]7.9e-1328.37Show/hide
Query:  WAWMIDNLSKEELERGASILWSIWNFINKIIHNNL-TPASEALQISIETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRG
        W W   +  K   + G  +LWSIW + N+I+H+ +  P S+ ++   E+ I E         N    ++      W  P  + WKLN D  W +     G
Subjt:  WAWMIDNLSKEELERGASILWSIWNFINKIIHNNL-TPASEALQISIETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRG

Query:  IGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAFSVSFIHCNII
        +GWIVRDS G  I +E         +K L      E             I + +ESD LEV+ ++ K    L+ +  I E +   ++      F H  + 
Subjt:  IGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAFSVSFIHCNII

Query:  LNTVAHCVANEACNF
         N VAH +A  AC F
Subjt:  LNTVAHCVANEACNF

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]2.4e-1730.53Show/hide
Query:  IDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISI----------ETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEK
        +D   +EE  R   I W IW   NK I   +   +  +Q+ I          +TN+K +    D     R   +  + A W+ P  N WKLN+D AW   
Subjt:  IDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISI----------ETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEK

Query:  DKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIK-DTCK----QRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAF
            GIGWI+RD  G +I ++ ++I     I  LE  AI EGL  I+ + C+    +   P+ +ESD+LE I +L ++ +D + +  + E +   +++  
Subjt:  DKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIK-DTCK----QRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAF

Query:  SVSFIHCNIILNTVAHCVANEACNFD
         VS  H +   N VAH +A  A   D
Subjt:  SVSFIHCNIILNTVAHCVANEACNFD

XP_042979954.1 uncharacterized protein LOC122310138 [Carya illinoinensis]2.1e-1326.47Show/hide
Query:  SSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKGIATQVSKDRDPKELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISIE
        + + WKSIW +     +K  + K  N+I+PT++N+  K I      + +  ELW  +I+  +++ELE  A+ +  IW  +N  I      +   L +   
Subjt:  SSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKGIATQVSKDRDPKELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISIE

Query:  TNIKERILTFDKQRNSRTA--RSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCK
          +++ I T    + +R    RS++ +  W++   N+   N D     K++  G+  ++RD+ G ++ S      N     + E  A+++ +   KD   
Subjt:  TNIKERILTFDKQRNSRTA--RSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCK

Query:  QRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVD--EAFSVSFIHCNIILNTVAHCVANEACNFD
         +   +++E DA  VIK +   EEDLS +  I E V + V   + + V FI      N VAH +A  A   D
Subjt:  QRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVD--EAFSVSFIHCNIILNTVAHCVANEACNFD

TrEMBL top hitse value%identityAlignment
A0A2Z6M8I1 Uncharacterized protein3.8e-1325.83Show/hide
Query:  TDSYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKGIATQ---VSKDRDPKEL--------WA--------------------------WM
        T +    S  WK++W++++ P+    + +I+++++P K+N+  KGI         ++ P+ +        WA                          +M
Subjt:  TDSYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKGIATQ---VSKDRDPKEL--------WA--------------------------WM

Query:  IDNLSKEELERGASILWSIW-NFINKIIHNNLTPASEALQISIETNIKERILTFDKQRNSRTARSQV-SQAMWERPKLNQWKLNSDVAWFEKDKRRGIGW
        + N +KE ++  ++I +SIW    NK+ HN  TPASEA++ +++T  +   L  D+  +S+   S V +   W  P  N  KLN D A    D R G G 
Subjt:  IDNLSKEELERGASILWSIW-NFINKIIHNNLTPASEALQISIETNIKERILTFDKQRNSRTARSQV-SQAMWERPKLNQWKLNSDVAWFEKDKRRGIGW

Query:  IVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSH
        +VR   G  + +  +V +      M EA  +FE L  ++   K      +IE DA +++ V+ K +  +S+
Subjt:  IVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSH

A0A2Z6NHQ6 Reverse transcriptase domain-containing protein7.7e-1424.19Show/hide
Query:  SYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKG------IATQVSKDRDPKELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTP
        S+ R +  W  +W ++A P++K  + +I    + T+A +Q KG      +++ ++ + +   +   M+  LSKE+    A ILWSIW   N  I NN+T 
Subjt:  SYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKG------IATQVSKDRDPKELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTP

Query:  ASEALQISIETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGL
        A   +       ++E     +    S + +   +  +W +P     K N D ++   + + GIG  +RD  G+ I ++ +  S    +   EA  +F  L
Subjt:  ASEALQISIETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGL

Query:  NCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAF---SVSFIHCNIILNTVAHCVANEA
        N + +       P+  E D+  V+      + D +    I E         +   SV F+      N VAH +A  A
Subjt:  NCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAF---SVSFIHCNIILNTVAHCVANEA

A0A6J1CQG0 uncharacterized protein LOC1110132162.2e-1634.21Show/hide
Query:  KELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISIETNIKERILTFDKQR-NSRTARSQVSQ--------------AMWERPKLNQ
        K+ W W+++ LS EE+     I W IW   N+ I    T   + L  SI   I   I   DK    S+T RSQ +                 W  P  N 
Subjt:  KELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISIETNIKERILTFDKQR-NSRTARSQVSQ--------------AMWERPKLNQ

Query:  WKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLS
        WKLN+D +W E+ +  GIGWI+ D  G ++ +    I     I  LE   I  GL  I     Q R P+ +ESD++EVI+++KKE+ DL+
Subjt:  WKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLS

A0A6J1DSV1 uncharacterized protein LOC1110236081.2e-1730.53Show/hide
Query:  IDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISI----------ETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEK
        +D   +EE  R   I W IW   NK I   +   +  +Q+ I          +TN+K +    D     R   +  + A W+ P  N WKLN+D AW   
Subjt:  IDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISI----------ETNIKERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEK

Query:  DKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIK-DTCK----QRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAF
            GIGWI+RD  G +I ++ ++I     I  LE  AI EGL  I+ + C+    +   P+ +ESD+LE I +L ++ +D + +  + E +   +++  
Subjt:  DKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIK-DTCK----QRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAF

Query:  SVSFIHCNIILNTVAHCVANEACNFD
         VS  H +   N VAH +A  A   D
Subjt:  SVSFIHCNIILNTVAHCVANEACNFD

A0A803QC75 Uncharacterized protein1.3e-1324.85Show/hide
Query:  TDSYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKGIATQVSKDRDPKELW---------------AWMIDNL------------------
        + S  R S  WK  WSLQ  P++K+   + ++D +P   ++  + + T  S     ++ W                W   NL                  
Subjt:  TDSYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKGIATQVSKDRDPKELW---------------AWMIDNL------------------

Query:  -----SKEELERGASILWSIWNFINKIIHN------------------NLTPASEALQISIETNIKER-----ILTFDKQRNSRTARSQVSQA--MWERP
             SK E+E+    LW+IW   N+I+H                   N   A + ++++  T    R     + +F  Q  S + + QV  A   W  P
Subjt:  -----SKEELERGASILWSIWNFINKIIHN------------------NLTPASEALQISIETNIKER-----ILTFDKQRNSRTARSQVSQA--MWERP

Query:  KLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPL-VIESDALEVIKVLKKEEEDLSHLKTI
        + N +KLN D A     K  GIG I+RDS G+++ +  K     +    +EA A+F+ LN +     Q+++P+ ++ESDAL V+  L+     +S    +
Subjt:  KLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPL-VIESDALEVIKVLKKEEEDLSHLKTI

Query:  TEAVSLAVDEAFSVSFIHCNIILNTVAHCVANEACNFD
           V   +     V+  H     N  AHC+A  A   D
Subjt:  TEAVSLAVDEAFSVSFIHCNIILNTVAHCVANEACNFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.5e-0423.87Show/hide
Query:  ILWSIWNFINKIIHNNLT-PASEALQISIETNIKERILTFDKQRNSRTARSQVSQ---AMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICS
        +LW +W   N+++       A E L+ ++E + +E   +  ++   + +  QV +     W+ P     K N+D  W  ++ R GIGWI+R+ +G ++  
Subjt:  ILWSIWNFINKIIHNNLT-PASEALQISIETNIKERILTFDKQRNSRTARSQVSQ---AMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICS

Query:  EMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIP-LVIESDALEVIKVLKKEE
          + +  P    +LEA+   E L     T  +     ++ ESDA  ++ +L  ++
Subjt:  EMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIP-LVIESDALEVIKVLKKEE

AT4G29090.1 Ribonuclease H-like superfamily protein2.6e-0625.13Show/hide
Query:  ILWSIWNFINKIIHNNLTPASEALQISIETNIKE-RILTFDKQRNSRTARSQVSQAMWERPKLNQW-KLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEM
        +LW +W   N+++       ++ +    E +++E RI T  +   ++   ++ S   W RP  +QW K N+D  W   ++R GIGW++R+  G +    M
Subjt:  ILWSIWNFINKIIHNNLTPASEALQISIETNIKE-RILTFDKQRNSRTARSQVSQAMWERPKLNQW-KLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEM

Query:  KVISNPWPIKMLEAQAIFEGLN-CIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAFSVSFIHCNIILNTVAHCVANEACNF
           + P    +LEA+   E +   +    + +   ++ ESD+  +I++L  +E   S LK   + +   + +   V F+      NT+A  VA E+ +F
Subjt:  KVISNPWPIKMLEAQAIFEGLN-CIKDTCKQRRIPLVIESDALEVIKVLKKEEEDLSHLKTITEAVSLAVDEAFSVSFIHCNIILNTVAHCVANEACNF

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.2e-0424.74Show/hide
Query:  ILWSIWNFINKIIHNNL-TPASEALQISIETNIK--ERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMI
        ++W IW   N ++ N+  T     +++++    +  +  +T ++Q  +R A        W  P  ++ K N D +  E++   G+GWI+R+S G++I
Subjt:  ILWSIWNFINKIIHNNL-TPASEALQISIETNIK--ERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGACAGTTATAGGAGGTCTTCTTCGCCTTGGAAATCAATATGGTCTTTACAAGCTCTCCCTAGACTAAAAGTTGGGGTGTTGAAGATTGTGAACGATATTATCCC
CACAAAGGCTAATATCCAGTCCAAAGGGATTGCTACTCAAGTCTCGAAAGATCGGGATCCGAAGGAGCTTTGGGCGTGGATGATCGACAATCTGTCAAAGGAAGAGTTGG
AAAGGGGAGCGTCTATTCTTTGGAGCATTTGGAATTTCATAAACAAGATTATCCACAACAATTTGACCCCAGCATCAGAAGCTTTGCAAATTTCCATTGAAACGAATATA
AAGGAGCGAATTCTTACATTCGACAAGCAGAGGAACTCGAGAACAGCGAGGAGCCAAGTGAGTCAGGCTATGTGGGAAAGACCTAAATTGAATCAGTGGAAGTTGAATTC
GGATGTCGCCTGGTTCGAGAAAGACAAGCGTAGAGGCATTGGATGGATCGTGCGTGACTCAACTGGATCCATGATTTGTTCTGAGATGAAGGTCATTTCCAATCCATGGC
CAATTAAAATGTTGGAGGCTCAAGCGATCTTTGAAGGCCTCAATTGCATCAAGGACACCTGCAAGCAACGCAGAATTCCCCTAGTTATCGAATCAGACGCCTTGGAGGTC
ATAAAGGTGCTGAAGAAAGAAGAGGAAGACTTATCTCACTTGAAGACGATCACGGAAGCCGTCTCCCTTGCTGTCGATGAAGCCTTTTCCGTGAGTTTTATCCACTGCAA
TATAATTTTGAACACAGTAGCGCACTGTGTTGCCAATGAGGCTTGTAATTTTGATTTTGTCGTAGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTGACAGTTATAGGAGGTCTTCTTCGCCTTGGAAATCAATATGGTCTTTACAAGCTCTCCCTAGACTAAAAGTTGGGGTGTTGAAGATTGTGAACGATATTATCCC
CACAAAGGCTAATATCCAGTCCAAAGGGATTGCTACTCAAGTCTCGAAAGATCGGGATCCGAAGGAGCTTTGGGCGTGGATGATCGACAATCTGTCAAAGGAAGAGTTGG
AAAGGGGAGCGTCTATTCTTTGGAGCATTTGGAATTTCATAAACAAGATTATCCACAACAATTTGACCCCAGCATCAGAAGCTTTGCAAATTTCCATTGAAACGAATATA
AAGGAGCGAATTCTTACATTCGACAAGCAGAGGAACTCGAGAACAGCGAGGAGCCAAGTGAGTCAGGCTATGTGGGAAAGACCTAAATTGAATCAGTGGAAGTTGAATTC
GGATGTCGCCTGGTTCGAGAAAGACAAGCGTAGAGGCATTGGATGGATCGTGCGTGACTCAACTGGATCCATGATTTGTTCTGAGATGAAGGTCATTTCCAATCCATGGC
CAATTAAAATGTTGGAGGCTCAAGCGATCTTTGAAGGCCTCAATTGCATCAAGGACACCTGCAAGCAACGCAGAATTCCCCTAGTTATCGAATCAGACGCCTTGGAGGTC
ATAAAGGTGCTGAAGAAAGAAGAGGAAGACTTATCTCACTTGAAGACGATCACGGAAGCCGTCTCCCTTGCTGTCGATGAAGCCTTTTCCGTGAGTTTTATCCACTGCAA
TATAATTTTGAACACAGTAGCGCACTGTGTTGCCAATGAGGCTTGTAATTTTGATTTTGTCGTAGGTTGA
Protein sequenceShow/hide protein sequence
MTDSYRRSSSPWKSIWSLQALPRLKVGVLKIVNDIIPTKANIQSKGIATQVSKDRDPKELWAWMIDNLSKEELERGASILWSIWNFINKIIHNNLTPASEALQISIETNI
KERILTFDKQRNSRTARSQVSQAMWERPKLNQWKLNSDVAWFEKDKRRGIGWIVRDSTGSMICSEMKVISNPWPIKMLEAQAIFEGLNCIKDTCKQRRIPLVIESDALEV
IKVLKKEEEDLSHLKTITEAVSLAVDEAFSVSFIHCNIILNTVAHCVANEACNFDFVVG