; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G010420 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G010420
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRNase H domain-containing protein
Genome locationCG_Chr05:11601576..11605279
RNA-Seq ExpressionClCG05G010420
SyntenyClCG05G010420
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.5e-1735.98Show/hide
Query:  GEQTDNKGKG----LILPTHIPTHIGRQ--PPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISIS
        G  T+ KGK     L L   I  + G Q  PP S+ WKLN +AAW A+TN+GG GW+LRD    V     + +   + +  L+VMAIC GL+ +      
Subjt:  GEQTDNKGKG----LILPTHIPTHIGRQ--PPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISIS

Query:  NTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLE
             SD LE I LL+    + +EI + +EE     +++ I+S  H+ R +  +AH +AR+++E
Subjt:  NTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLE

XP_022148737.1 uncharacterized protein LOC111017329 [Momordica charantia]3.1e-1230.53Show/hide
Query:  SFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKD
        +FWKLN  AAW AN   GG GW++R+    ++  G + +   + +  L++MAI  G++ + S S  + +  S+ LE I L+  +  N++EI + +++ K+
Subjt:  SFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKD

Query:  RGRELGIISFSHVCRSSTVLAHCVARKSLEG
         G    +  F HV R    +A  +A ++ +G
Subjt:  RGRELGIISFSHVCRSSTVLAHCVARKSLEG

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]9.0e-1233.11Show/hide
Query:  QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLE-GSDCLEVIILLNDVATNLSEIFFF
        +PP    W LN  A+WS +T+ GG GW++R     + L G +FV  C  VKLL+  AI  GL+ L+++ +   L   +D  EV  LLN    +L++  + 
Subjt:  QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLE-GSDCLEVIILLNDVATNLSEIFFF

Query:  IEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLEGWESSILSTPYP
        +EE  +      I++F+ V R +   AH +A+++    ES I    +P
Subjt:  IEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLEGWESSILSTPYP

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]7.9e-1633.14Show/hide
Query:  GEQTDNKGKGLILPTHIPTHIGR------QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISIS
        G  T+ KGK      H+   IG       +PP S+ WKLN  AAW A+TN+GG GW+LRD    V     + +   + +  L+VMAIC GL+ +      
Subjt:  GEQTDNKGKGLILPTHIPTHIGR------QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISIS

Query:  NTLE--------GSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLE
           +         SD LE I LL+    + +EI + +EE      ++ I+S  H+ R +  +AH +AR+++E
Subjt:  NTLE--------GSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLE

XP_038886170.1 uncharacterized protein LOC120076417 [Benincasa hispida]1.4e-4454.49Show/hide
Query:  MVFFRQLTDVESICDMIARHVTDPCFCGEQTDNKGKGLILPTHIPTHIGRQPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKV
        ++F R+  D E+ICDMI RH++  C  GE  D KGKGL  P   PT +G  P    +WKLN+ A+W++  ++ G GWV  DHL  V + GLKFV RCQKV
Subjt:  MVFFRQLTDVESICDMIARHVTDPCFCGEQTDNKGKGLILPTHIPTHIGRQPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKV

Query:  KLLKVMAICFGLKTLSSISISNTLEGSDCL-EVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAH
         +L+ +AICFGL+ LSSI ISN +  SDCL EVI LLND   +LSE+ F  EEAKDRG  LG+ISFSHV R   VLA+
Subjt:  KLLKVMAICFGLKTLSSISISNTLEGSDCL-EVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAH

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134121.2e-1735.98Show/hide
Query:  GEQTDNKGKG----LILPTHIPTHIGRQ--PPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISIS
        G  T+ KGK     L L   I  + G Q  PP S+ WKLN +AAW A+TN+GG GW+LRD    V     + +   + +  L+VMAIC GL+ +      
Subjt:  GEQTDNKGKG----LILPTHIPTHIGRQ--PPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISIS

Query:  NTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLE
             SD LE I LL+    + +EI + +EE     +++ I+S  H+ R +  +AH +AR+++E
Subjt:  NTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLE

A0A6J1D5W1 uncharacterized protein LOC1110173291.5e-1230.53Show/hide
Query:  SFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKD
        +FWKLN  AAW AN   GG GW++R+    ++  G + +   + +  L++MAI  G++ + S S  + +  S+ LE I L+  +  N++EI + +++ K+
Subjt:  SFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKD

Query:  RGRELGIISFSHVCRSSTVLAHCVARKSLEG
         G    +  F HV R    +A  +A ++ +G
Subjt:  RGRELGIISFSHVCRSSTVLAHCVARKSLEG

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X18.5e-0841.38Show/hide
Query:  QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLEGSDCLEVIILLN
        +PP S+ WKLN  AAW A+TN+ G GW+LRD    V   G + +   + +  L+VMAIC GL+ +           SD LE I LL+
Subjt:  QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLEGSDCLEVIILLN

A0A6J1DNV9 uncharacterized protein LOC1110224034.4e-1233.11Show/hide
Query:  QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLE-GSDCLEVIILLNDVATNLSEIFFF
        +PP    W LN  A+WS +T+ GG GW++R     + L G +FV  C  VKLL+  AI  GL+ L+++ +   L   +D  EV  LLN    +L++  + 
Subjt:  QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISISNTLE-GSDCLEVIILLNDVATNLSEIFFF

Query:  IEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLEGWESSILSTPYP
        +EE  +      I++F+ V R +   AH +A+++    ES I    +P
Subjt:  IEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLEGWESSILSTPYP

A0A6J1DSV1 uncharacterized protein LOC1110236083.8e-1633.14Show/hide
Query:  GEQTDNKGKGLILPTHIPTHIGR------QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISIS
        G  T+ KGK      H+   IG       +PP S+ WKLN  AAW A+TN+GG GW+LRD    V     + +   + +  L+VMAIC GL+ +      
Subjt:  GEQTDNKGKGLILPTHIPTHIGR------QPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICFGLKTLSSISIS

Query:  NTLE--------GSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLE
           +         SD LE I LL+    + +EI + +EE      ++ I+S  H+ R +  +AH +AR+++E
Subjt:  NTLE--------GSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTTTTCAGACAACTGACTGATGTTGAGTCTATTTGTGATATGATTGCAAGGCATGTTACTGATCCTTGCTTTTGTGGTGAGCAAACTGATAATAAAGGAAAGGG
GTTGATCCTTCCCACCCACATCCCAACTCATATTGGTCGGCAACCCCCTTATTCATCTTTTTGGAAGTTGAATGTTCATGCAGCTTGGAGCGCTAATACCAATTCAGGTG
GGTGGGGTTGGGTGCTTCGTGACCATTTGGATCATGTGCGCTTAGTAGGGTTGAAGTTTGTTCCTAGATGCCAAAAGGTGAAGCTCCTTAAAGTTATGGCAATTTGTTTT
GGGCTTAAGACTCTATCTTCTATAAGTATATCGAATACATTAGAGGGATCTGATTGTTTGGAGGTCATCATTCTATTGAATGATGTTGCTACAAATCTTTCTGAAATCTT
CTTTTTTATCGAGGAGGCTAAGGATAGAGGTCGAGAGCTAGGAATCATCTCCTTTTCCCATGTCTGCCGTAGTTCGACTGTGTTGGCACACTGTGTTGCGCGTAAATCTT
TGGAGGGTTGGGAGTCCTCAATTCTGTCTACTCCTTATCCTGAATGCTCAGAGTATCTCGAGACTAGAGTAGGGAAAGGCAGGTCGGAGGTCAAGGCCACTGAGCACCTG
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTTTTCAGACAACTGACTGATGTTGAGTCTATTTGTGATATGATTGCAAGGCATGTTACTGATCCTTGCTTTTGTGGTGAGCAAACTGATAATAAAGGAAAGGG
GTTGATCCTTCCCACCCACATCCCAACTCATATTGGTCGGCAACCCCCTTATTCATCTTTTTGGAAGTTGAATGTTCATGCAGCTTGGAGCGCTAATACCAATTCAGGTG
GGTGGGGTTGGGTGCTTCGTGACCATTTGGATCATGTGCGCTTAGTAGGGTTGAAGTTTGTTCCTAGATGCCAAAAGGTGAAGCTCCTTAAAGTTATGGCAATTTGTTTT
GGGCTTAAGACTCTATCTTCTATAAGTATATCGAATACATTAGAGGGATCTGATTGTTTGGAGGTCATCATTCTATTGAATGATGTTGCTACAAATCTTTCTGAAATCTT
CTTTTTTATCGAGGAGGCTAAGGATAGAGGTCGAGAGCTAGGAATCATCTCCTTTTCCCATGTCTGCCGTAGTTCGACTGTGTTGGCACACTGTGTTGCGCGTAAATCTT
TGGAGGGTTGGGAGTCCTCAATTCTGTCTACTCCTTATCCTGAATGCTCAGAGTATCTCGAGACTAGAGTAGGGAAAGGCAGGTCGGAGGTCAAGGCCACTGAGCACCTG
TAG
Protein sequenceShow/hide protein sequence
MVFFRQLTDVESICDMIARHVTDPCFCGEQTDNKGKGLILPTHIPTHIGRQPPYSSFWKLNVHAAWSANTNSGGWGWVLRDHLDHVRLVGLKFVPRCQKVKLLKVMAICF
GLKTLSSISISNTLEGSDCLEVIILLNDVATNLSEIFFFIEEAKDRGRELGIISFSHVCRSSTVLAHCVARKSLEGWESSILSTPYPECSEYLETRVGKGRSEVKATEHL