; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G047500 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G047500
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionRNase H domain-containing protein
Genome locationCiama_Chr02:35317796..35320831
RNA-Seq ExpressionCaUC02G047500
SyntenyCaUC02G047500
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8769349.1 Casein kinase I-2-like protein [Hordeum vulgare]2.1e-0631.48Show/hide
Query:  PPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSW
        P +AE L +++G+ F   +   N+I+E+D L+ +NL N   +          EI  L D F S V +  +RS NG      + A + S+   W S+ P++
Subjt:  PPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSW

Query:  IVSQLRKD
        +VS L  D
Subjt:  IVSQLRKD

MBA0784548.1 hypothetical protein [Gossypium trilobum]1.2e-0631.86Show/hide
Query:  DLDYTPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFS
        DLD      E+ A +E ++FARA N P ++ E+D +  ++ IN +  +        +EI    + FSS    + NRSCN +TD + K A + + N+ +  
Subjt:  DLDYTPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFS

Query:  DFPSWIVSQLRKD
        D+P  I   + KD
Subjt:  DFPSWIVSQLRKD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]9.0e-1037.82Show/hide
Query:  TPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAK-NARNCSLNMDWFSDFP
        +P LAEI  I EG++FA A N  +L VESD L AI LI  E         W  EI++L   F+ I F   +R CN     +AK    + S    W  +FP
Subjt:  TPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAK-NARNCSLNMDWFSDFP

Query:  SWIVSQLRKDDCIGFAQVA
        +W++  +++D    FA VA
Subjt:  SWIVSQLRKDDCIGFAQVA

XP_022158489.1 uncharacterized protein LOC111024968 [Momordica charantia]1.6e-0630.99Show/hide
Query:  GGFSLNERVAKSTENSPLGSNNKGKEDEEITRYEDLDY--TPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRD
        G   +N   A S E S +G   +    E +    D     TP LA+ILAI+EG+  A    V  ++VE+D L+A+NLI  +      A  W E+IR+   
Subjt:  GGFSLNERVAKSTENSPLGSNNKGKEDEEITRYEDLDY--TPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRD

Query:  FFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWI
         F  I F+   R  N +   + +   +      W  DFP W+
Subjt:  FFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWI

XP_027061994.1 uncharacterized protein LOC113688384 [Coffea arabica]1.6e-0632.69Show/hide
Query:  EILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWIVSQ
        E LAI+  +  A+     N+ V+SDY   ++LIN +  +        E+I  L+  F S VF F  RS N  +  +A+ A   + N +W   FP+W+   
Subjt:  EILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWIVSQ

Query:  LRKD
         RKD
Subjt:  LRKD

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248744.4e-1037.82Show/hide
Query:  TPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAK-NARNCSLNMDWFSDFP
        +P LAEI  I EG++FA A N  +L VESD L AI LI  E         W  EI++L   F+ I F   +R CN     +AK    + S    W  +FP
Subjt:  TPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAK-NARNCSLNMDWFSDFP

Query:  SWIVSQLRKDDCIGFAQVA
        +W++  +++D    FA VA
Subjt:  SWIVSQLRKDDCIGFAQVA

A0A6J1DZK3 uncharacterized protein LOC1110249687.7e-0730.99Show/hide
Query:  GGFSLNERVAKSTENSPLGSNNKGKEDEEITRYEDLDY--TPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRD
        G   +N   A S E S +G   +    E +    D     TP LA+ILAI+EG+  A    V  ++VE+D L+A+NLI  +      A  W E+IR+   
Subjt:  GGFSLNERVAKSTENSPLGSNNKGKEDEEITRYEDLDY--TPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRD

Query:  FFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWI
         F  I F+   R  N +   + +   +      W  DFP W+
Subjt:  FFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWI

A0A6P6S857 uncharacterized protein LOC1136883847.7e-0732.69Show/hide
Query:  EILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWIVSQ
        E LAI+  +  A+     N+ V+SDY   ++LIN +  +        E+I  L+  F S VF F  RS N  +  +A+ A   + N +W   FP+W+   
Subjt:  EILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWIVSQ

Query:  LRKD
         RKD
Subjt:  LRKD

A0A6P6SPA8 uncharacterized protein LOC1136933341.7e-0628.47Show/hide
Query:  KSTENSPLGSNNKGKEDEEITRYEDLDYTPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNR
        K T  S +  N +GK  +   R E     P + E  AI+ G++ A   N   +  +SD  + +++IN E+D+        E+I ++R  F    F F +R
Subjt:  KSTENSPLGSNNKGKEDEEITRYEDLDYTPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNR

Query:  SCNGLTDRIAKNARNCSLNMDWFSDFPSWIVSQLRKD
        + N     +AK A   + N++W   FP W+    +KD
Subjt:  SCNGLTDRIAKNARNCSLNMDWFSDFPSWIVSQLRKD

A0A7J9FH86 RNase H domain-containing protein5.9e-0731.86Show/hide
Query:  DLDYTPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFS
        DLD      E+ A +E ++FARA N P ++ E+D +  ++ IN +  +        +EI    + FSS    + NRSCN +TD + K A + + N+ +  
Subjt:  DLDYTPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFS

Query:  DFPSWIVSQLRKD
        D+P  I   + KD
Subjt:  DFPSWIVSQLRKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATCAAAACATGGTGGAATAAATGGATATATTTTAGGTACAATTGATCATGTGATAAAAGAATGTAAAGATACAGTGATAGATGCTGAACAAGACCATAAAGGACG
GCAATATGGTAGCTGGCTAAGATTATTCCAAGGTCTAAATCCAAGGAGGAGGAGGAAAGCTTCTCCCCAAAAAAACGAAAAAGAAAAATCACCCTTTGAATCCTCTGCTA
TAAAAACTGGTGGTTTCTCCCTCAATGAAAGAGTAGCCAAATCTACAGAAAATTCTCCGCTAGGCAGCAATAACAAAGGAAAAGAAGACGAGGAGATCACTCGTTACGAA
GATCTTGACTACACTCCCCCCTTGGCGGAAATTCTTGCTATTCAGGAGGGCATTAGATTTGCGCGTGCTCAGAATGTACCAAATTTAATTGTGGAATCTGACTACTTACA
AGCAATCAATCTAATCAATTTTGAAGAAGATGAGTTTAGAGGGGCGGACTGTTGGCCGGAAGAGATCAGAAGCTTGAGAGATTTCTTCTCATCCATCGTTTTCAAATTTT
GTAATCGAAGTTGTAATGGCTTGACTGATAGAATAGCAAAAAATGCTAGGAACTGTAGTCTCAATATGGATTGGTTTTCTGACTTTCCATCCTGGATTGTTAGCCAACTC
AGAAAAGATGATTGTATCGGTTTTGCCCAAGTGGCGGATAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTATCAAAACATGGTGGAATAAATGGATATATTTTAGGTACAATTGATCATGTGATAAAAGAATGTAAAGATACAGTGATAGATGCTGAACAAGACCATAAAGGACG
GCAATATGGTAGCTGGCTAAGATTATTCCAAGGTCTAAATCCAAGGAGGAGGAGGAAAGCTTCTCCCCAAAAAAACGAAAAAGAAAAATCACCCTTTGAATCCTCTGCTA
TAAAAACTGGTGGTTTCTCCCTCAATGAAAGAGTAGCCAAATCTACAGAAAATTCTCCGCTAGGCAGCAATAACAAAGGAAAAGAAGACGAGGAGATCACTCGTTACGAA
GATCTTGACTACACTCCCCCCTTGGCGGAAATTCTTGCTATTCAGGAGGGCATTAGATTTGCGCGTGCTCAGAATGTACCAAATTTAATTGTGGAATCTGACTACTTACA
AGCAATCAATCTAATCAATTTTGAAGAAGATGAGTTTAGAGGGGCGGACTGTTGGCCGGAAGAGATCAGAAGCTTGAGAGATTTCTTCTCATCCATCGTTTTCAAATTTT
GTAATCGAAGTTGTAATGGCTTGACTGATAGAATAGCAAAAAATGCTAGGAACTGTAGTCTCAATATGGATTGGTTTTCTGACTTTCCATCCTGGATTGTTAGCCAACTC
AGAAAAGATGATTGTATCGGTTTTGCCCAAGTGGCGGATAAATAA
Protein sequenceShow/hide protein sequence
MLSKHGGINGYILGTIDHVIKECKDTVIDAEQDHKGRQYGSWLRLFQGLNPRRRRKASPQKNEKEKSPFESSAIKTGGFSLNERVAKSTENSPLGSNNKGKEDEEITRYE
DLDYTPPLAEILAIQEGIRFARAQNVPNLIVESDYLQAINLINFEEDEFRGADCWPEEIRSLRDFFSSIVFKFCNRSCNGLTDRIAKNARNCSLNMDWFSDFPSWIVSQL
RKDDCIGFAQVADK