; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC09G160890 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC09G160890
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDNA glycosylase superfamily protein
Genome locationCiama_Chr09:2929189..2929716
RNA-Seq ExpressionCaUC09G160890
SyntenyCaUC09G160890
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0008725 - DNA-3-methyladenine glycosylase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593364.1 hypothetical protein SDJN03_12840, partial [Cucurbita argyrosperma subsp. sororia]1.7e-6687.95Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTGTC
        NSRASSARGTRQRGPNLRRK SSTVK A+KAVE V A  VVA  +T GCLE KKRCAWVT NT  C
Subjt:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTGTC

XP_004136097.2 uncharacterized protein LOC101205558 [Cucumis sativus]6.6e-7493.33Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VV DT GCLESKKRCAWVTPNT  C
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC

XP_008461179.1 PREDICTED: probable GMP synthase [glutamine-hydrolyzing] [Cucumis melo]2.3e-7493.94Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VVADT GCLESKKRCAWVTPNT  C
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC

XP_022960311.1 uncharacterized protein LOC111461081 [Cucurbita moschata]1.7e-6687.95Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTGTC
        NSRASSARGTRQRGPNLRRK SSTVK A+KAVE V A  VVA  +T GCLE KKRCAWVT NT  C
Subjt:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTGTC

XP_038900164.1 probable GMP synthase [glutamine-hydrolyzing] [Benincasa hispida]9.5e-7391.07Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKAR VETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKA-----VESVAAVGVVADTAGCLESKKRCAWVTPNTGTC
        SRASSARGTRQRGPNLRRKQ+STVKGA K+     VES A V VVADT GCLESKKRCAWVTPNT  C
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKA-----VESVAAVGVVADTAGCLESKKRCAWVTPNTGTC

TrEMBL top hitse value%identityAlignment
A0A0A0K8L6 Uncharacterized protein3.2e-7493.33Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VV DT GCLESKKRCAWVTPNT  C
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC

A0A1S3CE52 probable GMP synthase [glutamine-hydrolyzing]1.1e-7493.94Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VVADT GCLESKKRCAWVTPNT  C
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC

A0A5A7UYZ9 Putative GMP synthase1.1e-7493.94Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC
        SRASSARGTRQRGPNLRRKQ STVKGADKAVE V   +V VVADT GCLESKKRCAWVTPNT  C
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVA--AVGVVADTAGCLESKKRCAWVTPNTGTC

A0A6J1H7A2 uncharacterized protein LOC1114610818.4e-6787.95Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTGTC
        NSRASSARGTRQRGPNLRRK SSTVK A+KAVE V A  VVA  +T GCLE KKRCAWVT NT  C
Subjt:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVA--DTAGCLESKKRCAWVTPNTGTC

A0A6J1KPI7 uncharacterized protein LOC1114975441.9e-6687.35Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF
        MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRK GVKPLKKLEKP QE ESKDKRVPLSPPQCV TVPSVLRQQDRHQAIL LSMNASCSSDASSDSF
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCV-TVPSVLRQQDRHQAILNLSMNASCSSDASSDSF

Query:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGV--VADTAGCLESKKRCAWVTPNTGTC
        NSRASSARGTRQRGPNLRRK SSTVK A+KA+E V A  V  VA+T GCLE KKRCAWVT NT  C
Subjt:  NSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGV--VADTAGCLESKKRCAWVTPNTGTC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15970.1 DNA glycosylase superfamily protein4.6e-0932.78Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---VESKDKR-----VPLSP----PQCVTVPSVLRQQDRHQAILNLSMNA
        MS PPR RS+N  + + R VLGPTGNK +    RKP   P  KLEKP  E   ++SKD++      P SP     QC ++ S + +++      + S +A
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQE---VESKDKR-----VPLSP----PQCVTVPSVLRQQDRHQAILNLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTCTLSLN
        S S ++S  S  S +S  +  R+ G     ++ S  K  +K      A G           +KRCAW+TP    C ++ +
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTCTLSLN

AT1G80850.1 DNA glycosylase superfamily protein2.1e-0932.14Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN
        MS PPR+RS++ +D + R VLGP GNK +     KP  KP+ +  K     E   +  PLSPP       +LR+         +SM AS SSDASS   +
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFN

Query:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTCTLSLN
        S  S    +   G  + R+  S    +             A    C + +KRCAW+TP +  C ++ +
Subjt:  SRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTCTLSLN

AT5G57970.1 DNA glycosylase superfamily protein3.9e-1638.29Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTC
        S SSDAS DSF+SRAS+ R  R      R K   +         SV + G +       E+KKRC WVTPN+  C
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTC

AT5G57970.2 DNA glycosylase superfamily protein3.9e-1638.29Show/hide
Query:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA
        MSG PR++SMNVA++++R  LG T  KA    T K   K L+KLE+        D++   + P            +   S+LR   RH+  L  NLS+NA
Subjt:  MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPP----------QCVTVPSVLRQQDRHQAIL--NLSMNA

Query:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTC
        S SSDAS DSF+SRAS+ R  R      R K   +         SV + G +       E+KKRC WVTPN+  C
Subjt:  SCSSDASSDSFNSRASSARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTACAGGGAACAAAGCGCGTACAGTAGAGACTAGAAAA
CCTGGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCGCCGCCTCAATGCGTTACAGTTCCATCG
GTTTTGAGGCAACAGGACCGCCACCAAGCGATTCTCAATCTGTCGATGAATGCTTCGTGTTCTTCTGATGCGTCGTCTGATTCGTTTAATAGCCGGGCATCTAGT
GCAAGGGGTACGAGACAGCGCGGTCCGAATTTGAGGAGAAAGCAAAGTAGTACGGTTAAGGGGGCTGACAAGGCCGTTGAAAGTGTGGCGGCGGTGGGGGTGGTG
GCGGATACAGCTGGTTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGGTACTTGTACATTGAGTTTAAATTTTCTGTTTGTTTCTGTTAGC
TAA
mRNA sequenceShow/hide mRNA sequence
ATGTCAGGCCCTCCGAGAATCCGGTCTATGAATGTGGCGGATTCCGATTCACGACCGGTACTTGGGCCTACAGGGAACAAAGCGCGTACAGTAGAGACTAGAAAA
CCTGGTGTGAAGCCATTGAAGAAGCTTGAAAAGCCTCGTCAAGAAGTTGAATCAAAGGACAAAAGGGTGCCATTGTCGCCGCCTCAATGCGTTACAGTTCCATCG
GTTTTGAGGCAACAGGACCGCCACCAAGCGATTCTCAATCTGTCGATGAATGCTTCGTGTTCTTCTGATGCGTCGTCTGATTCGTTTAATAGCCGGGCATCTAGT
GCAAGGGGTACGAGACAGCGCGGTCCGAATTTGAGGAGAAAGCAAAGTAGTACGGTTAAGGGGGCTGACAAGGCCGTTGAAAGTGTGGCGGCGGTGGGGGTGGTG
GCGGATACAGCTGGTTGCTTAGAGTCCAAAAAACGATGTGCTTGGGTAACGCCTAATACAGGTACTTGTACATTGAGTTTAAATTTTCTGTTTGTTTCTGTTAGC
TAA
Protein sequenceShow/hide protein sequence
MSGPPRIRSMNVADSDSRPVLGPTGNKARTVETRKPGVKPLKKLEKPRQEVESKDKRVPLSPPQCVTVPSVLRQQDRHQAILNLSMNASCSSDASSDSFNSRASS
ARGTRQRGPNLRRKQSSTVKGADKAVESVAAVGVVADTAGCLESKKRCAWVTPNTGTCTLSLNFLFVSVS