; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G016680 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G016680
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionBeta-1,3-N-Acetylglucosaminyltransferase family protein
Genome locationCG_Chr05:28984489..28985592
RNA-Seq ExpressionClCG05G016680
SyntenyClCG05G016680
Gene Ontology termsGO:0001709 - cell fate determination (biological process)
InterPro domainsIPR040361 - Tapetum determinant 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063321.1 TPD1 protein-like protein 1A-like [Cucumis melo var. makuwa]1.7e-5888.98Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLSTMIAAI+IL LSFINEG A GSCSLDTINIGTQRSGREIGGQPEWNVQ+INNCDCPQK+I+LSC GFQT+EPVDPSILSKQ D CLLINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSVSFSYAWDPPVIM PRFSV  CPT
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

XP_022141679.1 uncharacterized protein LOC111011980 [Momordica charantia]4.1e-5788.89Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLSTMIAAILIL LSFIN+GSA G+CSLDTINIGT+RSGREIGGQPEWNVQVINNCDCPQK+IVLSCPGFQT EPV PSILSKQGDTCLLING  VQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCP
         +SVSFSYAWDPP IMFPRFSV QCP
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCP

XP_022937408.1 uncharacterized protein LOC111443708 [Cucurbita moschata]3.1e-5788.19Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLS MIAAILILFLSFINEG    SCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQK+IVLSCPGFQTVEPVDPSI+SKQGD C LINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSV FSYAWDPP I+FPRFSV QC T
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

XP_022976215.1 uncharacterized protein LOC111476671 [Cucurbita maxima]1.3e-5889.76Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLS MIAAILILFLSFINEG   GSCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQK+IVLSCPGFQTVEPVDPSI+SKQGD CLLINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSV FSYAWDPP I+FPRFSV QC T
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

XP_038899927.1 uncharacterized protein LOC120087115 [Benincasa hispida]3.9e-6091.34Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINL TMIAAI ILFLSFINEGSA GSCSLDTINIGTQRSGREIGGQPEWNVQVINNC+CPQK+IVLSCPGFQT+EPVDPSILSKQ DTCLLINGG VQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSVSFSYAWDPP+IMFPR SV QCPT
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

TrEMBL top hitse value%identityAlignment
A0A5A7VC88 TPD1 protein-like protein 1A-like8.0e-5988.98Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLSTMIAAI+IL LSFINEG A GSCSLDTINIGTQRSGREIGGQPEWNVQ+INNCDCPQK+I+LSC GFQT+EPVDPSILSKQ D CLLINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSVSFSYAWDPPVIM PRFSV  CPT
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

A0A5D3E811 TPD1 protein-like protein 1A-like2.8e-5689.26Show/hide
Query:  MIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSF
        MIAAI+IL LSFINEG A GSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQK+I+LSC GFQT+EPVDPSILSKQ D CLLINGGIVQPGSSVSF
Subjt:  MIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSF

Query:  SYAWDPPVIMFPRFSVPQCPT
        SYAWDPPVIM PRFSV  CPT
Subjt:  SYAWDPPVIMFPRFSVPQCPT

A0A6J1CJF7 uncharacterized protein LOC1110119802.0e-5788.89Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLSTMIAAILIL LSFIN+GSA G+CSLDTINIGT+RSGREIGGQPEWNVQVINNCDCPQK+IVLSCPGFQT EPV PSILSKQGDTCLLING  VQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCP
         +SVSFSYAWDPP IMFPRFSV QCP
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCP

A0A6J1FFZ8 uncharacterized protein LOC1114437081.5e-5788.19Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLS MIAAILILFLSFINEG    SCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQK+IVLSCPGFQTVEPVDPSI+SKQGD C LINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSV FSYAWDPP I+FPRFSV QC T
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

A0A6J1IIV5 uncharacterized protein LOC1114766716.1e-5989.76Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLS MIAAILILFLSFINEG   GSCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQK+IVLSCPGFQTVEPVDPSI+SKQGD CLLINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSV FSYAWDPP I+FPRFSV QC T
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

SwissProt top hitse value%identityAlignment
Q1G3T1 TPD1 protein homolog 15.8e-0634.12Show/hide
Query:  CSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKKIVLSCPGFQTVEPVDPSILSK-QGDTCLLINGGIVQPGSSVSFSYA
        CS D I +    +     G P + V++ N+C  DC   +I +SC  F +V  V+P +  +   D CL+ +G  + PG S+SF YA
Subjt:  CSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKKIVLSCPGFQTVEPVDPSILSK-QGDTCLLINGGIVQPGSSVSFSYA

Arabidopsis top hitse value%identityAlignment
AT1G32583.1 FUNCTIONS IN: molecular_function unknown4.1e-0734.12Show/hide
Query:  CSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKKIVLSCPGFQTVEPVDPSILSK-QGDTCLLINGGIVQPGSSVSFSYA
        CS D I +    +     G P + V++ N+C  DC   +I +SC  F +V  V+P +  +   D CL+ +G  + PG S+SF YA
Subjt:  CSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKKIVLSCPGFQTVEPVDPSILSK-QGDTCLLINGGIVQPGSSVSFSYA

AT4G32090.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.4e-2347.06Show/hide
Query:  ILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW
        +++L LS ++ G  +  C+   I IG  R+GREIGGQPEW V VIN C+C QK + LSC GF   +PV P +L  QG+TCL+I G  +  G++  F+YA 
Subjt:  ILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW

Query:  DP
         P
Subjt:  DP

AT4G32100.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein5.7e-1738Show/hide
Query:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAWD
        L+L  +F+ +G    S SL+++++   ++G  +  +PEW V+V+N+  C      LSC  F++V P+D  +LSK GDTCLL NG  +     +SF Y WD
Subjt:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAWD

AT4G32105.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.4e-1842.57Show/hide
Query:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW
        L+LFL+F+N+G   G C L+ +++   ++G+ +  +PEW V+V N C +C  +   LSC GFQ+V PV  S+LSK GD CLL  G  + P     F+Y W
Subjt:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW

Query:  D
        D
Subjt:  D

AT4G32110.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.4e-1839.6Show/hide
Query:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW
        L+LFL+F+N+G   G CSL+++++   ++G+ +  +PEW V+V N C +C  +   L C GF +V P+D S+L K GD CL+  G  + P   + F Y W
Subjt:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW

Query:  D
        D
Subjt:  D


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAACCTCTCCACCATGATTGCAGCAATCCTCATCCTATTCCTAAGTTTCATCAATGAAGGGTCTGCTATTGGAAGCTGCAGTTTGGACACCATTAACATAGGAAC
ACAAAGAAGCGGAAGGGAGATTGGAGGGCAACCTGAATGGAACGTGCAAGTAATTAACAATTGCGATTGTCCTCAGAAGAAGATCGTCTTGTCCTGTCCAGGGTTTCAGA
CTGTTGAACCCGTTGATCCATCGATTCTTTCAAAGCAAGGGGACACTTGCCTTCTCATAAACGGAGGAATTGTACAACCTGGTTCTTCAGTTTCATTTTCCTATGCTTGG
GATCCCCCTGTCATCATGTTCCCTCGTTTCTCTGTTCCTCAGTGTCCTACCTGA
mRNA sequenceShow/hide mRNA sequence
AATGAGGAAAGCCATTCAAAATACAGATATGAAAACACTAACATACTATAATTTCCAAGATAAAGTAATCATAAAAACAAAGAAGTTGGTTTGTAAACCCACTTATCTCA
AAAGCACTGCCTGCCAGCAAAAAGTTCCCTTCAGCTATGATCAACCTCTCCACCATGATTGCAGCAATCCTCATCCTATTCCTAAGTTTCATCAATGAAGGGTCTGCTAT
TGGAAGCTGCAGTTTGGACACCATTAACATAGGAACACAAAGAAGCGGAAGGGAGATTGGAGGGCAACCTGAATGGAACGTGCAAGTAATTAACAATTGCGATTGTCCTC
AGAAGAAGATCGTCTTGTCCTGTCCAGGGTTTCAGACTGTTGAACCCGTTGATCCATCGATTCTTTCAAAGCAAGGGGACACTTGCCTTCTCATAAACGGAGGAATTGTA
CAACCTGGTTCTTCAGTTTCATTTTCCTATGCTTGGGATCCCCCTGTCATCATGTTCCCTCGTTTCTCTGTTCCTCAGTGTCCTACCTGAAGATCCAAAAACAATTTTTT
TTTAAAAAAAAAACGGTTATCAATTTAGTCTGAGTTGATATTGTAGTTAAGAAAAAAAAATTGAAAAATAATCGTTCCTTCTGTATGAAAAGAAGAATATAGGTGATGTC
TTAAGAGTTAAGCCCATATTTCTTGAGGTTGTTTCCTTCACCTGATTGATCTACTAAAGATGGGACTGTCAGCTTATAGTTATTTGGTACAAGTGAGAAATTTTACCTGT
AGGAATGTCTTTTCTCTAAACAAATGTTGCCATCCTCACATGAGTTTGACAATAGAATAGAACCTGCCTTTCTGAAAGATTAATATAATATAAGCAACTTGTATAGCTTG
TATTTGTACAGCATGATTTCTCATTCAGGGTATGTGACAGTTGGAAAGACAGAGTGTTGTATGTATAAATAAAAGAAGCTTTCATCAAACTGTATTTTGTCATCTGAGTC
CA
Protein sequenceShow/hide protein sequence
MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW
DPPVIMFPRFSVPQCPT