; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC05G097610 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC05G097610
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionBeta-1,3-N-Acetylglucosaminyltransferase family protein
Genome locationCmU531Chr05:27215171..27218843
RNA-Seq ExpressionCmUC05G097610
SyntenyCmUC05G097610
Gene Ontology termsGO:0001709 - cell fate determination (biological process)
InterPro domainsIPR040361 - Tapetum determinant 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063321.1 TPD1 protein-like protein 1A-like [Cucumis melo var. makuwa]1.7e-5888.98Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLSTMIAAI+IL LSFINEG A GSCSLDTINIGTQRSGREIGGQPEWNVQ+INNCDCPQK+I+LSC GFQT+EPVDPSILSKQ D CLLINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSVSFSYAWDPPVIM PRFSV  CPT
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

XP_022141679.1 uncharacterized protein LOC111011980 [Momordica charantia]4.1e-5788.89Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLSTMIAAILIL LSFIN+GSA G+CSLDTINIGT+RSGREIGGQPEWNVQVINNCDCPQK+IVLSCPGFQT EPV PSILSKQGDTCLLING  VQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCP
         +SVSFSYAWDPP IMFPRFSV QCP
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCP

XP_022937408.1 uncharacterized protein LOC111443708 [Cucurbita moschata]3.1e-5788.19Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLS MIAAILILFLSFINEG    SCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQK+IVLSCPGFQTVEPVDPSI+SKQGD C LINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSV FSYAWDPP I+FPRFSV QC T
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

XP_022976215.1 uncharacterized protein LOC111476671 [Cucurbita maxima]1.3e-5889.76Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLS MIAAILILFLSFINEG   GSCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQK+IVLSCPGFQTVEPVDPSI+SKQGD CLLINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSV FSYAWDPP I+FPRFSV QC T
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

XP_038899927.1 uncharacterized protein LOC120087115 [Benincasa hispida]3.9e-6091.34Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINL TMIAAI ILFLSFINEGSA GSCSLDTINIGTQRSGREIGGQPEWNVQVINNC+CPQK+IVLSCPGFQT+EPVDPSILSKQ DTCLLINGG VQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSVSFSYAWDPP+IMFPR SV QCPT
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

TrEMBL top hitse value%identityAlignment
A0A5A7VC88 TPD1 protein-like protein 1A-like8.0e-5988.98Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLSTMIAAI+IL LSFINEG A GSCSLDTINIGTQRSGREIGGQPEWNVQ+INNCDCPQK+I+LSC GFQT+EPVDPSILSKQ D CLLINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSVSFSYAWDPPVIM PRFSV  CPT
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

A0A5D3E811 TPD1 protein-like protein 1A-like2.8e-5689.26Show/hide
Query:  MIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSF
        MIAAI+IL LSFINEG A GSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQK+I+LSC GFQT+EPVDPSILSKQ D CLLINGGIVQPGSSVSF
Subjt:  MIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSF

Query:  SYAWDPPVIMFPRFSVPQCPT
        SYAWDPPVIM PRFSV  CPT
Subjt:  SYAWDPPVIMFPRFSVPQCPT

A0A6J1CJF7 uncharacterized protein LOC1110119802.0e-5788.89Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLSTMIAAILIL LSFIN+GSA G+CSLDTINIGT+RSGREIGGQPEWNVQVINNCDCPQK+IVLSCPGFQT EPV PSILSKQGDTCLLING  VQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCP
         +SVSFSYAWDPP IMFPRFSV QCP
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCP

A0A6J1FFZ8 uncharacterized protein LOC1114437081.5e-5788.19Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLS MIAAILILFLSFINEG    SCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQK+IVLSCPGFQTVEPVDPSI+SKQGD C LINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSV FSYAWDPP I+FPRFSV QC T
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

A0A6J1IIV5 uncharacterized protein LOC1114766716.1e-5989.76Show/hide
Query:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP
        MINLS MIAAILILFLSFINEG   GSCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQK+IVLSCPGFQTVEPVDPSI+SKQGD CLLINGGIVQP
Subjt:  MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPVIMFPRFSVPQCPT
        GSSV FSYAWDPP I+FPRFSV QC T
Subjt:  GSSVSFSYAWDPPVIMFPRFSVPQCPT

SwissProt top hitse value%identityAlignment
Q1G3T1 TPD1 protein homolog 15.8e-0634.12Show/hide
Query:  CSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKKIVLSCPGFQTVEPVDPSILSK-QGDTCLLINGGIVQPGSSVSFSYA
        CS D I +    +     G P + V++ N+C  DC   +I +SC  F +V  V+P +  +   D CL+ +G  + PG S+SF YA
Subjt:  CSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKKIVLSCPGFQTVEPVDPSILSK-QGDTCLLINGGIVQPGSSVSFSYA

Arabidopsis top hitse value%identityAlignment
AT1G32583.1 FUNCTIONS IN: molecular_function unknown4.1e-0734.12Show/hide
Query:  CSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKKIVLSCPGFQTVEPVDPSILSK-QGDTCLLINGGIVQPGSSVSFSYA
        CS D I +    +     G P + V++ N+C  DC   +I +SC  F +V  V+P +  +   D CL+ +G  + PG S+SF YA
Subjt:  CSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKKIVLSCPGFQTVEPVDPSILSK-QGDTCLLINGGIVQPGSSVSFSYA

AT4G32090.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.4e-2347.06Show/hide
Query:  ILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW
        +++L LS ++ G  +  C+   I IG  R+GREIGGQPEW V VIN C+C QK + LSC GF   +PV P +L  QG+TCL+I G  +  G++  F+YA 
Subjt:  ILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW

Query:  DP
         P
Subjt:  DP

AT4G32100.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein5.7e-1738Show/hide
Query:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAWD
        L+L  +F+ +G    S SL+++++   ++G  +  +PEW V+V+N+  C      LSC  F++V P+D  +LSK GDTCLL NG  +     +SF Y WD
Subjt:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAWD

AT4G32105.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.4e-1842.57Show/hide
Query:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW
        L+LFL+F+N+G   G C L+ +++   ++G+ +  +PEW V+V N C +C  +   LSC GFQ+V PV  S+LSK GD CLL  G  + P     F+Y W
Subjt:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW

Query:  D
        D
Subjt:  D

AT4G32110.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.4e-1839.6Show/hide
Query:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW
        L+LFL+F+N+G   G CSL+++++   ++G+ +  +PEW V+V N C +C  +   L C GF +V P+D S+L K GD CL+  G  + P   + F Y W
Subjt:  LILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVSFSYAW

Query:  D
        D
Subjt:  D


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAACCTCTCCACCATGATTGCAGCAATCCTCATCCTATTCCTAAGTTTCATCAATGAAGGGTCTGCTATTGGAAGCTGCAGTTTGGACACCATTAACATA
GGAACACAAAGAAGCGGAAGGGAGATTGGAGGGCAACCTGAATGGAACGTGCAAGTAATTAACAATTGCGATTGTCCTCAGAAGAAGATCGTCTTGTCCTGTCCA
GGGTTTCAGACTGTTGAACCCGTTGATCCATCGATTCTTTCAAAGCAAGGGGACACTTGCCTTCTCATAAACGGAGGAATTGTACAACCTGGTTCTTCAGTTTCA
TTTTCCTATGCTTGGGATCCCCCTGTCATCATGTTCCCTCGTTTCTCTGTTCCTCAGTGTCCTACCTGA
mRNA sequenceShow/hide mRNA sequence
AAAATTTTAAATTTACCTAATCTATTAAAGCATATCAATGCCCATGTTAATTTAACAAAGTGCTTATCGTTGAACTCATTTTTAACCCCACCCCAGGCTTGTAAA
TTGTAACTATTAAGCTAACGTTCAATGACTAAAAATCAAAAATTTGAATGTCCACTACACGTATATAAATAAATAAATAAATAAATAATTAAAAAGTTTCAAAAA
TATTTAAGAAAAAGTAAATAACACCAACCCACAATGAGATTATAACTTTTACATTACAAGCTAAACACGAGAACACTTTCCATCTGCACAAGATTTTTTAACCAA
CTTGTTTTAGATCACAAATTAAATTGGATGCACATATCAAACTTGATCTCTTTTGTGTTTGTCCTCTGGAGAAATTAACAATTACCAATGGATAGAAACTCCAAC
TGGCGCAATGGTAAATGGCAAAATAGAATAGAAATTTACAATCACCAATAAATGCCAAAATTGCACTCAAGCCACGTTTATCCCAACTGAAACGCACTCCATCAA
TGGTAAAATAAAAGGCACAATCTAATTGAGGAATAGAAGGTGTTCAATTTGATTGTGGTTGAAAATTACGTGAAACTCACTACTTTTACCGTTACCATTATCACT
ATCGTTACAAATATTGTTACTATAACAATATGAAAATTATGAATCTAGTACATTTATTGTTACTTTTATTTTTACCTTTACCGTTACCGTTAGGGTCAAGTAGTA
ATAGTAACGATAACGGTAACATTAAATTGTGACAAATTTGTAATTCTAAGTAAATGCGATCAAAAGCAATTCAATTACATTTGCGATTCAGACTTACTACAATTG
ATCCAACAATAAAATTTCATTAAACTACACCAATCTTCTTTACAATTTGGTAAACGTAGCCTCAGTTCAATGTGAGATTGAAGAGCGGAGGTTTCAACACAGCCA
AAGCCATCAAACCCACCAGTTTAGTCGAGTTGGGGGATCAATGTGTGGTTAATGATGGCTTTCCAATCTATTCTGCTGTTCTTATCTCCTTCACTACTGCATGGG
ACCCTTCATTAAATTTCAAGCCTTTTTCTCCCAAGTTGCTTGTTCCTAGTTAGTTACTCAAGGTTTTTGCTTTCTCTTTTCTCCAACTTATAGTGTAAGGAGTCC
TTACTCCTTTTATGTTTTGTACTACGAGTCTTTGGCAAAGTTCAATATATTTATAGAGGTTTAAGACTTATTTTGGTCTCTTCGAACTTTCATATATTTAAATGG
TTTTGGAACTTTTTGCATGAGCAAAAGCTGATCAAGGTCAGGTAATGAGGTCGGCCAATAATCCTGATACCAACTTTCAATTGGAGATGGCATTTACTTTTTCTT
ACCTATCAAGCTTCAGCACAAGTCTCGTCACTCGCAGTCCAAGAAAGGTAGTTTCAATCTATACGTCTCCCACTCTCTATTGCAGCAAACCAGATTGGTTCGAGA
TGACTAGTGAGATGTTGGGGGATGGTCTATTGGGTGTTACAAAGAGAGTGTAGTTTGGGTGTTAGGCAGGTTAAGAGGGGGAGGGTTGTATGAAAAAAGATGTTG
ATGTGATATACAATAAATGGAAGGAGCATGAAAAAGATATTCATACTCCTATAATATTGTAGATTTTCCCGAGGGAGAGAGAGGTGTATGAAAAAAATGTAAGAA
AAATTTCCCATACTCTCGTACTACTGAAAATATGCCGAGGGACTAGGGGCAGGAAGAGAGGGGCTATATTAGAAAGAGAGGGAATGGAAAAGGAGTAGAGTCATG
CATGCAAGAGAGAGAAAATGGTGGTCTTGTAATGTTAAAAGAGAGAATGAATGAGTGGAAGGTCATGGGAGAGAGAAGAGAGATGGATTTGATTGGCAGACTTGC
ATGTGAAAGATGGATTTGATTGGTGCATGTGCATGTTAGAAAGGAAAGAAGAGTACTGAGAACTTAATAAGTGAGAAAGAAAAGGAGAATAGGGAAGAGAGAAGA
ATGACAAAAGGTTGAAAGATAATAAAGGAAGAGAGAGGAACACAATAGGAACAGAAATAAAGAGTTTTGGATGCAGGGTCGGTCAGAAGGAGGTGTATATACTCT
CCTAACTAAACAAGGGGTTAGTTTTGGATATTCGGGTTCGGTCATGAACTTGGTCTGATTCAGTCTTTTAATTTAACTCCCCGAATCTTTTGAAGTATACCCTTC
AAACTTATAAAATGGTGTAGCAATTAACTCCAACTATTTAGGGAATTAAAACAATAGAGTTAAGTTATAAGAATTTTGTTAGCATAAAGGGATAATTATTAATTA
TTATGTGCCGAAAGTTGCCTAAATAAAATTTAAACAAAGCATCAATTTGATGGAAAAAGAAAAGAAAAAGGAAACGTCGAGAGAATAATAGGGTATAAAATTACG
TCACGATATGCATCTTCCTATTTTTGTTGTTCTTAGTTACATGGGATGTTCAATGTTTAAAAACGGTCAACCCAAATGAAATATTATGTTGACAGCACATCACAC
TAAAGACTTGCGTGAGAGTTTTAAATTTATTTCTTTTTCTTCGAGACTATTATTAATATAATTATTGCCATAATAACCAATGAGGAAAGCCATTCAAAATACAGA
TATGAAAACACTAACATACTATAATTTCCAAGATAAAGTAATCATAAAAACAAAGAAGTTGGTTTGTAAACCCACTTATCTCAAAAGCACTGCCTGCCAGCAAAA
AGTTCCCTTCAGCTATGATCAACCTCTCCACCATGATTGCAGCAATCCTCATCCTATTCCTAAGTTTCATCAATGAAGGGTCTGCTATTGGAAGCTGCAGTTTGG
ACACCATTAACATAGGAACACAAAGAAGCGGAAGGGAGATTGGAGGGCAACCTGAATGGAACGTGCAAGTAATTAACAATTGCGATTGTCCTCAGAAGAAGATCG
TCTTGTCCTGTCCAGGGTTTCAGACTGTTGAACCCGTTGATCCATCGATTCTTTCAAAGCAAGGGGACACTTGCCTTCTCATAAACGGAGGAATTGTACAACCTG
GTTCTTCAGTTTCATTTTCCTATGCTTGGGATCCCCCTGTCATCATGTTCCCTCGTTTCTCTGTTCCTCAGTGTCCTACCTGAAGATCCAAAAACAATTTTTTTT
AAAAAAAAAAACGGTTATCAATTTAGTCTGAGTTGATATTGTAGTTAAGAAAAAAAAATTGAAAAATAATCGTTCCTTCTGTATGAAAAGAAGAATATAGGTGAT
GTCTTAAGAGTTAAGCCCATATTTCTTGAGGTTGTTTCCTTCACCTGATTGATCTACTAAAGATGGGACTGTCAGCTTATAGTTATTTGGTACAAGTGAGAAATT
TTACCTGTAGGAATGTCTTTTCTCTAAACAAATGTTGCCATCCTCACATGAGTTTGACAATAGAATAGAACCTGCCTTTCTGAAAGATTAATATAATATAAGCAA
CTTGTATAGCTTGTATTTGTACAGCATGATTTCTCATTCAGGGTATGTGACAGTTGGAAAGACAGAGTGTTGTATGTATAAATAAAAGAAGCTTTC
Protein sequenceShow/hide protein sequence
MINLSTMIAAILILFLSFINEGSAIGSCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKKIVLSCPGFQTVEPVDPSILSKQGDTCLLINGGIVQPGSSVS
FSYAWDPPVIMFPRFSVPQCPT