; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G048230 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G048230
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptionaxoneme-associated protein mst101(2)-like
Genome locationCmU531Chr02:35835851..35840727
RNA-Seq ExpressionCmUC02G048230
SyntenyCmUC02G048230
Gene Ontology termsGO:0000462 - maturation of SSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (biological process)
GO:0005730 - nucleolus (cellular component)
InterPro domainsIPR027973 - Protein of unknown function DUF4602


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140183.1 uncharacterized protein LOC101203308 [Cucumis sativus]9.8e-8090.16Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH RDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+VRQFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDVKHLLR +SSRN +G SRNSDPFRN+DFG+EM G GRRKGGKKKNNKSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

XP_008449630.1 PREDICTED: uncharacterized protein LOC103491457 [Cucumis melo]1.4e-8191.19Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH RDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+VRQFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDVKHLLR +SSRN +GPSRNSDPFRN+DFG+EM GKGRRKGGKKKNNKSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

XP_022944773.1 uncharacterized protein LOC111449125 [Cucurbita moschata]3.9e-6881.87Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH RDK AKAGSSR+STDME  M MRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    H NV QFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        G ASS++SR SLGKRKPE QVLKSSEGFFKHGVLDVKHLLR A             P RNSDFG+EMAGKGRRKGGKKKNNKS KKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

XP_022985670.1 uncharacterized protein LOC111483659 [Cucurbita maxima]8.6e-6881.35Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH RDK AKAGSSR+STDME  M MRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPI+KKQKEREQKMVQE    H NV QFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASS++SR SLGKRKPE+QVLKSSEGFFKHGVLDVKHLLR A             P RNSD G+EMAGKGRRKGGKKKNNKS KKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

XP_038900589.1 uncharacterized protein LOC120087770 isoform X1 [Benincasa hispida]3.5e-7787.56Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH R+KRAK GSSR STDMET+MKMRNIKKEIEFLTSSHMSWKD+KEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HSNVRQFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMA SSNSRRSLG+RKPEEQVLKSSEGFFKHGVLDVKHLL  ASSRN +GP+RNSDPFRN+DFG+EM  KGRRKGGKKK+ KSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

TrEMBL top hitse value%identityAlignment
A0A0A0KEY3 Uncharacterized protein4.7e-8090.16Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH RDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+VRQFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDVKHLLR +SSRN +G SRNSDPFRN+DFG+EM G GRRKGGKKKNNKSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

A0A1S3BNE0 uncharacterized protein LOC1034914576.6e-8291.19Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH RDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+VRQFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDVKHLLR +SSRN +GPSRNSDPFRN+DFG+EM GKGRRKGGKKKNNKSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

A0A5A7V3A9 Uncharacterized protein4.5e-6289.81Show/hide
Query:  SSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDV
        SSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+V QFGGMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDV
Subjt:  SSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDV

Query:  KHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        KHLLR +SSRN SGPSRNSDPFRN+DFG+EM GKGRRKGGKKKNNKSKKKGGGKKRH
Subjt:  KHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

A0A6J1FZ06 uncharacterized protein LOC1114491251.9e-6881.87Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH RDK AKAGSSR+STDME  M MRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    H NV QFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        G ASS++SR SLGKRKPE QVLKSSEGFFKHGVLDVKHLLR A             P RNSDFG+EMAGKGRRKGGKKKNNKS KKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

A0A6J1JBZ2 uncharacterized protein LOC1114836594.2e-6881.35Show/hide
Query:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MH RDK AKAGSSR+STDME  M MRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPI+KKQKEREQKMVQE    H NV QFG
Subjt:  MHCRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASS++SR SLGKRKPE+QVLKSSEGFFKHGVLDVKHLLR A             P RNSD G+EMAGKGRRKGGKKKNNKS KKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44820.1 unknown protein9.9e-3050.56Show/hide
Query:  METQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPE
        ++  +  +NI K++    SSHM+WKDKK +E KK+ +LGGK QK  RLPLSVAR  MKKQK+RE+KM+++    +  + +FGG  SSS  + +  KR PE
Subjt:  METQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPE

Query:  EQVLKSSEGFFKHGVLDVKHLLRSASSRNS------SGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKR
        E+VLKS+ G FK GVLDVKHLLRS  S +S        PSR          G  +  KG++KGGK K NK KKKGGGKKR
Subjt:  EQVLKSSEGFFKHGVLDVKHLLRSASSRNS------SGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKR

AT2G44820.2 unknown protein9.9e-3050.56Show/hide
Query:  METQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPE
        ++  +  +NI K++    SSHM+WKDKK +E KK+ +LGGK QK  RLPLSVAR  MKKQK+RE+KM+++    +  + +FGG  SSS  + +  KR PE
Subjt:  METQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPE

Query:  EQVLKSSEGFFKHGVLDVKHLLRSASSRNS------SGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKR
        E+VLKS+ G FK GVLDVKHLLRS  S +S        PSR          G  +  KG++KGGK K NK KKKGGGKKR
Subjt:  EQVLKSSEGFFKHGVLDVKHLLRSASSRNS------SGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTAGAGTTCCAGTTGGTAGCACTTTTTGACAACCAAGCCGCACCGTCCGATCAGCACTGGCAGCTCTGTTTTTCTTTTCCTTCTATCTTAGCTTCCTCTGTATT
GCCGTCGCAGATCGCCGTCACACCCCCACTGCCATTTTTCTTTCTCCTCTTCTTATCTCACGCTCTGCTGCAGCCTCACGCCGCCACGCTAACCCCGTCTTCTGCTGCAG
CCATCCGTCAATTGTTTTTGCTTCGCAGTGACAAGATGCATTGCAGGGATAAAAGGGCAAAAGCGGGATCATCTAGGGACTCGACGGATATGGAAACTCAAATGAAAATG
AGAAATATCAAGAAAGAAATTGAATTCCTCACCTCCTCTCATATGTCATGGAAAGACAAAAAGGAGATCGAGAGTAAGAAAATTGTTTCTCTGGGTGGAAAGCCTCAAAA
GAAACAAAGACTGCCTCTAAGTGTAGCACGACCAATCATGAAGAAGCAGAAGGAAAGAGAACAAAAGATGGTACAAGAGGTAAGGAAATCCCATTCGAATGTTAGACAAT
TTGGTGGGATGGCTAGTAGTAGCAACTCTAGAAGATCTTTGGGGAAGAGGAAGCCGGAGGAGCAGGTTCTTAAGTCGAGTGAAGGCTTTTTTAAACATGGTGTGCTTGAT
GTCAAGCATCTACTACGTTCAGCCTCTTCTAGGAATAGTAGTGGCCCTTCTAGGAATAGTGACCCTTTTAGGAATAGTGACTTTGGAAGTGAAATGGCCGGTAAAGGTAG
AAGAAAAGGAGGAAAAAAGAAGAATAATAAAAGTAAGAAAAAGGGTGGCGGTAAGAAACGCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTAGAGTTCCAGTTGGTAGCACTTTTTGACAACCAAGCCGCACCGTCCGATCAGCACTGGCAGCTCTGTTTTTCTTTTCCTTCTATCTTAGCTTCCTCTGTATT
GCCGTCGCAGATCGCCGTCACACCCCCACTGCCATTTTTCTTTCTCCTCTTCTTATCTCACGCTCTGCTGCAGCCTCACGCCGCCACGCTAACCCCGTCTTCTGCTGCAG
CCATCCGTCAATTGTTTTTGCTTCGCAGTGACAAGATGCATTGCAGGGATAAAAGGGCAAAAGCGGGATCATCTAGGGACTCGACGGATATGGAAACTCAAATGAAAATG
AGAAATATCAAGAAAGAAATTGAATTCCTCACCTCCTCTCATATGTCATGGAAAGACAAAAAGGAGATCGAGAGTAAGAAAATTGTTTCTCTGGGTGGAAAGCCTCAAAA
GAAACAAAGACTGCCTCTAAGTGTAGCACGACCAATCATGAAGAAGCAGAAGGAAAGAGAACAAAAGATGGTACAAGAGGTAAGGAAATCCCATTCGAATGTTAGACAAT
TTGGTGGGATGGCTAGTAGTAGCAACTCTAGAAGATCTTTGGGGAAGAGGAAGCCGGAGGAGCAGGTTCTTAAGTCGAGTGAAGGCTTTTTTAAACATGGTGTGCTTGAT
GTCAAGCATCTACTACGTTCAGCCTCTTCTAGGAATAGTAGTGGCCCTTCTAGGAATAGTGACCCTTTTAGGAATAGTGACTTTGGAAGTGAAATGGCCGGTAAAGGTAG
AAGAAAAGGAGGAAAAAAGAAGAATAATAAAAGTAAGAAAAAGGGTGGCGGTAAGAAACGCCATTGAAATCATTCAATACTTGGAACATTCTTGTTTTACAACGAGAATA
TGGATATCCTTTCTTGCATATTTGTTGCAATTCAATGAAATTTTGCATCTCTTTGGTTC
Protein sequenceShow/hide protein sequence
MDVEFQLVALFDNQAAPSDQHWQLCFSFPSILASSVLPSQIAVTPPLPFFFLLFLSHALLQPHAATLTPSSAAAIRQLFLLRSDKMHCRDKRAKAGSSRDSTDMETQMKM
RNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLD
VKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH