; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G048030 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G048030
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionaxoneme-associated protein mst101(2)-like
Genome locationCiama_Chr02:35892192..35897116
RNA-Seq ExpressionCaUC02G048030
SyntenyCaUC02G048030
Gene Ontology termsGO:0000462 - maturation of SSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (biological process)
GO:0005730 - nucleolus (cellular component)
InterPro domainsIPR027973 - Protein of unknown function DUF4602


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140183.1 uncharacterized protein LOC101203308 [Cucumis sativus]4.4e-8090.67Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+VRQFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDVKHLLR +SSRN +G SRNSDPFRN+DFG+EM G GRRKGGKKKNNKSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

XP_008449630.1 PREDICTED: uncharacterized protein LOC103491457 [Cucumis melo]6.1e-8291.71Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+VRQFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDVKHLLR +SSRN +GPSRNSDPFRN+DFG+EM GKGRRKGGKKKNNKSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

XP_022944773.1 uncharacterized protein LOC111449125 [Cucurbita moschata]1.7e-6882.38Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSRDK AKAGSSR+STDME  M MRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    H NV QFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        G ASS++SR SLGKRKPE QVLKSSEGFFKHGVLDVKHLLR A             P RNSDFG+EMAGKGRRKGGKKKNNKS KKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

XP_022985670.1 uncharacterized protein LOC111483659 [Cucurbita maxima]3.9e-6881.87Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSRDK AKAGSSR+STDME  M MRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPI+KKQKEREQKMVQE    H NV QFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASS++SR SLGKRKPE+QVLKSSEGFFKHGVLDVKHLLR A             P RNSD G+EMAGKGRRKGGKKKNNKS KKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

XP_038900589.1 uncharacterized protein LOC120087770 isoform X1 [Benincasa hispida]1.6e-7788.08Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSR+KRAK GSSR STDMET+MKMRNIKKEIEFLTSSHMSWKD+KEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HSNVRQFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMA SSNSRRSLG+RKPEEQVLKSSEGFFKHGVLDVKHLL  ASSRN +GP+RNSDPFRN+DFG+EM  KGRRKGGKKK+ KSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

TrEMBL top hitse value%identityAlignment
A0A0A0KEY3 Uncharacterized protein2.1e-8090.67Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+VRQFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDVKHLLR +SSRN +G SRNSDPFRN+DFG+EM G GRRKGGKKKNNKSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

A0A1S3BNE0 uncharacterized protein LOC1034914573.0e-8291.71Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+VRQFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDVKHLLR +SSRN +GPSRNSDPFRN+DFG+EM GKGRRKGGKKKNNKSKKKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

A0A5A7V3A9 Uncharacterized protein4.5e-6289.81Show/hide
Query:  SSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDV
        SSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    HS+V QFGGMASSSNSR+S GKR+PEEQVLKSSEGFFKHGVLDV
Subjt:  SSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDV

Query:  KHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        KHLLR +SSRN SGPSRNSDPFRN+DFG+EM GKGRRKGGKKKNNKSKKKGGGKKRH
Subjt:  KHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

A0A6J1FZ06 uncharacterized protein LOC1114491258.4e-6982.38Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSRDK AKAGSSR+STDME  M MRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQE    H NV QFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        G ASS++SR SLGKRKPE QVLKSSEGFFKHGVLDVKHLLR A             P RNSDFG+EMAGKGRRKGGKKKNNKS KKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

A0A6J1JBZ2 uncharacterized protein LOC1114836591.9e-6881.87Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        MHSRDK AKAGSSR+STDME  M MRNIKKEIEFLTSSHMSWKDKKEIES+KIVSLGGKPQKKQRLPLSVARPI+KKQKEREQKMVQE    H NV QFG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH
        GMASS++SR SLGKRKPE+QVLKSSEGFFKHGVLDVKHLLR A             P RNSD G+EMAGKGRRKGGKKKNNKS KKGGGKKRH
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G44820.1 unknown protein2.2e-2947.98Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        ++ R    K      ST ++  +  +NI K++    SSHM+WKDKK +E KK+ +LGGK QK  RLPLSVAR  MKKQK+RE+KM+++    +  + +FG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNS------SGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKR
        G  SSS  + +  KR PEE+VLKS+ G FK GVLDVKHLLRS  S +S        PSR          G  +  KG++KGGK K NK KKKGGGKKR
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNS------SGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKR

AT2G44820.2 unknown protein2.2e-2947.98Show/hide
Query:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG
        ++ R    K      ST ++  +  +NI K++    SSHM+WKDKK +E KK+ +LGGK QK  RLPLSVAR  MKKQK+RE+KM+++    +  + +FG
Subjt:  MHSRDKRAKAGSSRDSTDMETQMKMRNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFG

Query:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNS------SGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKR
        G  SSS  + +  KR PEE+VLKS+ G FK GVLDVKHLLRS  S +S        PSR          G  +  KG++KGGK K NK KKKGGGKKR
Subjt:  GMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLDVKHLLRSASSRNS------SGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTAGAGTTCCAGTTGGTAACACTTTTTGATAACCAAGCCGCACCGTCCGATCAGCACTGGCAGCTCTGTTTTTCTTTTCCTTCTATCTTAGCTTCCTCTGTATT
GCCGTCGCAGATCGCCGTCACACCCCCACTGCCATTTTTCTTTCTCCTCTTCTTATCTCACGCTCTGCTGCAGCCTCACGCCGCCACGCCAACCCCGTCTTCTGCTGCAG
CCATCCGTCAATTGTTTTTGCTTCGCAGTGACAAGATGCATTCCAGGGATAAAAGGGCAAAAGCGGGATCATCTAGGGACTCGACGGATATGGAAACTCAAATGAAAATG
AGAAATATCAAGAAAGAAATTGAATTCCTCACCTCCTCGCATATGTCATGGAAAGACAAAAAGGAGATCGAGAGTAAGAAAATTGTTTCTCTGGGTGGAAAGCCTCAAAA
GAAACAAAGACTGCCTCTAAGTGTAGCACGACCAATCATGAAGAAGCAGAAGGAAAGAGAACAAAAGATGGTACAAGAGGTAAGGAAATCCCATTCGAATGTTAGACAAT
TTGGTGGGATGGCTAGTAGTAGCAACTCTAGAAGATCTTTGGGGAAGAGGAAGCCGGAGGAGCAGGTTCTTAAGTCGAGTGAAGGCTTTTTTAAACATGGTGTGCTTGAT
GTCAAGCATCTACTACGTTCAGCCTCTTCTAGGAATAGTAGTGGCCCTTCTAGGAATAGTGACCCTTTTAGGAATAGTGACTTTGGAAGTGAAATGGCCGGTAAAGGTAG
AAGAAAAGGAGGAAAAAAGAAGAATAATAAAAGTAAGAAAAAGGGTGGCGGTAAGAAACGCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGTAGAGTTCCAGTTGGTAACACTTTTTGATAACCAAGCCGCACCGTCCGATCAGCACTGGCAGCTCTGTTTTTCTTTTCCTTCTATCTTAGCTTCCTCTGTATT
GCCGTCGCAGATCGCCGTCACACCCCCACTGCCATTTTTCTTTCTCCTCTTCTTATCTCACGCTCTGCTGCAGCCTCACGCCGCCACGCCAACCCCGTCTTCTGCTGCAG
CCATCCGTCAATTGTTTTTGCTTCGCAGTGACAAGATGCATTCCAGGGATAAAAGGGCAAAAGCGGGATCATCTAGGGACTCGACGGATATGGAAACTCAAATGAAAATG
AGAAATATCAAGAAAGAAATTGAATTCCTCACCTCCTCGCATATGTCATGGAAAGACAAAAAGGAGATCGAGAGTAAGAAAATTGTTTCTCTGGGTGGAAAGCCTCAAAA
GAAACAAAGACTGCCTCTAAGTGTAGCACGACCAATCATGAAGAAGCAGAAGGAAAGAGAACAAAAGATGGTACAAGAGGTAAGGAAATCCCATTCGAATGTTAGACAAT
TTGGTGGGATGGCTAGTAGTAGCAACTCTAGAAGATCTTTGGGGAAGAGGAAGCCGGAGGAGCAGGTTCTTAAGTCGAGTGAAGGCTTTTTTAAACATGGTGTGCTTGAT
GTCAAGCATCTACTACGTTCAGCCTCTTCTAGGAATAGTAGTGGCCCTTCTAGGAATAGTGACCCTTTTAGGAATAGTGACTTTGGAAGTGAAATGGCCGGTAAAGGTAG
AAGAAAAGGAGGAAAAAAGAAGAATAATAAAAGTAAGAAAAAGGGTGGCGGTAAGAAACGCCATTGAAATCATTCAATACTTGGAACATTCTTGTTTTACAACGAGAATA
TGGATATCCTTTCTTGCATATTTGTTGCAATTCAATGAAATTTTGCATCTCTTTGTTTC
Protein sequenceShow/hide protein sequence
MDVEFQLVTLFDNQAAPSDQHWQLCFSFPSILASSVLPSQIAVTPPLPFFFLLFLSHALLQPHAATPTPSSAAAIRQLFLLRSDKMHSRDKRAKAGSSRDSTDMETQMKM
RNIKKEIEFLTSSHMSWKDKKEIESKKIVSLGGKPQKKQRLPLSVARPIMKKQKEREQKMVQEVRKSHSNVRQFGGMASSSNSRRSLGKRKPEEQVLKSSEGFFKHGVLD
VKHLLRSASSRNSSGPSRNSDPFRNSDFGSEMAGKGRRKGGKKKNNKSKKKGGGKKRH