; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC08G149100 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC08G149100
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionEmbryogenesis-like protein
Genome locationCmU531Chr08:18474098..18474577
RNA-Seq ExpressionCmUC08G149100
SyntenyCmUC08G149100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602205.1 hypothetical protein SDJN03_07438, partial [Cucurbita argyrosperma subsp. sororia]3.1e-6282.74Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLLAGKSQAS SSSS S +RILD ++ P SEFP K R DDLHFSY+MRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

KAG7032887.1 hypothetical protein SDJN02_06937, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-6182.14Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLLAGKSQAS SSSS S +RILD ++ P SEFP K R DDLHF Y+MRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

XP_022964579.1 uncharacterized protein LOC111464557 [Cucurbita moschata]1.2e-6182.14Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLLAGKSQAS SSSS S +RILD ++ P SEFP K R DDLHFSY+MRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYF+EEAECARDAVKEVLDMYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

XP_022990474.1 uncharacterized protein LOC111487326 [Cucurbita maxima]5.3e-6282.74Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLLAGKSQAS SSSS S +RILD  + P SEFP K R DDLHFSYLMRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

XP_038885743.1 uncharacterized protein LOC120076030 [Benincasa hispida]1.1e-7092.45Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMES
        M+PRLRLP YRLL GKSQASISSSS SF+RI DFTV PKSEFPN+ RADDLHFSYLM SNR YSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMES
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMES

Query:  KETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        KETVYFDEEAECARDAVKEVL+MYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
Subjt:  KETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

TrEMBL top hitse value%identityAlignment
A0A1S3C573 uncharacterized protein LOC1034965821.2e-5676.79Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MSPR RLPLYR L G SQ SISSSS  F+RILDF+  P       WRADDLHFSYLM SN         R +S  SG+SEP+FDQVREVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL++YEGLLAKL +SERKALQRSMGLKIEQLKAEL QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

A0A5D3BXL9 Uncharacterized protein1.2e-5676.79Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MSPR RLPLYR L G SQ SISSSS  F+RILDF+  P       WRADDLHFSYLM SN         R +S  SG+SEP+FDQVREVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL++YEGLLAKL +SERKALQRSMGLKIEQLKAEL QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1BV77 uncharacterized protein LOC1110060461.4e-5778.57Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        M+ R RL L R L GKSQAS SSSS S +RI D  + PKSEFPN WRA D HFS LMR N         RRYSG S  SEPEFDQVREVD INLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL+M+EGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1HL89 uncharacterized protein LOC1114645575.7e-6282.14Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLLAGKSQAS SSSS S +RILD ++ P SEFP K R DDLHFSY+MRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYF+EEAECARDAVKEVLDMYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1JQ75 uncharacterized protein LOC1114873262.5e-6282.74Show/hide
Query:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLLAGKSQAS SSSS S +RILD  + P SEFP K R DDLHFSYLMRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

SwissProt top hitse value%identityAlignment
Q9M9H3 Embryogenesis-like protein9.4e-3055.94Show/hide
Query:  SQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAECARDA
        S++S+S +S S    L F      + PN       + S+ + + RRYS GS +  P  D  + VD INLKFAEAREEIE AM++KETVYF+EEAECARDA
Subjt:  SQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAECARDA

Query:  VKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        V EVL+M++GLL K+ E E+ +LQRSMGLKIEQLKAEL+QL+E
Subjt:  VKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

Arabidopsis top hitse value%identityAlignment
AT1G71730.1 unknown protein6.7e-3155.94Show/hide
Query:  SQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAECARDA
        S++S+S +S S    L F      + PN       + S+ + + RRYS GS +  P  D  + VD INLKFAEAREEIE AM++KETVYF+EEAECARDA
Subjt:  SQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAECARDA

Query:  VKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        V EVL+M++GLL K+ E E+ +LQRSMGLKIEQLKAEL+QL+E
Subjt:  VKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCCTCGTTTACGCCTTCCACTCTACAGATTGCTTGCCGGAAAATCTCAGGCTTCAATTTCTTCATCTTCGTATAGTTTTCAGCGAATTCTGGATTTCACTGTGAA
CCCAAAATCTGAGTTTCCTAACAAATGGCGGGCAGATGATCTCCATTTTTCGTATCTAATGAGGTCCAATCGGAGGTACAGTGGTGGTTCGGGCACATCGGAGCCCGAAT
TCGATCAGGTTAGAGAGGTAGACAGGATCAATCTCAAGTTTGCCGAAGCGAGGGAAGAGATAGAGTCAGCTATGGAGTCTAAAGAGACCGTATATTTTGATGAAGAGGCC
GAGTGTGCTCGGGATGCTGTGAAAGAAGTTTTAGATATGTACGAGGGGCTTCTGGCGAAGTTGCCCGAGAGCGAAAGGAAGGCGTTGCAGAGGTCAATGGGGCTTAAGAT
CGAACAGCTGAAGGCTGAGCTCAAACAGCTTGACGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCCTCGTTTACGCCTTCCACTCTACAGATTGCTTGCCGGAAAATCTCAGGCTTCAATTTCTTCATCTTCGTATAGTTTTCAGCGAATTCTGGATTTCACTGTGAA
CCCAAAATCTGAGTTTCCTAACAAATGGCGGGCAGATGATCTCCATTTTTCGTATCTAATGAGGTCCAATCGGAGGTACAGTGGTGGTTCGGGCACATCGGAGCCCGAAT
TCGATCAGGTTAGAGAGGTAGACAGGATCAATCTCAAGTTTGCCGAAGCGAGGGAAGAGATAGAGTCAGCTATGGAGTCTAAAGAGACCGTATATTTTGATGAAGAGGCC
GAGTGTGCTCGGGATGCTGTGAAAGAAGTTTTAGATATGTACGAGGGGCTTCTGGCGAAGTTGCCCGAGAGCGAAAGGAAGGCGTTGCAGAGGTCAATGGGGCTTAAGAT
CGAACAGCTGAAGGCTGAGCTCAAACAGCTTGACGAGTAA
Protein sequenceShow/hide protein sequence
MSPRLRLPLYRLLAGKSQASISSSSYSFQRILDFTVNPKSEFPNKWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEA
ECARDAVKEVLDMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE