; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G004830 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G004830
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr19:5772449..5775811
RNA-Seq ExpressionCmoCh19G004830
SyntenyCmoCh19G004830
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571853.1 hypothetical protein SDJN03_28581, partial [Cucurbita argyrosperma subsp. sororia]3.4e-9298.93Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
        MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK

Query:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
        MIEEAE VAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKS+ADINEQMKKKEE
Subjt:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE

KAG7011540.1 hypothetical protein SDJN02_26446, partial [Cucurbita argyrosperma subsp. argyrosperma]5.7e-9298.92Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
        MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK

Query:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKE
        MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKS+ADINEQ+KKKE
Subjt:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKE

XP_022952908.1 uncharacterized protein LOC111455450 isoform X1 [Cucurbita moschata]2.3e-93100Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
        MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK

Query:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
        MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
Subjt:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE

XP_022952909.1 uncharacterized protein LOC111455450 isoform X2 [Cucurbita moschata]2.3e-93100Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
        MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK

Query:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
        MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
Subjt:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE

XP_022972002.1 uncharacterized protein LOC111470651 [Cucurbita maxima]1.9e-8290.76Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
        MSSNSGHFWSFTILRFR+LLLQLS+KINTSAPAGG RRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRS+CPRAKWIFGSLLSL VPSWNKWQA EDEAEK
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK

Query:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKK
         IEEAE+VAEVVEKVAELTEKVSAEIGEK+ E+S++K+AAE VEKYSKEIAH ALLAQHILHKVEEWKQKLDKS+ADINEQMKK
Subjt:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKK

TrEMBL top hitse value%identityAlignment
A0A0A0K622 Uncharacterized protein1.3e-4155.66Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQ------SSSVAVVPYSFRSFGLSSLLRSFCPR----------------------
        MSS  GH        +RTLLLQL  K + S+    +RRR VSEP Q      SSSVA+VP+   ++ +SSLLRS+C R                      
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQ------SSSVAVVPYSFRSFGLSSLLRSFCPR----------------------

Query:  --------AKWIFGSLLSLLVPSWNKWQAFEDEAEKMIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKV
                AKWIFG+LLSLLVP+WNK Q  EDEAE +IEEAENVAEVVEKVAELTEKVS +I EK+ E+SK+KEAAEVVE YSKEIAH A L Q ILHKV
Subjt:  --------AKWIFGSLLSLLVPSWNKWQAFEDEAEKMIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKV

Query:  EEWKQKLDKSKADINEQMKKK
        EEWK K+DKSK D NE  K++
Subjt:  EEWKQKLDKSKADINEQMKKK

A0A6J1CKS7 uncharacterized protein LOC1110122121.9e-4054.84Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYS-----------FRSFG---------------LSSLLRSFCPR-A
        MSS SGHFWS T++     LLQL +K       GG  RR  S PP  SSVA++PY+           FR FG                SS    F P   
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYS-----------FRSFG---------------LSSLLRSFCPR-A

Query:  KWIFGSLLSLLVPSW----NKWQAFEDEAEKMIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQ
        KWIFGSLLSLL+P+W    NK Q  E EAE +IEEAE+VAEVVEK AE+ EK SAEI +K+ E+SK+KEAAEVVE YSK+IAH A L Q ILHKVEEWKQ
Subjt:  KWIFGSLLSLLVPSW----NKWQAFEDEAEKMIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQ

Query:  KLDKSKADINEQMKKKE
        KLDKS+  INEQ++KKE
Subjt:  KLDKSKADINEQMKKKE

A0A6J1GLX4 uncharacterized protein LOC111455450 isoform X11.1e-93100Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
        MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK

Query:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
        MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
Subjt:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE

A0A6J1GN56 uncharacterized protein LOC111455450 isoform X21.1e-93100Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
        MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK

Query:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
        MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE
Subjt:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE

A0A6J1I3G6 uncharacterized protein LOC1114706519.0e-8390.76Show/hide
Query:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK
        MSSNSGHFWSFTILRFR+LLLQLS+KINTSAPAGG RRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRS+CPRAKWIFGSLLSL VPSWNKWQA EDEAEK
Subjt:  MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEK

Query:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKK
         IEEAE+VAEVVEKVAELTEKVSAEIGEK+ E+S++K+AAE VEKYSKEIAH ALLAQHILHKVEEWKQKLDKS+ADINEQMKK
Subjt:  MIEEAENVAEVVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14095.1 unknown protein1.5e-1335.29Show/hide
Query:  LQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWN-----KWQAFEDEAEKMIEEAENVAEVVEKV
        L+LSN +N    AG +     ++     S     ++F S+G            +W+ GS +SL++  WN     K +  E EAE ++E  E VAE+VEKV
Subjt:  LQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWN-----KWQAFEDEAEKMIEEAENVAEVVEKV

Query:  AELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKK
        A  T++++ E+ EK+ E++K+K+ A V+E  S+  AH A L Q  LHKVE+  Q +D  +A I   + KK
Subjt:  AELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCAACTCTGGTCATTTCTGGAGCTTCACAATCCTCAGATTCCGAACCTTGCTTCTTCAACTTTCTAATAAGATAAACACCTCCGCCCCCGCCGGTGGGTTCCG
TCGTCGTCCGGTATCTGAGCCGCCGCAGAGCTCCTCCGTTGCCGTGGTTCCGTACAGCTTCCGTTCCTTTGGTCTGTCGTCCCTCCTCAGATCTTTTTGTCCCAGGGCAA
AATGGATATTTGGGTCCCTATTGTCCCTCTTGGTACCTAGTTGGAATAAATGGCAAGCTTTTGAAGACGAAGCGGAAAAGATGATCGAAGAGGCGGAAAATGTGGCAGAA
GTAGTAGAAAAGGTAGCAGAGTTAACAGAGAAGGTATCAGCAGAAATTGGGGAGAAAGTTGGTGAGGAGAGTAAGGTGAAAGAAGCAGCTGAAGTGGTAGAGAAGTATTC
CAAGGAAATTGCCCATCATGCCCTCCTAGCACAACACATCCTCCACAAGGTGGAAGAATGGAAGCAAAAGCTAGACAAATCAAAGGCAGATATTAATGAACAGATGAAAA
AGAAAGAAGAATAG
mRNA sequenceShow/hide mRNA sequence
GCAACAACAATCCATCATATTTCATCATCTTCAAATCCTCTGACCCTTTACTTTCTTGTGTTCATCCGCCGTCTCGTCTTCGGAGATCCTCACCGGAGAAGCCGCCGCCC
GCCGTCCGTCATGTCGTCCAACTCTGGTCATTTCTGGAGCTTCACAATCCTCAGATTCCGAACCTTGCTTCTTCAACTTTCTAATAAGATAAACACCTCCGCCCCCGCCG
GTGGGTTCCGTCGTCGTCCGGTATCTGAGCCGCCGCAGAGCTCCTCCGTTGCCGTGGTTCCGTACAGCTTCCGTTCCTTTGGTCTGTCGTCCCTCCTCAGATCTTTTTGT
CCCAGGGCAAAATGGATATTTGGGTCCCTATTGTCCCTCTTGGTACCTAGTTGGAATAAATGGCAAGCTTTTGAAGACGAAGCGGAAAAGATGATCGAAGAGGCGGAAAA
TGTGGCAGAAGTAGTAGAAAAGGTAGCAGAGTTAACAGAGAAGGTATCAGCAGAAATTGGGGAGAAAGTTGGTGAGGAGAGTAAGGTGAAAGAAGCAGCTGAAGTGGTAG
AGAAGTATTCCAAGGAAATTGCCCATCATGCCCTCCTAGCACAACACATCCTCCACAAGGTGGAAGAATGGAAGCAAAAGCTAGACAAATCAAAGGCAGATATTAATGAA
CAGATGAAAAAGAAAGAAGAATAGTAAATTAATTCAATCCATAA
Protein sequenceShow/hide protein sequence
MSSNSGHFWSFTILRFRTLLLQLSNKINTSAPAGGFRRRPVSEPPQSSSVAVVPYSFRSFGLSSLLRSFCPRAKWIFGSLLSLLVPSWNKWQAFEDEAEKMIEEAENVAE
VVEKVAELTEKVSAEIGEKVGEESKVKEAAEVVEKYSKEIAHHALLAQHILHKVEEWKQKLDKSKADINEQMKKKEE