; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G004310 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G004310
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionMyb family transcription factor family protein
Genome locationCmo_Chr14:2043373..2043849
RNA-Seq ExpressionCmoCh14G004310
SyntenyCmoCh14G004310
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580813.1 hypothetical protein SDJN03_20815, partial [Cucurbita argyrosperma subsp. sororia]6.8e-7897.52Show/hide
Query:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS---A
        MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS   A
Subjt:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS---A

Query:  ADHLAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
        ADHLAGYLQWLEDRDKEE+LCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
Subjt:  ADHLAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL

XP_022935250.1 uncharacterized protein LOC111442186 [Cucurbita moschata]5.6e-80100Show/hide
Query:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADH
        MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADH
Subjt:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADH

Query:  LAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
        LAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
Subjt:  LAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL

XP_022983138.1 uncharacterized protein LOC111481779 [Cucurbita maxima]1.9e-7596.88Show/hide
Query:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS--AA
        MKMVSLSP SSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTW DNS  AA
Subjt:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS--AA

Query:  DHLAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
        DHLAGYLQWLEDRDKEEELCRH NEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
Subjt:  DHLAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL

XP_023526429.1 uncharacterized protein LOC111789933 [Cucurbita pepo subsp. pepo]6.4e-7695.65Show/hide
Query:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS---A
        MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTW DNS   A
Subjt:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS---A

Query:  ADHLAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
        ADHL GYLQWLE +DKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
Subjt:  ADHLAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL

XP_038906153.1 uncharacterized protein LOC120092033 [Benincasa hispida]3.8e-4465.14Show/hide
Query:  MVSLSPSS---SSLQIFPSSSS---------LKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCS-SSNY---HVA
        M SL+ SS   SSLQI PSSSS          KA+LQTLILSLARAISRAKTTA HILKQANHQ AIA KRNK KLL+GSFRLHYNWCS SSNY   HV 
Subjt:  MVSLSPSS---SSLQIFPSSSS---------LKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCS-SSNY---HVA

Query:  PPPLTW------GDNSAADHLAGYLQWLEDRDKEEELCRH--VNEIDKLADIFIARCHEKFRLEKQESYRKFQEM
        PP +TW      G     D L GYL+WLE+R+   ++     VNEIDKLA+IFIAR HEKF+LEKQESYR+FQ+M
Subjt:  PPPLTW------GDNSAADHLAGYLQWLEDRDKEEELCRH--VNEIDKLADIFIARCHEKFRLEKQESYRKFQEM

TrEMBL top hitse value%identityAlignment
A0A0A0LBV5 Uncharacterized protein5.3e-3659.78Show/hide
Query:  SPSSSSLQIFPSSSS--------LKALLQTLILSLARAISRAKTTALHILKQANHQSA-IAFKRNKNKLLFGSFRLHYNWCS-SSN--YHVAPPPLT---
        S SSSSLQ+ PS SS         KALLQTLILSLARAISRAKTTA         QSA  A KRNK KLL+GSFRLHYNWCS SSN   HV P  LT   
Subjt:  SPSSSSLQIFPSSSS--------LKALLQTLILSLARAISRAKTTALHILKQANHQSA-IAFKRNKNKLLFGSFRLHYNWCS-SSN--YHVAPPPLT---

Query:  --WGDNSAADHLAGYLQWLEDRD-----------KEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARS
           G     D L GYLQWLE+RD           +++   + VNEIDKLA+IFIARCHEKF+LEKQESYR+FQ+M ARS
Subjt:  --WGDNSAADHLAGYLQWLEDRD-----------KEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARS

A0A1S3B8H1 uncharacterized protein LOC1034871583.1e-3660.22Show/hide
Query:  SLSPSSSSLQIFPSSSS--------LKALLQTLILSLARAISRAKTTALHILKQANHQSA-IAFKRNKNKLLFGSFRLHYNWCS-SSN--YHVAPPPLTW
        S S SSSSLQ+ PS SS         KALLQTLI SLARAISRAKTTA         QSA IA KRNK KLL+GSFRLHYNWCS SSN   HV P  LT+
Subjt:  SLSPSSSSLQIFPSSSS--------LKALLQTLILSLARAISRAKTTALHILKQANHQSA-IAFKRNKNKLLFGSFRLHYNWCS-SSN--YHVAPPPLTW

Query:  ----GDNSAADHLAGYLQWLEDRD------------KEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARS
            G  +  D L GYLQWLE+RD              E+    VNEIDKLA+IFIARCHEKF+LEKQESYR+FQ+M ARS
Subjt:  ----GDNSAADHLAGYLQWLEDRD------------KEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARS

A0A5A7TJT8 Uncharacterized protein3.1e-3660.22Show/hide
Query:  SLSPSSSSLQIFPSSSS--------LKALLQTLILSLARAISRAKTTALHILKQANHQSA-IAFKRNKNKLLFGSFRLHYNWCS-SSN--YHVAPPPLTW
        S S SSSSLQ+ PS SS         KALLQTLI SLARAISRAKTTA         QSA IA KRNK KLL+GSFRLHYNWCS SSN   HV P  LT+
Subjt:  SLSPSSSSLQIFPSSSS--------LKALLQTLILSLARAISRAKTTALHILKQANHQSA-IAFKRNKNKLLFGSFRLHYNWCS-SSN--YHVAPPPLTW

Query:  ----GDNSAADHLAGYLQWLEDRD------------KEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARS
            G  +  D L GYLQWLE+RD              E+    VNEIDKLA+IFIARCHEKF+LEKQESYR+FQ+M ARS
Subjt:  ----GDNSAADHLAGYLQWLEDRD------------KEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARS

A0A6J1FA41 uncharacterized protein LOC1114421862.7e-80100Show/hide
Query:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADH
        MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADH
Subjt:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADH

Query:  LAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
        LAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
Subjt:  LAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL

A0A6J1J6X1 uncharacterized protein LOC1114817799.0e-7696.88Show/hide
Query:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS--AA
        MKMVSLSP SSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTW DNS  AA
Subjt:  MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNS--AA

Query:  DHLAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
        DHLAGYLQWLEDRDKEEELCRH NEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
Subjt:  DHLAGYLQWLEDRDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G42180.1 unknown protein1.0e-1038.04Show/hide
Query:  VSLSPSSSSLQIFPSSSSLKALLQTLI----LSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYH-------------VA
        + L  SSSS      S++LK L   LI      L R++SRA++  + I            K NK +L    F   Y   SS N H               
Subjt:  VSLSPSSSSLQIFPSSSSLKALLQTLI----LSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYH-------------VA

Query:  PPPLTW-GDNSAADHL-AGYLQWLEDR--------DKEEELCRHV--NEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
        P P +  G     D+L + YLQWLE+R        D +    R V  ++ID+LAD FIARCHEKF LEK ESYR+FQ+M ARSL
Subjt:  PPPLTW-GDNSAADHL-AGYLQWLEDR--------DKEEELCRHV--NEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL

AT3G57950.1 unknown protein1.3e-2342.7Show/hide
Query:  MKMVSLSPSSSSLQIFPSSSSLKALLQTL----ILSLARAISRAKTTALHILKQANHQS--------AIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPP
        M++ S SPSSSS     SS  LK L+Q L    +    RA+++AK+  L I K  ++               +N+ K+ FGSFRLHYNWCSS   HV P 
Subjt:  MKMVSLSPSSSSLQIFPSSSSLKALLQTL----ILSLARAISRAKTTALHILKQANHQS--------AIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPP

Query:  PLTW--------GDNSAADHLAGYLQWLEDR--DKEEELCRHV-----NEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL
        P  +        G+      L+GYL+WLE +  D  EE+   V     ++ID LAD+FIA CHEKF LEK ESYR+FQEM  R L
Subjt:  PLTW--------GDNSAADHLAGYLQWLEDR--DKEEELCRHV-----NEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL

AT5G06790.1 unknown protein1.4e-2041.41Show/hide
Query:  SLSPSSSSLQIFPSSSS--LKALLQTLILS----LARAISRAKTTALHIL--KQANHQSAIAF-------KRNKNKLLFGSFRLHYNWCSSSNYHVAPP-
        S+S S  S    P+SSS  LK+L+QTLI+S    L R ISR  +  + +L  KQ N  S  +        K+ KN +LFGSFRLHYN+CSS    V+ P 
Subjt:  SLSPSSSSLQIFPSSSS--LKALLQTLILS----LARAISRAKTTALHIL--KQANHQSAIAF-------KRNKNKLLFGSFRLHYNWCSSSNYHVAPP-

Query:  -------------PLTW-------------GDNSAADHLAGYLQWLEDRDK---EEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARS
                       TW              D+     L+ YL+ LED+ K   EEE    +NEIDKLAD FIA CHEKF LEK +SYR+ Q    RS
Subjt:  -------------PLTW-------------GDNSAADHLAGYLQWLEDRDK---EEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARS

AT5G47920.1 unknown protein5.3e-0441.3Show/hide
Query:  RDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAAR
        +++E E  +  +EID +AD+FI+R H++ +L+K  S++++QEM AR
Subjt:  RDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGTTTCCCTTTCGCCATCTTCTTCTTCGCTTCAAATTTTCCCATCATCTTCATCGCTCAAAGCCCTTTTACAGACGCTAATTCTCTCTCTAGCCCGAGCCAT
TTCTCGAGCCAAAACGACGGCGCTTCACATCCTAAAACAGGCCAACCACCAATCCGCCATAGCTTTCAAGAGGAACAAGAACAAGCTCCTCTTCGGCTCCTTCAGACTCC
ATTACAACTGGTGCTCCTCCTCCAATTATCACGTGGCTCCCCCGCCGCTCACGTGGGGTGACAACTCCGCCGCCGACCACCTTGCTGGGTACTTGCAGTGGCTGGAAGAC
AGAGATAAAGAAGAGGAATTATGTCGCCACGTTAATGAAATTGATAAATTAGCAGATATTTTTATTGCCAGGTGTCATGAGAAATTCAGGCTCGAAAAACAGGAGTCTTA
CCGGAAATTTCAAGAGATGGCGGCAAGAAGCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGGTTTCCCTTTCGCCATCTTCTTCTTCGCTTCAAATTTTCCCATCATCTTCATCGCTCAAAGCCCTTTTACAGACGCTAATTCTCTCTCTAGCCCGAGCCAT
TTCTCGAGCCAAAACGACGGCGCTTCACATCCTAAAACAGGCCAACCACCAATCCGCCATAGCTTTCAAGAGGAACAAGAACAAGCTCCTCTTCGGCTCCTTCAGACTCC
ATTACAACTGGTGCTCCTCCTCCAATTATCACGTGGCTCCCCCGCCGCTCACGTGGGGTGACAACTCCGCCGCCGACCACCTTGCTGGGTACTTGCAGTGGCTGGAAGAC
AGAGATAAAGAAGAGGAATTATGTCGCCACGTTAATGAAATTGATAAATTAGCAGATATTTTTATTGCCAGGTGTCATGAGAAATTCAGGCTCGAAAAACAGGAGTCTTA
CCGGAAATTTCAAGAGATGGCGGCAAGAAGCTTATGA
Protein sequenceShow/hide protein sequence
MKMVSLSPSSSSLQIFPSSSSLKALLQTLILSLARAISRAKTTALHILKQANHQSAIAFKRNKNKLLFGSFRLHYNWCSSSNYHVAPPPLTWGDNSAADHLAGYLQWLED
RDKEEELCRHVNEIDKLADIFIARCHEKFRLEKQESYRKFQEMAARSL