; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G029100 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G029100
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationCmo_Chr04:20670395..20671424
RNA-Seq ExpressionCmoCh04G029100
SyntenyCmoCh04G029100
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602512.1 hypothetical protein SDJN03_07745, partial [Cucurbita argyrosperma subsp. sororia]4.5e-11799.07Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
        MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPG+SEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE

Query:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
        GYPSDVPHNITKVLDLQMTNWELNGI RDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
Subjt:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR

Query:  LVEDYCAFRKEKKK
        LVEDYCAFRKEKKK
Subjt:  LVEDYCAFRKEKKK

KAG7033185.1 hypothetical protein SDJN02_07239, partial [Cucurbita argyrosperma subsp. argyrosperma]2.0e-11799.53Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
        MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE

Query:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
        GYPSDVPHNITKVLDLQMTNWELNGI RDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
Subjt:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR

Query:  LVEDYCAFRKEKKK
        LVEDYCAFRKEKKK
Subjt:  LVEDYCAFRKEKKK

XP_022961709.1 uncharacterized protein LOC111462397 [Cucurbita moschata]2.4e-118100Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
        MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE

Query:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
        GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
Subjt:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR

Query:  LVEDYCAFRKEKKK
        LVEDYCAFRKEKKK
Subjt:  LVEDYCAFRKEKKK

XP_023517004.1 uncharacterized protein LOC111780797 isoform X1 [Cucurbita pepo subsp. pepo]5.9e-11798.6Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
        MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPR+VKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE

Query:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
        GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYS+KFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLK MVEDVKLKAMDR
Subjt:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR

Query:  LVEDYCAFRKEKKK
        LVEDYCAFRKEKKK
Subjt:  LVEDYCAFRKEKKK

XP_023517005.1 uncharacterized protein LOC111780797 isoform X2 [Cucurbita pepo subsp. pepo]1.3e-11195.79Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
        MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPR+VKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE

Query:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
        GYPSDVPHNITK      TNWELNGIHRDYRPSSANVCSRGAIYS+KFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLK MVEDVKLKAMDR
Subjt:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR

Query:  LVEDYCAFRKEKKK
        LVEDYCAFRKEKKK
Subjt:  LVEDYCAFRKEKKK

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein2.2e-7768.37Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTS-G
        +  WKCFA+    QK     NLLSVS  SFSD+PLYE  GKASFD+YLEDKPRLVKATFPGK++QLNQEEWRIETPKI+ LFLKI PTID+KIISKT+ G
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTS-G

Query:  EGYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMD
        E YP  VPH I K+L  QMTNWE+NGIH++YRPSSANVCS G IY +K G RSRLKFQL I+LSF +PDAL FVP DV + I+E  +K MVED+K K + 
Subjt:  EGYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMD

Query:  RLVEDYCAFRKEKKK
        +LVEDY  FR EK+K
Subjt:  RLVEDYCAFRKEKKK

A0A1S4E357 uncharacterized protein LOC1034987442.5e-6567.76Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTS-G
        + +W+CFA+ +SQ+    D NLLSVS  SFSD+ L+E  GKASFD+YLEDKPRL+KATFPGK +QLNQEEWRIETPKI+ LFLKIWPT+D+KIISKT+ G
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTS-G

Query:  EGYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIV
        E YP DVP+ I KVL  +MTNWE+NGI++DYRPSSANVCS G IY EK G RS LKF+L I+LSF +PDAL FVP DV + ++
Subjt:  EGYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIV

A0A6J1C174 uncharacterized protein LOC111006493 isoform X11.9e-7670.33Show/hide
Query:  KCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGEGYPS
        K  AVSK+QQ   + QNLLS S+  FSDIPL E  GKASFDQYLEDKPR++KATFPGKS+QLNQEEWRIETPK+E L LKIWP ID+KIISKTSG+ YP 
Subjt:  KCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGEGYPS

Query:  DVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDRLVED
         VPH+ITK+L L+MTNWE+NGIHR+YRPSSANV S+GAIYSEK G  SRLKFQ  +N +F +P AL+F+PKD+F+SI E  LK M+ED+  KA+D+LVED
Subjt:  DVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDRLVED

Query:  YCAFRKEKK
        Y  FRKEKK
Subjt:  YCAFRKEKK

A0A6J1HEU2 uncharacterized protein LOC1114623971.2e-118100Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
        MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE

Query:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
        GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
Subjt:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR

Query:  LVEDYCAFRKEKKK
        LVEDYCAFRKEKKK
Subjt:  LVEDYCAFRKEKKK

A0A6J1JUJ3 uncharacterized protein LOC1114876275.1e-10691.59Show/hide
Query:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
        MSRWKCFAVSK+          LSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKA FPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE
Subjt:  MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGE

Query:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR
        GYPSDVPHNIT+VLDLQMTNWELNGI RDY PSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTF+PKDVFQSIVE GLK MVEDVKLKAMDR
Subjt:  GYPSDVPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDR

Query:  LVEDYCAFRKEKKK
        LVEDYCAFRKEKKK
Subjt:  LVEDYCAFRKEKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)2.2e-3742.47Show/hide
Query:  SDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSE--QLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGEGYPSDVPHNITKVLDLQMTNWELNGIHR
        +DI L+E   +A FD+YLEDK R+ +A FP K +  +LN+EEWRI+   I+F FL   P + ++I  K++G+ YPSDVP +ITKVL+L MT WEL G+ R
Subjt:  SDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSE--QLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGEGYPSDVPHNITKVLDLQMTNWELNGIHR

Query:  DYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDRLVEDYCAFRKEKKK
           P+   +  +GA+Y ++ G  +RLK +L+  +SF +P  L  VP+DV +++    L  +V+++K + ++ LV DY  F+ E+KK
Subjt:  DYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDRLVEDYCAFRKEKKK

AT5G39530.1 Protein of unknown function (DUF1997)9.6e-4143.68Show/hide
Query:  SLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGK--SEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGEGYPSDVPHNITKVLDLQMTNWEL
        S R  +DIPL E   +A FD+YLEDK R+ +A FP K  S +LN+EEWRI+   I FLFL +WP +D+++  K++G+ YP DVP +ITKVL+L M  W+L
Subjt:  SLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGK--SEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGEGYPSDVPHNITKVLDLQMTNWEL

Query:  NGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDRLVEDYCAFRKEKK
         G+ R   P+  ++  +GA+Y ++ G  +RL+ QL++N+SF +P  L  VP+DV +++    L  +VE++K K    L+ DY  F+ E+K
Subjt:  NGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDRLVEDYCAFRKEKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAGGTGGAAGTGCTTTGCAGTGTCAAAATCACAGCAGAAACCAAGAAAAGATCAGAACTTGTTATCTGTTTCTTTGAGATCTTTCAGTGACATACCGCTT
TATGAGCCTCAAGGGAAAGCTTCTTTTGATCAGTACTTGGAAGATAAACCCAGATTGGTGAAAGCAACATTTCCAGGAAAAAGTGAACAGCTCAACCAGGAAGAG
TGGAGAATTGAGACACCAAAAATCGAGTTTCTGTTTCTGAAGATATGGCCAACCATTGATATCAAAATCATCAGTAAAACCAGTGGAGAAGGCTACCCATCTGAT
GTTCCTCATAATATCACAAAAGTTCTTGACCTTCAAATGACAAATTGGGAGCTCAATGGGATCCATAGAGACTACAGGCCATCTTCAGCCAATGTTTGTTCTAGA
GGAGCTATTTACAGTGAAAAATTTGGAATCAGAAGCCGCCTTAAGTTTCAACTCCAAATAAATCTCAGCTTCCATATCCCGGACGCTCTGACTTTCGTTCCGAAA
GACGTTTTCCAGAGCATTGTAGAGAAGGGTTTGAAGACAATGGTTGAGGACGTGAAGCTTAAAGCCATGGATAGATTGGTTGAGGATTATTGTGCATTCAGAAAG
GAGAAGAAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
AGCCAGGTCTGTGCGTTAAAAATGGCGTTGTTCTTCAGAAATCCCACCTCAAAAACCAAAAAATGAGTAGGTGGAAGTGCTTTGCAGTGTCAAAATCACAGCAGA
AACCAAGAAAAGATCAGAACTTGTTATCTGTTTCTTTGAGATCTTTCAGTGACATACCGCTTTATGAGCCTCAAGGGAAAGCTTCTTTTGATCAGTACTTGGAAG
ATAAACCCAGATTGGTGAAAGCAACATTTCCAGGAAAAAGTGAACAGCTCAACCAGGAAGAGTGGAGAATTGAGACACCAAAAATCGAGTTTCTGTTTCTGAAGA
TATGGCCAACCATTGATATCAAAATCATCAGTAAAACCAGTGGAGAAGGCTACCCATCTGATGTTCCTCATAATATCACAAAAGTTCTTGACCTTCAAATGACAA
ATTGGGAGCTCAATGGGATCCATAGAGACTACAGGCCATCTTCAGCCAATGTTTGTTCTAGAGGAGCTATTTACAGTGAAAAATTTGGAATCAGAAGCCGCCTTA
AGTTTCAACTCCAAATAAATCTCAGCTTCCATATCCCGGACGCTCTGACTTTCGTTCCGAAAGACGTTTTCCAGAGCATTGTAGAGAAGGGTTTGAAGACAATGG
TTGAGGACGTGAAGCTTAAAGCCATGGATAGATTGGTTGAGGATTATTGTGCATTCAGAAAGGAGAAGAAGAAGTAA
Protein sequenceShow/hide protein sequence
MSRWKCFAVSKSQQKPRKDQNLLSVSLRSFSDIPLYEPQGKASFDQYLEDKPRLVKATFPGKSEQLNQEEWRIETPKIEFLFLKIWPTIDIKIISKTSGEGYPSD
VPHNITKVLDLQMTNWELNGIHRDYRPSSANVCSRGAIYSEKFGIRSRLKFQLQINLSFHIPDALTFVPKDVFQSIVEKGLKTMVEDVKLKAMDRLVEDYCAFRK
EKKK