; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh17G008570 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh17G008570
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionDUF4050 family protein
Genome locationCma_Chr17:6620593..6627493
RNA-Seq ExpressionCmaCh17G008570
SyntenyCmaCh17G008570
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575593.1 hypothetical protein SDJN03_26232, partial [Cucurbita argyrosperma subsp. sororia]5.1e-6372.32Show/hide
Query:  FFIGFSFPLQILISCAFFIHGNSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-----------------------
        F + F FP + L    F   G  GCLGC TKPKLKTKLNEPS+GQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ                       
Subjt:  FFIGFSFPLQILISCAFFIHGNSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-----------------------

Query:  --------GFLLWNQTRQRWTGNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
                GFLLWNQTRQRWTGNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
Subjt:  --------GFLLWNQTRQRWTGNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

KAG7014135.1 hypothetical protein SDJN02_24308, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-6378.85Show/hide
Query:  NSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWT
        NSGCLGC TKPKLKTKLNEPS+GQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ                               GFLLWNQTRQRWT
Subjt:  NSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWT

Query:  GNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        GNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
Subjt:  GNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

XP_022953471.1 uncharacterized protein LOC111456011 [Cucurbita moschata]9.6e-6278.57Show/hide
Query:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN
        GCLGCYTKPKLKTKLNEPS+GQ IHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ                               GFLLWNQTRQRWTGN
Subjt:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN

Query:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
Subjt:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

XP_022991286.1 uncharacterized protein LOC111487987 [Cucurbita maxima]3.9e-6379.87Show/hide
Query:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN
        GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ                               GFLLWNQTRQRWTGN
Subjt:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN

Query:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
Subjt:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

XP_023549085.1 uncharacterized protein LOC111807551 [Cucurbita pepo subsp. pepo]2.1e-6178.57Show/hide
Query:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN
        GCLGCYTKPKLKTKLNEPS+GQPIH DGLEKPSRSEDFWTTSTFDVDNSAGQSQ                               GFLLWNQTRQRWTGN
Subjt:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN

Query:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
Subjt:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A0A0KBV0 Uncharacterized protein1.7e-5168.59Show/hide
Query:  NSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWT
        NSGCLGCYTKPKLKT LNEPS+GQ I C GL KPS SEDFWTTSTFDVDNSAGQSQ                               G LLWNQTRQRWT
Subjt:  NSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWT

Query:  GNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        GN+   + PQFQEPKLD N TYE+LLGSNKPFRQPIPL EMVDFLVDVWEQEGLYD
Subjt:  GNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

A0A1S3CF66 uncharacterized protein LOC103499796 isoform X22.6e-5268.79Show/hide
Query:  NSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWT
        NSGCLGCYTKP+LKT LNEPS+GQ I C G+ KPS SEDFWTTSTFDV+NSAGQSQ                               G LLWNQTRQRWT
Subjt:  NSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWT

Query:  GNR-SENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        GN+ SE RAPQFQEPKLD N TYE+LLGSNKPFRQPIPL EMVDFLVDVWEQEGLYD
Subjt:  GNR-SENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

A0A5D3CFJ1 Uncharacterized protein2.6e-5268.79Show/hide
Query:  NSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWT
        NSGCLGCYTKP+LKT LNEPS+GQ I C G+ KPS SEDFWTTSTFDV+NSAGQSQ                               G LLWNQTRQRWT
Subjt:  NSGCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWT

Query:  GNR-SENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        GN+ SE RAPQFQEPKLD N TYE+LLGSNKPFRQPIPL EMVDFLVDVWEQEGLYD
Subjt:  GNR-SENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

A0A6J1GN31 uncharacterized protein LOC1114560114.6e-6278.57Show/hide
Query:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN
        GCLGCYTKPKLKTKLNEPS+GQ IHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ                               GFLLWNQTRQRWTGN
Subjt:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN

Query:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
Subjt:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

A0A6J1JQB3 uncharacterized protein LOC1114879871.9e-6379.87Show/hide
Query:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN
        GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ                               GFLLWNQTRQRWTGN
Subjt:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ-------------------------------GFLLWNQTRQRWTGN

Query:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
Subjt:  RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein4.8e-2746.45Show/hide
Query:  GCLGCYTKPK--LKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDN----------SAGQ------------------SQGFLLWNQTRQRWTG-N
        GC+GCY + +    +  + PS      C   +KPS SEDFW+TST D+DN          S+ Q                  +QG LLWNQTR+RW G +
Subjt:  GCLGCYTKPK--LKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDN----------SAGQ------------------SQGFLLWNQTRQRWTG-N

Query:  RSENRAPQFQEPKLDCN-ATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        +  N     Q  KL+ N ATY++LLGSNK F QPIPL+EMVDFLVD+WEQEGLYD
Subjt:  RSENRAPQFQEPKLDCN-ATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

AT1G15350.2 unknown protein4.8e-2746.45Show/hide
Query:  GCLGCYTKPK--LKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDN----------SAGQ------------------SQGFLLWNQTRQRWTG-N
        GC+GCY + +    +  + PS      C   +KPS SEDFW+TST D+DN          S+ Q                  +QG LLWNQTR+RW G +
Subjt:  GCLGCYTKPK--LKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDN----------SAGQ------------------SQGFLLWNQTRQRWTG-N

Query:  RSENRAPQFQEPKLDCN-ATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        +  N     Q  KL+ N ATY++LLGSNK F QPIPL+EMVDFLVD+WEQEGLYD
Subjt:  RSENRAPQFQEPKLDCN-ATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

AT3G15770.1 unknown protein4.1e-2643.4Show/hide
Query:  SGCLGCYTKPKLKTKLNEPSEGQ-----PIHCDGLEKPS--RSEDFWTTSTFDVDNSAGQS----------------------------QGFLLWNQTRQ
        S CL C+ K K KT ++ P  G            L KPS   SEDFWT +T D++++A  S                             G +LWNQTRQ
Subjt:  SGCLGCYTKPKLKTKLNEPSEGQ-----PIHCDGLEKPS--RSEDFWTTSTFDVDNSAGQS----------------------------QGFLLWNQTRQ

Query:  RWTGN-RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLY
        +W G+ RSE+R    +EP L+ N TYE+LLGSNK F +PIPL EMV FLV+VWE+EGLY
Subjt:  RWTGN-RSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLY

AT5G25360.1 unknown protein3.4e-3348.67Show/hide
Query:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ---------------------------GFLLWNQTRQRWTGNRSEN
        GC GC  KP L   ++EPS+G  I    ++KPS SEDFW+TST ++DNS  QSQ                           G  LWNQTRQ+W  N +  
Subjt:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ---------------------------GFLLWNQTRQRWTGNRSEN

Query:  RAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        +  + +EP +  NATYE+LLG NK F +PIPL EMVDFLVDVWEQEGLYD
Subjt:  RAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD

AT5G25360.2 unknown protein3.4e-3348.67Show/hide
Query:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ---------------------------GFLLWNQTRQRWTGNRSEN
        GC GC  KP L   ++EPS+G  I    ++KPS SEDFW+TST ++DNS  QSQ                           G  LWNQTRQ+W  N +  
Subjt:  GCLGCYTKPKLKTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQ---------------------------GFLLWNQTRQRWTGNRSEN

Query:  RAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD
        +  + +EP +  NATYE+LLG NK F +PIPL EMVDFLVDVWEQEGLYD
Subjt:  RAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGCATATCAGCCATATCATCTATCAGCCTGTGAAACCTGCACGTGGTCAAGTTCATGGGTTCCGTTGGCTCTATGGTACGAACATCTTGACGGAAGGTTGCAGAA
GAATAAGAAGAAGAAGAGAGGCGGCCAGGATAATCGGAGGAGATTTCGATTCCTGAAGGAAGGTCGGAGTGCGATTGACAGTGACGAGCTGCTCTTTGATCTTCTTTCCT
GGAATCGATTTTTCATTGGCTTTTCTTTTCCCCTGCAAATCCTTATCAGTTGCGCGTTTTTCATTCATGGGAACAGTGGTTGTCTTGGATGCTACACAAAGCCGAAACTA
AAAACTAAGCTCAACGAACCGTCGGAAGGTCAACCAATTCATTGTGATGGACTAGAGAAACCCAGCAGATCAGAGGATTTCTGGACTACTAGTACATTTGATGTGGACAA
CAGTGCAGGTCAATCACAAGGCTTTCTTCTGTGGAATCAAACTAGGCAACGCTGGACGGGGAATAGATCCGAAAACCGAGCACCGCAGTTTCAAGAACCCAAGCTAGACT
GCAATGCAACATATGAAAATTTACTAGGGAGCAACAAGCCATTCCGCCAACCAATTCCACTCAGCGAAATGGTAGATTTTCTTGTGGATGTATGGGAACAAGAAGGGTTG
TATGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGCATATCAGCCATATCATCTATCAGCCTGTGAAACCTGCACGTGGTCAAGTTCATGGGTTCCGTTGGCTCTATGGTACGAACATCTTGACGGAAGGTTGCAGAA
GAATAAGAAGAAGAAGAGAGGCGGCCAGGATAATCGGAGGAGATTTCGATTCCTGAAGGAAGGTCGGAGTGCGATTGACAGTGACGAGCTGCTCTTTGATCTTCTTTCCT
GGAATCGATTTTTCATTGGCTTTTCTTTTCCCCTGCAAATCCTTATCAGTTGCGCGTTTTTCATTCATGGGAACAGTGGTTGTCTTGGATGCTACACAAAGCCGAAACTA
AAAACTAAGCTCAACGAACCGTCGGAAGGTCAACCAATTCATTGTGATGGACTAGAGAAACCCAGCAGATCAGAGGATTTCTGGACTACTAGTACATTTGATGTGGACAA
CAGTGCAGGTCAATCACAAGGCTTTCTTCTGTGGAATCAAACTAGGCAACGCTGGACGGGGAATAGATCCGAAAACCGAGCACCGCAGTTTCAAGAACCCAAGCTAGACT
GCAATGCAACATATGAAAATTTACTAGGGAGCAACAAGCCATTCCGCCAACCAATTCCACTCAGCGAAATGGTAGATTTTCTTGTGGATGTATGGGAACAAGAAGGGTTG
TATGATTGATTAATTTGAAGAGTGCACAAGGCTTCATAGTCTCACTGTGAGTCAATGGCTTTATTTGAACCATCAAAATCTGCCTTGTATTGCTGAAACTTCAAATGAAC
ATTTCCCCTGCTGCTGCCGACCCCTCTTTTTGGTGCTTGGTTTGATTTTCTCGTAGTGCCTTGGAGCTTTACATCTTTACTCTGCTGTAGAAAATTCAGCTGTGCTAGAA
GTTACTGTTTTTGCAAATTATTTTTCTTTTCTTTCCTCTAAGTAATGATAAACAAATCTCAACATGAACGAACAGTAAATTAAATTACAAATTTACTCA
Protein sequenceShow/hide protein sequence
MGAYQPYHLSACETCTWSSSWVPLALWYEHLDGRLQKNKKKKRGGQDNRRRFRFLKEGRSAIDSDELLFDLLSWNRFFIGFSFPLQILISCAFFIHGNSGCLGCYTKPKL
KTKLNEPSEGQPIHCDGLEKPSRSEDFWTTSTFDVDNSAGQSQGFLLWNQTRQRWTGNRSENRAPQFQEPKLDCNATYENLLGSNKPFRQPIPLSEMVDFLVDVWEQEGL
YD