; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G003410 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G003410
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationCmo_Chr20:1673781..1674359
RNA-Seq ExpressionCmoCh20G003410
SyntenyCmoCh20G003410
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570626.1 hypothetical protein SDJN03_29541, partial [Cucurbita argyrosperma subsp. sororia]3.7e-9498.44Show/hide
Query:  MRTKMAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGE
        MRTKMAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSV LNLSLLLDLS+ENPNKVAFEYSY+TAVVSYRGE
Subjt:  MRTKMAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGE

Query:  ELGEAPIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        ELGEAPIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  ELGEAPIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

KAG7010478.1 hypothetical protein SDJN02_27272, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-9298.94Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLS+ENPNKVAFEYSY+TAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

XP_022944105.1 uncharacterized protein LOC111448649 [Cucurbita moschata]3.1e-93100Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

XP_022986213.1 uncharacterized protein LOC111484029 [Cucurbita maxima]4.4e-8794.68Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
        M APSRKLRSICIPVLLSVTLL+ISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNL LLLDLS+ENPNKVAFEYSY+TAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
         PIPAG L ADRTEKMNLTL MMADRLLAKSELFSDA+SGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

XP_023512272.1 uncharacterized protein LOC111777064 [Cucurbita pepo subsp. pepo]5.5e-9096.81Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLL+ISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLS+ENPNKVAFEYSY+TAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAG LPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCD TIGIGNRSI+DQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

TrEMBL top hitse value%identityAlignment
A0A5A7SSE6 Putative Harpin-induced 13.4e-5360.73Show/hide
Query:  TKMAAPSRK-LRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEE
        +KMAAP+ K LR+ CI ++LS+ LL++ +L+LAFT FKP+RP I VDSVSLLDLN++L      VDLNLS+ +DL++ENPNKVAFEYS +TAVV YRGE+
Subjt:  TKMAAPSRK-LRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEE

Query:  LGEAPIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        +GEAPIP G LP   T+KMNLTLT+M +R+L +SE+FSD +SG++ I+   RL+G VKV+GV KIHVVAS+SCDL I + N S  DQ C +
Subjt:  LGEAPIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

A0A6J1FYG9 uncharacterized protein LOC1114486491.5e-93100Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

A0A6J1G8C1 uncharacterized protein LOC1114518002.1e-6372.87Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
        MAA +RK R+ICI VLLS+ LL+I ILILAFT FKPK+PTI VDS+SLLDLNISLDAAR  VDLNL+L++ L++ENPNKVAF++S  TAVVSYRGEE+ E
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIP+G L AD TEKMNLTLTMMADRLLAKSEL SD ++GE+PI+ F RL G V VIGVFKI VVA SSCDLTI I  R++EDQ+C Y
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

A0A6J1JFV2 uncharacterized protein LOC1114840292.1e-8794.68Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
        M APSRKLRSICIPVLLSVTLL+ISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNL LLLDLS+ENPNKVAFEYSY+TAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
         PIPAG L ADRTEKMNLTL MMADRLLAKSELFSDA+SGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

A0A6J1L4F9 uncharacterized protein LOC1114997948.7e-6573.4Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE
        MAA +RK R+ICI VLLS+ +L+I ILILAFT FKPK+PTI VDSVSLLDLNISL+AAR  VDLNL+L++ L++ENPNKVAF++S  TAVVSYRGEE+ E
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIP+G L  D TEKMNLTLTMMADRLLAKSELFSD I+GE+PI+ F RL+G + VIGVFKI VVA SSCDLTI I NRS+EDQ+C Y
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family2.8e-0729.46Show/hide
Query:  MAAPSRKLRS-------ICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSY
        MA P  + RS        C    + + +L++ +L++ FT FKPK P I+V++V L    +S + A      N S    +++ NPN+  F +  ++  + Y
Subjt:  MAAPSRKLRS-------ICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSY

Query:  RGEELGEAPIPAGWLPADRTEKMNLTLTM
         G ++G   IPAG + + R + M  T T+
Subjt:  RGEELGEAPIPAGWLPADRTEKMNLTLTM

AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.9e-0823Show/hide
Query:  RTKMAAPSRKLRSICIPVLL---SVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARL---SVDLNLSLLLDLSIENPNKVAFEYSYTTAVV
        R+  ++ S  L+  C  + L    + LL+++++++   A KPK+P   +  V+++ + IS  +A L   +  L+L++ +  +  NPNKV   Y  ++  V
Subjt:  RTKMAAPSRKLRSICIPVLL---SVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARL---SVDLNLSLLLDLSIENPNKVAFEYSYTTAVV

Query:  SYRGEELGEAPIPAGWLPADRTEKMNLTLTMMADRLLA--KSELFSDA-ISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
         Y+G  LG A +P  +  A  T+ +  T+++    L+    ++L  DA ++  V + +   +   ++V+      V  S +C + I    +++  ++C +
Subjt:  SYRGEELGEAPIPAGWLPADRTEKMNLTLTMMADRLLA--KSELFSDA-ISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-1934.62Show/hide
Query:  SICIPVLLSVTLLIIS--ILILAFTAFKPKRPTIAVDSVSL--LDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPIPA
        SIC+    + T LI++  +L L FT F+ K P I ++ V +  LD     +  +L +  N+S+++D+S++NPN  +F+YS TT  + Y+G  +GEA    
Subjt:  SICIPVLLSVTLLIIS--ILILAFTAFKPKRPTIAVDSVSL--LDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPIPA

Query:  GWLPADRTEKMNLTLTMMADRLLAKSELFSD-AISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKC
        G     RT +MN+T+ +M DR+L+   L  + + SG V +  +TR+ G VK++G+ K HV    +C + + I  ++I+D  C
Subjt:  GWLPADRTEKMNLTLTMMADRLLAKSELFSD-AISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKC

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.2e-1528.8Show/hide
Query:  RKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAV--DSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPI
        R++  I   ++  + ++ ++ LILA   FKPK P +     +V  +  NISL      V LN +L L++ ++NPN   FEY     +V YR   +G   +
Subjt:  RKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAV--DSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPI

Query:  PAGWLPADRTEKMNLTLTMMADRLLAK-SELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKC
        P+  LPA  +  +   L +  D+ +A   ++  D + G++ +    ++ G + ++G+FKI + + S C+L +G  +  +EDQ C
Subjt:  PAGWLPADRTEKMNLTLTMMADRLLAK-SELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKC

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family6.0e-4247.75Show/hide
Query:  ICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPIPAGWLPA
        IC  +LL + L+ I I+ILAFT FKPKRPT  +DSV++  L  S++   L V LNL+L +DLS++NPN++ F Y  ++A+++YRG+ +GEAP+PA  + A
Subjt:  ICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPIPAGWLPA

Query:  DRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
         +T  +N+TLT+MADRLL++++L SD ++G +P+N F +++G V V+ +FKI V +SSSCDL+I + +R++  Q C Y
Subjt:  DRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCACTAAAATGGCGGCTCCCAGTAGGAAACTACGAAGCATTTGCATACCCGTATTGCTCTCTGTAACTCTGCTCATAATTTCGATCCTCATTTTAGCCTTTACTGC
TTTCAAGCCCAAGCGGCCCACCATCGCCGTCGATTCAGTTTCTCTGCTCGATCTGAACATTTCTCTGGACGCCGCCAGGCTGAGCGTCGATCTGAATTTGTCTCTCTTGC
TTGATCTCTCCATTGAGAATCCGAATAAGGTAGCCTTCGAATACTCCTATACCACCGCCGTCGTGAGTTACAGAGGCGAAGAACTCGGAGAAGCACCGATTCCGGCCGGC
TGGTTGCCGGCCGACAGGACTGAGAAAATGAACCTAACATTAACGATGATGGCTGACCGGCTCCTGGCTAAGTCGGAGCTATTCTCCGACGCGATCTCTGGTGAAGTCCC
GATCAACATTTTCACCCGATTGTCCGGGATTGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCCTCGTCGTCTTGTGATCTCACCATCGGCATTGGAAACAGAA
GCATTGAAGATCAGAAATGCCATTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGCACTAAAATGGCGGCTCCCAGTAGGAAACTACGAAGCATTTGCATACCCGTATTGCTCTCTGTAACTCTGCTCATAATTTCGATCCTCATTTTAGCCTTTACTGC
TTTCAAGCCCAAGCGGCCCACCATCGCCGTCGATTCAGTTTCTCTGCTCGATCTGAACATTTCTCTGGACGCCGCCAGGCTGAGCGTCGATCTGAATTTGTCTCTCTTGC
TTGATCTCTCCATTGAGAATCCGAATAAGGTAGCCTTCGAATACTCCTATACCACCGCCGTCGTGAGTTACAGAGGCGAAGAACTCGGAGAAGCACCGATTCCGGCCGGC
TGGTTGCCGGCCGACAGGACTGAGAAAATGAACCTAACATTAACGATGATGGCTGACCGGCTCCTGGCTAAGTCGGAGCTATTCTCCGACGCGATCTCTGGTGAAGTCCC
GATCAACATTTTCACCCGATTGTCCGGGATTGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCCTCGTCGTCTTGTGATCTCACCATCGGCATTGGAAACAGAA
GCATTGAAGATCAGAAATGCCATTACTAG
Protein sequenceShow/hide protein sequence
MRTKMAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSIENPNKVAFEYSYTTAVVSYRGEELGEAPIPAG
WLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY