; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg03389 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg03389
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionLate embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family
Genome locationCarg_Chr20:1601143..1601709
RNA-Seq ExpressionCarg03389
SyntenyCarg03389
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570626.1 hypothetical protein SDJN03_29541, partial [Cucurbita argyrosperma subsp. sororia]2.6e-9299.47Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSV LNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

KAG7010478.1 hypothetical protein SDJN02_27272, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-93100Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

XP_022944105.1 uncharacterized protein LOC111448649 [Cucurbita moschata]1.2e-9298.94Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLS+ENPNKVAFEYSY+TAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

XP_022986213.1 uncharacterized protein LOC111484029 [Cucurbita maxima]1.5e-8795.74Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        M APSRKLRSICIPVLLSVTLL+ISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNL LLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
         PIPAG L ADRTEKMNLTL MMADRLLAKSELFSDA+SGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

XP_023512272.1 uncharacterized protein LOC111777064 [Cucurbita pepo subsp. pepo]1.9e-9097.87Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLL+ISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAG LPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCD TIGIGNRSI+DQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

TrEMBL top hitse value%identityAlignment
A0A5A7SSE6 Putative Harpin-induced 17.4e-5361.9Show/hide
Query:  MAAPSRK-LRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELG
        MAAP+ K LR+ CI ++LS+ LL++ +L+LAFT FKP+RP I VDSVSLLDLN++L      VDLNLS+ +DL+VENPNKVAFEYS STAVV YRGE++G
Subjt:  MAAPSRK-LRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELG

Query:  EAPIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        EAPIP G LP   T+KMNLTLT+M +R+L +SE+FSD +SG++ I+   RL+G VKV+GV KIHVVAS+SCDL I + N S  DQ C +
Subjt:  EAPIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

A0A6J1FYG9 uncharacterized protein LOC1114486495.6e-9398.94Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLS+ENPNKVAFEYSY+TAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

A0A6J1G8C1 uncharacterized protein LOC1114518009.4e-6473.4Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        MAA +RK R+ICI VLLS+ LL+I ILILAFT FKPK+PTI VDS+SLLDLNISLDAAR  VDLNL+L++ L+VENPNKVAF++S  TAVVSYRGEE+ E
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIP+G L AD TEKMNLTLTMMADRLLAKSEL SD ++GE+PI+ F RL G V VIGVFKI VVA SSCDLTI I  R++EDQ+C Y
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

A0A6J1JFV2 uncharacterized protein LOC1114840297.1e-8895.74Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        M APSRKLRSICIPVLLSVTLL+ISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNL LLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
         PIPAG L ADRTEKMNLTL MMADRLLAKSELFSDA+SGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

A0A6J1L4F9 uncharacterized protein LOC1114997943.8e-6573.94Show/hide
Query:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE
        MAA +RK R+ICI VLLS+ +L+I ILILAFT FKPK+PTI VDSVSLLDLNISL+AAR  VDLNL+L++ L+VENPNKVAF++S  TAVVSYRGEE+ E
Subjt:  MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGE

Query:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
        APIP+G L  D TEKMNLTLTMMADRLLAKSELFSD I+GE+PI+ F RL+G + VIGVFKI VVA SSCDLTI I NRS+EDQ+C Y
Subjt:  APIPAGWLPADRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family1.2e-0731.01Show/hide
Query:  MAAPSRKLRS-------ICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSY
        MA P  + RS        C    + + +L++ +L++ FT FKPK P I+V++V L    +S + A      N S    ++V NPN+  F +  S+  + Y
Subjt:  MAAPSRKLRS-------ICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSY

Query:  RGEELGEAPIPAGWLPADRTEKMNLTLTM
         G ++G   IPAG + + R + M  T T+
Subjt:  RGEELGEAPIPAGWLPADRTEKMNLTLTM

AT2G01080.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.1e-0823.47Show/hide
Query:  AAPSRKLRSICIPVLL---SVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARL---SVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRG
        ++ S  L+  C  + L    + LL+++++++   A KPK+P   +  V+++ + IS  +A L   +  L+L++ +  +  NPNKV   Y  S+  V Y+G
Subjt:  AAPSRKLRSICIPVLL---SVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARL---SVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRG

Query:  EELGEAPIPAGWLPADRTEKMNLTLTMMADRLLA--KSELFSDA-ISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
          LG A +P  +  A  T+ +  T+++    L+    ++L  DA ++  V + +   +   ++V+      V  S +C + I    +++  ++C +
Subjt:  EELGEAPIPAGWLPADRTEKMNLTLTMMADRLLA--KSELFSDA-ISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.4e-1934.62Show/hide
Query:  SICIPVLLSVTLLIIS--ILILAFTAFKPKRPTIAVDSVSL--LDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPIPA
        SIC+    + T LI++  +L L FT F+ K P I ++ V +  LD     +  +L +  N+S+++D+SV+NPN  +F+YS +T  + Y+G  +GEA    
Subjt:  SICIPVLLSVTLLIIS--ILILAFTAFKPKRPTIAVDSVSL--LDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPIPA

Query:  GWLPADRTEKMNLTLTMMADRLLAKSELFSD-AISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKC
        G     RT +MN+T+ +M DR+L+   L  + + SG V +  +TR+ G VK++G+ K HV    +C + + I  ++I+D  C
Subjt:  GWLPADRTEKMNLTLTMMADRLLAKSELFSD-AISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKC

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.4e-1428.8Show/hide
Query:  RKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAV--DSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPI
        R++  I   ++  + ++ ++ LILA   FKPK P +     +V  +  NISL      V LN +L L++ ++NPN   FEY     +V YR   +G   +
Subjt:  RKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAV--DSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPI

Query:  PAGWLPADRTEKMNLTLTMMADRLLAK-SELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKC
        P+  LPA  +  +   L +  D+ +A   ++  D + G++ +    ++ G + ++G+FKI + + S C+L +G  +  +EDQ C
Subjt:  PAGWLPADRTEKMNLTLTMMADRLLAK-SELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKC

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.5e-4248.31Show/hide
Query:  ICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPIPAGWLPA
        IC  +LL + L+ I I+ILAFT FKPKRPT  +DSV++  L  S++   L V LNL+L +DLS++NPN++ F Y  S+A+++YRG+ +GEAP+PA  + A
Subjt:  ICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPIPAGWLPA

Query:  DRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY
         +T  +N+TLT+MADRLL++++L SD ++G +P+N F +++G V V+ +FKI V +SSSCDL+I + +R++  Q C Y
Subjt:  DRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTCCCAGTAGGAAACTACGAAGCATTTGCATACCCGTATTGCTCTCTGTAACTCTGCTCATAATTTCGATCCTCATTTTAGCCTTTACTGCTTTCAAGCCCAA
GCGGCCCACCATCGCCGTCGATTCAGTTTCTCTGCTCGATCTGAACATTTCTCTGGACGCCGCCAGGCTGAGCGTCGATCTGAATTTGTCTCTCTTGCTCGATCTCTCCG
TTGAGAATCCGAATAAGGTAGCCTTCGAATACTCCTATAGCACCGCCGTCGTGAGTTACAGAGGCGAAGAACTCGGAGAAGCGCCGATTCCGGCCGGCTGGTTGCCGGCC
GACAGGACTGAGAAAATGAACCTAACATTAACGATGATGGCTGACCGGCTCCTGGCTAAGTCGGAGCTATTCTCCGACGCGATCTCTGGTGAAGTCCCGATCAACATTTT
CACCCGATTGTCCGGGATTGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCCTCGTCGTCTTGTGATCTCACCATCGGCATTGGAAATAGAAGCATTGAAGATC
AGAAATGCCATTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTCCCAGTAGGAAACTACGAAGCATTTGCATACCCGTATTGCTCTCTGTAACTCTGCTCATAATTTCGATCCTCATTTTAGCCTTTACTGCTTTCAAGCCCAA
GCGGCCCACCATCGCCGTCGATTCAGTTTCTCTGCTCGATCTGAACATTTCTCTGGACGCCGCCAGGCTGAGCGTCGATCTGAATTTGTCTCTCTTGCTCGATCTCTCCG
TTGAGAATCCGAATAAGGTAGCCTTCGAATACTCCTATAGCACCGCCGTCGTGAGTTACAGAGGCGAAGAACTCGGAGAAGCGCCGATTCCGGCCGGCTGGTTGCCGGCC
GACAGGACTGAGAAAATGAACCTAACATTAACGATGATGGCTGACCGGCTCCTGGCTAAGTCGGAGCTATTCTCCGACGCGATCTCTGGTGAAGTCCCGATCAACATTTT
CACCCGATTGTCCGGGATTGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCCTCGTCGTCTTGTGATCTCACCATCGGCATTGGAAATAGAAGCATTGAAGATC
AGAAATGCCATTACTAG
Protein sequenceShow/hide protein sequence
MAAPSRKLRSICIPVLLSVTLLIISILILAFTAFKPKRPTIAVDSVSLLDLNISLDAARLSVDLNLSLLLDLSVENPNKVAFEYSYSTAVVSYRGEELGEAPIPAGWLPA
DRTEKMNLTLTMMADRLLAKSELFSDAISGEVPINIFTRLSGIVKVIGVFKIHVVASSSCDLTIGIGNRSIEDQKCHY