; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G22550 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G22550
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Description11S globulin seed storage protein 2-like
Genome locationChr3:19130127..19131783
RNA-Seq ExpressionCSPI03G22550
SyntenyCSPI03G22550
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006044 - 11-S seed storage protein, plant
IPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold
IPR022379 - 11-S seed storage protein, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038062.1 11S globulin seed storage protein 2-like [Cucumis melo var. makuwa]2.6e-17893Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        MATKVVLGILL LFAFESVVSTHHAPERH FREEAQQCRLDR QARPPSRRIE EGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAY+
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV
        E+GEGFLGLNFPGCNVE YEAQSAQL RSSRRIR+DKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAF+DLNNDDNQLDLRVRSSYLAGGV
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV

Query:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE
        PREARRVSKSDD VNIF+GFNKEFLEEAYNIPSDLA+KMQEERS GLIVKCDEEMSFMTPEE EEELS  PFSRREEDSNGLEETICTA+VQHNMNTQRE
Subjt:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE

Query:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        AD YSREAGRVNILNQLKLPILRFMGMSAEKGHLFP    +LH
Subjt:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

XP_004152047.1 11S globulin seed storage protein 2 [Cucumis sativus]1.5e-18997.96Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQ RLDR QARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV
        ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV

Query:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE
        PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE
Subjt:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE

Query:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFP    +LH
Subjt:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

XP_022963932.1 11S globulin seed storage protein 2-like [Cucurbita moschata]2.3e-14276.35Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        M T+VVL ILLCL A  S    H   ER  FREEAQQCRLDR ++ PPSRRIE EGGITE+WDEA E+FQCAGVAA RNIIRPN LSLPKFHSSPML YI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSS-----RRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSY
        E+GEGFLGLNFPGC  E YEAQSAQ SR S     RRI   KE+D+HQKVRRVRRGDMIV+PAGTVQWC+ND GQDLV VAF+DLNN+DNQLDLR+R S+
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSS-----RRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSY

Query:  LAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ
        LAGG+PRE RR    SK++D VNI+NGF+++FL +AYN+P+DL R+MQEERS GLIVKCDE+MSF+TPEEEEEELS  P SRREE SNGLEETICTARVQ
Subjt:  LAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ

Query:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        HNMNTQREADVYSREAGR+NILNQLKLPILRFMGMSAEKGHLFP    +LH
Subjt:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

XP_022967670.1 11S globulin seed storage protein 2-like [Cucurbita maxima]6.0e-14376.64Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        MAT+VVL ILLCL A  S    H   ERH FREEAQQCRLDR ++ PPSRRIE EGGITE+WDEA E+FQCAGVAA RNIIRPN LSLPKFHSSPML YI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSS-----RRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSY
        E+GEGFLGLN+PGC  E YEAQSAQ SR S     RRI   KE+D+HQKVRRVRRGDMIV+PAGTVQWC+ND GQDL+ +AF+DLNN+DNQLDLR+R S+
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSS-----RRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSY

Query:  LAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ
        LAGG+PRE RR    SK++D VNIF+GF++EFL +AYN+P+DL RKMQEERS GLIVKCDE+MSF+TPEEEEEELS  P SRREE SNGLEETICTARVQ
Subjt:  LAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ

Query:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF     +LH
Subjt:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

XP_038889063.1 11S globulin seed storage protein 2-like [Benincasa hispida]6.6e-15884.55Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        MATKVV+GILLCL AF+S+VS   APERH FREEAQQCRL+R QAR PSR IE EGGIT+VW+EANEEFQCAGVAAFRNIIRPNSLSLPK+HS+PML YI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV
        E+GEGFLGL+FPGC  E YEAQS   SRSSRRIR DKEEDKHQKVR+V RGDMIVIPAGTVQWC+ND GQDLVVVAFMDLNNDDNQLDLRVRSSYLA GV
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV

Query:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE
        PREARR +K+DD VNIF+GFN EFL EAYNIP DL +KMQEER  GLIVKCDEEMSFMTPEEEEEELS  PFSRREEDSNGLEETICTARVQHNMNTQ+E
Subjt:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE

Query:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF     +LH
Subjt:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

TrEMBL top hitse value%identityAlignment
A0A0A0LAB5 Cupin type-1 domain-containing protein1.3e-19699.14Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQ RLDR QARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV
        ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV

Query:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE
        PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE
Subjt:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE

Query:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLHHYFIQ
        ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH+YFIQ
Subjt:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLHHYFIQ

A0A5A7T3J7 11S globulin seed storage protein 2-like1.2e-17893Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        MATKVVLGILL LFAFESVVSTHHAPERH FREEAQQCRLDR QARPPSRRIE EGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAY+
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV
        E+GEGFLGLNFPGCNVE YEAQSAQL RSSRRIR+DKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAF+DLNNDDNQLDLRVRSSYLAGGV
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGV

Query:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE
        PREARRVSKSDD VNIF+GFNKEFLEEAYNIPSDLA+KMQEERS GLIVKCDEEMSFMTPEE EEELS  PFSRREEDSNGLEETICTA+VQHNMNTQRE
Subjt:  PREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQRE

Query:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        AD YSREAGRVNILNQLKLPILRFMGMSAEKGHLFP    +LH
Subjt:  ADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

A0A5D3DAK7 11S globulin seed storage protein 2-like1.3e-14073.94Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFR--EEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLA
        MA KV+L ILLC FA ES+V+     ER  FR   EAQ C+LDR + RPPSRRIE EGGITE+WDEA+EEFQCAGV A RN IRPNSLSLPKFH++PML 
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFR--EEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLA

Query:  YIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRI--RVD---KEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRS
        YIE+GEGF G+N+PGC  E YE+QSAQ SRS+RR+  R+     EED+HQK+RRVRRGDMIVIPAGTVQWCYND G+DL+ VAF+DLNNDDNQLDLRVR 
Subjt:  YIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRI--RVD---KEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRS

Query:  SYLAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTAR
        S+LAGGVP EARR    SKSD+ VNIFNG ++EFL EA+NIPSDL R+MQEERS GLIVKCDEEMSF+TPEEEEEELS   +SRR  + NG+EETICTAR
Subjt:  SYLAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTAR

Query:  VQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        VQHNMNTQREAD++SREAGRVNILNQLKLPILRF+GMSAEKGHLFP    +LH
Subjt:  VQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

A0A6J1HGI1 11S globulin seed storage protein 2-like1.1e-14276.35Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        M T+VVL ILLCL A  S    H   ER  FREEAQQCRLDR ++ PPSRRIE EGGITE+WDEA E+FQCAGVAA RNIIRPN LSLPKFHSSPML YI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSS-----RRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSY
        E+GEGFLGLNFPGC  E YEAQSAQ SR S     RRI   KE+D+HQKVRRVRRGDMIV+PAGTVQWC+ND GQDLV VAF+DLNN+DNQLDLR+R S+
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSS-----RRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSY

Query:  LAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ
        LAGG+PRE RR    SK++D VNI+NGF+++FL +AYN+P+DL R+MQEERS GLIVKCDE+MSF+TPEEEEEELS  P SRREE SNGLEETICTARVQ
Subjt:  LAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ

Query:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        HNMNTQREADVYSREAGR+NILNQLKLPILRFMGMSAEKGHLFP    +LH
Subjt:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

A0A6J1HV45 11S globulin seed storage protein 2-like2.9e-14376.64Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        MAT+VVL ILLCL A  S    H   ERH FREEAQQCRLDR ++ PPSRRIE EGGITE+WDEA E+FQCAGVAA RNIIRPN LSLPKFHSSPML YI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSS-----RRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSY
        E+GEGFLGLN+PGC  E YEAQSAQ SR S     RRI   KE+D+HQKVRRVRRGDMIV+PAGTVQWC+ND GQDL+ +AF+DLNN+DNQLDLR+R S+
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSS-----RRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSY

Query:  LAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ
        LAGG+PRE RR    SK++D VNIF+GF++EFL +AYN+P+DL RKMQEERS GLIVKCDE+MSF+TPEEEEEELS  P SRREE SNGLEETICTARVQ
Subjt:  LAGGVPREARRV---SKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ

Query:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH
        HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF     +LH
Subjt:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVKHTHLH

SwissProt top hitse value%identityAlignment
A0A1L6K371 11S globulin6.8e-5737.27Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        MA  ++L I LCL A   +V+   A      +    +C+L R  A  PS RIE E G+ E WD  N++FQCAGVA  R  I PN L LP++ ++P L YI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIR-VDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAG-
         +G G  G+ FPGC     E+Q  Q SR    +R    + D+HQK+R  R GD+I  PAG   WCYND    +V VA MD  N+ NQLD   R+ YLAG 
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIR-VDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAG-

Query:  --------------------------GVPREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERS-GGLIVKCD-EEMSFMTP-----EEEEE
                                  G P + +R S +    N+F+GF+ +FL +A+N+ ++ AR++Q E      IV+ +  ++  + P     E+E E
Subjt:  --------------------------GVPREARRVSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERS-GGLIVKCD-EEMSFMTP-----EEEEE

Query:  ELSALPFSRREE-----------DSNGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF
        E       R  E           D NGLEETICT R++ N+     AD+Y+ EAGR++  N   LP+LR++ +SAE+G L+
Subjt:  ELSALPFSRREE-----------DSNGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF

B5KVH4 11S globulin seed storage protein 11.6e-5836.41Show/hide
Query:  MATKVVLGILLCLF---AFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPML
        MA  ++L I LCL     F   ++     ++H F     QC+L+R  A  P+ RIE E G+ E WD  +++ QCAGVA  R  I PN L LP + ++P L
Subjt:  MATKVVLGILLCLF---AFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPML

Query:  AYIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLA
         YI RG G  G+ FPGC  E +E    Q  +  RR   + ++D+HQK+R  R GD+I  PAG   WCYND    +V +  +D +N+ NQLD   R+ YLA
Subjt:  AYIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLA

Query:  GGVPREARRVSKS-----------------------DDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERS-GGLIVKCD-EEMSFMTP-----EEEEEEL
        G    E R   +                        D   N+F+GF+ EFL +A+N+ ++ AR++Q E    G IV+ +  ++  + P     E+E EE 
Subjt:  GGVPREARRVSKS-----------------------DDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERS-GGLIVKCD-EEMSFMTP-----EEEEEEL

Query:  SALPFSRREE-----------DSNGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF
              R  E           D NGLEETICT  ++ N+     AD+Y+ EAGR++ +N   LPILR++ +SAE+G L+
Subjt:  SALPFSRREE-----------DSNGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF

P13744 11S globulin subunit beta4.0e-5741.9Show/hide
Query:  EEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRR
        +  + CRL+  +A+ P RR E E   TEVWD+ N+EFQCAGV   R+ IRP  L LP F ++P L ++ +G G  G+  PGC     E     L RS   
Subjt:  EEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRR

Query:  IRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAG-------GV---PREARRVSKSDDFVNIFNGFNK
            K  D+HQK+R  R GD++V+PAG   W YN    DLV++ F D  N  NQ+D  +R  YLAG       GV    R +R+ S  +   NIF+GF  
Subjt:  IRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAG-------GV---PREARRVSKSDDFVNIFNGFNK

Query:  EFLEEAYNIPSDLARKMQ-EERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDS-NGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLP
        EFLEEA+ I   L RK++ E+     IV+ DE+   + PE++EEE S   +   E +S NGLEETICT R++ N+     ADV++   GR++  N   LP
Subjt:  EFLEEAYNIPSDLARKMQ-EERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDS-NGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLP

Query:  ILRFMGMSAEKGHLF
        ILR + +SAE+G L+
Subjt:  ILRFMGMSAEKGHLF

Q8GZP6 11S globulin seed storage protein Ana o 2.0101 (Fragment)2.3e-6038.08Show/hide
Query:  APERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGCNVEEYEA--Q
        A  + W  ++  +C++DR  A  P  R+E+E G  E WD  +E+F+CAGVA  R+ I+PN L LP++ ++P L Y+ +GEG  G+++PGC  E Y+A  Q
Subjt:  APERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGCNVEEYEA--Q

Query:  SAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPREA--RRVSKSDDFVNIFNGF
          Q  +S R       +D+HQK+RR RRGD+I IPAG   WCYN+    +V V  +D++N  NQLD   R  +LAG  P++   ++        N+F+GF
Subjt:  SAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPREA--RRVSKSDDFVNIFNGF

Query:  NKEFLEEAYNIPSDLARKMQ-EERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREED-------SNGLEETICTARVQHNMNTQREADVYSREAGRVN
        + E L EA+ +   L ++++ E+  GG++   D+E+  + P   + E  +      E++        NG+EETICT R++ N+N    AD+Y+ E GR+ 
Subjt:  NKEFLEEAYNIPSDLARKMQ-EERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREED-------SNGLEETICTARVQHNMNTQREADVYSREAGRVN

Query:  ILNQLKLPILRFMGMSAEKGHLF
         LN L LPIL+++ +S EKG L+
Subjt:  ILNQLKLPILRFMGMSAEKGHLF

Q9XHP0 11S globulin seed storage protein 27.7e-7745.48Show/hide
Query:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI
        +A K +L + L L     +VS   A  R     + QQCR  R     PS RI+ EGG TE+WDE  E+FQCAG+ A R+ IRPN LSLP +H SP L YI
Subjt:  MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYI

Query:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDK------HQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSS
        ERG+G + +  PGC  E Y+   +Q  R+  R    +++D+      HQKV R+R+GD++ IP+G   WCYND  +DLV V+  D+N+  NQLD + R+ 
Subjt:  ERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDK------HQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSS

Query:  YLAGGVPREARRVSKS-DDFVNIFNGFNKEFLEEAYNIPSDLARKMQ-EERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ
        YLAGGVPR   +  ++   F NIF  F+ E L EA+N+P +  R+MQ EE   GLIV   E M+F+ P+EEE E       R  +  NGLEET CT + +
Subjt:  YLAGGVPREARRVSKS-DDFVNIFNGFNKEFLEEAYNIPSDLARKMQ-EERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQ

Query:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF
         N+ ++READ++SR+AGRV+++++ KLPIL++M +SAEKG+L+
Subjt:  HNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLF

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 23.0e-4435.53Show/hide
Query:  QCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVD
        +C+LD+  A  PS+ I+ EGG  EVWD    + +C+G A  R +I P  L LP F ++  L ++  G G +G   PGC   E   +S        + +  
Subjt:  QCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVD

Query:  KEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPR-----EARRVSKSDDFVNIFNGFNKEFLEEAYNI
           D HQKV  +R GD I  P+G  QW YN+  + L++VA  DL ++ NQLD  +R   +AG  P+     + R+  K +   NIFNGF  E L +A+ I
Subjt:  KEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPR-----EARRVSKSDDFVNIFNGFNKEFLEEAYNI

Query:  PSDLARKMQEERSG-GLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAE
          + A+++Q ++   G IVK +     + P     E    P     E +NGLEET+CT R   N++   +ADVY    G ++ LN   LPILR + +SA 
Subjt:  PSDLARKMQEERSG-GLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAE

Query:  KGHL
        +G +
Subjt:  KGHL

AT1G03890.1 RmlC-like cupins superfamily protein1.2e-4533.03Show/hide
Query:  LFAFESVVST------HHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGF
        LF+  SVVS       H A  R         C   +  +  P++  ++E G  EVWD  + E +CAGV   R  ++PNS+ LP F S P LAY+ +GEG 
Subjt:  LFAFESVVST------HHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGF

Query:  LGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPREARR
        +G    GC  E +        R        + ED HQK+   RRGD+    AG  QW YN    D V+V  +D+ N +NQLD   R   LAG   +E  +
Subjt:  LGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPREARR

Query:  VSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSG-GLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQREADVYS
                N F+GF+   + EA+ I  + A+++Q ++   G I++ +  + F+ P   E +   +        +NG+EET CTA++  N++    +D +S
Subjt:  VSKSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSG-GLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQREADVYS

Query:  REAGRVNILNQLKLPILRFMGMSAEKGHLF
          AGR++ LN L LP+LR + ++A +G+L+
Subjt:  REAGRVNILNQLKLPILRFMGMSAEKGHLF

AT4G28520.2 cruciferin 31.1e-3328.69Show/hide
Query:  QCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGC----------------------
        +C LD       +  I+ E G  E WD  + + +C GV+  R +I    L LP F +SP ++Y+ +G G  G   PGC                      
Subjt:  QCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGC----------------------

Query:  -----NVEEYEAQSAQLSRSSRRIRVDKEE-------------------------DKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDD
                E + Q  Q  R  +      ++                         D HQKV  VRRGD+     G+  W YN   Q LV++A +D+ N  
Subjt:  -----NVEEYEAQSAQLSRSSRRIRVDKEE-------------------------DKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDD

Query:  NQLDLRVRSSYLAGGVPREARRVS-KSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEER-SGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGL
        NQLD   R  +LAG   +     S +  +  N+++GF+ + + +A  I   LA+++Q ++ S G IV+       + P   +   S      R    NGL
Subjt:  NQLDLRVRSSYLAGGVPREARRVS-KSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEER-SGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGL

Query:  EETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHL
        EETIC+ R   N++    ADVY    GRV  +N   LPIL ++ +SA +G L
Subjt:  EETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHL

AT4G28520.4 cruciferin 33.7e-3428.93Show/hide
Query:  QCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGC----------------------
        +C LD       +  I+ E G  E WD  + + +C GV+  R +I    L LP F +SP ++Y+ +G G  G   PGC                      
Subjt:  QCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGC----------------------

Query:  -----NVEEYEAQSAQLSRSSRRIRVDKEE-------------------------DKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDD
                E + Q  Q  R  +      ++                         D HQKV  VRRGD+     G+  W YN   Q LV++A +D+ N  
Subjt:  -----NVEEYEAQSAQLSRSSRRIRVDKEE-------------------------DKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDD

Query:  NQLDLRVRSSYLAGGVPREARRVS-KSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEER-SGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGL
        NQLD   R  +LAG   +     S +  +  N+++GF+ + + +A  I   LA+++Q ++ S G IV+       + P   +   S      R    NGL
Subjt:  NQLDLRVRSSYLAGGVPREARRVS-KSDDFVNIFNGFNKEFLEEAYNIPSDLARKMQEER-SGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGL

Query:  EETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVK
        EETIC+ R   N++    ADVY    GRV  +N   LPIL ++ +SA +G L  VK
Subjt:  EETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFMGMSAEKGHLFPVK

AT5G44120.3 RmlC-like cupins superfamily protein1.0e-4435.92Show/hide
Query:  QCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVD
        +C+LD+  A  PS  ++ E G  EVWD    + +C+GV+  R II    L LP F ++  L+++ +G G +G   PGC  E ++  S    R   + +  
Subjt:  QCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEGFLGLNFPGCNVEEYEAQSAQLSRSSRRIRVD

Query:  KEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPREA--RRVSKSDDFVNIFNGFNKEFLEEAYNIPSD
        +  D HQKV  +R GD I    G  QW YND  + LV+V+  DL +  NQLD   R  YLAG  P+     +  +     NIFNGF  E + +A  I   
Subjt:  KEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPREA--RRVSKSDDFVNIFNGFNKEFLEEAYNIPSD

Query:  LARKMQ-EERSGGLIVKCDEEMSFMTP--------EEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFM
         A+++Q ++ + G IV+       + P        EEEEEE       R     NGLEETIC+AR   N++    ADVY  + G ++ LN   LPILRF+
Subjt:  LARKMQ-EERSGGLIVKCDEEMSFMTP--------EEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQREADVYSREAGRVNILNQLKLPILRFM

Query:  GMSAEKGHL
         +SA +G +
Subjt:  GMSAEKGHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCAAAGTTGTACTGGGGATTTTGCTGTGTTTGTTTGCATTTGAGTCGGTGGTGAGCACTCATCACGCACCGGAGAGGCACTGGTTCAGGGAAGAAGCT
CAGCAATGCAGGCTGGACAGGACTCAGGCGAGGCCACCATCGCGTCGTATCGAGTGGGAGGGAGGTATCACTGAGGTTTGGGATGAAGCTAATGAAGAGTTTCAG
TGTGCTGGAGTTGCTGCCTTTAGAAACATCATAAGGCCCAACTCTCTCTCTTTGCCTAAGTTCCATAGCTCCCCAATGCTTGCTTACATTGAGCGAGGTGAAGGT
TTCTTGGGCCTGAACTTCCCAGGGTGTAATGTAGAGGAGTACGAGGCACAATCAGCACAACTTTCAAGGTCATCGAGGCGAATACGCGTGGACAAAGAGGAAGAC
AAACACCAAAAGGTTCGCAGGGTCCGTCGAGGTGACATGATAGTCATCCCCGCTGGCACTGTCCAATGGTGCTACAACGACTGTGGCCAAGACCTTGTAGTCGTT
GCCTTCATGGATCTCAACAACGACGACAACCAACTCGACCTCCGTGTTAGGTCCTCCTACTTGGCTGGTGGAGTTCCAAGAGAAGCAAGAAGAGTATCAAAATCA
GATGATTTTGTGAACATCTTCAATGGGTTCAATAAGGAGTTTCTTGAAGAGGCATACAACATTCCATCAGACTTGGCAAGGAAGATGCAAGAAGAAAGAAGCGGT
GGTTTGATTGTGAAGTGTGATGAAGAAATGTCGTTTATGACGCCAGAGGAGGAGGAGGAAGAATTGAGCGCCTTGCCATTTTCAAGAAGGGAAGAGGACTCAAAT
GGACTGGAAGAAACTATTTGCACTGCTAGAGTTCAACACAACATGAACACACAAAGAGAAGCAGATGTATACTCTAGAGAAGCTGGTAGAGTTAACATTTTGAAT
CAACTCAAGCTTCCTATTCTAAGATTCATGGGCATGAGTGCCGAGAAAGGACACCTTTTTCCGGTAAAACATACTCATTTACACCATTACTTTATACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTACCAAAGTTGTACTGGGGATTTTGCTGTGTTTGTTTGCATTTGAGTCGGTGGTGAGCACTCATCACGCACCGGAGAGGCACTGGTTCAGGGAAGAAGCT
CAGCAATGCAGGCTGGACAGGACTCAGGCGAGGCCACCATCGCGTCGTATCGAGTGGGAGGGAGGTATCACTGAGGTTTGGGATGAAGCTAATGAAGAGTTTCAG
TGTGCTGGAGTTGCTGCCTTTAGAAACATCATAAGGCCCAACTCTCTCTCTTTGCCTAAGTTCCATAGCTCCCCAATGCTTGCTTACATTGAGCGAGGTGAAGGT
TTCTTGGGCCTGAACTTCCCAGGGTGTAATGTAGAGGAGTACGAGGCACAATCAGCACAACTTTCAAGGTCATCGAGGCGAATACGCGTGGACAAAGAGGAAGAC
AAACACCAAAAGGTTCGCAGGGTCCGTCGAGGTGACATGATAGTCATCCCCGCTGGCACTGTCCAATGGTGCTACAACGACTGTGGCCAAGACCTTGTAGTCGTT
GCCTTCATGGATCTCAACAACGACGACAACCAACTCGACCTCCGTGTTAGGTCCTCCTACTTGGCTGGTGGAGTTCCAAGAGAAGCAAGAAGAGTATCAAAATCA
GATGATTTTGTGAACATCTTCAATGGGTTCAATAAGGAGTTTCTTGAAGAGGCATACAACATTCCATCAGACTTGGCAAGGAAGATGCAAGAAGAAAGAAGCGGT
GGTTTGATTGTGAAGTGTGATGAAGAAATGTCGTTTATGACGCCAGAGGAGGAGGAGGAAGAATTGAGCGCCTTGCCATTTTCAAGAAGGGAAGAGGACTCAAAT
GGACTGGAAGAAACTATTTGCACTGCTAGAGTTCAACACAACATGAACACACAAAGAGAAGCAGATGTATACTCTAGAGAAGCTGGTAGAGTTAACATTTTGAAT
CAACTCAAGCTTCCTATTCTAAGATTCATGGGCATGAGTGCCGAGAAAGGACACCTTTTTCCGGTAAAACATACTCATTTACACCATTACTTTATACAATAG
Protein sequenceShow/hide protein sequence
MATKVVLGILLCLFAFESVVSTHHAPERHWFREEAQQCRLDRTQARPPSRRIEWEGGITEVWDEANEEFQCAGVAAFRNIIRPNSLSLPKFHSSPMLAYIERGEG
FLGLNFPGCNVEEYEAQSAQLSRSSRRIRVDKEEDKHQKVRRVRRGDMIVIPAGTVQWCYNDCGQDLVVVAFMDLNNDDNQLDLRVRSSYLAGGVPREARRVSKS
DDFVNIFNGFNKEFLEEAYNIPSDLARKMQEERSGGLIVKCDEEMSFMTPEEEEEELSALPFSRREEDSNGLEETICTARVQHNMNTQREADVYSREAGRVNILN
QLKLPILRFMGMSAEKGHLFPVKHTHLHHYFIQ