; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037849 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037849
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description11S globulin seed storage protein 2-like
Genome locationscaffold12:42430755..42433706
RNA-Seq ExpressionSpg037849
SyntenySpg037849
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006044 - 11-S seed storage protein, plant
IPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold
IPR022379 - 11-S seed storage protein, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571523.1 hypothetical protein SDJN03_28251, partial [Cucurbita argyrosperma subsp. sororia]1.4e-15080.52Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        MAT+VVLAILLC V      S+    ERR FREEAQQCRLDRI++ PPSRRIESEGGITELWDEA+E+ QCAGVAAIRN IRPNCLSLPKFHS+PMLIYI
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGF+GLNFPGCAETYEAQSAQSSRRSSRRMGR IGA +E+DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAF+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ
        AGG+PRE  R  RGS+ ++DLVN+++GFDQ+ LA+AYN+P++L R+M+EE+S GLIVKCDE MSFLTPEE+EEELS  S SR +  SNGLEETICTARVQ
Subjt:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ

Query:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP
        HNMNTQREADVY REAGR+NILNQ KLPILRFM MSAEKGHLFP
Subjt:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP

XP_022963932.1 11S globulin seed storage protein 2-like [Cucurbita moschata]5.4e-15080.23Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        M T+VVLAILLC V      S+    ERR FREEAQQCRLDRI++ PPSRRIESEGGITELWDEA+E+ QCAGVAAIRN IRPNCLSLPKFHS+PMLIYI
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGF+GLNFPGCAETYEAQSAQSSRRSSRRMGR IGA +E+DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAF+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ
        AGG+PRE  R  RGS+ ++DLVN+++GFDQ+ LA+AYN+P++L R+M+EE+S GLIVKCDE MSFLTPEE+EEELS  S SR +  SNGLEETICTARVQ
Subjt:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ

Query:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP
        HNMNTQREADVY REAGR+NILNQ KLPILRFM MSAEKGHLFP
Subjt:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP

XP_022967670.1 11S globulin seed storage protein 2-like [Cucurbita maxima]3.2e-15080.47Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        MAT+VVLAILLC V      S+    ER  FREEAQQCRLDRI++ PPSRRIESEGGITELWDEA+E+ QCAGVAA+RN IRPNCLSLPKFHS+PMLIYI
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGF+GLN+PGCAETYEAQSAQSSRRSSRRMGR IGA +E+DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDL+A+AF+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ
        AGG+PRE  R  RGS+ ++DLVN+FSGFDQE LA+AYN+P++L RKM+EE+S GLIVKCDE MSFLTPEE+EEELS  S SR +  SNGLEETICTARVQ
Subjt:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ

Query:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF
        HNMNTQREADVY REAGRVNILNQ KLPILRFM MSAEKGHLF
Subjt:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF

XP_023532597.1 11S globulin seed storage protein 2-like [Cucurbita pepo subsp. pepo]4.9e-15179.43Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        MATK+VLAILLCF V SSLVSAQ    RR FREEAQQCRLDR+QARPPSRRIESEGGI+E+WDE++EE QCAGVAA+R+ IRPN L++P F S+PMLIY+
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGFMGLNFPGCAETYEAQS+QSSRRSSRR+GR +GA +EEDQHQKVRRVRRGDMIVVPAGTV+WCHNDGGQDLV V+F+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAIR-----GE-RGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETI
        AGG+PREA+R     GE RGSRSS+DLVN+F GFDQELLAEAYNIPS+LARK++E++S GLIVKC+EDMSFLTPEE+EEE SAS S      SNGLEETI
Subjt:  AGGVPREAIR-----GE-RGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETI

Query:  CTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP
        CTARVQHNMNTQ+EADVY RE+GR+NILN+ KLPIL++MDMSAEKGHLFP
Subjt:  CTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP

XP_023553732.1 11S globulin seed storage protein 2-like [Cucurbita pepo subsp. pepo]1.2e-14979.94Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        MAT+VVLAILLC V      S+    ERR FREEAQQCRLDRI++ PPSRRIESEGGITELWDEA+E+ QCAGVAAIRN IRPNCLSLPKFHS+PMLIYI
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGF+GLN+PGCAETYEAQ AQSSRRSSRRMGR IGA +E+DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAF+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ
        AGG+PRE  R  RGS+ ++DLVN+++GFDQ+ LA+AYN+P++L R+M+EE+S GLIVKCDE MSFLTPEE+EEELS  S SR +  SNGLEETICTARVQ
Subjt:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ

Query:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP
        HNMNTQREADVY REAGR+NILNQ KLPILRFM MSAEKGHLFP
Subjt:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP

TrEMBL top hitse value%identityAlignment
A0A5D3DAK7 11S globulin seed storage protein 2-like2.3e-14677.52Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFR--EEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLI
        MA KV+LAILLCF    SLV+AQD +ERR FR   EAQ C+LDRI+ RPPSRRIESEGGITELWDEADEE QCAGV AIRNTIRPN LSLPKFH+APML+
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFR--EEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLI

Query:  YIERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADR-EEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRT
        YIE+GEGF G+N+PGCAETYE+QSAQSS RS+RRMGR IGA R EEDQHQK+RRVRRGDMIV+PAGTVQWC+NDGG+DL+AVAF+DLNN+DNQLDLR+R 
Subjt:  YIERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADR-EEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRT

Query:  SYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTA
        S+LAGGVP EA R  RGS+ SD+LVN+F+G DQE L+EA+NIPS+L R+M+EE+S GLIVKCDE+MSFLTPEE+EEELS +S SR + + NG+EETICTA
Subjt:  SYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTA

Query:  RVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP
        RVQHNMNTQREAD++ REAGRVNILNQ KLPILRF+ MSAEKGHLFP
Subjt:  RVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP

A0A6J1EX10 11S globulin seed storage protein 2-like7.7e-15078.86Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        MATK+VLAILLCF V SSLVSAQ   ERR FREEAQQCRLDR+QARPPSRRIESEGGI+E+WDE++EE QCAGVAA+R+ IRPN L++P F S+PMLIY+
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGF+GLNFPGCAETYEAQS+QSSRRSSRR+GR +GA +EEDQHQKVRRVRRGDMIVVPAGTV+WCHNDGGQDLV V+F+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAI------RGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETI
        AGG+PREA+      R  RGSRSS+DLVN+F GFDQELLAEAYNIPS+LARK++E++S GLIVKC+EDMSFLTPEE+EEE SAS S      SNGLEETI
Subjt:  AGGVPREAI------RGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETI

Query:  CTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP
        CTARVQHNMNTQ+EADVY RE+GR+NILN+ KLPIL++MDMSAEKGHLFP
Subjt:  CTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP

A0A6J1HGI1 11S globulin seed storage protein 2-like2.6e-15080.23Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        M T+VVLAILLC V      S+    ERR FREEAQQCRLDRI++ PPSRRIESEGGITELWDEA+E+ QCAGVAAIRN IRPNCLSLPKFHS+PMLIYI
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGF+GLNFPGCAETYEAQSAQSSRRSSRRMGR IGA +E+DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAF+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ
        AGG+PRE  R  RGS+ ++DLVN+++GFDQ+ LA+AYN+P++L R+M+EE+S GLIVKCDE MSFLTPEE+EEELS  S SR +  SNGLEETICTARVQ
Subjt:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ

Query:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP
        HNMNTQREADVY REAGR+NILNQ KLPILRFM MSAEKGHLFP
Subjt:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP

A0A6J1HV45 11S globulin seed storage protein 2-like1.5e-15080.47Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        MAT+VVLAILLC V      S+    ER  FREEAQQCRLDRI++ PPSRRIESEGGITELWDEA+E+ QCAGVAA+RN IRPNCLSLPKFHS+PMLIYI
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGF+GLN+PGCAETYEAQSAQSSRRSSRRMGR IGA +E+DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDL+A+AF+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ
        AGG+PRE  R  RGS+ ++DLVN+FSGFDQE LA+AYN+P++L RKM+EE+S GLIVKCDE MSFLTPEE+EEELS  S SR +  SNGLEETICTARVQ
Subjt:  AGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQ

Query:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF
        HNMNTQREADVY REAGRVNILNQ KLPILRFM MSAEKGHLF
Subjt:  HNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF

A0A6J1K8H1 11S globulin seed storage protein 2-like1.3e-14978.86Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        MATK+VLAILLCF V SSLVSAQ   ERR FREEAQQCRLDR+QAR PSRRIESEGGI+E+WDE++EE QCAGVAA+R+ IRPN L++P F S+PMLIY+
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
        E+GEGF+GLNFPGCAETYEAQS+QSSRRSSRR+GR +GA +EEDQHQKVRRVRRGDMIVVPAGTV+WCHNDGGQDLV V+F+DLNNEDNQLDLRIR S+L
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPREAI------RGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETI
        AGG+PREA+      R  RGSRSS+DLVN+FSGFDQELLAEAYNIPS+LARK++E++S GLIVKC+EDMSFLTPEE+EEE SAS S      SNGLEETI
Subjt:  AGGVPREAI------RGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETI

Query:  CTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP
        CTARVQHNMNTQ+EADVY RE+GR+NILN+ KLPIL++MDMSAEKGHLFP
Subjt:  CTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFP

SwissProt top hitse value%identityAlignment
A0A1L6K371 11S globulin2.3e-6640.1Show/hide
Query:  MATKVVLAILLCFV--VSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLI
        MA  ++L+I LC V  V+  L  +  R + R+      +C+L R+ A  PS RIE+E G+ E WD  +++ QCAGVA +R TI PN L LP++ +AP L+
Subjt:  MATKVVLAILLCFV--VSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLI

Query:  YIERGEGFMGLNFPGCAETY-EAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRT
        YI +G G  G+ FPGC ET+ E+Q  QS  R S R      A  + D+HQK+R  R GD+I  PAG   WC+NDG   +VAVA MD  N  NQLD   R 
Subjt:  YIERGEGFMGLNFPGCAETY-EAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRT

Query:  SYLAGGVPREAIR--------------------GERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKS-GGLIVKCD-EDMSFLTP-------
         YLAG  P +  R                    GE G +      NVFSGFD + LA+A+N+ +E AR+++ E      IV+ +   +  + P       
Subjt:  SYLAGGVPREAIR--------------------GERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKS-GGLIVKCD-EDMSFLTP-------

Query:  ---------EEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF
                  E+E E     S RG RD NGLEETICT R++ N+     AD+Y  EAGR++  N H LP+LR++ +SAE+G L+
Subjt:  ---------EEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF

B5KVH4 11S globulin seed storage protein 12.1e-6739.21Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        MA  ++L+I LC ++ +       +S  R  + +  QC+L+R+ A  P+ RIE+E G+ E WD   ++ QCAGVA +R TI PN L LP + +AP L+YI
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL
         RG G  G+ FPGC ET+E +S + S++  RR       + ++D+HQK+R  R GD+I  PAG   WC+NDG   +VA+  +D +N  NQLD   R  YL
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYL

Query:  AGGVPRE-------------------AIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKS-GGLIVKCD-EDMSFLTP-----------
        AG    E                     RGE G +  D   NVFSGFD E LA+A+N+ +E AR+++ E    G IV+ +   +  + P           
Subjt:  AGGVPRE-------------------AIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKS-GGLIVKCD-EDMSFLTP-----------

Query:  -----EEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF
              E+E E     S RG RD NGLEETICT  ++ N+     AD+Y  EAGR++ +N H LPILR++ +SAE+G L+
Subjt:  -----EEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF

Q2TPW5 11S globulin seed storage protein Jug r 41.9e-6537.86Show/hide
Query:  MATKVVLAILLCFVV---SSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPML
        MA  ++L+I L  +V   +  L  +  R ++++      QC+L+R+ A  P+ RIE+E G+ E WD  +++ QCAGVA +R TI PN L LP++ +AP L
Subjt:  MATKVVLAILLCFVV---SSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPML

Query:  IYIERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRT
        +YI RG G  G+ FPGC ET+E    QS +  SR          ++D+HQK+R  R GD+I  PAG   W +NDG   +VA++ +D NN  NQLD   R 
Subjt:  IYIERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRT

Query:  SYLAGG-------------------VPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKS-GGLIVKCD-EDMSFLTP--------
         YLAG                      R+   GE G +      NVFSGFD + LA+A+N+ +E AR+++ E      IV+ +   +  + P        
Subjt:  SYLAGG-------------------VPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMKEEKS-GGLIVKCD-EDMSFLTP--------

Query:  --------EEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF
                 E+E E     S RG RD NGLEETICT R++ N+     AD+Y  EAGR++ +N H LP+LR++ +SAE+G L+
Subjt:  --------EEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLF

Q8GZP6 11S globulin seed storage protein Ana o 2.0101 (Fragment)1.0e-6641.57Show/hide
Query:  SERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETYEAQSAQS
        S + W  ++  +C++DR+ A  P  R+E E G  E WD   E+ +CAGVA +R+TI+PN L LP++ +AP LIY+ +GEG  G+++PGC ETY+A     
Subjt:  SERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETYEAQSAQS

Query:  SRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVPREAIRGERGSRSSDDLVNVF
          +  R+ G+   + R +D+HQK+RR RRGD+I +PAG   WC+N+G   +V V  +D++N  NQLD   R  +LAG  P++  + ++  +S     N+F
Subjt:  SRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVPREAIRGERGSRSSDDLVNVF

Query:  SGFDQELLAEAYNIPSELARKMKEEKSGGLIVKC-DEDMSFLTP--------EEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREA
        SGFD ELLAEA+ +   L +++K E + G IVK  D+++  + P         E EEE        GQRD NG+EETICT R++ N+N    AD+Y  E 
Subjt:  SGFDQELLAEAYNIPSELARKMKEEKSGGLIVKC-DEDMSFLTP--------EEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREA

Query:  GRVNILNQHKLPILRFMDMSAEKGHLFPPHLV
        GR+  LN   LPIL+++ +S EKG L+   LV
Subjt:  GRVNILNQHKLPILRFMDMSAEKGHLFPPHLV

Q9XHP0 11S globulin seed storage protein 24.9e-8547.32Show/hide
Query:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI
        +A K +LA+ L  +VS+++  AQ R  R     + QQCR  RI    PS RI+SEGG TELWDE  E+ QCAG+ A+R+TIRPN LSLP +H +P L+YI
Subjt:  MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYI

Query:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREE-----DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRI
        ERG+G + +  PGCAETY+        RS R M R   +++++     D HQKV R+R+GD++ +P+G   WC+NDG +DLVAV+  D+N+  NQLD + 
Subjt:  ERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREE-----DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRI

Query:  RTSYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMK-EEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETI
        R  YLAGGVPR    GE+  ++     N+F  FD ELL+EA+N+P E  R+M+ EE+  GLIV   E M+F+ P+E+E E       RG++  NGLEET 
Subjt:  RTSYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMK-EEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETI

Query:  CTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFPPHLVA
        CT + + N+ ++READ++ R+AGRV++++++KLPIL++MD+SAEKG+L+   LV+
Subjt:  CTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHLFPPHLVA

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 23.2e-4735.9Show/hide
Query:  QCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRG
        +C+LD++ A  PS+ I+SEGG  E+WD    + +C+G A  R  I P  L LP F +A  L ++  G G MG   PGCAET+           S   G G
Subjt:  QCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRG

Query:  IGADREE---DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVP--REAIRGERGSRSSDDLVNVFSGFDQE
         G  + +   D HQKV  +R GD I  P+G  QW +N+G + L+ VA  DL +  NQLD  +R   +AG  P  +E ++G +  + +    N+F+GF  E
Subjt:  IGADREE---DQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVP--REAIRGERGSRSSDDLVNVFSGFDQE

Query:  LLAEAYNIPSELARKMKEEKSG-GLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPIL
        +LA+A+ I  E A++++ ++   G IVK +     + P  +  E       +    +NGLEET+CT R   N++   +ADVY    G ++ LN + LPIL
Subjt:  LLAEAYNIPSELARKMKEEKSG-GLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPIL

Query:  RFMDMSAEKGHL
        R + +SA +G +
Subjt:  RFMDMSAEKGHL

AT1G03890.1 RmlC-like cupins superfamily protein8.3e-4835.65Show/hide
Query:  CRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGI
        C   +I +  P++  + E G  E+WD    E +CAGV   R T++PN + LP F S P L Y+ +GEG MG    GC ET+      S        GRG 
Subjt:  CRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETYEAQSAQSSRRSSRRMGRGI

Query:  GAD---REEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLA
        G D   R ED HQK+   RRGD+    AG  QW +N G  D V V  +D+ N +NQLD   R   LAG   +E    E    +     N FSGFD  ++A
Subjt:  GAD---REEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLA

Query:  EAYNIPSELARKMKEEKSG-GLIVKCDEDMSFLTPEEQEEELSASSSSRGQRD--SNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILR
        EA+ I  E A++++ +K   G I++ +  + F+ P  +E           Q+D  +NG+EET CTA++  N++    +D +   AGR++ LN   LP+LR
Subjt:  EAYNIPSELARKMKEEKSG-GLIVKCDEDMSFLTPEEQEEELSASSSSRGQRD--SNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILR

Query:  FMDMSAEKGHLFPPHLV
         + ++A +G+L+   +V
Subjt:  FMDMSAEKGHLFPPHLV

AT4G28520.2 cruciferin 36.4e-4030.7Show/hide
Query:  QCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETY------------------
        +C LD +     +  I+SE G  E WD    + +C GV+  R  I    L LP F ++P + Y+ +G G  G   PGCAET+                  
Subjt:  QCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETY------------------

Query:  ----------EAQSAQSSRRSSRRMGRG--------------------IGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNED
                  + Q  Q  R+     G+G                     G     D HQKV  VRRGD+     G+  W +N G Q LV +A +D+ N  
Subjt:  ----------EAQSAQSSRRSSRRMGRG--------------------IGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNED

Query:  NQLDLRIRTSYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMK-EEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDS
        NQLD   R  +LAG   +    G  GS+   +  N++SGFD +++A+A  I  +LA++++ ++ S G IV+       + P  ++   S           
Subjt:  NQLDLRIRTSYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMK-EEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDS

Query:  NGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHL
        NGLEETIC+ R   N++    ADVY    GRV  +N + LPIL ++ +SA +G L
Subjt:  NGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHL

AT4G28520.4 cruciferin 36.4e-4030.7Show/hide
Query:  QCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETY------------------
        +C LD +     +  I+SE G  E WD    + +C GV+  R  I    L LP F ++P + Y+ +G G  G   PGCAET+                  
Subjt:  QCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLNFPGCAETY------------------

Query:  ----------EAQSAQSSRRSSRRMGRG--------------------IGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNED
                  + Q  Q  R+     G+G                     G     D HQKV  VRRGD+     G+  W +N G Q LV +A +D+ N  
Subjt:  ----------EAQSAQSSRRSSRRMGRG--------------------IGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNED

Query:  NQLDLRIRTSYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMK-EEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDS
        NQLD   R  +LAG   +    G  GS+   +  N++SGFD +++A+A  I  +LA++++ ++ S G IV+       + P  ++   S           
Subjt:  NQLDLRIRTSYLAGGVPREAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMK-EEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDS

Query:  NGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHL
        NGLEETIC+ R   N++    ADVY    GRV  +N + LPIL ++ +SA +G L
Subjt:  NGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHL

AT5G44120.3 RmlC-like cupins superfamily protein1.9e-4735.26Show/hide
Query:  VLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEG
        +L+  L  ++     +AQ   + + F  E   C+LD++ A  PS  ++SE G  E+WD    + +C+GV+  R  I    L LP F +   L ++ +G G
Subjt:  VLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEG

Query:  FMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVP
         MG   PGCAET++     SS    R  G+G  + R  D HQKV  +R GD I    G  QW +NDG + LV V+  DL +  NQLD   R  YLAG  P
Subjt:  FMGLNFPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVP

Query:  REAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMK-EEKSGGLIVKCDEDMSFLTP--------EEQEEELSASSSSRGQRDSNGLEETICT
        +  +  +   R      N+F+GF  E++A+A  I  + A++++ ++ + G IV+       + P        EE+EEE       R  R  NGLEETIC+
Subjt:  REAIRGERGSRSSDDLVNVFSGFDQELLAEAYNIPSELARKMK-EEKSGGLIVKCDEDMSFLTP--------EEQEEELSASSSSRGQRDSNGLEETICT

Query:  ARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHL
        AR   N++    ADVY  + G ++ LN + LPILRF+ +SA +G +
Subjt:  ARVQHNMNTQREADVYCREAGRVNILNQHKLPILRFMDMSAEKGHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCAAAGTTGTACTGGCGATTTTGCTGTGTTTCGTCGTATCGTCCTCTCTGGTGAGCGCTCAGGACCGGTCTGAGAGGCGCTGGTTCAGGGAAGAAGCTCAGCA
ATGCCGGCTGGACAGGATTCAGGCGAGGCCGCCGTCGCGTCGGATCGAGTCGGAGGGAGGCATCACTGAGCTTTGGGATGAAGCTGATGAAGAGTGTCAGTGTGCTGGAG
TAGCAGCCATTAGAAACACCATAAGGCCCAACTGTCTCTCTCTGCCTAAATTCCACAGCGCCCCCATGCTCATTTACATCGAGCGAGGTGAAGGGTTTATGGGGCTGAAC
TTCCCAGGGTGTGCAGAGACATACGAGGCACAATCAGCGCAATCTTCAAGAAGGTCCTCAAGGCGTATGGGACGTGGAATTGGCGCAGACAGGGAGGAAGACCAACACCA
AAAGGTGCGCAGAGTCCGCCGTGGTGACATGATCGTCGTCCCGGCCGGCACCGTCCAATGGTGCCACAACGACGGCGGCCAAGATCTCGTCGCCGTTGCCTTCATGGATC
TCAACAACGAGGACAACCAGCTCGACCTCCGCATCAGGACATCTTACTTGGCTGGTGGAGTGCCGAGAGAAGCGATAAGGGGAGAAAGAGGATCAAGATCTTCAGATGAT
CTAGTGAACGTCTTTAGTGGGTTCGATCAGGAGCTTCTTGCTGAGGCTTACAACATTCCATCAGAGTTGGCGAGGAAAATGAAAGAAGAAAAGAGTGGCGGATTGATCGT
GAAGTGCGACGAAGATATGTCGTTTTTGACGCCGGAGGAACAGGAGGAAGAATTGAGTGCATCCTCATCTTCAAGAGGGCAACGAGACTCAAATGGATTGGAAGAAACCA
TCTGCACTGCTAGAGTCCAGCACAACATGAACACTCAAAGAGAAGCTGATGTATACTGTAGGGAGGCTGGTAGAGTTAACATTTTGAACCAACACAAGCTCCCCATTCTC
AGATTCATGGACATGAGTGCTGAGAAAGGCCATCTTTTCCCGCCTCATTTGGTGGCCTCACCGAATGAACGCTCAATACAACCTGCACTGGTCAATGACAGACCACAGAA
TGGTGTACGTGGTAGAGGGAGAGGCAGAAATTCAAATAGCCGACGACTACGGCAACCTAGTGTTGAACGAGAGAGTCTCAAGAGGAAACATGTTCGTCATTCCCCAATTC
TACGTCACAATGGCTCGAGCAGGGCCAGAAGGGTTCGAATGGATCACTTTCAAGACCTCAAGCCAGCCCATCAAGAGCCCTGTGGCTGGCTACACATCGCTCTTCAGAGC
CCTCCCACTCCAAGTCCTCGAACAATCGTTCCAAATCACAACGAGAGAGGCCGAGCAACTCAAGCAGACCAGAAGGCAGCACACTTTCCTCTTCCCTCCGACGAGCAGCA
GCAGCCGCCGCAGCAGCCGCTATTGAAAAGAACCTTTTGCCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACCAAAGTTGTACTGGCGATTTTGCTGTGTTTCGTCGTATCGTCCTCTCTGGTGAGCGCTCAGGACCGGTCTGAGAGGCGCTGGTTCAGGGAAGAAGCTCAGCA
ATGCCGGCTGGACAGGATTCAGGCGAGGCCGCCGTCGCGTCGGATCGAGTCGGAGGGAGGCATCACTGAGCTTTGGGATGAAGCTGATGAAGAGTGTCAGTGTGCTGGAG
TAGCAGCCATTAGAAACACCATAAGGCCCAACTGTCTCTCTCTGCCTAAATTCCACAGCGCCCCCATGCTCATTTACATCGAGCGAGGTGAAGGGTTTATGGGGCTGAAC
TTCCCAGGGTGTGCAGAGACATACGAGGCACAATCAGCGCAATCTTCAAGAAGGTCCTCAAGGCGTATGGGACGTGGAATTGGCGCAGACAGGGAGGAAGACCAACACCA
AAAGGTGCGCAGAGTCCGCCGTGGTGACATGATCGTCGTCCCGGCCGGCACCGTCCAATGGTGCCACAACGACGGCGGCCAAGATCTCGTCGCCGTTGCCTTCATGGATC
TCAACAACGAGGACAACCAGCTCGACCTCCGCATCAGGACATCTTACTTGGCTGGTGGAGTGCCGAGAGAAGCGATAAGGGGAGAAAGAGGATCAAGATCTTCAGATGAT
CTAGTGAACGTCTTTAGTGGGTTCGATCAGGAGCTTCTTGCTGAGGCTTACAACATTCCATCAGAGTTGGCGAGGAAAATGAAAGAAGAAAAGAGTGGCGGATTGATCGT
GAAGTGCGACGAAGATATGTCGTTTTTGACGCCGGAGGAACAGGAGGAAGAATTGAGTGCATCCTCATCTTCAAGAGGGCAACGAGACTCAAATGGATTGGAAGAAACCA
TCTGCACTGCTAGAGTCCAGCACAACATGAACACTCAAAGAGAAGCTGATGTATACTGTAGGGAGGCTGGTAGAGTTAACATTTTGAACCAACACAAGCTCCCCATTCTC
AGATTCATGGACATGAGTGCTGAGAAAGGCCATCTTTTCCCGCCTCATTTGGTGGCCTCACCGAATGAACGCTCAATACAACCTGCACTGGTCAATGACAGACCACAGAA
TGGTGTACGTGGTAGAGGGAGAGGCAGAAATTCAAATAGCCGACGACTACGGCAACCTAGTGTTGAACGAGAGAGTCTCAAGAGGAAACATGTTCGTCATTCCCCAATTC
TACGTCACAATGGCTCGAGCAGGGCCAGAAGGGTTCGAATGGATCACTTTCAAGACCTCAAGCCAGCCCATCAAGAGCCCTGTGGCTGGCTACACATCGCTCTTCAGAGC
CCTCCCACTCCAAGTCCTCGAACAATCGTTCCAAATCACAACGAGAGAGGCCGAGCAACTCAAGCAGACCAGAAGGCAGCACACTTTCCTCTTCCCTCCGACGAGCAGCA
GCAGCCGCCGCAGCAGCCGCTATTGAAAAGAACCTTTTGCCAGTGA
Protein sequenceShow/hide protein sequence
MATKVVLAILLCFVVSSSLVSAQDRSERRWFREEAQQCRLDRIQARPPSRRIESEGGITELWDEADEECQCAGVAAIRNTIRPNCLSLPKFHSAPMLIYIERGEGFMGLN
FPGCAETYEAQSAQSSRRSSRRMGRGIGADREEDQHQKVRRVRRGDMIVVPAGTVQWCHNDGGQDLVAVAFMDLNNEDNQLDLRIRTSYLAGGVPREAIRGERGSRSSDD
LVNVFSGFDQELLAEAYNIPSELARKMKEEKSGGLIVKCDEDMSFLTPEEQEEELSASSSSRGQRDSNGLEETICTARVQHNMNTQREADVYCREAGRVNILNQHKLPIL
RFMDMSAEKGHLFPPHLVASPNERSIQPALVNDRPQNGVRGRGRGRNSNSRRLRQPSVERESLKRKHVRHSPILRHNGSSRARRVRMDHFQDLKPAHQEPCGWLHIALQS
PPTPSPRTIVPNHNERGRATQADQKAAHFPLPSDEQQQPPQQPLLKRTFCQ