; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006402 (gene) of Chayote v1 genome

Gene IDSed0006402
OrganismSechium edule (Chayote v1)
Description11S globulin seed storage protein 2-like
Genome locationLG01:71212488..71214618
RNA-Seq ExpressionSed0006402
SyntenySed0006402
Gene Ontology termsGO:0045735 - nutrient reservoir activity (molecular function)
InterPro domainsIPR006044 - 11-S seed storage protein, plant
IPR006045 - Cupin 1
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606375.1 hypothetical protein SDJN03_03692, partial [Cucurbita argyrosperma subsp. sororia]3.3e-19774.42Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MATKLVLAILLCF VSSLVSAQ  E RR R++AQQCR DR+Q RPPSRRIESEGGI+E+WDES++EFQCAG AA+R+IIRPNSL++P F SSP+LI++EQ
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        GEG++GLNFPGCAE YEA+S+QSSR SSRR+GR++GAG+E+DQHQKVRRVRRGDMI+VPAGTV+WCHNDG QDL+ V+F+DLNNEDNQLDLRIR S+LAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE
        G+PREA+R        R S+SSEDLVNIF GFDQELLAEAYNIPSDL R++QE++SSGLIVKCEEDMS+LTP+++E+E SA  SS    E       QH 
Subjt:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE

Query:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA
         ++++   +YSRE+GR+NILN+ KLP+L++MDMSAEKGHLFPNAQYNL WSMT+HRLVYVVEGEAE+QI+DDYGNQV NERVSKGNMF+IPQFYASLAQA
Subjt:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA

Query:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR
          EGFEW+TFKTS QPMKSPV GY SLFRALP QVLEQSFQ+TA EA+QLKQTR +HTFLFPP S SS  S R
Subjt:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR

KAG7036316.1 hypothetical protein SDJN02_03119, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-19774.42Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MATKLVLAILLCF VSSLVSAQ  E RR R++AQQCR DR+Q RPPSRRIESEGGI+E+WDES++EFQCAG AA+R+IIRPNSL++P F SSP+LI++EQ
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        GEG++GLNFPGCAE YEA+S+QSSR SSRR+GR++GAG+E+DQHQKVRRVRRGDMI+VPAGTV+WCHNDG QDL+ V+F+DLNNEDNQLDLRIR S+LAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE
        G+PREA+R        R S+SSEDLVNIF GFDQELLAEAYNIPSDL R++QE++SSGLIVKCEEDMS+LTP+++E+E SA  SS    E       QH 
Subjt:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE

Query:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA
         ++++   +YSRE+GR+NILN+ KLP+L++MDMSAEKGHLFPNAQYNL WSMT+HRLVYVVEGEAE+QI+DDYGNQV NERVSKGNMF+IPQFYASLAQA
Subjt:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA

Query:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR
          EGFEW+TFKTS QPMKSPV GY SLFRALP QVLEQSFQ+TA EA+QLKQTR +HTFLFPP S SS  S R
Subjt:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR

XP_022930973.1 11S globulin seed storage protein 2-like [Cucurbita moschata]2.5e-19774.42Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MATKLVLAILLCF VSSLVSAQ  E RR R++AQQCR DR+Q RPPSRRIESEGGI+E+WDES++EFQCAG AA+R+IIRPNSL++P F SSP+LI++EQ
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        GEG++GLNFPGCAE YEA+S+QSSR SSRR+GR++GAG+E+DQHQKVRRVRRGDMI+VPAGTV+WCHNDG QDL+ V+F+DLNNEDNQLDLRIR S+LAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE
        G+PREA+R        R S+SSEDLVNIF GFDQELLAEAYNIPSDL R++QE++SSGLIVKCEEDMS+LTP+++E+E SA  SS    E       QH 
Subjt:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE

Query:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA
         ++++   +YSRE+GR+NILN+ KLP+L++MDMSAEKGHLFPNAQYNL WSMT+HRLVYVVEGEAE+QI+DDYGNQV NERVSKGNMF+IPQFYASLAQA
Subjt:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA

Query:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR
          EGFEW+TFKTS QPMKSPV GY SLFRALP QVLEQSFQ+TA EA+QLKQTR +HTFLFPP S SS   RR
Subjt:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR

XP_022995608.1 11S globulin seed storage protein 2-like [Cucurbita maxima]4.8e-19674Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MATKLVLAILLCF VSSLVSAQ  E RR R++AQQCR DR+Q R PSRRIESEGGI+E+WDES++EFQCAG AA+R+IIRPNSL++P F SSP+LI++EQ
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        GEG++GLNFPGCAE YEA+S+QSSR SSRR+GR++GAG+E+DQHQKVRRVRRGDMI+VPAGTV+WCHNDG QDL+ V+F+DLNNEDNQLDLRIR S+LAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE
        G+PREA+R        R S+SSEDLVNIFSGFDQELLAEAYNIPSDL R++QE++S+GLIVKCEEDMS+LTP+++E+E SA  SS    E       QH 
Subjt:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE

Query:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA
         ++++   +YSRE+GR+NILN+ KLP+L++MDMSAEKGHLFPNAQYNL WSMT+HRLVYVVEGEAE+QI+DDYGNQVFN RVSKG+MF+IPQFYASLAQA
Subjt:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA

Query:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR
          EGFEW+TFKTS QPMKSPV GY SLFRALP QVLEQSFQ+TA EA+QLKQTR +HTFLFPP S SS   RR
Subjt:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR

XP_023532597.1 11S globulin seed storage protein 2-like [Cucurbita pepo subsp. pepo]1.5e-19775.16Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHE-SRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIE
        MATKLVLAILLCF VSSLVSAQ  E  RR R++AQQCR DR+Q RPPSRRIESEGGI+E+WDES++EFQCAG AA+R+IIRPNSL++P F SSP+LI++E
Subjt:  MATKLVLAILLCFTVSSLVSAQSHE-SRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIE

Query:  QGEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLA
        QGEG+MGLNFPGCAE YEA+S+QSSR SSRR+GR++GAG+E+DQHQKVRRVRRGDMI+VPAGTV+WCHNDG QDL+ V+F+DLNNEDNQLDLRIR S+LA
Subjt:  QGEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLA

Query:  GGIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQH
        GGIPREA+R     G  R S+SSEDLVNIF GFDQELLAEAYNIPSDL R++QE++SSGLIVKCEEDMS+LTP+++E+E SA  SS    E       QH
Subjt:  GGIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQH

Query:  E-HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQ
          ++++   +YSRE+GR+NILN+ KLP+L++MDMSAEKGHLFPNAQYNL WS+T+HRLVYVVEGEAE+QI+DDYGNQVF ERVSKGNMF+IPQFYASLAQ
Subjt:  E-HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQ

Query:  ASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRR
        A  EGFEW+TFKTS QPMKSPV GY SLFRALP QVLEQSFQ+TA EA+QLKQTR +HTFLFPP S SSRR
Subjt:  ASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRR

TrEMBL top hitse value%identityAlignment
A0A5D3DAK7 11S globulin seed storage protein 2-like1.7e-18670.69Show/hide
Query:  MATKLVLAILLC-FTVSSLVSAQSHESR---RLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLI
        MA K++LAILLC F   SLV+AQ    R   R   +AQ C+ DRI++RPPSRRIESEGGITELWDE+D+EFQCAG  AIRN IRPNSLSLPKFH++P+L+
Subjt:  MATKLVLAILLC-FTVSSLVSAQSHESR---RLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLI

Query:  FIEQGEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQ-EQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRG
        +IEQGEG+ G+N+PGCAE YE++SAQSSR S+RRMGR+IGAG+ E+DQHQK+RRVRRGDMI++PAGTVQWC+NDG +DLIAVAF+DLNN+DNQLDLR+RG
Subjt:  FIEQGEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQ-EQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRG

Query:  SYLAGGIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNG-------
        S+LAGG+P EA R  E R   S++LVNIF+G DQE L+EA+NIPSDLVRRMQEE+SSGLIVKC+E+MS+LTP+++E+ELS  + SRR    NG       
Subjt:  SYLAGGIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNG-------

Query:  -NAQHE-HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFY
           QH  +++R   ++SREAGRVNILNQ KLP+LRF+ MSAEKGHLFPNAQ+NL WSMT+HR+VYVV+GEAE+QI+DDYGNQ+FNERVS+GNMF+IPQFY
Subjt:  -NAQHE-HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFY

Query:  ASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSS-RRSRRS
         +LA+A  EGFEW+TFKTSNQPMKSPVAGY S FRALPLQ+LEQSFQ+T AEA+QLKQTR QHT LFPP S SS  RSRRS
Subjt:  ASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSS-RRSRRS

A0A6J1EX10 11S globulin seed storage protein 2-like1.2e-19774.42Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MATKLVLAILLCF VSSLVSAQ  E RR R++AQQCR DR+Q RPPSRRIESEGGI+E+WDES++EFQCAG AA+R+IIRPNSL++P F SSP+LI++EQ
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        GEG++GLNFPGCAE YEA+S+QSSR SSRR+GR++GAG+E+DQHQKVRRVRRGDMI+VPAGTV+WCHNDG QDL+ V+F+DLNNEDNQLDLRIR S+LAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE
        G+PREA+R        R S+SSEDLVNIF GFDQELLAEAYNIPSDL R++QE++SSGLIVKCEEDMS+LTP+++E+E SA  SS    E       QH 
Subjt:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE

Query:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA
         ++++   +YSRE+GR+NILN+ KLP+L++MDMSAEKGHLFPNAQYNL WSMT+HRLVYVVEGEAE+QI+DDYGNQV NERVSKGNMF+IPQFYASLAQA
Subjt:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA

Query:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR
          EGFEW+TFKTS QPMKSPV GY SLFRALP QVLEQSFQ+TA EA+QLKQTR +HTFLFPP S SS   RR
Subjt:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR

A0A6J1HGI1 11S globulin seed storage protein 2-like2.2e-19474.53Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        M T++VLAILLC  V+   S+Q HE R  R++AQQCR DRI+  PPSRRIESEGGITELWDE++++FQCAG AAIRNIIRPN LSLPKFHSSP+LI+IEQ
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        GEG++GLNFPGCAE YEA+SAQSSR SSRRMGR+IGAG+E DQHQKVRRVRRGDMI+VPAGTVQWCHNDG QDL+AVAF+DLNNEDNQLDLRIRGS+LAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNG--------NAQH
        GIPRE  R  E R   + DLVNI++GFDQ+ LA+AYN+P+DLVRRMQEE+SSGLIVKC+E MS+LTP+++E+ELS  + SRRE  SNG          QH
Subjt:  GIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNG--------NAQH

Query:  E-HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQ
          +++R   +YSREAGR+NILNQ KLP+LRFM MSAEKGHLFPNAQYNL WSMT+HRLVYVV+GEA  QI+DDYGNQVFNERVS+GNMF+IPQFY +L Q
Subjt:  E-HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQ

Query:  ASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPS--MSSRRSR
        A  EGFEWITFKTSNQPMKSP+AGY S FRALPLQ+LEQSFQ+T AEA+QLKQTR QHTFLFPP S   SSRR R
Subjt:  ASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPS--MSSRRSR

A0A6J1HV45 11S globulin seed storage protein 2-like9.7e-19574.53Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MAT++VLAILLC  V+   S+Q HE    R++AQQCR DRI+  PPSRRIESEGGITELWDE++++FQCAG AA+RNIIRPN LSLPKFHSSP+LI+IEQ
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        GEG++GLN+PGCAE YEA+SAQSSR SSRRMGR+IGAG+E DQHQKVRRVRRGDMI+VPAGTVQWCHNDG QDLIA+AF+DLNNEDNQLDLRIRGS+LAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNG--------NAQH
        GIPRE  R  E R   + DLVNIFSGFDQE LA+AYN+P+DLVR+MQEE+SSGLIVKC+E MS+LTP+++E+ELS  + SRRE  SNG          QH
Subjt:  GIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNG--------NAQH

Query:  E-HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQ
          +++R   +YSREAGRVNILNQ KLP+LRFM MSAEKGHLF NAQYNL WSMT+HRLVYVV+GEA  QI+DDYGNQVFNERVS+GNMF+IPQFY +L Q
Subjt:  E-HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQ

Query:  ASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRRS
        A  EGFEWITFKTSNQPMKSP+AGY S FRALPLQ+LEQSFQ+T AEA+QLKQTR QHTFLFPP S SS R  RS
Subjt:  ASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRRS

A0A6J1K8H1 11S globulin seed storage protein 2-like2.3e-19674Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MATKLVLAILLCF VSSLVSAQ  E RR R++AQQCR DR+Q R PSRRIESEGGI+E+WDES++EFQCAG AA+R+IIRPNSL++P F SSP+LI++EQ
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        GEG++GLNFPGCAE YEA+S+QSSR SSRR+GR++GAG+E+DQHQKVRRVRRGDMI+VPAGTV+WCHNDG QDL+ V+F+DLNNEDNQLDLRIR S+LAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE
        G+PREA+R        R S+SSEDLVNIFSGFDQELLAEAYNIPSDL R++QE++S+GLIVKCEEDMS+LTP+++E+E SA  SS    E       QH 
Subjt:  GIPREAIR-----GGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSR--REHNSNGNAQHE

Query:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA
         ++++   +YSRE+GR+NILN+ KLP+L++MDMSAEKGHLFPNAQYNL WSMT+HRLVYVVEGEAE+QI+DDYGNQVFN RVSKG+MF+IPQFYASLAQA
Subjt:  -HSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQA

Query:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR
          EGFEW+TFKTS QPMKSPV GY SLFRALP QVLEQSFQ+TA EA+QLKQTR +HTFLFPP S SS   RR
Subjt:  SSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSRR

SwissProt top hitse value%identityAlignment
A0A1L6K371 11S globulin1.3e-7934.89Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MA  ++L+I LC  V+ +    +    R + +  +C+  R+    PS RIE+E G+ E WD ++++FQCAG A +R  I PN L LP++ ++P L++I +
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        G G  G+ FPGC E +E    +S +  SR       A  ++D+HQK+R  R GD+I  PAG   WC+NDG   ++AVA MD  N  NQLD   R  YLAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPRE------------------AIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKS-SGLIVKCE-EDMSYLTPQDDEKELSAPTSS
            E                    R GE   +      N+FSGFD + LA+A+N+ ++  RR+Q E      IV+ E   +  + P+   +E       
Subjt:  GIPRE------------------AIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKS-SGLIVKCE-EDMSYLTPQDDEKELSAPTSS

Query:  RREHNSNGNAQHEHSKRS-------------------------RYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEG
         RE      ++   S+R                            IY+ EAGR++  N H LP+LR++ +SAE+G L+ +A Y   W++  H +VY + G
Subjt:  RREHNSNGNAQHEHSKRS-------------------------RYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEG

Query:  EAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFL--F
         AEVQ+ D++G  VF++ + +G +  IPQ +A + +A +EGFEW++FKT+   M SP+AG  S  RALP +VL  + Q+   +A++LK  R + T +   
Subjt:  EAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFL--F

Query:  PPPSMSSRRSRRS
        P  S SSR  RR+
Subjt:  PPPSMSSRRSRRS

B5KVH4 11S globulin seed storage protein 14.8e-8234.96Show/hide
Query:  MATKLVLAILLCFTVSSLVS-AQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIE
        MA  ++L+I LC  + +L +   +    R + +  QC+ +R+    P+ RIE+E G+ E WD + ++ QCAG A +R  I PN L LP + ++P L++I 
Subjt:  MATKLVLAILLCFTVSSLVS-AQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIE

Query:  QGEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLA
        +G G  G+ FPGC E +E    QS +   R          +QD+HQK+R  R GD+I  PAG   WC+NDGS  ++A+  +D +N  NQLD   R  YLA
Subjt:  QGEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLA

Query:  GGIPRE------------------AIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKS-SGLIVKCE-EDMSYLTPQDDEKELSAPTS
        G    E                    R GE   +  +   N+FSGFD E LA+A+N+ ++  RR+Q E    G IV+ E   +  + P+   +E      
Subjt:  GGIPRE------------------AIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKS-SGLIVKCE-EDMSYLTPQDDEKELSAPTS

Query:  SRREHNSNGNAQHEHSKRS-------------------------RYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVE
          RE      ++   S+R                            IY+ EAGR++ +N H LP+LR++ +SAE+G L+ +A Y   W++  H +VY + 
Subjt:  SRREHNSNGNAQHEHSKRS-------------------------RYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVE

Query:  GEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFP
        G AEVQ+ D++G  VF++ + +G +  IPQ +A + +A  EGFEW++FKT+   M SP+AG  S  RALP +VL  +FQ+   +A++LK  R + T L  
Subjt:  GEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFP

Query:  PPSMSSRRSRRS
          S SSR  RR+
Subjt:  PPSMSSRRSRRS

Q2TPW5 11S globulin seed storage protein Jug r 42.2e-8234.89Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        MA  ++L+I L   V+      +    R ++Q  QC+ +R+    P+ RIE+E G+ E WD ++++FQCAG A +R  I PN L LP++ ++P L++I +
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        G G  G+ FPGC E +E    QS +  SR          +QD+HQK+R  R GD+I  PAG   W +NDGS  ++A++ +D NN  NQLD   R  YLAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIRGGE------RRSKSSEDLV------------NIFSGFDQELLAEAYNIPSDLVRRMQEEKS-SGLIVKCE-EDMSYLTPQDDEKELSAPTSS
            E    G+      RR +  +               N+FSGFD + LA+A+N+ ++  RR+Q E      IV+ E   +  + P+   +E       
Subjt:  GIPREAIRGGE------RRSKSSEDLV------------NIFSGFDQELLAEAYNIPSDLVRRMQEEKS-SGLIVKCE-EDMSYLTPQDDEKELSAPTSS

Query:  RREHNSNGNAQHEHSKRS-------------------------RYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEG
         RE      ++   S+R                            IY+ EAGR++ +N H LP+LR++ +SAE+G L+ +A Y   W++  H +VY + G
Subjt:  RREHNSNGNAQHEHSKRS-------------------------RYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEG

Query:  EAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFL--F
         AEVQ+ D++G  VF++ + +G +  IPQ +A + +A +EGFEW++FKT+   M SP+AG  S  RALP +VL  +FQ+   +A++LK  R + T +   
Subjt:  EAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFL--F

Query:  PPPSMSSRRSRRS
        P  S SSR  RR+
Subjt:  PPPSMSSRRSRRS

Q8GZP6 11S globulin seed storage protein Ana o 2.0101 (Fragment)9.1e-8135.84Show/hide
Query:  SRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFPGCAEAYEAESAQSSR
        SR+  +Q  +C+ DR+    P  R+E E G  E WD + ++F+CAG A +R+ I+PN L LP++ ++P LI++ QGEG  G+++PGC E Y+A       
Subjt:  SRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFPGCAEAYEAESAQSSR

Query:  SSSRRMGRQIG-AGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGIPREAIRGGERRSKSSEDLVNIF
            + GRQ G +G+ QD+HQK+RR RRGD+I +PAG   WC+N+G+  ++ V  +D++N  NQLD   R  +LAG  P++  +  ++         N+F
Subjt:  SSSRRMGRQIG-AGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGIPREAIRGGERRSKSSEDLVNIF

Query:  SGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQHEHSKR--------------------------S
        SGFD ELLAEA+ +   L+++++ E + G IVK          +DDE  +  P+ S+ E  S    + E  KR                           
Subjt:  SGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQHEHSKR--------------------------S

Query:  RYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFE
          IY+ E GR+  LN   LP+L+++ +S EKG L+ NA     W++  H ++Y  +G+ +VQ+ D++GN+VF+  V +G M ++PQ +A + +A  E FE
Subjt:  RYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFE

Query:  WITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHT
        WI+FKT+++ M SP+AG  S+   +P +VL  +FQ++  +A+++K    Q T
Subjt:  WITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHT

Q9XHP0 11S globulin seed storage protein 27.6e-11245.38Show/hide
Query:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ
        +A K +LA+ L   VS+ + AQ+ E R    Q QQCRF RI    PS RI+SEGG TELWDE  ++FQCAG  A+R+ IRPN LSLP +H SP L++IE+
Subjt:  MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQ

Query:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG
        G+G + +  PGCAE Y+   +Q +   +    +Q   G  +D HQKV R+R+GD++ +P+G   WC+NDGS+DL+AV+  D+N+  NQLD + R  YLAG
Subjt:  GEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAG

Query:  GIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQ-EEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQ--------
        G+P    R GE+  ++ +   NIF  FD ELL+EA+N+P + +RRMQ EE+  GLIV   E M+++ P ++E E       R     NG  +        
Subjt:  GIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQ-EEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQ--------

Query:  -HEHSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLA
         +  S+R   I+SR+AGRV++++++KLP+L++MD+SAEKG+L+ NA  +  WSMT H +VYV  G+A+VQ+ D  G  + N+RV++G MF++PQ+Y S A
Subjt:  -HEHSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLA

Query:  QASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPP
        +A + GFEW+ FKT+  PM+SP+AGY S+ RA+PLQV+  S+Q++  +A+ LK  R   +FL  P
Subjt:  QASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPP

Arabidopsis top hitse value%identityAlignment
AT1G03880.1 cruciferin 22.4e-6032.96Show/hide
Query:  QCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQ
        +C+ D++    PS+ I+SEGG  E+WD    + +C+G A  R +I P  L LP F ++  L F+  G G MG   PGCAE +           S   G  
Subjt:  QCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQ

Query:  IGAGQEQ---DQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGIP--REAIRGGERRSKSSEDLVNIFSGFDQ
         G GQ Q   D HQKV  +R GD I  P+G  QW +N+G++ LI VA  DL +  NQLD  +R   +AG  P  +E ++G +++ ++     NIF+GF  
Subjt:  IGAGQEQ---DQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGIP--REAIRGGERRSKSSEDLVNIFSGFDQ

Query:  ELLAEAYNIPSDLVRRMQEEKSS-GLIVKCEEDMSYLTPQDDEKELSAPTSSRREHN-SNGNAQHEHSKR---------SRYIYSREAGRVNILNQHKLP
        E+LA+A+ I  +  +++Q ++ + G IVK       + P      L      ++ H  +NG  +   + R            +Y    G ++ LN + LP
Subjt:  ELLAEAYNIPSDLVRRMQEEKSS-GLIVKCEEDMSYLTPQDDEKELSAPTSSRREHN-SNGNAQHEHSKR---------SRYIYSREAGRVNILNQHKLP

Query:  LLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMS
        +LR + +SA +G +  NA    QW++  +  +YV  G+A +Q+ +D G +VF++ +S G + ++PQ ++ +  A  E FEWI FKT+     + +AG  S
Subjt:  LLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMS

Query:  LFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSR
        + R LPL+V+   +Q++  EAK++K + ++ T     P MS  R R
Subjt:  LFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSR

AT1G03890.1 RmlC-like cupins superfamily protein2.3e-6332.97Show/hide
Query:  KLVLAILLCFTVSSLVSAQSHESR-RLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGE
        KL+ ++L   ++S L+     E+R R       C F +I    P++  + E G  E+WD    E +CAG    R  ++PNS+ LP F S P L ++ QGE
Subjt:  KLVLAILLCFTVSSLVSAQSHESR-RLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGE

Query:  GYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGI
        G MG    GC E +      S R      GR+      +D HQK+   RRGD+    AG  QW +N G  D + V  +D+ N +NQLD   R   LAG  
Subjt:  GYMGLNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGI

Query:  PREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSS-GLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQH--EHSKRSR
         +E     E +  +     N FSGFD  ++AEA+ I  +  +++Q +K + G I++    + ++ P   E +     +   E           +  +RS 
Subjt:  PREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSS-GLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQH--EHSKRSR

Query:  YIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEW
        + +S  AGR++ LN   LP+LR + ++A +G+L+       QW+   H ++YV  G+A++Q+ DD G  VFNE+V +G + +IPQ +A    A   GFEW
Subjt:  YIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEW

Query:  ITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSS
        I+FKT++    + ++G  S  RA+P+ V++ S+ +   EAK++K ++ Q T L   PS SS
Subjt:  ITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSS

AT4G28520.1 cruciferin 33.6e-5629.75Show/hide
Query:  QCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFPGCAEAY-EAESAQSSRSSSRRMGR
        +C  D + +   +  I+SE G  E WD +  + +C G +  R +I    L LP F +SP + ++ QG G  G   PGCAE + +++  Q  +      GR
Subjt:  QCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFPGCAEAY-EAESAQSSRSSSRRMGR

Query:  Q------------------------------------------IGAGQE-----QDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNED
        Q                                           G GQ+     +D HQKV  VRRGD+     G+  W +N G Q L+ +A +D+ N  
Subjt:  Q------------------------------------------IGAGQE-----QDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNED

Query:  NQLDLRIRGSYLAGGIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSS-GLIVKCEEDMSYLTPQDDEKELSAPTSSRREHN
        NQLD   R  +LAG       +GG   S+  ++  N++SGFD +++A+A  I   L +++Q ++ S G IV+ +     + P   +   S      R   
Subjt:  NQLDLRIRGSYLAGGIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSS-GLIVKCEEDMSYLTPQDDEKELSAPTSSRREHN

Query:  SNGNAQHEHSKRSRY---------IYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSK
         NG  +   S RS           +Y    GRV  +N + LP+L ++ +SA +G L  NA    +++M  + ++Y   G+  +Q+ +D G  V +++V K
Subjt:  SNGNAQHEHSKRSRY---------IYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSK

Query:  GNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHT
        G + +IPQ +A + Q+    FEWI+FKT+   M S +AG  SL RALPL+V+   FQ++  EA+++K   ++ T
Subjt:  GNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHT

AT4G28520.3 cruciferin 31.1e-4927.37Show/hide
Query:  QCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFPGCAEAY-EAESAQSSRSSSRRMGR
        +C  D + +   +  I+SE G  E WD +  + +C G +  R +I    L LP F +SP + ++ QG G  G   PGCAE + +++  Q  +      GR
Subjt:  QCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFPGCAEAY-EAESAQSSRSSSRRMGR

Query:  Q------------------------------------------IGAGQE-----QDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNED
        Q                                           G GQ+     +D HQKV  VRRGD+     G+  W +N G Q L+ +A +D+ N  
Subjt:  Q------------------------------------------IGAGQE-----QDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNED

Query:  NQLDLRIRGSYLAGGIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNS
        NQLD   R  +LAG       +GG   S+  ++  N++SGFD +++A+A  I                                                
Subjt:  NQLDLRIRGSYLAGGIPREAIRGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNS

Query:  NGNAQHEHSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFY
                      +Y    GRV  +N + LP+L ++ +SA +G L  NA    +++M  + ++Y   G+  +Q+ +D G  V +++V KG + +IPQ +
Subjt:  NGNAQHEHSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFY

Query:  ASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHT
        A + Q+    FEWI+FKT+   M S +AG  SL RALPL+V+   FQ++  EA+++K   ++ T
Subjt:  ASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHT

AT5G44120.3 RmlC-like cupins superfamily protein9.7e-6231.37Show/hide
Query:  LLCFTVSSLVSAQSHESRRLRKQAQ---QCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMG
        LL F ++ L+    + +++ ++  Q   +C+ D++    PS  ++SE G  E+WD    + +C+G +  R II    L LP F ++  L F+ +G G MG
Subjt:  LLCFTVSSLVSAQSHESRRLRKQAQ---QCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMG

Query:  LNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQ---DQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGIP
           PGCAE ++         SS    R  G GQ Q   D HQKV  +R GD I    G  QW +NDG + L+ V+  DL +  NQLD   R  YLAG  P
Subjt:  LNFPGCAEAYEAESAQSSRSSSRRMGRQIGAGQEQ---DQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGIP

Query:  REAI--RGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQ-EEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQHEHSKRSRY
        +  +  +G E++ +      NIF+GF  E++A+A  I     +++Q ++ + G IV+ +     + P    +           H  +GN   E    +R 
Subjt:  REAI--RGGERRSKSSEDLVNIFSGFDQELLAEAYNIPSDLVRRMQ-EEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQHEHSKRSRY

Query:  -----------IYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASL
                   +Y  + G ++ LN + LP+LRF+ +SA +G +  NA    QW+   + ++YV +GEA++QI +D GN+VF+ +VS+G +  +PQ ++ +
Subjt:  -----------IYSREAGRVNILNQHKLPLLRFMDMSAEKGHLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASL

Query:  AQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSR
         +A+S  F+W+ FKT+     + +AG  S+ R LPL+V+   FQ++  EA+++K   ++ T        S  R R
Subjt:  AQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAKQLKQTRMQHTFLFPPPSMSSRRSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACCAAGCTTGTACTGGCGATTCTGCTGTGTTTCACTGTATCGTCTCTCGTGAGCGCTCAGAGCCACGAGAGCCGTCGGTTAAGGAAACAAGCTCAGCAATGCCG
CTTTGACAGGATTCAGATGAGGCCACCCTCGCGTCGGATCGAGTCGGAGGGAGGTATTACTGAGCTTTGGGATGAATCTGATAAAGAATTTCAGTGTGCTGGAGCTGCTG
CCATTAGAAACATCATAAGGCCCAACTCTCTCTCTTTGCCTAAATTCCACAGCTCCCCCGTGCTCATCTTCATTGAGCAAGGTGAAGGGTACATGGGGTTGAACTTCCCA
GGGTGTGCAGAGGCATACGAGGCAGAATCAGCTCAATCATCAAGAAGCTCTTCGAGGCGTATGGGGCGTCAAATTGGCGCCGGCCAAGAACAAGACCAACACCAAAAGGT
TCGGAGAGTCCGCCGTGGTGACATGATCATCGTCCCTGCCGGTACCGTCCAATGGTGCCACAATGATGGCAGCCAAGACCTCATTGCGGTTGCCTTCATGGATCTCAACA
ACGAGGACAACCAGCTCGACCTCCGCATCAGGGGATCTTATTTGGCTGGTGGAATTCCAAGAGAAGCAATAAGAGGAGGAGAAAGAAGATCAAAATCTTCCGAGGATCTA
GTGAACATCTTCAGTGGATTCGATCAGGAGCTTCTTGCAGAGGCTTACAACATTCCATCAGACTTGGTGAGAAGAATGCAGGAAGAAAAGAGCAGCGGATTGATCGTGAA
ATGCGAAGAAGACATGTCGTATTTGACACCGCAGGACGACGAGAAAGAATTGAGTGCACCGACATCTTCGAGAAGGGAACATAACTCGAATGGTAATGCACAACATGAAC
ACTCAAAGAGAAGCAGATATATATACTCAAGGGAGGCAGGCAGAGTGAACATTTTGAACCAACACAAGCTCCCCCTTCTCAGATTCATGGACATGAGTGCTGAAAAAGGC
CATCTTTTCCCTAACGCTCAGTACAACCTGCAGTGGTCAATGACAGAGCACAGATTGGTGTACGTGGTAGAGGGAGAGGCAGAGGTTCAAATCGCCGATGACTACGGCAA
CCAAGTGTTCAACGAGAGAGTTTCGAAAGGGAACATGTTCTTGATTCCGCAATTCTATGCCTCGCTAGCTCAGGCAAGTTCAGAAGGGTTTGAATGGATCACTTTCAAGA
CCTCAAACCAGCCCATGAAGAGCCCTGTGGCTGGCTACATGTCCCTTTTCAGAGCCCTCCCGCTCCAAGTCCTGGAACAATCATTCCAAATGACAGCAGCTGAGGCTAAG
CAGCTCAAGCAGACCAGAATGCAACACACTTTCCTGTTCCCTCCTCCTAGCATGAGCAGCCGCCGCAGCCGCAGGTCCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACCAAGCTTGTACTGGCGATTCTGCTGTGTTTCACTGTATCGTCTCTCGTGAGCGCTCAGAGCCACGAGAGCCGTCGGTTAAGGAAACAAGCTCAGCAATGCCG
CTTTGACAGGATTCAGATGAGGCCACCCTCGCGTCGGATCGAGTCGGAGGGAGGTATTACTGAGCTTTGGGATGAATCTGATAAAGAATTTCAGTGTGCTGGAGCTGCTG
CCATTAGAAACATCATAAGGCCCAACTCTCTCTCTTTGCCTAAATTCCACAGCTCCCCCGTGCTCATCTTCATTGAGCAAGGTGAAGGGTACATGGGGTTGAACTTCCCA
GGGTGTGCAGAGGCATACGAGGCAGAATCAGCTCAATCATCAAGAAGCTCTTCGAGGCGTATGGGGCGTCAAATTGGCGCCGGCCAAGAACAAGACCAACACCAAAAGGT
TCGGAGAGTCCGCCGTGGTGACATGATCATCGTCCCTGCCGGTACCGTCCAATGGTGCCACAATGATGGCAGCCAAGACCTCATTGCGGTTGCCTTCATGGATCTCAACA
ACGAGGACAACCAGCTCGACCTCCGCATCAGGGGATCTTATTTGGCTGGTGGAATTCCAAGAGAAGCAATAAGAGGAGGAGAAAGAAGATCAAAATCTTCCGAGGATCTA
GTGAACATCTTCAGTGGATTCGATCAGGAGCTTCTTGCAGAGGCTTACAACATTCCATCAGACTTGGTGAGAAGAATGCAGGAAGAAAAGAGCAGCGGATTGATCGTGAA
ATGCGAAGAAGACATGTCGTATTTGACACCGCAGGACGACGAGAAAGAATTGAGTGCACCGACATCTTCGAGAAGGGAACATAACTCGAATGGTAATGCACAACATGAAC
ACTCAAAGAGAAGCAGATATATATACTCAAGGGAGGCAGGCAGAGTGAACATTTTGAACCAACACAAGCTCCCCCTTCTCAGATTCATGGACATGAGTGCTGAAAAAGGC
CATCTTTTCCCTAACGCTCAGTACAACCTGCAGTGGTCAATGACAGAGCACAGATTGGTGTACGTGGTAGAGGGAGAGGCAGAGGTTCAAATCGCCGATGACTACGGCAA
CCAAGTGTTCAACGAGAGAGTTTCGAAAGGGAACATGTTCTTGATTCCGCAATTCTATGCCTCGCTAGCTCAGGCAAGTTCAGAAGGGTTTGAATGGATCACTTTCAAGA
CCTCAAACCAGCCCATGAAGAGCCCTGTGGCTGGCTACATGTCCCTTTTCAGAGCCCTCCCGCTCCAAGTCCTGGAACAATCATTCCAAATGACAGCAGCTGAGGCTAAG
CAGCTCAAGCAGACCAGAATGCAACACACTTTCCTGTTCCCTCCTCCTAGCATGAGCAGCCGCCGCAGCCGCAGGTCCTCTTAAAGATGCAGCTTTTGCCACTGATATAG
ATCAGTAGAGAGATAAGAAAAATAAGGCCACAGAAGTTAGCTCGCTAGGTTTGTCTTTTGTTATCATTGTTGGGGGGTTGAATGTGCTTCTTCTTGTAGAATAGAGAAGA
AGGGTTTGGCATTCTGACAATGCAAGAGAGTTTTGAAATTATGGAAAGTGTAATACTATATGTAATAATATATGAAAGCAATAAAAGGTGTGGCTGTTCTTCTTTCCTCA
TTTTGTGTTCATGTGTTAAGAGTAATTATTTTAC
Protein sequenceShow/hide protein sequence
MATKLVLAILLCFTVSSLVSAQSHESRRLRKQAQQCRFDRIQMRPPSRRIESEGGITELWDESDKEFQCAGAAAIRNIIRPNSLSLPKFHSSPVLIFIEQGEGYMGLNFP
GCAEAYEAESAQSSRSSSRRMGRQIGAGQEQDQHQKVRRVRRGDMIIVPAGTVQWCHNDGSQDLIAVAFMDLNNEDNQLDLRIRGSYLAGGIPREAIRGGERRSKSSEDL
VNIFSGFDQELLAEAYNIPSDLVRRMQEEKSSGLIVKCEEDMSYLTPQDDEKELSAPTSSRREHNSNGNAQHEHSKRSRYIYSREAGRVNILNQHKLPLLRFMDMSAEKG
HLFPNAQYNLQWSMTEHRLVYVVEGEAEVQIADDYGNQVFNERVSKGNMFLIPQFYASLAQASSEGFEWITFKTSNQPMKSPVAGYMSLFRALPLQVLEQSFQMTAAEAK
QLKQTRMQHTFLFPPPSMSSRRSRRSS