; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr008966 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr008966
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein LOW PSII ACCUMULATION 1, chloroplastic
Genome locationtig00007332:39940..45627
RNA-Seq ExpressionSgr008966
SyntenySgr008966
Gene Ontology termsGO:0010270 - photosystem II oxygen evolving complex assembly (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat
IPR021883 - Protein LOW PSII ACCUMULATION 1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152258.2 protein LOW PSII ACCUMULATION 1, chloroplastic [Cucumis sativus]1.6e-21788.24Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        MA+ATLPL+H L  LSNP++ T LRPRLP+       SQ+ F +SI+ CSSTSQSPEAN+++AESCVN GLQLFSKGRVKEALVQFEAALN+ P+PMEAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEVQAPFRGVR+FFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLF IPRL+RAIQGGD APDVWETAGNL VN+GGIIV VALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPAN-AATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPG
        AIQKAERFRTELLRRGVLLVPV+WGEGREPQ+EKKGFGAP   AA ALPSIGE+FEKRAQSITAKSKLKAEIRFRAEV+SPAEWESWIR+QQ+SEGVTPG
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPAN-AATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPG

Query:  EDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        EDVYIILRLDGRVRRSGRGMPDW KI+EELPPMEALLSKLE+
Subjt:  EDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

XP_008454363.1 PREDICTED: protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 [Cucumis melo]6.3e-22289.57Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        MA+ATLPL+H L  LSNP++ T LRPRLP+       SQ+ FH+SI+ CSSTSQSPEAN+++AESCVNLGLQLFSKGRVKEALVQFEAALN+ P+PMEAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AA YNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLF IPRL+RAIQGGD APDVWETAGNL VN+GGIIV VALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE
        AIQKAERFRTELLRRGVLLVPV+WGEGREPQ+EKKGFGAPA AATALPSIGE+FEKRAQSITAKSKLKAEIRFRAEV+SPAEWESWIRDQQKSEGVTPGE
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE

Query:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        DVYIILRLDGR+RRSGRGMPDW KI+EELPPMEALLSKLE+
Subjt:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

XP_022143429.1 protein LOW PSII ACCUMULATION 1, chloroplastic isoform X1 [Momordica charantia]6.9e-22992.29Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        MAVATLPLYH LL  SNP++RT+LRPRLP+ST N     KNFH+SI +CSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQF+AALNL P+P+EAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AA YNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVR+FFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLFTIPRL+RAIQGGDEAPDVWETAGNL VN+GGIIVLVALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE
        AIQKAERFRTELLRRGVLLVPVVWGEGREPQ+EK+GFGAP NA   LPSIGE+FEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE

Query:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        DVYIILRLDGRVRRSGRGMPDWPKI+EELPPMEALLSKLER
Subjt:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

XP_022983449.1 protein LOW PSII ACCUMULATION 1, chloroplastic [Cucurbita maxima]3.6e-21787.78Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        + +ATLP++HQLL LSNP++ T LR RLP+S      SQ+ FHVSI+ CSSTSQSPE NVE+AES VNLGLQLFSKGRVKEALVQFEAAL+++P+PMEAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLF +PRL+RAIQGG+EAPDVWET GNL VN+GGI+V VALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNR+VELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANA-ATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPG
        AIQKAERFRTELLRRGVLLVPV+W EGREP+MEKKGFGAPA A + ALPSIGE+FEKRAQSITAKSKLKAEIRFRA+V+SPAEWESWIRDQQKSEGVTPG
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANA-ATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPG

Query:  EDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        EDVYIILRLDGRVRRSGRGMPDW KI+EELPPM+ALLSKLER
Subjt:  EDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

XP_038905239.1 protein LOW PSII ACCUMULATION 1, chloroplastic [Benincasa hispida]3.2e-21889.12Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        MA+ATLPL+H LL  S+P++ T LRPR       LL SQ+ FHVSI+  SSTSQSPEAN+E+AESCVNLGLQLFSKGRVKEALVQFEAALN+ P+PMEAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEVQAPFRGVRRFF VALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLF IPRL+RAIQGGD APDVWETAGNL VN+GGI+V VALFLWDNKKEEEQL+QISR+ETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE
        AIQKAERFRTELLRRGVLLVPV+WGEGREPQ+EKKGFGAPA  A ALPSIGE+FEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSE VTPGE
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE

Query:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        DVYIILRLDGRVRRSGRGMPDW KI+EELPPMEALLSKLER
Subjt:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

TrEMBL top hitse value%identityAlignment
A0A0A0KT96 TPR_REGION domain-containing protein7.7e-21888.24Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        MA+ATLPL+H L  LSNP++ T LRPRLP+       SQ+ F +SI+ CSSTSQSPEAN+++AESCVN GLQLFSKGRVKEALVQFEAALN+ P+PMEAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEVQAPFRGVR+FFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLF IPRL+RAIQGGD APDVWETAGNL VN+GGIIV VALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPAN-AATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPG
        AIQKAERFRTELLRRGVLLVPV+WGEGREPQ+EKKGFGAP   AA ALPSIGE+FEKRAQSITAKSKLKAEIRFRAEV+SPAEWESWIR+QQ+SEGVTPG
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPAN-AATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPG

Query:  EDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        EDVYIILRLDGRVRRSGRGMPDW KI+EELPPMEALLSKLE+
Subjt:  EDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

A0A1S3BYE8 protein LOW PSII ACCUMULATION 1, chloroplastic isoform X13.0e-22289.57Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        MA+ATLPL+H L  LSNP++ T LRPRLP+       SQ+ FH+SI+ CSSTSQSPEAN+++AESCVNLGLQLFSKGRVKEALVQFEAALN+ P+PMEAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AA YNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLF IPRL+RAIQGGD APDVWETAGNL VN+GGIIV VALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE
        AIQKAERFRTELLRRGVLLVPV+WGEGREPQ+EKKGFGAPA AATALPSIGE+FEKRAQSITAKSKLKAEIRFRAEV+SPAEWESWIRDQQKSEGVTPGE
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE

Query:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        DVYIILRLDGR+RRSGRGMPDW KI+EELPPMEALLSKLE+
Subjt:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

A0A5A7TR76 Protein LOW PSII ACCUMULATION 13.0e-22289.57Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        MA+ATLPL+H L  LSNP++ T LRPRLP+       SQ+ FH+SI+ CSSTSQSPEAN+++AESCVNLGLQLFSKGRVKEALVQFEAALN+ P+PMEAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AA YNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLF IPRL+RAIQGGD APDVWETAGNL VN+GGIIV VALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE
        AIQKAERFRTELLRRGVLLVPV+WGEGREPQ+EKKGFGAPA AATALPSIGE+FEKRAQSITAKSKLKAEIRFRAEV+SPAEWESWIRDQQKSEGVTPGE
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE

Query:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        DVYIILRLDGR+RRSGRGMPDW KI+EELPPMEALLSKLE+
Subjt:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

A0A6J1CPA1 protein LOW PSII ACCUMULATION 1, chloroplastic isoform X13.3e-22992.29Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        MAVATLPLYH LL  SNP++RT+LRPRLP+ST N     KNFH+SI +CSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQF+AALNL P+P+EAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AA YNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVR+FFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLFTIPRL+RAIQGGDEAPDVWETAGNL VN+GGIIVLVALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE
        AIQKAERFRTELLRRGVLLVPVVWGEGREPQ+EK+GFGAP NA   LPSIGE+FEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGE

Query:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        DVYIILRLDGRVRRSGRGMPDWPKI+EELPPMEALLSKLER
Subjt:  DVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

A0A6J1J7F4 protein LOW PSII ACCUMULATION 1, chloroplastic1.7e-21787.78Show/hide
Query:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ
        + +ATLP++HQLL LSNP++ T LR RLP+S      SQ+ FHVSI+ CSSTSQSPE NVE+AES VNLGLQLFSKGRVKEALVQFEAAL+++P+PMEAQ
Subjt:  MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQ

Query:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
        AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEAR+GGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG
Subjt:  AALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAG

Query:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS
        ISLLF +PRL+RAIQGG+EAPDVWET GNL VN+GGI+V VALFLWDNKKEEEQLAQISR+ETLSRLPLRLSTNR+VELVQLRDTVRPVILAGKKETVSS
Subjt:  ISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSS

Query:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANA-ATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPG
        AIQKAERFRTELLRRGVLLVPV+W EGREP+MEKKGFGAPA A + ALPSIGE+FEKRAQSITAKSKLKAEIRFRA+V+SPAEWESWIRDQQKSEGVTPG
Subjt:  AIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANA-ATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPG

Query:  EDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        EDVYIILRLDGRVRRSGRGMPDW KI+EELPPM+ALLSKLER
Subjt:  EDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

SwissProt top hitse value%identityAlignment
Q94BS2 Protein MET1, chloroplastic5.7e-0835.96Show/hide
Query:  GLQLFSKGRVKEALVQFEAALNLHPSPMEAQAALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEE
        GLQ    G+ +EAL +FE+ L   P+P EA  A YN ACC++   + +     L  AL+     F  I +DPDL + R   +F  L ++
Subjt:  GLQLFSKGRVKEALVQFEAALNLHPSPMEAQAALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEE

Q9SRY4 Protein LOW PSII ACCUMULATION 1, chloroplastic4.4e-17870.86Show/hide
Query:  MAVATLPLY--HQLLNLSNPRART-SLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVE---------TAESCVNLGLQLFSKGRVKEALVQFEA
        MAVAT P    H    +SN  +R    RP LP     L  S++N+   +   +S+S SP ++           TAE CVN GL LF +GRVK+ALVQFE 
Subjt:  MAVATLPLY--HQLLNLSNPRART-SLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVE---------TAESCVNLGLQLFSKGRVKEALVQFEA

Query:  ALNLHPSPMEAQAALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVR
        AL+L P+P+E+QAA YNKACCHAYRGEGKKA DCLR+ALR+YNLKF TILNDPDLASFRALPEFKELQEEARLGGEDIG  FRRDLKLISEV+APFRGVR
Subjt:  ALNLHPSPMEAQAALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVR

Query:  RFFYVALSAAAGISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRP
        +FFY A +AAAGIS+ FT+PRL +AI+GGD AP++ ET GN  +NIGGI+V+V+LFLW+NKKEEEQ+ QI+RDETLSRLPLRLSTNR+VELVQLRDTVRP
Subjt:  RFFYVALSAAAGISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRP

Query:  VILAGKKETVSSAIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIR
        VILAGKKETV+ A+QKA+RFRTELLRRGVLLVPVVWGE + P++EKKGFGA + AAT+LPSIGE+F+ RAQS+ A+SKLK EIRF+AE VSP EWE WIR
Subjt:  VILAGKKETVSSAIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIR

Query:  DQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        DQQ SEGV PG+DVYIILRLDGRVRRSGRGMPDW +I +ELPPM+ +LSKLER
Subjt:  DQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

Arabidopsis top hitse value%identityAlignment
AT1G02910.1 tetratricopeptide repeat (TPR)-containing protein3.1e-17970.86Show/hide
Query:  MAVATLPLY--HQLLNLSNPRART-SLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVE---------TAESCVNLGLQLFSKGRVKEALVQFEA
        MAVAT P    H    +SN  +R    RP LP     L  S++N+   +   +S+S SP ++           TAE CVN GL LF +GRVK+ALVQFE 
Subjt:  MAVATLPLY--HQLLNLSNPRART-SLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVE---------TAESCVNLGLQLFSKGRVKEALVQFEA

Query:  ALNLHPSPMEAQAALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVR
        AL+L P+P+E+QAA YNKACCHAYRGEGKKA DCLR+ALR+YNLKF TILNDPDLASFRALPEFKELQEEARLGGEDIG  FRRDLKLISEV+APFRGVR
Subjt:  ALNLHPSPMEAQAALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVR

Query:  RFFYVALSAAAGISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRP
        +FFY A +AAAGIS+ FT+PRL +AI+GGD AP++ ET GN  +NIGGI+V+V+LFLW+NKKEEEQ+ QI+RDETLSRLPLRLSTNR+VELVQLRDTVRP
Subjt:  RFFYVALSAAAGISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRP

Query:  VILAGKKETVSSAIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIR
        VILAGKKETV+ A+QKA+RFRTELLRRGVLLVPVVWGE + P++EKKGFGA + AAT+LPSIGE+F+ RAQS+ A+SKLK EIRF+AE VSP EWE WIR
Subjt:  VILAGKKETVSSAIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIR

Query:  DQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER
        DQQ SEGV PG+DVYIILRLDGRVRRSGRGMPDW +I +ELPPM+ +LSKLER
Subjt:  DQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLER

AT1G55480.1 protein containing PDZ domain, a K-box domain, and a TPR region4.0e-0935.96Show/hide
Query:  GLQLFSKGRVKEALVQFEAALNLHPSPMEAQAALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEE
        GLQ    G+ +EAL +FE+ L   P+P EA  A YN ACC++   + +     L  AL+     F  I +DPDL + R   +F  L ++
Subjt:  GLQLFSKGRVKEALVQFEAALNLHPSPMEAQAALYNKACCHAYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEE

AT4G28740.1 FUNCTIONS IN: molecular_function unknown3.4e-3231.34Show/hide
Query:  DLKLISEVQAPFRGVRRFFYVALSAAAGISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRL-
        D ++ SEV +PFR VR FFY+A  A+  +  L    RL  A+     + +V E    L V+IG   +   L+  +NK +  Q+A++SR+E L +L +R+ 
Subjt:  DLKLISEVQAPFRGVRRFFYVALSAAAGISLLFTIPRLYRAIQGGDEAPDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRL-

Query:  STNRIVELVQLRDTVRPVILAGKKETVSSAIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEI
          N+++ +  LR   R VI AG  E +  A ++++ +   L+ RGV++V     +G  P +E                  EE  +R + +          
Subjt:  STNRIVELVQLRDTVRPVILAGKKETVSSAIQKAERFRTELLRRGVLLVPVVWGEGREPQMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEI

Query:  RFRAEVVSPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKL
         +R   V   EWE W+ +Q+K   V+    VY+ LRLDGRVR SG G P W   + +LPP++ + + L
Subjt:  RFRAEVVSPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGTTGCTACTCTTCCTCTGTACCACCAGCTACTCAACCTCTCGAACCCCAGAGCAAGAACCAGCCTGAGACCGCGGCTACCGAGTTCCACGCCCAACCTCTTGTG
TTCCCAAAAGAATTTCCATGTCTCTATCGTATATTGCTCTTCTACTTCTCAGTCCCCAGAAGCTAACGTCGAAACGGCAGAGTCCTGTGTCAATCTGGGTCTCCAGCTCT
TCTCTAAAGGACGGGTCAAAGAAGCTTTAGTCCAATTTGAAGCAGCACTTAATTTGCATCCCAGCCCAATGGAGGCCCAAGCTGCTTTGTACAATAAAGCATGCTGTCAT
GCCTATCGTGGGGAAGGAAAGAAAGCTGCTGATTGTCTGCGTGTTGCATTAAGAGAATATAACCTCAAATTTGGCACAATTTTGAATGATCCTGACTTGGCCTCATTCAG
AGCTCTTCCTGAATTCAAGGAATTGCAAGAAGAGGCTAGGCTGGGTGGAGAGGATATTGGATATGGCTTTCGAAGAGATCTTAAACTCATTAGTGAAGTCCAAGCACCTT
TTCGTGGGGTTAGGAGGTTCTTTTATGTGGCACTATCTGCAGCAGCTGGAATTTCATTGTTGTTTACTATACCCAGATTATATCGTGCTATTCAAGGCGGTGATGAAGCT
CCCGATGTTTGGGAAACTGCTGGAAATTTGACTGTTAATATTGGAGGTATTATTGTTCTCGTGGCATTATTCTTATGGGACAACAAGAAAGAAGAGGAACAGCTTGCACA
AATATCAAGAGATGAAACACTATCAAGGTTGCCTCTACGTCTTTCCACCAATCGGATTGTTGAACTTGTACAGCTTCGAGATACTGTAAGACCGGTCATTTTAGCTGGGA
AAAAGGAGACGGTTTCTTCAGCCATCCAGAAGGCAGAAAGGTTCAGAACTGAGCTTCTTAGACGAGGCGTTCTCTTAGTTCCTGTCGTATGGGGTGAAGGTAGGGAACCC
CAAATGGAAAAGAAAGGGTTTGGTGCTCCAGCCAATGCAGCTACTGCTCTGCCGTCTATTGGGGAAGAGTTTGAGAAACGAGCTCAGTCCATAACTGCAAAATCGAAGTT
GAAAGCTGAAATTCGATTCAGGGCCGAGGTTGTATCTCCTGCAGAATGGGAAAGTTGGATAAGGGACCAGCAGAAGTCTGAAGGGGTCACCCCTGGTGAGGATGTCTACA
TTATATTGCGACTGGATGGTCGAGTTCGAAGATCAGGGAGAGGAATGCCTGACTGGCCAAAAATTCTTGAAGAGCTGCCACCAATGGAAGCTCTTCTAAGCAAGCTAGAA
AGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGTTGCTACTCTTCCTCTGTACCACCAGCTACTCAACCTCTCGAACCCCAGAGCAAGAACCAGCCTGAGACCGCGGCTACCGAGTTCCACGCCCAACCTCTTGTG
TTCCCAAAAGAATTTCCATGTCTCTATCGTATATTGCTCTTCTACTTCTCAGTCCCCAGAAGCTAACGTCGAAACGGCAGAGTCCTGTGTCAATCTGGGTCTCCAGCTCT
TCTCTAAAGGACGGGTCAAAGAAGCTTTAGTCCAATTTGAAGCAGCACTTAATTTGCATCCCAGCCCAATGGAGGCCCAAGCTGCTTTGTACAATAAAGCATGCTGTCAT
GCCTATCGTGGGGAAGGAAAGAAAGCTGCTGATTGTCTGCGTGTTGCATTAAGAGAATATAACCTCAAATTTGGCACAATTTTGAATGATCCTGACTTGGCCTCATTCAG
AGCTCTTCCTGAATTCAAGGAATTGCAAGAAGAGGCTAGGCTGGGTGGAGAGGATATTGGATATGGCTTTCGAAGAGATCTTAAACTCATTAGTGAAGTCCAAGCACCTT
TTCGTGGGGTTAGGAGGTTCTTTTATGTGGCACTATCTGCAGCAGCTGGAATTTCATTGTTGTTTACTATACCCAGATTATATCGTGCTATTCAAGGCGGTGATGAAGCT
CCCGATGTTTGGGAAACTGCTGGAAATTTGACTGTTAATATTGGAGGTATTATTGTTCTCGTGGCATTATTCTTATGGGACAACAAGAAAGAAGAGGAACAGCTTGCACA
AATATCAAGAGATGAAACACTATCAAGGTTGCCTCTACGTCTTTCCACCAATCGGATTGTTGAACTTGTACAGCTTCGAGATACTGTAAGACCGGTCATTTTAGCTGGGA
AAAAGGAGACGGTTTCTTCAGCCATCCAGAAGGCAGAAAGGTTCAGAACTGAGCTTCTTAGACGAGGCGTTCTCTTAGTTCCTGTCGTATGGGGTGAAGGTAGGGAACCC
CAAATGGAAAAGAAAGGGTTTGGTGCTCCAGCCAATGCAGCTACTGCTCTGCCGTCTATTGGGGAAGAGTTTGAGAAACGAGCTCAGTCCATAACTGCAAAATCGAAGTT
GAAAGCTGAAATTCGATTCAGGGCCGAGGTTGTATCTCCTGCAGAATGGGAAAGTTGGATAAGGGACCAGCAGAAGTCTGAAGGGGTCACCCCTGGTGAGGATGTCTACA
TTATATTGCGACTGGATGGTCGAGTTCGAAGATCAGGGAGAGGAATGCCTGACTGGCCAAAAATTCTTGAAGAGCTGCCACCAATGGAAGCTCTTCTAAGCAAGCTAGAA
AGATGA
Protein sequenceShow/hide protein sequence
MAVATLPLYHQLLNLSNPRARTSLRPRLPSSTPNLLCSQKNFHVSIVYCSSTSQSPEANVETAESCVNLGLQLFSKGRVKEALVQFEAALNLHPSPMEAQAALYNKACCH
AYRGEGKKAADCLRVALREYNLKFGTILNDPDLASFRALPEFKELQEEARLGGEDIGYGFRRDLKLISEVQAPFRGVRRFFYVALSAAAGISLLFTIPRLYRAIQGGDEA
PDVWETAGNLTVNIGGIIVLVALFLWDNKKEEEQLAQISRDETLSRLPLRLSTNRIVELVQLRDTVRPVILAGKKETVSSAIQKAERFRTELLRRGVLLVPVVWGEGREP
QMEKKGFGAPANAATALPSIGEEFEKRAQSITAKSKLKAEIRFRAEVVSPAEWESWIRDQQKSEGVTPGEDVYIILRLDGRVRRSGRGMPDWPKILEELPPMEALLSKLE
R