; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011657 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011657
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPeroxisomal membrane protein 2
Genome locationChr01:8865664..8868601
RNA-Seq ExpressionHG10011657
SyntenyHG10011657
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007248 - Mpv17/PMP22


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046854.1 Peroxisomal membrane protein 2 [Cucumis melo var. makuwa]7.1e-20695.2Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHATAPQ FTSFTTAKRAPAPSHPR A+NFH PKLPERS FSRNGRKSNWALNSAVEEFDVIPV+S+D TDQQEG+ VGR ERDG EGELGSAVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSSASVGDG GTESGEMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NELK
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
        ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ

XP_004149288.2 uncharacterized protein LOC101205134 [Cucumis sativus]1.5e-20093.07Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHAT P  FTSFTTAKRAPAPS PRA +NFHAPKLPERSIFS  GR SNWALNSAVEEFDVIPVQS+DFTDQQEGV +GR ERDG EGE+G+AVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSSASV DG GTE+GEMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NELK
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
        ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATD SSD LPTDSTQ
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ

XP_008452276.1 PREDICTED: uncharacterized protein LOC103493348 isoform X1 [Cucumis melo]1.5e-20894.99Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHATAPQ FTSFTTAKRAPAPSHPR A+NFH PKLP+RSIFSRNGRKSNWALNSAVEEFDVIP++S+D TDQQEG+ VGR ERDG EGELGSAVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSSASVGDG GTESGEMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NELK
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVFWS
        ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVFWS
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVFWS

XP_008452277.1 PREDICTED: uncharacterized protein LOC103493348 isoform X2 [Cucumis melo]1.2e-20594.93Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHATAPQ FTSFTTAKRAPAPSHPR A+NFH PKLP+RSIFSRNGRKSNWALNSAVEEFDVIP++S+D TDQQEG+ VGR ERDG EGELGSAVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSSASVGDG GTESGEMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NELK
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
        ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ

XP_038905304.1 uncharacterized protein LOC120091374 [Benincasa hispida]9.2e-20695.76Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPK PERSIFSRNG++SNWALNSAVEE DVIPVQS DFTDQQEGV VGRVERDGVEGELGSAVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGD--GDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMV
        GELSLGGAGEIQGFSSSASVGD  G GTE+ EMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMV
Subjt:  GELSLGGAGEIQGFSSSASVGD--GDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMV

Query:  ISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNE
        ISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NE
Subjt:  ISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNE

Query:  LKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
        LKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
Subjt:  LKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ

TrEMBL top hitse value%identityAlignment
A0A0A0L5A9 Uncharacterized protein7.4e-20193.07Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHAT P  FTSFTTAKRAPAPS PRA +NFHAPKLPERSIFS  GR SNWALNSAVEEFDVIPVQS+DFTDQQEGV +GR ERDG EGE+G+AVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSSASV DG GTE+GEMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NELK
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
        ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATD SSD LPTDSTQ
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ

A0A1S3BUA2 uncharacterized protein LOC103493348 isoform X17.4e-20994.99Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHATAPQ FTSFTTAKRAPAPSHPR A+NFH PKLP+RSIFSRNGRKSNWALNSAVEEFDVIP++S+D TDQQEG+ VGR ERDG EGELGSAVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSSASVGDG GTESGEMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NELK
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVFWS
        ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVFWS
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVFWS

A0A1S3BUL8 uncharacterized protein LOC103493348 isoform X25.8e-20694.93Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHATAPQ FTSFTTAKRAPAPSHPR A+NFH PKLP+RSIFSRNGRKSNWALNSAVEEFDVIP++S+D TDQQEG+ VGR ERDG EGELGSAVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSSASVGDG GTESGEMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NELK
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
        ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ

A0A5D3BU84 Peroxisomal membrane protein 23.4e-20695.2Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHATAPQ FTSFTTAKRAPAPSHPR A+NFH PKLPERS FSRNGRKSNWALNSAVEEFDVIPV+S+D TDQQEG+ VGR ERDG EGELGSAVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSSASVGDG GTESGEMERVMIDRIINATIVLAAGS+ALTKLLTIDQDYWHGWT+YEILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NELK
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
        ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ

A0A6J1EMY0 uncharacterized protein LOC111436049 isoform X11.2e-19089.6Show/hide
Query:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF
        MASVHA APQRFTSF T KRA     PRAA NF+A KLP+RSIFSRNGRK +WALNSAVEE DVIPVQSADFTDQQEGV V RVER+ VEGE+GSAVGGF
Subjt:  MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGF

Query:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS
        GELSLGGAGEIQGFSSS +VGDG  TESGE E+V+IDRIINATIVLAAGS+A+TKLLTIDQDYWHGWT++EILRYAPQHNWSAYEEALKTHPVLAKMVIS
Subjt:  GELSLGGAGEIQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVIS

Query:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK
        GVVYSLGDWIAQC EGKPLFEFDRTRMFRSGLVGF+LHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLG LRLESPVSI+NEL+
Subjt:  GVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELK

Query:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ
        ATFWPMLTAGWKLWPFAHLITYGV+PVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDS SDSL TDSTQ
Subjt:  ATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQ

SwissProt top hitse value%identityAlignment
Q06563 Protein SYM12.3e-1327.84Show/hide
Query:  YEEALKTHPVLAKMVISGVVYSLGDWIAQCF--EGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLF-----PFQDWWVVPAKVAFDQTAWSAVW
        YE +LK  P     +++G ++ +GD  AQ      K    +D  R  R+ + G  +   +   +Y            P   W  +  +VA DQ A++ + 
Subjt:  YEEALKTHPVLAKMVISGVVYSLGDWIAQCF--EGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLF-----PFQDWWVVPAKVAFDQTAWSAVW

Query:  NSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEK
           YF  + I+   S      ++K  +WP L   W +WP    I + V+P++ RLL V+ V + W T LS Y N K
Subjt:  NSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEK

Q2KIY1 Peroxisomal membrane protein 29.8e-1727.84Show/hide
Query:  PQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFE-----FDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTA
        P+   S Y   L+ +PVL K   SG++ +LG+++AQ  E K   E      D +   R  + GF   G L H++Y   E   P +       ++  D+  
Subjt:  PQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFE-----FDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTA

Query:  WSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILST
        ++  + S++F+V+  L  +   +   ++K+ FWP L   W++W     I    IPV+ R+L+ + V L W   L++
Subjt:  WSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILST

Q6CAW5 Protein SYM12.4e-1528.41Show/hide
Query:  NWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQ-CFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNS
        NW  Y   L+ +P    +  +  ++ +GD ++Q  F  KP   ++  R  R+G+       +++ ++    +   P      V AKVA DQ  ++     
Subjt:  NWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQ-CFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNS

Query:  IYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSE
         YF V+G+L  +SP +I+  LK  +W  L  GW +WP   L  +G++P   R+L  +C  L+W T L+  +  K E
Subjt:  IYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSE

Q754F0 Protein SYM11.3e-1628.98Show/hide
Query:  YEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCE----GLFPFQDWWVVPAKVAFDQTAWSAVWNSI
        Y+ +L++HP     + +G ++ LGD +AQ    +P   +D  R  R  L G  L   +   +Y F      G  P   W  V A+VA DQ  ++ +   +
Subjt:  YEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCE----GLFPFQDWWVVPAKVAFDQTAWSAVWNSI

Query:  YFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEA
        Y+  + ++   S   +   L   +W  L A W +WP   L  + ++PV+ RLL V+ + + W T LS YSN  + +
Subjt:  YFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEA

Q7SCY7 Protein sym-13.3e-1225.6Show/hide
Query:  SAYEEALKTHPVLAKMVISGVVYSLGDWIA-QCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEG--LFPFQDWWVVPAKVAFDQTAWSAVWNS
        S Y+  L   P+L + V + +++ +GD  A Q  + + L   D TR  R  L G  + G  +  ++ F +   + P      + A+VA DQ  ++  +  
Subjt:  SAYEEALKTHPVLAKMVISGVVYSLGDWIA-QCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEG--LFPFQDWWVVPAKVAFDQTAWSAVWNS

Query:  IYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILS
        I+   LG + +     +  +L+  +W  L+  W +WPF  ++ + V+P++ R+L+V+ + + W   LS
Subjt:  IYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILS

Arabidopsis top hitse value%identityAlignment
AT1G52870.1 Peroxisomal membrane 22 kDa (Mpv17/PMP22) family protein2.1e-9964Show/hide
Query:  PRAALNFHAPKL-PERS-IFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGFGELSLGGAGEIQGF----SSSASV
        PR+ L    P L P RS I  RN +++  +     +E D+IPVQS D TD +EG  V       V+G   S V GF   +  G   ++GF    SS A +
Subjt:  PRAALNFHAPKL-PERS-IFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGFGELSLGGAGEIQGF----SSSASV

Query:  GDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLF
        GD    E+ EME+ MIDR INATIVLAAGS+A+TKLLTID DYWHGWT++EILRYAPQHNW AYEEALK +PVLAKMVISGVVYS+GDWIAQC+EGKPLF
Subjt:  GDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLF

Query:  EFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLI
        E DR R  RSGLVGFTLHGSLSH+YY FCE LFPFQDWWVVP KVAFDQT WSA+WNSIYF VLG LR ESP+SI+ ELKATF PMLT G     F HL+
Subjt:  EFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLI

AT1G52870.2 Peroxisomal membrane 22 kDa (Mpv17/PMP22) family protein2.3e-13068.66Show/hide
Query:  PRAALNFHAPKL-PERS-IFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGFGELSLGGAGEIQGF----SSSASV
        PR+ L    P L P RS I  RN +++  +     +E D+IPVQS D TD +EG  V       V+G   S V GF   +  G   ++GF    SS A +
Subjt:  PRAALNFHAPKL-PERS-IFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGFGELSLGGAGEIQGF----SSSASV

Query:  GDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLF
        GD    E+ EME+ MIDR INATIVLAAGS+A+TKLLTID DYWHGWT++EILRYAPQHNW AYEEALK +PVLAKMVISGVVYS+GDWIAQC+EGKPLF
Subjt:  GDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLF

Query:  EFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLI
        E DR R  RSGLVGFTLHGSLSH+YY FCE LFPFQDWWVVP KVAFDQT WSA+WNSIYF VLG LR ESP+SI+ ELKATF PMLTAGWKLWPFAHLI
Subjt:  EFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLI

Query:  TYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPT
        TYG++PVEQRLLWVDCVELIWVTILSTYSNEKSEARISE   ++SS S  T
Subjt:  TYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPT

AT2G14860.1 Peroxisomal membrane 22 kDa (Mpv17/PMP22) family protein4.2e-1528.57Show/hide
Query:  YEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVV
        Y   +K+HPV+ K V S ++Y   D  +Q         +D  R  R G  G  + G   HY+++F   LFP QD      K+A  QT +  +   I+F +
Subjt:  YEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVV

Query:  LGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARIS
           L+ E    I   LK    P L  G   WP    IT+   PV  + L  +    +W   ++  +N +    IS
Subjt:  LGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWVDCVELIWVTILSTYSNEKSEARIS

AT4G03410.1 Peroxisomal membrane 22 kDa (Mpv17/PMP22) family protein5.7e-11378.63Show/hide
Query:  MIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVG
        ++ R INA IVLAAG+ A+TKLLTID DYW GWT+YEILRYAP+HNW AYE+ LKT+PVLAKM ISG+VYSLGDWIAQC+EGKPLFEFDRTR+ RSGLVG
Subjt:  MIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVG

Query:  FTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWV
        FTLHGSLSHYYY FCE LFPFQ+WWVVPAKVAFDQT WSA+WNSIYF VLG+LR +SP  I++E+K TF PMLTAGWKLWP AHL+TYGVIPV+QRLLWV
Subjt:  FTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWV

Query:  DCVELIWVTILSTYSNEKSEARISEVATDSSSDS
        DC+ELIWVTILSTYSNEK+EA+ SE    SS  S
Subjt:  DCVELIWVTILSTYSNEKSEARISEVATDSSSDS

AT4G03410.2 Peroxisomal membrane 22 kDa (Mpv17/PMP22) family protein7.5e-11377.78Show/hide
Query:  MIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVG
        ++ R INA IVLAAG+ A+TKLLTID DYW GWT+YEILRYAP+HNW AYE+ LKT+PVLAKM ISG+VYSLGDWIAQC+EGKPLFEFDRTR+ RSGLVG
Subjt:  MIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLFEFDRTRMFRSGLVG

Query:  FTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWV
        FTLHGSLSHYYY FCE LFPFQ+WWVVPAKVAFDQT WSA+WNSIYF VLG+LR +SP  I++E+K TF PMLTAGWKLWP AHL+TYGVIPV+QRLLWV
Subjt:  FTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQRLLWV

Query:  DCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVF
        DC+ELIWVTILSTYSNEK+EA+ SE  T+SSS S  ++  QVF
Subjt:  DCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGGTCCACGCCACAGCTCCTCAGAGGTTCACTTCCTTCACCACAGCAAAACGAGCTCCCGCGCCGTCTCATCCGCGCGCCGCCCTCAATTTCCACGCGCCGAA
GCTTCCGGAGCGTTCGATTTTCTCGAGAAACGGGCGGAAGTCCAATTGGGCGTTGAACTCGGCGGTGGAAGAGTTCGATGTTATCCCGGTGCAGAGCGCTGATTTCACCG
ACCAGCAGGAAGGCGTGACGGTGGGTCGGGTGGAGAGGGATGGCGTGGAGGGAGAGCTGGGGAGTGCGGTAGGTGGATTCGGCGAGCTCTCGTTGGGAGGAGCTGGTGAA
ATTCAGGGGTTTTCTTCTTCTGCTTCTGTTGGCGATGGCGATGGAACAGAGAGCGGGGAAATGGAGAGGGTTATGATCGATCGGATTATAAATGCGACCATTGTTCTTGC
GGCGGGTTCTTTTGCTCTCACGAAGTTGCTTACCATCGATCAAGATTATTGGCATGGATGGACAATTTATGAAATACTGAGATATGCCCCTCAACACAACTGGAGTGCTT
ATGAGGAAGCTCTTAAGACGCACCCGGTCCTTGCTAAAATGGTGATTAGTGGAGTGGTGTACTCTCTTGGAGATTGGATTGCACAGTGTTTTGAAGGAAAACCTCTATTT
GAATTTGATCGCACACGCATGTTCAGATCAGGCCTTGTTGGCTTTACTCTACATGGTTCCCTTTCCCACTACTACTATCATTTTTGTGAGGGTCTTTTTCCTTTTCAAGA
TTGGTGGGTAGTTCCTGCAAAAGTAGCATTTGATCAAACGGCATGGTCAGCAGTTTGGAACAGTATTTATTTTGTGGTATTAGGAATCCTGCGGCTTGAGTCCCCAGTAT
CTATATATAATGAACTAAAGGCGACATTTTGGCCTATGCTTACCGCGGGTTGGAAACTTTGGCCGTTTGCTCATCTTATCACATACGGTGTTATTCCAGTAGAACAAAGA
CTCTTGTGGGTTGATTGTGTGGAGCTTATCTGGGTGACCATACTCTCAACTTATTCAAACGAGAAATCGGAGGCCAGAATCTCTGAGGTAGCAACAGATTCCAGTTCAGA
TTCTCTTCCCACAGATTCAACTCAGGTATTTTGGTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCGGTCCACGCCACAGCTCCTCAGAGGTTCACTTCCTTCACCACAGCAAAACGAGCTCCCGCGCCGTCTCATCCGCGCGCCGCCCTCAATTTCCACGCGCCGAA
GCTTCCGGAGCGTTCGATTTTCTCGAGAAACGGGCGGAAGTCCAATTGGGCGTTGAACTCGGCGGTGGAAGAGTTCGATGTTATCCCGGTGCAGAGCGCTGATTTCACCG
ACCAGCAGGAAGGCGTGACGGTGGGTCGGGTGGAGAGGGATGGCGTGGAGGGAGAGCTGGGGAGTGCGGTAGGTGGATTCGGCGAGCTCTCGTTGGGAGGAGCTGGTGAA
ATTCAGGGGTTTTCTTCTTCTGCTTCTGTTGGCGATGGCGATGGAACAGAGAGCGGGGAAATGGAGAGGGTTATGATCGATCGGATTATAAATGCGACCATTGTTCTTGC
GGCGGGTTCTTTTGCTCTCACGAAGTTGCTTACCATCGATCAAGATTATTGGCATGGATGGACAATTTATGAAATACTGAGATATGCCCCTCAACACAACTGGAGTGCTT
ATGAGGAAGCTCTTAAGACGCACCCGGTCCTTGCTAAAATGGTGATTAGTGGAGTGGTGTACTCTCTTGGAGATTGGATTGCACAGTGTTTTGAAGGAAAACCTCTATTT
GAATTTGATCGCACACGCATGTTCAGATCAGGCCTTGTTGGCTTTACTCTACATGGTTCCCTTTCCCACTACTACTATCATTTTTGTGAGGGTCTTTTTCCTTTTCAAGA
TTGGTGGGTAGTTCCTGCAAAAGTAGCATTTGATCAAACGGCATGGTCAGCAGTTTGGAACAGTATTTATTTTGTGGTATTAGGAATCCTGCGGCTTGAGTCCCCAGTAT
CTATATATAATGAACTAAAGGCGACATTTTGGCCTATGCTTACCGCGGGTTGGAAACTTTGGCCGTTTGCTCATCTTATCACATACGGTGTTATTCCAGTAGAACAAAGA
CTCTTGTGGGTTGATTGTGTGGAGCTTATCTGGGTGACCATACTCTCAACTTATTCAAACGAGAAATCGGAGGCCAGAATCTCTGAGGTAGCAACAGATTCCAGTTCAGA
TTCTCTTCCCACAGATTCAACTCAGGTATTTTGGTCTTAG
Protein sequenceShow/hide protein sequence
MASVHATAPQRFTSFTTAKRAPAPSHPRAALNFHAPKLPERSIFSRNGRKSNWALNSAVEEFDVIPVQSADFTDQQEGVTVGRVERDGVEGELGSAVGGFGELSLGGAGE
IQGFSSSASVGDGDGTESGEMERVMIDRIINATIVLAAGSFALTKLLTIDQDYWHGWTIYEILRYAPQHNWSAYEEALKTHPVLAKMVISGVVYSLGDWIAQCFEGKPLF
EFDRTRMFRSGLVGFTLHGSLSHYYYHFCEGLFPFQDWWVVPAKVAFDQTAWSAVWNSIYFVVLGILRLESPVSIYNELKATFWPMLTAGWKLWPFAHLITYGVIPVEQR
LLWVDCVELIWVTILSTYSNEKSEARISEVATDSSSDSLPTDSTQVFWS