; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G007550 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G007550
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptioncharged multivesicular body protein 7
Genome locationchr09:8658587..8669040
RNA-Seq ExpressionLsi09G007550
SyntenyLsi09G007550
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0032511 - late endosome to vacuole transport via multivesicular body sorting pathway (biological process)
GO:0000815 - ESCRT III complex (cellular component)
GO:0005771 - multivesicular body (cellular component)
GO:0009898 - cytoplasmic side of plasma membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579154.1 Charged multivesicular body protein 7, partial [Cucurbita argyrosperma subsp. sororia]2.5e-19877.87Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESK   VREFIREK PDWD+E+VATARFKAFSGQKSDWE RYLFWRDLILTI+ QFNF+F+KPSE+KNQWFSRGGL PLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPR GQLSYLFKKLS LMGT KKN D  L DDYIVLACVL+         DRAAEV+KCLS SNWTSSC+ITMVKFQNICGGPDEAT ILSYL  C
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKARYLSKE+KEL+E                      GVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQ YDVS+QSALASLKSGNKKTALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASLLNRVEEVLNA+ADAESTKT                           VSEAIQIGARVMKEHEVSWD LQHSLQE+EASIDIQ
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELS
        KQVAS IDSAPSGSILE+EDIEEEFKKLELEV AGQNLD STS++GVNIATG  VATVSDDSLSAALSNLKLVEETGKET  QKSNSKSKSKIMELS
Subjt:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELS

XP_011654554.1 charged multivesicular body protein 7 [Cucumis sativus]5.6e-19877.35Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VREFIREK PDWDDEVVATARFKAFSGQKSDWE RYLFWRDLILT++RQFNF+ IKPSE+KNQWF RGGLTPLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPRSGQLSY+FKKLS LMGT KKNPDSLL DDYIVLACVL+         DRAAEVIKCLSLS+WTSSCIITMVKFQNICGGPDEATVILSYLI C
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKA++LSKEKKELLE                      GVKVSLSATTV GIT+LDYDILHL+WT EKLQQQLDVIDQ YDVSKQSAL SLKSGN+KTALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASL NRVEEVLNA+ADAE TKT                           VSEAIQIGARVMKEHEV+WDQLQ SLQE+EAS+DIQ
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS
        KQVA+AIDS PS SI +DEDIEEEFKKLELE+TAGQ LD STSESGVNIATGETVA V DDSLS ALSNLKLVEET KE  N  S+SK KSKIME+ IS
Subjt:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS

XP_023551290.1 charged multivesicular body protein 7 [Cucurbita pepo subsp. pepo]3.3e-19877.87Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESK   VREFIREK PDWD+EVVATARFKAFSGQKSDWE RYLFWRDLILTI+ QFNF+F+KPSE+KNQWFSRGGL PLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPR GQLSYLFKKLS LMGT KKNPDSLL DDYI LA VL+         DRAAEV+KCLS SNWTSSC+ITMVKFQNICGGPDEAT ILSYL  C
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKARYLSKE+KEL+E                      GVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQ YDVS+QSALASLKSGNKKTALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESR+KVASLLNRVEEVLNA+ADAESTKT                           VSEAIQIGARVMKEHEVSWDQLQHSLQE+EASIDIQ
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELS
        KQVAS IDSAPSGSILE+EDI+EEFKKLELEV AGQNLD STS++GVNIATG  VATVSDDSLSAALSNLKLVEETGKET  QKSNSKSK KIMELS
Subjt:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELS

XP_038875993.1 uncharacterized protein LOC120068336 isoform X1 [Benincasa hispida]8.1e-21381.64Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESKGSRVREFIREK PDWDDEVVATARFKAFSGQKSDWE RY  WRDLI+TI+R+FNF+FIKPSE+KNQWFSRGGL+PLCLDHVLH+MYIEGDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPRSGQLSYLFKKLS LMGT KKNPDSLL DDY+VLACVL+F+MK FHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIG 
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKARYLSKEKKELLE                      GVK+SL+A TV GITTLDYDILHLIWTTEKLQQQLDVIDQ YDVS+QSALASLKSGNKKTALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASLLNRVEEVLNA+ADAESTKT                           VSEAIQIGARVMKEHEVSWDQLQ+SLQE+E SID+Q
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAI--DSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSI
        KQVASAI  DSAPSGSI EDEDIEEEFKKLELEVTAGQNLD STSES VNIATGETVATVSDD LS ALSNLKLVEETG  TA QKSNSKSKSK+MEL I
Subjt:  KQVASAI--DSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSI

Query:  S
        S
Subjt:  S

XP_038875996.1 uncharacterized protein LOC120068336 isoform X2 [Benincasa hispida]5.6e-20680.24Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESKGSRVREFIREK PDWDDEVVATARFKAFSGQKSDWE RY  WRDLI+TI+R+FNF+FIKPSE+KNQWFSRGGL+PLCLDHVLH+MYIEGDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPRSGQLSYLFKKLS LMGT KKNPDSLL DDY+VLACVL+         DRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIG 
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKARYLSKEKKELLE                      GVK+SL+A TV GITTLDYDILHLIWTTEKLQQQLDVIDQ YDVS+QSALASLKSGNKKTALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASLLNRVEEVLNA+ADAESTKT                           VSEAIQIGARVMKEHEVSWDQLQ+SLQE+E SID+Q
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAI--DSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSI
        KQVASAI  DSAPSGSI EDEDIEEEFKKLELEVTAGQNLD STSES VNIATGETVATVSDD LS ALSNLKLVEETG  TA QKSNSKSKSK+MEL I
Subjt:  KQVASAI--DSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSI

Query:  S
        S
Subjt:  S

TrEMBL top hitse value%identityAlignment
A0A0A0KMY2 Uncharacterized protein2.7e-19877.35Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VREFIREK PDWDDEVVATARFKAFSGQKSDWE RYLFWRDLILT++RQFNF+ IKPSE+KNQWF RGGLTPLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPRSGQLSY+FKKLS LMGT KKNPDSLL DDYIVLACVL+         DRAAEVIKCLSLS+WTSSCIITMVKFQNICGGPDEATVILSYLI C
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKA++LSKEKKELLE                      GVKVSLSATTV GIT+LDYDILHL+WT EKLQQQLDVIDQ YDVSKQSAL SLKSGN+KTALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASL NRVEEVLNA+ADAE TKT                           VSEAIQIGARVMKEHEV+WDQLQ SLQE+EAS+DIQ
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS
        KQVA+AIDS PS SI +DEDIEEEFKKLELE+TAGQ LD STSESGVNIATGETVA V DDSLS ALSNLKLVEET KE  N  S+SK KSKIME+ IS
Subjt:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS

A0A1S3CRA4 charged multivesicular body protein 7 isoform X13.9e-18974.4Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VREFIREK  DWDDEVVATARFKAFSGQKSDWE RYLFWRDLILT++RQ NF+ IKPSE+KNQWFSRGGLTPLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPRSGQLSY+FK+LS LMGT KKNP+SLL DDYI+LACVL+         DRA EVIKCLSLSNWTSS IITMVKFQNICGGPDEATVILSYLI C
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKA++LSK K +LLE                      GVKVS SATTV GITTLDYDILHL+WT EKLQQQLD I+Q YDVSKQSAL SLKSGNKK ALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASL NRVEEVLNA+ DAE TK+                           VSEAIQIGARVMKEHEV+WDQLQHSLQE+E SIDIQ
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAIDSAPSGSILED-EDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS
        KQVA+ IDS PS SI  D EDIEE FKKLELE+TA Q LD STSES VNIATGETV  V DDSLS+ LSNLKLVEE  KE ANQKSNSK  SKIMEL IS
Subjt:  KQVASAIDSAPSGSILED-EDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS

A0A1S3CRC5 charged multivesicular body protein 7 isoform X26.3e-18773.8Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VREFIREK  DWDDEVVATARFKAFSGQKSDWE RYLFWRDLILT++RQ NF+ IKPSE+KNQWFSRGGLTPLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPRSGQLSY+FK+LS LMGT KKNP+SLL DDYI+LACVL+         DRA EVIKCLSLSNWTSS IITMVKFQNICGGPDEATVILSYLI C
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKA++LSK K +LLE                         VS SATTV GITTLDYDILHL+WT EKLQQQLD I+Q YDVSKQSAL SLKSGNKK ALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASL NRVEEVLNA+ DAE TK+                           VSEAIQIGARVMKEHEV+WDQLQHSLQE+E SIDIQ
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAIDSAPSGSILED-EDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS
        KQVA+ IDS PS SI  D EDIEE FKKLELE+TA Q LD STSES VNIATGETV  V DDSLS+ LSNLKLVEE  KE ANQKSNSK  SKIMEL IS
Subjt:  KQVASAIDSAPSGSILED-EDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS

A0A6J1FF90 charged multivesicular body protein 76.7e-19777.26Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESK   VREFIREK PDWD+E+VATARFKAFSGQKSDWE RYLFWRDLILTI+ QFNF+F+KPSE+KNQWFSRGGL PLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPR GQLSYLFKKLS LMGT KKN D LL DDYIVLACVL+         DRAAEV+KCLS SNWTSSC+ITMVKFQNICGGPDEAT  LSYL  C
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKARYLSKE+KEL+E                      GVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQ YDVS+QSALASLKSGNKKTALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASLLNRVEEVLNA+ADAESTKT                           VSEAIQIGAR MKEHEVSWD LQHSLQE+EASIDIQ
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELS
        KQVAS IDSAPSG ILE+EDIEEEFKKLELEV AGQNLD STS++G NIATG  VATVSDDSLSAALSNLKLVEETGKET  QKSNSKSKSKIMELS
Subjt:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELS

A0A6J1K252 charged multivesicular body protein 76.7e-19777.46Show/hide
Query:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR
        MEKESK   VREFIREK PDWD+EVVATA FKAFSGQKSDWE RYLFWRDLIL IS QFNF+FIKPSE+KNQWFSRGGL PLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRR

Query:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPR GQLSYLFKKLS +MGT KKNPD LL DDYIVLACVL+         DRAAEV+KCLS SNWTSSC+ITMVKFQNICGGPDEAT ILSYL  C
Subjt:  RDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK
        GKARYLSKE+KEL+E                      GVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQ YDVS+QSALASLKSGNKKTALK
Subjt:  GKARYLSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALK

Query:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ
        HARELKITTESREKVASLLNRVEEVLNA+ADAESTKT                           VSEAIQIGARVMKEHEVSWDQLQHSL E+EASIDIQ
Subjt:  HARELKITTESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQ

Query:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELS
        KQV S IDSAPSGSILE+EDIEEEFKKLELEV AGQNLD +TS++GVNIATG  VATVSDDSLSAALSNLKLV ET KET  QKSNSKSKSKIMELS
Subjt:  KQVASAIDSAPSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G62080.1 SNF7 family protein3.5e-10546.14Show/hide
Query:  VREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRRRDMLDPRSG
        V+EFIR + PDWDDEVVA ARFKAFSGQ+SDWEL++ FWRDLI+ +SRQF    I P +VK  WF RGG+TPLC+D V+ LM+ EGD++R  D+ DP SG
Subjt:  VREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRRRDMLDPRSG

Query:  QLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        +++ L + +  LM       + +L +  +++  +          K++AA+V+K LS  +WTS+C++T+ KF+N+C G +EA+ +LS+L GCGKA  +S  
Subjt:  QLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALKHARELKITT
        + EL+E                      GVKVS S T + GI+TLD DILHL+ TTEKLQ QL+V+DQ  + SK+SALASLKSG++K AL+HARELK+ T
Subjt:  KKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALKHARELKITT

Query:  ESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQKQVASAIDS
        ESREK  SLLNRVEEVLN +AD+ESTK                            VSEAI+ GARVMK+ ++S D +   L+E+E +I+ QKQV  A++S
Subjt:  ESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQKQVASAIDS

Query:  APSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKL------VEETGKETANQKSNSKSKSKIME
        AP   I +DEDIEEE  +LE        +D+ +  S V  AT +T      DSL+   S LKL      +EE   E A  K + K   KI+E
Subjt:  APSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKL------VEETGKETANQKSNSKSKSKIME

AT3G62080.2 SNF7 family protein3.5e-10546.14Show/hide
Query:  VREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRRRDMLDPRSG
        V+EFIR + PDWDDEVVA ARFKAFSGQ+SDWEL++ FWRDLI+ +SRQF    I P +VK  WF RGG+TPLC+D V+ LM+ EGD++R  D+ DP SG
Subjt:  VREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEGDIIRRRDMLDPRSG

Query:  QLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        +++ L + +  LM       + +L +  +++  +          K++AA+V+K LS  +WTS+C++T+ KF+N+C G +EA+ +LS+L GCGKA  +S  
Subjt:  QLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALKHARELKITT
        + EL+E                      GVKVS S T + GI+TLD DILHL+ TTEKLQ QL+V+DQ  + SK+SALASLKSG++K AL+HARELK+ T
Subjt:  KKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALKHARELKITT

Query:  ESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQKQVASAIDS
        ESREK  SLLNRVEEVLN +AD+ESTK                            VSEAI+ GARVMK+ ++S D +   L+E+E +I+ QKQV  A++S
Subjt:  ESREKVASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQKQVASAIDS

Query:  APSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKL------VEETGKETANQKSNSKSKSKIME
        AP   I +DEDIEEE  +LE        +D+ +  S V  AT +T      DSL+   S LKL      +EE   E A  K + K   KI+E
Subjt:  APSGSILEDEDIEEEFKKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKL------VEETGKETANQKSNSKSKSKIME


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATTGCAGCTCTATTCATAGGTGTGAACATTGCAAACAAAATGGAAAAGGAATCAAAAGGGTCTCGCGTAAGAGAGTTCATTAGGGAAAAATTCCCAGACTGGGA
TGACGAAGTGGTGGCTACAGCTCGGTTCAAGGCATTTAGTGGGCAGAAATCTGATTGGGAACTCAGATACCTATTTTGGAGGGATTTGATCCTCACAATTTCCCGCCAAT
TCAACTTCGTCTTCATTAAACCTTCTGAAGTAAAGAATCAATGGTTTTCTCGAGGAGGGTTGACTCCACTGTGCCTTGACCATGTTCTGCATCTAATGTATATTGAGGGT
GACATTATAAGACGTAGAGACATGCTGGATCCAAGGAGTGGCCAACTCTCCTACTTGTTTAAAAAACTAAGCTATTTGATGGGTACATATAAAAAGAACCCCGACAGTTT
GCTTCATGATGATTATATAGTTCTTGCCTGTGTATTAGAGTTTAATATGAAATGTTTCCATGCCAAGGATAGAGCAGCTGAAGTTATCAAGTGTTTATCTCTTAGTAATT
GGACCTCATCCTGCATTATTACAATGGTGAAGTTCCAAAACATCTGTGGAGGACCTGATGAAGCGACTGTTATCTTGAGTTACTTGATTGGATGTGGTAAGGCAAGGTAT
CTCTCTAAGGAAAAAAAAGAACTTCTAGAGTTTTTCAGTGAGTTATCGCCTTCCGAGCTCTCGTTTGATTTCTCTTTCCCCCCTTGTAATAACCAGGGTGTAAAGGTCTC
TCTTTCAGCAACGACGGTTGCTGGCATCACAACTCTCGATTATGACATTTTGCACTTAATTTGGACAACAGAAAAGCTTCAGCAACAACTTGATGTGATTGACCAGTGCT
ATGATGTGTCGAAACAATCCGCACTGGCTTCTTTGAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTGAAGATCACCACAGAAAGTCGGGAAAAAGTT
GCATCTCTCTTAAACAGAGTGGAGGAAGTCCTAAATGCTGTTGCAGATGCCGAATCAACAAAAACGGTAGGTTACCATCTTCAGTTTTCTTCTTTGATAAGTGGAAGGGG
CTCTGCCCGTACCTTTGCCTCACTAAAAATGACCGGAGTTTCTGAGGCTATTCAAATTGGTGCTCGAGTAATGAAAGAACACGAGGTTAGTTGGGATCAACTCCAGCATA
GTTTGCAGGAAGTAGAAGCAAGTATTGATATACAAAAGCAAGTAGCAAGTGCTATAGATTCAGCTCCATCTGGCTCAATTCTGGAAGATGAAGATATTGAGGAGGAGTTT
AAGAAGCTTGAGTTGGAAGTAACAGCAGGCCAAAACCTCGACGTGTCGACATCAGAATCTGGGGTTAATATTGCAACTGGTGAAACAGTGGCTACAGTTTCCGACGACTC
ATTGAGCGCTGCATTATCAAATCTAAAGCTTGTTGAAGAAACAGGAAAGGAGACAGCGAACCAGAAGTCGAATTCTAAGAGTAAGTCGAAAATTATGGAGCTTAGCATTT
CTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAATTGCAGCTCTATTCATAGGTGTGAACATTGCAAACAAAATGGAAAAGGAATCAAAAGGGTCTCGCGTAAGAGAGTTCATTAGGGAAAAATTCCCAGACTGGGA
TGACGAAGTGGTGGCTACAGCTCGGTTCAAGGCATTTAGTGGGCAGAAATCTGATTGGGAACTCAGATACCTATTTTGGAGGGATTTGATCCTCACAATTTCCCGCCAAT
TCAACTTCGTCTTCATTAAACCTTCTGAAGTAAAGAATCAATGGTTTTCTCGAGGAGGGTTGACTCCACTGTGCCTTGACCATGTTCTGCATCTAATGTATATTGAGGGT
GACATTATAAGACGTAGAGACATGCTGGATCCAAGGAGTGGCCAACTCTCCTACTTGTTTAAAAAACTAAGCTATTTGATGGGTACATATAAAAAGAACCCCGACAGTTT
GCTTCATGATGATTATATAGTTCTTGCCTGTGTATTAGAGTTTAATATGAAATGTTTCCATGCCAAGGATAGAGCAGCTGAAGTTATCAAGTGTTTATCTCTTAGTAATT
GGACCTCATCCTGCATTATTACAATGGTGAAGTTCCAAAACATCTGTGGAGGACCTGATGAAGCGACTGTTATCTTGAGTTACTTGATTGGATGTGGTAAGGCAAGGTAT
CTCTCTAAGGAAAAAAAAGAACTTCTAGAGTTTTTCAGTGAGTTATCGCCTTCCGAGCTCTCGTTTGATTTCTCTTTCCCCCCTTGTAATAACCAGGGTGTAAAGGTCTC
TCTTTCAGCAACGACGGTTGCTGGCATCACAACTCTCGATTATGACATTTTGCACTTAATTTGGACAACAGAAAAGCTTCAGCAACAACTTGATGTGATTGACCAGTGCT
ATGATGTGTCGAAACAATCCGCACTGGCTTCTTTGAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTGAAGATCACCACAGAAAGTCGGGAAAAAGTT
GCATCTCTCTTAAACAGAGTGGAGGAAGTCCTAAATGCTGTTGCAGATGCCGAATCAACAAAAACGGTAGGTTACCATCTTCAGTTTTCTTCTTTGATAAGTGGAAGGGG
CTCTGCCCGTACCTTTGCCTCACTAAAAATGACCGGAGTTTCTGAGGCTATTCAAATTGGTGCTCGAGTAATGAAAGAACACGAGGTTAGTTGGGATCAACTCCAGCATA
GTTTGCAGGAAGTAGAAGCAAGTATTGATATACAAAAGCAAGTAGCAAGTGCTATAGATTCAGCTCCATCTGGCTCAATTCTGGAAGATGAAGATATTGAGGAGGAGTTT
AAGAAGCTTGAGTTGGAAGTAACAGCAGGCCAAAACCTCGACGTGTCGACATCAGAATCTGGGGTTAATATTGCAACTGGTGAAACAGTGGCTACAGTTTCCGACGACTC
ATTGAGCGCTGCATTATCAAATCTAAAGCTTGTTGAAGAAACAGGAAAGGAGACAGCGAACCAGAAGTCGAATTCTAAGAGTAAGTCGAAAATTATGGAGCTTAGCATTT
CTTAGGTGCATTCTCATGGAGCAGTTTACAATGTCAATGTCAGCCCAATGTAACCTTCTGCTTTCGGTTGCCTCTGTACTTTGTCGTTAAAATTCAGTGAAATTGATGTA
CAAAATTGTTACTACAAAACCGCTGCTCTTTGTTCCTGAGTCAGCTATTGTATTACTGTTATTGTTGAAAAATATAAGATTGAGATCGTTGTATAAACTTCAAAGCGTGA
TTAGTTTGAAATCTCCATTCTAAAAACAAAATGGCGGTTAAATTACTAAGTTCAAAGTGGTCGTTGGACAGGTATATTAACCCAATTAAGTTTTAAATTTT
Protein sequenceShow/hide protein sequence
MEIAALFIGVNIANKMEKESKGSRVREFIREKFPDWDDEVVATARFKAFSGQKSDWELRYLFWRDLILTISRQFNFVFIKPSEVKNQWFSRGGLTPLCLDHVLHLMYIEG
DIIRRRDMLDPRSGQLSYLFKKLSYLMGTYKKNPDSLLHDDYIVLACVLEFNMKCFHAKDRAAEVIKCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARY
LSKEKKELLEFFSELSPSELSFDFSFPPCNNQGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQCYDVSKQSALASLKSGNKKTALKHARELKITTESREKV
ASLLNRVEEVLNAVADAESTKTVGYHLQFSSLISGRGSARTFASLKMTGVSEAIQIGARVMKEHEVSWDQLQHSLQEVEASIDIQKQVASAIDSAPSGSILEDEDIEEEF
KKLELEVTAGQNLDVSTSESGVNIATGETVATVSDDSLSAALSNLKLVEETGKETANQKSNSKSKSKIMELSIS