; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh16G001050 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh16G001050
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionProtein of unknown function (DUF819)
Genome locationCma_Chr16:469649..472966
RNA-Seq ExpressionCmaCh16G001050
SyntenyCmaCh16G001050
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576704.1 hypothetical protein SDJN03_24278, partial [Cucurbita argyrosperma subsp. sororia]7.7e-23998.06Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCK+SAR SRSIAIA RPPVQPLSSSSSAAAES RRRFWNFCGTSTGNV LRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FS+GAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDA KDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
         FDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

KAG7014753.1 yjcL, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-23294.15Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCK+SAR SRSIAIA RPPVQPLSSSSSAAAES RRRFWNFCGTSTGNV LRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAF------------GIW----SEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGS
        FS+GAF              W    SEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGS
Subjt:  FSVGAF------------GIW----SEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGS

Query:  VATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIE
        VATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISD+LGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDA KDAEIE
Subjt:  VATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIE

Query:  DSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLV
        DSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLV
Subjt:  DSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLV

Query:  QIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        QIAVHLAIILGLGKLL FDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  QIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_022922913.1 uncharacterized protein LOC111430750 [Cucurbita moschata]5.9e-23998.06Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCK+SAR SRSIAIA RPPVQPLSSSSSAAAES RRRFWNFCGTSTGNV LRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FS+GAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDA KDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
         FDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_022984975.1 uncharacterized protein LOC111483083 [Cucurbita maxima]2.1e-244100Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_023553523.1 uncharacterized protein LOC111810914 [Cucurbita pepo subsp. pepo]2.2e-23897.84Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPL-SSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTV
        MALQSQLASKSPEVQLLCLSSCKISAR SRSIAIA RPPVQPL SSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTV
Subjt:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPL-SSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTV

Query:  LFSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVP
        LFS+GAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVP
Subjt:  LFSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVP

Query:  MQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALA
        MQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGV+SSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDS+KLPVLQSA ALA
Subjt:  MQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALA

Query:  VSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKL
        VSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGA+AMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKL
Subjt:  VSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKL

Query:  LRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        L FDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  LRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

TrEMBL top hitse value%identityAlignment
A0A0A0L600 Uncharacterized protein2.4e-20184.12Show/hide
Query:  MALQSQLA---SKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNW
        MA QSQL    SKSPE+Q  C SS K S   SRSI++A  PP+Q L SSSS  AE    RFW+F  +S GNVQ RR+VAVRSHLKLNLPL+SP+DQWGNW
Subjt:  MALQSQLA---SKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNW

Query:  TVLFSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFL
        TVLFS+GAFGIWSEKTKVGSALSGALVS LVGLAASNFGIIASDAPAF  VLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TT+GTVVAYFL
Subjt:  TVLFSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFL

Query:  VPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATA
        VPM+SLGQDSWKIAAALMGRHIGGAVNYVAISDALGVS SVLAAGLAADNVICA YFATLFALASKVPPEPTT ++    GKDAE+E S+KLPVLQSA+A
Subjt:  VPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATA

Query:  LAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLG
        +AVSFAICK GSYLTKYFGIQGGSMPAITA+IVVLATIFPK FAYLAPSG AMA+ILMQ+FFAVVGASGNVWSVI+TAPSIFLF+ VQI+VHL II+GLG
Subjt:  LAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLG

Query:  KLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        KLLRFD K LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFGMMVLKYM
Subjt:  KLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X31.1e-19883.81Show/hide
Query:  LASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAF
        L SKSP++QL C SS K SAR  RSI +A RPPV P+ SSSS AAE   RRFWNF   S+GN  LRR +AV+SHLKLNLPLISPHDQW NWTVLFSVGAF
Subjt:  LASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAF

Query:  GIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQD
        GIWSEKTK+GSALSGALVS LVGLAASN GIIASDAPAFP VLE LLPL++P+LLFRADLR VIKSTGTLLLAFLLGSV T IGT VAYFLVPM+SLGQD
Subjt:  GIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQD

Query:  SWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAVSFAICK
        SWKIAAALMGRHIGGAVNYVAIS ALGVS SVLAAGLAADNVICA YFATLFALASKVP EPT  +D  + GKD E E ++KLPVLQSATALAVSFAICK
Subjt:  SWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAVSFAICK

Query:  AGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKL
        AGSYLTK+FGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMA+ILMQ+FF VVGASGN+WSVI+TAPSIF+FSLVQIAVHLA+ +GLGKLLRFD KL
Subjt:  AGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKL

Query:  LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X11.1e-19883.81Show/hide
Query:  LASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAF
        L SKSP++QL C SS K SAR  RSI +A RPPV P+ SSSS AAE   RRFWNF   S+GN  LRR +AV+SHLKLNLPLISPHDQW NWTVLFSVGAF
Subjt:  LASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAF

Query:  GIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQD
        GIWSEKTK+GSALSGALVS LVGLAASN GIIASDAPAFP VLE LLPL++P+LLFRADLR VIKSTGTLLLAFLLGSV T IGT VAYFLVPM+SLGQD
Subjt:  GIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQD

Query:  SWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAVSFAICK
        SWKIAAALMGRHIGGAVNYVAIS ALGVS SVLAAGLAADNVICA YFATLFALASKVP EPT  +D  + GKD E E ++KLPVLQSATALAVSFAICK
Subjt:  SWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAVSFAICK

Query:  AGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKL
        AGSYLTK+FGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMA+ILMQ+FF VVGASGN+WSVI+TAPSIF+FSLVQIAVHLA+ +GLGKLLRFD KL
Subjt:  AGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKL

Query:  LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1E833 uncharacterized protein LOC1114307502.9e-23998.06Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCK+SAR SRSIAIA RPPVQPLSSSSSAAAES RRRFWNFCGTSTGNV LRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FS+GAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDA KDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
         FDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1JA22 uncharacterized protein LOC1114830831.0e-244100Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  RFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL1.5e-3530.43Show/hide
Query:  LISPHDQWGNWTVLFSVGAFGIWSE-KTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSV
        LIS  D W  W  +    A  I  E + K  SA+SGA+++    +  +N G++  ++P +  V  +++PLA+P+LLF+ ++R + K +  LL  FL+ SV
Subjt:  LISPHDQWGNWTVLFSVGAFGIWSE-KTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSV

Query:  ATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALAS--------------KVPPEPTTF
         T +G+++A+FL+       D  KI   +   +IGG VN+ A++         ++A + ADN + A  F  L ++ +              KV  +  + 
Subjt:  ATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALAS--------------KVPPEPTTF

Query:  NDATDAGKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYF------GIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGAS
        N A    K  +I  S K     +  A A+     K   Y    F      G  G     +T++ V++  +FP+ F  L  S   +   L+ +FF V+G  
Subjt:  NDATDAGKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYF------GIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGAS

Query:  GNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG
         ++  +++ AP I LF  +    +LA+ L  GKL R   + +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G
Subjt:  GNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)2.2e-12761.64Show/hide
Query:  RRNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIK
        RR V V S L+   PLISP D W  W  LF+ GAFG+WSEKTK+GS +SGAL S L+GLAASN  +I  + P++ F +EFLLP  +P+LLFRADLR +I+
Subjt:  RRNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIK

Query:  STGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTF
        STG+LLLAFL+GSVAT +GTVVA+ LVPM+SLG D+WKIAAALMG +IGG++N+VAIS+AL +S SV+AAG+A DNVICA +F  LFALASK+PPE  + 
Subjt:  STGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTF

Query:  NDA-TDAGKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWS
        +    D  KD ++ED ++  V+ ++ AL+VSF ICKA   LT  F IQG  +PA+TAI +VLAT FP  F  LAPS   +++ILMQ+FF ++GA+G+VW+
Subjt:  NDA-TDAGKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWS

Query:  VISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLK
        VI+TAPSIFLF+ +Q+ VHLA+ L LGKL   D KLLL+ASNAN+GGPTTAC MATAKGW+S+V+PGIL+G+FG++IATFLGIG G+ VLK
Subjt:  VISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLK

AT5G52540.1 Protein of unknown function (DUF819)1.5e-15272.66Show/hide
Query:  RNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKS
        R+V V S   L+ PLISP+D+WG WT LF+ GA G+WSEKTKVG+A+SGALVS LVGLAASN GII+S APAF  VL FLLPLAVP+LLFRADLR V++S
Subjt:  RNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKS

Query:  TGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVP----PEP
        TG LLLAFL+GSVATT+GT +AY+LVPM+SLG DSWKIAAALMGRHIGGAVNYVAIS+ALGV+ SVLAAGLAADNVICA YF TLFAL SK+P    P P
Subjt:  TGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVP----PEP

Query:  TTFNDATDAGKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNV
        TT     DA  +   E  +K+PVL  AT +AVS AICKAG+ LTKYFGI GGS+PAITA++V+LAT+FP  F  LAPSG AMA+ILMQ+FF VVGASGN+
Subjt:  TTFNDATDAGKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNV

Query:  WSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        WSVI+TAPSIFLF+LVQI  HLA+ILG+GKLL  + +LLL+ASNANVGGPTTA GMATAKGW+S+++PGILAGIFGIAIATF+GI FG+ VLK+M
Subjt:  WSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTACAGTCGCAGCTCGCCTCGAAGTCGCCCGAAGTACAGCTTCTATGTTTGTCTTCCTGCAAAATCTCAGCTAGAATCTCCAGGAGCATCGCGATCGCTCTTCG
GCCACCAGTGCAACCGTTGTCGTCGTCGTCATCAGCAGCAGCTGAAAGTGCACGCCGAAGATTCTGGAACTTTTGCGGCACCAGTACTGGAAATGTTCAACTGAGACGAA
ATGTTGCTGTTAGATCTCATCTGAAATTGAATCTCCCCCTCATTTCTCCGCATGACCAGTGGGGCAACTGGACTGTTTTATTCTCCGTTGGAGCCTTCGGTATCTGGTCT
GAGAAAACGAAGGTTGGTAGTGCATTAAGTGGTGCCCTAGTGAGCGCATTGGTGGGACTTGCAGCCAGTAATTTTGGGATCATTGCATCTGATGCTCCAGCTTTTCCTTT
TGTTTTGGAGTTTTTGCTACCGTTAGCAGTTCCTATGCTGTTATTTAGGGCAGATTTGCGCAATGTAATAAAGTCAACTGGGACACTTCTCTTGGCATTTTTGTTAGGTT
CAGTTGCAACAACAATTGGAACTGTTGTGGCCTATTTTCTTGTACCAATGCAATCACTTGGTCAGGACAGTTGGAAAATTGCCGCCGCACTAATGGGAAGACATATTGGT
GGAGCTGTCAATTATGTCGCTATATCTGATGCCCTTGGTGTCTCTTCATCAGTATTAGCAGCTGGACTTGCTGCGGATAATGTAATTTGTGCAGCGTATTTTGCAACATT
GTTTGCTTTAGCATCTAAAGTACCTCCTGAACCAACGACATTTAATGATGCGACGGATGCTGGAAAGGATGCAGAAATTGAGGATAGCAGCAAGCTTCCGGTGTTACAAT
CTGCCACAGCCCTTGCTGTATCGTTTGCCATTTGTAAAGCTGGTTCCTACCTGACCAAATATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATTACCGCCATCATTGTT
GTCTTAGCAACCATTTTTCCCAAGCCGTTTGCTTACCTTGCTCCTTCTGGTGGGGCTATGGCTATGATTCTAATGCAGATTTTCTTTGCTGTAGTGGGAGCAAGTGGAAA
TGTATGGAGTGTCATCAGCACTGCGCCAAGTATCTTCTTGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCCATAATCCTTGGTCTTGGAAAGCTGCTTCGCTTCGACC
AAAAGTTGCTGCTGATAGCATCGAATGCCAACGTCGGAGGTCCCACTACAGCCTGCGGGATGGCCACGGCTAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTTGCT
GGAATTTTTGGAATCGCTATTGCAACTTTCCTAGGGATTGGATTTGGAATGATGGTCTTGAAATACATGTAA
mRNA sequenceShow/hide mRNA sequence
CTCCGTCCAATCCACCTCGCCGGAGATGGCCTTACAGTCGCAGCTCGCCTCGAAGTCGCCCGAAGTACAGCTTCTATGTTTGTCTTCCTGCAAAATCTCAGCTAGAATCT
CCAGGAGCATCGCGATCGCTCTTCGGCCACCAGTGCAACCGTTGTCGTCGTCGTCATCAGCAGCAGCTGAAAGTGCACGCCGAAGATTCTGGAACTTTTGCGGCACCAGT
ACTGGAAATGTTCAACTGAGACGAAATGTTGCTGTTAGATCTCATCTGAAATTGAATCTCCCCCTCATTTCTCCGCATGACCAGTGGGGCAACTGGACTGTTTTATTCTC
CGTTGGAGCCTTCGGTATCTGGTCTGAGAAAACGAAGGTTGGTAGTGCATTAAGTGGTGCCCTAGTGAGCGCATTGGTGGGACTTGCAGCCAGTAATTTTGGGATCATTG
CATCTGATGCTCCAGCTTTTCCTTTTGTTTTGGAGTTTTTGCTACCGTTAGCAGTTCCTATGCTGTTATTTAGGGCAGATTTGCGCAATGTAATAAAGTCAACTGGGACA
CTTCTCTTGGCATTTTTGTTAGGTTCAGTTGCAACAACAATTGGAACTGTTGTGGCCTATTTTCTTGTACCAATGCAATCACTTGGTCAGGACAGTTGGAAAATTGCCGC
CGCACTAATGGGAAGACATATTGGTGGAGCTGTCAATTATGTCGCTATATCTGATGCCCTTGGTGTCTCTTCATCAGTATTAGCAGCTGGACTTGCTGCGGATAATGTAA
TTTGTGCAGCGTATTTTGCAACATTGTTTGCTTTAGCATCTAAAGTACCTCCTGAACCAACGACATTTAATGATGCGACGGATGCTGGAAAGGATGCAGAAATTGAGGAT
AGCAGCAAGCTTCCGGTGTTACAATCTGCCACAGCCCTTGCTGTATCGTTTGCCATTTGTAAAGCTGGTTCCTACCTGACCAAATATTTTGGAATTCAAGGTGGTAGCAT
GCCAGCAATTACCGCCATCATTGTTGTCTTAGCAACCATTTTTCCCAAGCCGTTTGCTTACCTTGCTCCTTCTGGTGGGGCTATGGCTATGATTCTAATGCAGATTTTCT
TTGCTGTAGTGGGAGCAAGTGGAAATGTATGGAGTGTCATCAGCACTGCGCCAAGTATCTTCTTGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCCATAATCCTTGGT
CTTGGAAAGCTGCTTCGCTTCGACCAAAAGTTGCTGCTGATAGCATCGAATGCCAACGTCGGAGGTCCCACTACAGCCTGCGGGATGGCCACGGCTAAGGGTTGGAGTTC
AATGGTTATTCCTGGAATTCTTGCTGGAATTTTTGGAATCGCTATTGCAACTTTCCTAGGGATTGGATTTGGAATGATGGTCTTGAAATACATGTAATACCATTCAAATT
TAGATCATAAAGTCCTCTGCTTTTCTCCAAATTCCTGTAACTTATTTAATGGAACAACTTATTTTTGCTCAAAGGACATTGGAGGTCGCTGGCAAAGCTCAACTTGAGTA
TCTAATGGCATGTTATATGTCAATTCGAGCTGAGCTCAGTTCTTCATGGTGAGACCATA
Protein sequenceShow/hide protein sequence
MALQSQLASKSPEVQLLCLSSCKISARISRSIAIALRPPVQPLSSSSSAAAESARRRFWNFCGTSTGNVQLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWS
EKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPFVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIG
GAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAGKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIV
VLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLRFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILA
GIFGIAIATFLGIGFGMMVLKYM