; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G001130 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G001130
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF819)
Genome locationCmo_Chr16:513050..516310
RNA-Seq ExpressionCmoCh16G001130
SyntenyCmoCh16G001130
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576704.1 hypothetical protein SDJN03_24278, partial [Cucurbita argyrosperma subsp. sororia]8.0e-24499.78Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAES RRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

KAG7014753.1 yjcL, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-23795.82Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAES RRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSIGAF------------GIW----SEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGS
        FSIGAF              W    SEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGS
Subjt:  FSIGAF------------GIW----SEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGS

Query:  VATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIE
        VATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISD+LGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIE
Subjt:  VATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIE

Query:  DSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLV
        DSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLV
Subjt:  DSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLV

Query:  QIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        QIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  QIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_022922913.1 uncharacterized protein LOC111430750 [Cucurbita moschata]9.4e-245100Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_022984975.1 uncharacterized protein LOC111483083 [Cucurbita maxima]5.9e-23998.06Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCK+SAR SRSIAIA RPPVQPLSSSSSAAAES RRRFWNFCGTSTGNV LRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FS+GAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDA KDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
         FDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_023553523.1 uncharacterized protein LOC111810914 [Cucurbita pepo subsp. pepo]3.5e-23998.06Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPL-SSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTV
        MALQSQLASKSPEVQLLCLSSCK+SARFSRSIAIAPRPPVQPL SSSSSAAAES RRRFWNFCGTSTGNV LRRNVAVRSHLKLNLPLISPHDQWGNWTV
Subjt:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPL-SSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTV

Query:  LFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVP
        LFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVP
Subjt:  LFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVP

Query:  MQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALA
        MQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGV+SSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDA KDAEIEDS+KLPVLQSA ALA
Subjt:  MQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALA

Query:  VSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKL
        VSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGA+AMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKL
Subjt:  VSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKL

Query:  LGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        LGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  LGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

TrEMBL top hitse value%identityAlignment
A0A0A0L600 Uncharacterized protein5.8e-20084.12Show/hide
Query:  MALQSQLA---SKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNW
        MA QSQL    SKSPE+Q  C SS K S  FSRSI++A  PP+Q L SSSS  AE    RFW+F  +S GNV  RR+VAVRSHLKLNLPL+SP+DQWGNW
Subjt:  MALQSQLA---SKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNW

Query:  TVLFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFL
        TVLFSIGAFGIWSEKTKVGSALSGALVS LVGLAASNFGIIASDAPAF IVLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TT+GTVVAYFL
Subjt:  TVLFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFL

Query:  VPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATA
        VPM+SLGQDSWKIAAALMGRHIGGAVNYVAISDALGVS SVLAAGLAADNVICA YFATLFALASKVPPEPTT ++     KDAE+E S+KLPVLQSA+A
Subjt:  VPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATA

Query:  LAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLG
        +AVSFAICK GSYLTKYFGIQGGSMPAITA+IVVLATIFPK FAYLAPSG AMA+ILMQ+FFAVVGASGNVWSVI+TAPSIFLF+ VQI+VHL II+GLG
Subjt:  LAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLG

Query:  KLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        KLL FD K LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFGMMVLKYM
Subjt:  KLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X32.2e-19983.81Show/hide
Query:  LASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAF
        L SKSP++QL C SS K SARF RSI +APRPPV P+ SSSS AAE G RRFWNF   S+GN  LRR +AV+SHLKLNLPLISPHDQW NWTVLFS+GAF
Subjt:  LASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAF

Query:  GIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQD
        GIWSEKTK+GSALSGALVS LVGLAASN GIIASDAPAFP+VLE LLPL++P+LLFRADLR VIKSTGTLLLAFLLGSV T IGT VAYFLVPM+SLGQD
Subjt:  GIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQD

Query:  SWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAVSFAICK
        SWKIAAALMGRHIGGAVNYVAIS ALGVS SVLAAGLAADNVICA YFATLFALASKVP EPT  +D  +  KD E E ++KLPVLQSATALAVSFAICK
Subjt:  SWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAVSFAICK

Query:  AGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKL
        AGSYLTK+FGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMA+ILMQ+FF VVGASGN+WSVI+TAPSIF+FSLVQIAVHLA+ +GLGKLL FD KL
Subjt:  AGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKL

Query:  LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X12.2e-19983.81Show/hide
Query:  LASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAF
        L SKSP++QL C SS K SARF RSI +APRPPV P+ SSSS AAE G RRFWNF   S+GN  LRR +AV+SHLKLNLPLISPHDQW NWTVLFS+GAF
Subjt:  LASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAF

Query:  GIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQD
        GIWSEKTK+GSALSGALVS LVGLAASN GIIASDAPAFP+VLE LLPL++P+LLFRADLR VIKSTGTLLLAFLLGSV T IGT VAYFLVPM+SLGQD
Subjt:  GIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQD

Query:  SWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAVSFAICK
        SWKIAAALMGRHIGGAVNYVAIS ALGVS SVLAAGLAADNVICA YFATLFALASKVP EPT  +D  +  KD E E ++KLPVLQSATALAVSFAICK
Subjt:  SWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAVSFAICK

Query:  AGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKL
        AGSYLTK+FGIQGGSMPAITA+IVVLATIFPKPFAYLAPSG AMA+ILMQ+FF VVGASGN+WSVI+TAPSIF+FSLVQIAVHLA+ +GLGKLL FD KL
Subjt:  AGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKL

Query:  LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1E833 uncharacterized protein LOC1114307504.6e-245100Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1JA22 uncharacterized protein LOC1114830832.9e-23998.06Show/hide
Query:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL
        MALQSQLASKSPEVQLLCLSSCK+SAR SRSIAIA RPPVQPLSSSSSAAAES RRRFWNFCGTSTGNV LRRNVAVRSHLKLNLPLISPHDQWGNWTVL
Subjt:  MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVL

Query:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
        FS+GAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFP VLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM
Subjt:  FSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPM

Query:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV
        QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDA KDAEIEDSSKLPVLQSATALAV
Subjt:  QSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAV

Query:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
        SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL
Subjt:  SFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLL

Query:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
         FDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  GFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL1.7e-3430.18Show/hide
Query:  LISPHDQWGNWTVLFSIGAFGIWSE-KTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSV
        LIS  D W  W  +    A  I  E + K  SA+SGA+++    +  +N G++  ++P +  V  +++PLA+P+LLF+ ++R + K +  LL  FL+ SV
Subjt:  LISPHDQWGNWTVLFSIGAFGIWSE-KTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSV

Query:  ATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALAS--------------KVPPEPTTF
         T +G+++A+FL+       D  KI   +   +IGG VN+ A++         ++A + ADN + A  F  L ++ +              KV  +  + 
Subjt:  ATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALAS--------------KVPPEPTTF

Query:  NDATDAVKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYF------GIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGAS
        N A    K  +I  S K     +  A A+     K   Y    F      G  G     +T++ V++  +FP+ F  L  S   +   L+ +FF V+G  
Subjt:  NDATDAVKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYF------GIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGAS

Query:  GNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG
         ++  +++ AP I LF  +    +LA+ L  GKL     + +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G
Subjt:  GNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)1.5e-12661.38Show/hide
Query:  RRNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIK
        RR V V S L+   PLISP D W  W  LF+ GAFG+WSEKTK+GS +SGAL S L+GLAASN  +I  + P++   +EFLLP  +P+LLFRADLR +I+
Subjt:  RRNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIK

Query:  STGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTF
        STG+LLLAFL+GSVAT +GTVVA+ LVPM+SLG D+WKIAAALMG +IGG++N+VAIS+AL +S SV+AAG+A DNVICA +F  LFALASK+PPE  + 
Subjt:  STGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTF

Query:  NDA-TDAVKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWS
        +    D  KD ++ED ++  V+ ++ AL+VSF ICKA   LT  F IQG  +PA+TAI +VLAT FP  F  LAPS   +++ILMQ+FF ++GA+G+VW+
Subjt:  NDA-TDAVKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWS

Query:  VISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLK
        VI+TAPSIFLF+ +Q+ VHLA+ L LGKL   D KLLL+ASNAN+GGPTTAC MATAKGW+S+V+PGIL+G+FG++IATFLGIG G+ VLK
Subjt:  VISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLK

AT5G52540.1 Protein of unknown function (DUF819)1.5e-15272.66Show/hide
Query:  RNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKS
        R+V V S   L+ PLISP+D+WG WT LF+ GA G+WSEKTKVG+A+SGALVS LVGLAASN GII+S APAF +VL FLLPLAVP+LLFRADLR V++S
Subjt:  RNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWSEKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKS

Query:  TGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVP----PEP
        TG LLLAFL+GSVATT+GT +AY+LVPM+SLG DSWKIAAALMGRHIGGAVNYVAIS+ALGV+ SVLAAGLAADNVICA YF TLFAL SK+P    P P
Subjt:  TGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVP----PEP

Query:  TTFNDATDAVKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNV
        TT     DA  +   E  +K+PVL  AT +AVS AICKAG+ LTKYFGI GGS+PAITA++V+LAT+FP  F  LAPSG AMA+ILMQ+FF VVGASGN+
Subjt:  TTFNDATDAVKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIVVLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNV

Query:  WSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        WSVI+TAPSIFLF+LVQI  HLA+ILG+GKLL  + +LLL+ASNANVGGPTTA GMATAKGW+S+++PGILAGIFGIAIATF+GI FG+ VLK+M
Subjt:  WSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTACAGTCGCAGCTCGCCTCGAAGTCGCCCGAAGTACAGCTTCTATGTTTGTCTTCCTGCAAACTGTCAGCTAGATTCTCCAGGAGCATCGCGATCGCTCCTCG
GCCACCGGTGCAACCGTTGTCGTCGTCGTCATCAGCAGCAGCTGAAAGTGGACGCCGGAGATTTTGGAACTTTTGCGGCACCAGTACTGGAAATGTTCTACTGAGACGAA
ATGTTGCTGTTAGATCTCATCTGAAATTGAATCTCCCCCTCATTTCTCCGCATGACCAGTGGGGCAACTGGACTGTTTTATTCTCCATTGGAGCCTTCGGTATCTGGTCT
GAGAAAACGAAGGTTGGTAGTGCATTAAGTGGTGCCCTAGTGAGCGCATTGGTGGGACTTGCAGCCAGTAATTTCGGGATCATTGCATCTGATGCTCCAGCTTTTCCTAT
TGTTTTGGAGTTTTTGCTACCGTTAGCAGTTCCTATGCTGTTATTTAGGGCAGATTTGCGCAATGTAATAAAGTCAACTGGGACACTTCTCTTGGCATTTTTGTTAGGTT
CAGTTGCAACAACAATTGGAACCGTTGTGGCCTATTTTCTTGTACCAATGCAATCACTTGGTCAGGACAGTTGGAAAATTGCCGCCGCACTAATGGGAAGACATATTGGT
GGAGCTGTCAATTATGTTGCTATATCTGATGCCCTTGGTGTCTCTTCATCAGTATTAGCAGCTGGACTTGCTGCGGATAATGTAATTTGTGCAGCGTATTTTGCAACATT
GTTTGCATTAGCATCTAAAGTACCTCCTGAACCAACGACATTTAATGATGCTACGGATGCTGTGAAGGATGCAGAAATTGAGGATAGCAGCAAGCTTCCGGTGTTACAAT
CTGCCACAGCCCTTGCTGTATCGTTTGCCATTTGTAAAGCTGGTTCCTACCTGACCAAATATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATTACCGCCATCATTGTT
GTCTTAGCAACCATTTTTCCCAAGCCGTTTGCTTACCTTGCTCCTTCTGGTGGGGCTATGGCTATGATTCTAATGCAGATTTTCTTTGCTGTAGTGGGAGCAAGTGGAAA
CGTATGGAGTGTCATCAGCACTGCACCAAGTATCTTCTTGTTTTCTCTTGTCCAGATCGCAGTCCATCTTGCCATAATCCTTGGTCTTGGAAAGCTGCTTGGCTTCGACC
AAAAGTTGCTGCTGATAGCATCGAATGCCAACGTTGGAGGTCCCACTACAGCCTGCGGGATGGCCACGGCTAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTTGCT
GGAATTTTCGGAATCGCTATTGCAACTTTCCTAGGGATTGGATTTGGAATGATGGTCTTGAAATACATGTAA
mRNA sequenceShow/hide mRNA sequence
TTGGTGTTCTCTCTCCGTCCAATCCACCTCGCCGGAGATGGCCTTACAGTCGCAGCTCGCCTCGAAGTCGCCCGAAGTACAGCTTCTATGTTTGTCTTCCTGCAAACTGT
CAGCTAGATTCTCCAGGAGCATCGCGATCGCTCCTCGGCCACCGGTGCAACCGTTGTCGTCGTCGTCATCAGCAGCAGCTGAAAGTGGACGCCGGAGATTTTGGAACTTT
TGCGGCACCAGTACTGGAAATGTTCTACTGAGACGAAATGTTGCTGTTAGATCTCATCTGAAATTGAATCTCCCCCTCATTTCTCCGCATGACCAGTGGGGCAACTGGAC
TGTTTTATTCTCCATTGGAGCCTTCGGTATCTGGTCTGAGAAAACGAAGGTTGGTAGTGCATTAAGTGGTGCCCTAGTGAGCGCATTGGTGGGACTTGCAGCCAGTAATT
TCGGGATCATTGCATCTGATGCTCCAGCTTTTCCTATTGTTTTGGAGTTTTTGCTACCGTTAGCAGTTCCTATGCTGTTATTTAGGGCAGATTTGCGCAATGTAATAAAG
TCAACTGGGACACTTCTCTTGGCATTTTTGTTAGGTTCAGTTGCAACAACAATTGGAACCGTTGTGGCCTATTTTCTTGTACCAATGCAATCACTTGGTCAGGACAGTTG
GAAAATTGCCGCCGCACTAATGGGAAGACATATTGGTGGAGCTGTCAATTATGTTGCTATATCTGATGCCCTTGGTGTCTCTTCATCAGTATTAGCAGCTGGACTTGCTG
CGGATAATGTAATTTGTGCAGCGTATTTTGCAACATTGTTTGCATTAGCATCTAAAGTACCTCCTGAACCAACGACATTTAATGATGCTACGGATGCTGTGAAGGATGCA
GAAATTGAGGATAGCAGCAAGCTTCCGGTGTTACAATCTGCCACAGCCCTTGCTGTATCGTTTGCCATTTGTAAAGCTGGTTCCTACCTGACCAAATATTTTGGAATTCA
AGGTGGTAGCATGCCAGCAATTACCGCCATCATTGTTGTCTTAGCAACCATTTTTCCCAAGCCGTTTGCTTACCTTGCTCCTTCTGGTGGGGCTATGGCTATGATTCTAA
TGCAGATTTTCTTTGCTGTAGTGGGAGCAAGTGGAAACGTATGGAGTGTCATCAGCACTGCACCAAGTATCTTCTTGTTTTCTCTTGTCCAGATCGCAGTCCATCTTGCC
ATAATCCTTGGTCTTGGAAAGCTGCTTGGCTTCGACCAAAAGTTGCTGCTGATAGCATCGAATGCCAACGTTGGAGGTCCCACTACAGCCTGCGGGATGGCCACGGCTAA
GGGTTGGAGTTCAATGGTTATTCCTGGAATTCTTGCTGGAATTTTCGGAATCGCTATTGCAACTTTCCTAGGGATTGGATTTGGAATGATGGTCTTGAAATACATGTAAT
ACCATTCAAATTTAGATCATAAAATCCTCTGCTTTTCTCCAAATCAGTTACTGGAACAACTGTTTAAGATAGTTTTAAGATAGTTTTTGCTCATAGGACGTTGGAGGAGA
GCTTGTCGTCGCAGGCAAAGCTCAACTTGAGTATCT
Protein sequenceShow/hide protein sequence
MALQSQLASKSPEVQLLCLSSCKLSARFSRSIAIAPRPPVQPLSSSSSAAAESGRRRFWNFCGTSTGNVLLRRNVAVRSHLKLNLPLISPHDQWGNWTVLFSIGAFGIWS
EKTKVGSALSGALVSALVGLAASNFGIIASDAPAFPIVLEFLLPLAVPMLLFRADLRNVIKSTGTLLLAFLLGSVATTIGTVVAYFLVPMQSLGQDSWKIAAALMGRHIG
GAVNYVAISDALGVSSSVLAAGLAADNVICAAYFATLFALASKVPPEPTTFNDATDAVKDAEIEDSSKLPVLQSATALAVSFAICKAGSYLTKYFGIQGGSMPAITAIIV
VLATIFPKPFAYLAPSGGAMAMILMQIFFAVVGASGNVWSVISTAPSIFLFSLVQIAVHLAIILGLGKLLGFDQKLLLIASNANVGGPTTACGMATAKGWSSMVIPGILA
GIFGIAIATFLGIGFGMMVLKYM