; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008004 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008004
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProtein of unknown function (DUF819)
Genome locationchr06:726506..736599
RNA-Seq ExpressionIVF0008004
SyntenyIVF0008004
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576704.1 hypothetical protein SDJN03_24278, partial [Cucurbita argyrosperma subsp. sororia]7.77e-25383.48Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSS-LAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNW
        MA QS LA   S SPE+Q  C SS K S  FSRSIA+AP PP+Q LSSSS  AA+    RFW+F  +S GNV  RR+VAV+SHLKLNLPL+SP DQWGNW
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSS-LAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNW

Query:  TVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFL
        TVLFSIGAFGIWSEKTK+GSALSGALVS LVGLAASNFGIIASDAPAF  VLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TT+GTVVAYFL
Subjt:  TVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFL

Query:  VPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVE--KDAEVEPSNKLPVLQSASA
        VPM+SLGQDSWKIAAALMGRHIGGAVNYVAISDALGVS SVLAAGLAADNVICA YFATLFALAS VPPE TT ++  +  KDAE+E S+KLPVLQSA+A
Subjt:  VPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVE--KDAEVEPSNKLPVLQSASA

Query:  IAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLG
        +AVSFAICK GSYLTKYFGIQGGSMPAITA+IVVLATIFPK FAYLAPSG AMA+ILMQ+FFAVVGASGNVWSVI+TAPSIF+F+ VQI+VHLAII+GLG
Subjt:  IAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLG

Query:  KLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        KLL FD K LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  KLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_004140757.1 uncharacterized protein LOC101211894 isoform X1 [Cucumis sativus]4.15e-30195.68Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
        MASQS L ILHS SPELQ PCFSS+KFSVGFSRSI+MA HPPLQSLSSSSL A+ITCPRFWSFRKSSNGNVQSRRDVAV+SHLKLNLPLVSP DQWGNWT
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
        VLFSIGAFGIWSEKTK+GSALSGALVSTLVGLAASNFGIIASDAPAFA VLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS VPPE TTLDNGV KDAEVEPSNKLPVLQSASA+AV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
        SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIF+FAFVQISVHL IIIGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFGMMVLKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_008439290.1 PREDICTED: uncharacterized membrane protein YjcL-like [Cucumis melo]2.69e-31399.57Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
        MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
        VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS VPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
        SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG MVLKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_023553523.1 uncharacterized protein LOC111810914 [Cucurbita pepo subsp. pepo]2.82e-25383.3Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSS--LAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGN
        MA QS LA   S SPE+Q  C SS K S  FSRSIA+AP PP+Q LSSSS   AA+    RFW+F  +S GNVQ RR+VAV+SHLKLNLPL+SP DQWGN
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSS--LAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGN

Query:  WTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYF
        WTVLFSIGAFGIWSEKTK+GSALSGALVS LVGLAASNFGIIASDAPAF  VLEFLLPLAVP+LLFRADLR VIKSTGTLLLAFLLGSV TT+GTVVAYF
Subjt:  WTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYF

Query:  LVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVE--KDAEVEPSNKLPVLQSAS
        LVPM+SLGQDSWKIAAALMGRHIGGAVNYVAISDALGV+ SVLAAGLAADNVICA YFATLFALAS VPPE TT ++  +  KDAE+E SNKLPVLQSA 
Subjt:  LVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVE--KDAEVEPSNKLPVLQSAS

Query:  AIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGL
        A+AVSFAICK GSYLTKYFGIQGGSMPAITA+IVVLATIFPK FAYLAPSG A+A+ILMQ+FFAVVGASGNVWSVI+TAPSIF+F+ VQI+VHLAII+GL
Subjt:  AIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGL

Query:  GKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        GKLL FD K LLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
Subjt:  GKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

XP_038877446.1 uncharacterized membrane protein YjcL-like [Benincasa hispida]5.77e-28391.59Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQ-SLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNW
        MASQS LA LHS SP  QPPC SS KFSVGFSR IAMAP P LQ SLSSSSL A+I   RFW+FRKSS GNVQ RRDVAVKSHLKLN+PL+SP DQWGNW
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQ-SLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNW

Query:  TVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFL
        TVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAF  VLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTT+GTVVAYFL
Subjt:  TVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFL

Query:  VPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIA
        VPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS VPPE+T LDN V K AEVEPSNKLPVLQSA+AIA
Subjt:  VPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIA

Query:  VSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKL
        VSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGN+WSVINTAPSIFIF+FVQI+VHLAII+GLGKL
Subjt:  VSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKL

Query:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        LRFDLK LLIASNANVGGPTTACGMATAKGWSSM+IPGILAGIFGIAIATFLGIGFGM+VLKYM
Subjt:  LRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

TrEMBL top hitse value%identityAlignment
A0A0A0L600 Uncharacterized protein7.8e-23795.68Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
        MASQS L ILHS SPELQ PCFSS+KFSVGFSRSI+MA HPPLQSLSSSSL A+ITCPRFWSFRKSSNGNVQSRRDVAV+SHLKLNLPLVSP DQWGNWT
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
        VLFSIGAFGIWSEKTK+GSALSGALVSTLVGLAASNFGIIASDAPAFA VLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS VPPE TTLDNGV KDAEVEPSNKLPVLQSASA+AV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
        SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIF+FAFVQISVHL IIIGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIA+ATFLGIGFGMMVLKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A1S3AZ41 uncharacterized membrane protein YjcL-like4.1e-24699.57Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
        MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
        VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS VPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
        SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG MVLKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X36.9e-20183.15Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
        MASQ  +AIL S SP+LQ PCFSS+K S  F RSI MAP PP+  +SSSS AA+I   RFW+F  +S+GN   RR +AVKSHLKLNLPL+SP DQW NWT
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
        VLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAF  VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT +GT VAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALAS VP E T   + V KD E E +NKLPVLQSA+A+AV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
        SFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHLA+ IGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CHF3 uncharacterized protein LOC111011457 isoform X46.9e-20183.15Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
        MASQ  +AIL S SP+LQ PCFSS+K S  F RSI MAP PP+  +SSSS AA+I   RFW+F  +S+GN   RR +AVKSHLKLNLPL+SP DQW NWT
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
        VLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAF  VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT +GT VAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALAS VP E T   + V KD E E +NKLPVLQSA+A+AV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
        SFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHLA+ IGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X16.9e-20183.15Show/hide
Query:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT
        MASQ  +AIL S SP+LQ PCFSS+K S  F RSI MAP PP+  +SSSS AA+I   RFW+F  +S+GN   RR +AVKSHLKLNLPL+SP DQW NWT
Subjt:  MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWT

Query:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV
        VLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAF  VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT +GT VAYFLV
Subjt:  VLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLV

Query:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV
        PMRSLGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALAS VP E T   + V KD E E +NKLPVLQSA+A+AV
Subjt:  PMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAV

Query:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL
        SFAICK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHLA+ IGLGKLL
Subjt:  SFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLL

Query:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        RFDLK LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIAIATFLGIGFG+M LKYM
Subjt:  RFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL5.0e-3931.36Show/hide
Query:  LVSPQDQWGNWTVLFSIGAFGIWSE-KTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSV
        L+S  D W  W  +    A  I  E + K  SA+SGA+++    +  +N G++  ++P + TV  +++PLA+PLLLF+ ++R++ K +  LL  FL+ SV
Subjt:  LVSPQDQWGNWTVLFSIGAFGIWSE-KTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSV

Query:  GTTVGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS--------NVP-PESTTLDNGVE
        GT +G+++A+FL+       D  KI   +   +IGG VN+ A++         ++A + ADN + A+ F  L ++ +         +P  E    D    
Subjt:  GTTVGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALAS--------NVP-PESTTLDNGVE

Query:  KDAE-----VEPSNKLPVLQSASAIAVSFAICKVGSYLTKYF------GIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGN
          AE      + S K     + +A A+     KV  Y    F      G  G     +T++ V++  +FP+ F  L  S E +   L+ +FF V+G   +
Subjt:  KDAE-----VEPSNKLPVLQSASAIAVSFAICKVGSYLTKYF------GIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGN

Query:  VWSVINTAPSIFIFAFVQISVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG
        +  ++  AP I +F F+    +LA+ +  GKL R  L+ +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G
Subjt:  VWSVINTAPSIFIFAFVQISVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFG

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)6.3e-13061.93Show/hide
Query:  VQSRRDVAVKSHLKLNLPLVSPQDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRR
        V SRR   VK + +L  PL+SP D W  W  LF+ GAFG+WSEKTKIGS +SGAL STL+GLAASN  +I  + P++   +EFLLP  +PLLLFRADLRR
Subjt:  VQSRRDVAVKSHLKLNLPLVSPQDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRR

Query:  VIKSTGTLLLAFLLGSVGTTVGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPES
        +I+STG+LLLAFL+GSV T VGTVVA+ LVPMRSLG D+WKIAAALMG +IGG++N+VAIS+AL +SPSV+AAG+A DNVICA++F  LFALAS +PPE+
Subjt:  VIKSTGTLLLAFLLGSVGTTVGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPES

Query:  TTL---DNGVEKDAEVEPSNKLPVLQSASAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGN
         +    D  + KD ++E  N+  V+ ++ A++VSF ICK    LT  F IQG  +PA+TA+ +VLAT FP  F  LAPS E ++LILMQVFF ++GA+G+
Subjt:  TTL---DNGVEKDAEVEPSNKLPVLQSASAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGN

Query:  VWSVINTAPSIFIFAFVQISVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLK
        VW+VINTAPSIF+FA +Q+ VHLA+ + LGKL   D+K LL+ASNAN+GGPTTAC MATAKGW+S+V+PGIL+G+FG++IATFLGIG G+ VLK
Subjt:  VWSVINTAPSIFIFAFVQISVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLK

AT5G52540.1 Protein of unknown function (DUF819)1.9e-15574.11Show/hide
Query:  RDVAVKSHLKLNLPLVSPQDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKS
        R V V S   L+ PL+SP D+WG WT LF+ GA G+WSEKTK+G+A+SGALVSTLVGLAASN GII+S APAFA VL FLLPLAVPLLLFRADLRRV++S
Subjt:  RDVAVKSHLKLNLPLVSPQDQWGNWTVLFSIGAFGIWSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKS

Query:  TGTLLLAFLLGSVGTTVGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASN-----VPPE
        TG LLLAFL+GSV TTVGT +AY+LVPM+SLG DSWKIAAALMGRHIGGAVNYVAIS+ALGV+PSVLAAGLAADNVICAVYF TLFAL S      VPP 
Subjt:  TGTLLLAFLLGSVGTTVGTVVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASN-----VPPE

Query:  STTLDNGVEKDAEVEPSNKLPVLQSASAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVW
        +T +D   E +   E  NK+PVL  A+ IAVS AICK G+ LTKYFGI GGS+PAITAV+V+LAT+FP  F  LAPSGEAMALILMQVFF VVGASGN+W
Subjt:  STTLDNGVEKDAEVEPSNKLPVLQSASAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIVVLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVW

Query:  SVINTAPSIFIFAFVQISVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM
        SVINTAPSIF+FA VQI  HLA+I+G+GKLL  +L+ LL+ASNANVGGPTTA GMATAKGW+S+++PGILAGIFGIAIATF+GI FG+ VLK+M
Subjt:  SVINTAPSIFIFAFVQISVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILAGIFGIAIATFLGIGFGMMVLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCACAGTCACACCTCGCGATTCTTCACTCGAACTCGCCTGAATTACAGCCTCCATGTTTTTCTTCCAGCAAATTCTCTGTTGGGTTCTCCAGGAGCATCGCGAT
GGCACCTCACCCACCGCTGCAATCGTTATCTTCCTCATCGTTGGCTGCTAAAATCACATGCCCGAGATTCTGGAGCTTTCGGAAATCCAGCAACGGAAATGTTCAATCTA
GACGAGATGTTGCTGTTAAATCTCACCTGAAATTGAATCTCCCCCTCGTTTCTCCGCAGGATCAGTGGGGCAACTGGACTGTTTTATTCTCCATAGGAGCCTTCGGTATC
TGGTCCGAGAAAACGAAGATTGGTAGTGCATTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTTGCAGCCAGTAATTTTGGGATCATTGCATCTGATGCTCCAGCTTT
TGCGACTGTTTTGGAGTTTTTGCTACCGTTAGCAGTTCCATTGCTGTTATTTAGAGCAGATTTGCGTCGTGTAATAAAGTCAACCGGGACTCTTCTCTTGGCATTTTTGT
TAGGTTCAGTTGGAACAACAGTTGGAACTGTGGTGGCCTATTTTCTTGTACCAATGCGATCACTGGGTCAAGACAGTTGGAAAATTGCCGCCGCACTGATGGGAAGACAT
ATTGGTGGAGCTGTCAATTATGTTGCCATATCTGATGCTCTGGGTGTTTCTCCATCAGTGTTAGCTGCTGGACTTGCAGCAGATAATGTTATTTGTGCAGTGTATTTTGC
AACATTGTTTGCATTAGCATCAAACGTACCTCCTGAATCTACGACATTGGATAACGGCGTTGAGAAGGATGCAGAAGTTGAGCCTAGCAACAAGCTTCCGGTGTTACAAT
CTGCCTCAGCCATCGCTGTATCATTTGCCATTTGTAAAGTTGGTTCCTACTTGACCAAATATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATTACAGCCGTCATTGTT
GTTTTAGCAACCATTTTTCCTAAGCTGTTTGCTTACCTTGCTCCTTCTGGTGAAGCTATGGCTCTGATTCTAATGCAGGTTTTCTTCGCTGTAGTGGGAGCAAGTGGAAA
TGTATGGAGTGTCATCAACACTGCACCAAGTATCTTCATATTTGCTTTTGTCCAGATTTCAGTCCATCTTGCCATAATCATTGGCCTTGGGAAGCTGCTTCGCTTCGACC
TAAAGTCGTTGCTCATAGCATCGAACGCGAACGTTGGAGGTCCCACAACAGCTTGCGGGATGGCCACAGCTAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTTGCT
GGAATTTTCGGAATCGCTATCGCAACTTTCCTAGGGATTGGATTTGGAATGATGGTCTTGAAATACATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCACAGTCACACCTCGCGATTCTTCACTCGAACTCGCCTGAATTACAGCCTCCATGTTTTTCTTCCAGCAAATTCTCTGTTGGGTTCTCCAGGAGCATCGCGAT
GGCACCTCACCCACCGCTGCAATCGTTATCTTCCTCATCGTTGGCTGCTAAAATCACATGCCCGAGATTCTGGAGCTTTCGGAAATCCAGCAACGGAAATGTTCAATCTA
GACGAGATGTTGCTGTTAAATCTCACCTGAAATTGAATCTCCCCCTCGTTTCTCCGCAGGATCAGTGGGGCAACTGGACTGTTTTATTCTCCATAGGAGCCTTCGGTATC
TGGTCCGAGAAAACGAAGATTGGTAGTGCATTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTTGCAGCCAGTAATTTTGGGATCATTGCATCTGATGCTCCAGCTTT
TGCGACTGTTTTGGAGTTTTTGCTACCGTTAGCAGTTCCATTGCTGTTATTTAGAGCAGATTTGCGTCGTGTAATAAAGTCAACCGGGACTCTTCTCTTGGCATTTTTGT
TAGGTTCAGTTGGAACAACAGTTGGAACTGTGGTGGCCTATTTTCTTGTACCAATGCGATCACTGGGTCAAGACAGTTGGAAAATTGCCGCCGCACTGATGGGAAGACAT
ATTGGTGGAGCTGTCAATTATGTTGCCATATCTGATGCTCTGGGTGTTTCTCCATCAGTGTTAGCTGCTGGACTTGCAGCAGATAATGTTATTTGTGCAGTGTATTTTGC
AACATTGTTTGCATTAGCATCAAACGTACCTCCTGAATCTACGACATTGGATAACGGCGTTGAGAAGGATGCAGAAGTTGAGCCTAGCAACAAGCTTCCGGTGTTACAAT
CTGCCTCAGCCATCGCTGTATCATTTGCCATTTGTAAAGTTGGTTCCTACTTGACCAAATATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATTACAGCCGTCATTGTT
GTTTTAGCAACCATTTTTCCTAAGCTGTTTGCTTACCTTGCTCCTTCTGGTGAAGCTATGGCTCTGATTCTAATGCAGGTTTTCTTCGCTGTAGTGGGAGCAAGTGGAAA
TGTATGGAGTGTCATCAACACTGCACCAAGTATCTTCATATTTGCTTTTGTCCAGATTTCAGTCCATCTTGCCATAATCATTGGCCTTGGGAAGCTGCTTCGCTTCGACC
TAAAGTCGTTGCTCATAGCATCGAACGCGAACGTTGGAGGTCCCACAACAGCTTGCGGGATGGCCACAGCTAAGGGTTGGAGTTCAATGGTTATTCCTGGAATTCTTGCT
GGAATTTTCGGAATCGCTATCGCAACTTTCCTAGGGATTGGATTTGGAATGATGGTCTTGAAATACATGTAG
Protein sequenceShow/hide protein sequence
MASQSHLAILHSNSPELQPPCFSSSKFSVGFSRSIAMAPHPPLQSLSSSSLAAKITCPRFWSFRKSSNGNVQSRRDVAVKSHLKLNLPLVSPQDQWGNWTVLFSIGAFGI
WSEKTKIGSALSGALVSTLVGLAASNFGIIASDAPAFATVLEFLLPLAVPLLLFRADLRRVIKSTGTLLLAFLLGSVGTTVGTVVAYFLVPMRSLGQDSWKIAAALMGRH
IGGAVNYVAISDALGVSPSVLAAGLAADNVICAVYFATLFALASNVPPESTTLDNGVEKDAEVEPSNKLPVLQSASAIAVSFAICKVGSYLTKYFGIQGGSMPAITAVIV
VLATIFPKLFAYLAPSGEAMALILMQVFFAVVGASGNVWSVINTAPSIFIFAFVQISVHLAIIIGLGKLLRFDLKSLLIASNANVGGPTTACGMATAKGWSSMVIPGILA
GIFGIAIATFLGIGFGMMVLKYM