; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000429 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000429
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF819)
Genome locationscaffold44:1635771..1638673
RNA-Seq ExpressionMS000429
SyntenyMS000429
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439290.1 PREDICTED: uncharacterized membrane protein YjcL-like [Cucumis melo]1.5e-19781.21Show/hide
Query:  MASQ--VAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWT
        MASQ  +AIL +  PE+QPPCFSS+K S  F RSI MAP PP+  LSSSS AA+I   RFW+F  +S+GN Q RR +AVKSHLKLNLPL+SP DQWGNWT
Subjt:  MASQ--VAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLV
        VLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAF  VLEFLLPL+V LLLFRADLRRVI STGTLLLAFLLGSVGTT+GT VAYFLV
Subjt:  VLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLV

Query:  PMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAV
        PMRSLGQD+WKIAAALMGRHI GAVNYVA S ALGVSPSV+AAGLAADNVI AVYFATLFALASK+P E T   + V+KDAE E +NKLPVLQSA  +AV
Subjt:  PMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAV

Query:  SFAICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLL
        SFAICK GSYLTK+FGIQGGSMPAITA +V+LATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHLAI +GLGKLL
Subjt:  SFAICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLL

Query:  RFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        RFDLK LL+ASNAN+GGPTTACGM TAKGWSSM +PGILAGIFGIAIATFLGIGFG M LKYM
Subjt:  RFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

XP_022140902.1 uncharacterized protein LOC111011457 isoform X1 [Momordica charantia]1.4e-24099.33Show/hide
Query:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
        MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTS GNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
Subjt:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM

Query:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
        RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
Subjt:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGL KLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF

Query:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGI
        DLKLLLLASNANIGGPTTACGM TAKGWSSMAVPGILAGIFGIAIATFLGI
Subjt:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGI

XP_022140903.1 uncharacterized protein LOC111011457 isoform X2 [Momordica charantia]1.4e-24099.33Show/hide
Query:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
        MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTS GNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
Subjt:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM

Query:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
        RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
Subjt:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGL KLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF

Query:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGI
        DLKLLLLASNANIGGPTTACGM TAKGWSSMAVPGILAGIFGIAIATFLGI
Subjt:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGI

XP_022140905.1 uncharacterized protein LOC111011457 isoform X3 [Momordica charantia]2.3e-22791.76Show/hide
Query:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
        MASQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS SSGNF LRR IAVKSHLKLNLPLISPHDQW NWTVL
Subjt:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM

Query:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
        RSLGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSF
Subjt:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF

Query:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLL+ASNAN+GGPTTACGM TAKGWSSM VPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

XP_022140906.1 uncharacterized protein LOC111011457 isoform X4 [Momordica charantia]2.3e-22791.76Show/hide
Query:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
        MASQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS SSGNF LRR IAVKSHLKLNLPLISPHDQW NWTVL
Subjt:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM

Query:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
        RSLGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSF
Subjt:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF

Query:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLL+ASNAN+GGPTTACGM TAKGWSSM VPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

TrEMBL top hitse value%identityAlignment
A0A1S3AZ41 uncharacterized membrane protein YjcL-like7.1e-19881.21Show/hide
Query:  MASQ--VAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWT
        MASQ  +AIL +  PE+QPPCFSS+K S  F RSI MAP PP+  LSSSS AA+I   RFW+F  +S+GN Q RR +AVKSHLKLNLPL+SP DQWGNWT
Subjt:  MASQ--VAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWT

Query:  VLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLV
        VLFS+GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAF  VLEFLLPL+V LLLFRADLRRVI STGTLLLAFLLGSVGTT+GT VAYFLV
Subjt:  VLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLV

Query:  PMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAV
        PMRSLGQD+WKIAAALMGRHI GAVNYVA S ALGVSPSV+AAGLAADNVI AVYFATLFALASK+P E T   + V+KDAE E +NKLPVLQSA  +AV
Subjt:  PMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAV

Query:  SFAICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLL
        SFAICK GSYLTK+FGIQGGSMPAITA +V+LATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHLAI +GLGKLL
Subjt:  SFAICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLL

Query:  RFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        RFDLK LL+ASNAN+GGPTTACGM TAKGWSSM +PGILAGIFGIAIATFLGIGFG M LKYM
Subjt:  RFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X31.1e-22791.76Show/hide
Query:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
        MASQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS SSGNF LRR IAVKSHLKLNLPLISPHDQW NWTVL
Subjt:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM

Query:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
        RSLGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSF
Subjt:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF

Query:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLL+ASNAN+GGPTTACGM TAKGWSSM VPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CHF3 uncharacterized protein LOC111011457 isoform X41.1e-22791.76Show/hide
Query:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
        MASQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS SSGNF LRR IAVKSHLKLNLPLISPHDQW NWTVL
Subjt:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM

Query:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
        RSLGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSF
Subjt:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF

Query:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLL+ASNAN+GGPTTACGM TAKGWSSM VPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X16.8e-24199.33Show/hide
Query:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
        MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTS GNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
Subjt:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM

Query:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
        RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
Subjt:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGL KLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF

Query:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGI
        DLKLLLLASNANIGGPTTACGM TAKGWSSMAVPGILAGIFGIAIATFLGI
Subjt:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGI

A0A6J1CJ32 uncharacterized protein LOC111011457 isoform X26.8e-24199.33Show/hide
Query:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
        MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTS GNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL
Subjt:  MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPM

Query:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
        RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF
Subjt:  RSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGL KLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRF

Query:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGI
        DLKLLLLASNANIGGPTTACGM TAKGWSSMAVPGILAGIFGIAIATFLGI
Subjt:  DLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGI

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL3.7e-3429.04Show/hide
Query:  LISPHDQWGNWTVLFSVGAFGIWSE-KTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSV
        LIS  D W  W  +    A  I  E + K  SA+SGA+++    +  +NVG++  ++P +  V  +++PL++ LLLF+ ++R++   +  LL  FL+ SV
Subjt:  LISPHDQWGNWTVLFSVGAFGIWSE-KTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSV

Query:  GTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALAS--------KLPAEPTPSSD-NVK
        GT +G+ +A+FL+       D  KI   +   +I G VN+ A +         ++A + ADN + A+ F  L ++ +         +P E    +D N  
Subjt:  GTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALAS--------KLPAEPTPSSD-NVK

Query:  KDAEAEHNNK-LPVLQSAICLAVSFAICKAGSYLTKHF----------GIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGN
          AE+    K + +   A     +FA+      ++ +F          G  G     +T+  V++  +FP+ F  L  S E +   L+ +FF V+G   +
Subjt:  KDAEAEHNNK-LPVLQSAICLAVSFAICKAGSYLTKHF----------GIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGN

Query:  IWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        +  ++  AP I +F  +    +LA+++  GKL R  L+ +LLA NA +GGPTTA  M  AKGW  +  P +L G  G  I  ++G   G  F  ++
Subjt:  IWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)1.0e-12458.59Show/hide
Query:  PPVSHLSSSSPAAEIGDRRFWNFPST-SSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGI
        P   HLSSS     I  RR  + P T +       RR  VK + +L  PLISP D W  W  LF+ GAFG+WSEKTKIGS +SGAL STL+GLAASN+ +
Subjt:  PPVSHLSSSSPAAEIGDRRFWNFPST-SSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGI

Query:  IASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPS
        I  + P++   +EFLLP ++ LLLFRADLRR+I STG+LLLAFL+GSV T +GT VA+ LVPMRSLG DNWKIAAALMG +I G++N+VA S AL +SPS
Subjt:  IASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPS

Query:  VMAAGLAADNVISAVYFATLFALASKLPAEPTPSSD---NVKKDAEAEHNNKLPVLQSAICLAVSFAICKAGSYLTKHFGIQGGSMPAITAAVVILATIF
        V+AAG+A DNVI A++F  LFALASK+P E   +S    ++ KD + E  N+  V+ ++I L+VSF ICKA   LT  F IQG  +PA+TA  ++LAT F
Subjt:  VMAAGLAADNVISAVYFATLFALASKLPAEPTPSSD---NVKKDAEAEHNNKLPVLQSAICLAVSFAICKAGSYLTKHFGIQGGSMPAITAAVVILATIF

Query:  PKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVP
        P  F  LAPS E +++ILMQVFF ++GA+G++W+VINTAPSIF+F+ +Q+ VHLA+T+ LGKL   D+KLLLLASNANIGGPTTAC M TAKGW+S+ VP
Subjt:  PKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVP

Query:  GILAGIFGIAIATFLGIGFGLMFLK
        GIL+G+FG++IATFLGIG G+  LK
Subjt:  GILAGIFGIAIATFLGIGFGLMFLK

AT5G52540.1 Protein of unknown function (DUF819)5.4e-15066.38Show/hide
Query:  SSTKSSARFFRS----ITMAPRPPVSHLSSS-----SPAA----EIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFG
        +S  SS+R FR     ++    P  S  S+S     SPA+      G  RF + P +SS      R + V S   L+ PLISP+D+WG WT LF+ GA G
Subjt:  SSTKSSARFFRS----ITMAPRPPVSHLSSS-----SPAA----EIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFG

Query:  IWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDN
        +WSEKTK+G+A+SGALVSTLVGLAASN+GII+S APAF +VL FLLPL+V LLLFRADLRRV+ STG LLLAFL+GSV TT+GTA+AY+LVPM+SLG D+
Subjt:  IWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDN

Query:  WKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAE----AEHNNKLPVLQSAICLAVSFAIC
        WKIAAALMGRHI GAVNYVA S ALGV+PSV+AAGLAADNVI AVYF TLFAL SK+PAE  P    +  DAE    +E  NK+PVL  A  +AVS AIC
Subjt:  WKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAE----AEHNNKLPVLQSAICLAVSFAIC

Query:  KAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRFDLK
        KAG+ LTK+FGI GGS+PAITA VVILAT+FP  F  LAPSGEAMA+ILMQVFF VVGASGNIWSVINTAPSIF+F+LVQI  HLA+ +G+GKLL  +L+
Subjt:  KAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRFDLK

Query:  LLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        LLLLASNAN+GGPTTA GM TAKGW+S+ VPGILAGIFGIAIATF+GI FG+  LK+M
Subjt:  LLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCGCAAGTTGCAATTCTTCAAACGAAGTTGCCGGAAATCCAGCCGCCATGTTTTTCTTCCACCAAAAGCTCAGCCAGGTTCTTCAGGAGCATTACGATGGCACC
TCGGCCACCGGTGTCCCATTTATCGTCGTCATCACCAGCTGCTGAAATTGGAGATCGGAGATTCTGGAATTTTCCTAGCACTAGTTCCGGAAATTTTCAATTGCGACGGC
GTATTGCTGTAAAATCTCATTTGAAATTGAATCTCCCGCTAATTTCTCCGCATGATCAGTGGGGCAACTGGACTGTTTTATTTTCCGTAGGAGCCTTCGGTATCTGGTCC
GAGAAAACGAAGATTGGCAGTGCACTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTTGCAGCCAGTAATGTTGGGATTATTGCATCTGATGCTCCAGCTTTTCCCAT
TGTTTTGGAGTTTTTGCTACCGCTATCAGTTCTTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTAATAAATTCAACTGGCACTCTTCTCTTGGCCTTTTTGTTAGGTT
CAGTTGGAACAACAATTGGAACCGCAGTGGCCTATTTTCTCGTACCAATGCGATCGCTTGGTCAAGACAATTGGAAAATCGCTGCTGCACTAATGGGAAGACATATTTGT
GGAGCTGTCAATTATGTTGCTAGTTCTGGTGCTCTTGGTGTTTCTCCATCAGTAATGGCTGCTGGACTTGCTGCAGATAATGTAATTTCTGCAGTTTATTTTGCAACATT
GTTTGCATTAGCATCTAAACTACCTGCTGAACCTACACCATCAAGCGATAATGTCAAGAAGGATGCAGAGGCTGAGCATAACAATAAACTTCCAGTATTACAATCTGCCA
TATGCCTTGCTGTATCATTTGCCATTTGTAAGGCTGGTTCCTACCTGACCAAACATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATCACAGCTGCAGTCGTGATCTTG
GCAACCATTTTTCCTAAGCCGTTTGCTTACCTTGCACCTTCTGGTGAGGCTATGGCTGTGATTCTGATGCAGGTTTTCTTTAATGTAGTGGGAGCAAGTGGGAATATATG
GAGTGTCATCAACACTGCACCAAGTATCTTCATGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCCATAACCGTTGGTCTCGGAAAGCTGCTTCGTTTCGACCTAAAGT
TGTTGCTGCTAGCATCGAATGCGAACATTGGAGGTCCCACGACAGCGTGCGGGATGGGCACAGCAAAGGGTTGGAGTTCAATGGCTGTTCCTGGAATTCTTGCTGGAATT
TTCGGAATTGCGATTGCAACTTTCCTAGGTATTGGATTTGGATTGATGTTCTTGAAATACATG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCGCAAGTTGCAATTCTTCAAACGAAGTTGCCGGAAATCCAGCCGCCATGTTTTTCTTCCACCAAAAGCTCAGCCAGGTTCTTCAGGAGCATTACGATGGCACC
TCGGCCACCGGTGTCCCATTTATCGTCGTCATCACCAGCTGCTGAAATTGGAGATCGGAGATTCTGGAATTTTCCTAGCACTAGTTCCGGAAATTTTCAATTGCGACGGC
GTATTGCTGTAAAATCTCATTTGAAATTGAATCTCCCGCTAATTTCTCCGCATGATCAGTGGGGCAACTGGACTGTTTTATTTTCCGTAGGAGCCTTCGGTATCTGGTCC
GAGAAAACGAAGATTGGCAGTGCACTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTTGCAGCCAGTAATGTTGGGATTATTGCATCTGATGCTCCAGCTTTTCCCAT
TGTTTTGGAGTTTTTGCTACCGCTATCAGTTCTTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTAATAAATTCAACTGGCACTCTTCTCTTGGCCTTTTTGTTAGGTT
CAGTTGGAACAACAATTGGAACCGCAGTGGCCTATTTTCTCGTACCAATGCGATCGCTTGGTCAAGACAATTGGAAAATCGCTGCTGCACTAATGGGAAGACATATTTGT
GGAGCTGTCAATTATGTTGCTAGTTCTGGTGCTCTTGGTGTTTCTCCATCAGTAATGGCTGCTGGACTTGCTGCAGATAATGTAATTTCTGCAGTTTATTTTGCAACATT
GTTTGCATTAGCATCTAAACTACCTGCTGAACCTACACCATCAAGCGATAATGTCAAGAAGGATGCAGAGGCTGAGCATAACAATAAACTTCCAGTATTACAATCTGCCA
TATGCCTTGCTGTATCATTTGCCATTTGTAAGGCTGGTTCCTACCTGACCAAACATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATCACAGCTGCAGTCGTGATCTTG
GCAACCATTTTTCCTAAGCCGTTTGCTTACCTTGCACCTTCTGGTGAGGCTATGGCTGTGATTCTGATGCAGGTTTTCTTTAATGTAGTGGGAGCAAGTGGGAATATATG
GAGTGTCATCAACACTGCACCAAGTATCTTCATGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCCATAACCGTTGGTCTCGGAAAGCTGCTTCGTTTCGACCTAAAGT
TGTTGCTGCTAGCATCGAATGCGAACATTGGAGGTCCCACGACAGCGTGCGGGATGGGCACAGCAAAGGGTTGGAGTTCAATGGCTGTTCCTGGAATTCTTGCTGGAATT
TTCGGAATTGCGATTGCAACTTTCCTAGGTATTGGATTTGGATTGATGTTCTTGAAATACATG
Protein sequenceShow/hide protein sequence
MASQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSSGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWS
EKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHIC
GAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAICKAGSYLTKHFGIQGGSMPAITAAVVIL
ATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLGKLLRFDLKLLLLASNANIGGPTTACGMGTAKGWSSMAVPGILAGI
FGIAIATFLGIGFGLMFLKYM