; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000428 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000428
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF819)
Genome locationscaffold44:1630584..1634464
RNA-Seq ExpressionMS000428
SyntenyMS000428
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140757.1 uncharacterized protein LOC101211894 isoform X1 [Cucumis sativus]6.8e-20383.22Show/hide
Query:  SQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFS
        SQ+ IL SKSP+LQ PCFSSTK S  F RSI+MA  PP+  +SSSS  AEI   RFW+F  +S+GN   RR +AV+SHLKLNLPL+SP+DQW NWTVLFS
Subjt:  SQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRS
        +GAFGIWSEKTK+GSALSGALVSTLVGLAASN GIIASDAPAF +VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT +GT VAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRS

Query:  LGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAI
        LGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT   + VGKD E E +NKLPVLQSA+A+AVSFAI
Subjt:  LGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDL
        CK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHL + IGLGKLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDL

Query:  KLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        K LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIA+ATFLGIGFG+M LKYM
Subjt:  KLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

XP_022140902.1 uncharacterized protein LOC111011457 isoform X1 [Momordica charantia]6.9e-248100Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

XP_022140903.1 uncharacterized protein LOC111011457 isoform X2 [Momordica charantia]1.0e-21991.35Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS S GNF LRR IAVKSHLKLNLPLISPHDQW NWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GL KLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGI
        DLKLLL+ASNAN+GGPTTACGMATAKGWSSM VPGILAGIFGIAIATFLGI
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGI

XP_022140905.1 uncharacterized protein LOC111011457 isoform X3 [Momordica charantia]6.9e-248100Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

XP_022140906.1 uncharacterized protein LOC111011457 isoform X4 [Momordica charantia]6.9e-248100Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

TrEMBL top hitse value%identityAlignment
A0A0A0L600 Uncharacterized protein3.3e-20383.22Show/hide
Query:  SQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFS
        SQ+ IL SKSP+LQ PCFSSTK S  F RSI+MA  PP+  +SSSS  AEI   RFW+F  +S+GN   RR +AV+SHLKLNLPL+SP+DQW NWTVLFS
Subjt:  SQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRS
        +GAFGIWSEKTK+GSALSGALVSTLVGLAASN GIIASDAPAF +VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT +GT VAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRS

Query:  LGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAI
        LGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT   + VGKD E E +NKLPVLQSA+A+AVSFAI
Subjt:  LGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDL
        CK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHL + IGLGKLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDL

Query:  KLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        K LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIA+ATFLGIGFG+M LKYM
Subjt:  KLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X33.4e-248100Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CHF3 uncharacterized protein LOC111011457 isoform X43.4e-248100Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X13.4e-248100Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CJ32 uncharacterized protein LOC111011457 isoform X25.1e-22091.35Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS S GNF LRR IAVKSHLKLNLPLISPHDQW NWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GL KLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGI
        DLKLLL+ASNAN+GGPTTACGMATAKGWSSM VPGILAGIFGIAIATFLGI
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGI

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL1.4e-3831.06Show/hide
Query:  LISPHDQWANWTVLFSVGAFGIWSE-KTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSV
        LIS  D W  W  +    A  I  E + K  SA+SGA+++    +  +NVG++  ++P +  V   ++PL+IPLLLF+ ++R++ K +  LL  FL+ SV
Subjt:  LISPHDQWANWTVLFSVGAFGIWSE-KTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSV

Query:  GTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALAS--------KVPAEPTPSSD-NVG
        GT++G+ +A+FL+       D  KI   +   +IGG VN+ A++         ++A + ADN + A+ F  L ++ +         +P E    +D N G
Subjt:  GTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALAS--------KVPAEPTPSSD-NVG

Query:  KDPEAEHNNKLPVLQ-----SATALAVSFAICKAGSYLTKHF------GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGN
           E+    K   L+     +  A A+     K   Y    F      G  G     +T++ V++  +FP+ F  L  S E +   L+ +FF V+G   +
Subjt:  KDPEAEHNNKLPVLQ-----SATALAVSFAICKAGSYLTKHF------GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGN

Query:  IWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        +  ++  AP I +F  +    +LA+++  GKL R  L+ +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G  F  ++
Subjt:  IWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)4.1e-12962.69Show/hide
Query:  VKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTL
        VK + +L  PLISP D W+ W  LF+ GAFG+WSEKTKIGS +SGAL STL+GLAASN+ +I  + P++   +E LLP +IPLLLFRADLRR+I+STG+L
Subjt:  VKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTL

Query:  LLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSD---
        LLAFL+GSV TI+GT VA+ LVPMRSLG D+WKIAAALMG +IGG++N+VAIS AL +SPSV+AAG+A DNVICA++F  LFALASK+P E   +S    
Subjt:  LLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSD---

Query:  NVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTA
        ++ KD + E  N+  V+ ++ AL+VSF ICKA   LT  F IQG  +PA+TA+ +VLAT FP  F  LAPS E +++ILMQVFFT++GA+G++W+VINTA
Subjt:  NVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTA

Query:  PSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLK
        PSIF+F+ +Q+ VHLA+T+ LGKL   D+KLLL+ASNAN+GGPTTAC MATAKGW+S+VVPGIL+G+FG++IATFLGIG G+  LK
Subjt:  PSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLK

AT5G52540.1 Protein of unknown function (DUF819)5.6e-15566.3Show/hide
Query:  SSTKSSARFFRSIT-MAPRPPVPPVSSSSPAAEI------------GDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFG
        +S  SS+R FR  + +  R  +P  S+ S ++ +            G  RF + P +SS      R + V S   L+ PLISP+D+W  WT LF+ GA G
Subjt:  SSTKSSARFFRSIT-MAPRPPVPPVSSSSPAAEI------------GDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFG

Query:  IWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDS
        +WSEKTK+G+A+SGALVSTLVGLAASN+GII+S APAF VVL  LLPL++PLLLFRADLRRV++STG LLLAFL+GSV T +GTA+AY+LVPM+SLG DS
Subjt:  IWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDS

Query:  WKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNV---GKDPEAEHNNKLPVLQSATALAVSFAICK
        WKIAAALMGRHIGGAVNYVAIS ALGV+PSVLAAGLAADNVICAVYF TLFAL SK+PAE  P    +     +  +E  NK+PVL  AT +AVS AICK
Subjt:  WKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNV---GKDPEAEHNNKLPVLQSATALAVSFAICK

Query:  AGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKL
        AG+ LTK+FGI GGS+PAITAV+V+LAT+FP  F  LAPSGEAMA+ILMQVFFTVVGASGNIWSVINTAPSIF+F+LVQI  HLA+ +G+GKLL  +L+L
Subjt:  AGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKL

Query:  LLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        LL+ASNANVGGPTTA GMATAKGW+S++VPGILAGIFGIAIATF+GI FG+  LK+M
Subjt:  LLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGCAAGTCGCAATTCTTCAATCGAAGTCACCGCAACTCCAGCTGCCATGTTTTTCTTCCACCAAAAGCTCAGCCAGGTTCTTCAGGAGCATCACAATGGCACC
TCGCCCACCGGTGCCCCCTGTTTCGTCATCATCACCAGCTGCTGAAATTGGAGATCGGAGATTCTGGAATTTTCCTAGCAATAGCTCCGGAAATTTTCATTTGAGACGAT
GTATTGCTGTAAAATCTCATTTGAAATTGAATCTCCCACTAATTTCTCCGCATGATCAGTGGGCCAACTGGACTGTTTTGTTTTCCGTAGGAGCCTTCGGTATCTGGTCC
GAGAAAACGAAGATTGGCAGTGCACTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTCGCAGCCAGTAATGTTGGGATTATTGCATCTGATGCTCCAGCTTTCCCTGT
TGTTTTGGAGCTTTTGCTACCGCTATCAATTCCTTTGCTGTTATTTAGAGCAGACTTGCGTCGTGTAATAAAGTCTACTGGCACACTTCTCTTGGCCTTTTTGTTAGGTT
CAGTTGGAACAATAATTGGAACCGCAGTTGCCTATTTTCTTGTACCAATGCGATCGCTTGGTCAAGACAGTTGGAAAATTGCCGCCGCACTAATGGGAAGACATATTGGT
GGAGCTGTCAATTATGTTGCTATTTCTGGTGCTCTTGGTGTTTCTCCATCGGTATTGGCTGCTGGACTTGCTGCAGATAATGTAATCTGTGCAGTTTATTTTGCAACGTT
GTTTGCATTAGCATCTAAAGTACCTGCTGAACCTACACCATCAAGCGATAATGTCGGGAAGGATCCTGAGGCTGAGCATAACAACAAGCTTCCAGTGTTGCAATCTGCCA
CAGCCCTTGCTGTATCATTTGCCATTTGTAAGGCCGGTTCCTACCTGACCAAACATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATCACAGCAGTCATCGTGGTCTTA
GCGACCATTTTTCCGAAGCCATTTGCTTACCTTGCACCTTCTGGTGAGGCCATGGCTGTTATTCTAATGCAGGTTTTCTTTACTGTAGTGGGAGCGAGTGGGAATATATG
GAGTGTCATCAACACTGCACCAAGTATCTTCATGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCTATGACCATTGGTCTCGGAAAGCTGCTTCGATTCGACCTAAAGT
TGTTGCTGATAGCATCGAATGCAAATGTCGGGGGCCCCACGACAGCGTGCGGGATGGCCACAGCAAAGGGTTGGAGTTCAATGGTTGTTCCTGGAATTCTTGCTGGAATT
TTCGGAATTGCGATTGCAACTTTCCTAGGTATTGGATTTGGATTGATGTTCTTGAAATACATG
mRNA sequenceShow/hide mRNA sequence
ATGGCATCGCAAGTCGCAATTCTTCAATCGAAGTCACCGCAACTCCAGCTGCCATGTTTTTCTTCCACCAAAAGCTCAGCCAGGTTCTTCAGGAGCATCACAATGGCACC
TCGCCCACCGGTGCCCCCTGTTTCGTCATCATCACCAGCTGCTGAAATTGGAGATCGGAGATTCTGGAATTTTCCTAGCAATAGCTCCGGAAATTTTCATTTGAGACGAT
GTATTGCTGTAAAATCTCATTTGAAATTGAATCTCCCACTAATTTCTCCGCATGATCAGTGGGCCAACTGGACTGTTTTGTTTTCCGTAGGAGCCTTCGGTATCTGGTCC
GAGAAAACGAAGATTGGCAGTGCACTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTCGCAGCCAGTAATGTTGGGATTATTGCATCTGATGCTCCAGCTTTCCCTGT
TGTTTTGGAGCTTTTGCTACCGCTATCAATTCCTTTGCTGTTATTTAGAGCAGACTTGCGTCGTGTAATAAAGTCTACTGGCACACTTCTCTTGGCCTTTTTGTTAGGTT
CAGTTGGAACAATAATTGGAACCGCAGTTGCCTATTTTCTTGTACCAATGCGATCGCTTGGTCAAGACAGTTGGAAAATTGCCGCCGCACTAATGGGAAGACATATTGGT
GGAGCTGTCAATTATGTTGCTATTTCTGGTGCTCTTGGTGTTTCTCCATCGGTATTGGCTGCTGGACTTGCTGCAGATAATGTAATCTGTGCAGTTTATTTTGCAACGTT
GTTTGCATTAGCATCTAAAGTACCTGCTGAACCTACACCATCAAGCGATAATGTCGGGAAGGATCCTGAGGCTGAGCATAACAACAAGCTTCCAGTGTTGCAATCTGCCA
CAGCCCTTGCTGTATCATTTGCCATTTGTAAGGCCGGTTCCTACCTGACCAAACATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATCACAGCAGTCATCGTGGTCTTA
GCGACCATTTTTCCGAAGCCATTTGCTTACCTTGCACCTTCTGGTGAGGCCATGGCTGTTATTCTAATGCAGGTTTTCTTTACTGTAGTGGGAGCGAGTGGGAATATATG
GAGTGTCATCAACACTGCACCAAGTATCTTCATGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCTATGACCATTGGTCTCGGAAAGCTGCTTCGATTCGACCTAAAGT
TGTTGCTGATAGCATCGAATGCAAATGTCGGGGGCCCCACGACAGCGTGCGGGATGGCCACAGCAAAGGGTTGGAGTTCAATGGTTGTTCCTGGAATTCTTGCTGGAATT
TTCGGAATTGCGATTGCAACTTTCCTAGGTATTGGATTTGGATTGATGTTCTTGAAATACATG
Protein sequenceShow/hide protein sequence
MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWS
EKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIG
GAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVL
ATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGI
FGIAIATFLGIGFGLMFLKYM