; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0114 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0114
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF819)
Genome locationMC10:783517..786403
RNA-Seq ExpressionMC10g0114
SyntenyMC10g0114
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439290.1 PREDICTED: uncharacterized membrane protein YjcL-like [Cucumis melo]3.14e-25181.26Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        S +AIL +  PE+QPPCFSS+K S  F RSI MAP PP+  LSSSS AA+I   RFW+F  +S GN Q RR +AVKSHLKLNLPL+SP DQWGNWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        +GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAF  VLEFLLPL+V LLLFRADLRRVI STGTLLLAFLLGSVGTT+GT VAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQD+WKIAAALMGRHI GAVNYVA S ALGVSPSV+AAGLAADNVI AVYFATLFALASK+P E T   + V+KDAE E +NKLPVLQSA  +AVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CK GSYLTK+FGIQGGSMPAITA +V+LATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHLAI +GL KLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        K LL+ASNAN+GGPTTACGMATAKGWSSM +PGILAGIFGIAIATFLGIGFG M LKYM
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

XP_022140902.1 uncharacterized protein LOC111011457 isoform X1 [Momordica charantia]2.23e-302100Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI
        KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI

XP_022140903.1 uncharacterized protein LOC111011457 isoform X2 [Momordica charantia]3.69e-305100Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI
        KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI

XP_022140905.1 uncharacterized protein LOC111011457 isoform X3 [Momordica charantia]1.27e-28391.5Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        SQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS S GNF LRR IAVKSHLKLNLPLISPHDQW NWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GL KLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        KLLL+ASNAN+GGPTTACGMATAKGWSSM VPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

XP_022140906.1 uncharacterized protein LOC111011457 isoform X4 [Momordica charantia]3.28e-28491.5Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        SQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS S GNF LRR IAVKSHLKLNLPLISPHDQW NWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GL KLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        KLLL+ASNAN+GGPTTACGMATAKGWSSM VPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

TrEMBL top hitse value%identityAlignment
A0A1S3AZ41 uncharacterized membrane protein YjcL-like1.52e-25181.26Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        S +AIL +  PE+QPPCFSS+K S  F RSI MAP PP+  LSSSS AA+I   RFW+F  +S GN Q RR +AVKSHLKLNLPL+SP DQWGNWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        +GAFGIWSEKTKIGSALSGALVSTLVGLAASN GIIASDAPAF  VLEFLLPL+V LLLFRADLRRVI STGTLLLAFLLGSVGTT+GT VAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQD+WKIAAALMGRHI GAVNYVA S ALGVSPSV+AAGLAADNVI AVYFATLFALASK+P E T   + V+KDAE E +NKLPVLQSA  +AVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CK GSYLTK+FGIQGGSMPAITA +V+LATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHLAI +GL KLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        K LL+ASNAN+GGPTTACGMATAKGWSSM +PGILAGIFGIAIATFLGIGFG M LKYM
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X36.17e-28491.5Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        SQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS S GNF LRR IAVKSHLKLNLPLISPHDQW NWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GL KLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        KLLL+ASNAN+GGPTTACGMATAKGWSSM VPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CHF3 uncharacterized protein LOC111011457 isoform X41.59e-28491.5Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        SQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS S GNF LRR IAVKSHLKLNLPLISPHDQW NWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GL KLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        KLLL+ASNAN+GGPTTACGMATAKGWSSM VPGILAGIFGIAIATFLGIGFGLMFLKYM
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X11.08e-302100Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI
        KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI

A0A6J1CJ32 uncharacterized protein LOC111011457 isoform X21.79e-305100Show/hide
Query:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
        SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS
Subjt:  SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
        VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRS

Query:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
        LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI
Subjt:  LGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
        CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDL

Query:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI
        KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI
Subjt:  KLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGI

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL1.4e-3329.04Show/hide
Query:  LISPHDQWGNWTVLFSVGAFGIWSE-KTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSV
        LIS  D W  W  +    A  I  E + K  SA+SGA+++    +  +NVG++  ++P +  V  +++PL++ LLLF+ ++R++   +  LL  FL+ SV
Subjt:  LISPHDQWGNWTVLFSVGAFGIWSE-KTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSV

Query:  GTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALAS--------KLPAEPTPSSD-NVK
        GT +G+ +A+FL+       D  KI   +   +I G VN+ A +         ++A + ADN + A+ F  L ++ +         +P E    +D N  
Subjt:  GTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALAS--------KLPAEPTPSSD-NVK

Query:  KDAEAEHNNK-LPVLQSAICLAVSFAICKAGSYLTKHF----------GIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGN
          AE+    K + +   A     +FA+      ++ +F          G  G     +T+  V++  +FP+ F  L  S E +   L+ +FF V+G   +
Subjt:  KDAEAEHNNK-LPVLQSAICLAVSFAICKAGSYLTKHF----------GIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGN

Query:  IWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDLKLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        +  ++  AP I +F  +    +LA+++   KL R  L+ +LLA NA +GGPTTA  MA AKGW  +  P +L G  G  I  ++G   G  F  ++
Subjt:  IWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDLKLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)3.0e-12461.03Show/hide
Query:  RRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINS
        RR  VK + +L  PLISP D W  W  LF+ GAFG+WSEKTKIGS +SGAL STL+GLAASN+ +I  + P++   +EFLLP ++ LLLFRADLRR+I S
Subjt:  RRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINS

Query:  TGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSS
        TG+LLLAFL+GSV T +GT VA+ LVPMRSLG DNWKIAAALMG +I G++N+VA S AL +SPSV+AAG+A DNVI A++F  LFALASK+P E   +S
Subjt:  TGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSS

Query:  D---NVKKDAEAEHNNKLPVLQSAICLAVSFAICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSV
            ++ KD + E  N+  V+ ++I L+VSF ICKA   LT  F IQG  +PA+TA  ++LAT FP  F  LAPS E +++ILMQVFF ++GA+G++W+V
Subjt:  D---NVKKDAEAEHNNKLPVLQSAICLAVSFAICKAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSV

Query:  INTAPSIFMFSLVQIAVHLAITVGLRKLLRFDLKLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLK
        INTAPSIF+F+ +Q+ VHLA+T+ L KL   D+KLLLLASNANIGGPTTAC MATAKGW+S+ VPGIL+G+FG++IATFLGIG G+  LK
Subjt:  INTAPSIFMFSLVQIAVHLAITVGLRKLLRFDLKLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLK

AT5G52540.1 Protein of unknown function (DUF819)1.6e-14966.16Show/hide
Query:  SSTKSSARFFRS----ITMAPRPPVSHLSSS-----SPAA----EIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFG
        +S  SS+R FR     ++    P  S  S+S     SPA+      G  RF +  S+S       R + V S   L+ PLISP+D+WG WT LF+ GA G
Subjt:  SSTKSSARFFRS----ITMAPRPPVSHLSSS-----SPAA----EIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFG

Query:  IWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDN
        +WSEKTK+G+A+SGALVSTLVGLAASN+GII+S APAF +VL FLLPL+V LLLFRADLRRV+ STG LLLAFL+GSV TT+GTA+AY+LVPM+SLG D+
Subjt:  IWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDN

Query:  WKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAE----AEHNNKLPVLQSAICLAVSFAIC
        WKIAAALMGRHI GAVNYVA S ALGV+PSV+AAGLAADNVI AVYF TLFAL SK+PAE  P    +  DAE    +E  NK+PVL  A  +AVS AIC
Subjt:  WKIAAALMGRHICGAVNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAE----AEHNNKLPVLQSAICLAVSFAIC

Query:  KAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDLK
        KAG+ LTK+FGI GGS+PAITA VVILAT+FP  F  LAPSGEAMA+ILMQVFF VVGASGNIWSVINTAPSIF+F+LVQI  HLA+ +G+ KLL  +L+
Subjt:  KAGSYLTKHFGIQGGSMPAITAAVVILATIFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDLK

Query:  LLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM
        LLLLASNAN+GGPTTA GMATAKGW+S+ VPGILAGIFGIAIATF+GI FG+  LK+M
Subjt:  LLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFGIAIATFLGIGFGLMFLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCGCAAGTTGCAATTCTTCAAACGAAGTTGCCGGAAATCCAGCCGCCATGTTTTTCTTCCACCAAAAGCTCAGCCAGGTTCTTCAGGAGCATTACGATGGCACCTCGGCC
ACCGGTGTCCCATTTATCGTCGTCATCACCAGCTGCTGAAATTGGAGATCGGAGATTCTGGAATTTTCCTAGCACTAGTTTCGGAAATTTTCAATTGCGACGGCGTATTG
CTGTAAAATCTCATTTGAAATTGAATCTCCCGCTAATTTCTCCGCATGATCAGTGGGGCAACTGGACTGTTTTATTTTCCGTAGGAGCCTTCGGTATCTGGTCCGAGAAA
ACGAAGATTGGCAGTGCACTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTTGCAGCCAGTAATGTTGGGATTATTGCATCTGATGCTCCAGCTTTTCCCATTGTTTT
GGAGTTTTTGCTACCGCTATCAGTTCTTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTAATAAATTCAACTGGCACTCTTCTCTTGGCCTTTTTGTTAGGTTCAGTTG
GAACAACAATTGGAACCGCAGTGGCCTATTTTCTCGTACCAATGCGATCGCTTGGTCAAGACAATTGGAAAATCGCTGCTGCACTAATGGGAAGACATATTTGTGGAGCT
GTCAATTATGTTGCTAGTTCTGGTGCTCTTGGTGTTTCTCCATCAGTAATGGCTGCTGGACTTGCTGCAGATAATGTAATTTCTGCAGTTTATTTTGCAACATTGTTTGC
ATTAGCATCTAAACTACCTGCTGAACCTACACCATCAAGCGATAATGTCAAGAAGGATGCAGAGGCTGAGCATAACAATAAACTTCCAGTATTACAATCTGCCATATGCC
TTGCTGTATCATTTGCCATTTGTAAGGCTGGTTCCTACCTGACCAAACATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATCACAGCTGCAGTCGTGATCTTGGCAACC
ATTTTTCCTAAGCCGTTTGCTTACCTTGCACCTTCTGGTGAGGCTATGGCTGTGATTCTGATGCAGGTTTTCTTTAATGTAGTGGGAGCAAGTGGGAATATATGGAGTGT
CATCAACACTGCACCAAGTATCTTCATGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCCATAACCGTTGGTCTCAGAAAGCTGCTTCGTTTCGACCTAAAGTTGTTGC
TGCTAGCATCGAATGCGAACATTGGAGGTCCCACGACAGCGTGCGGGATGGCCACAGCAAAGGGTTGGAGTTCAATGGCTGTTCCTGGAATTCTTGCTGGAATTTTCGGA
ATCGCGATTGCAACTTTCCTAGGTATTGGATTTGGATTGATGTTCTTGAAATACATG
mRNA sequenceShow/hide mRNA sequence
TCGCAAGTTGCAATTCTTCAAACGAAGTTGCCGGAAATCCAGCCGCCATGTTTTTCTTCCACCAAAAGCTCAGCCAGGTTCTTCAGGAGCATTACGATGGCACCTCGGCC
ACCGGTGTCCCATTTATCGTCGTCATCACCAGCTGCTGAAATTGGAGATCGGAGATTCTGGAATTTTCCTAGCACTAGTTTCGGAAATTTTCAATTGCGACGGCGTATTG
CTGTAAAATCTCATTTGAAATTGAATCTCCCGCTAATTTCTCCGCATGATCAGTGGGGCAACTGGACTGTTTTATTTTCCGTAGGAGCCTTCGGTATCTGGTCCGAGAAA
ACGAAGATTGGCAGTGCACTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTTGCAGCCAGTAATGTTGGGATTATTGCATCTGATGCTCCAGCTTTTCCCATTGTTTT
GGAGTTTTTGCTACCGCTATCAGTTCTTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTAATAAATTCAACTGGCACTCTTCTCTTGGCCTTTTTGTTAGGTTCAGTTG
GAACAACAATTGGAACCGCAGTGGCCTATTTTCTCGTACCAATGCGATCGCTTGGTCAAGACAATTGGAAAATCGCTGCTGCACTAATGGGAAGACATATTTGTGGAGCT
GTCAATTATGTTGCTAGTTCTGGTGCTCTTGGTGTTTCTCCATCAGTAATGGCTGCTGGACTTGCTGCAGATAATGTAATTTCTGCAGTTTATTTTGCAACATTGTTTGC
ATTAGCATCTAAACTACCTGCTGAACCTACACCATCAAGCGATAATGTCAAGAAGGATGCAGAGGCTGAGCATAACAATAAACTTCCAGTATTACAATCTGCCATATGCC
TTGCTGTATCATTTGCCATTTGTAAGGCTGGTTCCTACCTGACCAAACATTTTGGAATTCAAGGTGGTAGCATGCCAGCAATCACAGCTGCAGTCGTGATCTTGGCAACC
ATTTTTCCTAAGCCGTTTGCTTACCTTGCACCTTCTGGTGAGGCTATGGCTGTGATTCTGATGCAGGTTTTCTTTAATGTAGTGGGAGCAAGTGGGAATATATGGAGTGT
CATCAACACTGCACCAAGTATCTTCATGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCCATAACCGTTGGTCTCAGAAAGCTGCTTCGTTTCGACCTAAAGTTGTTGC
TGCTAGCATCGAATGCGAACATTGGAGGTCCCACGACAGCGTGCGGGATGGCCACAGCAAAGGGTTGGAGTTCAATGGCTGTTCCTGGAATTCTTGCTGGAATTTTCGGA
ATCGCGATTGCAACTTTCCTAGGTATTGGATTTGGATTGATGTTCTTGAAATACATG
Protein sequenceShow/hide protein sequence
SQVAILQTKLPEIQPPCFSSTKSSARFFRSITMAPRPPVSHLSSSSPAAEIGDRRFWNFPSTSFGNFQLRRRIAVKSHLKLNLPLISPHDQWGNWTVLFSVGAFGIWSEK
TKIGSALSGALVSTLVGLAASNVGIIASDAPAFPIVLEFLLPLSVLLLLFRADLRRVINSTGTLLLAFLLGSVGTTIGTAVAYFLVPMRSLGQDNWKIAAALMGRHICGA
VNYVASSGALGVSPSVMAAGLAADNVISAVYFATLFALASKLPAEPTPSSDNVKKDAEAEHNNKLPVLQSAICLAVSFAICKAGSYLTKHFGIQGGSMPAITAAVVILAT
IFPKPFAYLAPSGEAMAVILMQVFFNVVGASGNIWSVINTAPSIFMFSLVQIAVHLAITVGLRKLLRFDLKLLLLASNANIGGPTTACGMATAKGWSSMAVPGILAGIFG
IAIATFLGIGFGLMFLKYM