; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0115 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0115
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF819)
Genome locationMC10:787551..791819
RNA-Seq ExpressionMC10g0115
SyntenyMC10g0115
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008537 - Protein of unknown function DUF819


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140757.1 uncharacterized protein LOC101211894 isoform X1 [Cucumis sativus]3.49e-25783.22Show/hide
Query:  SQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFS
        SQ+ IL SKSP+LQ PCFSSTK S  F RSI+MA  PP+  +SSSS  AEI   RFW+F  +S+GN   RR +AV+SHLKLNLPL+SP+DQW NWTVLFS
Subjt:  SQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRS
        +GAFGIWSEKTK+GSALSGALVSTLVGLAASN GIIASDAPAF +VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT +GT VAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRS

Query:  LGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAI
        LGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT   + VGKD E E +NKLPVLQSA+A+AVSFAI
Subjt:  LGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDL
        CK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHL + IGLGKLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDL

Query:  KLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        K LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIA+ATFLGIGFG+M LKYM
Subjt:  KLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

XP_022140902.1 uncharacterized protein LOC111011457 isoform X1 [Momordica charantia]0.0100Show/hide
Query:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
        GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
Subjt:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP

Query:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
        AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
Subjt:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL

Query:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
        ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
Subjt:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI

Query:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
        CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
Subjt:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA

Query:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
        VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
Subjt:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL

Query:  GIGFGLMFLKYM
        GIGFGLMFLKYM
Subjt:  GIGFGLMFLKYM

XP_022140903.1 uncharacterized protein LOC111011457 isoform X2 [Momordica charantia]3.46e-27491.35Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS S GNF LRR IAVKSHLKLNLPLISPHDQW NWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GL KLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGI
        DLKLLL+ASNAN+GGPTTACGMATAKGWSSM VPGILAGIFGIAIATFLGI
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGI

XP_022140905.1 uncharacterized protein LOC111011457 isoform X3 [Momordica charantia]0.0100Show/hide
Query:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
        GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
Subjt:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP

Query:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
        AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
Subjt:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL

Query:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
        ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
Subjt:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI

Query:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
        CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
Subjt:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA

Query:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
        VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
Subjt:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL

Query:  GIGFGLMFLKYM
        GIGFGLMFLKYM
Subjt:  GIGFGLMFLKYM

XP_022140906.1 uncharacterized protein LOC111011457 isoform X4 [Momordica charantia]0.0100Show/hide
Query:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
        GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
Subjt:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP

Query:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
        AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
Subjt:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL

Query:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
        ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
Subjt:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI

Query:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
        CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
Subjt:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA

Query:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
        VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
Subjt:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL

Query:  GIGFGLMFLKYM
        GIGFGLMFLKYM
Subjt:  GIGFGLMFLKYM

TrEMBL top hitse value%identityAlignment
A0A0A0L600 Uncharacterized protein1.69e-25783.22Show/hide
Query:  SQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFS
        SQ+ IL SKSP+LQ PCFSSTK S  F RSI+MA  PP+  +SSSS  AEI   RFW+F  +S+GN   RR +AV+SHLKLNLPL+SP+DQW NWTVLFS
Subjt:  SQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFS

Query:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRS
        +GAFGIWSEKTK+GSALSGALVSTLVGLAASN GIIASDAPAF +VLE LLPL++PLLLFRADLRRVIKSTGTLLLAFLLGSVGT +GT VAYFLVPMRS
Subjt:  VGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRS

Query:  LGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAI
        LGQDSWKIAAALMGRHIGGAVNYVAIS ALGVSPSVLAAGLAADNVICAVYFATLFALASKVP EPT   + VGKD E E +NKLPVLQSA+A+AVSFAI
Subjt:  LGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAI

Query:  CKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDL
        CK GSYLTK+FGIQGGSMPAITAVIVVLATIFPK FAYLAPSGEAMA+ILMQVFF VVGASGN+WSVINTAPSIF+F+ VQI+VHL + IGLGKLLRFDL
Subjt:  CKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDL

Query:  KLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        K LLIASNANVGGPTTACGMATAKGWSSMV+PGILAGIFGIA+ATFLGIGFG+M LKYM
Subjt:  KLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

A0A6J1CGH1 uncharacterized protein LOC111011457 isoform X30.0100Show/hide
Query:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
        GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
Subjt:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP

Query:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
        AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
Subjt:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL

Query:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
        ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
Subjt:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI

Query:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
        CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
Subjt:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA

Query:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
        VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
Subjt:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL

Query:  GIGFGLMFLKYM
        GIGFGLMFLKYM
Subjt:  GIGFGLMFLKYM

A0A6J1CHF3 uncharacterized protein LOC111011457 isoform X40.0100Show/hide
Query:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
        GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
Subjt:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP

Query:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
        AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
Subjt:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL

Query:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
        ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
Subjt:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI

Query:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
        CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
Subjt:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA

Query:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
        VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
Subjt:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL

Query:  GIGFGLMFLKYM
        GIGFGLMFLKYM
Subjt:  GIGFGLMFLKYM

A0A6J1CIC3 uncharacterized protein LOC111011457 isoform X10.0100Show/hide
Query:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
        GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP
Subjt:  GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSP

Query:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
        AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL
Subjt:  AAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVL

Query:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
        ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI
Subjt:  ELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVI

Query:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
        CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA
Subjt:  CAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMA

Query:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
        VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL
Subjt:  VILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFL

Query:  GIGFGLMFLKYM
        GIGFGLMFLKYM
Subjt:  GIGFGLMFLKYM

A0A6J1CJ32 uncharacterized protein LOC111011457 isoform X21.68e-27491.35Show/hide
Query:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL
        MASQVAILQ+K P++Q PCFSSTKSSARFFRSITMAPRPPV  +SSSSPAAEIGDRRFWNFPS S GNF LRR IAVKSHLKLNLPLISPHDQW NWTVL
Subjt:  MASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVL

Query:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM
        FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFP+VLE LLPLS+ LLLFRADLRRVI STGTLLLAFLLGSVGT IGTAVAYFLVPM
Subjt:  FSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPM

Query:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF
        RSLGQD+WKIAAALMGRHI GAVNYVA SGALGVSPSV+AAGLAADNVI AVYFATLFALASK+PAEPTPSSDNV KD EAEHNNKLPVLQSA  LAVSF
Subjt:  RSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDPEAEHNNKLPVLQSATALAVSF

Query:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF
        AICKAGSYLTKHFGIQGGSMPAITA +V+LATIFPKPFAYLAPSGEAMAVILMQVFF VVGASGNIWSVINTAPSIFMFSLVQIAVHLA+T+GL KLLRF
Subjt:  AICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRF

Query:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGI
        DLKLLL+ASNAN+GGPTTACGMATAKGWSSM VPGILAGIFGIAIATFLGI
Subjt:  DLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGI

SwissProt top hitse value%identityAlignment
O31634 Uncharacterized membrane protein YjcL1.6e-3831.06Show/hide
Query:  LISPHDQWANWTVLFSVGAFGIWSE-KTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSV
        LIS  D W  W  +    A  I  E + K  SA+SGA+++    +  +NVG++  ++P +  V   ++PL+IPLLLF+ ++R++ K +  LL  FL+ SV
Subjt:  LISPHDQWANWTVLFSVGAFGIWSE-KTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSV

Query:  GTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALAS--------KVPAEPTPSSD-NVG
        GT++G+ +A+FL+       D  KI   +   +IGG VN+ A++         ++A + ADN + A+ F  L ++ +         +P E    +D N G
Subjt:  GTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALAS--------KVPAEPTPSSD-NVG

Query:  KDPEAEHNNKLPVLQ-----SATALAVSFAICKAGSYLTKHF------GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGN
           E+    K   L+     +  A A+     K   Y    F      G  G     +T++ V++  +FP+ F  L  S E +   L+ +FF V+G   +
Subjt:  KDPEAEHNNKLPVLQ-----SATALAVSFAICKAGSYLTKHF------GIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGN

Query:  IWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        +  ++  AP I +F  +    +LA+++  GKL R  L+ +L+A NA VGGPTTA  MA AKGW  +V P +L G  G  I  ++G   G  F  ++
Subjt:  IWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM

Arabidopsis top hitse value%identityAlignment
AT5G24000.1 Protein of unknown function (DUF819)4.5e-12962.69Show/hide
Query:  VKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTL
        VK + +L  PLISP D W+ W  LF+ GAFG+WSEKTKIGS +SGAL STL+GLAASN+ +I  + P++   +E LLP +IPLLLFRADLRR+I+STG+L
Subjt:  VKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTL

Query:  LLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSD---
        LLAFL+GSV TI+GT VA+ LVPMRSLG D+WKIAAALMG +IGG++N+VAIS AL +SPSV+AAG+A DNVICA++F  LFALASK+P E   +S    
Subjt:  LLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSD---

Query:  NVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTA
        ++ KD + E  N+  V+ ++ AL+VSF ICKA   LT  F IQG  +PA+TA+ +VLAT FP  F  LAPS E +++ILMQVFFT++GA+G++W+VINTA
Subjt:  NVGKDPEAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTA

Query:  PSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLK
        PSIF+F+ +Q+ VHLA+T+ LGKL   D+KLLL+ASNAN+GGPTTAC MATAKGW+S+VVPGIL+G+FG++IATFLGIG G+  LK
Subjt:  PSIFMFSLVQIAVHLAMTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLK

AT5G52540.1 Protein of unknown function (DUF819)6.2e-15566.3Show/hide
Query:  SSTKSSARFFRSIT-MAPRPPVPPVSSSSPAAEI------------GDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFG
        +S  SS+R FR  + +  R  +P  S+ S ++ +            G  RF + P +SS      R + V S   L+ PLISP+D+W  WT LF+ GA G
Subjt:  SSTKSSARFFRSIT-MAPRPPVPPVSSSSPAAEI------------GDRRFWNFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFG

Query:  IWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDS
        +WSEKTK+G+A+SGALVSTLVGLAASN+GII+S APAF VVL  LLPL++PLLLFRADLRRV++STG LLLAFL+GSV T +GTA+AY+LVPM+SLG DS
Subjt:  IWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRVIKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDS

Query:  WKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNV---GKDPEAEHNNKLPVLQSATALAVSFAICK
        WKIAAALMGRHIGGAVNYVAIS ALGV+PSVLAAGLAADNVICAVYF TLFAL SK+PAE  P    +     +  +E  NK+PVL  AT +AVS AICK
Subjt:  WKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNV---GKDPEAEHNNKLPVLQSATALAVSFAICK

Query:  AGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKL
        AG+ LTK+FGI GGS+PAITAV+V+LAT+FP  F  LAPSGEAMA+ILMQVFFTVVGASGNIWSVINTAPSIF+F+LVQI  HLA+ +G+GKLL  +L+L
Subjt:  AGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLAMTIGLGKLLRFDLKL

Query:  LLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM
        LL+ASNANVGGPTTA GMATAKGW+S++VPGILAGIFGIAIATF+GI FG+  LK+M
Subjt:  LLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGCAGCACGCGCGAAGTATTTTTCTTTAAAAATAATATTCCCGGGCTCTGGGGGAGGGAACCCACGTTTTTACATTGTTTGTATCACTGTATTTTACTGAGTGATGAAGT
CCATTGTCGGTGTTCTCTCTCCGTTCAATCCATCTCGCCGGAGATGGCATCGCAAGTCGCAATTCTTCAATCGAAGTCACCGCAACTCCAGCTGCCATGTTTTTCTTCCA
CCAAAAGCTCAGCCAGGTTCTTCAGGAGCATCACAATGGCACCTCGCCCACCGGTGCCCCCTGTTTCGTCATCATCACCAGCTGCTGAAATTGGAGATCGGAGATTCTGG
AATTTTCCTAGCAATAGCTCCGGAAATTTTCATTTGAGACGATGTATTGCTGTAAAATCTCATTTGAAATTGAATCTCCCACTAATTTCTCCGCATGATCAGTGGGCCAA
CTGGACTGTTTTGTTTTCCGTAGGAGCCTTCGGTATCTGGTCCGAGAAAACGAAGATTGGCAGTGCACTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTCGCAGCCA
GTAATGTTGGGATTATTGCATCTGATGCTCCAGCTTTCCCTGTTGTTTTGGAGCTTTTGCTACCGCTATCAATTCCTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTA
ATAAAGTCTACTGGCACACTTCTCTTGGCCTTTTTGTTAGGTTCAGTTGGAACAATAATTGGAACCGCAGTTGCCTATTTTCTTGTACCAATGCGATCGCTTGGTCAAGA
CAGTTGGAAAATTGCCGCCGCACTAATGGGAAGACATATTGGTGGAGCTGTCAATTATGTTGCTATTTCTGGTGCTCTTGGTGTTTCTCCATCGGTATTGGCTGCTGGAC
TTGCTGCAGATAATGTAATCTGTGCAGTTTATTTTGCAACGTTGTTTGCATTAGCATCTAAAGTACCTGCTGAACCTACACCATCAAGCGATAATGTCGGGAAGGATCCT
GAGGCTGAGCATAACAACAAGCTTCCAGTGTTGCAATCTGCCACAGCCCTTGCTGTATCATTTGCCATTTGTAAGGCCGGTTCCTACCTGACCAAACATTTTGGAATTCA
AGGTGGTAGCATGCCAGCAATCACAGCAGTCATCGTGGTCTTAGCCACCATTTTTCCGAAGCCATTTGCTTACCTTGCACCTTCTGGTGAGGCCATGGCTGTTATTCTAA
TGCAGGTTTTCTTTACTGTAGTGGGAGCGAGTGGGAATATATGGAGTGTCATCAACACTGCACCAAGTATCTTCATGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCT
ATGACCATTGGTCTCGGAAAGCTGCTTCGATTCGACCTAAAGTTGTTGCTGATAGCATCGAATGCAAATGTCGGGGGCCCCACGACAGCGTGCGGGATGGCCACAGCAAA
GGGTTGGAGTTCAATGGTTGTTCCTGGAATTCTTGCTGGAATTTTCGGAATTGCGATTGCAACTTTCCTAGGTATTGGATTTGGATTGATGTTCTTGAAATACATGTAA
mRNA sequenceShow/hide mRNA sequence
GGCAGCACGCGCGAAGTATTTTTCTTTAAAAATAATATTCCCGGGCTCTGGGGGAGGGAACCCACGTTTTTACATTGTTTGTATCACTGTATTTTACTGAGTGATGAAGT
CCATTGTCGGTGTTCTCTCTCCGTTCAATCCATCTCGCCGGAGATGGCATCGCAAGTCGCAATTCTTCAATCGAAGTCACCGCAACTCCAGCTGCCATGTTTTTCTTCCA
CCAAAAGCTCAGCCAGGTTCTTCAGGAGCATCACAATGGCACCTCGCCCACCGGTGCCCCCTGTTTCGTCATCATCACCAGCTGCTGAAATTGGAGATCGGAGATTCTGG
AATTTTCCTAGCAATAGCTCCGGAAATTTTCATTTGAGACGATGTATTGCTGTAAAATCTCATTTGAAATTGAATCTCCCACTAATTTCTCCGCATGATCAGTGGGCCAA
CTGGACTGTTTTGTTTTCCGTAGGAGCCTTCGGTATCTGGTCCGAGAAAACGAAGATTGGCAGTGCACTAAGTGGTGCCTTAGTGAGCACATTGGTAGGACTCGCAGCCA
GTAATGTTGGGATTATTGCATCTGATGCTCCAGCTTTCCCTGTTGTTTTGGAGCTTTTGCTACCGCTATCAATTCCTTTGCTGTTATTTAGAGCAGATTTGCGTCGTGTA
ATAAAGTCTACTGGCACACTTCTCTTGGCCTTTTTGTTAGGTTCAGTTGGAACAATAATTGGAACCGCAGTTGCCTATTTTCTTGTACCAATGCGATCGCTTGGTCAAGA
CAGTTGGAAAATTGCCGCCGCACTAATGGGAAGACATATTGGTGGAGCTGTCAATTATGTTGCTATTTCTGGTGCTCTTGGTGTTTCTCCATCGGTATTGGCTGCTGGAC
TTGCTGCAGATAATGTAATCTGTGCAGTTTATTTTGCAACGTTGTTTGCATTAGCATCTAAAGTACCTGCTGAACCTACACCATCAAGCGATAATGTCGGGAAGGATCCT
GAGGCTGAGCATAACAACAAGCTTCCAGTGTTGCAATCTGCCACAGCCCTTGCTGTATCATTTGCCATTTGTAAGGCCGGTTCCTACCTGACCAAACATTTTGGAATTCA
AGGTGGTAGCATGCCAGCAATCACAGCAGTCATCGTGGTCTTAGCCACCATTTTTCCGAAGCCATTTGCTTACCTTGCACCTTCTGGTGAGGCCATGGCTGTTATTCTAA
TGCAGGTTTTCTTTACTGTAGTGGGAGCGAGTGGGAATATATGGAGTGTCATCAACACTGCACCAAGTATCTTCATGTTTTCTCTTGTCCAGATTGCAGTCCATCTTGCT
ATGACCATTGGTCTCGGAAAGCTGCTTCGATTCGACCTAAAGTTGTTGCTGATAGCATCGAATGCAAATGTCGGGGGCCCCACGACAGCGTGCGGGATGGCCACAGCAAA
GGGTTGGAGTTCAATGGTTGTTCCTGGAATTCTTGCTGGAATTTTCGGAATTGCGATTGCAACTTTCCTAGGTATTGGATTTGGATTGATGTTCTTGAAATACATGTAAA
ATCATTCAGTCTCAGATCATAAAACTGCTGGTTTCTTCACAAGTGTTTGTATAACTATTTTGAGGGAGTACCTTTTCCTTCTTAATTGTTTTCTCGAAACGTATTAGAGA
GCTCATTTTCTTGGTGTGTATTTGGCATGCCATTCAATGAAAATCATCTGGGTTAGTCCAAAAATTTAGAGAGAATGAATTCGATGGGTTCAAAG
Protein sequenceShow/hide protein sequence
GSTREVFFFKNNIPGLWGREPTFLHCLYHCILLSDEVHCRCSLSVQSISPEMASQVAILQSKSPQLQLPCFSSTKSSARFFRSITMAPRPPVPPVSSSSPAAEIGDRRFW
NFPSNSSGNFHLRRCIAVKSHLKLNLPLISPHDQWANWTVLFSVGAFGIWSEKTKIGSALSGALVSTLVGLAASNVGIIASDAPAFPVVLELLLPLSIPLLLFRADLRRV
IKSTGTLLLAFLLGSVGTIIGTAVAYFLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISGALGVSPSVLAAGLAADNVICAVYFATLFALASKVPAEPTPSSDNVGKDP
EAEHNNKLPVLQSATALAVSFAICKAGSYLTKHFGIQGGSMPAITAVIVVLATIFPKPFAYLAPSGEAMAVILMQVFFTVVGASGNIWSVINTAPSIFMFSLVQIAVHLA
MTIGLGKLLRFDLKLLLIASNANVGGPTTACGMATAKGWSSMVVPGILAGIFGIAIATFLGIGFGLMFLKYM