; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002503 (gene) of Chayote v1 genome

Gene IDSed0002503
OrganismSechium edule (Chayote v1)
DescriptionPolyketide cyclase/dehydrase and lipid transport superfamily protein
Genome locationLG01:66895815..66899186
RNA-Seq ExpressionSed0002503
SyntenySed0002503
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008289 - lipid binding (molecular function)
InterPro domainsIPR002913 - START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145876.1 uncharacterized protein LOC101209462 [Cucumis sativus]4.7e-20383.57Show/hide
Query:  VDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCS-VVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPT
        ++ AEA T+YDLVFK LMF+ARLWVG+IVGVLVGWIWKP+WA+  ++LF SSK+KDNLPS S  VGSIS LNSL  QLPRCLN+TS+C DE E L + PT
Subjt:  VDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCS-VVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPT

Query:  AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ
         +ISSSS  EGEK A+LTEEDLK LYRLVEEKDGGPSWIQMMDRST+TM+YQAWRRD +TGPPQYRSRTVFEDATPQMVRDFFWDDEFR+KWDDMLIS Q
Subjt:  AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ

Query:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW
        TLEDC  TGTM VRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKG+PCSS+PRQNKPKRVDLYYSSWCIRAVES+KGN QLTACEV+LFH+EDMGIPW
Subjt:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW

Query:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIALACSLDHG
        EIAKLGVR+GMWGAVKK+DPALR+YQKHRA+EAPLSNCA +ANINTKVS D LRCSEDASDDS++ K+L EP EKPAGKNL K+L+VGGAIALACSLDHG
Subjt:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIALACSLDHG

Query:  LLTKAVVFGVARRFSNIGKR
        LLTKAVVFGVARRFSNIGKR
Subjt:  LLTKAVVFGVARRFSNIGKR

XP_008437458.1 PREDICTED: uncharacterized protein LOC103482874 [Cucumis melo]1.5e-20183.57Show/hide
Query:  VDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCS-VVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPT
        ++ AEA T YDLVFK LMF+ARLWVG+IVGVLVGWIWKP+WA+  + LF SSK+K+NLPS S  VGSIS LNSL IQLPRCLN+TS+C DE EVL + PT
Subjt:  VDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCS-VVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPT

Query:  AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ
         +ISSSS  EGEK A+LTEEDLK LYRLVEEKDGGPSWIQMMDRST+TM+YQAWRRD +TGPPQYRSRTVFEDATPQMVRDFFWDDEFR KWDDMLIS Q
Subjt:  AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ

Query:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW
        TLEDC  TGTM VRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSS+PRQNKPKRVDLYYSSWCIRAVES+KGNGQ+TACEV+LFH+EDMGIPW
Subjt:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW

Query:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIALACSLDHG
        EIAKLGVR+GMWGAVKK+DPALRAYQKHRA+EAPLSN A +ANINTK+S D LRCSEDASDDS++ K+L EP EKPA KNL K+L+VGGAIALACSLDHG
Subjt:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIALACSLDHG

Query:  LLTKAVVFGVARRFSNIGKR
        LLTKAVVFGVARRFSNIGKR
Subjt:  LLTKAVVFGVARRFSNIGKR

XP_022159578.1 uncharacterized protein LOC111025953 isoform X1 [Momordica charantia]3.3e-20483.61Show/hide
Query:  DLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP--EPPT
        + AE+PT YDLVFK LMFIARLWVG+IVGVLVGWIWKP+WAD  ++LF SSK+KDNLPS    GSISSLNSL IQLPRCLN+TS C  E EVL   +PPT
Subjt:  DLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP--EPPT

Query:  AAI--SSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIS
        AA   SSSS  EGEKP QLTEED K LY+LVEEKDGGPSWIQMMDRSTSTM+YQAWRRD +TGPPQYRSRTVFEDATPQMVRDFFWDDEFR KWDDML+S
Subjt:  AAI--SSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIS

Query:  TQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGI
         QTLEDCP+TG MTV WVRKFPFFCSDREYVIGRRIWESGQSYYCVTKG+PCS VPRQNKPKRVDLYYSSWCIRAVES+KG GQLTACEV+LFHYEDMGI
Subjt:  TQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGI

Query:  PWEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDH
        PWEIAKLGVR+GMWGAVKK+DPALR YQKHR ++APLSNCA +ANINTK+S+D LRCSED SDDS EVKSLEP+EKPAGK+L K+L++GGAIALACSLDH
Subjt:  PWEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDH

Query:  GLLTKAVVFGVARRFSNIGKR
        GLLTKAVVFGVARRFSNIGKR
Subjt:  GLLTKAVVFGVARRFSNIGKR

XP_022159579.1 uncharacterized protein LOC111025953 isoform X2 [Momordica charantia]6.6e-20584.05Show/hide
Query:  DLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP--EPPT
        + AE+PT YDLVFK LMFIARLWVG+IVGVLVGWIWKP+WAD  ++LF SSK+KDNLPS    GSISSLNSL IQLPRCLN+TS C  E EVL   +PPT
Subjt:  DLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP--EPPT

Query:  -AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIST
         AAISSSS  EGEKP QLTEED K LY+LVEEKDGGPSWIQMMDRSTSTM+YQAWRRD +TGPPQYRSRTVFEDATPQMVRDFFWDDEFR KWDDML+S 
Subjt:  -AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIST

Query:  QTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIP
        QTLEDCP+TG MTV WVRKFPFFCSDREYVIGRRIWESGQSYYCVTKG+PCS VPRQNKPKRVDLYYSSWCIRAVES+KG GQLTACEV+LFHYEDMGIP
Subjt:  QTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIP

Query:  WEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHG
        WEIAKLGVR+GMWGAVKK+DPALR YQKHR ++APLSNCA +ANINTK+S+D LRCSED SDDS EVKSLEP+EKPAGK+L K+L++GGAIALACSLDHG
Subjt:  WEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHG

Query:  LLTKAVVFGVARRFSNIGKR
        LLTKAVVFGVARRFSNIGKR
Subjt:  LLTKAVVFGVARRFSNIGKR

XP_038876163.1 uncharacterized protein LOC120068456 [Benincasa hispida]8.4e-20886.43Show/hide
Query:  VDLAEAPT-VYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPT
        ++ AEAPT  YDLVFK LMFIARLWVG+IVGVLVGWIWKP+WA+  ++LF SSK+K NLPS   VGSIS LNSL IQLPRCLN+TS+C DE E+L   PT
Subjt:  VDLAEAPT-VYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPT

Query:  AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ
        A+ISSSS  EGEKPAQLTEEDLK LYRLVEEKDGGPSWIQMMDRST+TM+YQAWRRD E GPPQYRSRTVFEDATPQMVRDFFWDDEFR KWDDMLIS Q
Subjt:  AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ

Query:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW
         LEDC TTGTMTV WVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSS+PRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEV+LFHYEDMGIPW
Subjt:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW

Query:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIALACSLDHG
        EIAKLGVR+GMWGAVKK+DPAL AYQKHRASEAPLSNCA +ANINTKVSTD LRCSEDASDDS++ KSL EPSEKP GKNL K+LMVGGAIALACSLDHG
Subjt:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIALACSLDHG

Query:  LLTKAVVFGVARRFSNIGKR
        LLTKAVVFGVARRFSNIGKR
Subjt:  LLTKAVVFGVARRFSNIGKR

TrEMBL top hitse value%identityAlignment
A0A1S3AUN5 uncharacterized protein LOC1034828747.4e-20283.57Show/hide
Query:  VDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCS-VVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPT
        ++ AEA T YDLVFK LMF+ARLWVG+IVGVLVGWIWKP+WA+  + LF SSK+K+NLPS S  VGSIS LNSL IQLPRCLN+TS+C DE EVL + PT
Subjt:  VDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCS-VVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPT

Query:  AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ
         +ISSSS  EGEK A+LTEEDLK LYRLVEEKDGGPSWIQMMDRST+TM+YQAWRRD +TGPPQYRSRTVFEDATPQMVRDFFWDDEFR KWDDMLIS Q
Subjt:  AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ

Query:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW
        TLEDC  TGTM VRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSS+PRQNKPKRVDLYYSSWCIRAVES+KGNGQ+TACEV+LFH+EDMGIPW
Subjt:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW

Query:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIALACSLDHG
        EIAKLGVR+GMWGAVKK+DPALRAYQKHRA+EAPLSN A +ANINTK+S D LRCSEDASDDS++ K+L EP EKPA KNL K+L+VGGAIALACSLDHG
Subjt:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIALACSLDHG

Query:  LLTKAVVFGVARRFSNIGKR
        LLTKAVVFGVARRFSNIGKR
Subjt:  LLTKAVVFGVARRFSNIGKR

A0A5D3C494 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.1e-20082.9Show/hide
Query:  MAVD-----LAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCS-VVGSISSLNSLMIQLPRCLNMTSSCKDEIE
        MAVD      AEA T YDLVFK LMF+ARLWVG+IVGVLVGWIWKP+WA+  + LF SSK+K+NLPS S  VGSIS LNSL IQLPRCLN+TS+C DE E
Subjt:  MAVD-----LAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCS-VVGSISSLNSLMIQLPRCLNMTSSCKDEIE

Query:  VLPEPPTAAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWD
        VL + PT +ISSSS  EGEK A+LTEEDLK LYRLVEEKDGGPSWIQMMDRST+TM+YQAWRRD +TGPPQYRSRTVFEDATPQMVRDFFWDDEFR KWD
Subjt:  VLPEPPTAAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWD

Query:  DMLISTQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHY
        DMLIS QTLEDC  TGTM VRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSS+PRQNKPKRVDLYYSSWCIRAVES+KGNGQ+TACEV+LFH+
Subjt:  DMLISTQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHY

Query:  EDMGIPWEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIAL
        EDMGIPWEIAKLGVR+GMWGAVKK+DPALRAYQKHRA+EAPLSN A +ANINTK+S D LRCSEDASDDS++ K+L EP EKPA KNL K+L+VGGAIAL
Subjt:  EDMGIPWEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSL-EPSEKPAGKNLGKMLMVGGAIAL

Query:  ACSLDHGLLTKAVVFGVARRFSNIGKR
        ACSLD GLLTKAVVFGVARRFSNIGKR
Subjt:  ACSLDHGLLTKAVVFGVARRFSNIGKR

A0A6J1DZ71 uncharacterized protein LOC111025953 isoform X11.6e-20483.61Show/hide
Query:  DLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP--EPPT
        + AE+PT YDLVFK LMFIARLWVG+IVGVLVGWIWKP+WAD  ++LF SSK+KDNLPS    GSISSLNSL IQLPRCLN+TS C  E EVL   +PPT
Subjt:  DLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP--EPPT

Query:  AAI--SSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIS
        AA   SSSS  EGEKP QLTEED K LY+LVEEKDGGPSWIQMMDRSTSTM+YQAWRRD +TGPPQYRSRTVFEDATPQMVRDFFWDDEFR KWDDML+S
Subjt:  AAI--SSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIS

Query:  TQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGI
         QTLEDCP+TG MTV WVRKFPFFCSDREYVIGRRIWESGQSYYCVTKG+PCS VPRQNKPKRVDLYYSSWCIRAVES+KG GQLTACEV+LFHYEDMGI
Subjt:  TQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGI

Query:  PWEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDH
        PWEIAKLGVR+GMWGAVKK+DPALR YQKHR ++APLSNCA +ANINTK+S+D LRCSED SDDS EVKSLEP+EKPAGK+L K+L++GGAIALACSLDH
Subjt:  PWEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDH

Query:  GLLTKAVVFGVARRFSNIGKR
        GLLTKAVVFGVARRFSNIGKR
Subjt:  GLLTKAVVFGVARRFSNIGKR

A0A6J1E2R8 uncharacterized protein LOC111025953 isoform X23.2e-20584.05Show/hide
Query:  DLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP--EPPT
        + AE+PT YDLVFK LMFIARLWVG+IVGVLVGWIWKP+WAD  ++LF SSK+KDNLPS    GSISSLNSL IQLPRCLN+TS C  E EVL   +PPT
Subjt:  DLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP--EPPT

Query:  -AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIST
         AAISSSS  EGEKP QLTEED K LY+LVEEKDGGPSWIQMMDRSTSTM+YQAWRRD +TGPPQYRSRTVFEDATPQMVRDFFWDDEFR KWDDML+S 
Subjt:  -AAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIST

Query:  QTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIP
        QTLEDCP+TG MTV WVRKFPFFCSDREYVIGRRIWESGQSYYCVTKG+PCS VPRQNKPKRVDLYYSSWCIRAVES+KG GQLTACEV+LFHYEDMGIP
Subjt:  QTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIP

Query:  WEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHG
        WEIAKLGVR+GMWGAVKK+DPALR YQKHR ++APLSNCA +ANINTK+S+D LRCSED SDDS EVKSLEP+EKPAGK+L K+L++GGAIALACSLDHG
Subjt:  WEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHG

Query:  LLTKAVVFGVARRFSNIGKR
        LLTKAVVFGVARRFSNIGKR
Subjt:  LLTKAVVFGVARRFSNIGKR

A0A6J1H5S2 uncharacterized protein LOC1114598231.1e-19782.2Show/hide
Query:  MAVD-----LAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEV
        MAVD      AEA T YDL FK LMFI RLWVGVIVGVLVGWIWKP+WADF  +LF SSK+K+NL SCS VGSIS LN + IQLPRCLN+ S+C D  EV
Subjt:  MAVD-----LAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEV

Query:  LPEPPTAAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDD
        L +PPT A+SSSS  EGE  AQL EEDLK LYRLVEEKDGG SWIQMMDRST+TM+YQAWRRD ETGPPQYRSRT+FEDATPQMVRDFFWDDEFRQKWDD
Subjt:  LPEPPTAAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDD

Query:  MLISTQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYE
        ML+S QTLE+CPTTGTMTV WVRKFPFFCS+REYVIGRRIWESGQSYYCVTKGVPCSSVPRQNK KRVDLYYSSWCIRAVESK GNGQL+ACEV LFHYE
Subjt:  MLISTQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYE

Query:  DMGIPWEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCS-EDASDDSTEV-KSLEPSEKPAGKNLGKMLMVGGAIAL
        DMGIPWEIAKLGVR+GMWGAVKKMDPALR YQKHRA+EAP+S C  + NINTKVST+ LRCS E+ASDD TEV K +EPSEK AGKN  K+LMVGGAIAL
Subjt:  DMGIPWEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCS-EDASDDSTEV-KSLEPSEKPAGKNLGKMLMVGGAIAL

Query:  ACSLDHGLLTKAVVFGVARRFSNIGKR
        AC+LDHGLLTKAVVFGVARRFSNIGKR
Subjt:  ACSLDHGLLTKAVVFGVARRFSNIGKR

SwissProt top hitse value%identityAlignment
P53808 Phosphatidylcholine transfer protein4.6e-0723.62Show/hide
Query:  GPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRR-
        G  W  +++ S  T+ Y+    D  +G  +Y+   V E  +P ++ D + D ++R++WD  +   + L +  +   M   W  K+PF  S+R+YV  R+ 
Subjt:  GPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRR-

Query:  ---IWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMG--IPWEIAKLGVRKGMWGAVKKMDPALRAYQK
             +  + Y  + + +     P ++   RV  Y  S  I + + KKG+       V ++++++ G  IP  +     + G+   +K M  A + Y K
Subjt:  ---IWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMG--IPWEIAKLGVRKGMWGAVKKMDPALRAYQK

Q9UKL6 Phosphatidylcholine transfer protein4.1e-0824.86Show/hide
Query:  DLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRR----IWESGQSYYCVTKGVPCSS
        D +TG  +Y+   V ED +P ++ D + D ++R++WD  +   + L +    G   V W  K+PF  S+R+YV  R+      E  + +  + +      
Subjt:  DLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRR----IWESGQSYYCVTKGVPCSS

Query:  VPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMG--IPWEIAKLGVRKGMWGAVKKMDPALRAYQK
        +  ++   RV  Y  S  I + + KKG+      +V ++++++ G  IP  +     + G+   +K M  A + Y K
Subjt:  VPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMG--IPWEIAKLGVRKGMWGAVKKMDPALRAYQK

Arabidopsis top hitse value%identityAlignment
AT1G64720.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein4.1e-13659.45Show/hide
Query:  LMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPTAAISSSSIPEGEKPAQL
        ++F+A LW+ V  GVLVGW+W+P+WA      +  SK+  N P             L +QLP  +  TSS                 + S+   EK   +
Subjt:  LMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPTAAISSSSIPEGEKPAQL

Query:  TEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVR
        T++D + L++LVE KDGGP WIQMMDRST T +YQAWRRD E GPPQYRSRTVFEDATP+MVRDFFWDDEFR KWDDML+ + TLE C  TGTM V+WVR
Subjt:  TEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVR

Query:  KFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPWEIAKLGVRKGMWGAVKK
        KFPFFCSDREY+IGRRIW++G+ +YC+TKGV   SVPRQNKP+RVDLYYSSWCIRAVESK+G+G++T+CEV+LFH+EDMGIPWEIAKLGVR+GMWGAVKK
Subjt:  KFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPWEIAKLGVRKGMWGAVKK

Query:  MDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHGLLTKAVVFGVARRFSNIG
        ++P LRAYQ+ +A+ A LS  A +A+INTKVS +       +  + T        +KP GKN+ K+L+VGGAIALAC+LD GLLTKAV+FGVARRF+ +G
Subjt:  MDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHGLLTKAVVFGVARRFSNIG

Query:  KR
        KR
Subjt:  KR

AT3G23080.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.7e-9745.93Show/hide
Query:  VDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSS-KMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP-EPP
        +D    P+V +     L+    +WV V++G+L+GW W+PRW   V   F S  +    +P       I    + +     C     S  +     P E  
Subjt:  VDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSS-KMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP-EPP

Query:  TAAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIST
          + +S  + E EK   +TE+DL+ L  L+E  +    W  MMD+ST  M+YQAWR + E GP  YRSRTVFED TP +VRDFFWDDEFR KWD ML   
Subjt:  TAAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIST

Query:  QTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIP
        +TLE+ P TGT  V W++KFPFFCSDREY+IGRRIWESG+ YY VTKGVP  ++ +++KP+RV+LY+SSW I AVES+KG+GQ+TACEV L HYEDMGIP
Subjt:  QTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIP

Query:  WEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLG---KMLMVGGAIALACSL
         ++AKLGVR GMWGAVKK++  LRAYQ  R     LS  A++A+I TK++ D +  S   ++D    +++E + K      G   K ++VGG +ALAC L
Subjt:  WEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLG---KMLMVGGAIALACSL

Query:  DHGLLTKAVVFGVARRFS
            + KA++ G  +R +
Subjt:  DHGLLTKAVVFGVARRFS

AT3G23080.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.7e-9447.31Show/hide
Query:  IVGVLVGWIWKPRWADFVKNLFGSS-KMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP-EPPTAAISSSSIPEGEKPAQLTEEDLKRLY
        ++G+L+GW W+PRW   V   F S  +    +P       I    + +     C     S  +     P E    + +S  + E EK   +TE+DL+ L 
Subjt:  IVGVLVGWIWKPRWADFVKNLFGSS-KMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLP-EPPTAAISSSSIPEGEKPAQLTEEDLKRLY

Query:  RLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVRKFPFFCSDR
         L+E  +    W  MMD+ST  M+YQAWR + E GP  YRSRTVFED TP +VRDFFWDDEFR KWD ML   +TLE+ P TGT  V W++KFPFFCSDR
Subjt:  RLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVRKFPFFCSDR

Query:  EYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPWEIAKLGVRKGMWGAVKKMDPALRAYQ
        EY+IGRRIWESG+ YY VTKGVP  ++ +++KP+RV+LY+SSW I AVES+KG+GQ+TACEV L HYEDMGIP ++AKLGVR GMWGAVKK++  LRAYQ
Subjt:  EYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPWEIAKLGVRKGMWGAVKKMDPALRAYQ

Query:  KHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLG---KMLMVGGAIALACSLDHGLLTKAVVFGVARRFS
          R     LS  A++A+I TK++ D +  S   ++D    +++E + K      G   K ++VGG +ALAC L    + KA++ G  +R +
Subjt:  KHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLG---KMLMVGGAIALACSLDHGLLTKAVVFGVARRFS

AT4G14500.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.3e-9946.51Show/hide
Query:  LMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMK------------------DNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPP
        L+    +W+ V++G+L+GW W+PRW   +   F  SK++                    L + SV  +I S N          + T S   +  V     
Subjt:  LMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMK------------------DNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPP

Query:  TAAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIST
        +   S  S         +TE DL+ L +L+E  +    W  MMD++T  M+YQAWR + ETGP  YRSRTVFEDATP +VRDFFWDDEFR KWD ML + 
Subjt:  TAAISSSSIPEGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLIST

Query:  QTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIP
        +TLE+   TGTM V+W +KFPFFCSDREY+IGRRIWESG+ YYCVTKGVP  ++P+++KP+RV+LY+SSW IRAVES+KG+GQ TACEV L HYEDMGIP
Subjt:  QTLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIP

Query:  WEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHG
         ++AKLGVR GMWGAVKK++  LRAYQ  R S++ LS  A++A I TK++ DS   +E +S D    +++E + +     +    +V G +ALAC L  G
Subjt:  WEIAKLGVRKGMWGAVKKMDPALRAYQKHRASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHG

Query:  LLTKAVVFGVARRFS
        ++ KA++ G  +R +
Subjt:  LLTKAVVFGVARRFS

AT5G54170.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.1e-10849.41Show/hide
Query:  WVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPR-------------CLNMTSSCKDEIE-VLPEPPTAAISSSS---
        W+  ++G+++GW WKPRW     N     K++ + P    +   SS  S ++  P              C   T + + + + V P+  +++  SS    
Subjt:  WVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPR-------------CLNMTSSCKDEIE-VLPEPPTAAISSSS---

Query:  --IPEGEK-----PAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ
          +   +K     P  +TE DL+ L +LVE KDGG +WIQMMDR T  M YQAW R+ + GP +YRSRTVFEDATP M+RDFFWDDEFR  WD ML ++ 
Subjt:  --IPEGEK-----PAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQ

Query:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW
        T+E+CP+TGTM VRW+RKFPFFCSDREYVIGRRIW  G SYYCVTKGV   S+P  NK KRVDL+YSSWCIR VES++ +G  +ACEV+LFH+EDMGIP 
Subjt:  TLEDCPTTGTMTVRWVRKFPFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPW

Query:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASE--APLSNCARIANINTKVSTDSLRCSEDASDDSTEVK-SLEPSEKPAGKNLGKMLMVGGAIALACSLD
        EIAKLGV++GMWGAVKKM+P LRAYQ HR S+    LS  A +A INTK++ D L    + +   TE   +L    + A  NL K+L++GGA+A+ CSL 
Subjt:  EIAKLGVRKGMWGAVKKMDPALRAYQKHRASE--APLSNCARIANINTKVSTDSLRCSEDASDDSTEVK-SLEPSEKPAGKNLGKMLMVGGAIALACSLD

Query:  HG-LLTKAVVFGVARRFSNIGKR
         G  +  A + G  +RF N G++
Subjt:  HG-LLTKAVVFGVARRFSNIGKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGGATTTGGCGGAGGCTCCCACTGTTTATGATTTAGTGTTTAAGGGTTTGATGTTTATTGCTCGTTTGTGGGTGGGGGTTATTGTGGGGGTGTTGGTGGGATG
GATTTGGAAGCCTAGATGGGCGGATTTTGTTAAGAATCTGTTTGGTTCTTCAAAGATGAAGGATAATTTGCCTTCGTGTAGCGTTGTCGGATCGATTTCTAGCTTGAATT
CGTTGATGATTCAACTACCCAGATGCTTGAATATGACTTCAAGTTGTAAGGATGAAATTGAAGTTCTACCTGAGCCACCTACTGCTGCCATCTCCAGTTCATCAATCCCC
GAAGGCGAAAAACCAGCCCAGTTAACTGAAGAGGATTTGAAGCGTTTATACAGGCTTGTTGAGGAGAAAGATGGAGGACCCTCATGGATTCAGATGATGGATCGCTCTAC
TTCTACCATGAACTATCAAGCTTGGCGGCGAGACCTTGAGACGGGGCCACCACAATATCGAAGCCGAACGGTTTTTGAGGATGCAACGCCTCAGATGGTGAGGGACTTCT
TTTGGGATGATGAATTTAGACAGAAATGGGATGACATGCTTATAAGTACTCAAACTTTAGAAGATTGTCCTACCACAGGGACGATGACGGTCCGCTGGGTGCGCAAGTTC
CCTTTCTTCTGTAGTGATCGAGAATACGTGATCGGACGAAGAATTTGGGAATCGGGACAGTCATATTACTGTGTGACTAAGGGTGTACCTTGTTCCTCGGTACCTAGACA
AAACAAGCCAAAACGTGTCGATCTCTACTATTCAAGTTGGTGCATCCGTGCAGTTGAGTCGAAAAAGGGGAATGGCCAGTTAACTGCATGTGAGGTGGTACTGTTTCACT
ATGAAGACATGGGTATTCCATGGGAAATCGCAAAGCTCGGAGTCAGGAAGGGCATGTGGGGAGCCGTGAAGAAGATGGATCCCGCTTTACGTGCATATCAAAAGCATAGA
GCATCCGAAGCCCCGCTTTCCAACTGCGCACGAATCGCCAATATCAACACAAAGGTCAGTACCGACTCCCTGAGATGCTCAGAAGATGCATCCGATGATTCCACAGAGGT
CAAAAGCTTGGAGCCCTCTGAAAAACCAGCAGGAAAGAACTTGGGAAAGATGCTTATGGTCGGTGGAGCAATCGCTCTAGCGTGCAGTCTCGATCATGGTCTGTTAACCA
AAGCAGTTGTATTCGGAGTTGCTCGAAGATTTTCGAACATTGGAAAGAGATGA
mRNA sequenceShow/hide mRNA sequence
AAAATTTCACAGAGTTGAAAGCAAACAAGAAGAAGAAAAAAAAAAGTTTGAAACGTGATAAATACAAACGCGGTGATAAAAAACGCGAAACCCTATCTTGGAAATAGAAA
TTGTCCATTTCAATGGCGATTTAGAGAGCGACCGACCAGACCCATTTGGCCTTTCACCCTAAGATCGGCGAAAACCCTTTAATCCCTTTTGGGATTTTGAGAGATCCCCT
TATTTGTTTCTGTATTTAAGCAACGAATTGAGAAGTTTTTTTTGAGTTCATCAGCCCCAAAATTTGTCCATTACCAGAGAAAATTTATCTGGTTCTTGTTAGGGTTCGTC
TTTGTTTGCTCTGTTATGGTTCCTTTGCGTTTCTCAGTTGTTTGATTCTCGGTTCTGTTTGTTGTTTGTTTGCCCTGTATACCCAATTCGTTTGATCTTATAGCTCTTTG
GATTTGTGGGTTGTTTTGATTTTTTGATTTTGGGGGCTGATGGCGGTGGATTTGGCGGAGGCTCCCACTGTTTATGATTTAGTGTTTAAGGGTTTGATGTTTATTGCTCG
TTTGTGGGTGGGGGTTATTGTGGGGGTGTTGGTGGGATGGATTTGGAAGCCTAGATGGGCGGATTTTGTTAAGAATCTGTTTGGTTCTTCAAAGATGAAGGATAATTTGC
CTTCGTGTAGCGTTGTCGGATCGATTTCTAGCTTGAATTCGTTGATGATTCAACTACCCAGATGCTTGAATATGACTTCAAGTTGTAAGGATGAAATTGAAGTTCTACCT
GAGCCACCTACTGCTGCCATCTCCAGTTCATCAATCCCCGAAGGCGAAAAACCAGCCCAGTTAACTGAAGAGGATTTGAAGCGTTTATACAGGCTTGTTGAGGAGAAAGA
TGGAGGACCCTCATGGATTCAGATGATGGATCGCTCTACTTCTACCATGAACTATCAAGCTTGGCGGCGAGACCTTGAGACGGGGCCACCACAATATCGAAGCCGAACGG
TTTTTGAGGATGCAACGCCTCAGATGGTGAGGGACTTCTTTTGGGATGATGAATTTAGACAGAAATGGGATGACATGCTTATAAGTACTCAAACTTTAGAAGATTGTCCT
ACCACAGGGACGATGACGGTCCGCTGGGTGCGCAAGTTCCCTTTCTTCTGTAGTGATCGAGAATACGTGATCGGACGAAGAATTTGGGAATCGGGACAGTCATATTACTG
TGTGACTAAGGGTGTACCTTGTTCCTCGGTACCTAGACAAAACAAGCCAAAACGTGTCGATCTCTACTATTCAAGTTGGTGCATCCGTGCAGTTGAGTCGAAAAAGGGGA
ATGGCCAGTTAACTGCATGTGAGGTGGTACTGTTTCACTATGAAGACATGGGTATTCCATGGGAAATCGCAAAGCTCGGAGTCAGGAAGGGCATGTGGGGAGCCGTGAAG
AAGATGGATCCCGCTTTACGTGCATATCAAAAGCATAGAGCATCCGAAGCCCCGCTTTCCAACTGCGCACGAATCGCCAATATCAACACAAAGGTCAGTACCGACTCCCT
GAGATGCTCAGAAGATGCATCCGATGATTCCACAGAGGTCAAAAGCTTGGAGCCCTCTGAAAAACCAGCAGGAAAGAACTTGGGAAAGATGCTTATGGTCGGTGGAGCAA
TCGCTCTAGCGTGCAGTCTCGATCATGGTCTGTTAACCAAAGCAGTTGTATTCGGAGTTGCTCGAAGATTTTCGAACATTGGAAAGAGATGATGGCTGCAACAAATCTCA
CAATGAGATTGAGATTAGCATTTGTTTCTTTGACATACAACCTATTTGAAGCAAGTGTTGGAAGGTGGTAAATGTTATATTTTTCCCTCAATAGTTGTAAAGTGAAGGTT
TAGAATGGGGATGAGATTAGGTAGGCTTTATCACCTTGAACTTTATGGTTGTATCTCATATCAGTAACAAGATAAATGTGATTGCAAAAGGAGCCTAAGACTTGGTAAGG
TCCAATATACATCTGTCACTGGATATATCATAATAAACTTTGTAAAGGACAGGAGAAAAAAAGCATTAAAGTTCAAGATTGTTATCATTATAAACTTAAAATTTTAAGTT
GATCCATTCGTTCCCTAAACTTTGTCAAGGGTTGTTGTAGTTAATTTTTAAAACTGTTTGATAA
Protein sequenceShow/hide protein sequence
MAVDLAEAPTVYDLVFKGLMFIARLWVGVIVGVLVGWIWKPRWADFVKNLFGSSKMKDNLPSCSVVGSISSLNSLMIQLPRCLNMTSSCKDEIEVLPEPPTAAISSSSIP
EGEKPAQLTEEDLKRLYRLVEEKDGGPSWIQMMDRSTSTMNYQAWRRDLETGPPQYRSRTVFEDATPQMVRDFFWDDEFRQKWDDMLISTQTLEDCPTTGTMTVRWVRKF
PFFCSDREYVIGRRIWESGQSYYCVTKGVPCSSVPRQNKPKRVDLYYSSWCIRAVESKKGNGQLTACEVVLFHYEDMGIPWEIAKLGVRKGMWGAVKKMDPALRAYQKHR
ASEAPLSNCARIANINTKVSTDSLRCSEDASDDSTEVKSLEPSEKPAGKNLGKMLMVGGAIALACSLDHGLLTKAVVFGVARRFSNIGKR