; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg04104 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg04104
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like
Genome locationCarg_Chr19:7665784..7673285
RNA-Seq ExpressionCarg04104
SyntenyCarg04104
Gene Ontology termsGO:0006869 - lipid transport (biological process)
GO:0009706 - chloroplast inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0005319 - lipid transporter activity (molecular function)
GO:0005543 - phospholipid binding (molecular function)
InterPro domainsIPR003399 - Mce/MlaD
IPR039342 - Protein TRIGALACTOSYLDIACYLGLYCEROL 2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572181.1 hypothetical protein SDJN03_28909, partial [Cucurbita argyrosperma subsp. sororia]3.9e-181100Show/hide
Query:  MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE
        MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE
Subjt:  MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE

Query:  WDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCY
        WDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCY
Subjt:  WDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCY

Query:  VVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF
        VVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF
Subjt:  VVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF

Query:  DGSQINVIQELLADNVQKSVRTESELK
        DGSQINVIQELLADNVQKSVRTESELK
Subjt:  DGSQINVIQELLADNVQKSVRTESELK

KAG7011820.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIEPWLEKAMDDDGHLLSYHWEALTGFFVRISMAFASSSSVICQNRALSS
        DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIEPWLEKAMDDDGHLLSYHWEALTGFFVRISMAFASSSSVICQNRALSS
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIEPWLEKAMDDDGHLLSYHWEALTGFFVRISMAFASSSSVICQNRALSS

Query:  SVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELP
        SVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELP
Subjt:  SVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELP

Query:  EFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELE
        EFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELE
Subjt:  EFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELE

Query:  HCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQK
        HCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQK
Subjt:  HCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQK

Query:  SVRTESELKEISIANETPA
        SVRTESELKEISIANETPA
Subjt:  SVRTESELKEISIANETPA

XP_022952170.1 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X1 [Cucurbita moschata]6.2e-18798.02Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        MVGDARVQVAAFSVALPSSLVTLPH+SSSRLSY LPFGLKSKVKKIKATSADAGHSQPPSSSER NPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIET+VEVEDDKIIIPLNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPS GPLDSECIKEGLILCDKHKMKG QGVSLDALVGIFTRLGREAEEIGL+NTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
        DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE

XP_022969129.1 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X1 [Cucurbita maxima]3.2e-18396.32Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        MVGD  VQVA FSVALPSSLVTLPHRSS+RLSY LPFGLKSKVKKIKATSA AGHSQPPSSSERRNPLALFLDVPRT+WRRTL PLSNFGFGRRSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPS GPLDS+CIKEGLILCDKHKMKG QGVSLDALVGIFTRLGREAEEIGL+NTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
        DSGL+KEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE

XP_023554510.1 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo]1.5e-18899.15Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSY LPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKG QGVSLDALVGIFTRLGREAEEIGL+NTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
        DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE

TrEMBL top hitse value%identityAlignment
A0A1S3BRQ6 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic isoform X11.0e-17492.07Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        MVGD RVQV   SV LPSSLVTLPHRSS+RLSY LP G KSKVK+IKATSADAGHSQPPSSSERRNPL+LFLDVPRTVWR+TLRPLSNFGFGRRSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQA GIS GTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIP NSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPSAGPLD ECI+EGLILCDK KMKG QGVSLDALVGIFTRLGREAEEIGL+NTF LAQRVALV+EEA+PLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
        DSGLLKEVE+LT SLSHATEDLRSV ASI+TPENTELLQKS+YTLIHTLKNIE
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE

A0A6J1GJI3 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X13.0e-18798.02Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        MVGDARVQVAAFSVALPSSLVTLPH+SSSRLSY LPFGLKSKVKKIKATSADAGHSQPPSSSER NPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIET+VEVEDDKIIIPLNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPS GPLDSECIKEGLILCDKHKMKG QGVSLDALVGIFTRLGREAEEIGL+NTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
        DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE

A0A6J1GKV0 uncharacterized protein LOC1114549268.8e-17998.78Show/hide
Query:  MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE
        MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNR LSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE
Subjt:  MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE

Query:  WDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCY
        WDGYGA+FSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGN IGSNDEVTAFAYQRSGCY
Subjt:  WDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCY

Query:  VVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF
        VV+WPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF
Subjt:  VVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF

Query:  DGSQINVIQELLADNVQKSVRTESELK
        DGSQINVIQELLADNVQKSVRTESELK
Subjt:  DGSQINVIQELLADNVQKSVRTESELK

A0A6J1HVH4 uncharacterized protein LOC1114682174.4e-17898.17Show/hide
Query:  MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE
        MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNR LS+NIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE
Subjt:  MAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGE

Query:  WDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCY
        WDGYGA+FSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERN GNGIGSNDEVTAFAYQRSGCY
Subjt:  WDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCY

Query:  VVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF
        VV+WPVKV GSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF
Subjt:  VVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARF

Query:  DGSQINVIQELLADNVQKSVRTESELK
        DGSQINVIQELLADNVQKSVRTESELK
Subjt:  DGSQINVIQELLADNVQKSVRTESELK

A0A6J1HZ36 protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic-like isoform X11.6e-18396.32Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        MVGD  VQVA FSVALPSSLVTLPHRSS+RLSY LPFGLKSKVKKIKATSA AGHSQPPSSSERRNPLALFLDVPRT+WRRTL PLSNFGFGRRSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDITPRDPIPVPS GPLDS+CIKEGLILCDKHKMKG QGVSLDALVGIFTRLGREAEEIGL+NTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
        DSGL+KEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE

SwissProt top hitse value%identityAlignment
P46315 Uncharacterized protein ycf221.2e-1029.25Show/hide
Query:  KFRK---YLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETMIDITPRDPI---PVPSAGPLDSE
        K++K   Y    EF  A GI  GT V +RGV +G +  +  +   +   + ++ +KI+IP NS++E NQ+ L   T+IDI P + I    +      +  
Subjt:  KFRK---YLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETMIDITPRDPI---PVPSAGPLDSE

Query:  CIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSL
        C  +  I C+   + G +G++ D L+   TR+ +  ++    N F L
Subjt:  CIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSL

P51372 Uncharacterized protein ycf222.7e-1535.04Show/hide
Query:  SKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETMIDITPRDPIPVP----SAGPLDSEC
        SK + Y    EF  A GI  GT VR+RG+ +G V+ ++ S   I T +E++    IIP  SL+E NQ+GLL +T+IDI P   +         GPL   C
Subjt:  SKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETMIDITPRDPIPVP----SAGPLDSEC

Query:  IKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEE
             I+C  + ++G +G++ D L+   TR+ +  ++
Subjt:  IKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEE

Q1XDB5 Uncharacterized protein ycf227.5e-1835.19Show/hide
Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        +G   +    LL++SL W        K   Y A  EF  A GI  GT VR+RG+ VG V+ ++ S   I T +E++    IIP  SL+E NQ+GLL +T+
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVP----SAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEE
        IDI P   + +      AGPL   C     I+C+ + +KG +G++ D L+   TR+ +  ++
Subjt:  IDITPRDPIPVP----SAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEE

Q9LTR2 Protein TRIGALACTOSYLDIACYLGLYCEROL 2, chloroplastic3.2e-12567.14Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        M+G+  +QV + S+   SS++  P  S + + Y  P      +    A+++DA H Q PSS   +NPL + LDVPR +WR+TL+PLS+FGFG+RSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLF+VSGA LL LS AWLRGFQ+RSKFRKY  VFE + A GI TGTPVRIRGVTVG +IRVNPSL+ IE V E+EDDKIIIP NSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDI PR+PIP PS GPL  EC KEGLI+CD+  +KG QGVSLD LVGIFTR+GRE E IG++NT+SLA+R A V+EEARPLL KIQAMAED QPLL+E R
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
        DSGLLKEVE LT SL+ A++DLR V++SIMTPENTEL+QKSIYTL++TLKN+E
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE

Arabidopsis top hitse value%identityAlignment
AT3G20320.1 trigalactosyldiacylglycerol22.3e-12667.14Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        M+G+  +QV + S+   SS++  P  S + + Y  P      +    A+++DA H Q PSS   +NPL + LDVPR +WR+TL+PLS+FGFG+RSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLF+VSGA LL LS AWLRGFQ+RSKFRKY  VFE + A GI TGTPVRIRGVTVG +IRVNPSL+ IE V E+EDDKIIIP NSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR
        IDI PR+PIP PS GPL  EC KEGLI+CD+  +KG QGVSLD LVGIFTR+GRE E IG++NT+SLA+R A V+EEARPLL KIQAMAED QPLL+E R
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVR

Query:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE
        DSGLLKEVE LT SL+ A++DLR V++SIMTPENTEL+QKSIYTL++TLKN+E
Subjt:  DSGLLKEVENLTSSLSHATEDLRSVHASIMTPENTELLQKSIYTLIHTLKNIE

AT3G20320.2 trigalactosyldiacylglycerol22.4e-9664.79Show/hide
Query:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG
        M+G+  +QV + S+   SS++  P  S + + Y  P      +    A+++DA H Q PSS   +NPL + LDVPR +WR+TL+PLS+FGFG+RSIWEGG
Subjt:  MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGG

Query:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM
        VGLF+VSGA LL LS AWLRGFQ+RSKFRKY  VFE + A GI TGTPVRIRGVTVG +IRVNPSL+ IE V E+EDDKIIIP NSLVEVNQSGLLMETM
Subjt:  VGLFLVSGAILLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETM

Query:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLK
        IDI PR+PIP PS GPL  EC KEGLI+CD+  +KG QGVSLD LVGIFTR+GRE E IG++NT+SLA+R A V+EEARPLL K
Subjt:  IDITPRDPIPVPSAGPLDSECIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLK

AT4G38225.1 unknown protein8.7e-7853.55Show/hide
Query:  PLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAE
        P SS++      ++S  P S  A+++   A  QSQ +     + WS+FA+NVSGEWDG+GA+F+  G P+ELPE VVP+A+REWEVKVFDWQTQCPTLA+
Subjt:  PLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAE

Query:  PEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRV-EGTRL
        P   SF+YK+IKLLPTVGCEADAATRYSID+R +G G  S     AF+Y  +G YV +WP++       +E+EHCL++P+D+ESRVR+ QVV + E T +
Subjt:  PEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRV-EGTRL

Query:  VLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQKSVRTESEL
         LQS+KVFCEQWYGPFR+G+QLGGCAIR S FA+T    AS V GSW+  ++   F  S    IQ++  + V + VR E++L
Subjt:  VLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQKSVRTESEL

AT4G38225.2 unknown protein2.0e-7455.6Show/hide
Query:  PLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAE
        P SS++      ++S  P S  A+++   A  QSQ +     + WS+FA+NVSGEWDG+GA+F+  G P+ELPE VVP+A+REWEVKVFDWQTQCPTLA+
Subjt:  PLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAE

Query:  PEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRV-EGTRL
        P   SF+YK+IKLLPTVGCEADAATRYSID+R +G G  S     AF+Y  +G YV +WP++       +E+EHCL++P+D+ESRVR+ QVV + E T +
Subjt:  PEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRV-EGTRL

Query:  VLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGS
         LQS+KVFCEQWYGPFR+G+QLGGCAIR S FA+T    AS V GSW+  ++   F  S
Subjt:  VLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGS

AT4G38225.3 unknown protein8.7e-7853.55Show/hide
Query:  PLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAE
        P SS++      ++S  P S  A+++   A  QSQ +     + WS+FA+NVSGEWDG+GA+F+  G P+ELPE VVP+A+REWEVKVFDWQTQCPTLA+
Subjt:  PLSSNIGINAAPSLSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAE

Query:  PEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRV-EGTRL
        P   SF+YK+IKLLPTVGCEADAATRYSID+R +G G  S     AF+Y  +G YV +WP++       +E+EHCL++P+D+ESRVR+ QVV + E T +
Subjt:  PEKPSFMYKTIKLLPTVGCEADAATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRV-EGTRL

Query:  VLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQKSVRTESEL
         LQS+KVFCEQWYGPFR+G+QLGGCAIR S FA+T    AS V GSW+  ++   F  S    IQ++  + V + VR E++L
Subjt:  VLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFASTAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQKSVRTESEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGAGATGCTCGTGTGCAGGTTGCAGCATTTTCTGTGGCGTTACCCTCATCCTTGGTTACCCTTCCACATAGATCTTCAAGTAGGTTGTCTTATCGTCTCCCATT
CGGTCTAAAATCCAAAGTAAAGAAGATAAAGGCTACATCTGCTGATGCTGGACATAGTCAGCCACCCTCATCTTCAGAAAGAAGGAATCCGCTCGCCCTTTTTCTCGATG
TTCCTCGTACTGTATGGAGGAGAACCTTGCGTCCATTGAGTAATTTCGGGTTCGGTCGAAGGAGCATATGGGAAGGTGGGGTTGGGTTGTTCCTTGTGTCAGGTGCTATA
CTTCTTACACTCAGTTTGGCTTGGTTGAGAGGCTTTCAATTACGGTCTAAATTTAGAAAATACTTGGCTGTCTTTGAGTTTGCTCAGGCTTGTGGTATTTCGACTGGAAC
TCCTGTAAGAATTAGAGGAGTTACTGTTGGCAATGTCATTCGTGTTAACCCTTCCTTGAGATGTATTGAAACTGTTGTTGAGGTTGAAGATGATAAAATCATTATACCCC
TTAATTCATTGGTTGAAGTAAACCAGTCTGGTCTTCTTATGGAGACGATGATTGACATTACCCCCAGGGATCCTATTCCAGTGCCTTCAGCGGGACCACTCGACTCCGAA
TGTATTAAAGAAGGCTTAATATTATGTGATAAGCATAAAATGAAGGGACGTCAAGGGGTAAGTTTGGATGCATTAGTTGGAATATTCACTCGGCTTGGACGCGAAGCGGA
GGAAATAGGGCTTTCTAATACGTTTTCTTTAGCCCAACGAGTTGCTTTGGTTGTTGAAGAAGCAAGGCCTTTGCTTTTAAAGATTCAAGCCATGGCTGAAGATGTTCAAC
CTTTGCTTGCTGAGGTTCGTGACAGTGGTCTTCTAAAGGAGGTTGAAAACTTAACTAGCAGTCTTTCACATGCCACAGAAGATTTAAGAAGCGTGCATGCGTCGATTATG
ACCCCGGAAAACACAGAGCTTCTTCAGAAGTCCATATATACGCTAATTCATACTTTGAAGAACATAGAGCCTTGGCTGGAAAAAGCTATGGATGATGATGGCCATTTACT
TTCCTATCATTGGGAAGCGCTTACAGGTTTCTTTGTTAGGATTTCAATGGCGTTCGCTTCTTCTAGTAGTGTTATCTGTCAGAACAGAGCCTTGTCGTCCTCCGTCGTTT
CTTCTCCGGGACTTTTACACCATCGCTGCTTCTCACGGCTTCAATCGCAGCGTATTCTTCATTGCAATCGTCCTTTGTCTTCGAACATCGGGATAAACGCCGCTCCGTCT
CTTTCTTCGGCGCCCTCTTCCGTCGTCGCTAAAACTGCTCTATCCGATGCTCATGTTCAAAGTCAGAGCTCCAGTTCTGCTCCTGGTAGTGGGTGGTCTGATTTTGCCAA
AAACGTCTCTGGCGAATGGGATGGATATGGTGCGGAATTTTCTTCTGGAGGAACGCCAATTGAACTTCCAGAATTCGTTGTCCCCGATGCTTATAGGGAATGGGAGGTTA
AGGTTTTCGACTGGCAGACTCAGTGCCCCACTCTTGCGGAACCTGAGAAGCCCTCTTTCATGTACAAGACAATAAAGCTACTTCCTACAGTGGGATGTGAAGCCGATGCT
GCAACCCGTTACAGCATTGATGAGAGAAATGTTGGAAATGGAATTGGTTCAAATGATGAAGTGACTGCCTTTGCGTATCAACGTAGTGGATGTTATGTAGTTCTTTGGCC
GGTTAAGGTTGTGGGTTCTTATAAGTTAATGGAGTTGGAGCATTGCCTGGTTAGTCCTCAAGATCGTGAATCCCGTGTGAGGGTTGTTCAGGTTGTCCGAGTCGAAGGCA
CACGGCTAGTGTTGCAGAGTATCAAAGTTTTCTGCGAGCAGTGGTATGGACCATTCAGAAACGGAGAACAGCTTGGTGGATGCGCCATCCGAGACTCATCATTTGCTTCT
ACAGCTGCCTTGAAAGCTTCTGAGGTTGTTGGTTCATGGCAGGGTCCTGTCTCTGTTGCCCGTTTTGATGGTTCTCAGATTAATGTTATACAAGAACTTTTGGCTGACAA
TGTGCAAAAGTCGGTGAGAACTGAATCAGAACTCAAGGAAATATCGATTGCAAATGAGACCCCTGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGAGATGCTCGTGTGCAGGTTGCAGCATTTTCTGTGGCGTTACCCTCATCCTTGGTTACCCTTCCACATAGATCTTCAAGTAGGTTGTCTTATCGTCTCCCATT
CGGTCTAAAATCCAAAGTAAAGAAGATAAAGGCTACATCTGCTGATGCTGGACATAGTCAGCCACCCTCATCTTCAGAAAGAAGGAATCCGCTCGCCCTTTTTCTCGATG
TTCCTCGTACTGTATGGAGGAGAACCTTGCGTCCATTGAGTAATTTCGGGTTCGGTCGAAGGAGCATATGGGAAGGTGGGGTTGGGTTGTTCCTTGTGTCAGGTGCTATA
CTTCTTACACTCAGTTTGGCTTGGTTGAGAGGCTTTCAATTACGGTCTAAATTTAGAAAATACTTGGCTGTCTTTGAGTTTGCTCAGGCTTGTGGTATTTCGACTGGAAC
TCCTGTAAGAATTAGAGGAGTTACTGTTGGCAATGTCATTCGTGTTAACCCTTCCTTGAGATGTATTGAAACTGTTGTTGAGGTTGAAGATGATAAAATCATTATACCCC
TTAATTCATTGGTTGAAGTAAACCAGTCTGGTCTTCTTATGGAGACGATGATTGACATTACCCCCAGGGATCCTATTCCAGTGCCTTCAGCGGGACCACTCGACTCCGAA
TGTATTAAAGAAGGCTTAATATTATGTGATAAGCATAAAATGAAGGGACGTCAAGGGGTAAGTTTGGATGCATTAGTTGGAATATTCACTCGGCTTGGACGCGAAGCGGA
GGAAATAGGGCTTTCTAATACGTTTTCTTTAGCCCAACGAGTTGCTTTGGTTGTTGAAGAAGCAAGGCCTTTGCTTTTAAAGATTCAAGCCATGGCTGAAGATGTTCAAC
CTTTGCTTGCTGAGGTTCGTGACAGTGGTCTTCTAAAGGAGGTTGAAAACTTAACTAGCAGTCTTTCACATGCCACAGAAGATTTAAGAAGCGTGCATGCGTCGATTATG
ACCCCGGAAAACACAGAGCTTCTTCAGAAGTCCATATATACGCTAATTCATACTTTGAAGAACATAGAGCCTTGGCTGGAAAAAGCTATGGATGATGATGGCCATTTACT
TTCCTATCATTGGGAAGCGCTTACAGGTTTCTTTGTTAGGATTTCAATGGCGTTCGCTTCTTCTAGTAGTGTTATCTGTCAGAACAGAGCCTTGTCGTCCTCCGTCGTTT
CTTCTCCGGGACTTTTACACCATCGCTGCTTCTCACGGCTTCAATCGCAGCGTATTCTTCATTGCAATCGTCCTTTGTCTTCGAACATCGGGATAAACGCCGCTCCGTCT
CTTTCTTCGGCGCCCTCTTCCGTCGTCGCTAAAACTGCTCTATCCGATGCTCATGTTCAAAGTCAGAGCTCCAGTTCTGCTCCTGGTAGTGGGTGGTCTGATTTTGCCAA
AAACGTCTCTGGCGAATGGGATGGATATGGTGCGGAATTTTCTTCTGGAGGAACGCCAATTGAACTTCCAGAATTCGTTGTCCCCGATGCTTATAGGGAATGGGAGGTTA
AGGTTTTCGACTGGCAGACTCAGTGCCCCACTCTTGCGGAACCTGAGAAGCCCTCTTTCATGTACAAGACAATAAAGCTACTTCCTACAGTGGGATGTGAAGCCGATGCT
GCAACCCGTTACAGCATTGATGAGAGAAATGTTGGAAATGGAATTGGTTCAAATGATGAAGTGACTGCCTTTGCGTATCAACGTAGTGGATGTTATGTAGTTCTTTGGCC
GGTTAAGGTTGTGGGTTCTTATAAGTTAATGGAGTTGGAGCATTGCCTGGTTAGTCCTCAAGATCGTGAATCCCGTGTGAGGGTTGTTCAGGTTGTCCGAGTCGAAGGCA
CACGGCTAGTGTTGCAGAGTATCAAAGTTTTCTGCGAGCAGTGGTATGGACCATTCAGAAACGGAGAACAGCTTGGTGGATGCGCCATCCGAGACTCATCATTTGCTTCT
ACAGCTGCCTTGAAAGCTTCTGAGGTTGTTGGTTCATGGCAGGGTCCTGTCTCTGTTGCCCGTTTTGATGGTTCTCAGATTAATGTTATACAAGAACTTTTGGCTGACAA
TGTGCAAAAGTCGGTGAGAACTGAATCAGAACTCAAGGAAATATCGATTGCAAATGAGACCCCTGCTTAA
Protein sequenceShow/hide protein sequence
MVGDARVQVAAFSVALPSSLVTLPHRSSSRLSYRLPFGLKSKVKKIKATSADAGHSQPPSSSERRNPLALFLDVPRTVWRRTLRPLSNFGFGRRSIWEGGVGLFLVSGAI
LLTLSLAWLRGFQLRSKFRKYLAVFEFAQACGISTGTPVRIRGVTVGNVIRVNPSLRCIETVVEVEDDKIIIPLNSLVEVNQSGLLMETMIDITPRDPIPVPSAGPLDSE
CIKEGLILCDKHKMKGRQGVSLDALVGIFTRLGREAEEIGLSNTFSLAQRVALVVEEARPLLLKIQAMAEDVQPLLAEVRDSGLLKEVENLTSSLSHATEDLRSVHASIM
TPENTELLQKSIYTLIHTLKNIEPWLEKAMDDDGHLLSYHWEALTGFFVRISMAFASSSSVICQNRALSSSVVSSPGLLHHRCFSRLQSQRILHCNRPLSSNIGINAAPS
LSSAPSSVVAKTALSDAHVQSQSSSSAPGSGWSDFAKNVSGEWDGYGAEFSSGGTPIELPEFVVPDAYREWEVKVFDWQTQCPTLAEPEKPSFMYKTIKLLPTVGCEADA
ATRYSIDERNVGNGIGSNDEVTAFAYQRSGCYVVLWPVKVVGSYKLMELEHCLVSPQDRESRVRVVQVVRVEGTRLVLQSIKVFCEQWYGPFRNGEQLGGCAIRDSSFAS
TAALKASEVVGSWQGPVSVARFDGSQINVIQELLADNVQKSVRTESELKEISIANETPA