; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg05582 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg05582
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationCarg_Chr17:2407226..2410078
RNA-Seq ExpressionCarg05582
SyntenyCarg05582
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575159.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.5e-266100Show/hide
Query:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
        MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
Subjt:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV

Query:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
        QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF
        PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF
Subjt:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF

Query:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
        GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
Subjt:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF

Query:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
        SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
Subjt:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN

XP_022959177.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita moschata]1.7e-26298.09Show/hide
Query:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
        MM+KLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPS+TPSHGFSLQRVFSIPFSDSGSATLLGQFNV
Subjt:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV

Query:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
        QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF
        PSSLVIDLGSADTDSGLSYHLSMHHNAGSPS+SGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFL+NP VSLSGIIGAVATSYF
Subjt:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF

Query:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
        GDISAGSAAEGSLQEF+GLY+Q SRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
Subjt:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF

Query:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
        SFQQQIAGPVSFRADSGVAIDLSK GWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
Subjt:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN

XP_023006570.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita maxima]1.2e-25896.82Show/hide
Query:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
        MM+KLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
Subjt:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV

Query:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
        QKFVSSLKKSGFGEMGQS+SSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSD+LRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF
        PSSLVIDLGSA TDSGLSYHLSMHHNAGSPS+SGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNP VSLSGIIGAVATSYF
Subjt:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF

Query:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
         DIS GSAAEGSLQEFKGL++Q SRIRSTVFADVFASISFSAQYGMFQ NFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
Subjt:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF

Query:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
        SFQQQIAGPVSFRAD+GVAIDLSK GWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQ EFMVELRFYEN
Subjt:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN

XP_023548884.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]2.5e-26197.88Show/hide
Query:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
        MM+KLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
Subjt:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV

Query:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
        QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF
        PSSLVIDLGSA TDSGLSYHLSMHHNAGSPS+SGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNP VSLSGIIGAVATSYF
Subjt:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF

Query:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
        GDISAGSAAEGSLQEFKGLY+Q SRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
Subjt:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF

Query:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
        SFQQQIAGPVSFRAD+GVAIDLSK GWGSLQVEEPTFALEYAL+ LGSAKAIAWYSPKQREFMVELRFYEN
Subjt:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN

XP_038875869.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]3.8e-22584.68Show/hide
Query:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
        MM+KLRW M+GQ FWDLDVSTPRTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ FMAAPFVPSY+PSHGFSLQRVFSIPFSDSGS TLLGQFN+
Subjt:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV

Query:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
        QKF+SSLKKSG G+MGQS+SS LQ IGRHL  RSLYA GISSDILL PDD+L+ISFDGYGD++I+RTKAV HHKFLHHDLTMEA SPGLFVDKSGKYWDV
Subjt:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF
        PS+LV+DLGSA ++SGLSYHLSMH N GSPS+SGSEQ   +P CLLPGLSAKAAFA KKNLEIWRSNAKKLK VQPYDIFLS P VSLSGIIGAVAT+YF
Subjt:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF

Query:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
        GD S  SAA+ SL EFKGLY+Q SRIRSTVFADVFASISFSAQYGMFQR +LDLT FS R DFHSGSKFLSGAMLLI+DLSNS+HPRTESV+ATLP+ARF
Subjt:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF

Query:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE
        S QQQIAGPVSFRADSGVAIDL+K GWG L V+EPTFALEYAL  LGSAKAIAWYSPK REFMVELRFYE
Subjt:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE

TrEMBL top hitse value%identityAlignment
A0A0A0K824 Uncharacterized protein2.4e-21781.88Show/hide
Query:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        M+KLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPSY+PSHGFSLQRVFS+PFSDSGS TLLGQFN+Q
Subjt:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G GEM QS SSLLQ IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLT+EALSPGLF++K G+YWDVP
Subjt:  KFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYFG
        SSLV+DLGS  +DSGLSYHLSMH NAG PS+ GSE T  APFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFLS P VSLS IIGAVATSYFG
Subjt:  SSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYFG

Query:  DISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFS
        D  A SAA+ SL++FKG Y+++SRIRSTVFAD+F SISFSAQYGMFQ+ +LDLTRFS   DFHSGSKFLSG+MLLI+DLSNS+HP+TESVKATLPNARFS
Subjt:  DISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFS

Query:  FQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE
         QQQIAGPVSFRAD+GVAIDL+K GW  L+VEEPTFALEYAL+ LGSAKAIAWYSPK REFMVELRFYE
Subjt:  FQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE

A0A1S3C837 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.7e-21581.66Show/hide
Query:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        M+KLRW M+G  FWDLDVST RTLDGSASPVP+   LLPLGLSRGVRLSRAKQIDFMQ FM APFVPSY+PSHGFSLQRVFSIPFSDSGS TLLGQFN+Q
Subjt:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G GEMGQS SS +Q IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLTMEALSPGLF+DKSG+YWDVP
Subjt:  KFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYFG
        SSLV+DLGSA +DSGLSYHLSMH N G PS  GSE T  APFCL PGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFLS P VSLS IIGAVATSYFG
Subjt:  SSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYFG

Query:  DISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFS
        D    SAA+GSL EFKG Y+Q SRIRST+FAD+F SISFSAQYGMFQ+ +LDLTRFS   DFHSGSKFLSG+MLLI+DLSNS+HP+TE+VKATLPNARFS
Subjt:  DISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFS

Query:  FQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE
         QQQIAGPVSFRADSGVAIDL+K GW  L+V+EPTFALEYAL  LGSAKAIAWYSPK REFMVELRFYE
Subjt:  FQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE

A0A5D3BY40 Protein TRIGALACTOSYLDIACYLGLYCEROL 41.7e-21581.66Show/hide
Query:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        M+KLRW M+G  FWDLDVST RTLDGSASPVP+   LLPLGLSRGVRLSRAKQIDFMQ FM APFVPSY+PSHGFSLQRVFSIPFSDSGS TLLGQFN+Q
Subjt:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVP
        KF+SSL K+G GEMGQS SS +Q IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLTMEALSPGLF+DKSG+YWDVP
Subjt:  KFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVP

Query:  SSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYFG
        SSLV+DLGSA +DSGLSYHLSMH N G PS  GSE T  APFCL PGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFLS P VSLS IIGAVATSYFG
Subjt:  SSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYFG

Query:  DISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFS
        D    SAA+GSL EFKG Y+Q SRIRST+FAD+F SISFSAQYGMFQ+ +LDLTRFS   DFHSGSKFLSG+MLLI+DLSNS+HP+TE+VKATLPNARFS
Subjt:  DISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFS

Query:  FQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE
         QQQIAGPVSFRADSGVAIDL+K GW  L+V+EPTFALEYAL  LGSAKAIAWYSPK REFMVELRFYE
Subjt:  FQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE

A0A6J1H3U0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic8.4e-26398.09Show/hide
Query:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
        MM+KLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPS+TPSHGFSLQRVFSIPFSDSGSATLLGQFNV
Subjt:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV

Query:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
        QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF
        PSSLVIDLGSADTDSGLSYHLSMHHNAGSPS+SGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFL+NP VSLSGIIGAVATSYF
Subjt:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF

Query:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
        GDISAGSAAEGSLQEF+GLY+Q SRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
Subjt:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF

Query:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
        SFQQQIAGPVSFRADSGVAIDLSK GWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
Subjt:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN

A0A6J1KW75 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic5.6e-25996.82Show/hide
Query:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
        MM+KLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV
Subjt:  MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNV

Query:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
        QKFVSSLKKSGFGEMGQS+SSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSD+LRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV
Subjt:  QKFVSSLKKSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDV

Query:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF
        PSSLVIDLGSA TDSGLSYHLSMHHNAGSPS+SGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNP VSLSGIIGAVATSYF
Subjt:  PSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYF

Query:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
         DIS GSAAEGSLQEFKGL++Q SRIRSTVFADVFASISFSAQYGMFQ NFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF
Subjt:  GDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARF

Query:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN
        SFQQQIAGPVSFRAD+GVAIDLSK GWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQ EFMVELRFYEN
Subjt:  SFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYEN

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.8e-12249.38Show/hide
Query:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  EG   WDLD+STP TL+G+A  VP D   LPLGLSRG RLSR KQ++F  +FMA+P +PS++P           GFSLQRV ++PFS++   
Subjt:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNVQKFVSSLKKS-GFGEMGQS-ISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGY-GDSD-ILRTKAVLHHKFLHHDLTMEALSPG
        +LLGQF+VQ+FV+ + K+  FG    S ++S L  IG+HL+D+SLYA G  S+ LL+PDD LL+S+D Y GD D   R KA+ +H+F  H+LT EA+ PG
Subjt:  TLLGQFNVQKFVSSLKKS-GFGEMGQS-ISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGY-GDSD-ILRTKAVLHHKFLHHDLTMEALSPG

Query:  LFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSL
        LFVDK G+YWDVP S+ IDL S   +SG SYHL +HHN+GSP +  S+   + P  LLPGLS K+A + + N+++WR    KL+  +PYD+FLS+P V++
Subjt:  LFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSL

Query:  SGIIGAVATSYFGDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRT
        SGIIG+V T+ FG+ S  S  E   +   G  +    + S   AD     S +AQYG FQ+ F DLTRF  R DF  G +FL+GA  + +DL NS+ P  
Subjt:  SGIIGAVATSYFGDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRT

Query:  ESVKATLPNARFSFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE
        E+ +   P    S QQQI GP SF+ +SG+ IDL + G   + V++  FA+EYAL  L SAKA+  YSPKQ EFMVELRF+E
Subjt:  ESVKATLPNARFSFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown8.0e-6432.63Show/hide
Query:  FWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSH-----GFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLK
        FWD +VS+P+TL+G+A  VP   +  PL  +R  R  R +Q+  +++      +PS  P+       FSL  +   P S++    L+GQF  +K  + + 
Subjt:  FWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSH-----GFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLK

Query:  KSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVPSSLVIDL
        K+      +    +++   +H+ D+SLY+ G+ + I L    +LL+S +  GD + LR K +L H    HDLT+EA  P LF+D  G++WDVP SL +D+
Subjt:  KSGFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVPSSLVIDL

Query:  GSADTDSGLSYHLSMHHNAGSP---SRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQ-------PYDIFLSNPRVSLSGIIGAVATS
         S   +SG+ Y   +H + G+P   + +G E    AP  L+PGL AKAA + K N ++WR   K+    +       PYD+ L  P  ++SGI+G+   +
Subjt:  GSADTDSGLSYHLSMHHNAGSP---SRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQ-------PYDIFLSNPRVSLSGIIGAVATS

Query:  YFGDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNA
        +                 +G+ V   + RS + ADVF S  ++ Q G F + + DLTR   R D       L  A  L + L ++    ++    + P  
Subjt:  YFGDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNA

Query:  RFSFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE
           FQQQ+AGP+ F+ DS   +       G+ ++E+  ++L Y+L  L S K +AWYSPK++E M+ELR +E
Subjt:  RFSFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE

AT3G06960.1 pigment defective 3202.0e-12349.38Show/hide
Query:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  EG   WDLD+STP TL+G+A  VP D   LPLGLSRG RLSR KQ++F  +FMA+P +PS++P           GFSLQRV ++PFS++   
Subjt:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNVQKFVSSLKKS-GFGEMGQS-ISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGY-GDSD-ILRTKAVLHHKFLHHDLTMEALSPG
        +LLGQF+VQ+FV+ + K+  FG    S ++S L  IG+HL+D+SLYA G  S+ LL+PDD LL+S+D Y GD D   R KA+ +H+F  H+LT EA+ PG
Subjt:  TLLGQFNVQKFVSSLKKS-GFGEMGQS-ISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGY-GDSD-ILRTKAVLHHKFLHHDLTMEALSPG

Query:  LFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSL
        LFVDK G+YWDVP S+ IDL S   +SG SYHL +HHN+GSP +  S+   + P  LLPGLS K+A + + N+++WR    KL+  +PYD+FLS+P V++
Subjt:  LFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSL

Query:  SGIIGAVATSYFGDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRT
        SGIIG+V T+ FG+ S  S  E   +   G  +    + S   AD     S +AQYG FQ+ F DLTRF  R DF  G +FL+GA  + +DL NS+ P  
Subjt:  SGIIGAVATSYFGDISAGSAAEGSLQEFKGLYVQASRIRSTVFADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRT

Query:  ESVKATLPNARFSFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE
        E+ +   P    S QQQI GP SF+ +SG+ IDL + G   + V++  FA+EYAL  L SAKA+  YSPKQ EFMVELRF+E
Subjt:  ESVKATLPNARFSFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALEYALYALGSAKAIAWYSPKQREFMVELRFYE

AT3G06960.2 pigment defective 3201.1e-8151.3Show/hide
Query:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  EG   WDLD+STP TL+G+A  VP D   LPLGLSRG RLSR KQ++F  +FMA+P +PS++P           GFSLQRV ++PFS++   
Subjt:  MRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNVQKFVSSLKKS-GFGEMGQS-ISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGY-GDSD-ILRTKAVLHHKFLHHDLTMEALSPG
        +LLGQF+VQ+FV+ + K+  FG    S ++S L  IG+HL+D+SLYA G  S+ LL+PDD LL+S+D Y GD D   R KA+ +H+F  H+LT EA+ PG
Subjt:  TLLGQFNVQKFVSSLKKS-GFGEMGQS-ISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGY-GDSD-ILRTKAVLHHKFLHHDLTMEALSPG

Query:  LFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSL
        LFVDK G+YWDVP S+ IDL S   +SG SYHL +HHN+GSP +  S+   + P  LLPGLS K+A + + N+++WR    KL+  +PYD+FLS+P V++
Subjt:  LFVDKSGKYWDVPSSLVIDLGSADTDSGLSYHLSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSL

Query:  SGIIGAVA
        SGIIG ++
Subjt:  SGIIGAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAGGAAGCTAAGATGGACTATGGAGGGTCAAAGCTTTTGGGATTTGGATGTTTCAACGCCTAGAACACTCGATGGCTCGGCCTCCCCTGTTCCTACTGATTTGCA
GCTGCTTCCCTTGGGATTGTCCAGAGGTGTTCGGCTTTCCAGGGCCAAGCAGATCGATTTCATGCAGCAGTTCATGGCTGCTCCTTTTGTTCCTTCTTACACCCCTTCCC
ATGGCTTCTCTCTCCAGCGCGTCTTCTCCATCCCCTTTTCGGACTCTGGGTCCGCTACTCTTTTAGGTCAGTTCAATGTGCAGAAATTCGTGTCCTCTCTTAAGAAATCT
GGTTTTGGAGAGATGGGTCAGTCGATTTCCTCATTGCTGCAAGGCATTGGAAGGCACCTCCGCGACCGATCTTTGTATGCGTTTGGTATTTCTTCTGATATCTTGTTAAC
TCCTGATGATGCCCTGTTGATCAGCTTTGATGGATATGGTGACAGTGACATACTTAGAACAAAAGCAGTACTCCACCACAAGTTCTTACATCATGATCTAACAATGGAGG
CGCTTTCTCCAGGGCTTTTTGTAGACAAATCTGGTAAATATTGGGATGTGCCTTCTTCGTTAGTCATTGATCTAGGGTCTGCGGATACCGACTCGGGTCTGAGTTATCAC
TTGTCTATGCACCACAATGCTGGGTCTCCCTCACGATCTGGAAGTGAACAAACCTGTATGGCTCCTTTCTGTTTACTTCCTGGTTTATCAGCCAAGGCTGCGTTTGCCTT
AAAGAAGAATTTGGAAATTTGGAGAAGCAACGCCAAGAAGTTGAAGAGGGTGCAACCATATGACATTTTCCTATCAAATCCTCGCGTTTCATTGTCAGGGATCATTGGTG
CTGTTGCTACTAGCTACTTCGGAGACATTTCGGCTGGATCGGCAGCAGAAGGCAGTCTTCAGGAGTTTAAAGGACTTTACGTGCAGGCTTCTCGAATAAGATCTACTGTT
TTTGCAGATGTGTTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGAACTTTCTGGATCTTACCCGTTTTTCTGGACGTTTCGATTTCCATTCTGGCTC
CAAATTCCTTTCTGGAGCCATGCTCTTGATAGAGGATCTTTCCAATTCCCAGCACCCAAGAACCGAATCCGTGAAAGCGACCTTGCCGAATGCGAGATTTTCCTTCCAGC
AGCAGATTGCTGGACCTGTAAGCTTTAGAGCAGATTCAGGAGTTGCAATAGATTTGAGTAAAGAAGGATGGGGGAGTTTACAAGTGGAGGAGCCTACATTTGCGTTGGAA
TATGCGTTGTACGCCCTTGGTTCAGCTAAAGCCATCGCTTGGTATTCACCAAAGCAAAGAGAATTTATGGTAGAGCTCCGTTTCTATGAGAACTGA
mRNA sequenceShow/hide mRNA sequence
TTGGAATTGGAATCCCGAAATGAGTAGAGGAGGAGGAGGAGGATAAGGTGTGGCGGAATTGCTTCAGAATCAGAACCTTAACCTGGTTGTGTTGTTTGTGAAGGTGTTCA
GGATGATGAGGAAGCTAAGATGGACTATGGAGGGTCAAAGCTTTTGGGATTTGGATGTTTCAACGCCTAGAACACTCGATGGCTCGGCCTCCCCTGTTCCTACTGATTTG
CAGCTGCTTCCCTTGGGATTGTCCAGAGGTGTTCGGCTTTCCAGGGCCAAGCAGATCGATTTCATGCAGCAGTTCATGGCTGCTCCTTTTGTTCCTTCTTACACCCCTTC
CCATGGCTTCTCTCTCCAGCGCGTCTTCTCCATCCCCTTTTCGGACTCTGGGTCCGCTACTCTTTTAGGTCAGTTCAATGTGCAGAAATTCGTGTCCTCTCTTAAGAAAT
CTGGTTTTGGAGAGATGGGTCAGTCGATTTCCTCATTGCTGCAAGGCATTGGAAGGCACCTCCGCGACCGATCTTTGTATGCGTTTGGTATTTCTTCTGATATCTTGTTA
ACTCCTGATGATGCCCTGTTGATCAGCTTTGATGGATATGGTGACAGTGACATACTTAGAACAAAAGCAGTACTCCACCACAAGTTCTTACATCATGATCTAACAATGGA
GGCGCTTTCTCCAGGGCTTTTTGTAGACAAATCTGGTAAATATTGGGATGTGCCTTCTTCGTTAGTCATTGATCTAGGGTCTGCGGATACCGACTCGGGTCTGAGTTATC
ACTTGTCTATGCACCACAATGCTGGGTCTCCCTCACGATCTGGAAGTGAACAAACCTGTATGGCTCCTTTCTGTTTACTTCCTGGTTTATCAGCCAAGGCTGCGTTTGCC
TTAAAGAAGAATTTGGAAATTTGGAGAAGCAACGCCAAGAAGTTGAAGAGGGTGCAACCATATGACATTTTCCTATCAAATCCTCGCGTTTCATTGTCAGGGATCATTGG
TGCTGTTGCTACTAGCTACTTCGGAGACATTTCGGCTGGATCGGCAGCAGAAGGCAGTCTTCAGGAGTTTAAAGGACTTTACGTGCAGGCTTCTCGAATAAGATCTACTG
TTTTTGCAGATGTGTTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGGAACTTTCTGGATCTTACCCGTTTTTCTGGACGTTTCGATTTCCATTCTGGC
TCCAAATTCCTTTCTGGAGCCATGCTCTTGATAGAGGATCTTTCCAATTCCCAGCACCCAAGAACCGAATCCGTGAAAGCGACCTTGCCGAATGCGAGATTTTCCTTCCA
GCAGCAGATTGCTGGACCTGTAAGCTTTAGAGCAGATTCAGGAGTTGCAATAGATTTGAGTAAAGAAGGATGGGGGAGTTTACAAGTGGAGGAGCCTACATTTGCGTTGG
AATATGCGTTGTACGCCCTTGGTTCAGCTAAAGCCATCGCTTGGTATTCACCAAAGCAAAGAGAATTTATGGTAGAGCTCCGTTTCTATGAGAACTGA
Protein sequenceShow/hide protein sequence
MMRKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTDLQLLPLGLSRGVRLSRAKQIDFMQQFMAAPFVPSYTPSHGFSLQRVFSIPFSDSGSATLLGQFNVQKFVSSLKKS
GFGEMGQSISSLLQGIGRHLRDRSLYAFGISSDILLTPDDALLISFDGYGDSDILRTKAVLHHKFLHHDLTMEALSPGLFVDKSGKYWDVPSSLVIDLGSADTDSGLSYH
LSMHHNAGSPSRSGSEQTCMAPFCLLPGLSAKAAFALKKNLEIWRSNAKKLKRVQPYDIFLSNPRVSLSGIIGAVATSYFGDISAGSAAEGSLQEFKGLYVQASRIRSTV
FADVFASISFSAQYGMFQRNFLDLTRFSGRFDFHSGSKFLSGAMLLIEDLSNSQHPRTESVKATLPNARFSFQQQIAGPVSFRADSGVAIDLSKEGWGSLQVEEPTFALE
YALYALGSAKAIAWYSPKQREFMVELRFYEN