; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031711 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031711
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationchr11:12538863..12543120
RNA-Seq ExpressionLag0031711
SyntenyLag0031711
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575159.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.9e-22586.14Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        M+KLRWTMEGQSFWDLDVSTPRTLDGSASPVPT   LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPSY PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KF+SSLKKSG GEM QS SSLLQ IGRHL  RSLYA GISSDILLTPDD+LLISFDGYGDS++LRTKAV HHKFLHHDLTMEALSPGLFVDKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVID+GSA   SG SYHLSMHHNAGSPS+SGSEQT  APFCLLPGLS KAAF+ KKN+EIWRSNAKKLK VQPYDIFLSNP VSLSGIIGAVAT+YFG
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D S  SAA+ SLQEFKGLY+Q +RI+STVFADVFASISFSAQYGMFQR FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGP+SFRADSGV IDL++  WG +QVEEPTFALE+AL  LGSAKAIAWYSPK REFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

XP_022959177.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita moschata]4.5e-22786.57Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT   LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPS+ PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KF+SSLKKSG GEM QS SSLLQ IGRHL  RSLYA GISSDILLTPDD+LLISFDGYGDS++LRTKAV HHKFLHHDLTMEALSPGLFVDKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVID+GSA   SG SYHLSMHHNAGSPSQSGSEQT  APFCLLPGLS KAAF+ KKN+EIWRSNAKKLK VQPYDIFL+NPHVSLSGIIGAVAT+YFG
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D S  SAA+ SLQEF+GLYMQT+RI+STVFADVFASISFSAQYGMFQR FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGP+SFRADSGV IDL++  WG +QVEEPTFALE+AL  LGSAKAIAWYSPK REFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

XP_023006570.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita maxima]2.9e-22686.78Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT   LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPSY PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KF+SSLKKSG GEM QS SSLLQ IGRHL  RSLYA GISSDILLTPDD+LLISFDGYGDS+VLRTKAV HHKFLHHDLTMEALSPGLFVDKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVID+GSAA  SG SYHLSMHHNAGSPSQSGSEQT  APFCLLPGLS KAAF+ KKN+EIWRSNAKKLK VQPYDIFLSNPHVSLSGIIGAVAT+YF 
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D SV SAA+ SLQEFKGL+MQT+RI+STVFADVFASISFSAQYGMFQ  FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGP+SFRAD+GV IDL++  WG +QVEEPTFALE+AL  LGSAKAIAWYSPK  EFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

XP_023548884.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]3.7e-22987.42Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT   LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPSY PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KF+SSLKKSG GEM QS SSLLQ IGRHL  RSLYA GISSDILLTPDD+LLISFDGYGDS++LRTKAV HHKFLHHDLTMEALSPGLFVDKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVID+GSAA  SG SYHLSMHHNAGSPSQSGSEQT  APFCLLPGLS KAAF+ KKN+EIWRSNAKKLK VQPYDIFLSNPHVSLSGIIGAVAT+YFG
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D S  SAA+ SLQEFKGLYMQT+RI+STVFADVFASISFSAQYGMFQR FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGP+SFRAD+GV IDL++  WG +QVEEPTFALE+AL VLGSAKAIAWYSPK REFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

XP_038875869.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]8.2e-22985.93Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT--GLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRW M+GQ FWDLDVSTPRTLDGSASPVP+   LLPLGLSRGVRLSRAKQIDFMQ FMAAPFVPSY+PSHGFSLQRVF+IPFSDSGS T LGQFNLQ
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT--GLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KF+SSLKKSG G+M QS SS LQ IGRHLC RSLYALGISSDILL PDDSL+ISFDGYGD+E++RTKAVFHHKFLHHDLTMEA SPGLFVDKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        S+LV+D+GSAA  SG SYHLSMH N GSPSQSGSEQ R++P CLLPGLS KAAF+FKKN+EIWRSNAKKLKMVQPYDIFLS PHVSLSGIIGAVATTYFG
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        DNS+RSAAQ+SL EFKGLY+QT+RI+STVFADVFASISFSAQYGMFQR +LDLT  SARMDFHSGSKFLSGAMLLI+DLSNSRHPRTESVRATLP+ARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
        +QQQIAGP+SFRADSGV IDLN+  WG + V+EPTFALE+ALQVLGSAKAIAWYSPKHREFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

TrEMBL top hitse value%identityAlignment
A0A0A0K824 Uncharacterized protein1.4e-21882.09Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT--GLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRW M+GQ FWDLDVST RTLDGSASPVP+   LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSY+PSHGFSLQRVF++PFSDSGS T LGQFNLQ
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT--GLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KFMSSL K+GSGEM QS SSLLQ IGRHL QRSLYA+GIS+DILL PDDSL+ISFDGYGDS+++RTKAVFH KFLHHDLT+EALSPGLF++K G YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLV+D+GS A  SG SYHLSMH NAG PSQ GSE T +APFCLLPGLS KAAF+FKKN EIWRSNAKKLKMVQPYDIFLS PHVSLS IIGAVAT+YFG
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D+  RSAAQ+SL++FKG YM+++RI+STVFAD+F SISFSAQYGMFQ+ +LDLTR SA MDFHSGSKFLSG+MLLI+DLSNSRHP+TESV+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
        IQQQIAGP+SFRAD+GV IDLN+  W  ++VEEPTFALE+AL VLGSAKAIAWYSPKHREFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

A0A1S3C837 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.3e-21682.09Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT--GLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRW M+G  FWDLDVST RTLDGSASPVP+   LLPLGLSRGVRLSRAKQIDFMQ FM APFVPSY+PSHGFSLQRVF+IPFSDSGS T LGQFNLQ
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT--GLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KFMSSL K+GSGEM QS SS +Q IGRHL QRSLYA+GIS+DILL PDDSL+ISFDGYGDS+++RTKAVFH KFLHHDLTMEALSPGLF+DKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLV+D+GSAA  SG SYHLSMH N G PS  GSE T +APFCL PGLS KAAF+FKKN EIWRSNAKKLKMVQPYDIFLS PHVSLS IIGAVAT+YFG
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D+ VRSAAQ SL EFKG YMQT+RI+ST+FAD+F SISFSAQYGMFQ+ +LDLTR SA MDFHSGSKFLSG+MLLI+DLSNSRHP+TE+V+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
        IQQQIAGP+SFRADSGV IDLN+  W  ++V+EPTFALE+ALQVLGSAKAIAWYSPKHREFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

A0A5D3BY40 Protein TRIGALACTOSYLDIACYLGLYCEROL 41.3e-21682.09Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT--GLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRW M+G  FWDLDVST RTLDGSASPVP+   LLPLGLSRGVRLSRAKQIDFMQ FM APFVPSY+PSHGFSLQRVF+IPFSDSGS T LGQFNLQ
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT--GLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KFMSSL K+GSGEM QS SS +Q IGRHL QRSLYA+GIS+DILL PDDSL+ISFDGYGDS+++RTKAVFH KFLHHDLTMEALSPGLF+DKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLV+D+GSAA  SG SYHLSMH N G PS  GSE T +APFCL PGLS KAAF+FKKN EIWRSNAKKLKMVQPYDIFLS PHVSLS IIGAVAT+YFG
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D+ VRSAAQ SL EFKG YMQT+RI+ST+FAD+F SISFSAQYGMFQ+ +LDLTR SA MDFHSGSKFLSG+MLLI+DLSNSRHP+TE+V+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
        IQQQIAGP+SFRADSGV IDLN+  W  ++V+EPTFALE+ALQVLGSAKAIAWYSPKHREFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

A0A6J1H3U0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.2e-22786.57Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT   LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPS+ PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KF+SSLKKSG GEM QS SSLLQ IGRHL  RSLYA GISSDILLTPDD+LLISFDGYGDS++LRTKAV HHKFLHHDLTMEALSPGLFVDKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVID+GSA   SG SYHLSMHHNAGSPSQSGSEQT  APFCLLPGLS KAAF+ KKN+EIWRSNAKKLK VQPYDIFL+NPHVSLSGIIGAVAT+YFG
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D S  SAA+ SLQEF+GLYMQT+RI+STVFADVFASISFSAQYGMFQR FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGP+SFRADSGV IDL++  WG +QVEEPTFALE+AL  LGSAKAIAWYSPK REFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

A0A6J1KW75 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.4e-22686.78Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPT   LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPSY PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTG--LLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP
        KF+SSLKKSG GEM QS SSLLQ IGRHL  RSLYA GISSDILLTPDD+LLISFDGYGDS+VLRTKAV HHKFLHHDLTMEALSPGLFVDKSG YWDVP
Subjt:  KFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVP

Query:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG
        SSLVID+GSAA  SG SYHLSMHHNAGSPSQSGSEQT  APFCLLPGLS KAAF+ KKN+EIWRSNAKKLK VQPYDIFLSNPHVSLSGIIGAVAT+YF 
Subjt:  SSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLSGIIGAVATTYFG

Query:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D SV SAA+ SLQEFKGL+MQT+RI+STVFADVFASISFSAQYGMFQ  FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGP+SFRAD+GV IDL++  WG +QVEEPTFALE+AL  LGSAKAIAWYSPK  EFMVELRFYE
Subjt:  IQQQIAGPISFRADSGVTIDLNEPWWG-IQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.1e-12749.79Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF
        M ++RW  EG   WDLD+STP TL+G+A  VP   LPLGLSRG RLSR KQ++F  RFMA+P +PS++P           GFSLQRV T+PFS++   + 
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF

Query:  LGQFNLQKFMSSLKKS---GSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGY-GD-SEVLRTKAVFHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ + K+   G G    + +S L  IG+HL  +SLYALG  S+ LL+PDD+LL+S+D Y GD  +  R KA+F+H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFMSSLKKS---GSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGY-GD-SEVLRTKAVFHHKFLHHDLTMEALSPGL

Query:  FVDKSGNYWDVPSSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK G YWDVP S+ ID+ S    SGPSYHL +HHN+GSP +  S+     P  LLPGLS+K+A S++ N+++WR    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKSGNYWDVPSSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLS

Query:  GIIGAVATTYFGDNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE
        GIIG+V T  FG+NS+RS  +N  +   G  +    + S   AD     S +AQYG FQ+ F DLTR  AR+DF  G +FL+GA  + +DL NSR P  E
Subjt:  GIIGAVATTYFGDNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE

Query:  SVRATLPNARFSIQQQIAGPISFRADSGVTIDLNEPWWGIQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
        + +   P    S+QQQI GP SF+ +SG+ IDL      + V++  FA+E+ALQVL SAKA+  YSPK  EFMVELRF+E
Subjt:  SVRATLPNARFSIQQQIAGPISFRADSGVTIDLNEPWWGIQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown8.9e-6432.62Show/hide
Query:  FWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSH-----GFSLQRVFTIPFSDSGSATFLGQFNLQKFMSSLKKS
        FWD +VS+P+TL+G+A  VP    PL  +R  R  R +Q+  ++       +PS AP+       FSL  +   P S++     +GQF  +K  + +K  
Subjt:  FWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAPSH-----GFSLQRVFTIPFSDSGSATFLGQFNLQKFMSSLKKS

Query:  GSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVPSSLVIDVGS
         S    +    +++   +H+  +SLY++G+ + I L    SLL+S +  GD   LR K +  H    HDLT+EA  P LF+D  G +WDVP SL +DV S
Subjt:  GSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVLRTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVPSSLVIDVGS

Query:  AAFGSGPSYHLSMHHNAGSP---SQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKK-------LKMVQPYDIFLSNPHVSLSGIIGAVATTYF
            SG  Y   +H + G+P   + +G E    AP  L+PGL  KAA S+K N ++WR   K+         +  PYD+ L  PH ++SGI+G+    + 
Subjt:  AAFGSGPSYHLSMHHNAGSP---SQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKK-------LKMVQPYDIFLSNPHVSLSGIIGAVATTYF

Query:  GDNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARF
            +               +   + +S + ADVF S  ++ Q G F + + DLTRV AR+D       L  A  L + L ++    ++    + P    
Subjt:  GDNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARF

Query:  SIQQQIAGPISFRADSGVTIDLNEPWWGIQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
          QQQ+AGPI F+ DS   +         ++E+  ++L ++L++L S K +AWYSPK +E M+ELR +E
Subjt:  SIQQQIAGPISFRADSGVTIDLNEPWWGIQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

AT3G06960.1 pigment defective 3207.9e-12949.79Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF
        M ++RW  EG   WDLD+STP TL+G+A  VP   LPLGLSRG RLSR KQ++F  RFMA+P +PS++P           GFSLQRV T+PFS++   + 
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF

Query:  LGQFNLQKFMSSLKKS---GSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGY-GD-SEVLRTKAVFHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ + K+   G G    + +S L  IG+HL  +SLYALG  S+ LL+PDD+LL+S+D Y GD  +  R KA+F+H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFMSSLKKS---GSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGY-GD-SEVLRTKAVFHHKFLHHDLTMEALSPGL

Query:  FVDKSGNYWDVPSSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK G YWDVP S+ ID+ S    SGPSYHL +HHN+GSP +  S+     P  LLPGLS+K+A S++ N+++WR    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKSGNYWDVPSSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLS

Query:  GIIGAVATTYFGDNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE
        GIIG+V T  FG+NS+RS  +N  +   G  +    + S   AD     S +AQYG FQ+ F DLTR  AR+DF  G +FL+GA  + +DL NSR P  E
Subjt:  GIIGAVATTYFGDNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE

Query:  SVRATLPNARFSIQQQIAGPISFRADSGVTIDLNEPWWGIQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE
        + +   P    S+QQQI GP SF+ +SG+ IDL      + V++  FA+E+ALQVL SAKA+  YSPK  EFMVELRF+E
Subjt:  SVRATLPNARFSIQQQIAGPISFRADSGVTIDLNEPWWGIQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYE

AT3G06960.2 pigment defective 3205.0e-8351.14Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF
        M ++RW  EG   WDLD+STP TL+G+A  VP   LPLGLSRG RLSR KQ++F  RFMA+P +PS++P           GFSLQRV T+PFS++   + 
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF

Query:  LGQFNLQKFMSSLKKS---GSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGY-GD-SEVLRTKAVFHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ + K+   G G    + +S L  IG+HL  +SLYALG  S+ LL+PDD+LL+S+D Y GD  +  R KA+F+H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFMSSLKKS---GSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGY-GD-SEVLRTKAVFHHKFLHHDLTMEALSPGL

Query:  FVDKSGNYWDVPSSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK G YWDVP S+ ID+ S    SGPSYHL +HHN+GSP +  S+     P  LLPGLS+K+A S++ N+++WR    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKSGNYWDVPSSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQPYDIFLSNPHVSLS

Query:  GIIGAVA
        GIIG ++
Subjt:  GIIGAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAGCAATAGCAGGTTCAGTCAGAAGTCACAAGACAACACAGCAGAAGAACAACGGAGACCCCTCGAGGAAGTTATAGTTCATAGAATAGGGTTTAGCATTTCGAC
AATTCGATTCGAGCTGCTAATTGGTTTAGGGTTTAGAGTTGTTCTGCTCGTGAAGGTTGGAGGGATGAAGAAGCTAAGATGGACAATGGAGGGGCAAAGCTTTTGGGATC
TCGATGTTTCAACGCCTAGAACGCTGGATGGGTCGGCCTCCCCTGTTCCCACTGGCTTACTTCCCTTGGGATTGTCCAGAGGCGTCAGGCTTTCCAGGGCCAAGCAGATC
GATTTCATGCAGCGCTTCATGGCTGCACCTTTTGTCCCTTCTTATGCCCCTTCCCATGGCTTCTCTCTCCAGCGCGTCTTTACCATCCCCTTTTCCGACTCCGGGTCCGC
CACTTTTCTAGGTCAGTTCAATTTGCAGAAGTTCATGTCCTCTCTTAAGAAATCTGGTTCCGGAGAGATGCCTCAGTCGGCGTCCTCATTGCTGCAACGCATTGGAAGGC
ACCTCTGCCAACGGTCTTTGTATGCCCTTGGTATCTCTTCTGATATCTTGTTAACTCCTGATGATTCGCTGTTGATCAGCTTCGACGGATATGGCGACAGTGAAGTACTT
CGAACAAAAGCAGTGTTCCATCATAAGTTCCTACATCATGATCTGACAATGGAGGCACTTTCTCCAGGACTTTTTGTGGACAAATCTGGTAACTACTGGGATGTGCCTTC
TTCATTAGTCATTGATGTAGGTTCTGCTGCTTTTGGCTCGGGTCCAAGTTATCACTTGTCCATGCACCACAATGCCGGGTCTCCCTCACAATCTGGAAGTGAACAAACCC
GTGCGGCCCCTTTCTGTCTGCTTCCTGGTCTTTCAATGAAGGCTGCTTTTTCCTTTAAGAAGAACATTGAAATCTGGAGAAGCAACGCCAAGAAGTTAAAGATGGTGCAA
CCGTATGACATTTTCCTATCAAATCCTCATGTTTCATTGTCAGGGATCATTGGTGCTGTTGCTACTACCTACTTTGGAGACAATTCAGTGAGGTCAGCAGCACAGAACAG
TCTTCAGGAATTTAAAGGACTTTACATGCAGACTGCTAGAATAAAATCTACTGTTTTTGCGGATGTATTCGCTTCCATTTCTTTTTCTGCTCAGTATGGGATGTTTCAAA
GGCCATTTCTGGATCTTACCCGTGTTTCTGCACGTATGGATTTCCATTCTGGCTCCAAGTTCCTTTCGGGAGCCATGCTTTTGATAGAAGATCTTTCCAATTCCCGGCAC
CCAAGAACTGAATCTGTGAGAGCGACCTTGCCTAATGCTAGATTTTCCATTCAGCAGCAGATCGCTGGACCCATCAGCTTTAGAGCAGATTCAGGAGTTACAATAGATTT
GAATGAACCATGGTGGGGTATACAAGTGGAGGAGCCTACATTTGCCTTAGAGCATGCGCTGCAGGTCCTTGGTTCAGCTAAAGCCATTGCTTGGTATTCACCAAAGCACA
GAGAATTTATGGTAGAGCTTCGTTTCTACGAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAGCAATAGCAGGTTCAGTCAGAAGTCACAAGACAACACAGCAGAAGAACAACGGAGACCCCTCGAGGAAGTTATAGTTCATAGAATAGGGTTTAGCATTTCGAC
AATTCGATTCGAGCTGCTAATTGGTTTAGGGTTTAGAGTTGTTCTGCTCGTGAAGGTTGGAGGGATGAAGAAGCTAAGATGGACAATGGAGGGGCAAAGCTTTTGGGATC
TCGATGTTTCAACGCCTAGAACGCTGGATGGGTCGGCCTCCCCTGTTCCCACTGGCTTACTTCCCTTGGGATTGTCCAGAGGCGTCAGGCTTTCCAGGGCCAAGCAGATC
GATTTCATGCAGCGCTTCATGGCTGCACCTTTTGTCCCTTCTTATGCCCCTTCCCATGGCTTCTCTCTCCAGCGCGTCTTTACCATCCCCTTTTCCGACTCCGGGTCCGC
CACTTTTCTAGGTCAGTTCAATTTGCAGAAGTTCATGTCCTCTCTTAAGAAATCTGGTTCCGGAGAGATGCCTCAGTCGGCGTCCTCATTGCTGCAACGCATTGGAAGGC
ACCTCTGCCAACGGTCTTTGTATGCCCTTGGTATCTCTTCTGATATCTTGTTAACTCCTGATGATTCGCTGTTGATCAGCTTCGACGGATATGGCGACAGTGAAGTACTT
CGAACAAAAGCAGTGTTCCATCATAAGTTCCTACATCATGATCTGACAATGGAGGCACTTTCTCCAGGACTTTTTGTGGACAAATCTGGTAACTACTGGGATGTGCCTTC
TTCATTAGTCATTGATGTAGGTTCTGCTGCTTTTGGCTCGGGTCCAAGTTATCACTTGTCCATGCACCACAATGCCGGGTCTCCCTCACAATCTGGAAGTGAACAAACCC
GTGCGGCCCCTTTCTGTCTGCTTCCTGGTCTTTCAATGAAGGCTGCTTTTTCCTTTAAGAAGAACATTGAAATCTGGAGAAGCAACGCCAAGAAGTTAAAGATGGTGCAA
CCGTATGACATTTTCCTATCAAATCCTCATGTTTCATTGTCAGGGATCATTGGTGCTGTTGCTACTACCTACTTTGGAGACAATTCAGTGAGGTCAGCAGCACAGAACAG
TCTTCAGGAATTTAAAGGACTTTACATGCAGACTGCTAGAATAAAATCTACTGTTTTTGCGGATGTATTCGCTTCCATTTCTTTTTCTGCTCAGTATGGGATGTTTCAAA
GGCCATTTCTGGATCTTACCCGTGTTTCTGCACGTATGGATTTCCATTCTGGCTCCAAGTTCCTTTCGGGAGCCATGCTTTTGATAGAAGATCTTTCCAATTCCCGGCAC
CCAAGAACTGAATCTGTGAGAGCGACCTTGCCTAATGCTAGATTTTCCATTCAGCAGCAGATCGCTGGACCCATCAGCTTTAGAGCAGATTCAGGAGTTACAATAGATTT
GAATGAACCATGGTGGGGTATACAAGTGGAGGAGCCTACATTTGCCTTAGAGCATGCGCTGCAGGTCCTTGGTTCAGCTAAAGCCATTGCTTGGTATTCACCAAAGCACA
GAGAATTTATGGTAGAGCTTCGTTTCTACGAGAAGTGA
Protein sequenceShow/hide protein sequence
MGSNSRFSQKSQDNTAEEQRRPLEEVIVHRIGFSISTIRFELLIGLGFRVVLLVKVGGMKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPTGLLPLGLSRGVRLSRAKQI
DFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQKFMSSLKKSGSGEMPQSASSLLQRIGRHLCQRSLYALGISSDILLTPDDSLLISFDGYGDSEVL
RTKAVFHHKFLHHDLTMEALSPGLFVDKSGNYWDVPSSLVIDVGSAAFGSGPSYHLSMHHNAGSPSQSGSEQTRAAPFCLLPGLSMKAAFSFKKNIEIWRSNAKKLKMVQ
PYDIFLSNPHVSLSGIIGAVATTYFGDNSVRSAAQNSLQEFKGLYMQTARIKSTVFADVFASISFSAQYGMFQRPFLDLTRVSARMDFHSGSKFLSGAMLLIEDLSNSRH
PRTESVRATLPNARFSIQQQIAGPISFRADSGVTIDLNEPWWGIQVEEPTFALEHALQVLGSAKAIAWYSPKHREFMVELRFYEK