; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5658 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5658
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationctg1402:197841..201057
RNA-Seq ExpressionCucsat.G5658
SyntenyCucsat.G5658
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
GO:0070300 - phosphatidic acid binding (molecular function)
GO:0106029 - tRNA pseudouridine synthase activity (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK02989.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa]9.61e-31693.62Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRWAMDG  FWDLDVSTSRTLDGSASPVPSP HLLPLGLSRGVRLSRAKQIDFMQ FM APFVPSYSPSHGFSLQRVFS+PFSDSGSITLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KFMSSLMKTGSGEM QS++S +Q IGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT+EALSPGLFM+K GRYWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLVVDLGS ASDSGLSYHLSMHQN GFPS LGSEPTHSAPFCL PGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        DDL RSAAQ SL +FKGFYM++SRIRST+FADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTE+VKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
        IQQQIAGPVSFRAD+GVAIDLNKAGWDLLRV+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

XP_004138517.2 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic isoform X1 [Cucumis sativus]0.099.79Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KFMSSLMKTGSGEMCQSY+SLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
        IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

XP_008458282.1 PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo]0.093.62Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRWAMDG  FWDLDVSTSRTLDGSASPVPSP HLLPLGLSRGVRLSRAKQIDFMQ FM APFVPSYSPSHGFSLQRVFS+PFSDSGSITLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KFMSSLMKTGSGEM QS++S +Q IGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT+EALSPGLFM+K GRYWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLVVDLGS ASDSGLSYHLSMHQN GFPS LGSEPTHSAPFCL PGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        DDL RSAAQ SL +FKGFYM++SRIRST+FADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTE+VKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
        IQQQIAGPVSFRAD+GVAIDLNKAGWDLLRV+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

XP_023548884.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]9.89e-28383.4Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPSY+PSHGFSLQRVFS+PFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KF+SSL K+G GEM QS +SLLQ IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLT+EALSPGLF++K G+YWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLV+DLGS A+DSGLSYHLSMH NAG PSQ GSE T  APFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFLS PHVSLS IIGAVATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        D  A SAA+ SL++FKG YM++SRIRSTVFAD+F SISFSAQYGMFQ+ +LDLTRFS   DFHSGSKFLSG+MLLI+DLSNS+HP+TESVKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
         QQQIAGPVSFRADTGVAIDL+KAGW  L+VEEPTFALEYALHVLGSAKAIAWYSPK REFMVELRFYE 
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

XP_038875869.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]5.31e-29485.96Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRWAMDGQGFWDLDVST RTLDGSASPVPS LHLLPLGLSRGVRLSRAKQIDFMQ FMAAPFVPSYSPSHGFSLQRVFS+PFSDSGS+TLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KF+SSL K+G G+M QS +S LQ IGRHL  RSLYA+GIS+DILLPPDDSLMISFDGYGD++IVRTKAVFH KFLHHDLT+EA SPGLF++K G+YWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        S+LVVDLGS AS+SGLSYHLSMHQN G PSQ GSE   S+P CLLPGLSAKAAFAFKKN EIWRSNAKKLKMVQPYDIFLSTPHVSLS IIGAVAT+YFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        D+  RSAAQDSL +FKG Y+++SRIRSTVFAD+F SISFSAQYGMFQ+KYLDLT FSA MDFHSGSKFLSG+MLLIDDLSNSRHP+TESV+ATLP+ARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRAD+GVAIDLNKAGW LL V+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

TrEMBL top hitse value%identityAlignment
A0A0A0K824 Uncharacterized protein0.099.79Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KFMSSLMKTGSGEMCQSY+SLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
        IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

A0A1S3C837 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic0.093.62Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRWAMDG  FWDLDVSTSRTLDGSASPVPSP HLLPLGLSRGVRLSRAKQIDFMQ FM APFVPSYSPSHGFSLQRVFS+PFSDSGSITLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KFMSSLMKTGSGEM QS++S +Q IGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT+EALSPGLFM+K GRYWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLVVDLGS ASDSGLSYHLSMHQN GFPS LGSEPTHSAPFCL PGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        DDL RSAAQ SL +FKGFYM++SRIRST+FADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTE+VKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
        IQQQIAGPVSFRAD+GVAIDLNKAGWDLLRV+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

A0A5D3BY40 Protein TRIGALACTOSYLDIACYLGLYCEROL 44.65e-31693.62Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRWAMDG  FWDLDVSTSRTLDGSASPVPSP HLLPLGLSRGVRLSRAKQIDFMQ FM APFVPSYSPSHGFSLQRVFS+PFSDSGSITLLGQFNLQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KFMSSLMKTGSGEM QS++S +Q IGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT+EALSPGLFM+K GRYWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLVVDLGS ASDSGLSYHLSMHQN GFPS LGSEPTHSAPFCL PGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        DDL RSAAQ SL +FKGFYM++SRIRST+FADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTE+VKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
        IQQQIAGPVSFRAD+GVAIDLNKAGWDLLRV+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

A0A6J1H3U0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.77e-27881.91Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPS++PSHGFSLQRVFS+PFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KF+SSL K+G GEM QS +SLLQ IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLT+EALSPGLF++K G+YWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLV+DLGS  +DSGLSYHLSMH NAG PSQ GSE T  APFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFL+ PHVSLS IIGAVATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        D  A SAA+ SL++F+G YM++SRIRSTVFAD+F SISFSAQYGMFQ+ +LDLTRFS   DFHSGSKFLSG+MLLI+DLSNS+HP+TESVKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
         QQQIAGPVSFRAD+GVAIDL+KAGW  L+VEEPTFALEYAL+ LGSAKAIAWYSPK REFMVELRFYE 
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

A0A6J1KW75 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.45e-27781.91Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ
        MKKLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPSY+PSHGFSLQRVFS+PFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQ

Query:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP
        KF+SSL K+G GEM QS +SLLQ IGRHL  RSLYA GIS+DILL PDD+L+ISFDGYGDSD++RTKAV H KFLHHDLT+EALSPGLF++K G+YWDVP
Subjt:  KFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG
        SSLV+DLGS A+DSGLSYHLSMH NAG PSQ GSE T  APFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFLS PHVSLS IIGAVATSYF 
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFG

Query:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS
        D    SAA+ SL++FKG +M++SRIRSTVFAD+F SISFSAQYGMFQ  +LDLTRFS   DFHSGSKFLSG+MLLI+DLSNS+HP+TESVKATLPNARFS
Subjt:  DDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFS

Query:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
         QQQIAGPVSFRADTGVAIDL+KAGW  L+VEEPTFALEYAL+ LGSAKAIAWYSPK  EFMVELRFYE 
Subjt:  IQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic6.5e-11947.31Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSP---------SHGFSLQRVFSVPFSDSGSI
        M ++RW  +G   WDLD+ST  TL+G+A  VP     LPLGLSRG RLSR KQ++F  RFMA+P +PS+SP           GFSLQRV ++PFS++  +
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSP---------SHGFSLQRVFSVPFSDSGSI

Query:  TLLGQFNLQKFMSSLMKT---GSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSP
        +LLGQF++Q+F++ + KT   G G    +  S L  IG+HL  +SLYA+G  ++ LL PDD+L++S+D Y GD D   R KA+F+ +F  H+LT EA+ P
Subjt:  TLLGQFNLQKFMSSLMKT---GSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSP

Query:  GLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVS
        GLF++K G YWDVP S+ +DL S+ ++SG SYHL +H N+G P +L S+     P  LLPGLS K+A +++ N ++WR    KL+  +PYD+FLS+PHV+
Subjt:  GLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVS

Query:  LSAIIGAVATSYFGDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPK
        +S IIG+V T+ FG++  RS  ++  E   GF +    + S   AD     S +AQYG FQK + DLTRF A +DF  G +FL+G+  +  DL NSR P 
Subjt:  LSAIIGAVATSYFGDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPK

Query:  TESVKATLPNARFSIQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
         E+ +   P    S+QQQI GP SF+ ++G+ IDL + G + + V++  FA+EYAL VL SAKA+  YSPK  EFMVELRF+ET
Subjt:  TESVKATLPNARFSIQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown1.1e-6031.89Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSH-----GFSLQRVFSVPFSDSGSITLLG
        M  L  A+D   FWD +VS+ +TL+G+A  VP      PL  +R  R  R +Q+  ++       +PS +P+       FSL  +   P S++  + L+G
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSH-----GFSLQRVFSVPFSDSGSITLLG

Query:  QFNLQKFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGR
        QF  +K  + + K       +    +++   +H+  +SLY++G+   I L    SL++S +  GD + +R K +       HDLTVEA  P LF++  GR
Subjt:  QFNLQKFMSSLMKTGSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGR

Query:  YWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQL---GSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKK-------LKMVQPYDIFLSTPHV
        +WDVP SL VD+ S+  +SG+ Y   +H++ G P  +   G E    AP  L+PGL AKAA ++K N ++WR   K+         +  PYD+ L  PH 
Subjt:  YWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQL---GSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKK-------LKMVQPYDIFLSTPHV

Query:  SLSAIIGAVATSYFGDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHP
        ++S I+G+   ++                 +G  +   + RS + AD+F S  ++ Q G F K Y DLTR  A +D  S   F     L         H 
Subjt:  SLSAIIGAVATSYFGDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHP

Query:  KTESVKATL--PNARFSIQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYE
         + +   TL  P      QQQ+AGP+ F+ D+   +          R+E+  ++L Y+L +L S K +AWYSPK +E M+ELR +E
Subjt:  KTESVKATL--PNARFSIQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYE

AT3G06960.1 pigment defective 3204.6e-12047.31Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSP---------SHGFSLQRVFSVPFSDSGSI
        M ++RW  +G   WDLD+ST  TL+G+A  VP     LPLGLSRG RLSR KQ++F  RFMA+P +PS+SP           GFSLQRV ++PFS++  +
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSP---------SHGFSLQRVFSVPFSDSGSI

Query:  TLLGQFNLQKFMSSLMKT---GSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSP
        +LLGQF++Q+F++ + KT   G G    +  S L  IG+HL  +SLYA+G  ++ LL PDD+L++S+D Y GD D   R KA+F+ +F  H+LT EA+ P
Subjt:  TLLGQFNLQKFMSSLMKT---GSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSP

Query:  GLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVS
        GLF++K G YWDVP S+ +DL S+ ++SG SYHL +H N+G P +L S+     P  LLPGLS K+A +++ N ++WR    KL+  +PYD+FLS+PHV+
Subjt:  GLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVS

Query:  LSAIIGAVATSYFGDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPK
        +S IIG+V T+ FG++  RS  ++  E   GF +    + S   AD     S +AQYG FQK + DLTRF A +DF  G +FL+G+  +  DL NSR P 
Subjt:  LSAIIGAVATSYFGDDLARSAAQDSLEKFKGFYMKSSRIRSTVFADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPK

Query:  TESVKATLPNARFSIQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET
         E+ +   P    S+QQQI GP SF+ ++G+ IDL + G + + V++  FA+EYAL VL SAKA+  YSPK  EFMVELRF+ET
Subjt:  TESVKATLPNARFSIQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEYALHVLGSAKAIAWYSPKHREFMVELRFYET

AT3G06960.2 pigment defective 3204.8e-7747.9Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSP---------SHGFSLQRVFSVPFSDSGSI
        M ++RW  +G   WDLD+ST  TL+G+A  VP     LPLGLSRG RLSR KQ++F  RFMA+P +PS+SP           GFSLQRV ++PFS++  +
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSP---------SHGFSLQRVFSVPFSDSGSI

Query:  TLLGQFNLQKFMSSLMKT---GSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSP
        +LLGQF++Q+F++ + KT   G G    +  S L  IG+HL  +SLYA+G  ++ LL PDD+L++S+D Y GD D   R KA+F+ +F  H+LT EA+ P
Subjt:  TLLGQFNLQKFMSSLMKT---GSGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSP

Query:  GLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVS
        GLF++K G YWDVP S+ +DL S+ ++SG SYHL +H N+G P +L S+     P  LLPGLS K+A +++ N ++WR    KL+  +PYD+FLS+PHV+
Subjt:  GLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVS

Query:  LSAIIGAVA
        +S IIG ++
Subjt:  LSAIIGAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGCTAAGATGGGCAATGGACGGGCAAGGCTTCTGGGATCTCGATGTTTCAACGTCTAGAACACTCGATGGCTCTGCTTCCCCGGTTCCTTCTCCTTTACACTT
ACTTCCTTTGGGATTATCCAGAGGCGTTCGTCTTTCCAGAGCCAAGCAGATCGATTTTATGCAGCGTTTCATGGCTGCTCCTTTTGTTCCTTCTTATTCCCCCTCCCATG
GCTTCTCTCTTCAACGCGTCTTCTCCGTACCCTTTTCAGATTCTGGCTCCATTACTCTCTTAGGCCAGTTCAATTTGCAGAAATTCATGTCCTCTCTGATGAAAACTGGT
TCTGGAGAGATGTGTCAGTCGTATACCTCACTTCTGCAATATATTGGAAGGCATCTTTACCAACGGTCTTTGTATGCTGTTGGTATCTCTGCCGATATCTTGTTACCACC
CGATGATTCGCTGATGATTAGCTTCGATGGATATGGTGACAGTGATATAGTTCGAACAAAAGCAGTATTTCACCGCAAGTTCCTACATCATGATCTAACAGTGGAGGCAC
TTTCTCCAGGACTTTTTATGGAGAAATGTGGTAGATATTGGGATGTGCCTTCTTCATTGGTTGTTGATCTAGGTTCTGTTGCTTCCGATTCGGGTTTGAGTTACCATTTG
TCTATGCACCAGAATGCTGGGTTTCCCTCACAATTGGGAAGTGAACCGACCCATTCTGCTCCTTTCTGTTTACTTCCTGGTCTATCAGCCAAGGCTGCGTTTGCCTTTAA
GAAGAACTTTGAAATTTGGAGAAGTAATGCCAAGAAGTTAAAGATGGTGCAACCTTATGACATTTTTCTATCAACTCCTCATGTTTCATTGTCAGCGATCATTGGTGCTG
TAGCTACTTCCTACTTTGGAGACGATTTGGCTAGATCAGCAGCACAAGACAGTCTTGAGAAATTCAAAGGATTTTACATGAAGAGTTCTAGAATAAGATCGACTGTTTTC
GCAGATTTATTCACTTCTATTTCTTTTTCAGCTCAGTATGGGATGTTTCAGAAGAAATATCTGGATCTTACCCGATTTTCTGCTTGTATGGATTTCCATTCTGGCTCCAA
GTTTCTTTCAGGATCCATGCTTTTGATAGATGATCTTTCCAACTCCCGGCACCCAAAAACTGAATCTGTGAAAGCGACCTTGCCTAATGCAAGATTTTCCATTCAGCAAC
AGATTGCTGGACCTGTGAGCTTTAGAGCAGATACAGGAGTTGCAATAGATTTGAATAAAGCAGGGTGGGATTTATTACGAGTGGAAGAGCCTACATTTGCCTTGGAATAT
GCGTTGCATGTCCTTGGTTCAGCTAAGGCAATTGCTTGGTATTCACCAAAGCATAGAGAATTTATGGTAGAGCTTCGTTTCTACGAGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGCTAAGATGGGCAATGGACGGGCAAGGCTTCTGGGATCTCGATGTTTCAACGTCTAGAACACTCGATGGCTCTGCTTCCCCGGTTCCTTCTCCTTTACACTT
ACTTCCTTTGGGATTATCCAGAGGCGTTCGTCTTTCCAGAGCCAAGCAGATCGATTTTATGCAGCGTTTCATGGCTGCTCCTTTTGTTCCTTCTTATTCCCCCTCCCATG
GCTTCTCTCTTCAACGCGTCTTCTCCGTACCCTTTTCAGATTCTGGCTCCATTACTCTCTTAGGCCAGTTCAATTTGCAGAAATTCATGTCCTCTCTGATGAAAACTGGT
TCTGGAGAGATGTGTCAGTCGTATACCTCACTTCTGCAATATATTGGAAGGCATCTTTACCAACGGTCTTTGTATGCTGTTGGTATCTCTGCCGATATCTTGTTACCACC
CGATGATTCGCTGATGATTAGCTTCGATGGATATGGTGACAGTGATATAGTTCGAACAAAAGCAGTATTTCACCGCAAGTTCCTACATCATGATCTAACAGTGGAGGCAC
TTTCTCCAGGACTTTTTATGGAGAAATGTGGTAGATATTGGGATGTGCCTTCTTCATTGGTTGTTGATCTAGGTTCTGTTGCTTCCGATTCGGGTTTGAGTTACCATTTG
TCTATGCACCAGAATGCTGGGTTTCCCTCACAATTGGGAAGTGAACCGACCCATTCTGCTCCTTTCTGTTTACTTCCTGGTCTATCAGCCAAGGCTGCGTTTGCCTTTAA
GAAGAACTTTGAAATTTGGAGAAGTAATGCCAAGAAGTTAAAGATGGTGCAACCTTATGACATTTTTCTATCAACTCCTCATGTTTCATTGTCAGCGATCATTGGTGCTG
TAGCTACTTCCTACTTTGGAGACGATTTGGCTAGATCAGCAGCACAAGACAGTCTTGAGAAATTCAAAGGATTTTACATGAAGAGTTCTAGAATAAGATCGACTGTTTTC
GCAGATTTATTCACTTCTATTTCTTTTTCAGCTCAGTATGGGATGTTTCAGAAGAAATATCTGGATCTTACCCGATTTTCTGCTTGTATGGATTTCCATTCTGGCTCCAA
GTTTCTTTCAGGATCCATGCTTTTGATAGATGATCTTTCCAACTCCCGGCACCCAAAAACTGAATCTGTGAAAGCGACCTTGCCTAATGCAAGATTTTCCATTCAGCAAC
AGATTGCTGGACCTGTGAGCTTTAGAGCAGATACAGGAGTTGCAATAGATTTGAATAAAGCAGGGTGGGATTTATTACGAGTGGAAGAGCCTACATTTGCCTTGGAATAT
GCGTTGCATGTCCTTGGTTCAGCTAAGGCAATTGCTTGGTATTCACCAAAGCATAGAGAATTTATGGTAGAGCTTCGTTTCTACGAGACCTGA
Protein sequenceShow/hide protein sequence
MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQRFMAAPFVPSYSPSHGFSLQRVFSVPFSDSGSITLLGQFNLQKFMSSLMKTG
SGEMCQSYTSLLQYIGRHLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFMEKCGRYWDVPSSLVVDLGSVASDSGLSYHL
SMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGAVATSYFGDDLARSAAQDSLEKFKGFYMKSSRIRSTVF
ADLFTSISFSAQYGMFQKKYLDLTRFSACMDFHSGSKFLSGSMLLIDDLSNSRHPKTESVKATLPNARFSIQQQIAGPVSFRADTGVAIDLNKAGWDLLRVEEPTFALEY
ALHVLGSAKAIAWYSPKHREFMVELRFYET