; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0000043 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0000043
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationchr11:3356901..3359950
RNA-Seq ExpressionPI0000043
SyntenyPI0000043
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK02989.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa]1.1e-25394.47Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRWAMD  GFWDLDVSTSRTLDGSASPVPSP HLLPLGLSRGVRLSRAKQIDFMQ+FM APFVPSYSPSHGFSLQRVFSIPFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KFMSSLMKTGSGEMGQSFSSF+Q IG  LYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT+EALSPGLF+DKSGRYWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLVVDLGS ASDSGLSYHLSMHQN GFPS LGSEPTHSAPFCL PGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIG VATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        DDLVRSAAQ SL EFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKFLSG+M LIDDLSNSRHPKT++VKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

XP_004138517.2 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic isoform X1 [Cucumis sativus]3.2e-25694.47Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQ FMAAPFVPSYSPSHGFSLQRVFS+PFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KFMSSLMKTGSGEM QS+SS LQYIG  LYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLF++K GRYWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIG VATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        DDL RSAAQDSLE+FKGFYM++SRIRST+FADLFTSISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKFLSG+M LIDDLSNSRHPKT+SVKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRAD+GVAIDLNKAGWDLLRV+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

XP_008458282.1 PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo]1.1e-25394.47Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRWAMD  GFWDLDVSTSRTLDGSASPVPSP HLLPLGLSRGVRLSRAKQIDFMQ+FM APFVPSYSPSHGFSLQRVFSIPFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KFMSSLMKTGSGEMGQSFSSF+Q IG  LYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT+EALSPGLF+DKSGRYWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLVVDLGS ASDSGLSYHLSMHQN GFPS LGSEPTHSAPFCL PGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIG VATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        DDLVRSAAQ SL EFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKFLSG+M LIDDLSNSRHPKT++VKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

XP_023548884.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]2.3e-22283.8Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ FMAAPFVPSY+PSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KF+SSL K+G GEMGQS SS LQ IG  L  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLT+EALSPGLFVDKSG+YWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLV+DLGS A+DSGLSYHLSMH NAG PSQ GSE T  APFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFLS PHVSLS IIG VATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        D    SAA+ SL+EFKG YMQTSRIRST+FAD+F SISFSAQYGMFQR +LDLTRFS   DFHSGSKFLSG M LI+DLSNS+HP+T+SVKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGPVSFRAD+GVAIDL+KAGW  L+V+EPTFALEYAL VLGSAKAIAWYSPK REFMVELRFYE
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE

XP_038875869.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]3.1e-23587.66Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRWAMDGQGFWDLDVST RTLDGSASPVPS LHLLPLGLSRGVRLSRAKQIDFMQ+FMAAPFVPSYSPSHGFSLQRVFSIPFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KF+SSL K+G G+MGQS SSFLQ IG  L  RSLYA+GIS+DILLPPDDSLMISFDGYGD++IVRTKAVFH KFLHHDLT+EA SPGLFVDKSG+YWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        S+LVVDLGS AS+SGLSYHLSMHQN G PSQ GSE   S+P CLLPGLSAKAAFAFKKN EIWRSNAKKLKMVQPYDIFLSTPHVSLS IIG VAT+YFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        D+ +RSAAQDSL EFKG Y+QTSRIRST+FAD+F SISFSAQYGMFQRKYLDLT FSA MDFHSGSKFLSG M LIDDLSNSRHP+T+SV+ATLP+ARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        LQQQIAGPVSFRADSGVAIDLNKAGW LL VDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

TrEMBL top hitse value%identityAlignment
A0A0A0K824 Uncharacterized protein1.5e-25694.47Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQ FMAAPFVPSYSPSHGFSLQRVFS+PFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KFMSSLMKTGSGEM QS+SS LQYIG  LYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLF++K GRYWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIG VATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        DDL RSAAQDSLE+FKGFYM++SRIRST+FADLFTSISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKFLSG+M LIDDLSNSRHPKT+SVKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRAD+GVAIDLNKAGWDLLRV+EPTFALEYAL VLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

A0A1S3C837 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic5.5e-25494.47Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRWAMD  GFWDLDVSTSRTLDGSASPVPSP HLLPLGLSRGVRLSRAKQIDFMQ+FM APFVPSYSPSHGFSLQRVFSIPFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KFMSSLMKTGSGEMGQSFSSF+Q IG  LYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT+EALSPGLF+DKSGRYWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLVVDLGS ASDSGLSYHLSMHQN GFPS LGSEPTHSAPFCL PGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIG VATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        DDLVRSAAQ SL EFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKFLSG+M LIDDLSNSRHPKT++VKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

A0A5D3BY40 Protein TRIGALACTOSYLDIACYLGLYCEROL 45.5e-25494.47Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRWAMD  GFWDLDVSTSRTLDGSASPVPSP HLLPLGLSRGVRLSRAKQIDFMQ+FM APFVPSYSPSHGFSLQRVFSIPFSDSGS TLLGQFN+Q
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KFMSSLMKTGSGEMGQSFSSF+Q IG  LYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLT+EALSPGLF+DKSGRYWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLVVDLGS ASDSGLSYHLSMHQN GFPS LGSEPTHSAPFCL PGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIG VATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        DDLVRSAAQ SL EFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQ+KYLDLTRFSACMDFHSGSKFLSG+M LIDDLSNSRHPKT++VKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        +QQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

A0A6J1H3U0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.4e-22082.94Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ FMAAPFVPS++PSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KF+SSL K+G GEMGQS SS LQ IG  L  RSLYA GIS+DILL PDD+L+ISFDGYGDSDI+RTKAV H KFLHHDLT+EALSPGLFVDKSG+YWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLV+DLGS  +DSGLSYHLSMH NAG PSQ GSE T  APFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFL+ PHVSLS IIG VATSYFG
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        D    SAA+ SL+EF+G YMQTSRIRST+FAD+F SISFSAQYGMFQR +LDLTRFS   DFHSGSKFLSG M LI+DLSNS+HP+T+SVKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGPVSFRADSGVAIDL+KAGW  L+V+EPTFALEYAL  LGSAKAIAWYSPK REFMVELRFYE
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE

A0A6J1KW75 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.5e-21982.73Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
        MKKLRW M+GQ FWDLDVST RTLDGSASPVP+ L LLPLGLSRGVRLSRAKQIDFMQ FMAAPFVPSY+PSHGFSLQRVFSIPFSDSGSATLLGQFNVQ
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQ

Query:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP
        KF+SSL K+G GEMGQS SS LQ IG  L  RSLYA GIS+DILL PDD+L+ISFDGYGDSD++RTKAV H KFLHHDLT+EALSPGLFVDKSG+YWDVP
Subjt:  KFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVP

Query:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG
        SSLV+DLGS A+DSGLSYHLSMH NAG PSQ GSE T  APFCLLPGLSAKAAFA KKN EIWRSNAKKLK VQPYDIFLS PHVSLS IIG VATSYF 
Subjt:  SSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFG

Query:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS
        D  V SAA+ SL+EFKG +MQTSRIRST+FAD+F SISFSAQYGMFQ  +LDLTRFS   DFHSGSKFLSG M LI+DLSNS+HP+T+SVKATLPNARFS
Subjt:  DDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFS

Query:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE
         QQQIAGPVSFRAD+GVAIDL+KAGW  L+V+EPTFALEYAL  LGSAKAIAWYSPK  EFMVELRFYE
Subjt:  LQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.0e-12048.03Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  +G   WDLD+ST  TL+G+A  VP     LPLGLSRG RLSR KQ++F   FMA+P +PS+SP           GFSLQRV ++PFS++   
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNVQKFMSSLMKTGSGEMGQS--FSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSPG
        +LLGQF+VQ+F++ + KT +   G S   +S L  IG  L  +SLYA+G  ++ LL PDD+L++S+D Y GD D   R KA+F+ +F  H+LT EA+ PG
Subjt:  TLLGQFNVQKFMSSLMKTGSGEMGQS--FSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSPG

Query:  LFVDKSGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSL
        LFVDK G YWDVP S+ +DL S+ ++SG SYHL +H N+G P +L S+     P  LLPGLS K+A +++ N ++WR    KL+  +PYD+FLS+PHV++
Subjt:  LFVDKSGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSL

Query:  SAIIGTVATSYFGDDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKT
        S IIG+V T+ FG++ +RS  ++  E   GF +    + S   AD     S +AQYG FQ+ + DLTRF A +DF  G +FL+G   +  DL NSR P  
Subjt:  SAIIGTVATSYFGDDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKT

Query:  DSVKATLPNARFSLQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        ++ +   P    SLQQQI GP SF+ +SG+ IDL + G + + VD+  FA+EYALQVL SAKA+  YSPK  EFMVELRF+ET
Subjt:  DSVKATLPNARFSLQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown1.2e-5931.89Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSH-----GFSLQRVFSIPFSDSGSATLLG
        M  L  A+D   FWD +VS+ +TL+G+A  VP      PL  +R  R  R +Q+  ++       +PS +P+       FSL  +   P S++    L+G
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSH-----GFSLQRVFSIPFSDSGSATLLG

Query:  QFNVQKFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGR
        QF  +K  + + K       +     ++     +  +SLY++G+   I L    SL++S +  GD + +R K +       HDLTVEA  P LF+D  GR
Subjt:  QFNVQKFMSSLMKTGSGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGR

Query:  YWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQL---GSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKK-------LKMVQPYDIFLSTPHV
        +WDVP SL VD+ S+  +SG+ Y   +H++ G P  +   G E    AP  L+PGL AKAA ++K N ++WR   K+         +  PYD+ L  PH 
Subjt:  YWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQL---GSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKK-------LKMVQPYDIFLSTPHV

Query:  SLSAIIGTVATSYFGDDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHP
        ++S I+G+   ++                 +G  +   + RS I AD+F S  ++ Q G F + Y DLTR  A +D  S         H   + S+    
Subjt:  SLSAIIGTVATSYFGDDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHP

Query:  KTDSVKATLPNARFSL--QQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE
               TL + R +L  QQQ+AGP+ F+ DS   +          R+++  ++L Y+L++L S K +AWYSPK +E M+ELR +E
Subjt:  KTDSVKATLPNARFSL--QQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYE

AT3G06960.1 pigment defective 3201.4e-12148.03Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  +G   WDLD+ST  TL+G+A  VP     LPLGLSRG RLSR KQ++F   FMA+P +PS+SP           GFSLQRV ++PFS++   
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNVQKFMSSLMKTGSGEMGQS--FSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSPG
        +LLGQF+VQ+F++ + KT +   G S   +S L  IG  L  +SLYA+G  ++ LL PDD+L++S+D Y GD D   R KA+F+ +F  H+LT EA+ PG
Subjt:  TLLGQFNVQKFMSSLMKTGSGEMGQS--FSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSPG

Query:  LFVDKSGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSL
        LFVDK G YWDVP S+ +DL S+ ++SG SYHL +H N+G P +L S+     P  LLPGLS K+A +++ N ++WR    KL+  +PYD+FLS+PHV++
Subjt:  LFVDKSGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSL

Query:  SAIIGTVATSYFGDDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKT
        S IIG+V T+ FG++ +RS  ++  E   GF +    + S   AD     S +AQYG FQ+ + DLTRF A +DF  G +FL+G   +  DL NSR P  
Subjt:  SAIIGTVATSYFGDDLVRSAAQDSLEEFKGFYMQTSRIRSTIFADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKT

Query:  DSVKATLPNARFSLQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET
        ++ +   P    SLQQQI GP SF+ +SG+ IDL + G + + VD+  FA+EYALQVL SAKA+  YSPK  EFMVELRF+ET
Subjt:  DSVKATLPNARFSLQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFYET

AT3G06960.2 pigment defective 3201.8e-7648.38Show/hide
Query:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSP---------SHGFSLQRVFSIPFSDSGSA
        M ++RW  +G   WDLD+ST  TL+G+A  VP     LPLGLSRG RLSR KQ++F   FMA+P +PS+SP           GFSLQRV ++PFS++   
Subjt:  MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSP---------SHGFSLQRVFSIPFSDSGSA

Query:  TLLGQFNVQKFMSSLMKTGSGEMGQS--FSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSPG
        +LLGQF+VQ+F++ + KT +   G S   +S L  IG  L  +SLYA+G  ++ LL PDD+L++S+D Y GD D   R KA+F+ +F  H+LT EA+ PG
Subjt:  TLLGQFNVQKFMSSLMKTGSGEMGQS--FSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGY-GDSD-IVRTKAVFHRKFLHHDLTVEALSPG

Query:  LFVDKSGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSL
        LFVDK G YWDVP S+ +DL S+ ++SG SYHL +H N+G P +L S+     P  LLPGLS K+A +++ N ++WR    KL+  +PYD+FLS+PHV++
Subjt:  LFVDKSGRYWDVPSSLVVDLGSVASDSGLSYHLSMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSL

Query:  SAIIGTVA
        S IIG ++
Subjt:  SAIIGTVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGCTAAGATGGGCAATGGACGGGCAAGGCTTCTGGGATCTCGATGTTTCAACGTCTAGAACACTCGATGGCTCTGCTTCCCCTGTTCCTTCTCCTTTACACTT
ACTTCCTTTGGGATTATCCAGAGGCGTTCGTCTTTCCAGAGCCAAGCAGATCGATTTCATGCAGAATTTCATGGCTGCTCCTTTTGTTCCTTCTTATTCCCCCTCCCATG
GCTTCTCTCTTCAACGCGTCTTCTCCATACCCTTTTCAGATTCTGGCTCCGCTACTCTCTTAGGCCAGTTCAATGTGCAGAAATTCATGTCCTCTCTGATGAAAACTGGT
TCTGGAGAGATGGGTCAGTCGTTTTCCTCATTTCTGCAATATATTGGAATGCGTCTTTACCAACGGTCCTTGTATGCTGTTGGTATCTCTGCTGATATCTTGTTACCACC
CGACGATTCCCTGATGATTAGCTTCGATGGATATGGTGACAGTGATATAGTTCGAACAAAAGCAGTATTTCACCGCAAGTTCCTACATCATGATCTAACAGTGGAGGCAC
TTTCTCCAGGACTTTTTGTGGACAAATCTGGAAGATATTGGGATGTGCCTTCTTCATTGGTTGTTGATCTAGGTTCTGTTGCTTCCGATTCGGGTTTGAGTTACCATTTG
TCTATGCACCAGAATGCCGGGTTTCCCTCACAATTGGGAAGTGAACCGACCCATTCTGCTCCTTTCTGTTTACTTCCTGGTCTATCAGCCAAGGCTGCTTTTGCCTTTAA
GAAGAACTTTGAAATTTGGAGAAGCAATGCCAAGAAGTTAAAGATGGTGCAACCGTATGACATTTTTCTATCAACTCCTCATGTTTCGTTGTCAGCGATCATTGGTACTG
TAGCTACTTCCTACTTTGGAGACGATTTGGTTAGATCAGCAGCACAAGACAGTCTTGAGGAATTCAAAGGATTTTACATGCAGACATCTAGAATAAGATCTACTATTTTT
GCAGATTTATTCACTTCTATTTCCTTTTCAGCTCAGTATGGGATGTTTCAGAGGAAATATCTTGATCTTACCCGATTTTCCGCTTGCATGGATTTCCATTCTGGCTCCAA
GTTTCTTTCAGGAACCATGCATTTGATAGATGATCTTTCCAACTCCCGGCACCCAAAAACTGACTCTGTGAAAGCGACCTTGCCTAATGCAAGATTTTCCCTTCAGCAAC
AGATTGCTGGACCTGTCAGCTTTAGAGCAGATTCAGGAGTTGCAATAGATTTGAATAAAGCAGGGTGGGATTTATTACGAGTGGATGAGCCTACATTTGCCTTGGAATAT
GCGTTGCAAGTCCTTGGTTCAGCTAAGGCCATTGCTTGGTATTCACCAAAGCATAGAGAATTTATGGTAGAGCTTCGTTTCTACGAGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGCTAAGATGGGCAATGGACGGGCAAGGCTTCTGGGATCTCGATGTTTCAACGTCTAGAACACTCGATGGCTCTGCTTCCCCTGTTCCTTCTCCTTTACACTT
ACTTCCTTTGGGATTATCCAGAGGCGTTCGTCTTTCCAGAGCCAAGCAGATCGATTTCATGCAGAATTTCATGGCTGCTCCTTTTGTTCCTTCTTATTCCCCCTCCCATG
GCTTCTCTCTTCAACGCGTCTTCTCCATACCCTTTTCAGATTCTGGCTCCGCTACTCTCTTAGGCCAGTTCAATGTGCAGAAATTCATGTCCTCTCTGATGAAAACTGGT
TCTGGAGAGATGGGTCAGTCGTTTTCCTCATTTCTGCAATATATTGGAATGCGTCTTTACCAACGGTCCTTGTATGCTGTTGGTATCTCTGCTGATATCTTGTTACCACC
CGACGATTCCCTGATGATTAGCTTCGATGGATATGGTGACAGTGATATAGTTCGAACAAAAGCAGTATTTCACCGCAAGTTCCTACATCATGATCTAACAGTGGAGGCAC
TTTCTCCAGGACTTTTTGTGGACAAATCTGGAAGATATTGGGATGTGCCTTCTTCATTGGTTGTTGATCTAGGTTCTGTTGCTTCCGATTCGGGTTTGAGTTACCATTTG
TCTATGCACCAGAATGCCGGGTTTCCCTCACAATTGGGAAGTGAACCGACCCATTCTGCTCCTTTCTGTTTACTTCCTGGTCTATCAGCCAAGGCTGCTTTTGCCTTTAA
GAAGAACTTTGAAATTTGGAGAAGCAATGCCAAGAAGTTAAAGATGGTGCAACCGTATGACATTTTTCTATCAACTCCTCATGTTTCGTTGTCAGCGATCATTGGTACTG
TAGCTACTTCCTACTTTGGAGACGATTTGGTTAGATCAGCAGCACAAGACAGTCTTGAGGAATTCAAAGGATTTTACATGCAGACATCTAGAATAAGATCTACTATTTTT
GCAGATTTATTCACTTCTATTTCCTTTTCAGCTCAGTATGGGATGTTTCAGAGGAAATATCTTGATCTTACCCGATTTTCCGCTTGCATGGATTTCCATTCTGGCTCCAA
GTTTCTTTCAGGAACCATGCATTTGATAGATGATCTTTCCAACTCCCGGCACCCAAAAACTGACTCTGTGAAAGCGACCTTGCCTAATGCAAGATTTTCCCTTCAGCAAC
AGATTGCTGGACCTGTCAGCTTTAGAGCAGATTCAGGAGTTGCAATAGATTTGAATAAAGCAGGGTGGGATTTATTACGAGTGGATGAGCCTACATTTGCCTTGGAATAT
GCGTTGCAAGTCCTTGGTTCAGCTAAGGCCATTGCTTGGTATTCACCAAAGCATAGAGAATTTATGGTAGAGCTTCGTTTCTACGAGACCTGA
Protein sequenceShow/hide protein sequence
MKKLRWAMDGQGFWDLDVSTSRTLDGSASPVPSPLHLLPLGLSRGVRLSRAKQIDFMQNFMAAPFVPSYSPSHGFSLQRVFSIPFSDSGSATLLGQFNVQKFMSSLMKTG
SGEMGQSFSSFLQYIGMRLYQRSLYAVGISADILLPPDDSLMISFDGYGDSDIVRTKAVFHRKFLHHDLTVEALSPGLFVDKSGRYWDVPSSLVVDLGSVASDSGLSYHL
SMHQNAGFPSQLGSEPTHSAPFCLLPGLSAKAAFAFKKNFEIWRSNAKKLKMVQPYDIFLSTPHVSLSAIIGTVATSYFGDDLVRSAAQDSLEEFKGFYMQTSRIRSTIF
ADLFTSISFSAQYGMFQRKYLDLTRFSACMDFHSGSKFLSGTMHLIDDLSNSRHPKTDSVKATLPNARFSLQQQIAGPVSFRADSGVAIDLNKAGWDLLRVDEPTFALEY
ALQVLGSAKAIAWYSPKHREFMVELRFYET