; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019427 (gene) of Snake gourd v1 genome

Gene IDTan0019427
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationLG06:80348804..80351407
RNA-Seq ExpressionTan0019427
SyntenyTan0019427
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575159.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.4e-21683.16Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        M+KLRWTMEGQSFWDLDVSTPRTLDGSASPVP D  +LPLGL RGVRLSRA+QIDFMQ+FMAAPFVPSY PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KF+SSLKKSG G+M QS SSLLQ IG HL  RSLYA GISSD LLTPDD LLISFDGYGDS++LRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        SS VID+GS+  DSG SYHLSMHHNAGSPS+SGSEQ   APF LLPGL+ KAAF+ KKN EIW+SNAKKLK VQPYDIFLSNP VSLSGI+GAVA SY G
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D S  SAA+ +LQEFKGL++Q SRIRST+ ADVFASISFSAQYGMFQR FLDLTRFS R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE
         QQQIAGP+SF ADSGVAIDL+K   G +QVEEPTFALEYAL  LGSAKAIAWYSPK +EFMVELRFYE
Subjt:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE

XP_022959177.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita moschata]5.8e-21883.58Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVP D  +LPLGL RGVRLSRA+QIDFMQ+FMAAPFVPS+ PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KF+SSLKKSG G+M QS SSLLQ IG HL  RSLYA GISSD LLTPDD LLISFDGYGDS++LRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        SS VID+GS+  DSG SYHLSMHHNAGSPSQSGSEQ   APF LLPGL+ KAAF+ KKN EIW+SNAKKLK VQPYDIFL+NPHVSLSGI+GAVA SY G
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D S  SAA+ +LQEF+GL+MQTSRIRST+ ADVFASISFSAQYGMFQR FLDLTRFS R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE
         QQQIAGP+SF ADSGVAIDL+K   G +QVEEPTFALEYAL  LGSAKAIAWYSPK +EFMVELRFYE
Subjt:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE

XP_023006570.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita maxima]9.0e-21984.43Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVP D  +LPLGL RGVRLSRA+QIDFMQ+FMAAPFVPSY PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KF+SSLKKSG G+M QS SSLLQ IG HL  RSLYA GISSD LLTPDD LLISFDGYGDS+VLRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        SS VID+GS+A DSG SYHLSMHHNAGSPSQSGSEQ   APF LLPGL+ KAAF+ KKN EIW+SNAKKLK VQPYDIFLSNPHVSLSGI+GAVA SY  
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D SV SAA+ +LQEFKGLHMQTSRIRST+ ADVFASISFSAQYGMFQ  FLDLTRFS R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE
         QQQIAGP+SF AD+GVAIDL+K   G +QVEEPTFALEYAL  LGSAKAIAWYSPK  EFMVELRFYE
Subjt:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE

XP_023548884.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]4.7e-22084.43Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVP D  +LPLGL RGVRLSRA+QIDFMQ+FMAAPFVPSY PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KF+SSLKKSG G+M QS SSLLQ IG HL  RSLYA GISSD LLTPDD LLISFDGYGDS++LRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        SS VID+GS+A DSG SYHLSMHHNAGSPSQSGSEQ   APF LLPGL+ KAAF+ KKN EIW+SNAKKLK VQPYDIFLSNPHVSLSGI+GAVA SY G
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D S  SAA+ +LQEFKGL+MQTSRIRST+ ADVFASISFSAQYGMFQR FLDLTRFS R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE
         QQQIAGP+SF AD+GVAIDL+K   G +QVEEPTFALEYAL VLGSAKAIAWYSPK +EFMVELRFYE
Subjt:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE

XP_038875869.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]1.5e-21882.34Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPA--DVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRW M+GQ FWDLDVSTPRTLDGSASPVP+   +LPLGL RGVRLSRA+QIDFMQ FMAAPFVPSY+PSHGFSLQRVF+IPFSDSGS T LGQFNLQ
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPA--DVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KF+SSLKKSG G M QS SS LQ IG HLCHRSLYALGISSD LL PDD L+ISFDGYGD+E++RTKAV HHKFLHHDLTMEA SPGLFVDK G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        S+ V+D+GS+A +SG SYHLSMH N GSPSQSGSEQ  ++P  LLPGL+ KAAF+FKKN EIW+SNAKKLKMVQPYDIFLS PHVSLSGI+GAVA +Y G
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        DNS++SAAQD+L EFKGL++QTSRIRST+ ADVFASISFSAQYGMFQR +LDLT FSARMDFHSGSKFLSGAMLLI+DLSNSRHPRTESVRATLP+ARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPKGG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET
        LQQQIAGP+SF ADSGVAIDLNK   G + V+EPTFALEYALQVLGSAKAIAWYSPKH+EFMVELRFYET
Subjt:  LQQQIAGPISFIADSGVAIDLNKPKGG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET

TrEMBL top hitse value%identityAlignment
A0A0A0K824 Uncharacterized protein1.5e-20878.72Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPA--DVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRW M+GQ FWDLDVST RTLDGSASPVP+   +LPLGL RGVRLSRA+QIDFMQRFMAAPFVPSY+PSHGFSLQRVF++PFSDSGS T LGQFNLQ
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPA--DVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KFMSSL K+GSG+M QS SSLLQ IG HL  RSLYA+GIS+D LL PDD L+ISFDGYGDS+++RTKAV H KFLHHDLT+EALSPGLF++K G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        SS V+D+GS A DSG SYHLSMH NAG PSQ GSE   +APF LLPGL+ KAAF+FKKNFEIW+SNAKKLKMVQPYDIFLS PHVSLS I+GAVA SY G
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D+  +SAAQD+L++FKG +M++SRIRST+ AD+F SISFSAQYGMFQ+ +LDLTRFSA MDFHSGSKFLSG+MLLI+DLSNSRHP+TESV+ATLPNARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPKGG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET
        +QQQIAGP+SF AD+GVAIDLNK     ++VEEPTFALEYAL VLGSAKAIAWYSPKH+EFMVELRFYET
Subjt:  LQQQIAGPISFIADSGVAIDLNKPKGG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET

A0A5D3BY40 Protein TRIGALACTOSYLDIACYLGLYCEROL 49.4e-20678.3Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPA--DVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRW M+G  FWDLDVST RTLDGSASPVP+   +LPLGL RGVRLSRA+QIDFMQ FM APFVPSY+PSHGFSLQRVF+IPFSDSGS T LGQFNLQ
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPA--DVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KFMSSL K+GSG+M QS SS +Q IG HL  RSLYA+GIS+D LL PDD L+ISFDGYGDS+++RTKAV H KFLHHDLTMEALSPGLF+DK G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        SS V+D+GS+A DSG SYHLSMH N G PS  GSE   +APF L PGL+ KAAF+FKKNFEIW+SNAKKLKMVQPYDIFLS PHVSLS I+GAVA SY G
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D+ V+SAAQ +L EFKG +MQTSRIRST+ AD+F SISFSAQYGMFQ+ +LDLTRFSA MDFHSGSKFLSG+MLLI+DLSNSRHP+TE+V+ATLPNARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPKGG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET
        +QQQIAGP+SF ADSGVAIDLNK     ++V+EPTFALEYALQVLGSAKAIAWYSPKH+EFMVELRFYET
Subjt:  LQQQIAGPISFIADSGVAIDLNKPKGG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET

A0A6J1E1S3 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.2e-20880.43Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPA--DVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTM+GQ FW+LDVSTP TLDG+ASPVPA   +LPLGL RG RLSRA+QIDFMQRFMAAPFVPSYAPSHGFSLQRVF  PF   GS T LGQFNLQ
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPA--DVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPD--DRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWD
        KF+SS  +  SG M  S SS LQAIG HLCH S YALG+SSD  L PD  D LLISFDGYG++ +LRTKA+LHHKFLHHDLTMEALSPGLFVD+ GNYWD
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPD--DRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWD

Query:  VPSSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISY
        VPSS +ID+GS+A DSGPSYHLSMHHNAGSPSQSG+E+    PF LLPGL++KAAFSFK NF+IW+SNAKKLKMVQPYDIFLSNPHVSLSGI+GAVA +Y
Subjt:  VPSSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISY

Query:  IGDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNAR
         GDNSV+SAAQD+LQEFKGL+MQTS+IRST+LADVFASISFSAQYGMFQR FLDLTRFSA MDFHSGSKF+SGA LLIEDLSNS  PRTE+V+A LP+AR
Subjt:  IGDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNAR

Query:  FSLQQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE
        FSLQQQIAGPISF ADS V IDLNK   G+QVEEPTFALEYALQVLGSAKAIAWYSPK +EFMVELRFYE
Subjt:  FSLQQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE

A0A6J1H3U0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.8e-21883.58Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVP D  +LPLGL RGVRLSRA+QIDFMQ+FMAAPFVPS+ PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KF+SSLKKSG G+M QS SSLLQ IG HL  RSLYA GISSD LLTPDD LLISFDGYGDS++LRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        SS VID+GS+  DSG SYHLSMHHNAGSPSQSGSEQ   APF LLPGL+ KAAF+ KKN EIW+SNAKKLK VQPYDIFL+NPHVSLSGI+GAVA SY G
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D S  SAA+ +LQEF+GL+MQTSRIRST+ ADVFASISFSAQYGMFQR FLDLTRFS R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE
         QQQIAGP+SF ADSGVAIDL+K   G +QVEEPTFALEYAL  LGSAKAIAWYSPK +EFMVELRFYE
Subjt:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE

A0A6J1KW75 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic4.3e-21984.43Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ
        MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVP D  +LPLGL RGVRLSRA+QIDFMQ+FMAAPFVPSY PSHGFSLQRVF+IPFSDSGSAT LGQFN+Q
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPAD--VLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQ

Query:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP
        KF+SSLKKSG G+M QS SSLLQ IG HL  RSLYA GISSD LLTPDD LLISFDGYGDS+VLRTKAVLHHKFLHHDLTMEALSPGLFVDK G YWDVP
Subjt:  KFMSSLKKSGSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVP

Query:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG
        SS VID+GS+A DSG SYHLSMHHNAGSPSQSGSEQ   APF LLPGL+ KAAF+ KKN EIW+SNAKKLK VQPYDIFLSNPHVSLSGI+GAVA SY  
Subjt:  SSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIG

Query:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS
        D SV SAA+ +LQEFKGLHMQTSRIRST+ ADVFASISFSAQYGMFQ  FLDLTRFS R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV+ATLPNARFS
Subjt:  DNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFS

Query:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE
         QQQIAGP+SF AD+GVAIDL+K   G +QVEEPTFALEYAL  LGSAKAIAWYSPK  EFMVELRFYE
Subjt:  LQQQIAGPISFIADSGVAIDLNKPK-GGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic3.2e-12648.86Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF
        M ++RW  EG   WDLD+STP TL+G+A  VP D LPLGL RG RLSR +Q++F  RFMA+P +PS++P           GFSLQRV T+PFS++   + 
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF

Query:  LGQFNLQKFMSSLKKS---GSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGY-GD-SEVLRTKAVLHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ + K+   G G    + +S L  IG HL  +SLYALG  S+ LL+PDD LL+S+D Y GD  +  R KA+ +H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFMSSLKKS---GSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGY-GD-SEVLRTKAVLHHKFLHHDLTMEALSPGL

Query:  FVDKYGNYWDVPSSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK+G YWDVP S  ID+ S   +SGPSYHL +HHN+GSP +  S+ +   P SLLPGL++K+A S++ N ++W+    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKYGNYWDVPSSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLS

Query:  GILGAVAISYIGDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE
        GI+G+V  +  G+NS++S  +++ +   G  +    + S  +AD     S +AQYG FQ+ F DLTRF AR+DF  G +FL+GA  + +DL NSR P  E
Subjt:  GILGAVAISYIGDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE

Query:  SVRATLPNARFSLQQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET
        + +   P    SLQQQI GP SF  +SG+ IDL      + V++  FA+EYALQVL SAKA+  YSPK  EFMVELRF+ET
Subjt:  SVRATLPNARFSLQQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown2.1e-6433.97Show/hide
Query:  FWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSH-----GFSLQRVFTIPFSDSGSATFLGQFNLQKFMSSLKKS
        FWD +VS+P+TL+G+A  VP +  PL   R  R  R QQ+  ++       +PS AP+       FSL  +   P S++     +GQF  +K  + +K  
Subjt:  FWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSH-----GFSLQRVFTIPFSDSGSATFLGQFNLQKFMSSLKKS

Query:  GSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVPSSFVIDVGS
         S    +    +++    H+  +SLY++G+ +   L     LL+S +  GD   LR K +L H    HDLT+EA  P LF+D  G +WDVP S  +DV S
Subjt:  GSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVPSSFVIDVGS

Query:  SALDSGPSYHLSMHHNAGSP---SQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKK-------LKMVQPYDIFLSNPHVSLSGILGAVAISYI
           +SG  Y   +H + G+P   + +G E    AP SL+PGL  KAA S+K N ++W+   K+         +  PYD+ L  PH ++SGI+G+   ++I
Subjt:  SALDSGPSYHLSMHHNAGSP---SQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKK-------LKMVQPYDIFLSNPHVSLSGILGAVAISYI

Query:  GDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARF
            +               +   + RS + ADVF S  ++ Q G F + + DLTR  AR+D       L  A  L + L    H  + +   TL + R 
Subjt:  GDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARF

Query:  SL--QQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE
        +L  QQQ+AGPI F  DS   +      G  ++E+  ++L Y+L++L S K +AWYSPK KE M+ELR +E
Subjt:  SL--QQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYE

AT3G06960.1 pigment defective 3202.3e-12748.86Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF
        M ++RW  EG   WDLD+STP TL+G+A  VP D LPLGL RG RLSR +Q++F  RFMA+P +PS++P           GFSLQRV T+PFS++   + 
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF

Query:  LGQFNLQKFMSSLKKS---GSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGY-GD-SEVLRTKAVLHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ + K+   G G    + +S L  IG HL  +SLYALG  S+ LL+PDD LL+S+D Y GD  +  R KA+ +H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFMSSLKKS---GSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGY-GD-SEVLRTKAVLHHKFLHHDLTMEALSPGL

Query:  FVDKYGNYWDVPSSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK+G YWDVP S  ID+ S   +SGPSYHL +HHN+GSP +  S+ +   P SLLPGL++K+A S++ N ++W+    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKYGNYWDVPSSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLS

Query:  GILGAVAISYIGDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE
        GI+G+V  +  G+NS++S  +++ +   G  +    + S  +AD     S +AQYG FQ+ F DLTRF AR+DF  G +FL+GA  + +DL NSR P  E
Subjt:  GILGAVAISYIGDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLADVFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE

Query:  SVRATLPNARFSLQQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET
        + +   P    SLQQQI GP SF  +SG+ IDL      + V++  FA+EYALQVL SAKA+  YSPK  EFMVELRF+ET
Subjt:  SVRATLPNARFSLQQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQVLGSAKAIAWYSPKHKEFMVELRFYET

AT3G06960.2 pigment defective 3207.1e-8149.84Show/hide
Query:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF
        M ++RW  EG   WDLD+STP TL+G+A  VP D LPLGL RG RLSR +Q++F  RFMA+P +PS++P           GFSLQRV T+PFS++   + 
Subjt:  MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAP---------SHGFSLQRVFTIPFSDSGSATF

Query:  LGQFNLQKFMSSLKKS---GSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGY-GD-SEVLRTKAVLHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ + K+   G G    + +S L  IG HL  +SLYALG  S+ LL+PDD LL+S+D Y GD  +  R KA+ +H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFMSSLKKS---GSGKMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGY-GD-SEVLRTKAVLHHKFLHHDLTMEALSPGL

Query:  FVDKYGNYWDVPSSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK+G YWDVP S  ID+ S   +SGPSYHL +HHN+GSP +  S+ +   P SLLPGL++K+A S++ N ++W+    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKYGNYWDVPSSFVIDVGSSALDSGPSYHLSMHHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLS

Query:  GILGAVA
        GI+G ++
Subjt:  GILGAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGCTAAGATGGACAATGGAGGGGCAAAGCTTTTGGGACCTGGATGTTTCAACGCCTAGAACACTGGATGGATCGGCCTCCCCTGTTCCCGCGGACGTTCTTCC
CTTGGGATTGTTCAGAGGCGTTAGGCTTTCCAGGGCCCAACAGATCGATTTCATGCAGCGCTTCATGGCTGCACCTTTTGTCCCTTCTTATGCCCCCTCCCATGGCTTTT
CTCTCCAGCGCGTCTTTACCATCCCCTTTTCCGACTCCGGGTCCGCCACTTTTTTAGGTCAGTTTAATTTGCAGAAGTTCATGTCCTCTCTTAAGAAATCTGGTTCTGGA
AAGATGCGTCAGTCCGCGTCCTCATTGCTGCAAGCCATTGGAAGCCACCTCTGCCACCGATCTTTGTATGCTCTTGGTATCTCTTCTGATACCCTGTTAACTCCCGATGA
TAGGTTGTTGATCAGCTTCGACGGATATGGCGACAGTGAAGTACTTCGAACAAAAGCAGTACTCCACCACAAGTTCCTACATCATGATCTAACAATGGAAGCACTTTCTC
CAGGACTTTTTGTGGACAAATATGGTAACTACTGGGATGTGCCTTCTTCATTTGTCATTGATGTAGGTTCTTCTGCTCTTGACTCGGGTCCAAGTTATCACTTGTCTATG
CACCACAATGCCGGGTCTCCCTCACAATCTGGAAGTGAACAACTCCTTACGGCTCCTTTCTCTCTACTTCCTGGTCTAGCTGTCAAGGCTGCTTTTTCCTTTAAGAAGAA
CTTTGAAATCTGGAAAAGCAACGCCAAGAAGTTAAAGATGGTGCAACCATATGACATTTTCCTATCAAATCCTCATGTTTCATTGTCAGGGATCCTTGGTGCTGTAGCTA
TTTCCTACATTGGAGACAATTCAGTTAAATCAGCAGCACAAGACAATCTTCAGGAATTTAAAGGACTTCACATGCAGACTTCTAGAATAAGATCTACCCTTTTAGCGGAT
GTATTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGACCATTTCTGGATCTTACCCGTTTTTCTGCACGTATGGATTTCCATTCTGGCTCCAAGTTCCT
TTCAGGAGCCATGCTTTTGATAGAAGATCTTTCCAATTCCCGGCACCCAAGAACTGAATCTGTGAGAGCGACCTTGCCTAATGCTAGATTTTCCCTTCAGCAGCAGATTG
CCGGACCCATCAGCTTTATAGCAGATTCAGGAGTTGCAATAGATTTGAATAAACCAAAGGGAGGTATACAAGTTGAGGAGCCTACATTTGCCTTGGAATATGCGTTGCAG
GTCCTTGGTTCAGCTAAAGCCATCGCTTGGTATTCACCAAAGCACAAAGAATTTATGGTAGAGCTCCGTTTCTATGAGACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGCTAAGATGGACAATGGAGGGGCAAAGCTTTTGGGACCTGGATGTTTCAACGCCTAGAACACTGGATGGATCGGCCTCCCCTGTTCCCGCGGACGTTCTTCC
CTTGGGATTGTTCAGAGGCGTTAGGCTTTCCAGGGCCCAACAGATCGATTTCATGCAGCGCTTCATGGCTGCACCTTTTGTCCCTTCTTATGCCCCCTCCCATGGCTTTT
CTCTCCAGCGCGTCTTTACCATCCCCTTTTCCGACTCCGGGTCCGCCACTTTTTTAGGTCAGTTTAATTTGCAGAAGTTCATGTCCTCTCTTAAGAAATCTGGTTCTGGA
AAGATGCGTCAGTCCGCGTCCTCATTGCTGCAAGCCATTGGAAGCCACCTCTGCCACCGATCTTTGTATGCTCTTGGTATCTCTTCTGATACCCTGTTAACTCCCGATGA
TAGGTTGTTGATCAGCTTCGACGGATATGGCGACAGTGAAGTACTTCGAACAAAAGCAGTACTCCACCACAAGTTCCTACATCATGATCTAACAATGGAAGCACTTTCTC
CAGGACTTTTTGTGGACAAATATGGTAACTACTGGGATGTGCCTTCTTCATTTGTCATTGATGTAGGTTCTTCTGCTCTTGACTCGGGTCCAAGTTATCACTTGTCTATG
CACCACAATGCCGGGTCTCCCTCACAATCTGGAAGTGAACAACTCCTTACGGCTCCTTTCTCTCTACTTCCTGGTCTAGCTGTCAAGGCTGCTTTTTCCTTTAAGAAGAA
CTTTGAAATCTGGAAAAGCAACGCCAAGAAGTTAAAGATGGTGCAACCATATGACATTTTCCTATCAAATCCTCATGTTTCATTGTCAGGGATCCTTGGTGCTGTAGCTA
TTTCCTACATTGGAGACAATTCAGTTAAATCAGCAGCACAAGACAATCTTCAGGAATTTAAAGGACTTCACATGCAGACTTCTAGAATAAGATCTACCCTTTTAGCGGAT
GTATTTGCTTCCATTTCTTTTTCAGCTCAGTATGGGATGTTTCAAAGACCATTTCTGGATCTTACCCGTTTTTCTGCACGTATGGATTTCCATTCTGGCTCCAAGTTCCT
TTCAGGAGCCATGCTTTTGATAGAAGATCTTTCCAATTCCCGGCACCCAAGAACTGAATCTGTGAGAGCGACCTTGCCTAATGCTAGATTTTCCCTTCAGCAGCAGATTG
CCGGACCCATCAGCTTTATAGCAGATTCAGGAGTTGCAATAGATTTGAATAAACCAAAGGGAGGTATACAAGTTGAGGAGCCTACATTTGCCTTGGAATATGCGTTGCAG
GTCCTTGGTTCAGCTAAAGCCATCGCTTGGTATTCACCAAAGCACAAAGAATTTATGGTAGAGCTCCGTTTCTATGAGACCTGA
Protein sequenceShow/hide protein sequence
MKKLRWTMEGQSFWDLDVSTPRTLDGSASPVPADVLPLGLFRGVRLSRAQQIDFMQRFMAAPFVPSYAPSHGFSLQRVFTIPFSDSGSATFLGQFNLQKFMSSLKKSGSG
KMRQSASSLLQAIGSHLCHRSLYALGISSDTLLTPDDRLLISFDGYGDSEVLRTKAVLHHKFLHHDLTMEALSPGLFVDKYGNYWDVPSSFVIDVGSSALDSGPSYHLSM
HHNAGSPSQSGSEQLLTAPFSLLPGLAVKAAFSFKKNFEIWKSNAKKLKMVQPYDIFLSNPHVSLSGILGAVAISYIGDNSVKSAAQDNLQEFKGLHMQTSRIRSTLLAD
VFASISFSAQYGMFQRPFLDLTRFSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVRATLPNARFSLQQQIAGPISFIADSGVAIDLNKPKGGIQVEEPTFALEYALQ
VLGSAKAIAWYSPKHKEFMVELRFYET