; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0027195 (gene) of Chayote v1 genome

Gene IDSed0027195
OrganismSechium edule (Chayote v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationLG09:6670615..6680317
RNA-Seq ExpressionSed0027195
SyntenySed0027195
Gene Ontology termsGO:0001522 - pseudouridine synthesis (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0009982 - pseudouridine synthase activity (molecular function)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575159.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.2e-21682.13Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        M+KLRWTM+GQSFWDLDVSTPRTLDGSASPVP D  LLPLGL RGVRLSRAKQIDFMQ+FMAAPFVPS++ SH GFSLQR FS+PFSDSGSAT LGQFN+
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKF+SS+K SG G+M +S SS L GIG HL  RSLYA GISSD++LTPDD L ISFDGYGDSD+LRTKAVLHHKFLHHDLTMEALSPGLFVDKSG +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PSSLVID+GSA  DSG SYHLSMHHNAGSPS++GSEQT   PFCLLPGL+ KAAF+ KKN EIWRSNAKKLK VQPYDIFLSNP VSLS I+GAVAT+YF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
        GD S  SAA  SLQEFKGL++Q S IRSTV ADVFASISFSAQYGMFQR FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV  TLPNARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE
        S QQQI GP+SFRADSGVAIDL+K  WG +QVEEPTFALEYAL  LGSAKAIAWYSPK REFMVELRF+E
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE

XP_022959177.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita moschata]2.6e-21882.77Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRWTM+GQSFWDLDVSTPRTLDGSASPVP D  LLPLGL RGVRLSRAKQIDFMQ+FMAAPFVPSF+ SH GFSLQR FS+PFSDSGSAT LGQFN+
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKF+SS+K SG G+M +S SS L GIG HL  RSLYA GISSD++LTPDD L ISFDGYGDSD+LRTKAVLHHKFLHHDLTMEALSPGLFVDKSG +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PSSLVID+GSA  DSG SYHLSMHHNAGSPSQ+GSEQT   PFCLLPGL+ KAAF+ KKN EIWRSNAKKLK VQPYDIFL+NPHVSLS I+GAVAT+YF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
        GD S  SAA  SLQEF+GL++QTS IRSTV ADVFASISFSAQYGMFQR FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV  TLPNARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE
        S QQQI GP+SFRADSGVAIDL+K  WG +QVEEPTFALEYAL  LGSAKAIAWYSPK REFMVELRF+E
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE

XP_023006570.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita maxima]5.8e-21882.98Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRWTM+GQSFWDLDVSTPRTLDGSASPVP D  LLPLGL RGVRLSRAKQIDFMQ+FMAAPFVPS++ SH GFSLQR FS+PFSDSGSAT LGQFN+
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKF+SS+K SG G+M +S SS L GIG HL  RSLYA GISSD++LTPDD L ISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSG +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PSSLVID+GSAA DSG SYHLSMHHNAGSPSQ+GSEQT   PFCLLPGL+ KAAF+ KKN EIWRSNAKKLK VQPYDIFLSNPHVSLS I+GAVAT+YF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
         D SV SAA  SLQEFKGLH+QTS IRSTV ADVFASISFSAQYGMFQ  FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV  TLPNARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE
        S QQQI GP+SFRAD+GVAIDL+K  WG +QVEEPTFALEYAL  LGSAKAIAWYSPK  EFMVELRF+E
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE

XP_023548884.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]1.4e-21983.19Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRWTM+GQSFWDLDVSTPRTLDGSASPVP D  LLPLGL RGVRLSRAKQIDFMQ+FMAAPFVPS++ SH GFSLQR FS+PFSDSGSAT LGQFN+
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKF+SS+K SG G+M +S SS L GIG HL  RSLYA GISSD++LTPDD L ISFDGYGDSD+LRTKAVLHHKFLHHDLTMEALSPGLFVDKSG +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PSSLVID+GSAA DSG SYHLSMHHNAGSPSQ+GSEQT   PFCLLPGL+ KAAF+ KKN EIWRSNAKKLK VQPYDIFLSNPHVSLS I+GAVAT+YF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
        GD S  SAA  SLQEFKGL++QTS IRSTV ADVFASISFSAQYGMFQR FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV  TLPNARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE
        S QQQI GP+SFRAD+GVAIDL+K  WG +QVEEPTFALEYAL VLGSAKAIAWYSPK REFMVELRF+E
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE

XP_038875869.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]8.1e-22082.59Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPA--DLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRW MDGQ FWDLDVSTPRTLDGSASPVP+   LLPLGL RGVRLSRAKQIDFMQ FMAAPFVPS+S SH GFSLQR FS+PFSDSGS T LGQFNL
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPA--DLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKFISS+K SG G M +S SSFL  IG HLCHRSLYALGISSD++L PDD+L ISFDGYGD++++RTKAV HHKFLHHDLTMEA SPGLFVDKSG +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PS+LV+D+GSAA +SG SYHLSMH N GSPSQ+GSEQ    P CLLPGL+ KAAF+FKKN EIWRSNAKKLKMVQPYDIFLS PHVSLS I+GAVATTYF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
        GDNS+RSAA DSL EFKGL+LQTS IRSTV ADVFASISFSAQYGMFQR +LDLT  SARMDFHSGSKFLSGAMLLI+DLSNSRHPRTESV  TLP+ARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET
        SLQQQI GP+SFRADSGVAIDLNK  WG + V+EPTFALEYALQVLGSAKAIAWYSPKHREFMVELRF+ET
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET

TrEMBL top hitse value%identityAlignment
A0A0A0K824 Uncharacterized protein3.1e-20978.34Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPA--DLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRW MDGQ FWDLDVST RTLDGSASPVP+   LLPLGL RGVRLSRAKQIDFMQRFMAAPFVPS+S SH GFSLQR FS+PFSDSGS T LGQFNL
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPA--DLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKF+SS+  +GSG+M +S SS L  IG HL  RSLYA+GIS+D++L PDD+L ISFDGYGDSD++RTKAV H KFLHHDLT+EALSPGLF++K G +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PSSLV+D+GS A DSG SYHLSMH NAG PSQ GSE T   PFCLLPGL+ KAAF+FKKNFEIWRSNAKKLKMVQPYDIFLS PHVSLSAI+GAVAT+YF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
        GD+  RSAA DSL++FKG ++++S IRSTV AD+F SISFSAQYGMFQ+ +LDLTR SA MDFHSGSKFLSG+MLLI+DLSNSRHP+TESV  TLPNARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET
        S+QQQI GP+SFRAD+GVAIDLNK  W  ++VEEPTFALEYAL VLGSAKAIAWYSPKHREFMVELRF+ET
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET

A0A5D3BY40 Protein TRIGALACTOSYLDIACYLGLYCEROL 41.3e-20778.34Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPA--DLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRW MDG  FWDLDVST RTLDGSASPVP+   LLPLGL RGVRLSRAKQIDFMQ FM APFVPS+S SH GFSLQR FS+PFSDSGS T LGQFNL
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPA--DLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKF+SS+  +GSG+M +S SSF+  IG HL  RSLYA+GIS+D++L PDD+L ISFDGYGDSD++RTKAV H KFLHHDLTMEALSPGLF+DKSG +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PSSLV+D+GSAA DSG SYHLSMH N G PS  GSE T   PFCL PGL+ KAAF+FKKNFEIWRSNAKKLKMVQPYDIFLS PHVSLSAI+GAVAT+YF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
        GD+ VRSAA  SL EFKG ++QTS IRST+ AD+F SISFSAQYGMFQ+ +LDLTR SA MDFHSGSKFLSG+MLLI+DLSNSRHP+TE+V  TLPNARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET
        S+QQQI GP+SFRADSGVAIDLNK  W  ++V+EPTFALEYALQVLGSAKAIAWYSPKHREFMVELRF+ET
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET

A0A6J1E1S3 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic4.4e-21181.1Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPA--DLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRWTMDGQ FW+LDVSTP TLDG+ASPVPA   LLPLGL RG RLSRAKQIDFMQRFMAAPFVPS++ SH GFSLQR F  PF   GS T LGQFNL
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPA--DLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPD--DTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFW
        QKFISS   + SG M  S SSFL  IG HLCH S YALG+SSD+ L PD  D+L ISFDGYG++ +LRTKA+LHHKFLHHDLTMEALSPGLFVD+SGN+W
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPD--DTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFW

Query:  DVPSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATT
        DVPSSL+ID+GSAA DSGPSYHLSMHHNAGSPSQ+G+E+T   PFCLLPGL++KAAFSFK NF+IWRSNAKKLKMVQPYDIFLSNPHVSLS I+GAVATT
Subjt:  DVPSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATT

Query:  YFGDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNA
        YFGDNSVRSAA DSLQEFKGL++QTS IRSTVLADVFASISFSAQYGMFQR FLDLTR SA MDFHSGSKF+SGA LLIEDLSNS  PRTE+V   LP+A
Subjt:  YFGDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNA

Query:  RFSLQQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE
        RFSLQQQI GPISFRADS V IDLNK  WG+QVEEPTFALEYALQVLGSAKAIAWYSPK REFMVELRF+E
Subjt:  RFSLQQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE

A0A6J1H3U0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.3e-21882.77Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRWTM+GQSFWDLDVSTPRTLDGSASPVP D  LLPLGL RGVRLSRAKQIDFMQ+FMAAPFVPSF+ SH GFSLQR FS+PFSDSGSAT LGQFN+
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKF+SS+K SG G+M +S SS L GIG HL  RSLYA GISSD++LTPDD L ISFDGYGDSD+LRTKAVLHHKFLHHDLTMEALSPGLFVDKSG +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PSSLVID+GSA  DSG SYHLSMHHNAGSPSQ+GSEQT   PFCLLPGL+ KAAF+ KKN EIWRSNAKKLK VQPYDIFL+NPHVSLS I+GAVAT+YF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
        GD S  SAA  SLQEF+GL++QTS IRSTV ADVFASISFSAQYGMFQR FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV  TLPNARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE
        S QQQI GP+SFRADSGVAIDL+K  WG +QVEEPTFALEYAL  LGSAKAIAWYSPK REFMVELRF+E
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE

A0A6J1KW75 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.8e-21882.98Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL
        MKKLRWTM+GQSFWDLDVSTPRTLDGSASPVP D  LLPLGL RGVRLSRAKQIDFMQ+FMAAPFVPS++ SH GFSLQR FS+PFSDSGSAT LGQFN+
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPAD--LLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNL

Query:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV
        QKF+SS+K SG G+M +S SS L GIG HL  RSLYA GISSD++LTPDD L ISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSG +WDV
Subjt:  QKFISSVKNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDV

Query:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF
        PSSLVID+GSAA DSG SYHLSMHHNAGSPSQ+GSEQT   PFCLLPGL+ KAAF+ KKN EIWRSNAKKLK VQPYDIFLSNPHVSLS I+GAVAT+YF
Subjt:  PSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
         D SV SAA  SLQEFKGLH+QTS IRSTV ADVFASISFSAQYGMFQ  FLDLTR S R DFHSGSKFLSGAMLLIEDLSNS+HPRTESV  TLPNARF
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE
        S QQQI GP+SFRAD+GVAIDL+K  WG +QVEEPTFALEYAL  LGSAKAIAWYSPK  EFMVELRF+E
Subjt:  SLQQQIVGPISFRADSGVAIDLNKPRWG-IQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.6e-12849.9Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFS--------HSHGGFSLQRAFSLPFSDSGSATF
        M ++RW  +G   WDLD+STP TL+G+A  VP D LPLGL RG RLSR KQ++F  RFMA+P +PSFS           GGFSLQR  +LPFS++   + 
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFS--------HSHGGFSLQRAFSLPFSDSGSATF

Query:  LGQFNLQKFISSV---KNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGY-GDSDV-LRTKAVLHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ +   K  G G    + +S L+ IG HL  +SLYALG  S+ +L+PDDTL +S+D Y GD D   R KA+ +H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFISSV---KNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGY-GDSDV-LRTKAVLHHKFLHHDLTMEALSPGL

Query:  FVDKSGNFWDVPSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK G +WDVP S+ ID+ S   +SGPSYHL +HHN+GSP +  S+    PP  LLPGL++K+A S++ N ++WR    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKSGNFWDVPSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLS

Query:  AIVGAVATTYFGDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE
         I+G+V T  FG+NS+RS   +  +   G  L    + S  +AD     S +AQYG FQ+ F DLTR  AR+DF  G +FL+GA  + +DL NSR P  E
Subjt:  AIVGAVATTYFGDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE

Query:  SVNETLPNARFSLQQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET
        +  +  P    SLQQQIVGP SF+ +SG+ IDL      + V++  FA+EYALQVL SAKA+  YSPK  EFMVELRFFET
Subjt:  SVNETLPNARFSLQQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown4.6e-6433.33Show/hide
Query:  FWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSH----GGFSLQRAFSLPFSDSGSATFLGQFNLQKFISSVKNS
        FWD +VS+P+TL+G+A  VP +  PL   R  R  R +Q+  ++       +PS + +     G FSL      P S++     +GQF  +K  + +K  
Subjt:  FWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSH----GGFSLQRAFSLPFSDSGSATFLGQFNLQKFISSVKNS

Query:  GSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDVPSSLVIDVGS
         S    E     +     H+  +SLY++G+ + + L    +L +S +  GD + LR K +L H    HDLT+EA  P LF+D  G FWDVP SL +DV S
Subjt:  GSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDVPSSLVIDVGS

Query:  AALDSGPSYHLSMHHNAGSP---SQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKK-------LKMVQPYDIFLSNPHVSLSAIVGAVATTYF
           +SG  Y   +H + G+P   +  G E     P  L+PGL  KAA S+K N ++WR   K+         +  PYD+ L  PH ++S IVG+    + 
Subjt:  AALDSGPSYHLSMHHNAGSP---SQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKK-------LKMVQPYDIFLSNPHVSLSAIVGAVATTYF

Query:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF
            +               L     RS + ADVF S  ++ Q G F + + DLTR+ AR+D       L  A  L + L    H  + + ++TL + R 
Subjt:  GDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARF

Query:  SL--QQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE
        +L  QQQ+ GPI F+ DS   +         ++E+  ++L Y+L++L S K +AWYSPK +E M+ELR FE
Subjt:  SL--QQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFE

AT3G06960.1 pigment defective 3201.9e-12949.9Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFS--------HSHGGFSLQRAFSLPFSDSGSATF
        M ++RW  +G   WDLD+STP TL+G+A  VP D LPLGL RG RLSR KQ++F  RFMA+P +PSFS           GGFSLQR  +LPFS++   + 
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFS--------HSHGGFSLQRAFSLPFSDSGSATF

Query:  LGQFNLQKFISSV---KNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGY-GDSDV-LRTKAVLHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ +   K  G G    + +S L+ IG HL  +SLYALG  S+ +L+PDDTL +S+D Y GD D   R KA+ +H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFISSV---KNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGY-GDSDV-LRTKAVLHHKFLHHDLTMEALSPGL

Query:  FVDKSGNFWDVPSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK G +WDVP S+ ID+ S   +SGPSYHL +HHN+GSP +  S+    PP  LLPGL++K+A S++ N ++WR    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKSGNFWDVPSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLS

Query:  AIVGAVATTYFGDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE
         I+G+V T  FG+NS+RS   +  +   G  L    + S  +AD     S +AQYG FQ+ F DLTR  AR+DF  G +FL+GA  + +DL NSR P  E
Subjt:  AIVGAVATTYFGDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLADVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTE

Query:  SVNETLPNARFSLQQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET
        +  +  P    SLQQQIVGP SF+ +SG+ IDL      + V++  FA+EYALQVL SAKA+  YSPK  EFMVELRFFET
Subjt:  SVNETLPNARFSLQQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYALQVLGSAKAIAWYSPKHREFMVELRFFET

AT3G06960.2 pigment defective 3209.3e-8149.84Show/hide
Query:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFS--------HSHGGFSLQRAFSLPFSDSGSATF
        M ++RW  +G   WDLD+STP TL+G+A  VP D LPLGL RG RLSR KQ++F  RFMA+P +PSFS           GGFSLQR  +LPFS++   + 
Subjt:  MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFS--------HSHGGFSLQRAFSLPFSDSGSATF

Query:  LGQFNLQKFISSV---KNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGY-GDSDV-LRTKAVLHHKFLHHDLTMEALSPGL
        LGQF++Q+F++ +   K  G G    + +S L+ IG HL  +SLYALG  S+ +L+PDDTL +S+D Y GD D   R KA+ +H+F  H+LT EA+ PGL
Subjt:  LGQFNLQKFISSV---KNSGSGKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGY-GDSDV-LRTKAVLHHKFLHHDLTMEALSPGL

Query:  FVDKSGNFWDVPSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLS
        FVDK G +WDVP S+ ID+ S   +SGPSYHL +HHN+GSP +  S+    PP  LLPGL++K+A S++ N ++WR    KL+  +PYD+FLS+PHV++S
Subjt:  FVDKSGNFWDVPSSLVIDVGSAALDSGPSYHLSMHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLS

Query:  AIVGAVA
         I+G ++
Subjt:  AIVGAVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGCTAAGATGGACGATGGACGGGCAAAGCTTCTGGGATCTGGACGTTTCCACGCCGAGAACACTGGACGGGTCGGCCTCCCCTGTTCCCGCCGATCTTCTTCC
GTTGGGATTGTTCAGAGGCGTTAGGCTTTCCAGAGCCAAGCAGATCGATTTCATGCAGCGGTTCATGGCCGCGCCTTTCGTCCCTTCCTTCTCCCACTCCCATGGCGGCT
TCTCCCTCCAGCGCGCCTTTTCCCTCCCCTTCTCCGACTCCGGGTCTGCCACTTTCTTAGGTCAGTTCAATTTGCAGAAGTTTATCTCGTCTGTGAAGAATTCTGGTTCT
GGGAAGATGTTTGAATCGGCCTCTTCGTTTCTGCATGGAATTGGAAGCCACCTCTGCCACCGATCTTTGTATGCGCTTGGTATCTCTTCTGATGTAATGTTAACTCCAGA
TGATACGCTGTTCATTAGCTTTGATGGGTATGGCGATAGTGATGTACTTCGAACAAAAGCAGTACTCCACCACAAGTTTCTACATCATGATCTAACTATGGAAGCACTTT
CTCCAGGACTTTTTGTGGACAAATCTGGTAACTTCTGGGATGTGCCTTCTTCACTAGTCATTGATGTTGGTTCTGCTGCTTTGGACTCTGGTCCAAGTTATCACTTGTCT
ATGCACCACAATGCTGGGTCTCCCTCTCAAACTGGAAGTGAACAAACCCTTAAGCCTCCTTTCTGTTTACTTCCTGGTCTAGCAGTCAAGGCTGCCTTTTCCTTTAAGAA
GAACTTCGAAATCTGGAGAAGCAACGCCAAGAAGTTGAAGATGGTGCAACCATATGACATTTTCCTATCAAATCCTCATGTTTCGTTGTCGGCGATCGTCGGTGCTGTAG
CTACTACCTACTTTGGAGACAATTCAGTTAGATCAGCAGCACATGACAGTCTTCAGGAATTTAAAGGACTTCACCTGCAGACTTCTGGAATAAGATCTACGGTCTTAGCG
GATGTATTCGCTTCCATTTCTTTTTCAGCTCAGTACGGCATGTTTCAAAGGCCGTTTCTGGATCTTACCCGTTTGTCTGCACGTATGGATTTCCATTCTGGCTCCAAGTT
CCTTTCAGGAGCCATGCTTTTGATAGAAGATCTTTCCAATTCCCGACACCCAAGAACTGAATCCGTGAACGAGACCTTGCCGAATGCTAGATTTTCTCTTCAGCAGCAGA
TTGTTGGACCCATCAGCTTCAGAGCAGATTCAGGAGTTGCAATAGATTTGAACAAACCAAGGTGGGGCATACAAGTGGAGGAGCCTACATTTGCCTTGGAATATGCTTTG
CAGGTTCTTGGTTCGGCGAAAGCCATCGCTTGGTATTCACCGAAGCACAGAGAATTTATGGTAGAGCTTCGTTTCTTCGAGACCTGA
mRNA sequenceShow/hide mRNA sequence
GTTAAAGAAACAACCCCTATAGAAGTCTGGAAACAAAAACTTTTCAAAAAAAAGAAAAAAGAAAGAGTTTATGGGACAATTATGGTTTCTAGGGTTTGTCATCTTGATAA
TTCGAGCTGCAAAATTGGAGTGGAGTAGAAGTAGAGGATAAGGCGAGACAACGCAGTAGAAGCAAGAACAAATCTCTTACAACATCGTGTTTGGATTCAGGTTGTAGGGT
TTTGCGAGATGAAGAAGCTAAGATGGACGATGGACGGGCAAAGCTTCTGGGATCTGGACGTTTCCACGCCGAGAACACTGGACGGGTCGGCCTCCCCTGTTCCCGCCGAT
CTTCTTCCGTTGGGATTGTTCAGAGGCGTTAGGCTTTCCAGAGCCAAGCAGATCGATTTCATGCAGCGGTTCATGGCCGCGCCTTTCGTCCCTTCCTTCTCCCACTCCCA
TGGCGGCTTCTCCCTCCAGCGCGCCTTTTCCCTCCCCTTCTCCGACTCCGGGTCTGCCACTTTCTTAGGTCAGTTCAATTTGCAGAAGTTTATCTCGTCTGTGAAGAATT
CTGGTTCTGGGAAGATGTTTGAATCGGCCTCTTCGTTTCTGCATGGAATTGGAAGCCACCTCTGCCACCGATCTTTGTATGCGCTTGGTATCTCTTCTGATGTAATGTTA
ACTCCAGATGATACGCTGTTCATTAGCTTTGATGGGTATGGCGATAGTGATGTACTTCGAACAAAAGCAGTACTCCACCACAAGTTTCTACATCATGATCTAACTATGGA
AGCACTTTCTCCAGGACTTTTTGTGGACAAATCTGGTAACTTCTGGGATGTGCCTTCTTCACTAGTCATTGATGTTGGTTCTGCTGCTTTGGACTCTGGTCCAAGTTATC
ACTTGTCTATGCACCACAATGCTGGGTCTCCCTCTCAAACTGGAAGTGAACAAACCCTTAAGCCTCCTTTCTGTTTACTTCCTGGTCTAGCAGTCAAGGCTGCCTTTTCC
TTTAAGAAGAACTTCGAAATCTGGAGAAGCAACGCCAAGAAGTTGAAGATGGTGCAACCATATGACATTTTCCTATCAAATCCTCATGTTTCGTTGTCGGCGATCGTCGG
TGCTGTAGCTACTACCTACTTTGGAGACAATTCAGTTAGATCAGCAGCACATGACAGTCTTCAGGAATTTAAAGGACTTCACCTGCAGACTTCTGGAATAAGATCTACGG
TCTTAGCGGATGTATTCGCTTCCATTTCTTTTTCAGCTCAGTACGGCATGTTTCAAAGGCCGTTTCTGGATCTTACCCGTTTGTCTGCACGTATGGATTTCCATTCTGGC
TCCAAGTTCCTTTCAGGAGCCATGCTTTTGATAGAAGATCTTTCCAATTCCCGACACCCAAGAACTGAATCCGTGAACGAGACCTTGCCGAATGCTAGATTTTCTCTTCA
GCAGCAGATTGTTGGACCCATCAGCTTCAGAGCAGATTCAGGAGTTGCAATAGATTTGAACAAACCAAGGTGGGGCATACAAGTGGAGGAGCCTACATTTGCCTTGGAAT
ATGCTTTGCAGGTTCTTGGTTCGGCGAAAGCCATCGCTTGGTATTCACCGAAGCACAGAGAATTTATGGTAGAGCTTCGTTTCTTCGAGACCTGAAACCATGGTTTGTTC
GATTTTATTTTGGAGCAATACATGTTCTTTCATTTATCTCTGACAGAAACACGATCAACGTTTCTCTTTTATTTGTTGCCTGGCTAAGAGATTTTTAGTTAAATGTGGAG
GTCGAGATTCAAAATTCCGACATCGTCGAAGTATATGTCTTAACCAATCAAGTTATATTTAGGGCTAAAAGCTTAACTACGTTTGATTGGGCTATGTGAATTTAAAACAA
TCTTTAGCAACTGGAAATCAAGTTTTTTTAGCTAATTGTTTTGTATGCCATTTTTTTTTTTGGGTAAAATTTCTTAGCCAATTTGAG
Protein sequenceShow/hide protein sequence
MKKLRWTMDGQSFWDLDVSTPRTLDGSASPVPADLLPLGLFRGVRLSRAKQIDFMQRFMAAPFVPSFSHSHGGFSLQRAFSLPFSDSGSATFLGQFNLQKFISSVKNSGS
GKMFESASSFLHGIGSHLCHRSLYALGISSDVMLTPDDTLFISFDGYGDSDVLRTKAVLHHKFLHHDLTMEALSPGLFVDKSGNFWDVPSSLVIDVGSAALDSGPSYHLS
MHHNAGSPSQTGSEQTLKPPFCLLPGLAVKAAFSFKKNFEIWRSNAKKLKMVQPYDIFLSNPHVSLSAIVGAVATTYFGDNSVRSAAHDSLQEFKGLHLQTSGIRSTVLA
DVFASISFSAQYGMFQRPFLDLTRLSARMDFHSGSKFLSGAMLLIEDLSNSRHPRTESVNETLPNARFSLQQQIVGPISFRADSGVAIDLNKPRWGIQVEEPTFALEYAL
QVLGSAKAIAWYSPKHREFMVELRFFET