; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G002890 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G002890
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationCG_Chr05:2773082..2777590
RNA-Seq ExpressionClCG05G002890
SyntenyClCG05G002890
Gene Ontology termsGO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049109.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa]3.2e-22782.16Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MA LRTAMDSAFWD N+SSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLRL  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVAR  LDKS YTYG+CSQFSPSPFSSVYVSTE HGERKG RHKAMFY +LP HDINVDAAWPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSS+KS+SGLRYRVGLHKNGG+PRALNS NNDDPP  LMPGLCAKAAFS EK +YLWRV+E+KQD  +KT +GE     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GIVGGTF SWFGGSN V SNGDGNLTMG +KRSPLNADLFGS+CYTFQ G F K +GDLTRIDA+LDISSASGFAKRVFHGFKKSVDD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        I QQ            QV GPIVFR+DSKL+LDSASGK GPHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVEL+LFEF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

XP_004133963.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic isoform X1 [Cucumis sativus]7.3e-21678.42Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASR LRIQQ+S LGNGFPLGIIPSY PTA KELGSFSLQSLL  +P+  WW GLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLISSIKA++SA + LEL  LKD+A  FLDKSLYTYG+CSQFS  PFSSVYVSTE  GERKGHRHKAMFY +LP HDINVDAAWPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSSLKSESGLRYRVGLHKNGG+PRALNS N+DDPP  L+PGLCAKAAFS EKN+ LWR    ++++     +     +P+YDVRL EPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GI+GGT  SWFGGS+TV SNGDGNLTMG +KRSPLNADLFGSICYT+QHG+F   + DLTRIDARL ISSASGFAKRVFH FKKSVDD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        I QQ            QV GPIVFR++SKLLLDSASGK GPHVEDTI SL YSF  L S KAVFWYSPKRKEGMVEL+L+EF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

XP_008438274.1 PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo]2.2e-22882.57Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MA LRTAMDSAFWD N+SSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLRL  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVAR  LDKS YTYG+CSQFSPSPFSSVYVSTE HGERKG RHKAMFY +LP HDINVDAAWPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSS+KS+SGLRYRVGLHKNGG+PRALNS NNDDPP  LMPGLCAKAAFS EK +YLWRV+E+KQD  EKT +GE     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GIVGGTF SWFGGSNTV SNGDGNLTMG +KRSPLNADLFGS+CYTFQ G F K +GDLTRIDA+LDISSASGFAKRVFHGFKKSVDD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        I QQ            QV GPIVFR+DSKL+LDSASGK GPHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVEL+LFEF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

XP_022146920.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Momordica charantia]2.3e-23082.99Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MAYLRTAMDSAF DLN+SSPQTLAGTAKAVPG+PFPLDGARASR LR+QQISLLGNGFPLGIIPSYSPT  KELGSFSLQSLLL+LPAA+WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLISSIKAELSA DSLELPVLKDVA QFLDKSLYTYGLCSQFSPSPFSS++ STE HGE+KG RHKAMFY KLP+HDI ++AAWPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSSLKSESGLRYR GLHKNGG+PRAL+  N D+PP ALMPGLCAKAAFSFEKN+YLWRV+ERK+D+MEKTDKGE  W+ SYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GIVGGTF +WF GS T+ SNGDGN      KRSPLNADLFGSICYTFQ GRFRKQ+GDLTRIDARLDISSASGFAKRVF+ FK+S+DD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        I QQ            QV GPIVFRVDS L+LD  SG+  PHVEDTIYSLNYSFRLL+SGKAVFWYSPKRKEGMVEL+LFEF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

XP_038875801.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]2.8e-25289Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASR+LRIQQISLLGNGFPLGIIPSYSP++QKELGSFSLQSLL RLPAA+WWVGL+GQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSP+PFSSVYVSTE HGERKG RHKAMFY KLPHHDINVDAAWPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNS N++DPP ALMPGLCAKAAFSFEKN+YLWRVKERKQDL+EKTDK EWYWKPSYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GI+GGTF SWFGG++T  SNGDGNLTMG +KRSPLNADLFGSICYTFQHGRF+KQ+GDLTRIDARLDISSASGFAKRVF GFKKSVDD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        + QQ            QV GPIVFRVDS+L+LDSASGK GPH+E+TIYSLNYSFRLL+SGKAVFWYSP+RKEGMVEL+LFEF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

TrEMBL top hitse value%identityAlignment
A0A0A0L4I9 Uncharacterized protein3.5e-21678.42Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASR LRIQQ+S LGNGFPLGIIPSY PTA KELGSFSLQSLL  +P+  WW GLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLISSIKA++SA + LEL  LKD+A  FLDKSLYTYG+CSQFS  PFSSVYVSTE  GERKGHRHKAMFY +LP HDINVDAAWPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSSLKSESGLRYRVGLHKNGG+PRALNS N+DDPP  L+PGLCAKAAFS EKN+ LWR    ++++     +     +P+YDVRL EPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GI+GGT  SWFGGS+TV SNGDGNLTMG +KRSPLNADLFGSICYT+QHG+F   + DLTRIDARL ISSASGFAKRVFH FKKSVDD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        I QQ            QV GPIVFR++SKLLLDSASGK GPHVEDTI SL YSF  L S KAVFWYSPKRKEGMVEL+L+EF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

A0A1S3AWM5 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.1e-22882.57Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MA LRTAMDSAFWD N+SSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLRL  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVAR  LDKS YTYG+CSQFSPSPFSSVYVSTE HGERKG RHKAMFY +LP HDINVDAAWPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSS+KS+SGLRYRVGLHKNGG+PRALNS NNDDPP  LMPGLCAKAAFS EK +YLWRV+E+KQD  EKT +GE     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GIVGGTF SWFGGSNTV SNGDGNLTMG +KRSPLNADLFGS+CYTFQ G F K +GDLTRIDA+LDISSASGFAKRVFHGFKKSVDD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        I QQ            QV GPIVFR+DSKL+LDSASGK GPHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVEL+LFEF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

A0A5D3D2D9 Protein TRIGALACTOSYLDIACYLGLYCEROL 41.5e-22782.16Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MA LRTAMDSAFWD N+SSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLRL  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVAR  LDKS YTYG+CSQFSPSPFSSVYVSTE HGERKG RHKAMFY +LP HDINVDAAWPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSS+KS+SGLRYRVGLHKNGG+PRALNS NNDDPP  LMPGLCAKAAFS EK +YLWRV+E+KQD  +KT +GE     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GIVGGTF SWFGGSN V SNGDGNLTMG +KRSPLNADLFGS+CYTFQ G F K +GDLTRIDA+LDISSASGFAKRVFHGFKKSVDD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        I QQ            QV GPIVFR+DSKL+LDSASGK GPHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVEL+LFEF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

A0A6J1CYP7 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.1e-23082.99Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MAYLRTAMDSAF DLN+SSPQTLAGTAKAVPG+PFPLDGARASR LR+QQISLLGNGFPLGIIPSYSPT  KELGSFSLQSLLL+LPAA+WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKLISSIKAELSA DSLELPVLKDVA QFLDKSLYTYGLCSQFSPSPFSS++ STE HGE+KG RHKAMFY KLP+HDI ++AAWPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS
        PESISLDLSSLKSESGLRYR GLHKNGG+PRAL+  N D+PP ALMPGLCAKAAFSFEKN+YLWRV+ERK+D+MEKTDKGE  W+ SYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAIS

Query:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL
        GIVGGTF +WF GS T+ SNGDGN      KRSPLNADLFGSICYTFQ GRFRKQ+GDLTRIDARLDISSASGFAKRVF+ FK+S+DD ERS+SSPRLNL
Subjt:  GIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNL

Query:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        I QQ            QV GPIVFRVDS L+LD  SG+  PHVEDTIYSLNYSFRLL+SGKAVFWYSPKRKEGMVEL+LFEF
Subjt:  ILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

A0A6J1IIJ0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X16.9e-21276.03Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MA+LRTAMDSAFW+ ++SS QTL GTAKAVPGEPFPLDGARASR LRIQQ+S LGNGFPLGI+PS+SPTA KELGSFSLQSLLL+ PAA+WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAEL-SAADSLE-LPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYW
        PKK+IS+IK +L S  D+LE LP LKDVA  FLDK+LY+YGLCSQFSP+PFSSV+ STE HG+RKG RHKAMFY +LPHHDIN++AAWPELFIDHKGQYW
Subjt:  PKKLISSIKAEL-SAADSLE-LPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYW

Query:  DVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAA
        +VPES+SLDLSSLKSESGLRYRVGLHKNGG+PRAL   +  +PP  LMPGLCAKAAFS EKN+YLW  KE+KQ L E TD+ E    PSYDVRLK+PHAA
Subjt:  DVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAA

Query:  ISGIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRL
        ISGIVGGTF SWFGGS+TV +NGDGNL +   KRSPLNADLFGS+CYT+QHG FRK + DLTR+DARLDISS S FAKRVF+GFKKS+DD ERS+S+PRL
Subjt:  ISGIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRL

Query:  NLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        NLI QQ            Q+ GPIVFRVDS+L+L S S K GPHVEDTI SLNYSF+LL SGKAVFW+SPKRKEGMVEL+LFEF
Subjt:  NLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.2e-7535.91Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAANWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + NW V L+
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAANWWVGLV

Query:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGH-GE-RKGHRHKAMFYQKLPHHDINVDAAWPELFI
        GQF  ++ ++ I   KA    + S     L  + +   DKSLY  G CS+F  SP  ++ +S + + G+  K  R KA+F  + P H++  +A WP LF+
Subjt:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGH-GE-RKGHRHKAMFYQKLPHHDINVDAAWPELFI

Query:  DHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVR
        D  G+YWDVP S+++DL+SL +ESG  Y + LH N G P+ L+S   + PPP+L+PGL  K+A S+  N  LWR    K +            KP YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVR

Query:  LKEPHAAISGIVGGTFRSWFGGSNTVR------SNGDGNLTMG-QRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLD-------ISSASGFAKR
        L  PH A+SGI+G    + F G N++R      S G G  ++      S   AD  G    T Q+G F+K + DLTR  ARLD       ++ A+  A+ 
Subjt:  LKEPHAAISGIVGGTFRSWFGGSNTVR------SNGDGNLTMG-QRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLD-------ISSASGFAKR

Query:  VFHGFKKSVDDHERSQSSPRLNLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVEL
        + +  + S++  ++    P + + LQQ            Q+VGP  F+V+S + +D  +G     V+ T++++ Y+ ++L S KAV  YSPK+ E MVEL
Subjt:  VFHGFKKSVDDHERSQSSPRLNLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVEL

Query:  QLFE
        + FE
Subjt:  QLFE

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown6.5e-14655.67Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR
        MA L +A+DS FWD N+SSPQTL GTA++VPGEPFPLDGARASR+ RIQQ+SLL  GFPLGIIPS +P + K LGSFSL SLLL   + NWW+GLVGQF+
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV
        PKKL + IKA++S A+  +L V+KD A+  +DKSLY+ GL +Q +    SS+ +STE  G++ G R+K M    L  HD+ V+AAWP+LF+D+KG++WDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSA---NNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHA
        PES+++D+SSL  ESG+RYR GLHK+ G P+ +N+A   +  D P +LMPGLCAKAA S++ N+ LWR +E K+   E+ DK  +     YD+RLKEPHA
Subjt:  PESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSA---NNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHA

Query:  AISGIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPR
        AISGIVG +  +W          G G L  G +KRSP++AD+FGS CYTFQ GRF K YGDLTR+DAR+D+ SA   AK++FH    + DD   +  SPR
Subjt:  AISGIVGGTFRSWFGGSNTVRSNGDGNLTMGQRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPR

Query:  LNLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF
        LNLI QQ            QV GPIVF+VDS+  + +A       +ED IYSLNYS RLL SGK V WYSPKRKEGM+EL++FEF
Subjt:  LNLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF

AT3G06960.1 pigment defective 3208.4e-7735.91Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAANWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + NW V L+
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAANWWVGLV

Query:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGH-GE-RKGHRHKAMFYQKLPHHDINVDAAWPELFI
        GQF  ++ ++ I   KA    + S     L  + +   DKSLY  G CS+F  SP  ++ +S + + G+  K  R KA+F  + P H++  +A WP LF+
Subjt:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGH-GE-RKGHRHKAMFYQKLPHHDINVDAAWPELFI

Query:  DHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVR
        D  G+YWDVP S+++DL+SL +ESG  Y + LH N G P+ L+S   + PPP+L+PGL  K+A S+  N  LWR    K +            KP YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVR

Query:  LKEPHAAISGIVGGTFRSWFGGSNTVR------SNGDGNLTMG-QRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLD-------ISSASGFAKR
        L  PH A+SGI+G    + F G N++R      S G G  ++      S   AD  G    T Q+G F+K + DLTR  ARLD       ++ A+  A+ 
Subjt:  LKEPHAAISGIVGGTFRSWFGGSNTVR------SNGDGNLTMG-QRKRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLD-------ISSASGFAKR

Query:  VFHGFKKSVDDHERSQSSPRLNLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVEL
        + +  + S++  ++    P + + LQQ            Q+VGP  F+V+S + +D  +G     V+ T++++ Y+ ++L S KAV  YSPK+ E MVEL
Subjt:  VFHGFKKSVDDHERSQSSPRLNLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRGPHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVEL

Query:  QLFE
        + FE
Subjt:  QLFE

AT3G06960.2 pigment defective 3204.5e-5438.98Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAANWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + NW V L+
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAANWWVGLV

Query:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGH-GE-RKGHRHKAMFYQKLPHHDINVDAAWPELFI
        GQF  ++ ++ I   KA    + S     L  + +   DKSLY  G CS+F  SP  ++ +S + + G+  K  R KA+F  + P H++  +A WP LF+
Subjt:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGH-GE-RKGHRHKAMFYQKLPHHDINVDAAWPELFI

Query:  DHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVR
        D  G+YWDVP S+++DL+SL +ESG  Y + LH N G P+ L+S   + PPP+L+PGL  K+A S+  N  LWR    K +            KP YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSESGLRYRVGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVR

Query:  LKEPHAAISGIVG
        L  PH A+SGI+G
Subjt:  LKEPHAAISGIVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTACCTCAGGACTGCCATGGATTCCGCCTTCTGGGACTTGAACATTTCTTCTCCTCAAACCCTCGCCGGAACCGCCAAGGCCGTCCCCGGCGAGCCATTTCCCCT
CGATGGAGCTCGAGCCAGCCGCGCCCTGCGGATTCAGCAAATCTCCCTCCTCGGCAATGGATTTCCACTCGGAATTATTCCTTCCTACTCCCCCACTGCTCAGAAGGAGT
TAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAGGTTGCCCGCCGCCAATTGGTGGGTTGGATTGGTTGGCCAATTTCGTCCAAAGAAACTGATATCTTCAATAAAAGCC
GAACTTTCTGCTGCGGACAGCCTTGAGCTCCCTGTCTTGAAAGACGTTGCTAGACAGTTTCTGGACAAGTCGCTCTATACATATGGACTATGCTCTCAGTTTTCTCCTAG
TCCATTTTCATCTGTATATGTCAGCACAGAAGGGCATGGTGAGAGGAAAGGGCATCGCCACAAGGCGATGTTTTATCAAAAGCTTCCTCATCATGATATAAACGTGGATG
CGGCTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGACGTGCCCGAGTCTATATCTTTGGATCTTTCATCTCTTAAGTCTGAATCTGGGTTGCGATACCGA
GTTGGGTTGCATAAGAATGGTGGCATTCCTCGGGCTCTTAATTCTGCCAATAATGATGACCCGCCTCCTGCTCTTATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTT
TGAAAAGAACAAGTACCTTTGGAGGGTAAAAGAAAGGAAGCAAGATTTGATGGAGAAGACAGACAAGGGGGAATGGTATTGGAAGCCATCATATGACGTGCGCCTTAAAG
AACCTCATGCTGCCATATCTGGAATCGTCGGTGGCACCTTTCGCTCTTGGTTTGGAGGCAGCAACACGGTCAGGTCCAATGGAGATGGAAACTTAACTATGGGTCAGAGG
AAAAGAAGTCCATTGAATGCTGATCTTTTTGGCTCAATTTGCTATACTTTCCAACATGGGAGATTTAGAAAGCAATATGGTGACCTAACGAGGATAGATGCTCGGTTAGA
TATTTCATCGGCTTCAGGGTTTGCCAAAAGAGTTTTTCATGGTTTCAAGAAATCTGTTGATGATCATGAGAGATCACAATCTTCCCCCAGACTTAATTTGATCCTTCAAC
AACAGGTAAAAGTACCTTTTAGCTTTGGGATTCCAATGCAGGTCGTTGGCCCGATTGTCTTCCGTGTAGATTCCAAGCTTTTGCTCGATTCTGCGTCTGGCAAGCGTGGT
CCCCATGTTGAGGACACAATTTACAGCTTAAATTATTCATTTAGGCTTCTTCGATCAGGCAAAGCTGTTTTCTGGTATTCTCCCAAAAGGAAAGAGGGAATGGTCGAGTT
GCAACTGTTTGAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
GTTCCAAAAAATGCAAGAGACAGCAACAGATAAGGGGAGAAGGTAAACCCACAGCCAGCCTACGCTCTTCCCAATTCTTTCTGAGCTTCCAAGAAACGCATCAATGGCGT
ACCTCAGGACTGCCATGGATTCCGCCTTCTGGGACTTGAACATTTCTTCTCCTCAAACCCTCGCCGGAACCGCCAAGGCCGTCCCCGGCGAGCCATTTCCCCTCGATGGA
GCTCGAGCCAGCCGCGCCCTGCGGATTCAGCAAATCTCCCTCCTCGGCAATGGATTTCCACTCGGAATTATTCCTTCCTACTCCCCCACTGCTCAGAAGGAGTTAGGTTC
CTTTTCTCTTCAGTCGCTCTTGCTCAGGTTGCCCGCCGCCAATTGGTGGGTTGGATTGGTTGGCCAATTTCGTCCAAAGAAACTGATATCTTCAATAAAAGCCGAACTTT
CTGCTGCGGACAGCCTTGAGCTCCCTGTCTTGAAAGACGTTGCTAGACAGTTTCTGGACAAGTCGCTCTATACATATGGACTATGCTCTCAGTTTTCTCCTAGTCCATTT
TCATCTGTATATGTCAGCACAGAAGGGCATGGTGAGAGGAAAGGGCATCGCCACAAGGCGATGTTTTATCAAAAGCTTCCTCATCATGATATAAACGTGGATGCGGCTTG
GCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGACGTGCCCGAGTCTATATCTTTGGATCTTTCATCTCTTAAGTCTGAATCTGGGTTGCGATACCGAGTTGGGT
TGCATAAGAATGGTGGCATTCCTCGGGCTCTTAATTCTGCCAATAATGATGACCCGCCTCCTGCTCTTATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTTTGAAAAG
AACAAGTACCTTTGGAGGGTAAAAGAAAGGAAGCAAGATTTGATGGAGAAGACAGACAAGGGGGAATGGTATTGGAAGCCATCATATGACGTGCGCCTTAAAGAACCTCA
TGCTGCCATATCTGGAATCGTCGGTGGCACCTTTCGCTCTTGGTTTGGAGGCAGCAACACGGTCAGGTCCAATGGAGATGGAAACTTAACTATGGGTCAGAGGAAAAGAA
GTCCATTGAATGCTGATCTTTTTGGCTCAATTTGCTATACTTTCCAACATGGGAGATTTAGAAAGCAATATGGTGACCTAACGAGGATAGATGCTCGGTTAGATATTTCA
TCGGCTTCAGGGTTTGCCAAAAGAGTTTTTCATGGTTTCAAGAAATCTGTTGATGATCATGAGAGATCACAATCTTCCCCCAGACTTAATTTGATCCTTCAACAACAGGT
AAAAGTACCTTTTAGCTTTGGGATTCCAATGCAGGTCGTTGGCCCGATTGTCTTCCGTGTAGATTCCAAGCTTTTGCTCGATTCTGCGTCTGGCAAGCGTGGTCCCCATG
TTGAGGACACAATTTACAGCTTAAATTATTCATTTAGGCTTCTTCGATCAGGCAAAGCTGTTTTCTGGTATTCTCCCAAAAGGAAAGAGGGAATGGTCGAGTTGCAACTG
TTTGAGTTTTGACTTCGACATCATTAGATTCGGTTAGAGTTCAGTTGATGCATTCAATTCTTAGCTTTTTGACAACGAAATCGGCTTATATAGAGTTAGTTTAGCACTTG
ATGAGGCCTTTCTTCCCATCTATTTAGTATTGCACAACTCTTGCAGATGTAAAATATACTAGTGCTGTACTTGTTTATGTCGAACATTTGTTGCTCAAAAAATAAGGGCA
TTTGAAATCATTGAATCTGCAGCAAAACTTGG
Protein sequenceShow/hide protein sequence
MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAANWWVGLVGQFRPKKLISSIKA
ELSAADSLELPVLKDVARQFLDKSLYTYGLCSQFSPSPFSSVYVSTEGHGERKGHRHKAMFYQKLPHHDINVDAAWPELFIDHKGQYWDVPESISLDLSSLKSESGLRYR
VGLHKNGGIPRALNSANNDDPPPALMPGLCAKAAFSFEKNKYLWRVKERKQDLMEKTDKGEWYWKPSYDVRLKEPHAAISGIVGGTFRSWFGGSNTVRSNGDGNLTMGQR
KRSPLNADLFGSICYTFQHGRFRKQYGDLTRIDARLDISSASGFAKRVFHGFKKSVDDHERSQSSPRLNLILQQQVKVPFSFGIPMQVVGPIVFRVDSKLLLDSASGKRG
PHVEDTIYSLNYSFRLLRSGKAVFWYSPKRKEGMVELQLFEF