; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005665 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005665
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationscaffold254:639868..642485
RNA-Seq ExpressionMS005665
SyntenyMS005665
Gene Ontology termsGO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR022244 - Protein of unknown function DUF3769
IPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049109.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa]6.4e-21779.79Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MA LRTAMDSAF D NLSSPQTLAGTAK+VPG+PFPL+GARASR LR+QQ+SLLG+GFPLGIIPSYSPT HKELGSFSLQSLLL+L  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVA   LDKS YTYG+CSQFSPSPFSS++ STE+HGE+KGRRHKAMFYH+LP HDI ++AAWPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
        PESISLDLSS+KS+SGLRYR GLHKNGG+PRAL+ TN D+PPL LMPGLCAKAAFS EK RYLWRV+E+K+D  +KT +GE    +SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL
        GIVGGTFS+WF GS  +GSNGDGN      KRSPLNADLFGS+CYTFQ+G F K FGDLTRIDA+LDISSASGFAKRVF+ FK+S+DDLERSKSSPRLNL
Subjt:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL

Query:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        IFQQQVAGPIVFR+DS LMLD  SGK  PHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

XP_008438274.1 PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo]4.4e-21880.21Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MA LRTAMDSAF D NLSSPQTLAGTAK+VPG+PFPL+GARASR LR+QQ+SLLG+GFPLGIIPSYSPT HKELGSFSLQSLLL+L  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVA   LDKS YTYG+CSQFSPSPFSS++ STE+HGE+KGRRHKAMFYH+LP HDI ++AAWPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
        PESISLDLSS+KS+SGLRYR GLHKNGG+PRAL+ TN D+PPL LMPGLCAKAAFS EK RYLWRV+E+K+D  EKT +GE    +SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL
        GIVGGTFS+WF GS T+GSNGDGN      KRSPLNADLFGS+CYTFQ+G F K FGDLTRIDA+LDISSASGFAKRVF+ FK+S+DDLERSKSSPRLNL
Subjt:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL

Query:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        IFQQQVAGPIVFR+DS LMLD  SGK  PHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

XP_022146920.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Momordica charantia]2.9e-27099.78Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
        PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
Subjt:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
        PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFRGSGTIGSNGDGNKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQV
        GIVGGTFSTWFRGSGTIGSNGDGNKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQV
Subjt:  GIVGGTFSTWFRGSGTIGSNGDGNKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQV

Query:  AGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        AGPIVFRVDSSLMLDPPSG+YRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
Subjt:  AGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

XP_022974759.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X1 [Cucurbita maxima]2.8e-21278.56Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MA+LRTAMDSAF + ++SS QTL GTAKAVPG+PFPLDGARASRTLR+QQ+S LGNGFPLGI+PS+SPT HKELGSFSLQSLLLK PAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAEL-SAVDSLE-LPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYW
        PKK+IS+IK +L S +D+LE LP LKDVA  FLDK+LY+YGLCSQFSP+PFSS+F STEEHG++KGRRHKAMFYH+LP+HDI LEAAWPELF+DHKGQYW
Subjt:  PKKLISSIKAEL-SAVDSLE-LPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYW

Query:  DVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAA
        +VPES+SLDLSSLKSESGLRYR GLHKNGG+PRAL  T+G  PPL LMPGLCAKAAFS EKNRYLW  +E+K+ + E TD+ E     SYDVRLK+PHAA
Subjt:  DVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAA

Query:  ISGIVGGTFSTWFRGSGTIGSNGDG-----NKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLN
        ISGIVGGTFS+WF GS T+G+NGDG     NKRSPLNADLFGS+CYT+Q G FRK F DLTR+DARLDISS S FAKRVFN FK+SIDDLERSKS+PRLN
Subjt:  ISGIVGGTFSTWFRGSGTIGSNGDG-----NKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLN

Query:  LIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVFRVDS LML   S K+ PHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  LIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

XP_038875801.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]5.2e-23585.32Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MAYLRTAMDSAF DLN+SSPQTLAGTAKAVPG+PFPLDGARASR+LR+QQISLLGNGFPLGIIPSYSP+  KELGSFSLQSLL +LPAADWWVGL+GQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
        PKKLISSIKAELSA DSLELPVLKDVA QFLDKSLYTYGLCSQFSP+PFSS++ STE HGE+KG RHKAMFYHKLP+HDI ++AAWPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
        PESISLDLSSLKSESGLRYR GLHKNGG+PRAL+ TN ++PPLALMPGLCAKAAFSFEKNRYLWRV+ERK+D++EKTDK E  W+ SYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL
        GI+GGTFS+WF G+ T GSNGDGN      KRSPLNADLFGSICYTFQ GRF+KQFGDLTRIDARLDISSASGFAKRVF  FK+S+DDLERSKSSPRLNL
Subjt:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL

Query:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        +FQQQVAGPIVFRVDS LMLD  SGK+ PH+E+TIYSLNYSFRLL+SGKAVFWYSP+RKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

TrEMBL top hitse value%identityAlignment
A0A1S3AWM5 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.1e-21880.21Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MA LRTAMDSAF D NLSSPQTLAGTAK+VPG+PFPL+GARASR LR+QQ+SLLG+GFPLGIIPSYSPT HKELGSFSLQSLLL+L  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVA   LDKS YTYG+CSQFSPSPFSS++ STE+HGE+KGRRHKAMFYH+LP HDI ++AAWPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
        PESISLDLSS+KS+SGLRYR GLHKNGG+PRAL+ TN D+PPL LMPGLCAKAAFS EK RYLWRV+E+K+D  EKT +GE    +SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL
        GIVGGTFS+WF GS T+GSNGDGN      KRSPLNADLFGS+CYTFQ+G F K FGDLTRIDA+LDISSASGFAKRVF+ FK+S+DDLERSKSSPRLNL
Subjt:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL

Query:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        IFQQQVAGPIVFR+DS LMLD  SGK  PHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

A0A5D3D2D9 Protein TRIGALACTOSYLDIACYLGLYCEROL 43.1e-21779.79Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MA LRTAMDSAF D NLSSPQTLAGTAK+VPG+PFPL+GARASR LR+QQ+SLLG+GFPLGIIPSYSPT HKELGSFSLQSLLL+L  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVA   LDKS YTYG+CSQFSPSPFSS++ STE+HGE+KGRRHKAMFYH+LP HDI ++AAWPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
        PESISLDLSS+KS+SGLRYR GLHKNGG+PRAL+ TN D+PPL LMPGLCAKAAFS EK RYLWRV+E+K+D  +KT +GE    +SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL
        GIVGGTFS+WF GS  +GSNGDGN      KRSPLNADLFGS+CYTFQ+G F K FGDLTRIDA+LDISSASGFAKRVF+ FK+S+DDLERSKSSPRLNL
Subjt:  GIVGGTFSTWFRGSGTIGSNGDGN------KRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNL

Query:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        IFQQQVAGPIVFR+DS LMLD  SGK  PHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1CYP7 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.4e-27099.78Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
        PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
Subjt:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
        PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFRGSGTIGSNGDGNKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQV
        GIVGGTFSTWFRGSGTIGSNGDGNKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQV
Subjt:  GIVGGTFSTWFRGSGTIGSNGDGNKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQV

Query:  AGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        AGPIVFRVDSSLMLDPPSG+YRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
Subjt:  AGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1FCB0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.9e-21178.56Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MA+LRTAMDSAF D ++SS QTL GTAKAVPG PFPLDGARASRTLR+QQ+S LGNGFPLGI+PS+SPT HKELGSFSLQSLLLK PAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAE-LSAVDSLE-LPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYW
        PKK+ISSIK + +S +D+LE LP LKDVA   LDK+LY+YGLCSQFSP+PFSS+F STEEHG++KGRRHKAMFYH+LP+HDI LEAAWPELF+DHKGQYW
Subjt:  PKKLISSIKAE-LSAVDSLE-LPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYW

Query:  DVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAA
        +VPES+SLDLSSLKSESGLRYR GLHKNGG+PRAL  T+G +PPL LMPGLCAKAAFS EKNRYLW  +E+K+ + E TD+ E     SYDVRLK+PHAA
Subjt:  DVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAA

Query:  ISGIVGGTFSTWFRGSGTIGSNGDG-----NKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLN
        ISGIVGGTFS WF GS T+G+NGDG     NKRSPLNADLFGS+CYT+Q G FRK F DLTR+DARLDISS S FAKRVFN FK+SIDDLERSKS+PRLN
Subjt:  ISGIVGGTFSTWFRGSGTIGSNGDG-----NKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLN

Query:  LIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVFRVDS LML   S K  PHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  LIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1IIJ0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X11.3e-21278.56Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MA+LRTAMDSAF + ++SS QTL GTAKAVPG+PFPLDGARASRTLR+QQ+S LGNGFPLGI+PS+SPT HKELGSFSLQSLLLK PAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAEL-SAVDSLE-LPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYW
        PKK+IS+IK +L S +D+LE LP LKDVA  FLDK+LY+YGLCSQFSP+PFSS+F STEEHG++KGRRHKAMFYH+LP+HDI LEAAWPELF+DHKGQYW
Subjt:  PKKLISSIKAEL-SAVDSLE-LPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYW

Query:  DVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAA
        +VPES+SLDLSSLKSESGLRYR GLHKNGG+PRAL  T+G  PPL LMPGLCAKAAFS EKNRYLW  +E+K+ + E TD+ E     SYDVRLK+PHAA
Subjt:  DVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAA

Query:  ISGIVGGTFSTWFRGSGTIGSNGDG-----NKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLN
        ISGIVGGTFS+WF GS T+G+NGDG     NKRSPLNADLFGS+CYT+Q G FRK F DLTR+DARLDISS S FAKRVFN FK+SIDDLERSKS+PRLN
Subjt:  ISGIVGGTFSTWFRGSGTIGSNGDG-----NKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLN

Query:  LIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVFRVDS LML   S K+ PHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  LIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.4e-7336.38Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSP----TGHKELGSFSLQSLLLKLPAADWWVGLV
        M  +R   +    DL++S+P TL GTA+AVP DP PL  +R +R  R +Q+          +IPS+SP    TG    G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSP----TGHKELGSFSLQSLLLKLPAADWWVGLV

Query:  GQFRPKKLISSI---KAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEH-GE-KKGRRHKAMFYHKLPNHDIFLEAAWPELFL
        GQF  ++ ++ I   KA      S     L  +     DKSLY  G CS+F  SP  +L  S + + G+  K  R KA+F H+ P H++  EA WP LF+
Subjt:  GQFRPKKLISSI---KAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEH-GE-KKGRRHKAMFYHKLPNHDIFLEAAWPELFL

Query:  DHKGQYWDVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVR
        D  G+YWDVP S+++DL+SL +ESG  Y   LH N G P+ L     + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVR

Query:  LKEPHAAISGIVGGTFSTWFRGSGTIGSNGDGNK-------------RSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLD-------ISSASGFAKR
        L  PH A+SGI+G   +  F G  +I S  + +               S   AD  G    T Q G F+K F DLTR  ARLD       ++ A+  A+ 
Subjt:  LKEPHAAISGIVGGTFSTWFRGSGTIGSNGDGNK-------------RSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLD-------ISSASGFAKR

Query:  VFNCFKRSIDDLERSKSSPRLNLIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFE
        + N  + S++  +  K  P + +  QQQ+ GP  F+V+S + +D  +G     V+ T++++ Y+ ++L S KAV  YSPK+ E MVELR FE
Subjt:  VFNCFKRSIDDLERSKSSPRLNLIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFE

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown3.3e-14756.96Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR
        MA L +A+DS F D N+SSPQTL GTA++VPG+PFPLDGARASR+ R+QQ+SLL  GFPLGIIPS +P   K LGSFSL SLLL   + +WW+GLVGQF+
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV
        PKKL + IKA++S  +  +L V+KD A   +DKSLY+ GL +Q +    SSL  STE  G+K G R+K M  H L  HD+ +EAAWP+LFLD+KG++WDV
Subjt:  PKKLISSIKAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDV

Query:  PESISLDLSSLKSESGLRYRAGLHKNGGLPR---ALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHA
        PES+++D+SSL  ESG+RYR GLHK+ G P+   A    +G + P +LMPGLCAKAA S++ NR LWR QE++ +    T++ ++     YD+RLKEPHA
Subjt:  PESISLDLSSLKSESGLRYRAGLHKNGGLPR---ALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHA

Query:  AISGIVGGTFSTWFRGSGTIGSNGDGNKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQ
        AISGIVG + + W  G G +    +G KRSP++AD+FGS CYTFQ+GRF K +GDLTR+DAR+D+ SA   AK++F+    + DD   +  SPRLNLIFQ
Subjt:  AISGIVGGTFSTWFRGSGTIGSNGDGNKRSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQ

Query:  QQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF
        QQVAGPIVF+VDS   +          +ED IYSLNYS RLL+SGK V WYSPKRKEGM+ELR+FEF
Subjt:  QQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFEF

AT3G06960.1 pigment defective 3201.7e-7436.38Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSP----TGHKELGSFSLQSLLLKLPAADWWVGLV
        M  +R   +    DL++S+P TL GTA+AVP DP PL  +R +R  R +Q+          +IPS+SP    TG    G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSP----TGHKELGSFSLQSLLLKLPAADWWVGLV

Query:  GQFRPKKLISSI---KAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEH-GE-KKGRRHKAMFYHKLPNHDIFLEAAWPELFL
        GQF  ++ ++ I   KA      S     L  +     DKSLY  G CS+F  SP  +L  S + + G+  K  R KA+F H+ P H++  EA WP LF+
Subjt:  GQFRPKKLISSI---KAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEH-GE-KKGRRHKAMFYHKLPNHDIFLEAAWPELFL

Query:  DHKGQYWDVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVR
        D  G+YWDVP S+++DL+SL +ESG  Y   LH N G P+ L     + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVR

Query:  LKEPHAAISGIVGGTFSTWFRGSGTIGSNGDGNK-------------RSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLD-------ISSASGFAKR
        L  PH A+SGI+G   +  F G  +I S  + +               S   AD  G    T Q G F+K F DLTR  ARLD       ++ A+  A+ 
Subjt:  LKEPHAAISGIVGGTFSTWFRGSGTIGSNGDGNK-------------RSPLNADLFGSICYTFQQGRFRKQFGDLTRIDARLD-------ISSASGFAKR

Query:  VFNCFKRSIDDLERSKSSPRLNLIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFE
        + N  + S++  +  K  P + +  QQQ+ GP  F+V+S + +D  +G     V+ T++++ Y+ ++L S KAV  YSPK+ E MVELR FE
Subjt:  VFNCFKRSIDDLERSKSSPRLNLIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLKSGKAVFWYSPKRKEGMVELRLFE

AT3G06960.2 pigment defective 3202.2e-5038.66Show/hide
Query:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSP----TGHKELGSFSLQSLLLKLPAADWWVGLV
        M  +R   +    DL++S+P TL GTA+AVP DP PL  +R +R  R +Q+          +IPS+SP    TG    G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSP----TGHKELGSFSLQSLLLKLPAADWWVGLV

Query:  GQFRPKKLISSI---KAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEH-GE-KKGRRHKAMFYHKLPNHDIFLEAAWPELFL
        GQF  ++ ++ I   KA      S     L  +     DKSLY  G CS+F  SP  +L  S + + G+  K  R KA+F H+ P H++  EA WP LF+
Subjt:  GQFRPKKLISSI---KAELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEH-GE-KKGRRHKAMFYHKLPNHDIFLEAAWPELFL

Query:  DHKGQYWDVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVR
        D  G+YWDVP S+++DL+SL +ESG  Y   LH N G P+ L     + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSESGLRYRAGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVR

Query:  LKEPHAAISGIVG
        L  PH A+SGI+G
Subjt:  LKEPHAAISGIVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTACCTCAGGACCGCCATGGATTCTGCCTTTGGAGATTTGAACCTTTCATCCCCTCAAACCCTCGCCGGCACCGCCAAGGCCGTCCCCGGCGACCCATTCCCCCT
CGACGGAGCTCGGGCCAGCCGCACACTGCGGGTTCAGCAAATCTCCCTCCTCGGCAATGGATTTCCCCTCGGCATTATTCCTTCTTACTCGCCCACCGGACACAAGGAGT
TAGGTTCATTTTCTCTTCAGTCGCTCTTGCTCAAGTTGCCTGCCGCTGATTGGTGGGTTGGGTTAGTCGGCCAATTCCGTCCGAAGAAACTAATATCTTCTATAAAAGCC
GAACTTTCTGCTGTAGATAGCCTTGAGCTCCCTGTCTTGAAAGATGTTGCTCTACAGTTTCTGGACAAGTCACTCTATACATATGGTTTATGCTCTCAGTTTTCTCCTAG
TCCCTTTTCGTCTCTATTTTTCAGCACTGAAGAGCATGGTGAGAAGAAGGGACGTCGCCACAAAGCAATGTTTTATCACAAGCTTCCAAATCATGACATTTTTCTGGAAG
CAGCTTGGCCTGAGCTCTTCCTTGATCATAAAGGGCAATATTGGGATGTACCCGAGTCTATATCTTTGGATCTTTCATCTCTCAAGTCAGAATCTGGTTTGCGATACCGA
GCTGGGCTGCATAAGAATGGTGGCCTTCCCCGGGCTCTTAGTCCTACCAATGGCGACAACCCACCTCTTGCTCTCATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTT
TGAAAAGAACAGGTACCTTTGGAGGGTACAAGAACGGAAGGAAGACATGATGGAGAAGACGGACAAGGGGGAACAGTCTTGGAGGTCATCATACGATGTGCGTCTTAAAG
AACCTCATGCAGCCATATCTGGAATTGTTGGTGGCACCTTTAGCACCTGGTTCAGAGGCAGCGGCACGATTGGGTCCAATGGAGATGGAAACAAAAGAAGTCCACTGAAC
GCTGATCTTTTTGGCTCAATTTGCTATACTTTCCAACAAGGGAGATTTAGAAAGCAATTTGGTGACCTTACAAGGATAGATGCTCGGTTAGACATTTCGTCGGCTTCAGG
GTTTGCCAAAAGAGTTTTTAATTGTTTCAAGAGATCCATTGATGATCTAGAGCGATCGAAATCTTCCCCCCGTCTCAATTTGATCTTTCAACAGCAGGTTGCTGGCCCGA
TTGTCTTCCGTGTAGATTCCAGCCTTATGCTCGATCCTCCCTCTGGCAAGTACCGTCCCCATGTCGAGGACACAATATACAGCCTGAATTATTCATTTAGGCTTCTTAAA
TCAGGCAAAGCCGTTTTCTGGTATTCTCCCAAAAGGAAAGAGGGGATGGTCGAGTTGCGCCTGTTTGAGTTT
mRNA sequenceShow/hide mRNA sequence
ATGGCGTACCTCAGGACCGCCATGGATTCTGCCTTTGGAGATTTGAACCTTTCATCCCCTCAAACCCTCGCCGGCACCGCCAAGGCCGTCCCCGGCGACCCATTCCCCCT
CGACGGAGCTCGGGCCAGCCGCACACTGCGGGTTCAGCAAATCTCCCTCCTCGGCAATGGATTTCCCCTCGGCATTATTCCTTCTTACTCGCCCACCGGACACAAGGAGT
TAGGTTCATTTTCTCTTCAGTCGCTCTTGCTCAAGTTGCCTGCCGCTGATTGGTGGGTTGGGTTAGTCGGCCAATTCCGTCCGAAGAAACTAATATCTTCTATAAAAGCC
GAACTTTCTGCTGTAGATAGCCTTGAGCTCCCTGTCTTGAAAGATGTTGCTCTACAGTTTCTGGACAAGTCACTCTATACATATGGTTTATGCTCTCAGTTTTCTCCTAG
TCCCTTTTCGTCTCTATTTTTCAGCACTGAAGAGCATGGTGAGAAGAAGGGACGTCGCCACAAAGCAATGTTTTATCACAAGCTTCCAAATCATGACATTTTTCTGGAAG
CAGCTTGGCCTGAGCTCTTCCTTGATCATAAAGGGCAATATTGGGATGTACCCGAGTCTATATCTTTGGATCTTTCATCTCTCAAGTCAGAATCTGGTTTGCGATACCGA
GCTGGGCTGCATAAGAATGGTGGCCTTCCCCGGGCTCTTAGTCCTACCAATGGCGACAACCCACCTCTTGCTCTCATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTT
TGAAAAGAACAGGTACCTTTGGAGGGTACAAGAACGGAAGGAAGACATGATGGAGAAGACGGACAAGGGGGAACAGTCTTGGAGGTCATCATACGATGTGCGTCTTAAAG
AACCTCATGCAGCCATATCTGGAATTGTTGGTGGCACCTTTAGCACCTGGTTCAGAGGCAGCGGCACGATTGGGTCCAATGGAGATGGAAACAAAAGAAGTCCACTGAAC
GCTGATCTTTTTGGCTCAATTTGCTATACTTTCCAACAAGGGAGATTTAGAAAGCAATTTGGTGACCTTACAAGGATAGATGCTCGGTTAGACATTTCGTCGGCTTCAGG
GTTTGCCAAAAGAGTTTTTAATTGTTTCAAGAGATCCATTGATGATCTAGAGCGATCGAAATCTTCCCCCCGTCTCAATTTGATCTTTCAACAGCAGGTTGCTGGCCCGA
TTGTCTTCCGTGTAGATTCCAGCCTTATGCTCGATCCTCCCTCTGGCAAGTACCGTCCCCATGTCGAGGACACAATATACAGCCTGAATTATTCATTTAGGCTTCTTAAA
TCAGGCAAAGCCGTTTTCTGGTATTCTCCCAAAAGGAAAGAGGGGATGGTCGAGTTGCGCCTGTTTGAGTTT
Protein sequenceShow/hide protein sequence
MAYLRTAMDSAFGDLNLSSPQTLAGTAKAVPGDPFPLDGARASRTLRVQQISLLGNGFPLGIIPSYSPTGHKELGSFSLQSLLLKLPAADWWVGLVGQFRPKKLISSIKA
ELSAVDSLELPVLKDVALQFLDKSLYTYGLCSQFSPSPFSSLFFSTEEHGEKKGRRHKAMFYHKLPNHDIFLEAAWPELFLDHKGQYWDVPESISLDLSSLKSESGLRYR
AGLHKNGGLPRALSPTNGDNPPLALMPGLCAKAAFSFEKNRYLWRVQERKEDMMEKTDKGEQSWRSSYDVRLKEPHAAISGIVGGTFSTWFRGSGTIGSNGDGNKRSPLN
ADLFGSICYTFQQGRFRKQFGDLTRIDARLDISSASGFAKRVFNCFKRSIDDLERSKSSPRLNLIFQQQVAGPIVFRVDSSLMLDPPSGKYRPHVEDTIYSLNYSFRLLK
SGKAVFWYSPKRKEGMVELRLFEF