; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg06332 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg06332
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationCarg_Chr06:7796343..7800132
RNA-Seq ExpressionCarg06332
SyntenyCarg06332
Gene Ontology termsGO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597249.1 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]6.7e-275100Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
        EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI

Query:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
        VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
Subjt:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ

Query:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
Subjt:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

XP_022936098.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita moschata]2.8e-27399.57Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
        EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI

Query:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
        VGGTFSAWFGGSDTVGTNGDGN AIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
Subjt:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ

Query:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        QQIAGPIVFRVDSRLMLGSTSVK GPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
Subjt:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

XP_022974759.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X1 [Cucurbita maxima]5.5e-26997.86Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MAHLRTAMDSAFW+FDVSSSQTLVGTAKAVPG PFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKKVIS+IKED+ISDLDNLELLPALKDVATM LDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
        EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGG+PPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQG+TETTDEAEPSYDVRLKDPHAAISGI
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI

Query:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
        VGGTFS+WFGGSDTVGTNGDGN AIHNKRSPLNADLFGSLCYTYQHGSFRKDF DLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
Subjt:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ

Query:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
Subjt:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

XP_023538749.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]5.7e-26697.22Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPG PFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKKVISSIKED++SDLDNLELLPALKDVATM LDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
        EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALY+TDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTET D+AEPSYDVRLKDPHAAISGI
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI

Query:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
        VGGTFSAWFGGSDTVGTNGDGN AIHNKRSPLNADLFGSLC TYQHGSFRKDF DLTRLDARLDISSGSAF+KRVFNGFKKSIDDLERSKSTPRLNLIFQ
Subjt:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ

Query:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        QQIAGPIVFRVDSRLML STSVK GPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
Subjt:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

XP_038875801.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]1.4e-21978.6Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MA+LRTAMDSAFWD ++SS QTL GTAKAVPG PFPLDGARASR+LRIQQ+S LGNGFPLGI+PS+SP++ KELGSFSLQSLL + PAADWWVGL+GQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKK+ISSIK + +S  D+LE LP LKDVA   LDK+LY+YGLCSQFSP PFSSV+ STE HG+RKG RHKAMFYH+LPHHDIN++AAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA
        +VPES+SLDLSSLKSESGLRYRVGLHKNGG+PRAL  T+  DPPL LMPGLCAKAAFS EKNRYLW  KE+KQ + E TD+ E    PSYDVRLK+PHAA
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA

Query:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAI-HNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRL
        ISGI+GGTFS+WFGG+DT G+NGDGN  + H KRSPLNADLFGS+CYT+QHG F+K F DLTR+DARLDISS S FAKRVF GFKKS+DDLERSKS+PRL
Subjt:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAI-HNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRL

Query:  NLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        NL+FQQQ+AGPIVFRVDSRLML S S KHGPH+E+TI SLNYSF+LL+SGKAVFW+SP+RKEGMVELRLFEF
Subjt:  NLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

TrEMBL top hitse value%identityAlignment
A0A1S3AWM5 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.8e-21277.75Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MA+LRTAMDSAFWDF++SS QTL GTAK+VPG PFPL+GARASR LRIQQ+S LG+GFPLGI+PS+SPTAHKELGSFSLQSLLL+   A WWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKK+IS +K   +SD D  E L  LKDVA ++LDK+ Y+YG+CSQFSP+PFSSV+ STE+HG+RKGRRHKAMFYHRLP HDIN++AAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA
        +VPES+SLDLSS+KS+SGLRYRVGLHKNGGVPRAL  T+  DPPLTLMPGLCAKAAFS+EK RYLW  +E+KQ  TE T E E     SYD+RLK+PHAA
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA

Query:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAI-HNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRL
        ISGIVGGTFS+WFGGS+TVG+NGDGN  + H KRSPLNADLFGS+CYT+Q GSF KDF DLTR+DA+LDISS S FAKRVF+GFKKS+DDLERSKS+PRL
Subjt:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAI-HNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRL

Query:  NLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        NLIFQQQ+AGPIVFR+DS+LML S S K GPHVEDTI SL YSFKLL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  NLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

A0A5D3D2D9 Protein TRIGALACTOSYLDIACYLGLYCEROL 42.5e-21177.33Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MA+LRTAMDSAFWDF++SS QTL GTAK+VPG PFPL+GARASR LRIQQ+S LG+GFPLGI+PS+SPTAHKELGSFSLQSLLL+   A WWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKK+IS +K   +SD D  E L  LKDVA ++LDK+ Y+YG+CSQFSP+PFSSV+ STE+HG+RKGRRHKAMFYHRLP HDIN++AAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA
        +VPES+SLDLSS+KS+SGLRYRVGLHKNGGVPRAL  T+  DPPLTLMPGLCAKAAFS+EK RYLW  +E+KQ  T+ T E E     SYD+RLK+PHAA
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA

Query:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAI-HNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRL
        ISGIVGGTFS+WFGGS+ VG+NGDGN  + H KRSPLNADLFGS+CYT+Q GSF KDF DLTR+DA+LDISS S FAKRVF+GFKKS+DDLERSKS+PRL
Subjt:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAI-HNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRL

Query:  NLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        NLIFQQQ+AGPIVFR+DS+LML S S K GPHVEDTI SL YSFKLL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  NLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

A0A6J1CYP7 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.3e-21278.34Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MA+LRTAMDSAF D ++SS QTL GTAKAVPG PFPLDGARASRTLR+QQ+S LGNGFPLGI+PS+SPT HKELGSFSLQSLLLK PAADWWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKK+ISSIK + +S +D+LE LP LKDVA   LDK+LY+YGLCSQFSP+PFSS+F STEEHG++KGRRHKAMFYH+LP+HDI LEAAWPELF+DHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA
        +VPES+SLDLSSLKSESGLRYR GLHKNGG+PRAL  T+G +PPL LMPGLCAKAAFS EKNRYLW  +E+K+ + E TD+ E     SYDVRLK+PHAA
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAE----PSYDVRLKDPHAA

Query:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLN
        ISGIVGGTFS WF GS T+G+NGDG     NKRSPLNADLFGS+CYT+Q G FRK F DLTR+DARLDISS S FAKRVFN FK+SIDDLERSKS+PRLN
Subjt:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLN

Query:  LIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVFRVDS LML   S ++ PHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  LIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

A0A6J1FCB0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.4e-27399.57Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
        EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI

Query:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
        VGGTFSAWFGGSDTVGTNGDGN AIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
Subjt:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ

Query:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        QQIAGPIVFRVDSRLMLGSTSVK GPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
Subjt:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

A0A6J1IIJ0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X12.7e-26997.86Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MAHLRTAMDSAFW+FDVSSSQTLVGTAKAVPG PFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKKVIS+IKED+ISDLDNLELLPALKDVATM LDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI
        EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGG+PPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQG+TETTDEAEPSYDVRLKDPHAAISGI
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGI

Query:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
        VGGTFS+WFGGSDTVGTNGDGN AIHNKRSPLNADLFGSLCYTYQHGSFRKDF DLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ
Subjt:  VGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQ

Query:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
Subjt:  QQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.0e-7536.34Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSP----TAHKELGSFSLQSLLLKFPAADWWVGLV
        M  +R   +   WD D+S+  TL GTA+AVP  P PL  +R +R  R +QV F        ++PSFSP    T     G FSLQ +L    + +W V L+
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSP----TAHKELGSFSLQSLLLKFPAADWWVGLV

Query:  GQFRPKKVISSI-KEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEH-GD-RKGRRHKAMFYHRLPHHDINLEAAWPELFI
        GQF  ++ ++ I K        +  +   L  +   + DK+LY+ G CS+F  +P  ++  S + + GD  K  R KA+F H  P H++  EA WP LF+
Subjt:  GQFRPKKVISSI-KEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEH-GD-RKGRRHKAMFYHRLPHHDINLEAAWPELFI

Query:  DHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDP
        D  G+YW+VP S+++DL+SL +ESG  Y + LH N G P+ L+      PP +L+PGL  K+A S   N  LW      +G T   +  +P YDV L  P
Subjt:  DHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDP

Query:  HAAISGIVGGTFSAWFGGSDTVG-----TNGDGNFAIH--NKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAF-------AKRVFNGF
        H A+SGI+G   +A FG +         + G G F++H  +  S   AD  G    T Q+G+F+K F DLTR  ARLD   G  F       A+ + N  
Subjt:  HAAISGIVGGTFSAWFGGSDTVG-----TNGDGNFAIH--NKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAF-------AKRVFNGF

Query:  KKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFE
        + S++  +  K  P + +  QQQI GP  F+V+S + +   +  +   V+ T+ ++ Y+ ++L S KAV  +SPK+ E MVELR FE
Subjt:  KKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFE

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown8.3e-13854.78Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR
        MA+L +A+DS FWD +VSS QTL GTA++VPG PFPLDGARASR+ RIQQ+S L  GFPLGI+PS +P + K LGSFSL SLLL   + +WW+GLVGQF+
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFR

Query:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW
        PKK+ + IK D IS+ +  + L  +KD A  ++DK+LYS GL +Q +    SS+  STE  GD+ G R+K M  H L  HD+ +EAAWP+LF+D+KG++W
Subjt:  PKKVISSIKEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYW

Query:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPR---ALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDE-AEPSYDVRLKDPHAA
        +VPESL++D+SSL  ESG+RYR GLHK+ G P+   A     G D P +LMPGLCAKAA S + NR LW  +E K+G TE  D+     YD+RLK+PHAA
Subjt:  EVPESLSLDLSSLKSESGLRYRVGLHKNGGVPR---ALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDE-AEPSYDVRLKDPHAA

Query:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLN
        ISGIVG + +AW          G G      KRSP++AD+FGS CYT+Q G F K + DLTR+DAR+D+ S  A AK++F+    + DD   +  +PRLN
Subjt:  ISGIVGGTFSAWFGGSDTVGTNGDGNFAIHNKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLN

Query:  LIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVF+VDS+  +G+        +ED I SLNYS +LLESGK V W+SPKRKEGM+ELR+FEF
Subjt:  LIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFEF

AT3G06960.1 pigment defective 3201.4e-7636.34Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSP----TAHKELGSFSLQSLLLKFPAADWWVGLV
        M  +R   +   WD D+S+  TL GTA+AVP  P PL  +R +R  R +QV F        ++PSFSP    T     G FSLQ +L    + +W V L+
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSP----TAHKELGSFSLQSLLLKFPAADWWVGLV

Query:  GQFRPKKVISSI-KEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEH-GD-RKGRRHKAMFYHRLPHHDINLEAAWPELFI
        GQF  ++ ++ I K        +  +   L  +   + DK+LY+ G CS+F  +P  ++  S + + GD  K  R KA+F H  P H++  EA WP LF+
Subjt:  GQFRPKKVISSI-KEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEH-GD-RKGRRHKAMFYHRLPHHDINLEAAWPELFI

Query:  DHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDP
        D  G+YW+VP S+++DL+SL +ESG  Y + LH N G P+ L+      PP +L+PGL  K+A S   N  LW      +G T   +  +P YDV L  P
Subjt:  DHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDP

Query:  HAAISGIVGGTFSAWFGGSDTVG-----TNGDGNFAIH--NKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAF-------AKRVFNGF
        H A+SGI+G   +A FG +         + G G F++H  +  S   AD  G    T Q+G+F+K F DLTR  ARLD   G  F       A+ + N  
Subjt:  HAAISGIVGGTFSAWFGGSDTVG-----TNGDGNFAIH--NKRSPLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAF-------AKRVFNGF

Query:  KKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFE
        + S++  +  K  P + +  QQQI GP  F+V+S + +   +  +   V+ T+ ++ Y+ ++L S KAV  +SPK+ E MVELR FE
Subjt:  KKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFKLLESGKAVFWFSPKRKEGMVELRLFE

AT3G06960.2 pigment defective 3202.6e-5137.86Show/hide
Query:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSP----TAHKELGSFSLQSLLLKFPAADWWVGLV
        M  +R   +   WD D+S+  TL GTA+AVP  P PL  +R +R  R +QV F        ++PSFSP    T     G FSLQ +L    + +W V L+
Subjt:  MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSP----TAHKELGSFSLQSLLLKFPAADWWVGLV

Query:  GQFRPKKVISSI-KEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEH-GD-RKGRRHKAMFYHRLPHHDINLEAAWPELFI
        GQF  ++ ++ I K        +  +   L  +   + DK+LY+ G CS+F  +P  ++  S + + GD  K  R KA+F H  P H++  EA WP LF+
Subjt:  GQFRPKKVISSI-KEDIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEH-GD-RKGRRHKAMFYHRLPHHDINLEAAWPELFI

Query:  DHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDP
        D  G+YW+VP S+++DL+SL +ESG  Y + LH N G P+ L+      PP +L+PGL  K+A S   N  LW      +G T   +  +P YDV L  P
Subjt:  DHKGQYWEVPESLSLDLSSLKSESGLRYRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDP

Query:  HAAISGIVG
        H A+SGI+G
Subjt:  HAAISGIVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGCACCTCAGGACCGCCATGGATTCCGCCTTCTGGGATTTCGACGTTTCCTCCTCTCAAACCCTCGTCGGAACCGCCAAGGCTGTCCCCGGCGGACCATTCCCTCT
CGACGGAGCTCGAGCCAGCCGCACCTTGCGGATTCAGCAAGTCTCCTTCCTCGGAAATGGATTTCCCCTCGGAATTCTTCCTTCCTTCTCCCCCACTGCACACAAGGAGT
TAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAAGTTTCCCGCCGCCGACTGGTGGGTTGGATTGGTTGGCCAATTCCGTCCGAAGAAAGTGATATCTTCTATAAAAGAA
GACATTATTTCTGATCTAGACAACCTTGAGCTCCTCCCTGCCTTGAAAGATGTTGCTACCATGGTTCTGGACAAGACACTCTATTCATATGGATTATGCTCTCAGTTTTC
TCCTACTCCCTTTTCATCTGTATTTGCCAGCACGGAAGAGCACGGTGACAGGAAAGGACGTCGCCACAAAGCGATGTTTTATCACAGGCTTCCTCATCATGATATAAATC
TGGAAGCAGCTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGAAGTGCCCGAGTCTCTGTCTTTGGATCTGTCGTCTCTTAAGTCTGAATCTGGTTTGCGT
TACCGGGTCGGGTTGCATAAGAATGGTGGCGTTCCCCGGGCTCTTTATCATACTGATGGTGGCGACCCACCTCTTACTCTTATGCCTGGATTATGTGCAAAGGCTGCATT
CTCTTTAGAAAAGAATAGGTACCTTTGGGGGGGAAAAGAACAGAAACAAGGCGTAACTGAGACGACAGACGAGGCCGAACCATCATACGATGTGCGCCTTAAAGATCCTC
ATGCAGCCATATCTGGAATTGTTGGTGGCACCTTTAGCGCTTGGTTCGGAGGCAGTGACACGGTTGGGACGAACGGAGATGGAAACTTTGCTATCCATAACAAAAGAAGT
CCACTGAATGCTGACCTTTTTGGCTCACTTTGCTATACTTACCAACACGGGTCATTTAGAAAGGATTTTTGTGACCTCACGAGGTTAGATGCTCGGCTAGATATTTCGTC
GGGTTCAGCCTTTGCCAAAAGAGTTTTCAATGGGTTCAAGAAATCTATTGATGATCTGGAGAGATCAAAATCTACCCCTAGGCTCAATTTGATCTTCCAACAGCAGATTG
CAGGCCCGATCGTTTTCCGAGTAGATTCCCGGCTTATGCTCGGCTCTACCTCCGTCAAGCACGGACCCCATGTCGAGGACACAATATTAAGCTTAAACTATTCATTCAAG
CTTCTTGAATCAGGAAAAGCTGTTTTCTGGTTTTCTCCCAAAAGAAAAGAAGGGATGGTCGAGTTGCGCCTGTTCGAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAATGCAAGAGGCAGGCAGCAGCAGATAAGGGAAGAAGGTAAACTCAGCGCCAGCCTACACCCTAAACCCATTTCATTCTGAGCTTCCAAGAAAAACAAGAAACGCAT
CAATGGCGCACCTCAGGACCGCCATGGATTCCGCCTTCTGGGATTTCGACGTTTCCTCCTCTCAAACCCTCGTCGGAACCGCCAAGGCTGTCCCCGGCGGACCATTCCCT
CTCGACGGAGCTCGAGCCAGCCGCACCTTGCGGATTCAGCAAGTCTCCTTCCTCGGAAATGGATTTCCCCTCGGAATTCTTCCTTCCTTCTCCCCCACTGCACACAAGGA
GTTAGGTTCCTTTTCTCTTCAGTCGCTCTTGCTCAAGTTTCCCGCCGCCGACTGGTGGGTTGGATTGGTTGGCCAATTCCGTCCGAAGAAAGTGATATCTTCTATAAAAG
AAGACATTATTTCTGATCTAGACAACCTTGAGCTCCTCCCTGCCTTGAAAGATGTTGCTACCATGGTTCTGGACAAGACACTCTATTCATATGGATTATGCTCTCAGTTT
TCTCCTACTCCCTTTTCATCTGTATTTGCCAGCACGGAAGAGCACGGTGACAGGAAAGGACGTCGCCACAAAGCGATGTTTTATCACAGGCTTCCTCATCATGATATAAA
TCTGGAAGCAGCTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGAAGTGCCCGAGTCTCTGTCTTTGGATCTGTCGTCTCTTAAGTCTGAATCTGGTTTGC
GTTACCGGGTCGGGTTGCATAAGAATGGTGGCGTTCCCCGGGCTCTTTATCATACTGATGGTGGCGACCCACCTCTTACTCTTATGCCTGGATTATGTGCAAAGGCTGCA
TTCTCTTTAGAAAAGAATAGGTACCTTTGGGGGGGAAAAGAACAGAAACAAGGCGTAACTGAGACGACAGACGAGGCCGAACCATCATACGATGTGCGCCTTAAAGATCC
TCATGCAGCCATATCTGGAATTGTTGGTGGCACCTTTAGCGCTTGGTTCGGAGGCAGTGACACGGTTGGGACGAACGGAGATGGAAACTTTGCTATCCATAACAAAAGAA
GTCCACTGAATGCTGACCTTTTTGGCTCACTTTGCTATACTTACCAACACGGGTCATTTAGAAAGGATTTTTGTGACCTCACGAGGTTAGATGCTCGGCTAGATATTTCG
TCGGGTTCAGCCTTTGCCAAAAGAGTTTTCAATGGGTTCAAGAAATCTATTGATGATCTGGAGAGATCAAAATCTACCCCTAGGCTCAATTTGATCTTCCAACAGCAGAT
TGCAGGCCCGATCGTTTTCCGAGTAGATTCCCGGCTTATGCTCGGCTCTACCTCCGTCAAGCACGGACCCCATGTCGAGGACACAATATTAAGCTTAAACTATTCATTCA
AGCTTCTTGAATCAGGAAAAGCTGTTTTCTGGTTTTCTCCCAAAAGAAAAGAAGGGATGGTCGAGTTGCGCCTGTTCGAGTTTTGACTTCGATATCGTTTAATTCTGTTT
TAGTTCAGTTGATGCGTTCAGTTTCGTAGATTTTTGACAACGAAATCGGCTCTACAGACTTAGTATAGCACTTGAGGCTCTTGCAGATGTAATATATAGGAGTGGCGTCC
TTGTTTATGGCATAGAGCTGAGGCTTTAAACTGTTGTTGCTCAGAAAATAAGGGCATTTGTGATGAATTTGTAGTTATTGTCTAGAAATCATTGAAACTTGCAGCTAAAC
ATGGGCTGTTCATTGCTCCAAATCTCATCTGTTTGAGAGATCTTTACATCCTTATACTGGTTCCCTTCTCGAATCGACATGGGACCTGATACTTCCCTTCTCCAATCAAC
GTGCGACCCCCAAAATCACTCCTTTGGGGCTAGCTTC
Protein sequenceShow/hide protein sequence
MAHLRTAMDSAFWDFDVSSSQTLVGTAKAVPGGPFPLDGARASRTLRIQQVSFLGNGFPLGILPSFSPTAHKELGSFSLQSLLLKFPAADWWVGLVGQFRPKKVISSIKE
DIISDLDNLELLPALKDVATMVLDKTLYSYGLCSQFSPTPFSSVFASTEEHGDRKGRRHKAMFYHRLPHHDINLEAAWPELFIDHKGQYWEVPESLSLDLSSLKSESGLR
YRVGLHKNGGVPRALYHTDGGDPPLTLMPGLCAKAAFSLEKNRYLWGGKEQKQGVTETTDEAEPSYDVRLKDPHAAISGIVGGTFSAWFGGSDTVGTNGDGNFAIHNKRS
PLNADLFGSLCYTYQHGSFRKDFCDLTRLDARLDISSGSAFAKRVFNGFKKSIDDLERSKSTPRLNLIFQQQIAGPIVFRVDSRLMLGSTSVKHGPHVEDTILSLNYSFK
LLESGKAVFWFSPKRKEGMVELRLFEF