; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020923 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020923
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationchr7:3189738..3192929
RNA-Seq ExpressionLag0020923
SyntenyLag0020923
Gene Ontology termsGO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR022244 - Protein of unknown function DUF3769
IPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049109.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4 [Cucumis melo var. makuwa]6.3e-22883.09Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MA LRTAMDSAFWD NLSSPQTL+GTAK+VPGEPFPL+GARASR LRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLRL  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV
        PKKLIS +KA+LS  D  E   LKDVAR  LDK+ YTYG C+QFSPSPFSSV+VSTE+HGERKGRRHKAMFYH+LP+HDIN+DAAWPELFIDHKGQYW+V
Subjt:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV

Query:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS
        PESISLDLSSVKS+SGLRYRVGLHKNGG+PRALNSTN DDPPL LMPGLCAKAAFS EK +YLWRV+E+KQD  +KT +G+     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR
        GIVGGTF +WFGGS+ VGS   NGDGNL + HKKRSPLNADLFGS+CYTFQ G F K FGDLTRID +LDISSASGFAKRVF+GFKKSVDDLERS+SSPR
Subjt:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR

Query:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LNLIFQQQVAGPIVFR+DS+LMLDSASGK GPHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

XP_008438274.1 PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo]4.3e-22983.51Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MA LRTAMDSAFWD NLSSPQTL+GTAK+VPGEPFPL+GARASR LRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLRL  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV
        PKKLIS +KA+LS  D  E   LKDVAR  LDK+ YTYG C+QFSPSPFSSV+VSTE+HGERKGRRHKAMFYH+LP+HDIN+DAAWPELFIDHKGQYW+V
Subjt:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV

Query:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS
        PESISLDLSSVKS+SGLRYRVGLHKNGG+PRALNSTN DDPPL LMPGLCAKAAFS EK +YLWRV+E+KQD  EKT +G+     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR
        GIVGGTF +WFGGS+TVGS   NGDGNL + HKKRSPLNADLFGS+CYTFQ G F K FGDLTRID +LDISSASGFAKRVF+GFKKSVDDLERS+SSPR
Subjt:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR

Query:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LNLIFQQQVAGPIVFR+DS+LMLDSASGK GPHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

XP_022146920.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Momordica charantia]5.5e-24087.08Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MAYLRTAMDSAF DLNLSSPQTL+GTAKAVPG+PFPLDGARASRTLR+QQISLLGNGFPLGIIPSYSPT  KELGSFSLQSLLL+LPAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV
        PKKLISSIKAELSAVDSLE PVLKDVA QFLDK+LYTYG C+QFSPSPFSS+F STE+HGE+KGRRHKAMFYHKLP HDI L+AAWPELF+DHKGQYW+V
Subjt:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV

Query:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS
        PESISLDLSS+KSESGLRYR GLHKNGG+PRAL+ TNGD+PPLALMPGLCAKAAFSFEKN+YLWRV+ERK+D++EKTDKG+  WR SYDVRLKEPHAAIS
Subjt:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPRL
        GIVGGTF TWF GS T+GS   NGDGN     KRSPLNADLFGSICYTFQ GRFRKQFGDLTRID RLDISSASGFAKRVFN FK+S+DDLERS+SSPRL
Subjt:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPRL

Query:  NLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        NLIFQQQVAGPIVFRVDS LMLD  SG++ PHVEDTIYSLNYSFRLL+SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  NLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

XP_022974759.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X1 [Cucurbita maxima]2.6e-22681.43Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MA+LRTAMDSAFW+ ++SS QTL GTAKAVPGEPFPLDGARASRTLRIQQ+S LGNGFPLGI+PS+SPTA KELGSFSLQSLLL+ PAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAEL-SAVDSLE-PPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYW
        PKK+IS+IK +L S +D+LE  P LKDVA  FLDKTLY+YG C+QFSP+PFSSVF STE+HG+RKGRRHKAMFYH+LP HDINL+AAWPELFIDHKGQYW
Subjt:  PKKLISSIKAEL-SAVDSLE-PPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYW

Query:  EVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAA
        EVPES+SLDLSS+KSESGLRYRVGLHKNGG+PRAL  T+G +PPL LMPGLCAKAAFS EKN+YLW  KE+KQ L E TD+ +    PSYDVRLK+PHAA
Subjt:  EVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAA

Query:  ISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSP
        ISGIVGGTF +WFGGSDTV   GTNGDGNLAIH KRSPLNADLFGS+CYT+QHG FRK F DLTR+D RLDISS S FAKRVFNGFKKS+DDLERS+S+P
Subjt:  ISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSP

Query:  RLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        RLNLIFQQQ+AGPIVFRVDSRLML S S KHGPHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  RLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

XP_038875801.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]7.8e-25590.49Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MAYLRTAMDSAFWDLN+SSPQTL+GTAKAVPGEPFPLDGARASR+LRIQQISLLGNGFPLGIIPSYSP++QKELGSFSLQSLL RLPAADWWVGL+GQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV
        PKKLISSIKAELSA DSLE PVLKDVARQFLDK+LYTYG C+QFSP+PFSSV+VSTE HGERKG RHKAMFYHKLP HDIN+DAAWPELFIDHKGQYW+V
Subjt:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV

Query:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS
        PESISLDLSS+KSESGLRYRVGLHKNGGIPRALNSTN +DPPLALMPGLCAKAAFSFEKN+YLWRVKERKQDLIEKTDK +WYW+PSYDVRLKEPHAAIS
Subjt:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR
        GI+GGTF +WFGG+DT GS   NGDGNL + HKKRSPLNADLFGSICYTFQHGRF+KQFGDLTRID RLDISSASGFAKRVF GFKKSVDDLERS+SSPR
Subjt:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR

Query:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LNL+FQQQVAGPIVFRVDSRLMLDSASGKHGPH+E+TIYSLNYSFRLLQSGKAVFWYSP+RKEGMVELRLFEF
Subjt:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

TrEMBL top hitse value%identityAlignment
A0A1S3AWM5 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.1e-22983.51Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MA LRTAMDSAFWD NLSSPQTL+GTAK+VPGEPFPL+GARASR LRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLRL  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV
        PKKLIS +KA+LS  D  E   LKDVAR  LDK+ YTYG C+QFSPSPFSSV+VSTE+HGERKGRRHKAMFYH+LP+HDIN+DAAWPELFIDHKGQYW+V
Subjt:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV

Query:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS
        PESISLDLSSVKS+SGLRYRVGLHKNGG+PRALNSTN DDPPL LMPGLCAKAAFS EK +YLWRV+E+KQD  EKT +G+     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR
        GIVGGTF +WFGGS+TVGS   NGDGNL + HKKRSPLNADLFGS+CYTFQ G F K FGDLTRID +LDISSASGFAKRVF+GFKKSVDDLERS+SSPR
Subjt:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR

Query:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LNLIFQQQVAGPIVFR+DS+LMLDSASGK GPHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

A0A5D3D2D9 Protein TRIGALACTOSYLDIACYLGLYCEROL 43.0e-22883.09Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MA LRTAMDSAFWD NLSSPQTL+GTAK+VPGEPFPL+GARASR LRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLRL  A WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV
        PKKLIS +KA+LS  D  E   LKDVAR  LDK+ YTYG C+QFSPSPFSSV+VSTE+HGERKGRRHKAMFYH+LP+HDIN+DAAWPELFIDHKGQYW+V
Subjt:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV

Query:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS
        PESISLDLSSVKS+SGLRYRVGLHKNGG+PRALNSTN DDPPL LMPGLCAKAAFS EK +YLWRV+E+KQD  +KT +G+     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR
        GIVGGTF +WFGGS+ VGS   NGDGNL + HKKRSPLNADLFGS+CYTFQ G F K FGDLTRID +LDISSASGFAKRVF+GFKKSVDDLERS+SSPR
Subjt:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAI-HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPR

Query:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LNLIFQQQVAGPIVFR+DS+LMLDSASGK GPHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  LNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1CYP7 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.6e-24087.08Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MAYLRTAMDSAF DLNLSSPQTL+GTAKAVPG+PFPLDGARASRTLR+QQISLLGNGFPLGIIPSYSPT  KELGSFSLQSLLL+LPAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV
        PKKLISSIKAELSAVDSLE PVLKDVA QFLDK+LYTYG C+QFSPSPFSS+F STE+HGE+KGRRHKAMFYHKLP HDI L+AAWPELF+DHKGQYW+V
Subjt:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV

Query:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS
        PESISLDLSS+KSESGLRYR GLHKNGG+PRAL+ TNGD+PPLALMPGLCAKAAFSFEKN+YLWRV+ERK+D++EKTDKG+  WR SYDVRLKEPHAAIS
Subjt:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPRL
        GIVGGTF TWF GS T+GS   NGDGN     KRSPLNADLFGSICYTFQ GRFRKQFGDLTRID RLDISSASGFAKRVFN FK+S+DDLERS+SSPRL
Subjt:  GIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPRL

Query:  NLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        NLIFQQQVAGPIVFRVDS LMLD  SG++ PHVEDTIYSLNYSFRLL+SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  NLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1FCB0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.2e-22481.01Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MA+LRTAMDSAFWD ++SS QTL GTAKAVPG PFPLDGARASRTLRIQQ+S LGNGFPLGI+PS+SPTA KELGSFSLQSLLL+ PAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAE-LSAVDSLE-PPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYW
        PKK+ISSIK + +S +D+LE  P LKDVA   LDKTLY+YG C+QFSP+PFSSVF STE+HG+RKGRRHKAMFYH+LP HDINL+AAWPELFIDHKGQYW
Subjt:  PKKLISSIKAE-LSAVDSLE-PPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYW

Query:  EVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAA
        EVPES+SLDLSS+KSESGLRYRVGLHKNGG+PRAL  T+G DPPL LMPGLCAKAAFS EKN+YLW  KE+KQ + E TD+ +    PSYDVRLK+PHAA
Subjt:  EVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAA

Query:  ISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSP
        ISGIVGGTF  WFGGSDTV   GTNGDGNLAIH KRSPLNADLFGS+CYT+QHG FRK F DLTR+D RLDISS S FAKRVFNGFKKS+DDLERS+S+P
Subjt:  ISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSP

Query:  RLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        RLNLIFQQQ+AGPIVFRVDSRLML S S K GPHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  RLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1IIJ0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X11.3e-22681.43Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MA+LRTAMDSAFW+ ++SS QTL GTAKAVPGEPFPLDGARASRTLRIQQ+S LGNGFPLGI+PS+SPTA KELGSFSLQSLLL+ PAADWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAEL-SAVDSLE-PPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYW
        PKK+IS+IK +L S +D+LE  P LKDVA  FLDKTLY+YG C+QFSP+PFSSVF STE+HG+RKGRRHKAMFYH+LP HDINL+AAWPELFIDHKGQYW
Subjt:  PKKLISSIKAEL-SAVDSLE-PPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYW

Query:  EVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAA
        EVPES+SLDLSS+KSESGLRYRVGLHKNGG+PRAL  T+G +PPL LMPGLCAKAAFS EKN+YLW  KE+KQ L E TD+ +    PSYDVRLK+PHAA
Subjt:  EVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAA

Query:  ISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSP
        ISGIVGGTF +WFGGSDTV   GTNGDGNLAIH KRSPLNADLFGS+CYT+QHG FRK F DLTR+D RLDISS S FAKRVFNGFKKS+DDLERS+S+P
Subjt:  ISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSP

Query:  RLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        RLNLIFQQQ+AGPIVFRVDSRLML S S KHGPHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  RLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.6e-7735.23Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAADWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAADWWVGLV

Query:  GQFRPKKLISSI---KAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQH-GE-RKGRRHKAMFYHKLPRHDINLDAAWPELFI
        GQF  ++ ++ I   KA      S     L  + +   DK+LY  GFC++F  SP  ++ +S + + G+  K  R KA+F H+ P H++  +A WP LF+
Subjt:  GQFRPKKLISSI---KAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQH-GE-RKGRRHKAMFYHKLPRHDINLDAAWPELFI

Query:  DHKGQYWEVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVR
        D  G+YW+VP S+++DL+S+ +ESG  Y + LH N G P+ L+S   + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWEVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVR

Query:  LKEPHAAISGIVGGTFGTWFGGSDTVGSL--GTNGDGNLAIH--KKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLD-------ISSASGFAKRV
        L  PH A+SGI+G      FG +         + G G  ++H     S   AD  G    T Q+G F+K F DLTR   RLD       ++ A+  A+ +
Subjt:  LKEPHAAISGIVGGTFGTWFGGSDTVGSL--GTNGDGNLAIH--KKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLD-------ISSASGFAKRV

Query:  FNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFE
         N  + S++  ++    P + +  QQQ+ GP  F+V+S + +D  +G +   V+ T++++ Y+ ++L S KAV  YSPK+ E MVELR FE
Subjt:  FNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFE

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown4.0e-14855.37Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR
        MA L +A+DS FWD N+SSPQTL GTA++VPGEPFPLDGARASR+ RIQQ+SLL  GFPLGIIPS +P + K LGSFSL SLLL   + +WW+GLVGQF+
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFR

Query:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV
        PKKL + IKA++S  +  +  V+KD A+  +DK+LY+ G   Q +    SS+ +STE+ G++ G R+K M  H L +HD+ ++AAWP+LF+D+KG++W+V
Subjt:  PKKLISSIKAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEV

Query:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNST---NGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHA
        PES+++D+SS+  ESG+RYR GLHK+ G P+ +N+    +G D P +LMPGLCAKAA S++ N+ LWR +E K+   E+ DK  +     YD+RLKEPHA
Subjt:  PESISLDLSSVKSESGLRYRVGLHKNGGIPRALNST---NGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHA

Query:  AISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESS
        AISGIVG +   W             G G L   KKRSP++AD+FGS CYTFQ GRF K +GDLTR+D R+D+ SA   AK++F+    + DD   +  S
Subjt:  AISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAIHKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESS

Query:  PRLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        PRLNLIFQQQVAGPIVF+VDS+  + +A       +ED IYSLNYS RLL+SGK V WYSPKRKEGM+ELR+FEF
Subjt:  PRLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

AT3G06960.1 pigment defective 3201.1e-7835.23Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAADWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAADWWVGLV

Query:  GQFRPKKLISSI---KAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQH-GE-RKGRRHKAMFYHKLPRHDINLDAAWPELFI
        GQF  ++ ++ I   KA      S     L  + +   DK+LY  GFC++F  SP  ++ +S + + G+  K  R KA+F H+ P H++  +A WP LF+
Subjt:  GQFRPKKLISSI---KAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQH-GE-RKGRRHKAMFYHKLPRHDINLDAAWPELFI

Query:  DHKGQYWEVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVR
        D  G+YW+VP S+++DL+S+ +ESG  Y + LH N G P+ L+S   + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWEVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVR

Query:  LKEPHAAISGIVGGTFGTWFGGSDTVGSL--GTNGDGNLAIH--KKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLD-------ISSASGFAKRV
        L  PH A+SGI+G      FG +         + G G  ++H     S   AD  G    T Q+G F+K F DLTR   RLD       ++ A+  A+ +
Subjt:  LKEPHAAISGIVGGTFGTWFGGSDTVGSL--GTNGDGNLAIH--KKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLD-------ISSASGFAKRV

Query:  FNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFE
         N  + S++  ++    P + +  QQQ+ GP  F+V+S + +D  +G +   V+ T++++ Y+ ++L S KAV  YSPK+ E MVELR FE
Subjt:  FNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFE

AT3G06960.2 pigment defective 3204.9e-5337.06Show/hide
Query:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAADWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRLPAADWWVGLV

Query:  GQFRPKKLISSI---KAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQH-GE-RKGRRHKAMFYHKLPRHDINLDAAWPELFI
        GQF  ++ ++ I   KA      S     L  + +   DK+LY  GFC++F  SP  ++ +S + + G+  K  R KA+F H+ P H++  +A WP LF+
Subjt:  GQFRPKKLISSI---KAELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQH-GE-RKGRRHKAMFYHKLPRHDINLDAAWPELFI

Query:  DHKGQYWEVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVR
        D  G+YW+VP S+++DL+S+ +ESG  Y + LH N G P+ L+S   + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWEVPESISLDLSSVKSESGLRYRVGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVR

Query:  LKEPHAAISGIVG
        L  PH A+SGI+G
Subjt:  LKEPHAAISGIVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTACCTCAGGACCGCCATGGATTCCGCCTTCTGGGACTTGAACCTCTCCTCCCCTCAAACCCTCTCCGGAACCGCCAAGGCCGTCCCCGGCGAACCGTTTCCTCT
CGACGGTGCTCGAGCCAGCCGCACCTTGCGGATTCAGCAAATCTCCCTCCTCGGTAATGGATTTCCGCTCGGAATTATTCCTTCCTACTCTCCCACTGCACAGAAGGAGT
TAGGTTCATTTTCTCTTCAGTCGCTCTTGCTCAGGCTGCCCGCCGCCGATTGGTGGGTTGGATTGGTTGGTCAATTCCGTCCAAAGAAACTGATATCTTCTATAAAAGCC
GAACTTTCTGCTGTGGATAGCCTCGAACCCCCTGTCTTGAAAGATGTTGCTAGACAGTTTCTGGACAAGACACTCTATACGTATGGATTTTGCGCTCAGTTTTCTCCTAG
TCCATTTTCATCCGTATTTGTCAGCACAGAACAGCATGGTGAGAGGAAAGGACGTCGCCACAAAGCAATGTTTTATCACAAGCTTCCTCGTCATGATATAAATCTGGATG
CAGCTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGAAGTGCCTGAGTCCATATCTTTGGATCTTTCATCTGTTAAGTCTGAATCTGGTTTGCGATACCGA
GTTGGGTTGCATAAGAATGGTGGCATTCCCCGGGCTCTTAACTCTACCAATGGCGACGACCCACCTCTTGCTCTTATGCCTGGACTATGTGCAAAGGCTGCATTTTCTTT
TGAAAAGAACAAGTACCTTTGGAGGGTAAAAGAGAGGAAACAAGACCTGATTGAGAAGACAGACAAGGGAGACTGGTATTGGAGGCCATCATACGATGTGCGTCTTAAAG
AACCTCATGCAGCCATATCTGGAATTGTCGGTGGCACCTTTGGCACTTGGTTTGGAGGCAGTGACACGGTTGGGAGTCTTGGGACCAATGGAGATGGAAACTTAGCTATC
CATAAGAAAAGAAGTCCATTGAATGCTGACCTTTTTGGCTCAATTTGCTATACTTTCCAACATGGGAGATTCAGAAAGCAATTTGGTGACCTTACGAGGATAGATACTCG
GTTAGATATTTCGTCGGCTTCAGGGTTTGCTAAAAGAGTTTTCAATGGTTTCAAGAAATCTGTTGATGATCTAGAGAGATCAGAATCTTCCCCCAGACTTAATTTGATCT
TTCAACAACAGGTGGCTGGCCCGATTGTCTTCCGTGTAGATTCCAGGCTTATGCTCGATTCTGCCTCTGGCAAGCACGGTCCCCATGTCGAGGACACAATATACAGCCTA
AACTATTCATTTAGGCTTCTTCAATCAGGCAAAGCCGTTTTCTGGTATTCTCCCAAAAGGAAAGAGGGAATGGTCGAGTTGCGCCTGTTTGAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTACCTCAGGACCGCCATGGATTCCGCCTTCTGGGACTTGAACCTCTCCTCCCCTCAAACCCTCTCCGGAACCGCCAAGGCCGTCCCCGGCGAACCGTTTCCTCT
CGACGGTGCTCGAGCCAGCCGCACCTTGCGGATTCAGCAAATCTCCCTCCTCGGTAATGGATTTCCGCTCGGAATTATTCCTTCCTACTCTCCCACTGCACAGAAGGAGT
TAGGTTCATTTTCTCTTCAGTCGCTCTTGCTCAGGCTGCCCGCCGCCGATTGGTGGGTTGGATTGGTTGGTCAATTCCGTCCAAAGAAACTGATATCTTCTATAAAAGCC
GAACTTTCTGCTGTGGATAGCCTCGAACCCCCTGTCTTGAAAGATGTTGCTAGACAGTTTCTGGACAAGACACTCTATACGTATGGATTTTGCGCTCAGTTTTCTCCTAG
TCCATTTTCATCCGTATTTGTCAGCACAGAACAGCATGGTGAGAGGAAAGGACGTCGCCACAAAGCAATGTTTTATCACAAGCTTCCTCGTCATGATATAAATCTGGATG
CAGCTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGAAGTGCCTGAGTCCATATCTTTGGATCTTTCATCTGTTAAGTCTGAATCTGGTTTGCGATACCGA
GTTGGGTTGCATAAGAATGGTGGCATTCCCCGGGCTCTTAACTCTACCAATGGCGACGACCCACCTCTTGCTCTTATGCCTGGACTATGTGCAAAGGCTGCATTTTCTTT
TGAAAAGAACAAGTACCTTTGGAGGGTAAAAGAGAGGAAACAAGACCTGATTGAGAAGACAGACAAGGGAGACTGGTATTGGAGGCCATCATACGATGTGCGTCTTAAAG
AACCTCATGCAGCCATATCTGGAATTGTCGGTGGCACCTTTGGCACTTGGTTTGGAGGCAGTGACACGGTTGGGAGTCTTGGGACCAATGGAGATGGAAACTTAGCTATC
CATAAGAAAAGAAGTCCATTGAATGCTGACCTTTTTGGCTCAATTTGCTATACTTTCCAACATGGGAGATTCAGAAAGCAATTTGGTGACCTTACGAGGATAGATACTCG
GTTAGATATTTCGTCGGCTTCAGGGTTTGCTAAAAGAGTTTTCAATGGTTTCAAGAAATCTGTTGATGATCTAGAGAGATCAGAATCTTCCCCCAGACTTAATTTGATCT
TTCAACAACAGGTGGCTGGCCCGATTGTCTTCCGTGTAGATTCCAGGCTTATGCTCGATTCTGCCTCTGGCAAGCACGGTCCCCATGTCGAGGACACAATATACAGCCTA
AACTATTCATTTAGGCTTCTTCAATCAGGCAAAGCCGTTTTCTGGTATTCTCCCAAAAGGAAAGAGGGAATGGTCGAGTTGCGCCTGTTTGAGTTTTGA
Protein sequenceShow/hide protein sequence
MAYLRTAMDSAFWDLNLSSPQTLSGTAKAVPGEPFPLDGARASRTLRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRLPAADWWVGLVGQFRPKKLISSIKA
ELSAVDSLEPPVLKDVARQFLDKTLYTYGFCAQFSPSPFSSVFVSTEQHGERKGRRHKAMFYHKLPRHDINLDAAWPELFIDHKGQYWEVPESISLDLSSVKSESGLRYR
VGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNKYLWRVKERKQDLIEKTDKGDWYWRPSYDVRLKEPHAAISGIVGGTFGTWFGGSDTVGSLGTNGDGNLAI
HKKRSPLNADLFGSICYTFQHGRFRKQFGDLTRIDTRLDISSASGFAKRVFNGFKKSVDDLERSESSPRLNLIFQQQVAGPIVFRVDSRLMLDSASGKHGPHVEDTIYSL
NYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF