; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012303 (gene) of Snake gourd v1 genome

Gene IDTan0012303
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic
Genome locationLG10:3743052..3746779
RNA-Seq ExpressionTan0012303
SyntenyTan0012303
Gene Ontology termsGO:0034196 - acylglycerol transport (biological process)
GO:1990052 - ER to chloroplast lipid transport (biological process)
GO:0009941 - chloroplast envelope (cellular component)
GO:0070300 - phosphatidic acid binding (molecular function)
InterPro domainsIPR022244 - Protein of unknown function DUF3769
IPR044160 - Protein TRIGALACTOSYLDIACYLGLYCEROL 4-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438274.1 PREDICTED: protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucumis melo]2.0e-22682.13Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MA LRTAMDSAFWD N+SSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLR     WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVAR  LDK+ +TYG+CSQFSPSPFSS+++STE+HGERKG RHKAMFYH+LP HDIN++A WPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS
        PESISLDLSS+KS SGLRYR GLHKNGG+PRALNSTN DDPPL LMPGLCAKAAFS EK RYLWRV+E+KQD  EKT +GE     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFGGSDTVGTNGDGNLAI-HKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNL
        GIVGGTFS+WFGGS+TVG+NGDGNL + HKKRSPLNADLFGS+CYT Q G F K FGDLTR+DA+LDISSAS FAKRVF+GFKKSVDD+ERS+SSPRLNL
Subjt:  GIVGGTFSTWFGGSDTVGTNGDGNLAI-HKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNL

Query:  IFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        IFQQQVAGPIVFR+DS+ MLDSASGK  PHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

XP_022146920.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Momordica charantia]3.5e-23987.21Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MAYLRTAMDSAF DLN+SSPQTLAGTAKAVPG+PFPLDGARASR LR+QQISLLGNGFPLGIIPSYSPT  KELGSFSLQSLLL+ PA DWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV
        PKKLISSIKAELSA DSLELPVLKDVA QFLDK+L+TYGLCSQFSPSPFSSLF STEEHGE+KG RHKAMFYHKLP+HDI +EA WPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS
        PESISLDLSSLKS SGLRYR GLHKNGG+PRAL+ TNGD+PPLALMPGLCAKAAFSFEKNRYLWRV+ERK+DM+EKTDKGE  WR SYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNLI
        GIVGGTFSTWF GS T+G+NGDGN     KRSPLNADLFGSICYT Q GRFRKQFGDLTR+DARLDISSAS FAKRVFN FK+S+DD+ERS+SSPRLNLI
Subjt:  GIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNLI

Query:  FQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        FQQQVAGPIVFRVDS  MLD  SG+  PHVEDTIYSLNYSFRLL+SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  FQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

XP_022974759.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X1 [Cucurbita maxima]2.6e-22681.74Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MA+LRTAMDSAFW+ ++SS QTL GTAKAVPGEPFPLDGARASR LRIQQ+S LGNGFPLGI+PS+SPTA KELGSFSLQSLLL+FPA DWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAEL-SAADSLE-LPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYW
        PKK+IS+IK +L S  D+LE LP LKDVA  FLDKTL++YGLCSQFSP+PFSS+F STEEHG+RKG RHKAMFYH+LPHHDIN+EA WPELFIDHKGQYW
Subjt:  PKKLISSIKAEL-SAADSLE-LPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYW

Query:  DVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAA
        +VPES+SLDLSSLKS SGLRYR GLHKNGG+PRAL  T+G +PPL LMPGLCAKAAFS EKNRYLW  KE+KQ + E TD+ E    PSYDVRLK+PHAA
Subjt:  DVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAA

Query:  ISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLN
        ISGIVGGTFS+WFGGSDTVGTNGDGNLAIH KRSPLNADLFGS+CYT QHG FRK F DLTRLDARLDISS SAFAKRVFNGFKKS+DD+ERS+S+PRLN
Subjt:  ISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLN

Query:  LIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVFRVDSR ML S S K  PHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  LIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

XP_023538749.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Cucurbita pepo subsp. pepo]9.0e-22782.17Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MA+LRTAMDSAFWD ++SS QTL GTAKAVPGEPFPLDGARASR LRIQQ+S LGNGFPLGI+PS+SPTA KELGSFSLQSLLL+FPA DWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAEL-SAADSLE-LPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYW
        PKK+ISSIK +L S  D+LE LP LKDVA  FLDKTL++YGLCSQFSP+PFSS+F STEEHG+RKG RHKAMFYH+LPHHDIN+EA WPELFIDHKGQYW
Subjt:  PKKLISSIKAEL-SAADSLE-LPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYW

Query:  DVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAA
        +VPES+SLDLSSLKS SGLRYR GLHKNGG+PRAL +T+G DPPL LMPGLCAKAAFS EKNRYLW  KE+KQ + E  D  E    PSYDVRLK+PHAA
Subjt:  DVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAA

Query:  ISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLN
        ISGIVGGTFS WFGGSDTVGTNGDGNLAIH KRSPLNADLFGS+C T QHG FRK F DLTRLDARLDISS SAF+KRVFNGFKKS+DD+ERS+S+PRLN
Subjt:  ISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLN

Query:  LIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVFRVDSR MLDS S KR PHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  LIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

XP_038875801.1 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic [Benincasa hispida]2.5e-25390Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASR+LRIQQISLLGNGFPLGIIPSYSP++QKELGSFSLQSLL R PA DWWVGL+GQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV
        PKKLISSIKAELSAADSLELPVLKDVARQFLDK+L+TYGLCSQFSP+PFSS+++STE HGERKG RHKAMFYHKLPHHDIN++A WPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS
        PESISLDLSSLKS SGLRYR GLHKNGGIPRALNSTN +DPPLALMPGLCAKAAFSFEKNRYLWRVKERKQD+IEKTDK EWYW+PSYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFGGSDTVGTNGDGNLAI-HKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNL
        GI+GGTFS+WFGG+DT G+NGDGNL + HKKRSPLNADLFGSICYT QHGRF+KQFGDLTR+DARLDISSAS FAKRVF GFKKSVDD+ERS+SSPRLNL
Subjt:  GIVGGTFSTWFGGSDTVGTNGDGNLAI-HKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNL

Query:  IFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        +FQQQVAGPIVFRVDSR MLDSASGK  PH+E+TIYSLNYSFRLLQSGKAVFWYSP+RKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

TrEMBL top hitse value%identityAlignment
A0A1S3AWM5 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic9.7e-22782.13Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MA LRTAMDSAFWD N+SSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLR     WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVAR  LDK+ +TYG+CSQFSPSPFSS+++STE+HGERKG RHKAMFYH+LP HDIN++A WPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS
        PESISLDLSS+KS SGLRYR GLHKNGG+PRALNSTN DDPPL LMPGLCAKAAFS EK RYLWRV+E+KQD  EKT +GE     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFGGSDTVGTNGDGNLAI-HKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNL
        GIVGGTFS+WFGGS+TVG+NGDGNL + HKKRSPLNADLFGS+CYT Q G F K FGDLTR+DA+LDISSAS FAKRVF+GFKKSVDD+ERS+SSPRLNL
Subjt:  GIVGGTFSTWFGGSDTVGTNGDGNLAI-HKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNL

Query:  IFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        IFQQQVAGPIVFR+DS+ MLDSASGK  PHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

A0A5D3D2D9 Protein TRIGALACTOSYLDIACYLGLYCEROL 41.4e-22581.7Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MA LRTAMDSAFWD N+SSPQTLAGTAK+VPGEPFPL+GARASRALRIQQ+SLLG+GFPLGIIPSYSPTA KELGSFSLQSLLLR     WWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV
        PKKLIS +KA+LS  D  EL  LKDVAR  LDK+ +TYG+CSQFSPSPFSS+++STE+HGERKG RHKAMFYH+LP HDIN++A WPELFIDHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS
        PESISLDLSS+KS SGLRYR GLHKNGG+PRALNSTN DDPPL LMPGLCAKAAFS EK RYLWRV+E+KQD  +KT +GE     SYD+RLKEPHAAIS
Subjt:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFGGSDTVGTNGDGNLAI-HKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNL
        GIVGGTFS+WFGGS+ VG+NGDGNL + HKKRSPLNADLFGS+CYT Q G F K FGDLTR+DA+LDISSAS FAKRVF+GFKKSVDD+ERS+SSPRLNL
Subjt:  GIVGGTFSTWFGGSDTVGTNGDGNLAI-HKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNL

Query:  IFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        IFQQQVAGPIVFR+DS+ MLDSASGK  PHVEDTIYSL YSF+LL SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  IFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1CYP7 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic1.7e-23987.21Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MAYLRTAMDSAF DLN+SSPQTLAGTAKAVPG+PFPLDGARASR LR+QQISLLGNGFPLGIIPSYSPT  KELGSFSLQSLLL+ PA DWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV
        PKKLISSIKAELSA DSLELPVLKDVA QFLDK+L+TYGLCSQFSPSPFSSLF STEEHGE+KG RHKAMFYHKLP+HDI +EA WPELF+DHKGQYWDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS
        PESISLDLSSLKS SGLRYR GLHKNGG+PRAL+ TNGD+PPLALMPGLCAKAAFSFEKNRYLWRV+ERK+DM+EKTDKGE  WR SYDVRLKEPHAAIS
Subjt:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAIS

Query:  GIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNLI
        GIVGGTFSTWF GS T+G+NGDGN     KRSPLNADLFGSICYT Q GRFRKQFGDLTR+DARLDISSAS FAKRVFN FK+S+DD+ERS+SSPRLNLI
Subjt:  GIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNLI

Query:  FQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        FQQQVAGPIVFRVDS  MLD  SG+  PHVEDTIYSLNYSFRLL+SGKAVFWYSPKRKEGMVELRLFEF
Subjt:  FQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1FCB0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.2e-22681.95Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MA+LRTAMDSAFWD ++SS QTL GTAKAVPG PFPLDGARASR LRIQQ+S LGNGFPLGI+PS+SPTA KELGSFSLQSLLL+FPA DWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAE-LSAADSLE-LPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYW
        PKK+ISSIK + +S  D+LE LP LKDVA   LDKTL++YGLCSQFSP+PFSS+F STEEHG+RKG RHKAMFYH+LPHHDIN+EA WPELFIDHKGQYW
Subjt:  PKKLISSIKAE-LSAADSLE-LPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYW

Query:  DVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAA
        +VPES+SLDLSSLKS SGLRYR GLHKNGG+PRAL  T+G DPPL LMPGLCAKAAFS EKNRYLW  KE+KQ + E TD+ E    PSYDVRLK+PHAA
Subjt:  DVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAA

Query:  ISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLN
        ISGIVGGTFS WFGGSDTVGTNGDGNLAIH KRSPLNADLFGS+CYT QHG FRK F DLTRLDARLDISS SAFAKRVFNGFKKS+DD+ERS+S+PRLN
Subjt:  ISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLN

Query:  LIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVFRVDSR ML S S KR PHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  LIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

A0A6J1IIJ0 protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic-like isoform X11.3e-22681.74Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MA+LRTAMDSAFW+ ++SS QTL GTAKAVPGEPFPLDGARASR LRIQQ+S LGNGFPLGI+PS+SPTA KELGSFSLQSLLL+FPA DWWVGLVGQFR
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAEL-SAADSLE-LPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYW
        PKK+IS+IK +L S  D+LE LP LKDVA  FLDKTL++YGLCSQFSP+PFSS+F STEEHG+RKG RHKAMFYH+LPHHDIN+EA WPELFIDHKGQYW
Subjt:  PKKLISSIKAEL-SAADSLE-LPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYW

Query:  DVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAA
        +VPES+SLDLSSLKS SGLRYR GLHKNGG+PRAL  T+G +PPL LMPGLCAKAAFS EKNRYLW  KE+KQ + E TD+ E    PSYDVRLK+PHAA
Subjt:  DVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAA

Query:  ISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLN
        ISGIVGGTFS+WFGGSDTVGTNGDGNLAIH KRSPLNADLFGS+CYT QHG FRK F DLTRLDARLDISS SAFAKRVFNGFKKS+DD+ERS+S+PRLN
Subjt:  ISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLN

Query:  LIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        LIFQQQ+AGPIVFRVDSR ML S S K  PHVEDTI SLNYSF+LL+SGKAVFW+SPKRKEGMVELRLFEF
Subjt:  LIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

SwissProt top hitse value%identityAlignment
Q9M903 Protein TRIGALACTOSYLDIACYLGLYCEROL 4, chloroplastic2.1e-7735.85Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRFPATDWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRFPATDWWVGLV

Query:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEH-GE-RKGHRHKAMFYHKLPHHDINMEAGWPELFI
        GQF  ++ ++ I   KA    + S     L  + +   DK+L+  G CS+F  SP  +L +S + + G+  K  R KA+F H+ P H++  EA WP LF+
Subjt:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEH-GE-RKGHRHKAMFYHKLPHHDINMEAGWPELFI

Query:  DHKGQYWDVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVR
        D  G+YWDVP S+++DL+SL + SG  Y   LH N G P+ L+S   + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVR

Query:  LKEPHAAISGIVGGTFSTWFGGSDTVG-----TNGDGNLAIH--KKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLD-------ISSASAFAKRV
        L  PH A+SGI+G   +  FG +         + G G  ++H     S   AD  G    T Q+G F+K F DLTR  ARLD       ++ A++ A+ +
Subjt:  LKEPHAAISGIVGGTFSTWFGGSDTVG-----TNGDGNLAIH--KKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLD-------ISSASAFAKRV

Query:  FNGFKKSVDDMERSESSPRLNLIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFE
         N  + S++  ++    P + +  QQQ+ GP  F+V+S   +D  +G     V+ T++++ Y+ ++L S KAV  YSPK+ E MVELR FE
Subjt:  FNGFKKSVDDMERSESSPRLNLIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFE

Arabidopsis top hitse value%identityAlignment
AT2G44640.1 FUNCTIONS IN: molecular_function unknown1.3e-15157.42Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR
        MA L +A+DS FWD N+SSPQTL GTA++VPGEPFPLDGARASR+ RIQQ+SLL  GFPLGIIPS +P + K LGSFSL SLLL   + +WW+GLVGQF+
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFR

Query:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV
        PKKL + IKA++S A+  +L V+KD A+  +DK+L++ GL +Q +    SSL +STE  G++ G R+K M  H L  HD+ +EA WP+LF+D+KG++WDV
Subjt:  PKKLISSIKAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDV

Query:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNST---NGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHA
        PES+++D+SSL   SG+RYRFGLHK+ G P+ +N+    +G D P +LMPGLCAKAA S++ NR LWR +E K+   E+ DK  +     YD+RLKEPHA
Subjt:  PESISLDLSSLKSASGLRYRFGLHKNGGIPRALNST---NGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHA

Query:  AISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRL
        AISGIVG + + W          G G L   KKRSP++AD+FGS CYT Q GRF K +GDLTR+DAR+D+ SA A AK++F+    + DD   +  SPRL
Subjt:  AISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRL

Query:  NLIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF
        NLIFQQQVAGPIVF+VDS+F + +A       +ED IYSLNYS RLL+SGK V WYSPKRKEGM+ELR+FEF
Subjt:  NLIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFEF

AT3G06960.1 pigment defective 3201.5e-7835.85Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRFPATDWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRFPATDWWVGLV

Query:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEH-GE-RKGHRHKAMFYHKLPHHDINMEAGWPELFI
        GQF  ++ ++ I   KA    + S     L  + +   DK+L+  G CS+F  SP  +L +S + + G+  K  R KA+F H+ P H++  EA WP LF+
Subjt:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEH-GE-RKGHRHKAMFYHKLPHHDINMEAGWPELFI

Query:  DHKGQYWDVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVR
        D  G+YWDVP S+++DL+SL + SG  Y   LH N G P+ L+S   + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVR

Query:  LKEPHAAISGIVGGTFSTWFGGSDTVG-----TNGDGNLAIH--KKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLD-------ISSASAFAKRV
        L  PH A+SGI+G   +  FG +         + G G  ++H     S   AD  G    T Q+G F+K F DLTR  ARLD       ++ A++ A+ +
Subjt:  LKEPHAAISGIVGGTFSTWFGGSDTVG-----TNGDGNLAIH--KKRSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLD-------ISSASAFAKRV

Query:  FNGFKKSVDDMERSESSPRLNLIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFE
         N  + S++  ++    P + +  QQQ+ GP  F+V+S   +D  +G     V+ T++++ Y+ ++L S KAV  YSPK+ E MVELR FE
Subjt:  FNGFKKSVDDMERSESSPRLNLIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYSFRLLQSGKAVFWYSPKRKEGMVELRLFE

AT3G06960.2 pigment defective 3203.1e-5237.7Show/hide
Query:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRFPATDWWVGLV
        M  +R   +   WDL++S+P TL GTA+AVP +P PL  +R +R  R +Q+          +IPS+SP    T     G FSLQ +L    + +W V L+
Subjt:  MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSP----TAQKELGSFSLQSLLLRFPATDWWVGLV

Query:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEH-GE-RKGHRHKAMFYHKLPHHDINMEAGWPELFI
        GQF  ++ ++ I   KA    + S     L  + +   DK+L+  G CS+F  SP  +L +S + + G+  K  R KA+F H+ P H++  EA WP LF+
Subjt:  GQFRPKKLISSI---KAELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEH-GE-RKGHRHKAMFYHKLPHHDINMEAGWPELFI

Query:  DHKGQYWDVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVR
        D  G+YWDVP S+++DL+SL + SG  Y   LH N G P+ L+S   + PP +L+PGL  K+A S+  N  LWR    K +  +            YDV 
Subjt:  DHKGQYWDVPESISLDLSSLKSASGLRYRFGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVR

Query:  LKEPHAAISGIVG
        L  PH A+SGI+G
Subjt:  LKEPHAAISGIVG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATCTCAGGACCGCCATGGATTCCGCCTTCTGGGATTTGAACATTTCTTCCCCTCAAACCCTCGCCGGAACCGCCAAGGCCGTCCCCGGCGAACCATTTCCCCT
CGACGGAGCTCGAGCCAGCCGCGCCTTGCGGATTCAGCAAATTTCCCTCCTCGGCAATGGTTTTCCGCTCGGAATTATTCCTTCCTACTCTCCTACTGCACAGAAGGAGT
TAGGTTCCTTTTCTCTCCAGTCGCTCTTGCTCAGGTTTCCCGCCACTGATTGGTGGGTTGGATTGGTTGGCCAATTCCGTCCAAAGAAACTGATATCTTCTATAAAAGCC
GAACTTTCTGCTGCGGATAGCCTTGAGCTCCCTGTCTTGAAAGATGTTGCTAGACAGTTTCTGGACAAGACACTCTTTACATATGGATTATGCTCTCAGTTTTCTCCTAG
TCCCTTTTCATCTTTATTTATCAGCACAGAAGAGCATGGTGAGAGGAAAGGACATCGCCACAAAGCAATGTTTTACCACAAGCTTCCTCATCATGATATAAATATGGAAG
CAGGTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGACGTGCCTGAGTCTATATCTTTGGATCTTTCATCTCTTAAGTCTGCATCTGGTCTGCGATACCGG
TTTGGGTTGCATAAGAATGGTGGCATTCCCCGGGCTCTTAATTCTACCAATGGCGATGACCCACCTCTTGCTCTTATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTT
TGAAAAGAACAGGTACCTTTGGAGGGTAAAAGAAAGGAAACAAGACATGATTGAGAAAACAGACAAGGGAGAATGGTATTGGAGGCCATCATACGACGTGCGCCTTAAAG
AACCCCATGCAGCCATATCCGGAATCGTCGGTGGCACCTTTAGCACTTGGTTCGGAGGCAGTGACACGGTTGGGACCAATGGAGATGGAAACTTAGCTATCCATAAGAAA
AGAAGTCCATTGAATGCTGACCTTTTTGGCTCAATTTGCTATACTTTGCAACATGGGAGATTTAGAAAGCAATTTGGTGACCTCACGAGGTTAGATGCTCGGTTAGATAT
TTCGTCGGCTTCAGCATTTGCCAAAAGAGTTTTTAATGGTTTCAAGAAATCTGTTGATGATATGGAGAGATCAGAATCTTCCCCCAGACTGAATTTGATCTTTCAACAAC
AGGTCGCTGGCCCGATTGTCTTCCGTGTAGATTCCAGGTTTATGCTCGACTCTGCCTCCGGCAAGCGCAGTCCCCATGTCGAGGACACAATATACAGCCTAAACTATTCA
TTTAGGCTTCTTCAATCAGGCAAAGCGGTTTTCTGGTATTCTCCTAAAAGGAAAGAGGGGATGGTCGAGTTGCGCCTGTTCGAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
AGGCAACTCCCAAAAATGCAAGAGACAGCAACAGATAAGGGAAGAAGGTAAACGCACGGCCAGCCTACACTCCTTCCCAATTCCTTCTGAGCTTCCAAGAAAGGCATCAA
TGGCGTATCTCAGGACCGCCATGGATTCCGCCTTCTGGGATTTGAACATTTCTTCCCCTCAAACCCTCGCCGGAACCGCCAAGGCCGTCCCCGGCGAACCATTTCCCCTC
GACGGAGCTCGAGCCAGCCGCGCCTTGCGGATTCAGCAAATTTCCCTCCTCGGCAATGGTTTTCCGCTCGGAATTATTCCTTCCTACTCTCCTACTGCACAGAAGGAGTT
AGGTTCCTTTTCTCTCCAGTCGCTCTTGCTCAGGTTTCCCGCCACTGATTGGTGGGTTGGATTGGTTGGCCAATTCCGTCCAAAGAAACTGATATCTTCTATAAAAGCCG
AACTTTCTGCTGCGGATAGCCTTGAGCTCCCTGTCTTGAAAGATGTTGCTAGACAGTTTCTGGACAAGACACTCTTTACATATGGATTATGCTCTCAGTTTTCTCCTAGT
CCCTTTTCATCTTTATTTATCAGCACAGAAGAGCATGGTGAGAGGAAAGGACATCGCCACAAAGCAATGTTTTACCACAAGCTTCCTCATCATGATATAAATATGGAAGC
AGGTTGGCCAGAGCTCTTCATTGATCATAAAGGTCAATATTGGGACGTGCCTGAGTCTATATCTTTGGATCTTTCATCTCTTAAGTCTGCATCTGGTCTGCGATACCGGT
TTGGGTTGCATAAGAATGGTGGCATTCCCCGGGCTCTTAATTCTACCAATGGCGATGACCCACCTCTTGCTCTTATGCCTGGATTATGTGCAAAGGCTGCATTCTCTTTT
GAAAAGAACAGGTACCTTTGGAGGGTAAAAGAAAGGAAACAAGACATGATTGAGAAAACAGACAAGGGAGAATGGTATTGGAGGCCATCATACGACGTGCGCCTTAAAGA
ACCCCATGCAGCCATATCCGGAATCGTCGGTGGCACCTTTAGCACTTGGTTCGGAGGCAGTGACACGGTTGGGACCAATGGAGATGGAAACTTAGCTATCCATAAGAAAA
GAAGTCCATTGAATGCTGACCTTTTTGGCTCAATTTGCTATACTTTGCAACATGGGAGATTTAGAAAGCAATTTGGTGACCTCACGAGGTTAGATGCTCGGTTAGATATT
TCGTCGGCTTCAGCATTTGCCAAAAGAGTTTTTAATGGTTTCAAGAAATCTGTTGATGATATGGAGAGATCAGAATCTTCCCCCAGACTGAATTTGATCTTTCAACAACA
GGTCGCTGGCCCGATTGTCTTCCGTGTAGATTCCAGGTTTATGCTCGACTCTGCCTCCGGCAAGCGCAGTCCCCATGTCGAGGACACAATATACAGCCTAAACTATTCAT
TTAGGCTTCTTCAATCAGGCAAAGCGGTTTTCTGGTATTCTCCTAAAAGGAAAGAGGGGATGGTCGAGTTGCGCCTGTTCGAGTTTTGACTTCGACATCATTAAATTCAG
TTCTAGTTCAGTTGATGCATTCAATTCTTAGCTTTTTGACAACGAAATCGGCTATATAGAGTTAGTTTAGCACTTTGAAGCCTTTTTTCCCCATTATTTAGTGTTGCACA
ACTCTTGCAGATGTTAATATATAGGGGTGCTGTACTTGTTTATGTCACAGAGCTGAGGCTTCAAACAGTTGTTGCTCAAAAAATAAGGGCATTTGTTATGAATGTTTGGT
TATTTATCTATAAATGATTTTGAACTTGC
Protein sequenceShow/hide protein sequence
MAYLRTAMDSAFWDLNISSPQTLAGTAKAVPGEPFPLDGARASRALRIQQISLLGNGFPLGIIPSYSPTAQKELGSFSLQSLLLRFPATDWWVGLVGQFRPKKLISSIKA
ELSAADSLELPVLKDVARQFLDKTLFTYGLCSQFSPSPFSSLFISTEEHGERKGHRHKAMFYHKLPHHDINMEAGWPELFIDHKGQYWDVPESISLDLSSLKSASGLRYR
FGLHKNGGIPRALNSTNGDDPPLALMPGLCAKAAFSFEKNRYLWRVKERKQDMIEKTDKGEWYWRPSYDVRLKEPHAAISGIVGGTFSTWFGGSDTVGTNGDGNLAIHKK
RSPLNADLFGSICYTLQHGRFRKQFGDLTRLDARLDISSASAFAKRVFNGFKKSVDDMERSESSPRLNLIFQQQVAGPIVFRVDSRFMLDSASGKRSPHVEDTIYSLNYS
FRLLQSGKAVFWYSPKRKEGMVELRLFEF