; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028935 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028935
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAcyl-acyl carrier protein thioesterase ATL3
Genome locationtig00153210:1762646..1780946
RNA-Seq ExpressionSgr028935
SyntenySgr028935
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006016 - UspA
IPR006683 - Thioesterase domain
IPR012438 - Protein of unknown function DUF1639
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold
IPR029069 - HotDog domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034568.1 acyl-acyl carrier protein thioesterase ATL3 [Cucumis melo var. makuwa]3.7e-15658.72Show/hide
Query:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL
        MVCADS   ++  +++MAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSD        D+PP  DRR S A +FNC KF      T++P  FKDS KR 
Subjt:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL

Query:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGED-DDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAPID
        R SKSKI   HDNY+GDE IAAVREKLMIDLKTAAD+MKVAFWRDGVV+ +D DD DMT  EK++PAA  +AS  A    +ELRPW+LRVR+ ASKAPID
Subjt:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGED-DDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAPID

Query:  A---SAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLL
              GG GG K LKI    EK P RNSPLRSGDGG   +S GRR+ TEKKEREKFSVSLSKKEIEEDFM M+ RRPPRRPKKRPRIVQN MD      
Subjt:  A---SAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLL

Query:  LCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGI
                  T F  L+   I                      P   +V    E+G+R                          W S +P       + I
Subjt:  LCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGI

Query:  PICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSS-SSQTFDRKGSKGMDGFLD
             +   ++P+ +   + + DD +   AMLQASI P +SPPP+   KT G P P  +V+P  R STSLRL+S PVTRSS SS +FD K  KGMD FLD
Subjt:  PICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSS-SSQTFDRKGSKGMDGFLD

Query:  VELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME
        +ELKVRDYELDQFGVVNNAVYASYCQHGRHELLE+IGLSPDAVAR G ALALSELSLKFLAPLRSGD+FV+KVRISG+S ARFYIDHFIFKLPNME
Subjt:  VELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME

KAE8648792.1 hypothetical protein Csa_009266 [Cucumis sativus]4.3e-15758.17Show/hide
Query:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL
        MVCADS   ++  ++SMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSD        D+PP  DRR S A +FNC KF      T++P  FKDS KR 
Subjt:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL

Query:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVV---EGEDDDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAP
        R SKSKI   HDNY+GDE IAAVREKLMIDLKTAAD+MKVAFWRDGVV   +G+ DD D+T  EK++PAAA +AS +A    +EL+PW+LRVR+AA KA 
Subjt:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVV---EGEDDDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAP

Query:  ID------ASAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDV
        ID         GG GG K LKI    EK   RNSPLRSGDGG   +S GRR+ TEKKEREKFSVSLSKKEIEEDFM M+ RRPPRRPKKRPRIVQN MD 
Subjt:  ID------ASAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDV

Query:  GVLLLLCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELH
                       T F  L+   I                      P   +V    E+G+                     + R+R      P     
Subjt:  GVLLLLCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELH

Query:  EQDGIPICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMD
                +N     +P+ IL  SA  DD +   AMLQ+SI P +SP P+   KT G P+P  +V+PI RPSTSLRLRS P TRSS S +FD K  KGMD
Subjt:  EQDGIPICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMD

Query:  GFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME
         FLD+ELKVRDYELDQFGVVNNAVYASYCQHGRHELLE+IGLSPDAVAR G ALALSELSLKFLAPLRSGD+FV+KVRISG+S ARFYIDH IFKLPNME
Subjt:  GFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME

TYK09120.1 acyl-acyl carrier protein thioesterase ATL3 [Cucumis melo var. makuwa]3.7e-15658.72Show/hide
Query:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL
        MVCADS   ++  +++MAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSD        D+PP  DRR S A +FNC KF      T++P  FKDS KR 
Subjt:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL

Query:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGED-DDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAPID
        R SKSKI   HDNY+GDE IAAVREKLMIDLKTAAD+MKVAFWRDGVV+ +D DD DMT  EK++PAA  +AS  A    +ELRPW+LRVR+ ASKAPID
Subjt:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGED-DDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAPID

Query:  A---SAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLL
              GG GG K LKI    EK P RNSPLRSGDGG   +S GRR+ TEKKEREKFSVSLSKKEIEEDFM M+ RRPPRRPKKRPRIVQN MD      
Subjt:  A---SAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLL

Query:  LCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGI
                  T F  L+   I                      P   +V    E+G+R                          W S +P       + I
Subjt:  LCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGI

Query:  PICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSS-SSQTFDRKGSKGMDGFLD
             +   ++P+ +   + + DD +   AMLQASI P +SPPP+   KT G P P  +V+P  R STSLRL+S PVTRSS SS +FD K  KGMD FLD
Subjt:  PICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSS-SSQTFDRKGSKGMDGFLD

Query:  VELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME
        +ELKVRDYELDQFGVVNNAVYASYCQHGRHELLE+IGLSPDAVAR G ALALSELSLKFLAPLRSGD+FV+KVRISG+S ARFYIDHFIFKLPNME
Subjt:  VELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME

XP_008446606.1 PREDICTED: uncharacterized protein LOC103489289 [Cucumis melo]2.4e-13981.68Show/hide
Query:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQEIVDDSDSDDEFSVDDSASS
        ME+PVINRFSGLESGMSSLPNPTLLPQIL S SGF+T+S+SLDLWKWSAVIIAVVATFSGVINR+KLLFVIFRRQKRILQEIV DSDSD E+SVDDSA+S
Subjt:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQEIVDDSDSDDEFSVDDSASS

Query:  VSSSWSEFEED--DEPATSSR-SWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDHRQSDANEIR
        VSS WSEFE+D  DEPA+SS  SWD  DRDF VRGSDYY+ND +TKRTLRLR +RSFHNQDG+ +QFSW DF GGKSVVKLWDNLR EFDH  SD NEIR
Subjt:  VSSSWSEFEED--DEPATSSR-SWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDHRQSDANEIR

Query:  VYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPL
        V+DVIKEQ IGSI+AG+SQIAP+LTST+LLS   +VS K SVN+WDTRVG QIPALIAEWKPV GKV GV F+ DQKVYIR+EDAGK+TVGDVRNVKSPL
Subjt:  VYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPL

Query:  ENLTAADVETWWDADAVLVSAE
        ENLTAAD+ETW+DADAV+VSAE
Subjt:  ENLTAADVETWWDADAVLVSAE

XP_011655777.1 uncharacterized protein LOC105435585 [Cucumis sativus]1.3e-13780.75Show/hide
Query:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQEIVDDSDSDDEFSVDDSASS
        ME+PVINRFSGLESGMSSLPNPTLLPQIL S SGF+TLS+SLDLWKWSAVIIAVVATFSGVINR+KLLFVIFRRQKRILQEIV DSDSD E+SVDDSASS
Subjt:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQEIVDDSDSDDEFSVDDSASS

Query:  VSSSWSEFEED--DEPATSSR-SWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDHRQSDANEIR
        VSS WSEFEED  DEPA+SS  SWD  D+DF VRGSDYY+ND +TK+TLRLR +RSFHNQDG+G+QFSW DF GGKSVVKLWDNLR EFDH  SD NEIR
Subjt:  VSSSWSEFEED--DEPATSSR-SWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDHRQSDANEIR

Query:  VYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPL
        V+DVIKEQ IGSILAG+SQIAP+LTST+LLS   +VS K SVN+WDTR+G QIPALIAEWKPV GKV GV F+G+QKVY+R+E   K+TVGDVRN+KSPL
Subjt:  VYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPL

Query:  ENLTAADVETWWDADAVLVSAE
        ENLTAAD+ETW+DADAV+VSAE
Subjt:  ENLTAADVETWWDADAVLVSAE

TrEMBL top hitse value%identityAlignment
A0A0A0KT80 4HBT domain-containing protein2.3e-14056.69Show/hide
Query:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL
        MVCADS   ++  ++SMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSD        D+PP  DRR S A +FNC KF      T++P  FKDS KR 
Subjt:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL

Query:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVV---EGEDDDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAP
        R SKSKI   HDNY+GDE IAAVREKLMIDLKTAAD+MKVAFWRDGVV   +G+ DD D+T  EK++PAAA +AS +A    +EL+PW+LRVR+AA KA 
Subjt:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVV---EGEDDDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAP

Query:  ID------ASAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDV
        ID         GG GG K LKI    EK   RNSPLRSGDGG   +S GRR+ TEKKEREKFSVSLSKKEIEEDFM M+ RRPPRRPKKRPRIVQN MD 
Subjt:  ID------ASAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDV

Query:  GVLLLLCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELH
                       T F  L+   I                      P   +V    E+G+                     + R+R      P     
Subjt:  GVLLLLCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELH

Query:  EQDGIPICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMD
                +N     +P+ IL  SA  DD +   AMLQ+SI P +SP P+   KT G P+P  +V+PI RPSTSLRLRS P TRSS S +FD K  KGMD
Subjt:  EQDGIPICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMD

Query:  GFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLR
         FLD+ELKVRDYELDQFGVVNNAVYASYCQHGRHELLE+IGLSPDAVAR G ALALSELSLKFLAPLR
Subjt:  GFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLR

A0A1S3BFG2 uncharacterized protein LOC1034892891.2e-13981.68Show/hide
Query:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQEIVDDSDSDDEFSVDDSASS
        ME+PVINRFSGLESGMSSLPNPTLLPQIL S SGF+T+S+SLDLWKWSAVIIAVVATFSGVINR+KLLFVIFRRQKRILQEIV DSDSD E+SVDDSA+S
Subjt:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQEIVDDSDSDDEFSVDDSASS

Query:  VSSSWSEFEED--DEPATSSR-SWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDHRQSDANEIR
        VSS WSEFE+D  DEPA+SS  SWD  DRDF VRGSDYY+ND +TKRTLRLR +RSFHNQDG+ +QFSW DF GGKSVVKLWDNLR EFDH  SD NEIR
Subjt:  VSSSWSEFEED--DEPATSSR-SWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDHRQSDANEIR

Query:  VYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPL
        V+DVIKEQ IGSI+AG+SQIAP+LTST+LLS   +VS K SVN+WDTRVG QIPALIAEWKPV GKV GV F+ DQKVYIR+EDAGK+TVGDVRNVKSPL
Subjt:  VYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPL

Query:  ENLTAADVETWWDADAVLVSAE
        ENLTAAD+ETW+DADAV+VSAE
Subjt:  ENLTAADVETWWDADAVLVSAE

A0A5A7STM5 Uncharacterized protein1.2e-13981.68Show/hide
Query:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQEIVDDSDSDDEFSVDDSASS
        ME+PVINRFSGLESGMSSLPNPTLLPQIL S SGF+T+S+SLDLWKWSAVIIAVVATFSGVINR+KLLFVIFRRQKRILQEIV DSDSD E+SVDDSA+S
Subjt:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQEIVDDSDSDDEFSVDDSASS

Query:  VSSSWSEFEED--DEPATSSR-SWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDHRQSDANEIR
        VSS WSEFE+D  DEPA+SS  SWD  DRDF VRGSDYY+ND +TKRTLRLR +RSFHNQDG+ +QFSW DF GGKSVVKLWDNLR EFDH  SD NEIR
Subjt:  VSSSWSEFEED--DEPATSSR-SWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDHRQSDANEIR

Query:  VYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPL
        V+DVIKEQ IGSI+AG+SQIAP+LTST+LLS   +VS K SVN+WDTRVG QIPALIAEWKPV GKV GV F+ DQKVYIR+EDAGK+TVGDVRNVKSPL
Subjt:  VYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPL

Query:  ENLTAADVETWWDADAVLVSAE
        ENLTAAD+ETW+DADAV+VSAE
Subjt:  ENLTAADVETWWDADAVLVSAE

A0A5A7SVA0 Acyl-acyl carrier protein thioesterase ATL31.8e-15658.72Show/hide
Query:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL
        MVCADS   ++  +++MAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSD        D+PP  DRR S A +FNC KF      T++P  FKDS KR 
Subjt:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL

Query:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGED-DDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAPID
        R SKSKI   HDNY+GDE IAAVREKLMIDLKTAAD+MKVAFWRDGVV+ +D DD DMT  EK++PAA  +AS  A    +ELRPW+LRVR+ ASKAPID
Subjt:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGED-DDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAPID

Query:  A---SAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLL
              GG GG K LKI    EK P RNSPLRSGDGG   +S GRR+ TEKKEREKFSVSLSKKEIEEDFM M+ RRPPRRPKKRPRIVQN MD      
Subjt:  A---SAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLL

Query:  LCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGI
                  T F  L+   I                      P   +V    E+G+R                          W S +P       + I
Subjt:  LCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGI

Query:  PICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSS-SSQTFDRKGSKGMDGFLD
             +   ++P+ +   + + DD +   AMLQASI P +SPPP+   KT G P P  +V+P  R STSLRL+S PVTRSS SS +FD K  KGMD FLD
Subjt:  PICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSS-SSQTFDRKGSKGMDGFLD

Query:  VELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME
        +ELKVRDYELDQFGVVNNAVYASYCQHGRHELLE+IGLSPDAVAR G ALALSELSLKFLAPLRSGD+FV+KVRISG+S ARFYIDHFIFKLPNME
Subjt:  VELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME

A0A5D3CCR4 Acyl-acyl carrier protein thioesterase ATL31.8e-15658.72Show/hide
Query:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL
        MVCADS   ++  +++MAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSD        D+PP  DRR S A +FNC KF      T++P  FKDS KR 
Subjt:  MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDG------GDVPPHADRR-SPAVKFNCSKFQRRDSETERPVFFKDSGKRL

Query:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGED-DDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAPID
        R SKSKI   HDNY+GDE IAAVREKLMIDLKTAAD+MKVAFWRDGVV+ +D DD DMT  EK++PAA  +AS  A    +ELRPW+LRVR+ ASKAPID
Subjt:  RGSKSKIERSHDNYEGDEGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGED-DDDDMTKTEKELPAAAASASASA----EELRPWNLRVRRAASKAPID

Query:  A---SAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLL
              GG GG K LKI    EK P RNSPLRSGDGG   +S GRR+ TEKKEREKFSVSLSKKEIEEDFM M+ RRPPRRPKKRPRIVQN MD      
Subjt:  A---SAGGKGGSKALKI----EKTPNRNSPLRSGDGG--ARSPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLL

Query:  LCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGI
                  T F  L+   I                      P   +V    E+G+R                          W S +P       + I
Subjt:  LCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEVGLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGI

Query:  PICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSS-SSQTFDRKGSKGMDGFLD
             +   ++P+ +   + + DD +   AMLQASI P +SPPP+   KT G P P  +V+P  R STSLRL+S PVTRSS SS +FD K  KGMD FLD
Subjt:  PICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSS-SSQTFDRKGSKGMDGFLD

Query:  VELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME
        +ELKVRDYELDQFGVVNNAVYASYCQHGRHELLE+IGLSPDAVAR G ALALSELSLKFLAPLRSGD+FV+KVRISG+S ARFYIDHFIFKLPNME
Subjt:  VELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME

SwissProt top hitse value%identityAlignment
F4HX80 Acyl-acyl carrier protein thioesterase ATL4, chloroplastic3.2e-3859.48Show/hide
Query:  VSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSSSSQTF-DRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVA
        V+ T    + +  P  W   R    L LRS+   ++    TF D KG K M  F +VELKVRDYELDQFGVVNNAVYA+YCQHG HE LE+IG++ D VA
Subjt:  VSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSSSSQTF-DRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVA

Query:  RSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNMEV
        RSG ALA+SEL++ FLAPLRSGD+FV+KV IS  SAAR Y DH I KLPN EV
Subjt:  RSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNMEV

Q8W583 Acyl-acyl carrier protein thioesterase ATL3, chloroplastic9.7e-4377.27Show/hide
Query:  FDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYID
        FD KG KGM  F +VELKVRDYELDQFGVVNNAVYA+YCQHGRHE LE+IG++ D VARSG ALA+SEL++KFL+PLRSGD+FV+K RISG SAAR Y D
Subjt:  FDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYID

Query:  HFIFKLPNME
        HFIFKLPN E
Subjt:  HFIFKLPNME

Q9C7I5 Acyl-acyl carrier protein thioesterase ATL1, chloroplastic1.2e-3759.33Show/hide
Query:  KTTGLPIPS-PLVWPI--TRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARS
        K TG   P+  +V+P   +RP   L LRS+   +  S   F ++G KGM+G  ++ELKVRDYELDQFGVVNNAVYA+YCQHG+HE +ETIG++ D V+RS
Subjt:  KTTGLPIPS-PLVWPI--TRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARS

Query:  GAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME
        G ALA+SEL++KFLAPLRSG +FV+K RISG S  R Y + FIFKLPN E
Subjt:  GAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME

Q9C7I8 Acyl-acyl carrier protein thioesterase ATL2, chloroplastic1.0e-3661.11Show/hide
Query:  LRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFV
        L LRS+ + +  +    + +GS G+ GF ++ELKVRDYELDQFGVVNNAVYA+YCQHGRHE +++IG++ + V+RSG ALA+ EL++KFLAPLRSG RFV
Subjt:  LRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFV

Query:  IKVRISGMSAARFYIDHFIFKLPNME
        +K RISG+S  R Y + FIFKLPN E
Subjt:  IKVRISGMSAARFYIDHFIFKLPNME

S4TE15 Acyl-acyl carrier protein thioesterase TE3, chloroplastic1.6e-4570.71Show/hide
Query:  SPLVWPITRPSTSLRLRSSP-VTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELS
        S + +P+TR     RLR  P   R  S+  FD +G KGM  F +VELKVRDYELDQ+GVVNNAVYASYCQHGRHELLE+ GLS DAVAR+G ALALSELS
Subjt:  SPLVWPITRPSTSLRLRSSP-VTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELS

Query:  LKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME
        LKFLAPLRSGD+FV+KVR+SG SAAR Y DH IFKLPN E
Subjt:  LKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME

Arabidopsis top hitse value%identityAlignment
AT1G35250.1 Thioesterase superfamily protein7.3e-3861.11Show/hide
Query:  LRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFV
        L LRS+ + +  +    + +GS G+ GF ++ELKVRDYELDQFGVVNNAVYA+YCQHGRHE +++IG++ + V+RSG ALA+ EL++KFLAPLRSG RFV
Subjt:  LRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFV

Query:  IKVRISGMSAARFYIDHFIFKLPNME
        +K RISG+S  R Y + FIFKLPN E
Subjt:  IKVRISGMSAARFYIDHFIFKLPNME

AT1G35290.1 Thioesterase superfamily protein8.7e-3959.33Show/hide
Query:  KTTGLPIPS-PLVWPI--TRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARS
        K TG   P+  +V+P   +RP   L LRS+   +  S   F ++G KGM+G  ++ELKVRDYELDQFGVVNNAVYA+YCQHG+HE +ETIG++ D V+RS
Subjt:  KTTGLPIPS-PLVWPI--TRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARS

Query:  GAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME
        G ALA+SEL++KFLAPLRSG +FV+K RISG S  R Y + FIFKLPN E
Subjt:  GAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNME

AT1G68260.1 Thioesterase superfamily protein6.9e-4477.27Show/hide
Query:  FDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYID
        FD KG KGM  F +VELKVRDYELDQFGVVNNAVYA+YCQHGRHE LE+IG++ D VARSG ALA+SEL++KFL+PLRSGD+FV+K RISG SAAR Y D
Subjt:  FDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYID

Query:  HFIFKLPNME
        HFIFKLPN E
Subjt:  HFIFKLPNME

AT1G68280.1 Thioesterase superfamily protein2.3e-3959.48Show/hide
Query:  VSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSSSSQTF-DRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVA
        V+ T    + +  P  W   R    L LRS+   ++    TF D KG K M  F +VELKVRDYELDQFGVVNNAVYA+YCQHG HE LE+IG++ D VA
Subjt:  VSRTKTTGLPIPSPLVWPITRPSTSLRLRSSPVTRSSSSQTF-DRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVA

Query:  RSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNMEV
        RSG ALA+SEL++ FLAPLRSGD+FV+KV IS  SAAR Y DH I KLPN EV
Subjt:  RSGAALALSELSLKFLAPLRSGDRFVIKVRISGMSAARFYIDHFIFKLPNMEV

AT1G68440.1 unknown protein7.6e-3535.03Show/hide
Query:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQE--IVDDSDSDDEFSVDDSA
        MEVPVINR    E G++S+ +P+ L + + + SG   L Q+   WKW A+IIA +A F+  +++L  L V  R+    +    + DD DSD     D S 
Subjt:  MEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRILQE--IVDDSDSDDEFSVDDSA

Query:  SSVSSSWSEFEEDDEPATSSRSWD--FNDR----DFLVRGSDYYMNDLETKRT-----LRLRQQRSFHNQDGDGDQFSWTDFG--GGKSVVKLWDNLRLE
        SS  SS  E +E+DE        D  FN R     F VRGSDYY +D +         +  R   SF      GD FSW D G  G   VVKLWD+L ++
Subjt:  SSVSSSWSEFEEDDEPATSSRSWD--FNDR----DFLVRGSDYYMNDLETKRT-----LRLRQQRSFHNQDGDGDQFSWTDFG--GGKSVVKLWDNLRLE

Query:  -FDHRQSDANEIRVYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKP---VEGKVHGVHFAGDQKVYIRDED
          DH    A  ++ Y+                   + +S    +A       V V   D R G ++PAL+AEW+    + G + GV   G +KVY+RD+ 
Subjt:  -FDHRQSDANEIRVYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKP---VEGKVHGVHFAGDQKVYIRDED

Query:  AGKLTVGDVRNVKSPLENLTAADVETWWDADAVL
        +G++ VGD+R     L +LT  + ETWWDAD ++
Subjt:  AGKLTVGDVRNVKSPLENLTAADVETWWDADAVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTGTGCCGATTCGTCCAAGGAGAAACGAAGAGTAGTAGACTCCATGGCCATGGGACTCGAGAGATCGAAGCCGCTCCACAATTTCACGCTCCCGTTCTTGAAGTG
GGGAAATCAGAGGTACCTCCGCTGTATGAAATTGGACTCCGATGGCGGTGACGTACCGCCGCACGCTGACCGTCGATCACCTGCCGTCAAGTTCAACTGTTCGAAGTTTC
AGCGGCGAGATTCTGAGACGGAGAGGCCTGTGTTCTTTAAGGATTCGGGGAAAAGGCTGAGAGGTTCGAAGTCGAAGATCGAGAGGAGCCACGATAATTACGAAGGCGAT
GAAGGGATAGCGGCGGTGAGAGAGAAACTCATGATTGATCTAAAAACCGCGGCGGATAAGATGAAGGTTGCATTTTGGAGAGATGGGGTGGTCGAGGGAGAAGACGACGA
CGACGACATGACCAAAACGGAGAAGGAATTGCCTGCGGCGGCAGCGTCGGCGTCGGCGTCGGCGGAGGAGTTAAGACCCTGGAATTTGAGGGTAAGGAGGGCGGCGTCGA
AGGCTCCGATTGATGCAAGTGCAGGAGGTAAAGGTGGTAGTAAAGCCTTGAAGATCGAGAAGACACCGAATCGCAATTCGCCATTGAGGAGTGGTGACGGCGGCGCAAGG
TCGCCAGGGCGGCGAGTGGGAACGGAAAAGAAGGAAAGAGAGAAGTTCTCGGTATCACTTTCCAAAAAGGAGATCGAGGAGGATTTCATGGTCATGATGGGACGCAGGCC
GCCGAGGAGGCCCAAGAAGAGGCCCAGGATTGTACAGAATCACATGGATGTAGGGGTTCTTCTTCTTCTGTGTGACCTCTTTTTTACCTTCTTTGAAACTGGGTTTTCAA
CTCTTTATCCCTTCTCCATTGCAGACGCTTTTCCCCGGGCTGTGGTTGACGGAGATCACCGCCGATCTCTACGACGTGCCGGAAATCCCCCAAAACGGAAAGTTGAGGTT
GGTTTGGAAGATGGTGAGCGGCGGTCCAGGATGGGTTCGGGAGGAGAGGTTAAGTTGGAGGAGCCTTCAGCCATGGATTCAATGCTTCGTTACAGATCGTGGGTCAGTGT
TTCTCCAGAAGCAGAATTACACGAGCAGGACGGAATACCCATCTGCATTAATGTTCACGCCAACGTTTCCCCGGTTCCCATTCTCATCTCATCCGCCGCCGACGACGACA
GAAGCACAAGTTCCGCGATGTTGCAGGCTTCCATCAATCCGTTCAAATCTCCGCCTCCGGTCTCACGTACCAAGACTACCGGACTCCCAATCCCTTCACCACTTGTCTGG
CCCATCACCCGCCCGTCGACATCGTTACGCCTACGATCGTCCCCAGTGACGAGAAGCTCTAGCTCCCAAACATTTGATCGCAAGGGCAGCAAAGGGATGGATGGCTTCCT
AGATGTTGAGCTCAAAGTTCGTGATTATGAGTTGGATCAGTTCGGTGTCGTGAACAATGCTGTTTATGCAAGTTATTGCCAACATGGTCGTCATGAACTGCTGGAAACTA
TAGGCCTCAGTCCTGATGCAGTTGCTCGCAGTGGGGCTGCATTAGCACTATCAGAGTTGTCTTTGAAGTTCCTTGCACCACTTAGAAGTGGAGACAGATTTGTTATTAAG
GTGAGGATCTCAGGCATGTCAGCGGCTCGCTTCTACATTGATCATTTTATTTTCAAGCTTCCAAATATGGAGGTTGGTAATGGAAACATTAATTTGCATCTCAGTTTAAT
AACCTTGCCTGGAAAGCCTGGTAACTTCATAATCCACTCTGTCTTATTTTCTTTCCCTAGCCAATCTTGGAGGCAAAGTGTACAGCAGTTTGGCTTAATGAAAATTACCG
TCCCGTGCGTATCCCACAAGAATTCATGTCCAAATTTGCTCAATTTCTTCTGGATCACGAGTCGAAGTGATCAACTTTGGGGCGATTCTGTTATGTTCTGGAGAAGGCAA
AAGAGTTCCTCCTCTTGTTCCCTTAGCAGAATGGCGGAAGCAAAATTAAACCCAGCAGTCGAGAAGAGGGTGATGGTGGCCATAGACGAGAGTGAGTGTAGCTACTATGC
GCTAATCTGGGTGCTCGAAAATCTTAAAGAATCCATAGCCGAGTCCCCCCTTTTCGTCTTCACGGCTCTACCTCCGCCCACCAATTATACCTTCGGTGCTGGTGCATCTC
TTGGCCTTGTACGCACGTATTGCGCTGTTCCATCCAATTCGGAGTTAGGTAATTCGATCCAAGAGAATGATAAGAAAGTTAGATGCGCCCTCCTCGAGAAAGCAAAGGCT
ATATGTGCTGAAAGAGGGGTGGCTGCTATATCCATCACAGAGGTTGGGGATCCTGGAACAACCATATGTGATACAGTTGAAACGCTCAATATAAATTTGCTTGTTTTAGG
TGATCGTGGCCTTGGGAGAATTAAGAGACGCAGGTTCACTACTGGTGCACGGGAGAAGAAATATTTATCTGCAAAGCAATTTTTTAAGTTAAAAACTCTTCTGCCCAAAG
AACAACTCAACCAAGCTTTGTATGCAGTCGACCTTCTAATTTTGCTTCCTGGTGTACCAGTGAACGTCAATCATGTTTCCTCCCCTGTCTGTCGTAAAGCCCACATGCAG
AGGCTGCATTTTACACTGAGAAGGTTGAGTTTGGTAGCTCAGAAGCTGGGAAGTATAGTTGTTAATGGCCCCTGCAACGCACCGTCCTATCTCTCCAACTCCATCTTTGC
AGTCCCCTTAAGATTGAGATCCCAACTGCAAACGCTCGCTAAATCACCAATTGCGGAGGCTGGCAACAGTTGCTGCACCGCATCTGCTGCTGCTCTGCTCAACCGTGACT
GTTGCAGCAAATCAAAGACCAAACTGATCTTCATCAAAGAGGGTGCAAAGCCAAAAGGCCAAAGTGGGGGGACTCCTCCTAGTACCGGCCACCACCACCGTCAAACAAAC
ATATCAATCATGCAGGTGGGTCCCCACCAGAGCTTAGAAGAACCGACGGATCACTGCGCCCGCGATGATCTGTCCCCACCCAGTACTGAGGTTGAGATGGAGGTGCCTGT
GATTAACAGGTTTAGTGGGTTGGAATCGGGGATGAGTTCCTTGCCGAACCCGACTCTCTTGCCCCAGATTTTGGGTTCGCGCTCTGGATTTGAAACCCTTTCTCAATCCC
TCGATCTCTGGAAATGGAGTGCTGTGATTATCGCCGTTGTTGCTACTTTTAGTGGCGTTATCAATCGGCTTAAGCTTTTGTTCGTGATTTTCCGACGCCAGAAACGTATT
CTTCAAGAGATTGTTGATGATTCCGACTCCGACGATGAGTTCTCCGTTGATGATTCTGCCAGCTCCGTTTCGTCGTCGTGGTCTGAGTTCGAGGAAGACGACGAGCCCGC
CACGTCTTCCCGAAGTTGGGACTTCAATGACCGGGATTTCCTCGTCAGAGGCTCAGATTATTACATGAATGATCTCGAAACGAAACGGACTCTGAGACTCAGGCAGCAGC
GGAGCTTCCACAACCAAGACGGAGATGGCGACCAGTTCTCGTGGACCGATTTTGGCGGAGGCAAGAGCGTTGTGAAGTTGTGGGACAACCTTAGATTGGAGTTCGATCAC
CGTCAGTCCGACGCGAACGAGATTCGTGTCTACGATGTAATCAAAGAACAAAAAATAGGCTCCATTCTCGCCGGAGAATCACAGATAGCACCAGCGTTGACGTCGACAGT
ATTGTTGTCGGCGGCGGCGGATGTTTCAGGTAAGGTGTCCGTAAATCTCTGGGACACACGCGTGGGTTCTCAAATTCCGGCATTAATAGCCGAGTGGAAGCCGGTGGAAG
GCAAGGTCCACGGGGTCCATTTTGCCGGCGACCAAAAGGTTTATATCAGAGACGAAGACGCAGGAAAGCTAACGGTCGGTGACGTGAGAAACGTGAAGTCGCCGCTGGAG
AATTTAACGGCGGCGGATGTGGAGACTTGGTGGGACGCCGACGCGGTTCTGGTTTCGGCGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGTGTGCCGATTCGTCCAAGGAGAAACGAAGAGTAGTAGACTCCATGGCCATGGGACTCGAGAGATCGAAGCCGCTCCACAATTTCACGCTCCCGTTCTTGAAGTG
GGGAAATCAGAGGTACCTCCGCTGTATGAAATTGGACTCCGATGGCGGTGACGTACCGCCGCACGCTGACCGTCGATCACCTGCCGTCAAGTTCAACTGTTCGAAGTTTC
AGCGGCGAGATTCTGAGACGGAGAGGCCTGTGTTCTTTAAGGATTCGGGGAAAAGGCTGAGAGGTTCGAAGTCGAAGATCGAGAGGAGCCACGATAATTACGAAGGCGAT
GAAGGGATAGCGGCGGTGAGAGAGAAACTCATGATTGATCTAAAAACCGCGGCGGATAAGATGAAGGTTGCATTTTGGAGAGATGGGGTGGTCGAGGGAGAAGACGACGA
CGACGACATGACCAAAACGGAGAAGGAATTGCCTGCGGCGGCAGCGTCGGCGTCGGCGTCGGCGGAGGAGTTAAGACCCTGGAATTTGAGGGTAAGGAGGGCGGCGTCGA
AGGCTCCGATTGATGCAAGTGCAGGAGGTAAAGGTGGTAGTAAAGCCTTGAAGATCGAGAAGACACCGAATCGCAATTCGCCATTGAGGAGTGGTGACGGCGGCGCAAGG
TCGCCAGGGCGGCGAGTGGGAACGGAAAAGAAGGAAAGAGAGAAGTTCTCGGTATCACTTTCCAAAAAGGAGATCGAGGAGGATTTCATGGTCATGATGGGACGCAGGCC
GCCGAGGAGGCCCAAGAAGAGGCCCAGGATTGTACAGAATCACATGGATGTAGGGGTTCTTCTTCTTCTGTGTGACCTCTTTTTTACCTTCTTTGAAACTGGGTTTTCAA
CTCTTTATCCCTTCTCCATTGCAGACGCTTTTCCCCGGGCTGTGGTTGACGGAGATCACCGCCGATCTCTACGACGTGCCGGAAATCCCCCAAAACGGAAAGTTGAGGTT
GGTTTGGAAGATGGTGAGCGGCGGTCCAGGATGGGTTCGGGAGGAGAGGTTAAGTTGGAGGAGCCTTCAGCCATGGATTCAATGCTTCGTTACAGATCGTGGGTCAGTGT
TTCTCCAGAAGCAGAATTACACGAGCAGGACGGAATACCCATCTGCATTAATGTTCACGCCAACGTTTCCCCGGTTCCCATTCTCATCTCATCCGCCGCCGACGACGACA
GAAGCACAAGTTCCGCGATGTTGCAGGCTTCCATCAATCCGTTCAAATCTCCGCCTCCGGTCTCACGTACCAAGACTACCGGACTCCCAATCCCTTCACCACTTGTCTGG
CCCATCACCCGCCCGTCGACATCGTTACGCCTACGATCGTCCCCAGTGACGAGAAGCTCTAGCTCCCAAACATTTGATCGCAAGGGCAGCAAAGGGATGGATGGCTTCCT
AGATGTTGAGCTCAAAGTTCGTGATTATGAGTTGGATCAGTTCGGTGTCGTGAACAATGCTGTTTATGCAAGTTATTGCCAACATGGTCGTCATGAACTGCTGGAAACTA
TAGGCCTCAGTCCTGATGCAGTTGCTCGCAGTGGGGCTGCATTAGCACTATCAGAGTTGTCTTTGAAGTTCCTTGCACCACTTAGAAGTGGAGACAGATTTGTTATTAAG
GTGAGGATCTCAGGCATGTCAGCGGCTCGCTTCTACATTGATCATTTTATTTTCAAGCTTCCAAATATGGAGGTTGGTAATGGAAACATTAATTTGCATCTCAGTTTAAT
AACCTTGCCTGGAAAGCCTGGTAACTTCATAATCCACTCTGTCTTATTTTCTTTCCCTAGCCAATCTTGGAGGCAAAGTGTACAGCAGTTTGGCTTAATGAAAATTACCG
TCCCGTGCGTATCCCACAAGAATTCATGTCCAAATTTGCTCAATTTCTTCTGGATCACGAGTCGAAGTGATCAACTTTGGGGCGATTCTGTTATGTTCTGGAGAAGGCAA
AAGAGTTCCTCCTCTTGTTCCCTTAGCAGAATGGCGGAAGCAAAATTAAACCCAGCAGTCGAGAAGAGGGTGATGGTGGCCATAGACGAGAGTGAGTGTAGCTACTATGC
GCTAATCTGGGTGCTCGAAAATCTTAAAGAATCCATAGCCGAGTCCCCCCTTTTCGTCTTCACGGCTCTACCTCCGCCCACCAATTATACCTTCGGTGCTGGTGCATCTC
TTGGCCTTGTACGCACGTATTGCGCTGTTCCATCCAATTCGGAGTTAGGTAATTCGATCCAAGAGAATGATAAGAAAGTTAGATGCGCCCTCCTCGAGAAAGCAAAGGCT
ATATGTGCTGAAAGAGGGGTGGCTGCTATATCCATCACAGAGGTTGGGGATCCTGGAACAACCATATGTGATACAGTTGAAACGCTCAATATAAATTTGCTTGTTTTAGG
TGATCGTGGCCTTGGGAGAATTAAGAGACGCAGGTTCACTACTGGTGCACGGGAGAAGAAATATTTATCTGCAAAGCAATTTTTTAAGTTAAAAACTCTTCTGCCCAAAG
AACAACTCAACCAAGCTTTGTATGCAGTCGACCTTCTAATTTTGCTTCCTGGTGTACCAGTGAACGTCAATCATGTTTCCTCCCCTGTCTGTCGTAAAGCCCACATGCAG
AGGCTGCATTTTACACTGAGAAGGTTGAGTTTGGTAGCTCAGAAGCTGGGAAGTATAGTTGTTAATGGCCCCTGCAACGCACCGTCCTATCTCTCCAACTCCATCTTTGC
AGTCCCCTTAAGATTGAGATCCCAACTGCAAACGCTCGCTAAATCACCAATTGCGGAGGCTGGCAACAGTTGCTGCACCGCATCTGCTGCTGCTCTGCTCAACCGTGACT
GTTGCAGCAAATCAAAGACCAAACTGATCTTCATCAAAGAGGGTGCAAAGCCAAAAGGCCAAAGTGGGGGGACTCCTCCTAGTACCGGCCACCACCACCGTCAAACAAAC
ATATCAATCATGCAGGTGGGTCCCCACCAGAGCTTAGAAGAACCGACGGATCACTGCGCCCGCGATGATCTGTCCCCACCCAGTACTGAGGTTGAGATGGAGGTGCCTGT
GATTAACAGGTTTAGTGGGTTGGAATCGGGGATGAGTTCCTTGCCGAACCCGACTCTCTTGCCCCAGATTTTGGGTTCGCGCTCTGGATTTGAAACCCTTTCTCAATCCC
TCGATCTCTGGAAATGGAGTGCTGTGATTATCGCCGTTGTTGCTACTTTTAGTGGCGTTATCAATCGGCTTAAGCTTTTGTTCGTGATTTTCCGACGCCAGAAACGTATT
CTTCAAGAGATTGTTGATGATTCCGACTCCGACGATGAGTTCTCCGTTGATGATTCTGCCAGCTCCGTTTCGTCGTCGTGGTCTGAGTTCGAGGAAGACGACGAGCCCGC
CACGTCTTCCCGAAGTTGGGACTTCAATGACCGGGATTTCCTCGTCAGAGGCTCAGATTATTACATGAATGATCTCGAAACGAAACGGACTCTGAGACTCAGGCAGCAGC
GGAGCTTCCACAACCAAGACGGAGATGGCGACCAGTTCTCGTGGACCGATTTTGGCGGAGGCAAGAGCGTTGTGAAGTTGTGGGACAACCTTAGATTGGAGTTCGATCAC
CGTCAGTCCGACGCGAACGAGATTCGTGTCTACGATGTAATCAAAGAACAAAAAATAGGCTCCATTCTCGCCGGAGAATCACAGATAGCACCAGCGTTGACGTCGACAGT
ATTGTTGTCGGCGGCGGCGGATGTTTCAGGTAAGGTGTCCGTAAATCTCTGGGACACACGCGTGGGTTCTCAAATTCCGGCATTAATAGCCGAGTGGAAGCCGGTGGAAG
GCAAGGTCCACGGGGTCCATTTTGCCGGCGACCAAAAGGTTTATATCAGAGACGAAGACGCAGGAAAGCTAACGGTCGGTGACGTGAGAAACGTGAAGTCGCCGCTGGAG
AATTTAACGGCGGCGGATGTGGAGACTTGGTGGGACGCCGACGCGGTTCTGGTTTCGGCGGAATAG
Protein sequenceShow/hide protein sequence
MVCADSSKEKRRVVDSMAMGLERSKPLHNFTLPFLKWGNQRYLRCMKLDSDGGDVPPHADRRSPAVKFNCSKFQRRDSETERPVFFKDSGKRLRGSKSKIERSHDNYEGD
EGIAAVREKLMIDLKTAADKMKVAFWRDGVVEGEDDDDDMTKTEKELPAAAASASASAEELRPWNLRVRRAASKAPIDASAGGKGGSKALKIEKTPNRNSPLRSGDGGAR
SPGRRVGTEKKEREKFSVSLSKKEIEEDFMVMMGRRPPRRPKKRPRIVQNHMDVGVLLLLCDLFFTFFETGFSTLYPFSIADAFPRAVVDGDHRRSLRRAGNPPKRKVEV
GLEDGERRSRMGSGGEVKLEEPSAMDSMLRYRSWVSVSPEAELHEQDGIPICINVHANVSPVPILISSAADDDRSTSSAMLQASINPFKSPPPVSRTKTTGLPIPSPLVW
PITRPSTSLRLRSSPVTRSSSSQTFDRKGSKGMDGFLDVELKVRDYELDQFGVVNNAVYASYCQHGRHELLETIGLSPDAVARSGAALALSELSLKFLAPLRSGDRFVIK
VRISGMSAARFYIDHFIFKLPNMEVGNGNINLHLSLITLPGKPGNFIIHSVLFSFPSQSWRQSVQQFGLMKITVPCVSHKNSCPNLLNFFWITSRSDQLWGDSVMFWRRQ
KSSSSCSLSRMAEAKLNPAVEKRVMVAIDESECSYYALIWVLENLKESIAESPLFVFTALPPPTNYTFGAGASLGLVRTYCAVPSNSELGNSIQENDKKVRCALLEKAKA
ICAERGVAAISITEVGDPGTTICDTVETLNINLLVLGDRGLGRIKRRRFTTGAREKKYLSAKQFFKLKTLLPKEQLNQALYAVDLLILLPGVPVNVNHVSSPVCRKAHMQ
RLHFTLRRLSLVAQKLGSIVVNGPCNAPSYLSNSIFAVPLRLRSQLQTLAKSPIAEAGNSCCTASAAALLNRDCCSKSKTKLIFIKEGAKPKGQSGGTPPSTGHHHRQTN
ISIMQVGPHQSLEEPTDHCARDDLSPPSTEVEMEVPVINRFSGLESGMSSLPNPTLLPQILGSRSGFETLSQSLDLWKWSAVIIAVVATFSGVINRLKLLFVIFRRQKRI
LQEIVDDSDSDDEFSVDDSASSVSSSWSEFEEDDEPATSSRSWDFNDRDFLVRGSDYYMNDLETKRTLRLRQQRSFHNQDGDGDQFSWTDFGGGKSVVKLWDNLRLEFDH
RQSDANEIRVYDVIKEQKIGSILAGESQIAPALTSTVLLSAAADVSGKVSVNLWDTRVGSQIPALIAEWKPVEGKVHGVHFAGDQKVYIRDEDAGKLTVGDVRNVKSPLE
NLTAADVETWWDADAVLVSAE