; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh09G001020 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh09G001020
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionAP complex subunit sigma
Genome locationCma_Chr09:452012..455376
RNA-Seq ExpressionCmaCh09G001020
SyntenyCmaCh09G001020
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0006886 - intracellular protein transport (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0030122 - AP-2 adaptor complex (cellular component)
GO:0016301 - kinase activity (molecular function)
GO:0016773 - phosphotransferase activity, alcohol group as acceptor (molecular function)
GO:0035615 - clathrin adaptor activity (molecular function)
GO:0036094 - small molecule binding (molecular function)
InterPro domainsIPR011012 - Longin-like domain superfamily
IPR016635 - Adaptor protein complex, sigma subunit
IPR022775 - AP complex, mu/sigma subunit


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591267.1 AP-2 complex subunit sigma, partial [Cucurbita argyrosperma subsp. sororia]1.6e-9050.43Show/hide
Query:  MIRFILLQNRQGKTRLAKYYVPLEESEKHKVEYEIGS--IAKPLPLPLLLLSRVCNSDLNSLFSVLLPPHRFASSDPPFSPLDFGFLAISSSPVFSVGIV
        MIRFILLQNRQGKTRLAKYYVPLEES+KHKVE++I    + +       +LS   NS L+S+  + L  + +  + P  SP  F                
Subjt:  MIRFILLQNRQGKTRLAKYYVPLEESEKHKVEYEIGS--IAKPLPLPLLLLSRVCNSDLNSLFSVLLPPHRFASSDPPFSPLDFGFLAISSSPVFSVGIV

Query:  HNIISSDANVLFVDSVWRFSLVPRFQVSIGIWDCLRSLSHESTTDALCLKRGSEGFFNTWRMVFEGGLESVLFFQLNRIGEMRLQTRPVNKK--TSLPAP
                         R  L P+F +                   LC+   S           +  L   +   LN            N K  ++L AP
Subjt:  HNIISSDANVLFVDSVWRFSLVPRFQVSIGIWDCLRSLSHESTTDALCLKRGSEGFFNTWRMVFEGGLESVLFFQLNRIGEMRLQTRPVNKK--TSLPAP

Query:  FCFGIVCSSAFTFISTQTLKTFQHFVHRIQPPIINWIWIRRSLRHFKEWSGGLFILVILKG--KTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIY
        +      S     + +    +F   V R              L HF +  G      +L         K+KVECEVHRLVVNSDPNF+  VEFRTHKVIY
Subjt:  FCFGIVCSSAFTFISTQTLKTFQHFVHRIQPPIINWIWIRRSLRHFKEWSGGLFILVILKG--KTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIY

Query:  RQYAGLFFSICVDKTDNYLESVRLFVEVLDQFFSNVYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAG
        RQYAGLFFS+CV +TDNYLESVRLFVE+LD FFS+VYLILDEFILAGKLQETSKKAPQN LEPSA DIRLYEDHV Q  V+SLIDMEM LRTSKTKLLAG
Subjt:  RQYAGLFFSICVDKTDNYLESVRLFVEVLDQFFSNVYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAG

Query:  IDLIRRLKVSALETENESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVI
        IDLIR+LKVSALETEN S+LKSKRSELEFRCQM EN+LGIL+KLKSS   SNEKWKDGV+
Subjt:  IDLIRRLKVSALETENESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVI

KAG7024150.1 AP-2 complex subunit sigma, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-6580.45Show/hide
Query:  RTHKVIYRQYAGLFFSICVDKTDNYLESVRLFVEVLDQFFSN------------VYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVE
        RTHKVIYRQYAGLFFS+CV +TDNYLESVRLFVE+LD FFS+            VYLILDEFILAGKLQETSKKAPQN LEPSA DIRLYEDHV Q  V+
Subjt:  RTHKVIYRQYAGLFFSICVDKTDNYLESVRLFVEVLDQFFSN------------VYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVE

Query:  SLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETENESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVI
        SLIDMEM LRTSKTKLLAGIDLIR+ KVSALETEN S+LKSKRSELEFRCQM EN+LGILEKLKSS   SNEKWKDGV+
Subjt:  SLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETENESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVI

XP_015969838.1 AP-2 complex subunit sigma isoform X2 [Arachis duranensis]7.5e-3273.15Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSNVYLILDEFILAGK
        +GKTRLAKY          KVE EVHRLVVN DP + N VEFRTHKVIYR+YAGLFFS+CVD TDN   YLE + LFVE+LD FFSNVYLILDEFILAG+
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSNVYLILDEFILAGK

Query:  LQETSKKA
        LQETSKKA
Subjt:  LQETSKKA

XP_022937104.1 AP-2 complex subunit sigma isoform X1 [Cucurbita moschata]2.0e-6474.61Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V
        +GKTRLAKY          KVE EVHRLVVN DP F N VEFRTHKVIYR+YAGLFFS+CVD TDN   YLE + LFVE+LD FFSN            V
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V

Query:  YLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETENESKLKSKRSELEF
        YLILDEFILAGKLQETSKKAPQNQLEPSA DIRLYEDHV QVKV+SLIDMEM LRTSKTKLLAGIDLIRRLKVSALE ENE+KLKSKRS  ++
Subjt:  YLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETENESKLKSKRSELEF

XP_022975655.1 uncharacterized protein LOC111475461 [Cucurbita maxima]1.6e-13595.77Show/hide
Query:  STQTLKTFQHFVHRIQPPIINWIWIRRSLRHFKEWSGGLFILVILKGKTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTD
        STQTLKTFQHFVHRIQPPIINWIWIRRSLRHFKEW          +GKTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTD
Subjt:  STQTLKTFQHFVHRIQPPIINWIWIRRSLRHFKEWSGGLFILVILKGKTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTD

Query:  NYLESVRLFVEVLDQFFSNVYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETEN
        NYLESVRLFVEVLDQFFSNVYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETEN
Subjt:  NYLESVRLFVEVLDQFFSNVYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETEN

Query:  ESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVIEAPVRRLSWTLIYYKP
        ESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVIEAPVRRLSWTLIYYKP
Subjt:  ESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVIEAPVRRLSWTLIYYKP

TrEMBL top hitse value%identityAlignment
A0A1R3IXZ7 Phosphotransferase5.3e-3161.7Show/hide
Query:  LRHFKEWSG-GLFILVIL---KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVE
        + H K+ SG G  +L      +GKTRLAKY          KVE EVHRLVVN DP F N VEFRTHKVIYR+YAGLFFS+CVD TDN   YLES+ LFVE
Subjt:  LRHFKEWSG-GLFILVIL---KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVE

Query:  VLDQFFSN------------VYLILDEFILAGKLQETSKKA
        +LD FFSN            VYLILDEFILAG+LQETSKKA
Subjt:  VLDQFFSN------------VYLILDEFILAGKLQETSKKA

A0A6J1FA77 AP-2 complex subunit sigma isoform X19.5e-6574.61Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V
        +GKTRLAKY          KVE EVHRLVVN DP F N VEFRTHKVIYR+YAGLFFS+CVD TDN   YLE + LFVE+LD FFSN            V
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V

Query:  YLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETENESKLKSKRSELEF
        YLILDEFILAGKLQETSKKAPQNQLEPSA DIRLYEDHV QVKV+SLIDMEM LRTSKTKLLAGIDLIRRLKVSALE ENE+KLKSKRS  ++
Subjt:  YLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETENESKLKSKRSELEF

A0A6J1IHB5 uncharacterized protein LOC1114754617.9e-13695.77Show/hide
Query:  STQTLKTFQHFVHRIQPPIINWIWIRRSLRHFKEWSGGLFILVILKGKTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTD
        STQTLKTFQHFVHRIQPPIINWIWIRRSLRHFKEW          +GKTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTD
Subjt:  STQTLKTFQHFVHRIQPPIINWIWIRRSLRHFKEWSGGLFILVILKGKTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTD

Query:  NYLESVRLFVEVLDQFFSNVYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETEN
        NYLESVRLFVEVLDQFFSNVYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETEN
Subjt:  NYLESVRLFVEVLDQFFSNVYLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETEN

Query:  ESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVIEAPVRRLSWTLIYYKP
        ESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVIEAPVRRLSWTLIYYKP
Subjt:  ESKLKSKRSELEFRCQMVENVLGILEKLKSSGNDSNEKWKDGVIEAPVRRLSWTLIYYKP

A0A6P4DRG5 AP complex subunit sigma3.6e-3273.15Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSNVYLILDEFILAGK
        +GKTRLAKY          KVE EVHRLVVN DP + N VEFRTHKVIYR+YAGLFFS+CVD TDN   YLE + LFVE+LD FFSNVYLILDEFILAG+
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSNVYLILDEFILAGK

Query:  LQETSKKA
        LQETSKKA
Subjt:  LQETSKKA

M8B540 AP complex subunit sigma1.5e-3066.13Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V
        +GKTRLAKY          KVE EVHRLVVN DP F N VEFRTHKVIYR+YAGLFFSICVD TDN   YLE + LFVE+LD FFSN            V
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V

Query:  YLILDEFILAGKLQETSKKAPQNQ
        YLILDEFILAG+LQETSKK  Q Q
Subjt:  YLILDEFILAGKLQETSKKAPQNQ

SwissProt top hitse value%identityAlignment
O50016 AP-2 complex subunit sigma1.3e-3165.29Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSNV------------
        +GKTRLAKY          KVE EVHRLVVN DP F N VEFRTHKVIYR+YAGLFFSICVD TDN   YLE + LFVE+LD FFSNV            
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSNV------------

Query:  --YLILDEFILAGKLQETSKK
          YLILDEFILAG+LQETSK+
Subjt:  --YLILDEFILAGKLQETSKK

Q4WS49 AP-2 complex subunit sigma6.2e-2149.63Show/hide
Query:  FILV-ILKGKTRLAKY----------KVECEVHRLVVNSDPNFR-NSVEF-RTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN----
        FILV   +GKTRLAK+          K++ EVHRLV   D  ++ N VEF R+ K++YR+YAGLFF +CVD TDN   YLE++  FVEVLDQFF N    
Subjt:  FILV-ILKGKTRLAKY----------KVECEVHRLVVNSDPNFR-NSVEF-RTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN----

Query:  --------VYLILDEFILAGKLQETSKKAPQNQLE
                VY ILDE  LAG+++ETSK+    +LE
Subjt:  --------VYLILDEFILAGKLQETSKKAPQNQLE

Q54H39 AP-2 complex subunit sigma6.2e-2145.24Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V
        +GKTRL+K+          K+  E+H++V + +  F N VEFRTH+++YR+YAGLFFS+CVD TDN    LE++ LFVEVLD +F N            V
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V

Query:  YLILDEFILAGKLQETSKKAPQNQLE
        Y I+DE  LAG+L E SK     ++E
Subjt:  YLILDEFILAGKLQETSKKAPQNQLE

Q7SAQ1 AP-2 complex subunit sigma9.6e-2248.82Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFR-NSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------
        +GKTRLAK+          K++ E+HRLV   D  ++ N VEFR HKV+YR+YAGLFF  CVD  DN   YLE++  FVEVLD FF N            
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFR-NSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------

Query:  VYLILDEFILAGKLQETSKKAPQNQLE
        VY ILDE  LAG+++ETSK+    +LE
Subjt:  VYLILDEFILAGKLQETSKKAPQNQLE

Q84WL9 AP-2 complex subunit sigma7.8e-3265.83Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V
        +GKTRLAKY          KVE EVHRLVVN D  F N VEFRTHKVIYR+YAGLFFS+CVD TDN   YLES+ LFVE+LD FFSN            V
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V

Query:  YLILDEFILAGKLQETSKKA
        YLILDEFILAG+LQETSK+A
Subjt:  YLILDEFILAGKLQETSKKA

Arabidopsis top hitse value%identityAlignment
AT1G47830.1 SNARE-like superfamily protein5.6e-3365.83Show/hide
Query:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V
        +GKTRLAKY          KVE EVHRLVVN D  F N VEFRTHKVIYR+YAGLFFS+CVD TDN   YLES+ LFVE+LD FFSN            V
Subjt:  KGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSN------------V

Query:  YLILDEFILAGKLQETSKKA
        YLILDEFILAG+LQETSK+A
Subjt:  YLILDEFILAGKLQETSKKA

AT2G17380.1 associated protein 192.8e-1637.9Show/hide
Query:  ILVILKGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDNYLESVRL---FVEVLDQFFSNV-------
        +LV  +GK RL K+          KV  E+  +++N  P   N +E+R +KV+Y++YA L+F +C+D+ DN LE + +   +VE+LD++F +V       
Subjt:  ILVILKGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDNYLESVRL---FVEVLDQFFSNV-------

Query:  -----YLILDEFILAGKLQETSKK
             Y ILDE ++AG+LQE+SKK
Subjt:  -----YLILDEFILAGKLQETSKK

AT2G19790.1 SNARE-like superfamily protein1.5e-0934.4Show/hide
Query:  FILVILK-GKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSNV-----
        FIL++ K G+TRLA+Y           +E E+ R  +  +    + VE R +K++YR+YA LFF + VD  +N    LE + L VE +D+ F NV     
Subjt:  FILVILK-GKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDN---YLESVRLFVEVLDQFFSNV-----

Query:  -------YLILDEFILAGKLQETSK
               + +L+E ++ G + ETSK
Subjt:  -------YLILDEFILAGKLQETSK

AT4G35410.1 Clathrin adaptor complex small chain family protein6.0e-1137.63Show/hide
Query:  ILVILKGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDNYLESVRL---FVEVLDQFFSNV
        +LV  +GK RL K+          KV  E+  +++N  P   N VE+R +KV+Y++YA L+F +C+D+ DN LE + +   +VE+LD++F +V
Subjt:  ILVILKGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDNYLESVRL---FVEVLDQFFSNV

AT4G35410.2 Clathrin adaptor complex small chain family protein2.8e-1638.71Show/hide
Query:  ILVILKGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDNYLESVRL---FVEVLDQFFSNV-------
        +LV  +GK RL K+          KV  E+  +++N  P   N VE+R +KV+Y++YA L+F +C+D+ DN LE + +   +VE+LD++F +V       
Subjt:  ILVILKGKTRLAKY----------KVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDNYLESVRL---FVEVLDQFFSNV-------

Query:  -----YLILDEFILAGKLQETSKK
             Y ILDE ++AG+LQE+SKK
Subjt:  -----YLILDEFILAGKLQETSKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCGATTCATACTTTTGCAAAATAGGCAGGGCAAGACCCGTCTGGCTAAGTATTACGTTCCTCTCGAGGAATCCGAAAAGCACAAGGTCGAGTACGAGATTGGTAG
TATTGCCAAACCACTCCCTCTTCCTCTCCTACTTCTTTCTCGCGTCTGCAACTCAGACCTAAATTCCCTCTTCTCTGTATTGCTACCACCTCACAGGTTTGCTTCTTCAG
ATCCCCCTTTTTCCCCTCTCGATTTCGGGTTTCTTGCAATTTCTTCCTCCCCAGTTTTTAGTGTTGGGATTGTGCATAATATTATCAGTTCCGATGCGAATGTTTTGTTT
GTTGATTCAGTCTGGAGGTTTTCGTTGGTACCTCGATTCCAAGTGTCAATTGGGATTTGGGATTGTCTGAGATCTCTTAGTCATGAGTCCACGACCGATGCCCTGTGCTT
GAAGAGAGGATCGGAAGGGTTTTTCAACACATGGAGGATGGTTTTTGAAGGAGGTCTGGAGAGCGTCTTATTCTTCCAATTGAATAGGATTGGAGAAATGAGATTGCAAA
CACGCCCAGTCAACAAAAAGACAAGCTTGCCTGCCCCTTTTTGTTTTGGAATCGTTTGCTCATCTGCATTCACATTCATCTCAACACAAACTTTAAAGACATTTCAGCAC
TTTGTGCACCGAATTCAGCCTCCGATAATCAATTGGATTTGGATTCGTCGATCCCTTCGTCATTTTAAAGAATGGTCTGGCGGCCTTTTCATTTTGGTGATCTTAAAGGG
CAAGACCCGTCTGGCGAAGTATAAGGTCGAGTGCGAGGTTCATCGATTGGTGGTGAATAGTGATCCCAATTTCAGAAATTCTGTTGAGTTCCGAACACACAAGGTCATCT
ACAGACAATATGCAGGATTATTTTTCTCCATTTGTGTGGACAAAACAGACAACTATCTTGAGAGTGTTCGTCTGTTTGTGGAGGTTCTGGATCAATTTTTCAGCAATGTC
TATCTGATACTTGATGAATTTATTCTTGCTGGAAAACTCCAAGAAACGAGCAAAAAGGCGCCACAAAATCAGCTCGAACCAAGCGCCACAGATATTCGTTTATATGAAGA
TCATGTTTGTCAGGTAAAGGTTGAGTCGCTGATAGATATGGAGATGCCACTTCGAACCTCGAAAACAAAACTGCTAGCTGGAATAGATCTGATCCGCCGACTGAAGGTAT
CTGCACTGGAGACCGAGAATGAGAGTAAATTGAAGAGCAAGCGGTCGGAGCTTGAATTTAGGTGCCAAATGGTAGAGAATGTACTGGGCATATTGGAAAAATTAAAAAGC
TCCGGCAATGACAGCAATGAGAAGTGGAAGGATGGTGTCATTGAGGCCCCTGTGCGTAGACTTTCCTGGACTCTGATATATTATAAACCCTAA
mRNA sequenceShow/hide mRNA sequence
TAATCAACGCAACTTTTTGCACAGAATTCGGCCTCCGACAATCAACGCAACTCTGTGCACCAAAATTCGTCCTCCGATAATCAACGCAACTTTGTGCACCAAAATTCGGC
CTCCGACAATCAATTGGAGTTGGATTCGTCGATCTTCCTTCGTCATTTTCAAGAATGATCCGATTCATACTTTTGCAAAATAGGCAGGGCAAGACCCGTCTGGCTAAGTA
TTACGTTCCTCTCGAGGAATCCGAAAAGCACAAGGTCGAGTACGAGATTGGTAGTATTGCCAAACCACTCCCTCTTCCTCTCCTACTTCTTTCTCGCGTCTGCAACTCAG
ACCTAAATTCCCTCTTCTCTGTATTGCTACCACCTCACAGGTTTGCTTCTTCAGATCCCCCTTTTTCCCCTCTCGATTTCGGGTTTCTTGCAATTTCTTCCTCCCCAGTT
TTTAGTGTTGGGATTGTGCATAATATTATCAGTTCCGATGCGAATGTTTTGTTTGTTGATTCAGTCTGGAGGTTTTCGTTGGTACCTCGATTCCAAGTGTCAATTGGGAT
TTGGGATTGTCTGAGATCTCTTAGTCATGAGTCCACGACCGATGCCCTGTGCTTGAAGAGAGGATCGGAAGGGTTTTTCAACACATGGAGGATGGTTTTTGAAGGAGGTC
TGGAGAGCGTCTTATTCTTCCAATTGAATAGGATTGGAGAAATGAGATTGCAAACACGCCCAGTCAACAAAAAGACAAGCTTGCCTGCCCCTTTTTGTTTTGGAATCGTT
TGCTCATCTGCATTCACATTCATCTCAACACAAACTTTAAAGACATTTCAGCACTTTGTGCACCGAATTCAGCCTCCGATAATCAATTGGATTTGGATTCGTCGATCCCT
TCGTCATTTTAAAGAATGGTCTGGCGGCCTTTTCATTTTGGTGATCTTAAAGGGCAAGACCCGTCTGGCGAAGTATAAGGTCGAGTGCGAGGTTCATCGATTGGTGGTGA
ATAGTGATCCCAATTTCAGAAATTCTGTTGAGTTCCGAACACACAAGGTCATCTACAGACAATATGCAGGATTATTTTTCTCCATTTGTGTGGACAAAACAGACAACTAT
CTTGAGAGTGTTCGTCTGTTTGTGGAGGTTCTGGATCAATTTTTCAGCAATGTCTATCTGATACTTGATGAATTTATTCTTGCTGGAAAACTCCAAGAAACGAGCAAAAA
GGCGCCACAAAATCAGCTCGAACCAAGCGCCACAGATATTCGTTTATATGAAGATCATGTTTGTCAGGTAAAGGTTGAGTCGCTGATAGATATGGAGATGCCACTTCGAA
CCTCGAAAACAAAACTGCTAGCTGGAATAGATCTGATCCGCCGACTGAAGGTATCTGCACTGGAGACCGAGAATGAGAGTAAATTGAAGAGCAAGCGGTCGGAGCTTGAA
TTTAGGTGCCAAATGGTAGAGAATGTACTGGGCATATTGGAAAAATTAAAAAGCTCCGGCAATGACAGCAATGAGAAGTGGAAGGATGGTGTCATTGAGGCCCCTGTGCG
TAGACTTTCCTGGACTCTGATATATTATAAACCCTAA
Protein sequenceShow/hide protein sequence
MIRFILLQNRQGKTRLAKYYVPLEESEKHKVEYEIGSIAKPLPLPLLLLSRVCNSDLNSLFSVLLPPHRFASSDPPFSPLDFGFLAISSSPVFSVGIVHNIISSDANVLF
VDSVWRFSLVPRFQVSIGIWDCLRSLSHESTTDALCLKRGSEGFFNTWRMVFEGGLESVLFFQLNRIGEMRLQTRPVNKKTSLPAPFCFGIVCSSAFTFISTQTLKTFQH
FVHRIQPPIINWIWIRRSLRHFKEWSGGLFILVILKGKTRLAKYKVECEVHRLVVNSDPNFRNSVEFRTHKVIYRQYAGLFFSICVDKTDNYLESVRLFVEVLDQFFSNV
YLILDEFILAGKLQETSKKAPQNQLEPSATDIRLYEDHVCQVKVESLIDMEMPLRTSKTKLLAGIDLIRRLKVSALETENESKLKSKRSELEFRCQMVENVLGILEKLKS
SGNDSNEKWKDGVIEAPVRRLSWTLIYYKP