; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022943 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022943
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionEukaryotic translation initiation factor 3 subunit L
Genome locationtig00000729:1213796..1232177
RNA-Seq ExpressionSgr022943
SyntenySgr022943
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150281.1 uncharacterized protein LOC101204402 [Cucumis sativus]3.6e-11680.14Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ
        MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVM LVALIGLAIIVCT+ISLNIVFPDDI NRPFC+DRRLQPLP MNG GGESD  LGAFYLTNQ+
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ

Query:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI
        IVDYYWMLVFIPSV+AF ASA+YLVAGIVVAYSAP+RHGCLKVVENSYCASRR                                    GGVRCLTILN+
Subjt:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI

Query:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        VFAIIFGL+ALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRK+ATIL EGDLG RN GLEMLVANP EITP
Subjt:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

XP_008445033.1 PREDICTED: uncharacterized protein LOC103488196 [Cucumis melo]1.1e-11781.21Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ
        MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKP+VM LVALIGLAIIVCTTISLNIVFPDDI NRPFC+DRRLQPLP MNG GGESD  LGAFYLTNQ+
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ

Query:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI
        IVDYYWMLVFIPSV+AFLASAVYLVAGIVVAYSAP+RHGCLKVVENSYCASRR                                    GGVRCLTILN+
Subjt:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI

Query:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        VFAIIFGL+ALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRK+ATIL EGDLGGRN GLEMLVANP EITP
Subjt:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

XP_022132184.1 uncharacterized protein LOC111005105 [Momordica charantia]2.9e-11880.43Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI
        MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVM LVALIGLAIIVCTTISLNIVFP+DIANRPFCNDRRLQPLP+NG GGESDF+LGAFYLTNQ+I
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI

Query:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV
        VDYYWMLVFIPSV+AF AS VYLVAGI+VAYS PTRHGCLKVVENSYCASRR                                    GGVRCLTILN+V
Subjt:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV

Query:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        FAIIFGL+ALFLGSSLLTLGGSC+VPLFWCYEI+SWGLVILYGGTAFFLRRKAATILDEGDL G+N GLEMLVANP EITP
Subjt:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

XP_022962315.1 uncharacterized protein LOC111462802 [Cucurbita moschata]2.8e-11377.22Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI
        MAIIGDALRQAFMPKHEYENLREEEKAWGK+QKPLVM  V LIGLAIIVC+TISL IVFP DI NRPFC+DRRLQPLPMNG GG+SD  +GAFYLTNQ+I
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI

Query:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV
        VDY+WMLVFIPS  AFLAS VYL+AGI VAYSAPTRHGCLKVVENSYCASRR                                    GGVRCLTILN++
Subjt:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV

Query:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        FAIIFGL+ALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATIL EGDLGGRN GLEMLVANP EI+P
Subjt:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

XP_038886522.1 uncharacterized protein LOC120076695 [Benincasa hispida]7.2e-11780.07Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI
        MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPL+M LVALIG+ IIVCT ISLNIVFPDDI NRPFC+DRRLQPLPMNG  GESD LLGAFYLTNQ+I
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI

Query:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV
        VDYYWMLVFIPSV+AFLASAVYLVAGIVVAYSAPTRH CLKVVENSYCASRR                                    GGVRCLTILN++
Subjt:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV

Query:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        FAIIFGL+ALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATIL EG+LGGRN GLEMLVANP EITP
Subjt:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

TrEMBL top hitse value%identityAlignment
A0A0A0LS39 Uncharacterized protein1.7e-11680.14Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ
        MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVM LVALIGLAIIVCT+ISLNIVFPDDI NRPFC+DRRLQPLP MNG GGESD  LGAFYLTNQ+
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ

Query:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI
        IVDYYWMLVFIPSV+AF ASA+YLVAGIVVAYSAP+RHGCLKVVENSYCASRR                                    GGVRCLTILN+
Subjt:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI

Query:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        VFAIIFGL+ALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRK+ATIL EGDLG RN GLEMLVANP EITP
Subjt:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

A0A1S3BBR1 uncharacterized protein LOC1034881965.4e-11881.21Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ
        MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKP+VM LVALIGLAIIVCTTISLNIVFPDDI NRPFC+DRRLQPLP MNG GGESD  LGAFYLTNQ+
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ

Query:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI
        IVDYYWMLVFIPSV+AFLASAVYLVAGIVVAYSAP+RHGCLKVVENSYCASRR                                    GGVRCLTILN+
Subjt:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI

Query:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        VFAIIFGL+ALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRK+ATIL EGDLGGRN GLEMLVANP EITP
Subjt:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

A0A5A7VBN1 Uncharacterized protein5.4e-11881.21Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ
        MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKP+VM LVALIGLAIIVCTTISLNIVFPDDI NRPFC+DRRLQPLP MNG GGESD  LGAFYLTNQ+
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLP-MNGTGGESDFLLGAFYLTNQQ

Query:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI
        IVDYYWMLVFIPSV+AFLASAVYLVAGIVVAYSAP+RHGCLKVVENSYCASRR                                    GGVRCLTILN+
Subjt:  IVDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNI

Query:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        VFAIIFGL+ALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRK+ATIL EGDLGGRN GLEMLVANP EITP
Subjt:  VFAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

A0A6J1BSC6 uncharacterized protein LOC1110051051.4e-11880.43Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI
        MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVM LVALIGLAIIVCTTISLNIVFP+DIANRPFCNDRRLQPLP+NG GGESDF+LGAFYLTNQ+I
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI

Query:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV
        VDYYWMLVFIPSV+AF AS VYLVAGI+VAYS PTRHGCLKVVENSYCASRR                                    GGVRCLTILN+V
Subjt:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV

Query:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        FAIIFGL+ALFLGSSLLTLGGSC+VPLFWCYEI+SWGLVILYGGTAFFLRRKAATILDEGDL G+N GLEMLVANP EITP
Subjt:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

A0A6J1HEP5 uncharacterized protein LOC1114628021.4e-11377.22Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI
        MAIIGDALRQAFMPKHEYENLREEEKAWGK+QKPLVM  V LIGLAIIVC+TISL IVFP DI NRPFC+DRRLQPLPMNG GG+SD  +GAFYLTNQ+I
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI

Query:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV
        VDY+WMLVFIPS  AFLAS VYL+AGI VAYSAPTRHGCLKVVENSYCASRR                                    GGVRCLTILN++
Subjt:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV

Query:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP
        FAIIFGL+ALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATIL EGDLGGRN GLEMLVANP EI+P
Subjt:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G32750.1 unknown protein3.2e-9162.06Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI
        MAIIGDALRQAFMPK EYE+LREE++AW KLQ+P ++ +VA +   I  CT +SL IVFP ++  RPFC+D +LQPLP+ G   +SD   GAFYLT+Q+ 
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQI

Query:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV
        VDYYWM+VF+PS I FL S+VYLVAGI VAYSAP RHG LKVVEN+YCASRR                                    GGVRCL+ILN+V
Subjt:  VDYYWMLVFIPSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIV

Query:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRN-RGLEMLVANPSEITP
        FAII+GL+A+FLGSSLLTLG SCSVPLFWCYEISSWGLVILY GTAF LRR+AA  +DEG+ G RN +GLEML ANP E TP
Subjt:  FAIIFGLMALFLGSSLLTLGGSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRN-RGLEMLVANPSEITP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTATTGGAGACGCGCTGCGTCAGGCGTTCATGCCAAAGCACGAGTATGAGAACCTTCGCGAAGAAGAAAAAGCTTGGGGCAAGCTTCAGAAACCGCTGGTAAT
GGTTCTTGTGGCTTTGATAGGGCTCGCAATTATTGTATGCACTACTATTAGTCTGAACATAGTCTTTCCCGACGACATCGCAAATAGACCATTCTGCAACGACAGGAGGC
TTCAGCCTCTCCCCATGAATGGGACAGGAGGCGAATCTGATTTTCTTCTTGGAGCTTTTTATCTCACCAATCAACAAATCGTGGACTATTATTGGATGCTCGTGTTCATT
CCCTCAGTAATCGCATTTCTTGCATCGGCTGTCTATCTTGTTGCAGGCATAGTTGTTGCTTATTCTGCTCCAACAAGACACGGTTGCTTGAAGGTAGTTGAAAATAGCTA
CTGTGCATCCAGAAGAGGTAAAGCCCTATCTTCTCCCGTGATCAGCTGCTTTAAGTTCTTGTGCTGTTTGTCATCATTGGTTATGGATGTTATAAAAGAAATTTATCTTA
TGTGGCTATCTGTAGGTGGAGTTCGCTGTCTTACCATCCTGAATATTGTTTTTGCCATCATCTTTGGCCTCATGGCATTATTTCTTGGTTCAAGTCTTCTCACCTTGGGT
GGCAGCTGCTCTGTGCCCCTGTTTTGGTGCTATGAGATCTCATCATGGGGACTAGTTATTCTCTATGGAGGAACTGCCTTCTTTTTAAGGAGAAAAGCAGCTACAATTCT
TGATGAAGGAGACCTCGGCGGTAGAAACCGCGGGCTGGAAATGTTGGTAGCAAATCCTTCGGAAATCACTCCGATGTGGAAAGGCGTGTCAATGAAGGATTCAAGGCTTG
GATGGGATCTTCTCTCTTATCGTCTGATGAAGAAGATGAACCTGATAGCTACGATGAAGTGGAAACCGGAGCAGCTGCAATGGAAGTTTCCGCGACCACAAGAGTTCGTC
TCTACCTCTCTACGTCTCTCGTTGCAGGTTTCCGGCATTTTCGCAGTTTCGAACTCCGTTTCCTTGTTCGTGCGTGATCCGTACGAGTTTATCTTCGTCTTCGTGATGCA
GCATCAGCCATTCAATATTTCAGCTAAAGGAAAACGAACTGAAAGTGTTACTCTGGGATGGCTGTCGCTGAGGAATTTACTGAACCTCCTGAGCATTCGCACACGCCGCG
GCAGTGTGCTCACCTACCTGTTTCATCAGCCCAATTGCATCGTAGCTCCTATCACCTGGGCTCATAAAGACAGTTTGCAAGTGCACATGGATCGTCAGATGATAATTTTG
AAGGGAAGTCTTGATCTAGTAAAGCTGTTGAGGCTCTACACTCAATTTCAAATACAAGAGCGAGTCATTCATATCACATTCAACAAGGATACGAGAAGGCTGTTGGACCT
GTACAGAGGAGATAAGAATTCTCTATGGCGTTTCTTCAAAATTGATGGAGTATTAGTGAAAGTTTTAGCAGCACTTTTCAACACAGCATCAGGACTATCATCTCGGGAAG
GAATGGAACATCCAAGCTTGGAAACCGAGGAGGCTCATAACAAAGAGCTCCAAGGTCCTGCTGATCTGAATAGGCACTCAATCCGACAGAAGTTTGATGTGAAACAGTCT
CTACTTCCTGAGGTCCAATTGCTTGAAATTGTGACAACTCCTGACCTAATTCTAGTGCAGAAATATTTGGTAACTCGTATAAATTATATTGGTTATCCCCAGTTGAAGTT
TTGCAATCCTGTGAGAAATTGTGACTGCTATCATCTATACTGCTCTGCACCCTTGAAGATGAAAGCATGGGTAGGCTTGATTGCACTGCATGATGCAGGGGTTGGTACTG
TTCAAGGAGTACAAAGCGTTGGATCACCTTCCATAGTTCCTGGACAGGCAGTAGATCCAGATGGATGCAGAGTGCTTCCTCTCCAAAAATCTCCTCATGGCATCCAGCAA
AATTGTAGCAGAACTGGCATAGGACACAGCTTTCACCCTTCAAGCAGCTCCCACCTTTTGAGCTTTATTGGGCGACCTCGATCGTTGTTGTTCCCCCTCGACCGAGCATG
CTTTCCACCTCTGTTGAACTTACCACCACCCCTACAGGATAGCAAGCAGCAGGAAACCATGGAAGATTATGCAAGAGACAAGCAAAAATCCAGTAGGAATTTCAGAAACA
ATTCAAAGAACTATTCGTCCACCTTCTGTGTTGCCCTTAAAATCGCGATGATGCTTTTCTTGATTGCTGCGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTATTGGAGACGCGCTGCGTCAGGCGTTCATGCCAAAGCACGAGTATGAGAACCTTCGCGAAGAAGAAAAAGCTTGGGGCAAGCTTCAGAAACCGCTGGTAAT
GGTTCTTGTGGCTTTGATAGGGCTCGCAATTATTGTATGCACTACTATTAGTCTGAACATAGTCTTTCCCGACGACATCGCAAATAGACCATTCTGCAACGACAGGAGGC
TTCAGCCTCTCCCCATGAATGGGACAGGAGGCGAATCTGATTTTCTTCTTGGAGCTTTTTATCTCACCAATCAACAAATCGTGGACTATTATTGGATGCTCGTGTTCATT
CCCTCAGTAATCGCATTTCTTGCATCGGCTGTCTATCTTGTTGCAGGCATAGTTGTTGCTTATTCTGCTCCAACAAGACACGGTTGCTTGAAGGTAGTTGAAAATAGCTA
CTGTGCATCCAGAAGAGGTAAAGCCCTATCTTCTCCCGTGATCAGCTGCTTTAAGTTCTTGTGCTGTTTGTCATCATTGGTTATGGATGTTATAAAAGAAATTTATCTTA
TGTGGCTATCTGTAGGTGGAGTTCGCTGTCTTACCATCCTGAATATTGTTTTTGCCATCATCTTTGGCCTCATGGCATTATTTCTTGGTTCAAGTCTTCTCACCTTGGGT
GGCAGCTGCTCTGTGCCCCTGTTTTGGTGCTATGAGATCTCATCATGGGGACTAGTTATTCTCTATGGAGGAACTGCCTTCTTTTTAAGGAGAAAAGCAGCTACAATTCT
TGATGAAGGAGACCTCGGCGGTAGAAACCGCGGGCTGGAAATGTTGGTAGCAAATCCTTCGGAAATCACTCCGATGTGGAAAGGCGTGTCAATGAAGGATTCAAGGCTTG
GATGGGATCTTCTCTCTTATCGTCTGATGAAGAAGATGAACCTGATAGCTACGATGAAGTGGAAACCGGAGCAGCTGCAATGGAAGTTTCCGCGACCACAAGAGTTCGTC
TCTACCTCTCTACGTCTCTCGTTGCAGGTTTCCGGCATTTTCGCAGTTTCGAACTCCGTTTCCTTGTTCGTGCGTGATCCGTACGAGTTTATCTTCGTCTTCGTGATGCA
GCATCAGCCATTCAATATTTCAGCTAAAGGAAAACGAACTGAAAGTGTTACTCTGGGATGGCTGTCGCTGAGGAATTTACTGAACCTCCTGAGCATTCGCACACGCCGCG
GCAGTGTGCTCACCTACCTGTTTCATCAGCCCAATTGCATCGTAGCTCCTATCACCTGGGCTCATAAAGACAGTTTGCAAGTGCACATGGATCGTCAGATGATAATTTTG
AAGGGAAGTCTTGATCTAGTAAAGCTGTTGAGGCTCTACACTCAATTTCAAATACAAGAGCGAGTCATTCATATCACATTCAACAAGGATACGAGAAGGCTGTTGGACCT
GTACAGAGGAGATAAGAATTCTCTATGGCGTTTCTTCAAAATTGATGGAGTATTAGTGAAAGTTTTAGCAGCACTTTTCAACACAGCATCAGGACTATCATCTCGGGAAG
GAATGGAACATCCAAGCTTGGAAACCGAGGAGGCTCATAACAAAGAGCTCCAAGGTCCTGCTGATCTGAATAGGCACTCAATCCGACAGAAGTTTGATGTGAAACAGTCT
CTACTTCCTGAGGTCCAATTGCTTGAAATTGTGACAACTCCTGACCTAATTCTAGTGCAGAAATATTTGGTAACTCGTATAAATTATATTGGTTATCCCCAGTTGAAGTT
TTGCAATCCTGTGAGAAATTGTGACTGCTATCATCTATACTGCTCTGCACCCTTGAAGATGAAAGCATGGGTAGGCTTGATTGCACTGCATGATGCAGGGGTTGGTACTG
TTCAAGGAGTACAAAGCGTTGGATCACCTTCCATAGTTCCTGGACAGGCAGTAGATCCAGATGGATGCAGAGTGCTTCCTCTCCAAAAATCTCCTCATGGCATCCAGCAA
AATTGTAGCAGAACTGGCATAGGACACAGCTTTCACCCTTCAAGCAGCTCCCACCTTTTGAGCTTTATTGGGCGACCTCGATCGTTGTTGTTCCCCCTCGACCGAGCATG
CTTTCCACCTCTGTTGAACTTACCACCACCCCTACAGGATAGCAAGCAGCAGGAAACCATGGAAGATTATGCAAGAGACAAGCAAAAATCCAGTAGGAATTTCAGAAACA
ATTCAAAGAACTATTCGTCCACCTTCTGTGTTGCCCTTAAAATCGCGATGATGCTTTTCTTGATTGCTGCGGAGTAG
Protein sequenceShow/hide protein sequence
MAIIGDALRQAFMPKHEYENLREEEKAWGKLQKPLVMVLVALIGLAIIVCTTISLNIVFPDDIANRPFCNDRRLQPLPMNGTGGESDFLLGAFYLTNQQIVDYYWMLVFI
PSVIAFLASAVYLVAGIVVAYSAPTRHGCLKVVENSYCASRRGKALSSPVISCFKFLCCLSSLVMDVIKEIYLMWLSVGGVRCLTILNIVFAIIFGLMALFLGSSLLTLG
GSCSVPLFWCYEISSWGLVILYGGTAFFLRRKAATILDEGDLGGRNRGLEMLVANPSEITPMWKGVSMKDSRLGWDLLSYRLMKKMNLIATMKWKPEQLQWKFPRPQEFV
STSLRLSLQVSGIFAVSNSVSLFVRDPYEFIFVFVMQHQPFNISAKGKRTESVTLGWLSLRNLLNLLSIRTRRGSVLTYLFHQPNCIVAPITWAHKDSLQVHMDRQMIIL
KGSLDLVKLLRLYTQFQIQERVIHITFNKDTRRLLDLYRGDKNSLWRFFKIDGVLVKVLAALFNTASGLSSREGMEHPSLETEEAHNKELQGPADLNRHSIRQKFDVKQS
LLPEVQLLEIVTTPDLILVQKYLVTRINYIGYPQLKFCNPVRNCDCYHLYCSAPLKMKAWVGLIALHDAGVGTVQGVQSVGSPSIVPGQAVDPDGCRVLPLQKSPHGIQQ
NCSRTGIGHSFHPSSSSHLLSFIGRPRSLLFPLDRACFPPLLNLPPPLQDSKQQETMEDYARDKQKSSRNFRNNSKNYSSTFCVALKIAMMLFLIAAE