; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001225 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001225
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionTransmembrane protein
Genome locationscaffold36:2434385..2437118
RNA-Seq ExpressionMS001225
SyntenyMS001225
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029724.1 hypothetical protein SDJN02_08066 [Cucurbita argyrosperma subsp. argyrosperma]8.5e-14388.66Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
        MAIIGDALRQAFMPKHEYENLREEEK WGK+QKPLVMA   LIGLAIIVCTTISL IVFP DI NRPFC+DRRLQPLP+NGKGG+SD  +GAFYLTNQEI
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI

Query:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
        VDY+WMLVFIPS  AF ASVVYL+AGI VAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNV+FAIIFGLLALFLGSSLLTLGGSC+VPLFWCYEI+SW
Subjt:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW

Query:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHI
        GLVILYGGTAFFLRRKAATIL EGDL G+NLGLEMLVANPLEI+PDVERRV+EGFKAWMGSSLLSSDEEDE DSY+ VTSH NH NSNRHI
Subjt:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHI

XP_008445033.1 PREDICTED: uncharacterized protein LOC103488196 [Cucumis melo]5.9e-14490.44Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLP-VNGKGGESDFVLGAFYLTNQE
        MAIIGDALRQAFMPKHEYENLREEEK WGKLQKP+VMALVALIGLAIIVCTTISLNIVFP+DI NRPFC+DRRLQPLP +NGKGGESD  LGAFYLTNQE
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLP-VNGKGGESDFVLGAFYLTNQE

Query:  IVDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITS
        IVDYYWMLVFIPSVVAF AS VYLVAGI+VAYSAP+RHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSC+VPLFWCYEI+S
Subjt:  IVDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITS

Query:  WGLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTN-SNRHI
        WGLVILYGGTAFFLRRK+ATIL EGDL G+NLGLEMLVANP+EITPD+ERRV+EGFKAWMGSSLLSSDEEDE DSYE VTSHMNH N +N HI
Subjt:  WGLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTN-SNRHI

XP_022132184.1 uncharacterized protein LOC111005105 [Momordica charantia]5.0e-15999.32Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
        MAIIGDALRQAFMPKHEYENLREEEK WGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI

Query:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
        VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYS PTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
Subjt:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW

Query:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHIA
        GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHIA
Subjt:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHIA

XP_023547204.1 uncharacterized protein LOC111806085 [Cucurbita pepo subsp. pepo]2.9e-14388.66Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
        MAIIGDALRQAFMPKHEYENLREEEK WGK+QKPLVMA V LIG AIIVCTTISL IVFP DI NRPFC+DRRLQPLP+NGKGG+SD  +GAFYLTNQEI
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI

Query:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
        VDY+WMLVFIPS  AF ASVVYL+AGI VAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNV+FAIIFGLLALFLGSSLLTLGGSC+VPLFWCYEI+SW
Subjt:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW

Query:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHI
        GLVILYGGTAFFLRRKAATIL EGDL G+NLGLEMLVANPLEI+PDVERRV+EGFKAWMGSSLLSSDEEDE DSY+ V SHMNH NSNRHI
Subjt:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHI

XP_038886522.1 uncharacterized protein LOC120076695 [Benincasa hispida]1.2e-14490.03Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
        MAIIGDALRQAFMPKHEYENLREEEK WGKLQKPL+MALVALIG+ IIVCT ISLNIVFP+DI NRPFC+DRRLQPLP+NGK GESD +LGAFYLTNQEI
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI

Query:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
        VDYYWMLVFIPSVVAF AS VYLVAGI+VAYSAPTRH CLKVVENSYCASRRGGVRCLTILNV+FAIIFGLLALFLGSSLLTLGGSC+VPLFWCYEI+SW
Subjt:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW

Query:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHI
        GLVILYGGTAFFLRRKAATIL EG+L G+NLGLEMLVANP+EITPDVERRV+EGFKAWMGSSLLSSDEEDE DSYE VTSHMNH NSNR I
Subjt:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHI

TrEMBL top hitse value%identityAlignment
A0A0A0LS39 Uncharacterized protein5.4e-14389.76Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLP-VNGKGGESDFVLGAFYLTNQE
        MAIIGDALRQAFMPKHEYENLREEEK WGKLQKPLVMALVALIGLAIIVCT+ISLNIVFP+DI NRPFC+DRRLQPLP +NGKGGESD  LGAFYLTNQE
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLP-VNGKGGESDFVLGAFYLTNQE

Query:  IVDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITS
        IVDYYWMLVFIPSVVAF AS +YLVAGI+VAYSAP+RHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSC+VPLFWCYEI+S
Subjt:  IVDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITS

Query:  WGLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTN-SNRHI
        WGLVILYGGTAFFLRRK+ATIL EGDL  +NLGLEMLVANP+EITPDVERRV+EGFKAWMGSSLLSSDEEDE DSYE VTSH+NH N +N HI
Subjt:  WGLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTN-SNRHI

A0A1S3BBR1 uncharacterized protein LOC1034881962.9e-14490.44Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLP-VNGKGGESDFVLGAFYLTNQE
        MAIIGDALRQAFMPKHEYENLREEEK WGKLQKP+VMALVALIGLAIIVCTTISLNIVFP+DI NRPFC+DRRLQPLP +NGKGGESD  LGAFYLTNQE
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLP-VNGKGGESDFVLGAFYLTNQE

Query:  IVDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITS
        IVDYYWMLVFIPSVVAF AS VYLVAGI+VAYSAP+RHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSC+VPLFWCYEI+S
Subjt:  IVDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITS

Query:  WGLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTN-SNRHI
        WGLVILYGGTAFFLRRK+ATIL EGDL G+NLGLEMLVANP+EITPD+ERRV+EGFKAWMGSSLLSSDEEDE DSYE VTSHMNH N +N HI
Subjt:  WGLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTN-SNRHI

A0A5A7VBN1 Uncharacterized protein2.9e-14490.44Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLP-VNGKGGESDFVLGAFYLTNQE
        MAIIGDALRQAFMPKHEYENLREEEK WGKLQKP+VMALVALIGLAIIVCTTISLNIVFP+DI NRPFC+DRRLQPLP +NGKGGESD  LGAFYLTNQE
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLP-VNGKGGESDFVLGAFYLTNQE

Query:  IVDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITS
        IVDYYWMLVFIPSVVAF AS VYLVAGI+VAYSAP+RHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSC+VPLFWCYEI+S
Subjt:  IVDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITS

Query:  WGLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTN-SNRHI
        WGLVILYGGTAFFLRRK+ATIL EGDL G+NLGLEMLVANP+EITPD+ERRV+EGFKAWMGSSLLSSDEEDE DSYE VTSHMNH N +N HI
Subjt:  WGLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTN-SNRHI

A0A6J1BSC6 uncharacterized protein LOC1110051052.4e-15999.32Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
        MAIIGDALRQAFMPKHEYENLREEEK WGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI

Query:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
        VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYS PTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
Subjt:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW

Query:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHIA
        GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHIA
Subjt:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHIA

A0A6J1HEP5 uncharacterized protein LOC1114628027.1e-14388.32Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
        MAIIGDALRQAFMPKHEYENLREEEK WGK+QKPLVMA V LIGLAIIVC+TISL IVFP DI NRPFC+DRRLQPLP+NGKGG+SD  +GAFYLTNQEI
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI

Query:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
        VDY+WMLVFIPS  AF ASVVYL+AGI VAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNV+FAIIFGLLALFLGSSLLTLGGSC+VPLFWCYEI+SW
Subjt:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW

Query:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHI
        GLVILYGGTAFFLRRKAATIL EGDL G+NLGLEMLVANPLEI+PDVERRV+EGFKAWMGSSLLSSDEEDE DSY+ V SH NH NSNRHI
Subjt:  GLVILYGGTAFFLRRKAATILDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G32750.1 unknown protein3.2e-11168.62Show/hide
Query:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI
        MAIIGDALRQAFMPK EYE+LREE++ W KLQ+P ++++VA +   I  CT +SL IVFP+++  RPFC+D +LQPLP+ GK  +SD   GAFYLT+QE 
Subjt:  MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEI

Query:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW
        VDYYWM+VF+PS + F  S VYLVAGI VAYSAP RHG LKVVEN+YCASRRGGVRCL+ILNVVFAII+GLLA+FLGSSLLTLG SC+VPLFWCYEI+SW
Subjt:  VDYYWMLVFIPSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSW

Query:  GLVILYGGTAFFLRRKAATILDEGDLSGQN-LGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNR
        GLVILY GTAF LRR+AA  +DEG+   +N  GLEML ANPLE TPDVERRV+EGFKAWMG SLLSSDEE++   +     ++ HT S+R
Subjt:  GLVILYGGTAFFLRRKAATILDEGDLSGQN-LGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTATTGGAGACGCACTTCGCCAGGCGTTCATGCCAAAGCACGAGTATGAGAACCTCCGCGAAGAAGAAAAAACTTGGGGCAAGCTTCAGAAACCGTTGGTTAT
GGCTCTCGTGGCTTTGATAGGGCTCGCAATTATTGTATGCACTACTATTAGTCTGAACATAGTCTTCCCCAACGACATCGCAAATAGACCATTCTGCAACGACAGGAGGC
TTCAGCCTCTCCCCGTAAATGGAAAAGGAGGCGAATCTGATTTTGTTCTTGGAGCTTTTTATCTCACGAATCAAGAAATCGTGGACTATTATTGGATGCTCGTGTTCATC
CCCTCAGTAGTCGCATTTGCTGCATCGGTTGTCTATCTTGTTGCGGGCATAATTGTTGCTTATTCTGCTCCAACTAGACACGGTTGCTTGAAAGTAGTTGAGAATAGCTA
CTGTGCATCCAGAAGAGGTGGAGTTCGCTGTCTTACCATCTTGAATGTTGTTTTTGCCATCATCTTTGGCCTTTTGGCGCTATTTCTTGGTTCAAGTCTTCTCACCTTGG
GTGGGAGCTGCGCAGTGCCCCTGTTTTGGTGTTATGAGATCACATCATGGGGTCTAGTTATTCTCTATGGAGGAACTGCATTCTTTTTAAGGAGAAAAGCAGCTACCATT
CTTGACGAGGGAGACCTCAGTGGTCAAAACCTCGGGCTGGAAATGTTGGTAGCAAATCCCTTGGAGATCACTCCCGATGTGGAAAGGCGTGTCAGTGAAGGATTCAAGGC
TTGGATGGGATCTTCTCTCTTATCCTCTGATGAAGAAGATGAATCTGATAGCTACGAAGGCGTAACATCCCATATGAACCATACTAACTCTAACAGGCACATAGCT
mRNA sequenceShow/hide mRNA sequence
ATGGCAATTATTGGAGACGCACTTCGCCAGGCGTTCATGCCAAAGCACGAGTATGAGAACCTCCGCGAAGAAGAAAAAACTTGGGGCAAGCTTCAGAAACCGTTGGTTAT
GGCTCTCGTGGCTTTGATAGGGCTCGCAATTATTGTATGCACTACTATTAGTCTGAACATAGTCTTCCCCAACGACATCGCAAATAGACCATTCTGCAACGACAGGAGGC
TTCAGCCTCTCCCCGTAAATGGAAAAGGAGGCGAATCTGATTTTGTTCTTGGAGCTTTTTATCTCACGAATCAAGAAATCGTGGACTATTATTGGATGCTCGTGTTCATC
CCCTCAGTAGTCGCATTTGCTGCATCGGTTGTCTATCTTGTTGCGGGCATAATTGTTGCTTATTCTGCTCCAACTAGACACGGTTGCTTGAAAGTAGTTGAGAATAGCTA
CTGTGCATCCAGAAGAGGTGGAGTTCGCTGTCTTACCATCTTGAATGTTGTTTTTGCCATCATCTTTGGCCTTTTGGCGCTATTTCTTGGTTCAAGTCTTCTCACCTTGG
GTGGGAGCTGCGCAGTGCCCCTGTTTTGGTGTTATGAGATCACATCATGGGGTCTAGTTATTCTCTATGGAGGAACTGCATTCTTTTTAAGGAGAAAAGCAGCTACCATT
CTTGACGAGGGAGACCTCAGTGGTCAAAACCTCGGGCTGGAAATGTTGGTAGCAAATCCCTTGGAGATCACTCCCGATGTGGAAAGGCGTGTCAGTGAAGGATTCAAGGC
TTGGATGGGATCTTCTCTCTTATCCTCTGATGAAGAAGATGAATCTGATAGCTACGAAGGCGTAACATCCCATATGAACCATACTAACTCTAACAGGCACATAGCT
Protein sequenceShow/hide protein sequence
MAIIGDALRQAFMPKHEYENLREEEKTWGKLQKPLVMALVALIGLAIIVCTTISLNIVFPNDIANRPFCNDRRLQPLPVNGKGGESDFVLGAFYLTNQEIVDYYWMLVFI
PSVVAFAASVVYLVAGIIVAYSAPTRHGCLKVVENSYCASRRGGVRCLTILNVVFAIIFGLLALFLGSSLLTLGGSCAVPLFWCYEITSWGLVILYGGTAFFLRRKAATI
LDEGDLSGQNLGLEMLVANPLEITPDVERRVSEGFKAWMGSSLLSSDEEDESDSYEGVTSHMNHTNSNRHIA