; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018621 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018621
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCBS domain-containing protein CBSX1, chloroplastic-like
Genome locationChr04:5949492..5956574
RNA-Seq ExpressionHG10018621
SyntenyHG10018621
Gene Ontology termsGO:0045454 - cell redox homeostasis (biological process)
InterPro domainsIPR000644 - CBS domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK06762.1 CBS domain-containing protein CBSX2 [Cucumis melo var. makuwa]5.6e-8282.84Show/hide
Query:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS
        MASISTPYVPS   NSRL  TQFR  +AG   +S PSSLFRSP +ALAFSGHRVASS PFR +GSYTVGDFM KKGNL VLKPSTS++EALEVLVEK++S
Subjt:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS

Query:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDF
        GFPVVDDDWKLVGVVSDYDLLALDSISGVGGG+  NIFPDVN SW+SFKLIQ LLSKKNGE+VGDLMTPAPLVV E MN ENAARLLLETKFHRLPVVD 
Subjt:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDF

Query:  EGKL
        EGKL
Subjt:  EGKL

XP_004136971.1 CBS domain-containing protein CBSX2, chloroplastic [Cucumis sativus]3.6e-8182.27Show/hide
Query:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSG
        MASISTPYVPSV PNSRL  TQ R  +AG          +RSP VALAFSGHRV+SS PFR+GSY VGDFM KKGNLQVLKPSTSV+EALEVLVEK+LSG
Subjt:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSG

Query:  FPVVDDDWKLVGVVSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFE
        FPVVDDDWKLVGVVSDYDLLALDSISGVGGG+  NIFPDVN SW+SFKLIQ LLSKKNGEVVGDLMTPAPLVV E MN ENAARLLLETKFHRLPVVD E
Subjt:  FPVVDDDWKLVGVVSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFE

Query:  GKL
        GKL
Subjt:  GKL

XP_008455519.1 PREDICTED: CBS domain-containing protein CBSX2, chloroplastic [Cucumis melo]4.6e-7683.16Show/hide
Query:  NSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGV
        NSRL  TQFR  +AG   +S PSSLFRSP +ALAFSGHRVASS PFR +GSYTVGDFM KKGNL VLKPSTS++EALEVLVEK++SGFPVVDDDWKLVGV
Subjt:  NSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGV

Query:  VSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKL
        VSDYDLLALDSISGVGGG+  NIFPDVN SW+SFKLIQ LLSKKNGE+VGDLMTPAPLVV E MN ENAARLLLETKFHRLPVVD EGKL
Subjt:  VSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKL

XP_022155650.1 CBS domain-containing protein CBSX1, chloroplastic-like [Momordica charantia]6.6e-7575.24Show/hide
Query:  MASISTPYV-PSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS
        MASISTPYV P++LPNSR LQ +FR + A +     PSS  RSP+V  A SGHR+A+SAP RSGSYTVGDFM KKGNLQV+KPST++DEALEVLVE  LS
Subjt:  MASISTPYV-PSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS

Query:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGG--ETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVD
        GFPVVD DWKLVGVVSDYDLLA+DSISGVGGG  ETNIFPDVN SWKSFK IQ L+SK NGEV+GDLMTPAPLVVRE  +LENAARLLLETKFH LPVVD
Subjt:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGG--ETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVD

Query:  FEGKLCRHLS
         +GKL  +++
Subjt:  FEGKLCRHLS

XP_038888405.1 CBS domain-containing protein CBSX1, chloroplastic-like [Benincasa hispida]8.8e-9692.57Show/hide
Query:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSG
        MASISTPYVPSVLPNSRLLQTQFRLTYAGAG+NS PSSLFRSPAVALAFSGHRVASS+ FR GSYTVGDFM KKGNLQVLKPSTSVDEALEVLVEK+LSG
Subjt:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSG

Query:  FPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEG
        FPVVDDDWKLVGVVSDYDLLALDSISGVG  E NIFPDVNSSWKSFKLIQ LLSKKNGEVVGDLMTPAPLVVRE MNLE+AARLLLETKFH LPVVD EG
Subjt:  FPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEG

Query:  KL
        KL
Subjt:  KL

TrEMBL top hitse value%identityAlignment
A0A0A0K7X9 Uncharacterized protein1.7e-8182.27Show/hide
Query:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSG
        MASISTPYVPSV PNSRL  TQ R  +AG          +RSP VALAFSGHRV+SS PFR+GSY VGDFM KKGNLQVLKPSTSV+EALEVLVEK+LSG
Subjt:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSG

Query:  FPVVDDDWKLVGVVSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFE
        FPVVDDDWKLVGVVSDYDLLALDSISGVGGG+  NIFPDVN SW+SFKLIQ LLSKKNGEVVGDLMTPAPLVV E MN ENAARLLLETKFHRLPVVD E
Subjt:  FPVVDDDWKLVGVVSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFE

Query:  GKL
        GKL
Subjt:  GKL

A0A1S3C189 CBS domain-containing protein CBSX2, chloroplastic2.2e-7683.16Show/hide
Query:  NSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGV
        NSRL  TQFR  +AG   +S PSSLFRSP +ALAFSGHRVASS PFR +GSYTVGDFM KKGNL VLKPSTS++EALEVLVEK++SGFPVVDDDWKLVGV
Subjt:  NSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGV

Query:  VSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKL
        VSDYDLLALDSISGVGGG+  NIFPDVN SW+SFKLIQ LLSKKNGE+VGDLMTPAPLVV E MN ENAARLLLETKFHRLPVVD EGKL
Subjt:  VSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKL

A0A5A7SKP3 CBS domain-containing protein CBSX23.3e-7276.85Show/hide
Query:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS
        MASISTPYVPS   NSRL  TQFR  +AG   +S PSSLFRSP +ALAFSGHRVASS PFR +GSYTVGDFM KKGNL VLKPSTS++EALEVLVEK++S
Subjt:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS

Query:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFE
        GFPVVDDDWKLVGVVSDYDLLALDSIS        +   +   W+SFKLIQ LLSKKNGE+VGDLMTPAPLVV E MN ENAARLLLETKFHRLPVVD E
Subjt:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFE

Query:  GKL
        GKL
Subjt:  GKL

A0A5D3C646 CBS domain-containing protein CBSX22.7e-8282.84Show/hide
Query:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS
        MASISTPYVPS   NSRL  TQFR  +AG   +S PSSLFRSP +ALAFSGHRVASS PFR +GSYTVGDFM KKGNL VLKPSTS++EALEVLVEK++S
Subjt:  MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFR-SGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS

Query:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDF
        GFPVVDDDWKLVGVVSDYDLLALDSISGVGGG+  NIFPDVN SW+SFKLIQ LLSKKNGE+VGDLMTPAPLVV E MN ENAARLLLETKFHRLPVVD 
Subjt:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGGE-TNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDF

Query:  EGKL
        EGKL
Subjt:  EGKL

A0A6J1DN09 CBS domain-containing protein CBSX1, chloroplastic-like3.2e-7575.24Show/hide
Query:  MASISTPYV-PSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS
        MASISTPYV P++LPNSR LQ +FR + A +     PSS  RSP+V  A SGHR+A+SAP RSGSYTVGDFM KKGNLQV+KPST++DEALEVLVE  LS
Subjt:  MASISTPYV-PSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLS

Query:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGG--ETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVD
        GFPVVD DWKLVGVVSDYDLLA+DSISGVGGG  ETNIFPDVN SWKSFK IQ L+SK NGEV+GDLMTPAPLVVRE  +LENAARLLLETKFH LPVVD
Subjt:  GFPVVDDDWKLVGVVSDYDLLALDSISGVGGG--ETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVD

Query:  FEGKLCRHLS
         +GKL  +++
Subjt:  FEGKLCRHLS

SwissProt top hitse value%identityAlignment
O23193 CBS domain-containing protein CBSX1, chloroplastic2.3e-5456.22Show/hide
Query:  ASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGF
        +S S+PY+  +LP    +Q   + T++     S PS   R P+ + A     + +S+  RSG YTVG+FM KK +L V+KP+T+VDEALE+LVE  ++GF
Subjt:  ASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGF

Query:  PVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGK
        PV+D+DWKLVG+VSDYDLLALDSISG G  E ++FP+V+S+WK+F  +Q LLSK NG++VGDLMTPAPLVV E  NLE+AA++LLETK+ RLPVVD +GK
Subjt:  PVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGK

Query:  L
        L
Subjt:  L

O58045 Inosine-5'-monophosphate dehydrogenase2.3e-0627.64Show/hide
Query:  LKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLE
        + P  +VD AL ++ +  + G PVV+D+ K+VG+++  D+ A                                  + G++V +LMT   + V E++ +E
Subjt:  LKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLE

Query:  NAARLLLETKFHRLPVVDFEGKL
         A ++++E +  RLPVVD  GKL
Subjt:  NAARLLLETKFHRLPVVDFEGKL

O67820 Inosine-5'-monophosphate dehydrogenase2.3e-0629.92Show/hide
Query:  NLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVREN
        N   +KP T V EAL+++ +  +SG PVVD++ KL+G++++ DL               I P+  S     K +   ++K+N       +  AP    E 
Subjt:  NLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVREN

Query:  MNLENAARLLLETKFHRLPVVDFEGKL
        + L+ A  +  + K  +LP+VD EGK+
Subjt:  MNLENAARLLLETKFHRLPVVDFEGKL

Q9C5D0 CBS domain-containing protein CBSX2, chloroplastic5.6e-5368.71Show/hide
Query:  SSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSK
        +S P ++G YTVGDFM  + NL V+KPSTSVD+ALE+LVEK ++G PV+DD+W LVGVVSDYDLLALDSISG    +TN+FPDV+S+WK+F  +Q L+SK
Subjt:  SSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSK

Query:  KNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKL
          G+VVGDLMTP+PLVVR++ NLE+AARLLLETKF RLPVVD +GKL
Subjt:  KNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKL

Q9UY49 Inosine-5'-monophosphate dehydrogenase4.7e-0728.46Show/hide
Query:  LKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLE
        + P  +VD AL ++ + ++ G PVV+++ K+VG++S  D+ A                                  + G++V +LMT   + V EN+ +E
Subjt:  LKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLE

Query:  NAARLLLETKFHRLPVVDFEGKL
         A ++++E +  RLPVVD EG+L
Subjt:  NAARLLLETKFHRLPVVDFEGKL

Arabidopsis top hitse value%identityAlignment
AT4G34120.1 Cystathionine beta-synthase (CBS) family protein4.0e-5468.71Show/hide
Query:  SSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSK
        +S P ++G YTVGDFM  + NL V+KPSTSVD+ALE+LVEK ++G PV+DD+W LVGVVSDYDLLALDSISG    +TN+FPDV+S+WK+F  +Q L+SK
Subjt:  SSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSK

Query:  KNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKL
          G+VVGDLMTP+PLVVR++ NLE+AARLLLETKF RLPVVD +GKL
Subjt:  KNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKL

AT4G36910.1 Cystathionine beta-synthase (CBS) family protein1.6e-5556.22Show/hide
Query:  ASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGF
        +S S+PY+  +LP    +Q   + T++     S PS   R P+ + A     + +S+  RSG YTVG+FM KK +L V+KP+T+VDEALE+LVE  ++GF
Subjt:  ASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGF

Query:  PVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGK
        PV+D+DWKLVG+VSDYDLLALDSISG G  E ++FP+V+S+WK+F  +Q LLSK NG++VGDLMTPAPLVV E  NLE+AA++LLETK+ RLPVVD +GK
Subjt:  PVVDDDWKLVGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGK

Query:  L
        L
Subjt:  L


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCGATTTCGACGCCTTACGTTCCCTCTGTTCTTCCTAATTCACGATTGCTACAGACGCAATTCCGGCTGACCTACGCCGGCGCCGGGCTTAATTCATTGCCGTC
GAGTCTGTTTCGGTCTCCGGCTGTCGCTTTGGCATTTTCCGGCCACCGTGTGGCCAGTTCTGCACCGTTCAGGAGTGGATCGTATACGGTTGGGGATTTCATGATGAAGA
AGGGGAATCTGCAGGTTCTTAAGCCTTCTACGAGTGTTGACGAAGCCTTGGAGGTTCTGGTGGAGAAGAATCTATCTGGATTTCCTGTAGTTGATGACGACTGGAAACTG
GTTGGCGTTGTCTCAGATTATGATTTGTTAGCATTGGACTCAATTTCAGGTGTGGGTGGTGGTGAGACTAATATATTTCCAGATGTCAACAGTAGCTGGAAAAGCTTCAA
GTTGATACAGACATTGTTGAGCAAGAAGAATGGGGAAGTTGTAGGAGATTTAATGACACCTGCTCCATTGGTTGTTCGTGAAAACATGAACTTGGAAAATGCTGCCAGGT
TGTTGCTTGAAACAAAGTTCCACCGTTTACCGGTGGTAGACTTTGAAGGGAAGCTGTGTCGTCATCTCTCAACAGTTTCGCCGCCGCTGCCGCCGCCAGGGTTCGTTCGA
CGCAGATCCGACGACCCAGATCCGCTGCTGCCGCCGCTAGGTTTCGTTCAACGCAGATCCGCCGCCGCCGCCGCCAGATTTCGTTCGACGCAGATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCGATTTCGACGCCTTACGTTCCCTCTGTTCTTCCTAATTCACGATTGCTACAGACGCAATTCCGGCTGACCTACGCCGGCGCCGGGCTTAATTCATTGCCGTC
GAGTCTGTTTCGGTCTCCGGCTGTCGCTTTGGCATTTTCCGGCCACCGTGTGGCCAGTTCTGCACCGTTCAGGAGTGGATCGTATACGGTTGGGGATTTCATGATGAAGA
AGGGGAATCTGCAGGTTCTTAAGCCTTCTACGAGTGTTGACGAAGCCTTGGAGGTTCTGGTGGAGAAGAATCTATCTGGATTTCCTGTAGTTGATGACGACTGGAAACTG
GTTGGCGTTGTCTCAGATTATGATTTGTTAGCATTGGACTCAATTTCAGGTGTGGGTGGTGGTGAGACTAATATATTTCCAGATGTCAACAGTAGCTGGAAAAGCTTCAA
GTTGATACAGACATTGTTGAGCAAGAAGAATGGGGAAGTTGTAGGAGATTTAATGACACCTGCTCCATTGGTTGTTCGTGAAAACATGAACTTGGAAAATGCTGCCAGGT
TGTTGCTTGAAACAAAGTTCCACCGTTTACCGGTGGTAGACTTTGAAGGGAAGCTGTGTCGTCATCTCTCAACAGTTTCGCCGCCGCTGCCGCCGCCAGGGTTCGTTCGA
CGCAGATCCGACGACCCAGATCCGCTGCTGCCGCCGCTAGGTTTCGTTCAACGCAGATCCGCCGCCGCCGCCGCCAGATTTCGTTCGACGCAGATCTAA
Protein sequenceShow/hide protein sequence
MASISTPYVPSVLPNSRLLQTQFRLTYAGAGLNSLPSSLFRSPAVALAFSGHRVASSAPFRSGSYTVGDFMMKKGNLQVLKPSTSVDEALEVLVEKNLSGFPVVDDDWKL
VGVVSDYDLLALDSISGVGGGETNIFPDVNSSWKSFKLIQTLLSKKNGEVVGDLMTPAPLVVRENMNLENAARLLLETKFHRLPVVDFEGKLCRHLSTVSPPLPPPGFVR
RRSDDPDPLLPPLGFVQRRSAAAAARFRSTQI