; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G029533 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G029533
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
Descriptionheme-binding-like protein At3g10130, chloroplastic
Genome locationGy14Chr3:29987840..29988579
RNA-Seq ExpressionCsGy3G029533
SyntenyCsGy3G029533
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016899693.1 PREDICTED: heme-binding-like protein At3g10130, chloroplastic [Cucumis melo]3.42e-13397.98Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLILGKISVETPKYEL+QSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGG E
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVE+LKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

XP_022934955.1 heme-binding-like protein At3g10130, chloroplastic [Cucurbita moschata]5.01e-12692.93Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLI GKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPAD  V I+EEGERK+AVVRFSGIATEGVVA+KVEKLKKSLEKDG KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

XP_023527547.1 heme-binding-like protein At3g10130, chloroplastic [Cucurbita pepo subsp. pepo]1.38e-12391.41Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLI GKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQN+KSEKVAMTAPVITKSEKISMTAPVVT G GGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPAD  V I+EEGERK+AVVRFSGIATEGVVA+KVE LKKSLEKDG KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

XP_031737956.1 heme-binding-like protein At3g10130, chloroplastic [Cucumis sativus]6.16e-136100Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

XP_038906231.1 heme-binding-like protein At3g10130, chloroplastic [Benincasa hispida]5.23e-12892.93Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLILGKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIG+PQNIKSEKVAMTAPVITKSEKI MTAPVVT GGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKP+TMQFVLPSKYKKAEEAPKP DE VVI+EEGERKLAVVRFSGIATEGVVA+KVE LKKSLEKDGHK+IGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

TrEMBL top hitse value%identityAlignment
A0A0A0LFF2 Uncharacterized protein2.98e-136100Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

A0A1S4DUQ0 heme-binding-like protein At3g10130, chloroplastic1.66e-13397.98Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLILGKISVETPKYEL+QSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGG E
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVE+LKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

A0A5A7TLQ2 Heme-binding-like protein1.66e-13397.98Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLILGKISVETPKYEL+QSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGG E
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVE+LKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

A0A6J1F979 heme-binding-like protein At3g10130, chloroplastic2.43e-12692.93Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLI GKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPAD  V I+EEGERK+AVVRFSGIATEGVVA+KVEKLKKSLEKDG KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

A0A6J1J5Z2 heme-binding-like protein At3g10130, chloroplastic9.47e-12491.41Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE
        MGLI GKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVI+KSEKISMTAPVVT G GGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPAD  V I+EEGERK+AVVRFSGIATEGVVA+KVE LKKSLEKDG KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic1.7e-2035.68Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAE-VAYDPTQFRG-NKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVTEGGGGEGK
        +ET  + ++  T  YEIR+ EP  VAE +    T F        F VLA+Y+      +N   EK+ MT PV+T+      EK+ MT PV+T     + +
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAE-VAYDPTQFRG-NKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVTEGGGGEGK

Query:  PVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGD---YVLARYNPPWTLPSLRTNEVMIPVE
           M FV+PSKY      P P D  V I++   + +AVV FSG  T+  +  +  +L+++L+ D    + D   + +A+YNPP+TLP +R NEV + VE
Subjt:  PVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGD---YVLARYNPPWTLPSLRTNEVMIPVE

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein4.0e-0930.2Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGEGKPVTMQFV
        +E P YELV S + YEIR+Y  +V   V+ +P       D   T   +    I + +N   +K+ MTAPVI++         V    G       T+ F 
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGEGKPVTMQFV

Query:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLE-----------KDGHKVIGD--YVLARYNPPWTLPSLRTNEVMIP
        +P   KK +  P P+ E + I++   R +AV +FSG  ++  + E+   L  SL+           K+   V  D  Y +A+YN P+   S R NE+ +P
Subjt:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLE-----------KDGHKVIGD--YVLARYNPPWTLPSLRTNEVMIP

Query:  VE
         E
Subjt:  VE

AT2G37970.1 SOUL heme-binding family protein1.6e-7465.58Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK---------------SEK
        MG++ GKI+VETPKY + +S   YEIR+Y P+V AEV YD ++F+G+KDGGF +LAKYIG  G+P+N K EK+AMTAPVITK               SEK
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK---------------SEK

Query:  ISMTAPVVTEGGGGEG--KPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPW
        I MT+PVVT+ GGGEG  K VTMQF+LPS YKKAEEAP+P DERVVIKEEG RK  V++FSGIA+E VV+EKV+KL   LEKDG K+ GD+VLARYNPPW
Subjt:  ISMTAPVVTEGGGGEG--KPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPW

Query:  TLPSLRTNEVMIPVE
        TLP  RTNEVMIPVE
Subjt:  TLPSLRTNEVMIPVE

AT3G10130.1 SOUL heme-binding family protein1.2e-2135.68Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAE-VAYDPTQFRG-NKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVTEGGGGEGK
        +ET  + ++  T  YEIR+ EP  VAE +    T F        F VLA+Y+      +N   EK+ MT PV+T+      EK+ MT PV+T     + +
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAE-VAYDPTQFRG-NKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVTEGGGGEGK

Query:  PVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGD---YVLARYNPPWTLPSLRTNEVMIPVE
           M FV+PSKY      P P D  V I++   + +AVV FSG  T+  +  +  +L+++L+ D    + D   + +A+YNPP+TLP +R NEV + VE
Subjt:  PVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGD---YVLARYNPPWTLPSLRTNEVMIPVE

AT5G20140.1 SOUL heme-binding family protein3.3e-1935.26Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGEGKPVTMQFV
        +ETPKY++++ T++YE+R YEP +V E   D    + +   GF  +A YI      +N   EK+ MT PV T++    +++             V++Q V
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGEGKPVTMQFV

Query:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPW-TLPSLRTNEVMIPVE
        +PS  K     P P +E+V +K+      A V+FSG  TE VV  K  +L+ SL KDG +     +LARYN P  T   +  NEV+I +E
Subjt:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPW-TLPSLRTNEVMIPVE

AT5G20140.2 SOUL heme-binding family protein6.2e-1835.26Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGEGKPVTMQFV
        +ETPKY++++ T++YE+R YEP +V E   D    + +   GF  +A YI      +N   EK+ MT PV T++    +++             V++Q V
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGEGKPVTMQFV

Query:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPP
        +PS  K     P P +E+V +K+      A V+FSG  TE VV  K  +L+ SL KDG +     +LARYN P
Subjt:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTGATTCTCGGAAAGATCAGTGTTGAAACACCCAAATACGAGCTCGTTCAATCCACTTCCGACTACGAAATCCGCAAATACGAGCCATCGGTGGTCGCCGAAGT
CGCCTACGATCCGACCCAGTTCAGAGGCAACAAAGACGGTGGCTTCACCGTATTAGCAAAATACATTGGAGCCATTGGTGAGCCACAGAACATCAAGTCCGAGAAAGTGG
CCATGACGGCGCCGGTGATCACCAAATCGGAGAAGATTTCGATGACAGCTCCGGTGGTGACCGAAGGTGGCGGCGGCGAAGGGAAGCCGGTGACGATGCAGTTTGTGCTA
CCTAGCAAGTACAAGAAGGCAGAGGAGGCTCCTAAACCGGCGGATGAAAGGGTTGTGATAAAGGAAGAAGGGGAGAGAAAACTAGCCGTCGTGAGATTTAGCGGAATTGC
GACGGAGGGAGTGGTGGCGGAGAAGGTGGAGAAGCTAAAGAAAAGCTTGGAGAAAGATGGACACAAGGTGATTGGGGATTATGTATTGGCAAGATATAACCCACCTTGGA
CATTGCCTTCTTTGAGAACCAATGAAGTTATGATACCAGTAGAGTAA
mRNA sequenceShow/hide mRNA sequence
CGTCTTTCTTCTCAATGTATAAAATCAGCTACCCCTAGACTATCTATATCATAAGTTTCCAAACATGGGTTTGATTCTCGGAAAGATCAGTGTTGAAACACCCAAATACG
AGCTCGTTCAATCCACTTCCGACTACGAAATCCGCAAATACGAGCCATCGGTGGTCGCCGAAGTCGCCTACGATCCGACCCAGTTCAGAGGCAACAAAGACGGTGGCTTC
ACCGTATTAGCAAAATACATTGGAGCCATTGGTGAGCCACAGAACATCAAGTCCGAGAAAGTGGCCATGACGGCGCCGGTGATCACCAAATCGGAGAAGATTTCGATGAC
AGCTCCGGTGGTGACCGAAGGTGGCGGCGGCGAAGGGAAGCCGGTGACGATGCAGTTTGTGCTACCTAGCAAGTACAAGAAGGCAGAGGAGGCTCCTAAACCGGCGGATG
AAAGGGTTGTGATAAAGGAAGAAGGGGAGAGAAAACTAGCCGTCGTGAGATTTAGCGGAATTGCGACGGAGGGAGTGGTGGCGGAGAAGGTGGAGAAGCTAAAGAAAAGC
TTGGAGAAAGATGGACACAAGGTGATTGGGGATTATGTATTGGCAAGATATAACCCACCTTGGACATTGCCTTCTTTGAGAACCAATGAAGTTATGATACCAGTAGAGTA
AAGTGAATTGATTTGATTTCTAGTGTTTTTAGACATTCTGTGTTTGACATTTTGATGTACTACTGTCATTTATAGGCTCT
Protein sequenceShow/hide protein sequence
MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTEGGGGEGKPVTMQFVL
PSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE