; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G31690 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G31690
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionheme-binding-like protein At3g10130, chloroplastic
Genome locationChr3:28583978..28584574
RNA-Seq ExpressionCSPI03G31690
SyntenyCSPI03G31690
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017250.1 Heme-binding-like protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]4.1e-9692.39Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLI GKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPV
        GKPVTMQFVLPSKYKKAEEAPKPAD  V I+EEGERK+AVVRFSGIATEGVVA+KVE LKKSLEKDG KVIGD+VLARYNPPWTLPSLRTNEVMIP+
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPV

XP_016899693.1 PREDICTED: heme-binding-like protein At3g10130, chloroplastic [Cucumis melo]7.7e-10398.48Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLILGKISVETPKYEL+QSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGG E
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVE+LKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

XP_022934955.1 heme-binding-like protein At3g10130, chloroplastic [Cucurbita moschata]2.2e-9793.43Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLI GKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPAD  V I+EEGERK+AVVRFSGIATEGVVA+KVEKLKKSLEKDG KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

XP_031737956.1 heme-binding-like protein At3g10130, chloroplastic [Cucumis sativus]3.5e-10399.49Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

XP_038906231.1 heme-binding-like protein At3g10130, chloroplastic [Benincasa hispida]6.7e-9993.43Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLILGKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIG+PQNIKSEKVAMTAPVITKSEKI MTAPVVTGGGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKP+TMQFVLPSKYKKAEEAPKP DE VVI+EEGERKLAVVRFSGIATEGVVA+KVE LKKSLEKDGHK+IGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

TrEMBL top hitse value%identityAlignment
A0A0A0LFF2 Uncharacterized protein1.7e-10399.49Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVT GGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

A0A1S4DUQ0 heme-binding-like protein At3g10130, chloroplastic3.7e-10398.48Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLILGKISVETPKYEL+QSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGG E
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVE+LKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

A0A5A7TLQ2 Heme-binding-like protein3.7e-10398.48Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLILGKISVETPKYEL+QSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGG E
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVE+LKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

A0A6J1F979 heme-binding-like protein At3g10130, chloroplastic1.0e-9793.43Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLI GKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPAD  V I+EEGERK+AVVRFSGIATEGVVA+KVEKLKKSLEKDG KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

A0A6J1J5Z2 heme-binding-like protein At3g10130, chloroplastic9.8e-9691.92Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE
        MGLI GKISVETPKYEL+QST+DYEIRKYEPSVVAEV YDPTQFRGN+DGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVI+KSEKISMTAPVVTGG GGE
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGE

Query:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE
        GKPVTMQFVLPSKYKKAEEAPKPAD  V I+EEGERK+AVVRFSGIATEGVVA+KVE LKKSLEKDG KVIGD+VLARYNPPWTLPSLRTNEVMIPVE
Subjt:  GKPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic1.7e-2035.68Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAE-VAYDPTQFRG-NKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVTGGGGGEGK
        +ET  + ++  T  YEIR+ EP  VAE +    T F        F VLA+Y+      +N   EK+ MT PV+T+      EK+ MT PV+T     + +
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAE-VAYDPTQFRG-NKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVTGGGGGEGK

Query:  PVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGD---YVLARYNPPWTLPSLRTNEVMIPVE
           M FV+PSKY      P P D  V I++   + +AVV FSG  T+  +  +  +L+++L+ D    + D   + +A+YNPP+TLP +R NEV + VE
Subjt:  PVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGD---YVLARYNPPWTLPSLRTNEVMIPVE

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein4.0e-0930.2Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGEGKPVTMQFV
        +E P YELV S + YEIR+Y  +V   V+ +P       D   T   +    I + +N   +K+ MTAPVI++         V    G       T+ F 
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGEGKPVTMQFV

Query:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLE-----------KDGHKVIGD--YVLARYNPPWTLPSLRTNEVMIP
        +P   KK +  P P+ E + I++   R +AV +FSG  ++  + E+   L  SL+           K+   V  D  Y +A+YN P+   S R NE+ +P
Subjt:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLE-----------KDGHKVIGD--YVLARYNPPWTLPSLRTNEVMIP

Query:  VE
         E
Subjt:  VE

AT2G37970.1 SOUL heme-binding family protein4.7e-7465.58Show/hide
Query:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK---------------SEK
        MG++ GKI+VETPKY + +S   YEIR+Y P+V AEV YD ++F+G+KDGGF +LAKYIG  G+P+N K EK+AMTAPVITK               SEK
Subjt:  MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK---------------SEK

Query:  ISMTAPVVTGGGGGEG--KPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPW
        I MT+PVVT  GGGEG  K VTMQF+LPS YKKAEEAP+P DERVVIKEEG RK  V++FSGIA+E VV+EKV+KL   LEKDG K+ GD+VLARYNPPW
Subjt:  ISMTAPVVTGGGGGEG--KPVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPW

Query:  TLPSLRTNEVMIPVE
        TLP  RTNEVMIPVE
Subjt:  TLPSLRTNEVMIPVE

AT3G10130.1 SOUL heme-binding family protein1.2e-2135.68Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAE-VAYDPTQFRG-NKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVTGGGGGEGK
        +ET  + ++  T  YEIR+ EP  VAE +    T F        F VLA+Y+      +N   EK+ MT PV+T+      EK+ MT PV+T     + +
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAE-VAYDPTQFRG-NKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITK-----SEKISMTAPVVTGGGGGEGK

Query:  PVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGD---YVLARYNPPWTLPSLRTNEVMIPVE
           M FV+PSKY      P P D  V I++   + +AVV FSG  T+  +  +  +L+++L+ D    + D   + +A+YNPP+TLP +R NEV + VE
Subjt:  PVTMQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGD---YVLARYNPPWTLPSLRTNEVMIPVE

AT5G20140.1 SOUL heme-binding family protein2.5e-1935.26Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGEGKPVTMQFV
        +ETPKY++++ T++YE+R YEP +V E   D    + +   GF  +A YI      +N   EK+ MT PV T++    +++             V++Q V
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGEGKPVTMQFV

Query:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPW-TLPSLRTNEVMIPVE
        +PS  K     P P +E+V +K+      A V+FSG  TE VV  K  +L+ SL KDG +     +LARYN P  T   +  NEV+I +E
Subjt:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPW-TLPSLRTNEVMIPVE

AT5G20140.2 SOUL heme-binding family protein4.8e-1835.26Show/hide
Query:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGEGKPVTMQFV
        +ETPKY++++ T++YE+R YEP +V E   D    + +   GF  +A YI      +N   EK+ MT PV T++    +++             V++Q V
Subjt:  VETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGEGKPVTMQFV

Query:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPP
        +PS  K     P P +E+V +K+      A V+FSG  TE VV  K  +L+ SL KDG +     +LARYN P
Subjt:  LPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTGATTCTCGGAAAGATCAGTGTTGAAACCCCCAAATACGAGCTCGTTCAATCCACTTCCGACTACGAAATCCGCAAATACGAGCCATCGGTGGTCGCC
GAAGTCGCCTACGATCCGACCCAGTTCAGAGGCAACAAAGACGGTGGCTTCACCGTATTAGCAAAATACATTGGAGCCATTGGTGAGCCACAGAACATCAAGTCC
GAGAAAGTGGCCATGACGGCGCCGGTGATCACCAAATCGGAGAAGATTTCGATGACAGCTCCGGTGGTGACCGGAGGTGGCGGCGGCGAAGGGAAGCCGGTGACG
ATGCAGTTTGTGCTACCTAGCAAGTACAAGAAGGCAGAGGAGGCTCCTAAACCGGCGGATGAAAGGGTTGTGATAAAGGAAGAAGGGGAGAGAAAACTAGCCGTC
GTGAGATTTAGCGGAATTGCGACGGAGGGAGTGGTGGCGGAGAAGGTGGAGAAGCTAAAGAAAAGCTTGGAGAAAGATGGACACAAGGTGATTGGGGATTATGTA
TTGGCAAGATATAACCCACCTTGGACATTGCCTTCTTTGAGAACCAATGAAGTTATGATACCAGTAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTGATTCTCGGAAAGATCAGTGTTGAAACCCCCAAATACGAGCTCGTTCAATCCACTTCCGACTACGAAATCCGCAAATACGAGCCATCGGTGGTCGCC
GAAGTCGCCTACGATCCGACCCAGTTCAGAGGCAACAAAGACGGTGGCTTCACCGTATTAGCAAAATACATTGGAGCCATTGGTGAGCCACAGAACATCAAGTCC
GAGAAAGTGGCCATGACGGCGCCGGTGATCACCAAATCGGAGAAGATTTCGATGACAGCTCCGGTGGTGACCGGAGGTGGCGGCGGCGAAGGGAAGCCGGTGACG
ATGCAGTTTGTGCTACCTAGCAAGTACAAGAAGGCAGAGGAGGCTCCTAAACCGGCGGATGAAAGGGTTGTGATAAAGGAAGAAGGGGAGAGAAAACTAGCCGTC
GTGAGATTTAGCGGAATTGCGACGGAGGGAGTGGTGGCGGAGAAGGTGGAGAAGCTAAAGAAAAGCTTGGAGAAAGATGGACACAAGGTGATTGGGGATTATGTA
TTGGCAAGATATAACCCACCTTGGACATTGCCTTCTTTGAGAACCAATGAAGTTATGATACCAGTAGAGTAA
Protein sequenceShow/hide protein sequence
MGLILGKISVETPKYELVQSTSDYEIRKYEPSVVAEVAYDPTQFRGNKDGGFTVLAKYIGAIGEPQNIKSEKVAMTAPVITKSEKISMTAPVVTGGGGGEGKPVT
MQFVLPSKYKKAEEAPKPADERVVIKEEGERKLAVVRFSGIATEGVVAEKVEKLKKSLEKDGHKVIGDYVLARYNPPWTLPSLRTNEVMIPVE