; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G1865 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G1865
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein HEAT INTOLERANT 4
Genome locationctg1002:3426513..3430305
RNA-Seq ExpressionCucsat.G1865
SyntenyCucsat.G1865
Gene Ontology termsGO:1900034 - regulation of cellular response to heat (biological process)
InterPro domainsIPR039313 - Protein HEAT INTOLERANT 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647582.1 hypothetical protein Csa_003773 [Cucumis sativus]2.12e-14397.21Show/hide
Query:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
        MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
Subjt:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK

Query:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
        VYLFGSTEPQFVPFRGEYKVIYIPVVVA      VVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
Subjt:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN

Query:  RLKIVRIKKYEYCLP
        RLKIVRIKKYEYCLP
Subjt:  RLKIVRIKKYEYCLP

KAE8649164.1 hypothetical protein Csa_015023 [Cucumis sativus]3.28e-9970.67Show/hide
Query:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
        M KGT   AA  E  +  E +  K      AP+RAKR +  KP S PEYF+DKRN+EDLWK  FPVGTEWD LDSVYQ+NWNFSNLE+AFE GGKLYGEK
Subjt:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK

Query:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
        VYLFG TEPQ VPF+GE KVI IPVVVA      V SPFPPSDKIGINSVQRE EEIIPMK MKMDWVPYIPLE R+ +VDKLK QIFILSCTQRRAAL 
Subjt:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN

Query:  RLKIVRIKKYEYCLPRKLSSIYFYE
         LKI R+KKYEYCLP      YFY+
Subjt:  RLKIVRIKKYEYCLPRKLSSIYFYE

XP_011658000.1 protein HEAT INTOLERANT 4 [Cucumis sativus]1.35e-14397.21Show/hide
Query:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
        MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
Subjt:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK

Query:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
        VYLFGSTEPQFVPFRGEYKVIYIPVVVA      VVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
Subjt:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN

Query:  RLKIVRIKKYEYCLP
        RLKIVRIKKYEYCLP
Subjt:  RLKIVRIKKYEYCLP

XP_022133379.1 uncharacterized protein LOC111005966 [Momordica charantia]8.44e-11274.26Show/hide
Query:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
        M KGT   A++NE+ +F E ENRKE T+ TA TRAKRV+ASKP S PEYFQDKRNLEDLWK  FPVGTEWD LDSVYQYNWNFSNLE+AFE GGKLYGEK
Subjt:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK

Query:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
        VYLFGSTEPQ V F+GE +VI IPVVVA      VVSPFPPSDKIGINSVQRE EEIIPMK MKM WVPYIPLEDR+S+VDKLK QIFILSCTQRRAAL 
Subjt:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN

Query:  RLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLGESAK
         LKI R+KKYEYCLP      YFY+  +E +L +S +
Subjt:  RLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLGESAK

XP_023883507.1 protein HEAT INTOLERANT 4-like [Quercus suber]5.82e-9973.81Show/hide
Query:  SRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEYKVIYIPVVV
        S+ A +RAKRV+ASKP + PEYF+DKRNLEDLWKEVFPVGTEWD LD+VYQ+NWNFSNLE+AFE  GKLYG+KVYLFG TEPQ V F+GE KVI IPVVV
Subjt:  SRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEYKVIYIPVVV

Query:  ALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKLSSIYFYE--
        A      VVSPFPPSDKIGINSVQRE EEIIPMK MKMDWVPYIPLEDR+SQVD+LK QI+IL CTQRRAAL  LKI R+KKYEYCLP      YFY+  
Subjt:  ALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKLSSIYFYE--

Query:  EEAQLGESAK
        +E +L +S +
Subjt:  EEAQLGESAK

TrEMBL top hitse value%identityAlignment
A0A0A0KGX2 Uncharacterized protein1.36e-16097.45Show/hide
Query:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
        MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
Subjt:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK

Query:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
        VYLFGSTEPQFVPFRGEYKVIYIPVVVA      VVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
Subjt:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN

Query:  RLKIVRIKKYEYCLPRKLSSIYFYEEEAQLGESAK
        RLKIVRIKKYEYCLPRKLSSIYFYEEEAQLGESAK
Subjt:  RLKIVRIKKYEYCLPRKLSSIYFYEEEAQLGESAK

A0A6A3AE34 Protein CPR-5-like isoform X19.44e-9966.53Show/hide
Query:  MSKGTMGIAA--ENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYG
        M KG    AA  + E+ Q   ++  +    + AP +AKR + SKP S PEYF+DKRNLEDLWKE FPVGTEWD LD+VYQYNWNFSNLE+AFE GGKLYG
Subjt:  MSKGTMGIAA--ENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYG

Query:  EKVYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAA
        +KVYLFG TEPQ VPF+GE KVI IPVVVA      VVSPFPPSDKIGINSVQRE EEI+PMK MKMDWVPYIPLE+R+SQVD+LK +IFILSCTQRR A
Subjt:  EKVYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAA

Query:  LNRLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLGESAK
        L ++KI R+KKYEYCLP      YFY+  +E +L +S +
Subjt:  LNRLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLGESAK

A0A6J1BVU0 uncharacterized protein LOC1110059664.09e-11274.26Show/hide
Query:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
        M KGT   A++NE+ +F E ENRKE T+ TA TRAKRV+ASKP S PEYFQDKRNLEDLWK  FPVGTEWD LDSVYQYNWNFSNLE+AFE GGKLYGEK
Subjt:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK

Query:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
        VYLFGSTEPQ V F+GE +VI IPVVVA      VVSPFPPSDKIGINSVQRE EEIIPMK MKM WVPYIPLEDR+S+VDKLK QIFILSCTQRRAAL 
Subjt:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN

Query:  RLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLGESAK
         LKI R+KKYEYCLP      YFY+  +E +L +S +
Subjt:  RLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLGESAK

A0A6J1C502 uncharacterized protein LOC1110084157.53e-9971.11Show/hide
Query:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK
        M KGT    A  E  + +E + RKE     AP+RAKR +  KP S PEYF+DKRNLEDLWK  FPVGTEWD LD+VYQ+NWNFSNLE+AFE GGKLYGEK
Subjt:  MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEK

Query:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN
        VYLFG TEPQ VPF+GE KVI IP VVA      VVSPFPPSDKIGINSVQRE EEI+PMK MKMDWVPYIPLE R S+VDKLK QIFILSCTQRRAAL 
Subjt:  VYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALN

Query:  RLKIVRIKKYEYCLPRKLSSIYFYE
         LKI R+KKYEYCLP      YFY+
Subjt:  RLKIVRIKKYEYCLPRKLSSIYFYE

A0A6P4AAF2 protein HEAT INTOLERANT 41.78e-9875.36Show/hide
Query:  ENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEY
        +N N K+  S+ A TRAKRV+ASKP + PEYF+DKRNLEDLWK  FPVGTEWD LD VYQ+NWNFSNLE+AFE GG LYG+KVYLFG TEPQ VP +GE 
Subjt:  ENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEY

Query:  KVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKL
        KVI IPVVVA      VVSPFPPSDKIGINSVQRE EEIIPMK MKMDWVPYIPLEDR+SQVD+LK QIFILSCTQRRAAL  LKI R+KKYEYCLP   
Subjt:  KVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKL

Query:  SSIYFYE
           YFY+
Subjt:  SSIYFYE

SwissProt top hitse value%identityAlignment
A2RVJ8 Protein HEAT INTOLERANT 42.1e-6559.31Show/hide
Query:  RAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAI
        +AK+  A+K    P YF++KR+LEDLWK  FPVGTEWD LD++Y++NW+F NLEEA E GGKLYG+KVY+FG TEPQ VP++G  K++++P VV      
Subjt:  RAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAI

Query:  QVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLG
         + SPFPPSDKIGI SVQREVEEIIPMK MKMDW+PYIP+E R+ QVDK+  QIF L CTQRR+AL  +K  ++KK+EYCLP      YFY+  +E +L 
Subjt:  QVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLG

Query:  ESAK
        +S +
Subjt:  ESAK

Arabidopsis top hitse value%identityAlignment
AT5G10010.1 unknown protein1.5e-6659.31Show/hide
Query:  RAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAI
        +AK+  A+K    P YF++KR+LEDLWK  FPVGTEWD LD++Y++NW+F NLEEA E GGKLYG+KVY+FG TEPQ VP++G  K++++P VV      
Subjt:  RAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEYKVIYIPVVVALIYAI

Query:  QVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLG
         + SPFPPSDKIGI SVQREVEEIIPMK MKMDW+PYIP+E R+ QVDK+  QIF L CTQRR+AL  +K  ++KK+EYCLP      YFY+  +E +L 
Subjt:  QVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKLSSIYFYE--EEAQLG

Query:  ESAK
        +S +
Subjt:  ESAK

AT5G64910.1 unknown protein3.4e-5555.33Show/hide
Query:  ENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEY
        + E  K+ ++  A  RAK  +A    S PEYF++KRNLEDLWK  F VGTEWD  D++ ++NW+F+NLEEA E GG+LYG++VY+FG TE   V ++ E 
Subjt:  ENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEY

Query:  KVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLP
        K + +PVVV       + SP PPSD+IG+ SVQ EV EII MK MKM WVPYIPLE R+ QVD   F IFIL CTQRR+AL  L   R+KK+ YCLP
Subjt:  KVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLP

AT5G64910.2 unknown protein2.4e-5354.82Show/hide
Query:  ENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEY
        + E  K+ ++  A  RAK  +A    S PEYF++KRNLEDLWK  F VGTEWD  D++ ++NW+F+NLEEA E GG+LYG++VY+FG TE     ++ E 
Subjt:  ENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQFVPFRGEY

Query:  KVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLP
        K + +PVVV       + SP PPSD+IG+ SVQ EV EII MK MKM WVPYIPLE R+ QVD   F IFIL CTQRR+AL  L   R+KK+ YCLP
Subjt:  KVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAAAGGAACTATGGGGATAGCTGCTGAGAACGAAAATGTTCAATTTCTGGAGAATGAGAACCGTAAAGAATACACCTCCAGAACTGCACCCACTCGAGCCAAGCG
AGTCGAGGCTTCCAAACCTCTTTCAGTACCGGAATACTTCCAGGATAAGCGCAATTTGGAAGACTTGTGGAAGGAGGTATTCCCCGTGGGAACAGAGTGGGATCTACTAG
ATTCTGTTTATCAGTACAATTGGAATTTCTCAAATCTTGAGGAGGCATTTGAAGCAGGTGGAAAACTGTACGGGGAGAAAGTCTATCTATTTGGGTCTACAGAACCACAA
TTTGTTCCCTTCAGGGGAGAATACAAAGTTATCTACATTCCTGTGGTTGTGGCTTTGATTTATGCCATACAGGTTGTCTCACCTTTCCCACCTTCTGATAAAATTGGGAT
CAACTCTGTTCAACGGGAGGTTGAAGAGATCATACCCATGAAACTGATGAAAATGGATTGGGTGCCCTACATTCCCCTTGAGGACAGGAATAGCCAAGTTGATAAGCTGA
AATTTCAAATATTTATATTAAGTTGCACTCAACGAAGGGCTGCTCTGAACCGTTTGAAGATAGTTCGTATAAAGAAATACGAATATTGTTTGCCTCGAAAACTTAGTTCC
ATTTACTTCTATGAGGAGGAAGCCCAATTAGGAGAATCGGCTAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAAAGGAACTATGGGGATAGCTGCTGAGAACGAAAATGTTCAATTTCTGGAGAATGAGAACCGTAAAGAATACACCTCCAGAACTGCACCCACTCGAGCCAAGCG
AGTCGAGGCTTCCAAACCTCTTTCAGTACCGGAATACTTCCAGGATAAGCGCAATTTGGAAGACTTGTGGAAGGAGGTATTCCCCGTGGGAACAGAGTGGGATCTACTAG
ATTCTGTTTATCAGTACAATTGGAATTTCTCAAATCTTGAGGAGGCATTTGAAGCAGGTGGAAAACTGTACGGGGAGAAAGTCTATCTATTTGGGTCTACAGAACCACAA
TTTGTTCCCTTCAGGGGAGAATACAAAGTTATCTACATTCCTGTGGTTGTGGCTTTGATTTATGCCATACAGGTTGTCTCACCTTTCCCACCTTCTGATAAAATTGGGAT
CAACTCTGTTCAACGGGAGGTTGAAGAGATCATACCCATGAAACTGATGAAAATGGATTGGGTGCCCTACATTCCCCTTGAGGACAGGAATAGCCAAGTTGATAAGCTGA
AATTTCAAATATTTATATTAAGTTGCACTCAACGAAGGGCTGCTCTGAACCGTTTGAAGATAGTTCGTATAAAGAAATACGAATATTGTTTGCCTCGAAAACTTAGTTCC
ATTTACTTCTATGAGGAGGAAGCCCAATTAGGAGAATCGGCTAAATAG
Protein sequenceShow/hide protein sequence
MSKGTMGIAAENENVQFLENENRKEYTSRTAPTRAKRVEASKPLSVPEYFQDKRNLEDLWKEVFPVGTEWDLLDSVYQYNWNFSNLEEAFEAGGKLYGEKVYLFGSTEPQ
FVPFRGEYKVIYIPVVVALIYAIQVVSPFPPSDKIGINSVQREVEEIIPMKLMKMDWVPYIPLEDRNSQVDKLKFQIFILSCTQRRAALNRLKIVRIKKYEYCLPRKLSS
IYFYEEEAQLGESAK