; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G017160 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G017160
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionethylene-responsive transcription factor-like protein isoform X1
Genome locationCmo_Chr02:9823517..9827546
RNA-Seq ExpressionCmoCh02G017160
SyntenyCmoCh02G017160
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606307.1 Ethylene-responsive transcription factor-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.8e-12298.26Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
        IV NDDLNKRQDEF DL APEDIEPLALKF
Subjt:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF

KAG7036247.1 Ethylene-responsive transcription factor-like protein [Cucurbita argyrosperma subsp. argyrosperma]1.7e-12298.26Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
        IV NDDLNKRQDEF DL+APEDIEPLALKF
Subjt:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF

XP_022930940.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita moschata]2.8e-125100Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
        IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
Subjt:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF

XP_022930943.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucurbita moschata]2.7e-12399.57Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEI ENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
        IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
Subjt:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF

XP_023534153.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita pepo subsp. pepo]1.3e-11493.91Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGS DAPVSK SENSTAEDLEH +  ISVHPICSNEFNEIE NPVANLETESSRVSVLDTSKEKSDEPFAEPPVK RKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
        ++GNDDLNKRQDEF DLSAPEDIE LA+KF
Subjt:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein9.9e-9277.06Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEK----SDEPFAEPPVKRRKRHR
        MVSLRRRKLLGL +GK SF APV K SEN TAED  HCT+F+ V+PICS++ N+IEENP AN+E ESS VSVLDTSKE+    +DEP A+PPVKRRKRHR
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEK----SDEPFAEPPVKRRKRHR

Query:  RKQFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKN
        RK FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPE EK+ELRKFNWDEFLAMTR  I N+KQKR+SPESK 
Subjt:  RKQFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKN

Query:  SKLPIVGNDDLNKRQDEFIDLSAPEDIEPLA
        S+L   GNDD NKR D+FID S  ED+EP+A
Subjt:  SKLPIVGNDDLNKRQDEFIDLSAPEDIEPLA

A0A6J1DZ33 ethylene-responsive transcription factor-like protein At4g13040 isoform X12.1e-8673.25Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETE-SSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQ
        MVSLRRRKLLG C+GKGSF APV K SEN T E+  HCTNF+SVHPICS++ N+I+ENP+AN E E SSRV+VLDTSKEK++E  A+PPV+ RKRH RK+
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETE-SSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQ

Query:  FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKL
        FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGR+PNFELPE EK+ELRK NWD+FLA+TR  I N+KQKR+SPES  SKL
Subjt:  FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKL

Query:  PIVGNDDLNKRQDEFIDLSAPEDIEPLA
        P  GN D +KR  +F +LS  ED++P A
Subjt:  PIVGNDDLNKRQDEFIDLSAPEDIEPLA

A0A6J1ESB0 ethylene-responsive transcription factor-like protein At4g13040 isoform X11.4e-125100Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
        IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
Subjt:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF

A0A6J1EWY2 ethylene-responsive transcription factor-like protein At4g13040 isoform X21.3e-12399.57Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEI ENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
        IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
Subjt:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF

A0A6J1K7F1 ethylene-responsive transcription factor-like protein At4g130401.5e-11191.3Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE+ P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF
        I+G+DDLNKRQDEF DLSAPEDIEPLALKF
Subjt:  IVGNDDLNKRQDEFIDLSAPEDIEPLALKF

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130402.2e-3547.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein1.5e-3647.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP

AT4G13040.2 Integrase-type DNA-binding superfamily protein1.6e-3363.33Show/hide
Query:  AEPPVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIA
        ++ P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I 
Subjt:  AEPPVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIA

Query:  NKKQK-RVSPESK--NSKLP
        NKK K R+  E    N+ +P
Subjt:  NKKQK-RVSPESK--NSKLP

AT4G13040.3 Integrase-type DNA-binding superfamily protein1.5e-3647.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTCGGACTTTGCACTGGCAAAGGCTCATTTGATGCTCCAGTTTCAAAGTCTTCTGAAAATTCGACTGCTGAAGATCTC
GAGCACTGTACGAACTTCATTAGTGTTCATCCCATCTGTTCGAACGAATTCAACGAGATAGAGGAGAATCCCGTTGCAAATTTAGAGACCGAATCGTCGAGGGTA
TCTGTTTTGGACACATCAAAGGAGAAAAGTGATGAGCCATTTGCAGAACCGCCCGTAAAACGTAGAAAACGACACCGGAGAAAGCAGTTTCCAGAAGAATGTTTC
TTAATGAGAGGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCTATTAAGGTTGACAAGAAACAAATACACTTGGGAACTGTTGAATCACAAGAAGAAGCTGCT
CATTTGTATGACAGAGCTGCTTTCATGTGCGGAAGGGAACCCAACTTCGAGCTCCCAGAGGCGGAGAAGAAAGAACTGAGAAAGTTTAATTGGGACGAGTTCTTA
GCAATGACTCGGCTCGCAATCGCTAATAAAAAACAAAAGAGGGTCAGCCCAGAATCAAAGAACTCGAAACTTCCTATTGTAGGGAACGACGACTTGAACAAGAGA
CAGGATGAGTTCATTGACCTCTCAGCTCCAGAAGATATTGAGCCACTTGCCTTGAAATTTTGA
mRNA sequenceShow/hide mRNA sequence
CATAGAAGTCACGTTCTCTCTCTCATTAGTATTTCTCTCTCTAAAACCAATTTGATTCTCTCTGCCTCCATTGCAGACCACCACCTGGTTCTCGTCATCAGTCCA
AACAGAACAAATTTACAAAACGAGGGATTTCAAAAAGGTGGTAGAGAAACAGATGATGCATGAGTTCAGAATTTGCGTATAATCCTTTCTCCTCGTAATCTCAAC
CCGCTTTTGTTTTTCGATCGTTTTTCTACGCCTCCTCCGCCGCCGCGACCTCAGATGTGAGGTGATCCAGGAAAAACAACCAAATAGTACCAGCGATAACGCGAT
CGAAGCTAATTATGGTGAGCTTAAGAAGGCGTAAACTCCTCGGACTTTGCACTGGCAAAGGCTCATTTGATGCTCCAGTTTCAAAGTCTTCTGAAAATTCGACTG
CTGAAGATCTCGAGCACTGTACGAACTTCATTAGTGTTCATCCCATCTGTTCGAACGAATTCAACGAGATAGAGGAGAATCCCGTTGCAAATTTAGAGACCGAAT
CGTCGAGGGTATCTGTTTTGGACACATCAAAGGAGAAAAGTGATGAGCCATTTGCAGAACCGCCCGTAAAACGTAGAAAACGACACCGGAGAAAGCAGTTTCCAG
AAGAATGTTTCTTAATGAGAGGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCTATTAAGGTTGACAAGAAACAAATACACTTGGGAACTGTTGAATCACAAG
AAGAAGCTGCTCATTTGTATGACAGAGCTGCTTTCATGTGCGGAAGGGAACCCAACTTCGAGCTCCCAGAGGCGGAGAAGAAAGAACTGAGAAAGTTTAATTGGG
ACGAGTTCTTAGCAATGACTCGGCTCGCAATCGCTAATAAAAAACAAAAGAGGGTCAGCCCAGAATCAAAGAACTCGAAACTTCCTATTGTAGGGAACGACGACT
TGAACAAGAGACAGGATGAGTTCATTGACCTCTCAGCTCCAGAAGATATTGAGCCACTTGCCTTGAAATTTTGATGGAGATATGCAGTTTTGATTTCTTTTATTA
AAAGGCCATTGAAATTTTGGAGCATTGGATGTACATCAGTTTAGTGATTTAGGCAAATCCTTCGGGCATTTCATGAACAATGATCTTGCAACAATTTAGACATGA
ACAAGTCAGGAATTTTAGTACATTCATAGTAAGAATGATAATTTACATGTATTGAGTTCAGCAAATCGCTACTAGTTCGAACACAAATCAG
Protein sequenceShow/hide protein sequence
MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQFPEECF
LMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLPIVGNDDLNKR
QDEFIDLSAPEDIEPLALKF