; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g013380 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g013380
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
Descriptionethylene-responsive transcription factor-like protein isoform X1
Genome locationCsor_Chr02:9812263..9813876
RNA-Seq ExpressionCsor.00g013380
SyntenyCsor.00g013380
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606307.1 Ethylene-responsive transcription factor-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.37e-160100Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF
        IVENDDLNKRQDEFSDLLAPEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF

KAG7036247.1 Ethylene-responsive transcription factor-like protein [Cucurbita argyrosperma subsp. argyrosperma]1.95e-15999.57Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF
        IVENDDLNKRQDEFSDL APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF

XP_022930940.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita moschata]1.54e-15698.26Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF
        IV NDDLNKRQDEF DL APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF

XP_022930943.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucurbita moschata]5.79e-15497.83Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIE NPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF
        IV NDDLNKRQDEF DL APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF

XP_023534153.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita pepo subsp. pepo]2.34e-14693.91Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGS DAPVSK SENSTAEDLEH +  ISVHPICSNEFNEIE NPVANLETESSRVSVLDTSKEKSDEPFAEPPVK RKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF
        ++ NDDLNKRQDEFSDL APEDIE LA+KF
Subjt:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein1.85e-11275.32Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEK----SDEPFAEPPVKRRKRHR
        MVSLRRRKLLGL +GK SF APV K SEN TAED  HCT+F+ V+PICS++ N+IEENP AN+E ESS VSVLDTSKE+    +DEP A+PPVKRRKRHR
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEK----SDEPFAEPPVKRRKRHR

Query:  RKQFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKN
        RK FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPE EK+ELRK NWDEFLAMTR  I N+KQKR+SPESK 
Subjt:  RKQFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKN

Query:  SKLPIVENDDLNKRQDEFSDLLAPEDIEPLA
        S+L    NDD NKR D+F D    ED+EP+A
Subjt:  SKLPIVENDDLNKRQDEFSDLLAPEDIEPLA

A0A6J1DZ33 ethylene-responsive transcription factor-like protein At4g13040 isoform X11.51e-10872.81Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESS-RVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQ
        MVSLRRRKLLG C+GKGSF APV K SEN T E+  HCTNF+SVHPICS++ N+I+ENP+AN E ESS RV+VLDTSKEK++E  A+PPV+ RKRH RK+
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESS-RVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQ

Query:  FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKL
        FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGR+PNFELPE EK+ELRK+NWD+FLA+TR  I N+KQKR+SPES  SKL
Subjt:  FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKL

Query:  PIVENDDLNKRQDEFSDLLAPEDIEPLA
        P   N D +KR  +FS+L   ED++P A
Subjt:  PIVENDDLNKRQDEFSDLLAPEDIEPLA

A0A6J1ESB0 ethylene-responsive transcription factor-like protein At4g13040 isoform X17.44e-15798.26Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF
        IV NDDLNKRQDEF DL APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF

A0A6J1EWY2 ethylene-responsive transcription factor-like protein At4g13040 isoform X22.80e-15497.83Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIE NPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF
        IV NDDLNKRQDEF DL APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF

A0A6J1K7F1 ethylene-responsive transcription factor-like protein At4g130402.75e-14090.43Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE+ P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF
        I+ +DDLNKRQDEFSDL APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLLAPEDIEPLALKF

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130402.2e-3547.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein1.5e-3647.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP

AT4G13040.2 Integrase-type DNA-binding superfamily protein1.6e-3363.33Show/hide
Query:  AEPPVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIA
        ++ P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I 
Subjt:  AEPPVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIA

Query:  NKKQK-RVSPESK--NSKLP
        NKK K R+  E    N+ +P
Subjt:  NKKQK-RVSPESK--NSKLP

AT4G13040.3 Integrase-type DNA-binding superfamily protein1.5e-3647.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTCGGACTTTGCACTGGCAAAGGCTCATTTGATGCTCCAGTTTCAAAGTCTTCTGAAAATTCGACTGCTGAAGATCTC
GAGCACTGTACGAACTTCATTAGTGTTCATCCCATCTGTTCGAACGAATTCAACGAGATAGAGGAGAATCCCGTTGCAAATTTAGAGACCGAATCGTCGAGGGTA
TCTGTTTTGGACACATCAAAGGAGAAAAGTGATGAGCCATTTGCAGAACCGCCCGTAAAACGTAGAAAACGACACCGGAGAAAGCAGTTTCCAGAAGAATGTTTC
TTAATGAGAGGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCTATTAAGGTTGACAAGAAACAAATACACTTGGGAACTGTTGAATCACAAGAAGAAGCTGCT
CATTTGTATGACAGAGCTGCTTTCATGTGTGGAAGGGAACCCAACTTCGAGCTCCCAGAGGCGGAGAAGAAAGAACTGAGAAAGGTTAATTGGGACGAGTTTTTA
GCAATGACTCGGCTCGCAATCGCTAATAAAAAACAAAAGAGGGTCAGCCCAGAATCAAAGAACTCTAAACTTCCTATTGTAGAGAACGACGACTTGAACAAGAGA
CAGGATGAGTTCAGTGACCTCTTAGCTCCAGAAGATATTGAGCCACTTGCCTTGAAATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTCGGACTTTGCACTGGCAAAGGCTCATTTGATGCTCCAGTTTCAAAGTCTTCTGAAAATTCGACTGCTGAAGATCTC
GAGCACTGTACGAACTTCATTAGTGTTCATCCCATCTGTTCGAACGAATTCAACGAGATAGAGGAGAATCCCGTTGCAAATTTAGAGACCGAATCGTCGAGGGTA
TCTGTTTTGGACACATCAAAGGAGAAAAGTGATGAGCCATTTGCAGAACCGCCCGTAAAACGTAGAAAACGACACCGGAGAAAGCAGTTTCCAGAAGAATGTTTC
TTAATGAGAGGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCTATTAAGGTTGACAAGAAACAAATACACTTGGGAACTGTTGAATCACAAGAAGAAGCTGCT
CATTTGTATGACAGAGCTGCTTTCATGTGTGGAAGGGAACCCAACTTCGAGCTCCCAGAGGCGGAGAAGAAAGAACTGAGAAAGGTTAATTGGGACGAGTTTTTA
GCAATGACTCGGCTCGCAATCGCTAATAAAAAACAAAAGAGGGTCAGCCCAGAATCAAAGAACTCTAAACTTCCTATTGTAGAGAACGACGACTTGAACAAGAGA
CAGGATGAGTTCAGTGACCTCTTAGCTCCAGAAGATATTGAGCCACTTGCCTTGAAATTTTGA
Protein sequenceShow/hide protein sequence
MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQFPEECF
LMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLPIVENDDLNKR
QDEFSDLLAPEDIEPLALKF