; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg08894 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg08894
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionethylene-responsive transcription factor-like protein isoform X1
Genome locationCarg_Chr02:9346764..9350736
RNA-Seq ExpressionCarg08894
SyntenyCarg08894
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606307.1 Ethylene-responsive transcription factor-like protein, partial [Cucurbita argyrosperma subsp. sororia]7.0e-12499.57Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF
        IVENDDLNKRQDEFSDL APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF

KAG7036247.1 Ethylene-responsive transcription factor-like protein [Cucurbita argyrosperma subsp. argyrosperma]1.4e-124100Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF
        IVENDDLNKRQDEFSDLTAPEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF

XP_022930940.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita moschata]5.0e-12298.26Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF
        IV NDDLNKRQDEF DL+APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF

XP_022930943.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucurbita moschata]4.7e-12097.83Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEI ENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF
        IV NDDLNKRQDEF DL+APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF

XP_023534153.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita pepo subsp. pepo]3.8e-11493.91Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGS DAPVSK SENSTAEDLEH +  ISVHPICSNEFNEIE NPVANLETESSRVSVLDTSKEKSDEPFAEPPVK RKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF
        ++ NDDLNKRQDEFSDL+APEDIE LA+KF
Subjt:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein1.8e-8875.32Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEK----SDEPFAEPPVKRRKRHR
        MVSLRRRKLLGL +GK SF APV K SEN TAED  HCT+F+ V+PICS++ N+IEENP AN+E ESS VSVLDTSKE+    +DEP A+PPVKRRKRHR
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEK----SDEPFAEPPVKRRKRHR

Query:  RKQFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKN
        RK FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPE EK+ELRK NWDEFLAMTR  I N+KQKR+SPESK 
Subjt:  RKQFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKN

Query:  SKLPIVENDDLNKRQDEFSDLTAPEDIEPLA
        S+L    NDD NKR D+F D +  ED+EP+A
Subjt:  SKLPIVENDDLNKRQDEFSDLTAPEDIEPLA

A0A6J1DZ33 ethylene-responsive transcription factor-like protein At4g13040 isoform X11.8e-8572.81Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETE-SSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQ
        MVSLRRRKLLG C+GKGSF APV K SEN T E+  HCTNF+SVHPICS++ N+I+ENP+AN E E SSRV+VLDTSKEK++E  A+PPV+ RKRH RK+
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETE-SSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQ

Query:  FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKL
        FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGR+PNFELPE EK+ELRK+NWD+FLA+TR  I N+KQKR+SPES  SKL
Subjt:  FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKL

Query:  PIVENDDLNKRQDEFSDLTAPEDIEPLA
        P   N D +KR  +FS+L+  ED++P A
Subjt:  PIVENDDLNKRQDEFSDLTAPEDIEPLA

A0A6J1ESB0 ethylene-responsive transcription factor-like protein At4g13040 isoform X12.4e-12298.26Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF
        IV NDDLNKRQDEF DL+APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF

A0A6J1EWY2 ethylene-responsive transcription factor-like protein At4g13040 isoform X22.3e-12097.83Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEI ENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF
        IV NDDLNKRQDEF DL+APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF

A0A6J1K7F1 ethylene-responsive transcription factor-like protein At4g130401.1e-10990.43Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE+ P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF
        I+ +DDLNKRQDEFSDL+APEDIEPLALKF
Subjt:  IVENDDLNKRQDEFSDLTAPEDIEPLALKF

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130402.2e-3547.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein1.5e-3647.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP

AT4G13040.2 Integrase-type DNA-binding superfamily protein1.6e-3363.33Show/hide
Query:  AEPPVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIA
        ++ P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I 
Subjt:  AEPPVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIA

Query:  NKKQK-RVSPESK--NSKLP
        NKK K R+  E    N+ +P
Subjt:  NKKQK-RVSPESK--NSKLP

AT4G13040.3 Integrase-type DNA-binding superfamily protein1.5e-3647.93Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP
        MVSLRRR+LLGLC G   +  P+   +       + +     + +P  +        E   IEE    +  T S     R    D S   SD   P  +P
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSN-------EFNEIEENPVANLETESS----RVSVLDTSKEKSD--EPFAEP

Query:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK
        P KRRK+HRRK+   +E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK
Subjt:  PVKRRKRHRRKQ-FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKK

Query:  QK-RVSPESK--NSKLP
         K R+  E    N+ +P
Subjt:  QK-RVSPESK--NSKLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTCGGACTTTGCACTGGCAAAGGCTCATTTGATGCTCCAGTTTCAAAGTCTTCTGAAAATTCGACTGCTGAAGATCTCGAGCA
CTGTACGAACTTCATTAGTGTTCATCCCATCTGTTCGAACGAATTCAACGAGATAGAGGAGAATCCCGTTGCAAATTTAGAGACCGAATCGTCGAGGGTATCTGTTTTGG
ACACATCAAAGGAGAAAAGTGATGAGCCATTTGCAGAACCGCCCGTAAAACGTAGAAAACGACACCGGAGAAAGCAGTTTCCAGAAGAATGTTTCTTAATGAGAGGTGTT
TATTTCAAGAACATGAAATGGCAGGCTGCTATTAAGGTTGACAAGAAACAAATACACTTGGGAACTGTTGAATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGCTGC
TTTCATGTGTGGAAGGGAACCCAACTTCGAGCTCCCAGAGGCGGAGAAGAAAGAACTGAGAAAGGTTAATTGGGACGAGTTTTTAGCAATGACTCGGCTCGCAATCGCTA
ATAAAAAACAAAAGAGGGTCAGCCCAGAATCAAAGAACTCTAAACTTCCTATTGTAGAGAACGACGACTTGAACAAGAGACAGGATGAGTTCAGTGACCTCACAGCTCCA
GAAGATATTGAGCCACTTGCCTTGAAATTTTGA
mRNA sequenceShow/hide mRNA sequence
TAGAAGTCACGTTCTCTCTCTCTCATTAGTATTTCTCTCTCTAAAACCAATTTTATTCTCTCTGCCTCCATTGCAGACCACCACCTGGTTCTCGTCATCAGTCCAAACAG
AACAAATTTACAAAACGAGGGATTTCAAAAAGGTGGTAGAGAAACAGATGATGCATGAGTTCAGAATTTGCGTATAATCCTTTCTCCTCGTAATCTCAACCCGCTTTTGT
TGTTCGATCGTTTTTCTACGCCTCCTCCGCCGCCGAGACCTCAGATGTGAGGTGATCCAGGAAAAACAACCAAATAGTACCAGCGATAACGCGATCGAAGCTAATTATGG
TGAGCTTAAGAAGGCGTAAACTCCTCGGACTTTGCACTGGCAAAGGCTCATTTGATGCTCCAGTTTCAAAGTCTTCTGAAAATTCGACTGCTGAAGATCTCGAGCACTGT
ACGAACTTCATTAGTGTTCATCCCATCTGTTCGAACGAATTCAACGAGATAGAGGAGAATCCCGTTGCAAATTTAGAGACCGAATCGTCGAGGGTATCTGTTTTGGACAC
ATCAAAGGAGAAAAGTGATGAGCCATTTGCAGAACCGCCCGTAAAACGTAGAAAACGACACCGGAGAAAGCAGTTTCCAGAAGAATGTTTCTTAATGAGAGGTGTTTATT
TCAAGAACATGAAATGGCAGGCTGCTATTAAGGTTGACAAGAAACAAATACACTTGGGAACTGTTGAATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGCTGCTTTC
ATGTGTGGAAGGGAACCCAACTTCGAGCTCCCAGAGGCGGAGAAGAAAGAACTGAGAAAGGTTAATTGGGACGAGTTTTTAGCAATGACTCGGCTCGCAATCGCTAATAA
AAAACAAAAGAGGGTCAGCCCAGAATCAAAGAACTCTAAACTTCCTATTGTAGAGAACGACGACTTGAACAAGAGACAGGATGAGTTCAGTGACCTCACAGCTCCAGAAG
ATATTGAGCCACTTGCCTTGAAATTTTGATGGAGATATGCAGTTTTGATTTCTTTTATTAAAAGGCCATTGAAATTTTGGAGCATTGGATGTACATCAGTTTAGTGATTT
AGGCAAATCCTTCGGGCATTTCATGAACAATGATCTTGCAACAATTTAGACATGAACAAGTCAGGAATTTTAGTACTTTCAGG
Protein sequenceShow/hide protein sequence
MVSLRRRKLLGLCTGKGSFDAPVSKSSENSTAEDLEHCTNFISVHPICSNEFNEIEENPVANLETESSRVSVLDTSKEKSDEPFAEPPVKRRKRHRRKQFPEECFLMRGV
YFKNMKWQAAIKVDKKQIHLGTVESQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKVNWDEFLAMTRLAIANKKQKRVSPESKNSKLPIVENDDLNKRQDEFSDLTAP
EDIEPLALKF