; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh02G016710 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh02G016710
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
Descriptionethylene-responsive transcription factor-like protein isoform X2
Genome locationCma_Chr02:9443279..9447313
RNA-Seq ExpressionCmaCh02G016710
SyntenyCmaCh02G016710
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606307.1 Ethylene-responsive transcription factor-like protein, partial [Cucurbita argyrosperma subsp. sororia]3.3e-11090.43Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE+ P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF
        I+ +DDLNKRQDEFSDL APEDIEPLALKF
Subjt:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF

KAG7036247.1 Ethylene-responsive transcription factor-like protein [Cucurbita argyrosperma subsp. argyrosperma]1.5e-11090.43Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE+ P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRK NWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF
        I+ +DDLNKRQDEFSDL+APEDIEPLALKF
Subjt:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF

XP_022930940.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita moschata]6.1e-11291.3Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE+ P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF
        I+G+DDLNKRQDEF DLSAPEDIEPLALKF
Subjt:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF

XP_022930943.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucurbita moschata]8.8e-11191.3Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE  P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF
        I+G+DDLNKRQDEF DLSAPEDIEPLALKF
Subjt:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF

XP_022995968.1 ethylene-responsive transcription factor-like protein At4g13040 [Cucurbita maxima]3.7e-125100Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHFPEE
        MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHFPEE
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHFPEE

Query:  CFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLPILG
        CFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLPILG
Subjt:  CFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLPILG

Query:  SDDLNKRQDEFSDLSAPEDIEPLALKF
        SDDLNKRQDEFSDLSAPEDIEPLALKF
Subjt:  SDDLNKRQDEFSDLSAPEDIEPLALKF

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein2.6e-8471.86Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSEN---SDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEK----SDEPFEEPPVKHRKQHR
        MVSLRRRKLLGL +GK SF APV KFSEN    D  HC++F+ V+PI S++ N+IE+ P AN+E E   VSV DTSKE+    +DEP  +PPVK RK+HR
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSEN---SDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEK----SDEPFEEPPVKHRKQHR

Query:  RKHFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKN
        RKHFP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPE EK+ELRKFNWDEFLAMTR  I N+KQKR+SPESK 
Subjt:  RKHFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKN

Query:  SKLPILGSDDLNKRQDEFSDLSAPEDIEPLA
        S+L   G+DD NKR D+F D S  ED+EP+A
Subjt:  SKLPILGSDDLNKRQDEFSDLSAPEDIEPLA

A0A6J1DZ33 ethylene-responsive transcription factor-like protein At4g13040 isoform X11.0e-8069.74Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLE---HCSNFISVHPIFSNEFNEIEKIPAANLETE-PLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKH
        MVSLRRRKLLG C+GKGSF APV KFSEN   E   HC+NF+SVHPI S++ N+I++ P AN E E   RV+V DTSKEK++E   +PPV+ RK+H RK 
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLE---HCSNFISVHPIFSNEFNEIEKIPAANLETE-PLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKH

Query:  FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKL
        FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPE EK+ELRK NWD+FLA+TR  I N+KQKR+SPES  SKL
Subjt:  FPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKL

Query:  PILGSDDLNKRQDEFSDLSAPEDIEPLA
        P  G+ D +KR  +FS+LS  ED++P A
Subjt:  PILGSDDLNKRQDEFSDLSAPEDIEPLA

A0A6J1ESB0 ethylene-responsive transcription factor-like protein At4g13040 isoform X12.9e-11291.3Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE+ P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF
        I+G+DDLNKRQDEF DLSAPEDIEPLALKF
Subjt:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF

A0A6J1EWY2 ethylene-responsive transcription factor-like protein At4g13040 isoform X24.2e-11191.3Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF
        MVSLRRRKLLGLCTGKGSFDAPVSK SENS   DLEHC+NFISVHPI SNEFNEIE  P ANLETE  RVSV DTSKEKSDEPF EPPVK RK+HRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENS---DLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHF

Query:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
        PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP
Subjt:  PEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLP

Query:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF
        I+G+DDLNKRQDEF DLSAPEDIEPLALKF
Subjt:  ILGSDDLNKRQDEFSDLSAPEDIEPLALKF

A0A6J1K7F1 ethylene-responsive transcription factor-like protein At4g130401.8e-125100Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHFPEE
        MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHFPEE
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHFPEE

Query:  CFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLPILG
        CFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLPILG
Subjt:  CFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLPILG

Query:  SDDLNKRQDEFSDLSAPEDIEPLALKF
        SDDLNKRQDEFSDLSAPEDIEPLALKF
Subjt:  SDDLNKRQDEFSDLSAPEDIEPLALKF

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130403.3e-3647.44Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSN-----------FISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSD--EPFEEPPVK
        MVSLRRR+LLGLC G   +  P+   +    +    N             +V  +   +    E+      +    R    D S   SD   P  +PP K
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSN-----------FISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSD--EPFEEPPVK

Query:  HRKQHRRK--HFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQK
         RKQHRRK  H  E C LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK K
Subjt:  HRKQHRRK--HFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQK

Query:  -RVSPESK--NSKLP
         R+  E    N+ +P
Subjt:  -RVSPESK--NSKLP

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein2.3e-3747.44Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSN-----------FISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSD--EPFEEPPVK
        MVSLRRR+LLGLC G   +  P+   +    +    N             +V  +   +    E+      +    R    D S   SD   P  +PP K
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSN-----------FISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSD--EPFEEPPVK

Query:  HRKQHRRK--HFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQK
         RKQHRRK  H  E C LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK K
Subjt:  HRKQHRRK--HFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQK

Query:  -RVSPESK--NSKLP
         R+  E    N+ +P
Subjt:  -RVSPESK--NSKLP

AT4G13040.2 Integrase-type DNA-binding superfamily protein5.4e-3465Show/hide
Query:  EPPVKHRKQHRRK--HFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIA
        + P K RKQHRRK  H  E C LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I 
Subjt:  EPPVKHRKQHRRK--HFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIA

Query:  NKKQK-RVSPESK--NSKLP
        NKK K R+  E    N+ +P
Subjt:  NKKQK-RVSPESK--NSKLP

AT4G13040.3 Integrase-type DNA-binding superfamily protein2.3e-3747.44Show/hide
Query:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSN-----------FISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSD--EPFEEPPVK
        MVSLRRR+LLGLC G   +  P+   +    +    N             +V  +   +    E+      +    R    D S   SD   P  +PP K
Subjt:  MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSN-----------FISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSD--EPFEEPPVK

Query:  HRKQHRRK--HFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQK
         RKQHRRK  H  E C LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  I NKK K
Subjt:  HRKQHRRK--HFPEECFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQK

Query:  -RVSPESK--NSKLP
         R+  E    N+ +P
Subjt:  -RVSPESK--NSKLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTCGGACTTTGCACTGGCAAAGGCTCATTTGATGCTCCAGTTTCAAAGTTTTCTGAAAATTCAGATCTCGAGCACTGT
TCGAACTTCATTAGTGTTCATCCCATCTTTTCGAACGAATTCAACGAGATAGAGAAGATTCCCGCTGCAAATTTAGAGACTGAACCATTGAGGGTATCAGTTTTT
GACACATCAAAGGAGAAAAGTGATGAGCCATTTGAAGAACCGCCCGTAAAACATAGAAAACAACACCGGAGAAAGCATTTTCCAGAAGAATGTTTCTTAATGAGA
GGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCTATTAAGGTTGACAAGAAACAAATACACTTGGGAACTGTTGGATCACAAGAAGAAGCTGCTCATTTGTAT
GACAGAGCTGCTTTCATGTGTGGAAGGGAACCCAACTTCGAGCTCCCAGAGGCGGAGAAGAAAGAACTGAGAAAGTTTAATTGGGATGAGTTTTTAGCAATGACT
CGACTCGCAATCGCTAATAAAAAACAAAAGAGGGTCAGCCCAGAATCAAAGAACTCTAAACTTCCTATTCTGGGGAGCGACGACTTGAACAAGAGACAGGATGAG
TTCAGTGACCTCTCAGCTCCAGAAGATATTGAACCACTTGCCTTGAAATTTTGA
mRNA sequenceShow/hide mRNA sequence
TAGAAGTCACGTTCTCTCTCTCATTAGTATTTCTCTCTCTAAAACCAAATTTATTCTCTCTGCCTCCATTGCAGACCACCACCTGGTTATGGTCATCAGTCCAAA
CGGAACAAATTTACAAAACGAGGGATTTCAAAAAGGTGGTAGAGAAACAGATGATGCATGAGTTCAGAATTTGCGTATAATCCTTTCTCCTCGTAATCTCAACCC
GCTTTTGTTTTTCGATCGTTTTTCTACGCCTCCGCCGCTGTGACCTCAGATGTGAGGTGATCCAGGAGAAACAACCAAATAGTACCAGCGATAACGCTATCAAAG
CTAATTATGGTGAGCTTAAGAAGGCGTAAACTCCTCGGACTTTGCACTGGCAAAGGCTCATTTGATGCTCCAGTTTCAAAGTTTTCTGAAAATTCAGATCTCGAG
CACTGTTCGAACTTCATTAGTGTTCATCCCATCTTTTCGAACGAATTCAACGAGATAGAGAAGATTCCCGCTGCAAATTTAGAGACTGAACCATTGAGGGTATCA
GTTTTTGACACATCAAAGGAGAAAAGTGATGAGCCATTTGAAGAACCGCCCGTAAAACATAGAAAACAACACCGGAGAAAGCATTTTCCAGAAGAATGTTTCTTA
ATGAGAGGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCTATTAAGGTTGACAAGAAACAAATACACTTGGGAACTGTTGGATCACAAGAAGAAGCTGCTCAT
TTGTATGACAGAGCTGCTTTCATGTGTGGAAGGGAACCCAACTTCGAGCTCCCAGAGGCGGAGAAGAAAGAACTGAGAAAGTTTAATTGGGATGAGTTTTTAGCA
ATGACTCGACTCGCAATCGCTAATAAAAAACAAAAGAGGGTCAGCCCAGAATCAAAGAACTCTAAACTTCCTATTCTGGGGAGCGACGACTTGAACAAGAGACAG
GATGAGTTCAGTGACCTCTCAGCTCCAGAAGATATTGAACCACTTGCCTTGAAATTTTGATGGAGATATGCAGTTTTGATTTCTTTTATTAAAAGGCCATTGAAA
TTTTGGATCATTGGATGTACATCAGTTTAGTGATTTAGGCAAATCCTTCGGGCATTTCATGAACAATGATCTTGCAACAATTTAGACATGAACAAGTCAGGAATT
TAGTACATTCAGGTGAAAATCTACTTTAGATTCATAGTAAATCATTTACATGTATTGAGTTCAGCAAATCGCTACTAGTTTGAACACAGAACAGGAACGAC
Protein sequenceShow/hide protein sequence
MVSLRRRKLLGLCTGKGSFDAPVSKFSENSDLEHCSNFISVHPIFSNEFNEIEKIPAANLETEPLRVSVFDTSKEKSDEPFEEPPVKHRKQHRRKHFPEECFLMR
GVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEAEKKELRKFNWDEFLAMTRLAIANKKQKRVSPESKNSKLPILGSDDLNKRQDE
FSDLSAPEDIEPLALKF