; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G09030 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G09030
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionEthylene-responsive transcription factor-like protein
Genome locationClcChr09:7774491..7778357
RNA-Seq ExpressionClc09G09030
SyntenyClc09G09030
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0005094 - Rho GDP-dissociation inhibitor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK11601.1 ethylene-responsive transcription factor-like protein [Cucumis melo var. makuwa]8.4e-10087.1Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFVAPLPKFS++ TAPENPSPSNKL+TLHP SSDDV I DRSSIV+MESSF NAL+W+S +EQSNQPISGHPLKHRKRHRR+NSHNQELSIMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNKQ
        KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELS EEKQELQKFKWEDFLAMTRHAITNKKHKRLS   G+SPKKL ASSLQI+N +
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNKQ

Query:  LKQKFNESPVTEDIYFT
        LK +FNESP+ EDI FT
Subjt:  LKQKFNESPVTEDIYFT

XP_004149780.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucumis sativus]1.7e-10088.02Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFVAPLPKFS++ TA ENPSPSN L+TLHP SSDDV I DRSSIV+MESSF NAL+WTS NEQSNQPISGHPLKHRKRHRR+NSHNQELSIMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRL--STGSSPKKLGASSLQIDNKQ
        KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELS+EEKQELQKFKWEDFLAMTRHAITNKKHKRL  S GSSPKKLGASSLQIDN +
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRL--STGSSPKKLGASSLQIDNKQ

Query:  LKQKFNESPVTEDIYFT
        LK +FNE+P+ EDI FT
Subjt:  LKQKFNESPVTEDIYFT

XP_008458053.1 PREDICTED: ethylene-responsive transcription factor-like protein At4g13040 [Cucumis melo]1.4e-9987.1Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFVAPLPKFS++ TAPENPSPSNKL+TLHP SSDDV I DRSSIV+MESSF NAL+WTS +EQSNQPISG+PLKHRKRHRR+NSHNQELSIMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNKQ
        KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELS EEKQELQKFKWEDFLAMTRHAITNKKHKRLS   G+SPKKL ASSLQI+N +
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNKQ

Query:  LKQKFNESPVTEDIYFT
        LK +FNESP+ EDI FT
Subjt:  LKQKFNESPVTEDIYFT

XP_038899850.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Benincasa hispida]1.9e-10490.7Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFV PLPKFSENGT PENPSPSNKLVTLHP SSDDV ITDRSSI +MESSFANALDWTS NEQSNQPISGHPLKHRKRHRR+NSHNQELSIMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLSTGSSPKKLGASSLQIDNKQLK
        KNMKWQAAIKVDKKQIHLGTFGSQEEAA LYDRAA+VCGREPNFELS+EEKQELQKFKWEDFLAMTRHAITNKKHKRL TGSSPKKLGASSLQIDN + K
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLSTGSSPKKLGASSLQIDNKQLK

Query:  QKFNESPVTEDIYFT
         +FNE PV EDI FT
Subjt:  QKFNESPVTEDIYFT

XP_038899853.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Benincasa hispida]1.9e-10490.7Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFV PLPKFSENGT PENPSPSNKLVTLHP SSDDV ITDRSSI +MESSFANALDWTS NEQSNQPISGHPLKHRKRHRR+NSHNQELSIMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLSTGSSPKKLGASSLQIDNKQLK
        KNMKWQAAIKVDKKQIHLGTFGSQEEAA LYDRAA+VCGREPNFELS+EEKQELQKFKWEDFLAMTRHAITNKKHKRL TGSSPKKLGASSLQIDN + K
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLSTGSSPKKLGASSLQIDNKQLK

Query:  QKFNESPVTEDIYFT
         +FNE PV EDI FT
Subjt:  QKFNESPVTEDIYFT

TrEMBL top hitse value%identityAlignment
A0A0A0K8E3 AP2/ERF domain-containing protein8.2e-10188.02Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFVAPLPKFS++ TA ENPSPSN L+TLHP SSDDV I DRSSIV+MESSF NAL+WTS NEQSNQPISGHPLKHRKRHRR+NSHNQELSIMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRL--STGSSPKKLGASSLQIDNKQ
        KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELS+EEKQELQKFKWEDFLAMTRHAITNKKHKRL  S GSSPKKLGASSLQIDN +
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRL--STGSSPKKLGASSLQIDNKQ

Query:  LKQKFNESPVTEDIYFT
        LK +FNE+P+ EDI FT
Subjt:  LKQKFNESPVTEDIYFT

A0A1S3C6H0 ethylene-responsive transcription factor-like protein At4g130406.9e-10087.1Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFVAPLPKFS++ TAPENPSPSNKL+TLHP SSDDV I DRSSIV+MESSF NAL+WTS +EQSNQPISG+PLKHRKRHRR+NSHNQELSIMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNKQ
        KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELS EEKQELQKFKWEDFLAMTRHAITNKKHKRLS   G+SPKKL ASSLQI+N +
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNKQ

Query:  LKQKFNESPVTEDIYFT
        LK +FNESP+ EDI FT
Subjt:  LKQKFNESPVTEDIYFT

A0A5A7SPL9 Ethylene-responsive transcription factor-like protein1.0e-9886.7Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISG-HPLKHRKRHRRRNSHNQELSIMRGVY
        GKSSFVAPLPKFS++ TAPENPSPSNKL+TLHP SSDDV I DRSSIV+MESSF NAL+W+S +EQSNQPISG HPLKHRKRHRR+NSHNQELSIMRGVY
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISG-HPLKHRKRHRRRNSHNQELSIMRGVY

Query:  FKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNK
        FKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELS EEKQELQKFKWEDFLAMTRHAITNKKHKRLS   G+SPKKL ASSLQI+N 
Subjt:  FKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNK

Query:  QLKQKFNESPVTEDIYFT
        +LK +FNESP+ EDI FT
Subjt:  QLKQKFNESPVTEDIYFT

A0A5D3CIE2 Ethylene-responsive transcription factor-like protein4.0e-10087.1Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFVAPLPKFS++ TAPENPSPSNKL+TLHP SSDDV I DRSSIV+MESSF NAL+W+S +EQSNQPISGHPLKHRKRHRR+NSHNQELSIMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNKQ
        KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELS EEKQELQKFKWEDFLAMTRHAITNKKHKRLS   G+SPKKL ASSLQI+N +
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLS--TGSSPKKLGASSLQIDNKQ

Query:  LKQKFNESPVTEDIYFT
        LK +FNESP+ EDI FT
Subjt:  LKQKFNESPVTEDIYFT

A0A6J1E2N6 ethylene-responsive transcription factor-like protein At4g13040 isoform X19.7e-9484.36Show/hide
Query:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF
        GKSSFVAPLPKFS+NGTAPENPSPSNKLVT+HPMSSDDV +TDRSS+V+ME +FA+A D TS  EQSNQP SGHPLKHRKRHRR+NSHNQEL+IMRGVYF
Subjt:  GKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYF

Query:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLSTGSSPKKLGASSLQIDNKQLK
        KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFEL ++EKQELQKFKWEDFLAMTRHAITNKKHKRL TG SPKK GASSL ID    K
Subjt:  KNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHKRLSTGSSPKKLGASSLQIDNKQLK

Query:  QKFNESPVTED
         +FN  P  ED
Subjt:  QKFNESPVTED

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130401.8e-3648.94Show/hide
Query:  GKSSFVAPLPKF----------SENGTAPENPSPSNKLVT--LHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSH
        G + +V PLP            + N  A  NP P+  + T  +   + ++     RS  +   S ++   D +  N  S  P    P K RK+HRR+  H
Subjt:  GKSSFVAPLPKF----------SENGTAPENPSPSNKLVT--LHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSH

Query:  NQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHK
        NQE  +MRGVY+KNMKWQAAIKV+KKQIHLGTF SQEEAA LYDRAAF+CGREPNFELS+E  +EL++  WE+FL  TR  ITNKK K
Subjt:  NQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHK

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein1.3e-3748.94Show/hide
Query:  GKSSFVAPLPKF----------SENGTAPENPSPSNKLVT--LHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSH
        G + +V PLP            + N  A  NP P+  + T  +   + ++     RS  +   S ++   D +  N  S  P    P K RK+HRR+  H
Subjt:  GKSSFVAPLPKF----------SENGTAPENPSPSNKLVT--LHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSH

Query:  NQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHK
        NQE  +MRGVY+KNMKWQAAIKV+KKQIHLGTF SQEEAA LYDRAAF+CGREPNFELS+E  +EL++  WE+FL  TR  ITNKK K
Subjt:  NQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHK

AT4G13040.2 Integrase-type DNA-binding superfamily protein1.7e-3771.7Show/hide
Query:  ISGHPLKHRKRHRRRNSHNQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAI
        IS  P K RK+HRR+  HNQE  +MRGVY+KNMKWQAAIKV+KKQIHLGTF SQEEAA LYDRAAF+CGREPNFELS+E  +EL++  WE+FL  TR  I
Subjt:  ISGHPLKHRKRHRRRNSHNQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAI

Query:  TNKKHK
        TNKK K
Subjt:  TNKKHK

AT4G13040.3 Integrase-type DNA-binding superfamily protein1.3e-3748.94Show/hide
Query:  GKSSFVAPLPKF----------SENGTAPENPSPSNKLVT--LHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSH
        G + +V PLP            + N  A  NP P+  + T  +   + ++     RS  +   S ++   D +  N  S  P    P K RK+HRR+  H
Subjt:  GKSSFVAPLPKF----------SENGTAPENPSPSNKLVT--LHPMSSDDVKITDRSSIVDMESSFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSH

Query:  NQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHK
        NQE  +MRGVY+KNMKWQAAIKV+KKQIHLGTF SQEEAA LYDRAAF+CGREPNFELS+E  +EL++  WE+FL  TR  ITNKK K
Subjt:  NQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDFLAMTRHAITNKKHK

AT4G37750.1 Integrase-type DNA-binding superfamily protein9.2e-0443.86Show/hide
Query:  RRRNSHNQELSIMRGV--YFKNMKWQAAI--KVDKKQIHLGTFGSQEEAAHLYDRAA
        R+ +  ++  SI RGV  + ++ +WQA I      K ++LGTFG+QEEAA  YD AA
Subjt:  RRRNSHNQELSIMRGV--YFKNMKWQAAI--KVDKKQIHLGTFGSQEEAAHLYDRAA

AT5G57390.1 AINTEGUMENTA-like 57.0e-0430.77Show/hide
Query:  ALDWTSPNEQSNQPISGH-----PLKHRKR-------HRRRNSHNQELSIMRGV--YFKNMKWQAAI--KVDKKQIHLGTFGSQEEAAHLYDRAAFV---
        AL +  P   +N PIS +      +KH  R        R+ +  ++  S+ RGV  + ++ +WQA I      K ++LGTF +QEEAA  YD AA     
Subjt:  ALDWTSPNEQSNQPISGH-----PLKHRKR-------HRRRNSHNQELSIMRGV--YFKNMKWQAAI--KVDKKQIHLGTFGSQEEAAHLYDRAAFV---

Query:  CGREPNFELSKEEKQEL
             NF++S+ + + +
Subjt:  CGREPNFELSKEEKQEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTGTGGGGAGATCCCTTCGTTTTATCAATTTGCATCAGTTGTAGAAGCTGGAAAAATGCTTTTGGCCTCTAATCATGATGCTATCTTGATGATTTTATCAGTGGG
GCGGATTCATGATAAACAATTGGTTGCATACATTGTGCACGACGTGAAGATCAACATCAACTATGGGAAAAGTTCATTTGTAGCTCCACTTCCTAAGTTTTCAGAAAATG
GAACTGCTCCCGAAAATCCTAGTCCGAGTAACAAACTTGTCACTTTGCATCCCATGTCTTCTGATGATGTTAAAATCACGGATAGGAGCTCTATTGTAGATATGGAATCC
AGTTTCGCAAATGCATTGGATTGGACCTCGCCGAATGAACAGTCTAATCAGCCTATTTCAGGGCATCCACTAAAACATAGAAAGAGACATAGGAGAAGGAACTCTCATAA
TCAAGAACTATCAATCATGAGGGGTGTCTACTTTAAGAACATGAAATGGCAGGCTGCAATTAAGGTCGACAAAAAACAGATACATTTAGGAACATTTGGTTCACAAGAAG
AGGCTGCACATTTGTATGACAGAGCTGCTTTTGTGTGTGGAAGGGAACCAAATTTTGAACTCTCCAAGGAAGAGAAGCAAGAACTCCAAAAGTTCAAATGGGAAGACTTC
CTTGCTATGACTCGACACGCAATCACAAACAAAAAGCACAAAAGACTCAGCACAGGATCATCGCCAAAGAAGCTTGGAGCTTCGTCGCTGCAAATTGACAACAAGCAACT
CAAGCAAAAATTCAACGAGTCTCCGGTAACAGAGGACATTTACTTCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTGTGGGGAGATCCCTTCGTTTTATCAATTTGCATCAGTTGTAGAAGCTGGAAAAATGCTTTTGGCCTCTAATCATGATGCTATCTTGATGATTTTATCAGTGGG
GCGGATTCATGATAAACAATTGGTTGCATACATTGTGCACGACGTGAAGATCAACATCAACTATGGGAAAAGTTCATTTGTAGCTCCACTTCCTAAGTTTTCAGAAAATG
GAACTGCTCCCGAAAATCCTAGTCCGAGTAACAAACTTGTCACTTTGCATCCCATGTCTTCTGATGATGTTAAAATCACGGATAGGAGCTCTATTGTAGATATGGAATCC
AGTTTCGCAAATGCATTGGATTGGACCTCGCCGAATGAACAGTCTAATCAGCCTATTTCAGGGCATCCACTAAAACATAGAAAGAGACATAGGAGAAGGAACTCTCATAA
TCAAGAACTATCAATCATGAGGGGTGTCTACTTTAAGAACATGAAATGGCAGGCTGCAATTAAGGTCGACAAAAAACAGATACATTTAGGAACATTTGGTTCACAAGAAG
AGGCTGCACATTTGTATGACAGAGCTGCTTTTGTGTGTGGAAGGGAACCAAATTTTGAACTCTCCAAGGAAGAGAAGCAAGAACTCCAAAAGTTCAAATGGGAAGACTTC
CTTGCTATGACTCGACACGCAATCACAAACAAAAAGCACAAAAGACTCAGCACAGGATCATCGCCAAAGAAGCTTGGAGCTTCGTCGCTGCAAATTGACAACAAGCAACT
CAAGCAAAAATTCAACGAGTCTCCGGTAACAGAGGACATTTACTTCACTTGAGCTAGCTTTTAGCTCAAGAAGAATGGTAAATTTTTCTAGGTCCAACAGGTTAGGCCAA
GTTGCTTGAACCTCCATTGAAATTCCTCCCTCCCAAGTAAATGTGTTTTTAGGCTAGAATACATTGCGTCTTCATGTGGTAAGTTCATAATTTTTTTCTTTTTCAATTGA
AGCTTGTAATATGTTCCAATATAATTTACTGTTAATTTCTTTTTAGCACAAGCATATTGAATCGATATTGTAATGAAACAGAAGTTTGTAATTATTTTCATTAC
Protein sequenceShow/hide protein sequence
MPCGEIPSFYQFASVVEAGKMLLASNHDAILMILSVGRIHDKQLVAYIVHDVKININYGKSSFVAPLPKFSENGTAPENPSPSNKLVTLHPMSSDDVKITDRSSIVDMES
SFANALDWTSPNEQSNQPISGHPLKHRKRHRRRNSHNQELSIMRGVYFKNMKWQAAIKVDKKQIHLGTFGSQEEAAHLYDRAAFVCGREPNFELSKEEKQELQKFKWEDF
LAMTRHAITNKKHKRLSTGSSPKKLGASSLQIDNKQLKQKFNESPVTEDIYFT