; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G001530 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G001530
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionethylene-responsive transcription factor ERF039-like
Genome locationCmo_Chr08:893386..893970
RNA-Seq ExpressionCmoCh08G001530
SyntenyCmoCh08G001530
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592942.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. sororia]7.1e-6998.66Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRG-TTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
        MNTNSHHSQPTTTSSASSSASSSGENGRKRPRG TT SNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRG-TTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK

Query:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEE
        GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEE
Subjt:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEE

KAG7025350.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-9796.46Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRG-TTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
        MNTNSHHSQPTTTSSASSSASSSGEN RKRPRG TT SNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRG-TTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK

Query:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMA---EEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP
        GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMA   EEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQ+SWHWDSYSHTILP
Subjt:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMA---EEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP

XP_022959732.1 ethylene-responsive transcription factor ERF039-like [Cucurbita moschata]1.3e-102100Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG
        MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG

Query:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP
        CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP
Subjt:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP

XP_023004375.1 ethylene-responsive transcription factor ERF039-like [Cucurbita maxima]1.8e-9695.38Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGT-TMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
        MNTNSHHSQPTTTSSASSS SSSG+N +KRPRGT T SNIESESESSH TRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGT-TMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK

Query:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP
        GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPM EEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQ+SWHWDSYSHTILP
Subjt:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP

XP_023515152.1 ethylene-responsive transcription factor ERF039-like [Cucurbita pepo subsp. pepo]5.2e-9696.45Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRG-TTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
        MNTNSHHSQPTTTSSASSSASSSGEN RKRPRG TT SNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRG-TTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK

Query:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMA--EEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP
        GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMA  EEEEEEEETWFDLPDLVVGGSDGLLLA YDSPWQFGGDQQ SWHWDSYSHTILP
Subjt:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMA--EEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP

TrEMBL top hitse value%identityAlignment
A0A0A0KAN0 AP2/ERF domain-containing protein2.7e-5060.91Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG
        M TNSHHS  TT+SSA S  +++    RKR R +T S  ESES+   ST K+RGVR RAWGKWVSEIREPRKKSRIWLGTY TAEMAARAHD AALAIKG
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG

Query:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAE---------EEEEEEETWFDLPDLVVGGSDGLLL-------------AGYDSPWQF---
          AFLNFP+LKHQLPRPASLSAKDIQAAAA AA LK              EEEEEE TWFDLPDL+V  S+G LL             +   S WQF   
Subjt:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAE---------EEEEEEETWFDLPDLVVGGSDGLLL-------------AGYDSPWQF---

Query:  -GGDQQHSWHWDSYSHTILP
           D    WH D +SHTI P
Subjt:  -GGDQQHSWHWDSYSHTILP

A0A1S3CBH6 ethylene-responsive transcription factor ERF039-like2.3e-4961.68Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG
        M TNSHHS  TT+SSA++S        RKR R +T S  ESES+   ST K+RGVR R WGKWVSEIREPRKKSRIWLGTY TAEMAARAHD AALAIKG
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG

Query:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLK--PPMAE--EEEEEEETWFDLPDLVVGGSDGLLL------------AGYDSPWQF----GGDQQ
          AFLNFP+LKHQLPRPASLSAKDIQAAAA AA LK  P  A     EEEE TWFDLPDL+V  S+G LL            +   + WQF      D  
Subjt:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLK--PPMAE--EEEEEEETWFDLPDLVVGGSDGLLL------------AGYDSPWQF----GGDQQ

Query:  HSWHWDSYSHTILP
          WH D +SHTI P
Subjt:  HSWHWDSYSHTILP

A0A5A7TVV7 Ethylene-responsive transcription factor ERF039-like protein2.3e-4961.68Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG
        M TNSHHS  TT+SSA++S        RKR R +T S  ESES+   ST K+RGVR R WGKWVSEIREPRKKSRIWLGTY TAEMAARAHD AALAIKG
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG

Query:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLK--PPMAE--EEEEEEETWFDLPDLVVGGSDGLLL------------AGYDSPWQF----GGDQQ
          AFLNFP+LKHQLPRPASLSAKDIQAAAA AA LK  P  A     EEEE TWFDLPDL+V  S+G LL            +   + WQF      D  
Subjt:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLK--PPMAE--EEEEEEETWFDLPDLVVGGSDGLLL------------AGYDSPWQF----GGDQQ

Query:  HSWHWDSYSHTILP
          WH D +SHTI P
Subjt:  HSWHWDSYSHTILP

A0A6J1H5C9 ethylene-responsive transcription factor ERF039-like6.2e-103100Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG
        MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKG

Query:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP
        CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP
Subjt:  CGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP

A0A6J1KUE1 ethylene-responsive transcription factor ERF039-like8.7e-9795.38Show/hide
Query:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGT-TMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
        MNTNSHHSQPTTTSSASSS SSSG+N +KRPRGT T SNIESESESSH TRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK
Subjt:  MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGT-TMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIK

Query:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP
        GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPM EEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQ+SWHWDSYSHTILP
Subjt:  GCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP

SwissProt top hitse value%identityAlignment
O80654 Ethylene-responsive transcription factor ERF0371.5e-2978.57Show/hide
Query:  YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATL
        YRGVR R WGKWVSEIREPRKKSRIWLGT+ST EMAARAHDAAAL IKG  A LNFPEL   LPRPAS S +D+QAAAA AA +
Subjt:  YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATL

Q52QU1 Ethylene-responsive transcription factor ERF0423.3e-2966.34Show/hide
Query:  TMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAAT
        T  + + + E +     YRG R R+WGKWVSEIREPRKKSRIWLGT+ TAEMAARAHD AAL+IKG  A LNFPEL   LPRP SLS +DIQAAAA+AA 
Subjt:  TMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAAT

Query:  L
        +
Subjt:  L

Q8LBQ7 Ethylene-responsive transcription factor ERF0349.7e-2955.56Show/hide
Query:  SQPTTTSSASSSASSS---GENGRKRPRGTTMSNIESESESSHST------RK---------YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAAR
        S PTT+SS+SSS +S+    +N +++    ++S++ S  +           RK         YRGVR R+WGKWVSEIREPRKKSRIWLGTY TAEMAAR
Subjt:  SQPTTTSSASSSASSS---GENGRKRPRGTTMSNIESESESSHST------RK---------YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAAR

Query:  AHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAA
        AHD AALAIKG  A+LNFP+L  +LPRP + S KDIQAAA+ AA
Subjt:  AHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAA

Q9M210 Ethylene-responsive transcription factor ERF0351.8e-3042.04Show/hide
Query:  TTTSSASSSASSSGENGRKRPRGTTMSN-------IESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGA
        T T S+SS  +SS ++     R     N         + S+ + +   YRGVR R+WGKWVSEIREPRKKSRIWLGTY TAEMAARAHD AALAIKG   
Subjt:  TTTSSASSSASSSGENGRKRPRGTTMSN-------IESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGA

Query:  FLNFPELKHQLPRPASLSAKDIQAA---AADAATLKPPMAE------------------------------------EEEEEEETWFDLPDLVVGG----
        FLNFPEL   LPRP S S KDIQAA   AA+A T   P+ +                                    ++E  EET FDLPDL   G    
Subjt:  FLNFPELKHQLPRPASLSAKDIQAA---AADAATLKPPMAE------------------------------------EEEEEEETWFDLPDLVVGG----

Query:  SDGLLLAGYDSPWQFGGDQQHSWHWD
        +D   L      WQ  G++   + ++
Subjt:  SDGLLLAGYDSPWQFGGDQQHSWHWD

Q9ZQP3 Ethylene-responsive transcription factor ERF0382.0e-2980.49Show/hide
Query:  YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAA
        +RGVR R WGKWVSEIREP+KKSRIWLGT+STAEMAARAHD AALAIKG  A LNFPEL + LPRPAS   KDIQAAAA AA
Subjt:  YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAA

Arabidopsis top hitse value%identityAlignment
AT1G77200.1 Integrase-type DNA-binding superfamily protein1.1e-3078.57Show/hide
Query:  YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATL
        YRGVR R WGKWVSEIREPRKKSRIWLGT+ST EMAARAHDAAAL IKG  A LNFPEL   LPRPAS S +D+QAAAA AA +
Subjt:  YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAATL

AT2G25820.1 Integrase-type DNA-binding superfamily protein2.4e-3066.34Show/hide
Query:  TMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAAT
        T  + + + E +     YRG R R+WGKWVSEIREPRKKSRIWLGT+ TAEMAARAHD AAL+IKG  A LNFPEL   LPRP SLS +DIQAAAA+AA 
Subjt:  TMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAAT

Query:  L
        +
Subjt:  L

AT2G35700.1 ERF family protein 381.4e-3080.49Show/hide
Query:  YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAA
        +RGVR R WGKWVSEIREP+KKSRIWLGT+STAEMAARAHD AALAIKG  A LNFPEL + LPRPAS   KDIQAAAA AA
Subjt:  YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAA

AT2G44940.1 Integrase-type DNA-binding superfamily protein6.9e-3055.56Show/hide
Query:  SQPTTTSSASSSASSS---GENGRKRPRGTTMSNIESESESSHST------RK---------YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAAR
        S PTT+SS+SSS +S+    +N +++    ++S++ S  +           RK         YRGVR R+WGKWVSEIREPRKKSRIWLGTY TAEMAAR
Subjt:  SQPTTTSSASSSASSS---GENGRKRPRGTTMSNIESESESSHST------RK---------YRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAAR

Query:  AHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAA
        AHD AALAIKG  A+LNFP+L  +LPRP + S KDIQAAA+ AA
Subjt:  AHDAAALAIKGCGAFLNFPELKHQLPRPASLSAKDIQAAAADAA

AT3G60490.1 Integrase-type DNA-binding superfamily protein1.3e-3142.04Show/hide
Query:  TTTSSASSSASSSGENGRKRPRGTTMSN-------IESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGA
        T T S+SS  +SS ++     R     N         + S+ + +   YRGVR R+WGKWVSEIREPRKKSRIWLGTY TAEMAARAHD AALAIKG   
Subjt:  TTTSSASSSASSSGENGRKRPRGTTMSN-------IESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGA

Query:  FLNFPELKHQLPRPASLSAKDIQAA---AADAATLKPPMAE------------------------------------EEEEEEETWFDLPDLVVGG----
        FLNFPEL   LPRP S S KDIQAA   AA+A T   P+ +                                    ++E  EET FDLPDL   G    
Subjt:  FLNFPELKHQLPRPASLSAKDIQAA---AADAATLKPPMAE------------------------------------EEEEEEETWFDLPDLVVGG----

Query:  SDGLLLAGYDSPWQFGGDQQHSWHWD
        +D   L      WQ  G++   + ++
Subjt:  SDGLLLAGYDSPWQFGGDQQHSWHWD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACACAAACTCCCACCATTCTCAACCCACCACCACCTCCTCCGCCTCTTCTTCCGCCTCTTCCTCCGGCGAGAACGGCCGAAAGAGGCCGAGGGGTACGACGATGAG
CAACATAGAGAGCGAGAGCGAATCGAGCCACTCAACGAGGAAGTACAGAGGGGTTCGTCGGAGAGCGTGGGGAAAATGGGTGTCCGAAATAAGGGAGCCAAGGAAGAAAT
CAAGGATATGGCTCGGTACATACTCGACCGCGGAGATGGCGGCACGAGCACACGACGCGGCGGCTCTTGCCATCAAAGGCTGTGGTGCATTCCTCAACTTCCCTGAACTC
AAACACCAACTCCCCCGCCCTGCCTCCCTCTCCGCCAAGGACATCCAGGCCGCCGCCGCAGACGCAGCCACCCTCAAACCACCAATGGCGGAGGAGGAGGAGGAGGAGGA
AGAGACGTGGTTTGATTTGCCGGATCTTGTGGTCGGAGGGAGTGATGGGCTGCTGCTGGCCGGCTATGATTCGCCGTGGCAGTTCGGTGGAGATCAACAACATTCATGGC
ATTGGGACTCTTATTCTCATACTATTCTTCCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACACAAACTCCCACCATTCTCAACCCACCACCACCTCCTCCGCCTCTTCTTCCGCCTCTTCCTCCGGCGAGAACGGCCGAAAGAGGCCGAGGGGTACGACGATGAG
CAACATAGAGAGCGAGAGCGAATCGAGCCACTCAACGAGGAAGTACAGAGGGGTTCGTCGGAGAGCGTGGGGAAAATGGGTGTCCGAAATAAGGGAGCCAAGGAAGAAAT
CAAGGATATGGCTCGGTACATACTCGACCGCGGAGATGGCGGCACGAGCACACGACGCGGCGGCTCTTGCCATCAAAGGCTGTGGTGCATTCCTCAACTTCCCTGAACTC
AAACACCAACTCCCCCGCCCTGCCTCCCTCTCCGCCAAGGACATCCAGGCCGCCGCCGCAGACGCAGCCACCCTCAAACCACCAATGGCGGAGGAGGAGGAGGAGGAGGA
AGAGACGTGGTTTGATTTGCCGGATCTTGTGGTCGGAGGGAGTGATGGGCTGCTGCTGGCCGGCTATGATTCGCCGTGGCAGTTCGGTGGAGATCAACAACATTCATGGC
ATTGGGACTCTTATTCTCATACTATTCTTCCTTGA
Protein sequenceShow/hide protein sequence
MNTNSHHSQPTTTSSASSSASSSGENGRKRPRGTTMSNIESESESSHSTRKYRGVRRRAWGKWVSEIREPRKKSRIWLGTYSTAEMAARAHDAAALAIKGCGAFLNFPEL
KHQLPRPASLSAKDIQAAAADAATLKPPMAEEEEEEEETWFDLPDLVVGGSDGLLLAGYDSPWQFGGDQQHSWHWDSYSHTILP