; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g29570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g29570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionethylene-responsive transcription factor ERF022-like
Genome locationchr11:21593298..21598734
RNA-Seq ExpressionMoc11g29570
SyntenyMoc11g29570
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY17091.1 Integrase-type DNA-binding superfamily protein [Theobroma cacao]1.3e-5261.76Show/hide
Query:  TAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK-----RTPDAA
        ++ +RGVRKRKWGKWVSEIREPGKKTRIWLGS++TPEMAAAAYDVAALHLRG +ARLNFP+L+D LPRP SSN EDI+  AQ AAL ++      + +A 
Subjt:  TAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK-----RTPDAA

Query:  GACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD
        G+CS+   G  P+ V LSP+QIQAIN+SPLDSP MWMQM+E L  +E  +F  ++ + +W+D+++ S+WD
Subjt:  GACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD

KAG2710117.1 hypothetical protein I3760_04G009600 [Carya illinoinensis]9.7e-5367.27Show/hide
Query:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACSAVSV
        +RGVRKRKWGKWVSEIR PGKKTRIWLGSY+ PEMAAAAYDVAALHLRG    LNFP+LVD+LPRPASSN ED+Q AAQ AAL L+R         +   
Subjt:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACSAVSV

Query:  GAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLY--DHFDEQWEDVRHQSIWDS
         A P RVGLSPSQIQAIN+SPLDSP MWM++A AL  EE+S+ LY  DH   +W+ ++++SIWDS
Subjt:  GAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLY--DHFDEQWEDVRHQSIWDS

XP_017980907.1 PREDICTED: ethylene-responsive transcription factor ERF021 [Theobroma cacao]1.3e-5261.76Show/hide
Query:  TAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK-----RTPDAA
        ++ +RGVRKRKWGKWVSEIREPGKKTRIWLGS++TPEMAAAAYDVAALHLRG +ARLNFP+L+D LPRP SSN EDI+  AQ AAL ++      + +A 
Subjt:  TAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK-----RTPDAA

Query:  GACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD
        G+CS+   G  P+ V LSP+QIQAIN+SPLDSP MWMQM+E L  +E  +F  ++ + +W+D+++ S+WD
Subjt:  GACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD

XP_022141863.1 ethylene-responsive transcription factor ERF022-like [Momordica charantia]4.5e-90100Show/hide
Query:  TTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGA
        TTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGA
Subjt:  TTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGA

Query:  CSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWDS
        CSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWDS
Subjt:  CSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWDS

XP_024173792.1 ethylene-responsive transcription factor ERF021 [Rosa chinensis]1.1e-5361.98Show/hide
Query:  PSRSNRLITTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYL
        P+ S+  +    T+A+RGVRKRKWGKWVSEIREPGKKTRIWLGS++TPEMAAAAYDVAALH RG +ARLNFP+LV+ LPRPASS+P+DI+ AA  AAL L
Subjt:  PSRSNRLITTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYL

Query:  K-------RTPDAAGAC---SAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDE-----QWEDVRHQSIWD
        +        +  AAG C   S+ S G AP+ V LSPSQIQAIN+SPLDSP MWMQM+EAL  EE  +F  D  DE     QWE+++  S+WD
Subjt:  K-------RTPDAAGAC---SAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDE-----QWEDVRHQSIWD

TrEMBL top hitse value%identityAlignment
A0A061FJ37 Integrase-type DNA-binding superfamily protein6.1e-5361.76Show/hide
Query:  TAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK-----RTPDAA
        ++ +RGVRKRKWGKWVSEIREPGKKTRIWLGS++TPEMAAAAYDVAALHLRG +ARLNFP+L+D LPRP SSN EDI+  AQ AAL ++      + +A 
Subjt:  TAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK-----RTPDAA

Query:  GACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD
        G+CS+   G  P+ V LSP+QIQAIN+SPLDSP MWMQM+E L  +E  +F  ++ + +W+D+++ S+WD
Subjt:  GACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD

A0A2I4FHF9 ethylene-responsive transcription factor ERF022-like1.4e-5262.78Show/hide
Query:  PSRSNRLITTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYL
        PS  N       + A+RGVRKRKWGKWVSEIR PGKKTRIWLGSY+ PEMAAAAYDVAALHLRG    LNFP+LVD+LPRPASS+ ED+Q AAQ AAL L
Subjt:  PSRSNRLITTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYL

Query:  KRTPDAAGACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLY--DHFDEQWEDVRHQSIWDS
        +R         +    A P RVGLSPSQIQAIN+SPLDSP MWM++A AL  EE+++ LY  DH   +W+ ++++SIWDS
Subjt:  KRTPDAAGACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLY--DHFDEQWEDVRHQSIWDS

A0A2N9I738 AP2/ERF domain-containing protein2.3e-5269.7Show/hide
Query:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACSAVSV
        +RGVRKRKWGKWVSEIREPGKKTRIWLGSY+ PEMAAAAYDVAALHLRG  ARLNFP+LVD+LPRPASS  ED+Q AAQ AAL  +R P   G+  A   
Subjt:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACSAVSV

Query:  GAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDE--QWEDVRHQSIWDS
           PIRVGLSP QIQAIN+SPLDSP MWM++A AL  EED + LYD+  E  +WE+++++SIWDS
Subjt:  GAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDE--QWEDVRHQSIWDS

A0A2P6PFS1 Putative transcription factor AP2-EREBP family5.6e-5461.98Show/hide
Query:  PSRSNRLITTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYL
        P+ S+  +    T+A+RGVRKRKWGKWVSEIREPGKKTRIWLGS++TPEMAAAAYDVAALH RG +ARLNFP+LV+ LPRPASS+P+DI+ AA  AAL L
Subjt:  PSRSNRLITTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYL

Query:  K-------RTPDAAGAC---SAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDE-----QWEDVRHQSIWD
        +        +  AAG C   S+ S G AP+ V LSPSQIQAIN+SPLDSP MWMQM+EAL  EE  +F  D  DE     QWE+++  S+WD
Subjt:  K-------RTPDAAGAC---SAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDE-----QWEDVRHQSIWD

A0A6J1CLS1 ethylene-responsive transcription factor ERF022-like2.2e-90100Show/hide
Query:  TTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGA
        TTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGA
Subjt:  TTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGA

Query:  CSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWDS
        CSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWDS
Subjt:  CSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWDS

SwissProt top hitse value%identityAlignment
Q1ECI2 Ethylene-responsive transcription factor ERF0231.8e-2561.29Show/hide
Query:  TTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK
        +TT    + GVRKR+WGKWVSEIREP KK+RIWLGS+  PEMAA AYDVAA  L+G  A+LNFP+ ++ LPRP++  P DIQ AA  AA  +K
Subjt:  TTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK

Q9C9I2 Ethylene-responsive transcription factor ERF0213.0e-3652.41Show/hide
Query:  AAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAAL--YLKRTPDAAGACS
        +A+RGVRKRKWGKWVSEIREPG K RIWLGS++TPEMAA AYDVAA H RG +ARLNFP+L  +LPRPA S+ + I+ A   A L    + T  A    S
Subjt:  AAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAAL--YLKRTPDAAGACS

Query:  AVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD
        + S   AP  V LSP +IQAIN+S L SPT  M      S+ +   F  D     WE  +   +WD
Subjt:  AVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD

Q9LQ28 Ethylene-responsive transcription factor ERF0223.3e-4360.69Show/hide
Query:  AFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACS--A
        ++RG+R+RKWGKWVSEIREPGKKTRIWLGSY+T EMAAAAYD AALHLRG    LNFP+LVD+ PRP SS+ E IQ AAQ AAL  K      G  S  A
Subjt:  AFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACS--A

Query:  VSVGAAPIRVGLSPSQIQAINDSPLDSPTM-WMQMAEALSSEEDSIFLYDHF------DEQWEDVRHQSIWDS
        +  G    RVGLSP QIQAIN+SPLDSP M WMQ  E    EE    LY  F      DE +E  + QSIW+S
Subjt:  VSVGAAPIRVGLSPSQIQAINDSPLDSPTM-WMQMAEALSSEEDSIFLYDHF------DEQWEDVRHQSIWDS

Q9LYD3 Dehydration-responsive element-binding protein 31.8e-2559.6Show/hide
Query:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACSAVS
        +RGVR R WGKWVSEIREP KK+RIWLG++ TPEMAA A+DVAAL ++G  A LNFP+L D+ PRP S +P DIQTAA L A +++ T   + + S+ S
Subjt:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACSAVS

Q9SUK8 Ethylene-responsive transcription factor ERF0392.1e-2663.74Show/hide
Query:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDA
        FRGVR R+WGKWVSEIREP KK+RIWLG++ TPEMAA A+DVAAL ++G  A LNFP+L   LPRPAS++P+DIQ AA  AA    + P++
Subjt:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDA

Arabidopsis top hitse value%identityAlignment
AT1G01250.1 Integrase-type DNA-binding superfamily protein1.3e-2661.29Show/hide
Query:  TTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK
        +TT    + GVRKR+WGKWVSEIREP KK+RIWLGS+  PEMAA AYDVAA  L+G  A+LNFP+ ++ LPRP++  P DIQ AA  AA  +K
Subjt:  TTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLK

AT1G33760.1 Integrase-type DNA-binding superfamily protein2.3e-4460.69Show/hide
Query:  AFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACS--A
        ++RG+R+RKWGKWVSEIREPGKKTRIWLGSY+T EMAAAAYD AALHLRG    LNFP+LVD+ PRP SS+ E IQ AAQ AAL  K      G  S  A
Subjt:  AFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACS--A

Query:  VSVGAAPIRVGLSPSQIQAINDSPLDSPTM-WMQMAEALSSEEDSIFLYDHF------DEQWEDVRHQSIWDS
        +  G    RVGLSP QIQAIN+SPLDSP M WMQ  E    EE    LY  F      DE +E  + QSIW+S
Subjt:  VSVGAAPIRVGLSPSQIQAINDSPLDSPTM-WMQMAEALSSEEDSIFLYDHF------DEQWEDVRHQSIWDS

AT1G71450.1 Integrase-type DNA-binding superfamily protein2.1e-3752.41Show/hide
Query:  AAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAAL--YLKRTPDAAGACS
        +A+RGVRKRKWGKWVSEIREPG K RIWLGS++TPEMAA AYDVAA H RG +ARLNFP+L  +LPRPA S+ + I+ A   A L    + T  A    S
Subjt:  AAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAAL--YLKRTPDAAGACS

Query:  AVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD
        + S   AP  V LSP +IQAIN+S L SPT  M      S+ +   F  D     WE  +   +WD
Subjt:  AVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWD

AT4G16750.1 Integrase-type DNA-binding superfamily protein1.5e-2763.74Show/hide
Query:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDA
        FRGVR R+WGKWVSEIREP KK+RIWLG++ TPEMAA A+DVAAL ++G  A LNFP+L   LPRPAS++P+DIQ AA  AA    + P++
Subjt:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDA

AT5G11590.1 Integrase-type DNA-binding superfamily protein1.3e-2659.6Show/hide
Query:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACSAVS
        +RGVR R WGKWVSEIREP KK+RIWLG++ TPEMAA A+DVAAL ++G  A LNFP+L D+ PRP S +P DIQTAA L A +++ T   + + S+ S
Subjt:  FRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSNPEDIQTAAQLAALYLKRTPDAAGACSAVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGGAAGACCTGCACAAGAGGACAAAATCTCCGACGCTCAGATCGAATCTGTCGGGTCACTCAGGTTGAATCCGAGCAGGTCGAACCGTTTAATAACCACCACCTG
CACAGCCGCCTTTAGAGGCGTCCGTAAGCGTAAATGGGGCAAATGGGTTTCCGAGATTCGCGAGCCGGGAAAAAAGACCAGAATCTGGCTCGGCAGCTATGACACCCCAG
AGATGGCTGCTGCAGCCTACGACGTTGCTGCCCTCCACCTCCGCGGCCCCGACGCCCGCCTCAACTTCCCTGACCTCGTCGACGCCCTCCCGAGGCCCGCCAGCTCCAAT
CCCGAAGACATTCAGACCGCTGCCCAGCTCGCCGCATTGTACCTCAAGAGGACCCCCGACGCCGCAGGAGCATGCTCCGCCGTTAGCGTCGGGGCCGCGCCGATTCGGGT
GGGGCTGTCGCCGAGCCAAATTCAGGCCATCAATGACTCGCCGTTGGATTCTCCGACCATGTGGATGCAGATGGCTGAGGCGCTTAGTTCGGAGGAAGACTCCATTTTTT
TATATGATCATTTTGATGAACAGTGGGAAGACGTGCGCCATCAATCCATTTGGGATTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGGAAGACCTGCACAAGAGGACAAAATCTCCGACGCTCAGATCGAATCTGTCGGGTCACTCAGGTTGAATCCGAGCAGGTCGAACCGTTTAATAACCACCACCTG
CACAGCCGCCTTTAGAGGCGTCCGTAAGCGTAAATGGGGCAAATGGGTTTCCGAGATTCGCGAGCCGGGAAAAAAGACCAGAATCTGGCTCGGCAGCTATGACACCCCAG
AGATGGCTGCTGCAGCCTACGACGTTGCTGCCCTCCACCTCCGCGGCCCCGACGCCCGCCTCAACTTCCCTGACCTCGTCGACGCCCTCCCGAGGCCCGCCAGCTCCAAT
CCCGAAGACATTCAGACCGCTGCCCAGCTCGCCGCATTGTACCTCAAGAGGACCCCCGACGCCGCAGGAGCATGCTCCGCCGTTAGCGTCGGGGCCGCGCCGATTCGGGT
GGGGCTGTCGCCGAGCCAAATTCAGGCCATCAATGACTCGCCGTTGGATTCTCCGACCATGTGGATGCAGATGGCTGAGGCGCTTAGTTCGGAGGAAGACTCCATTTTTT
TATATGATCATTTTGATGAACAGTGGGAAGACGTGCGCCATCAATCCATTTGGGATTCTTAA
Protein sequenceShow/hide protein sequence
MRGRPAQEDKISDAQIESVGSLRLNPSRSNRLITTTCTAAFRGVRKRKWGKWVSEIREPGKKTRIWLGSYDTPEMAAAAYDVAALHLRGPDARLNFPDLVDALPRPASSN
PEDIQTAAQLAALYLKRTPDAAGACSAVSVGAAPIRVGLSPSQIQAINDSPLDSPTMWMQMAEALSSEEDSIFLYDHFDEQWEDVRHQSIWDS