; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10008811 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10008811
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEthylene-responsive transcription factor
Genome locationChr10:26237191..26237814
RNA-Seq ExpressionHG10008811
SyntenyHG10008811
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009873 - ethylene-activated signaling pathway (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily
IPR044808 - Ethylene-responsive transcription factor


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY22656.1 Integrase-type DNA-binding superfamily protein, putative [Theobroma cacao]2.1e-3963.46Show/hide
Query:  KINKVDPKRGKRPLCSDES---EEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQ-PADHEVQVKREIRHYRG
        K  KVDP++GKRPL  DES   EE++ FP+YSARS+ DM+AMV AL +VI   + S ++ +       D  P  QS +  QQ  +  +  V+R  RHYRG
Subjt:  KINKVDPKRGKRPLCSDES---EEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQ-PADHEVQVKREIRHYRG

Query:  VRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER
        VRQRPWGKWAAEIRDPKKAARVWLGTF+TAEAAALAYDEAALRFKG+KAKLNFPER
Subjt:  VRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER

KGN63431.1 hypothetical protein Csa_013280 [Cucumis sativus]1.9e-5671.82Show/hide
Query:  KVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPA-----DHEVQVKREIRHYRGVR
        KVDPKRGKR LCSDESEEENPFPIYSARSEYD SAMVSAL +VI SGSGSGS               S+S S  ++PA     D+E  VKRE RHYRGVR
Subjt:  KVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPA-----DHEVQVKREIRHYRGVR

Query:  QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQD
        QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER TTPPSY Y           +  HHQD
Subjt:  QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQD

RDX89871.1 Ethylene-responsive transcription factor ERF114, partial [Mucuna pruriens]2.1e-3956.74Show/hide
Query:  KVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWG
        KVD K GKR L  +E EEENPFP+YS RS+ DMSA+VSAL +VI    G+   + P  V  +  +      ++  QP  H+   +RE   YRGVRQRPWG
Subjt:  KVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWG

Query:  KWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQDHK
        KWAAEIRDPKKAARVWLGTF TAEAAALAYDEAALRFKG+KAKLNFPER    P + Y        L + T H+ +H+
Subjt:  KWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQDHK

XP_022921378.1 ethylene-responsive transcription factor ERF114-like, partial [Cucurbita moschata]3.4e-5368.25Show/hide
Query:  ARVVKINKVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQ--SQSQSQQPADHEVQVKREIRHY
        A+  K  KVDP RGKRP  SDE EEENPFPIYSARS++D SAMVSAL +VI   SGSGSSRSP     ADY P+SQ  +  + Q        V RE RHY
Subjt:  ARVVKINKVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQ--SQSQSQQPADHEVQVKREIRHY

Query:  RGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTT---PPSYAYHHPTFILNLHSSTSHHQDH
        RGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER TT   PP YAYHH            HH +H
Subjt:  RGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTT---PPSYAYHHPTFILNLHSSTSHHQDH

XP_038879952.1 ethylene-responsive transcription factor ERF114-like [Benincasa hispida]2.8e-5585.92Show/hide
Query:  MSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDE
        MSAMV ALGEVIR  SGSGSSRSP  V+A    P    QSQSQQP DHE QVKRE RHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDE
Subjt:  MSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDE

Query:  AALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHH
        AALRFKGTKAKLNFPER TTPPSYAYHHPT++LNLHSSTSHH
Subjt:  AALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHH

TrEMBL top hitse value%identityAlignment
A0A061FZL7 Integrase-type DNA-binding superfamily protein, putative1.0e-3963.46Show/hide
Query:  KINKVDPKRGKRPLCSDES---EEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQ-PADHEVQVKREIRHYRG
        K  KVDP++GKRPL  DES   EE++ FP+YSARS+ DM+AMV AL +VI   + S ++ +       D  P  QS +  QQ  +  +  V+R  RHYRG
Subjt:  KINKVDPKRGKRPLCSDES---EEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQ-PADHEVQVKREIRHYRG

Query:  VRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER
        VRQRPWGKWAAEIRDPKKAARVWLGTF+TAEAAALAYDEAALRFKG+KAKLNFPER
Subjt:  VRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER

A0A0A0LTV8 AP2/ERF domain-containing protein9.3e-5771.82Show/hide
Query:  KVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPA-----DHEVQVKREIRHYRGVR
        KVDPKRGKR LCSDESEEENPFPIYSARSEYD SAMVSAL +VI SGSGSGS               S+S S  ++PA     D+E  VKRE RHYRGVR
Subjt:  KVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPA-----DHEVQVKREIRHYRGVR

Query:  QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQD
        QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER TTPPSY Y           +  HHQD
Subjt:  QRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQD

A0A371GH55 Ethylene-responsive transcription factor ERF114 (Fragment)1.0e-3956.74Show/hide
Query:  KVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWG
        KVD K GKR L  +E EEENPFP+YS RS+ DMSA+VSAL +VI    G+   + P  V  +  +      ++  QP  H+   +RE   YRGVRQRPWG
Subjt:  KVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWG

Query:  KWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQDHK
        KWAAEIRDPKKAARVWLGTF TAEAAALAYDEAALRFKG+KAKLNFPER    P + Y        L + T H+ +H+
Subjt:  KWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQDHK

A0A6J1E3R5 ethylene-responsive transcription factor ERF114-like1.7e-5368.25Show/hide
Query:  ARVVKINKVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQ--SQSQSQQPADHEVQVKREIRHY
        A+  K  KVDP RGKRP  SDE EEENPFPIYSARS++D SAMVSAL +VI   SGSGSSRSP     ADY P+SQ  +  + Q        V RE RHY
Subjt:  ARVVKINKVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQ--SQSQSQQPADHEVQVKREIRHY

Query:  RGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTT---PPSYAYHHPTFILNLHSSTSHHQDH
        RGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER TT   PP YAYHH            HH +H
Subjt:  RGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTT---PPSYAYHHPTFILNLHSSTSHHQDH

A0A6J1JGC2 ethylene-responsive transcription factor ERF115-like3.3e-3868.03Show/hide
Query:  MVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQ--SQSQSQQPADHEVQVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEA
        MVSAL +VI SGSGSGSSRSP     ADY P+SQ  +  + Q        V RE RHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEA
Subjt:  MVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQ--SQSQSQQPADHEVQVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEA

Query:  ALRFKGTKAKLNFPERFTT---PPSYAYHHPTFILNLHSSTSHHQDH
        A RFKGTKAKLNFPER TT   PP YAYHH            HH +H
Subjt:  ALRFKGTKAKLNFPERFTT---PPSYAYHHPTFILNLHSSTSHHQDH

SwissProt top hitse value%identityAlignment
P93007 Ethylene-responsive transcription factor ERF1122.8e-2648.75Show/hide
Query:  GKRPLCSD----ESEEENPFPIYSARSEYDMSAMVSAL-GEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGK
        GKRPL  +      EE+      S  SE D+S  VS L G+ I           P+++D        Q +S S+Q            R+YRGVRQRPWGK
Subjt:  GKRPLCSD----ESEEENPFPIYSARSEYDMSAMVSAL-GEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGK

Query:  WAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHP
        WAAEIRDP KAARVWLGTFDTAE AALAYD+AA  F+G KAKLNFPE     P+  Y  P
Subjt:  WAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHP

Q70II3 Ethylene-responsive transcription factor ERF1107.4e-2752.11Show/hide
Query:  MSAMVSALGEVI------------RSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEV------QVKREIRHYRGVRQRPWGKWAAEIRDPKKAAR
        MSAMVSAL +V+             S S +G  R    +D+A           S  P +  +      + + + R YRGVRQRPWGKWAAEIRDP +AAR
Subjt:  MSAMVSALGEVI------------RSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEV------QVKREIRHYRGVRQRPWGKWAAEIRDPKKAAR

Query:  VWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPE--RFTTPP
        VWLGTFDTAEAAA AYDEAALRF+G KAKLNFPE  R   PP
Subjt:  VWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPE--RFTTPP

Q9FH54 Ethylene-responsive transcription factor ERF1143.2e-3859.38Show/hide
Query:  GKRPLCSDESEE----ENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAAD--YLPESQSQSQSQQPA-DHEVQVKREIRHYRGVRQRPW
        GKRP   DESEE    EN FP++SARS++DM  MVSAL +VI    G+  S+S   + + D  Y      Q  +QQ A  H+ Q     RHYRGVRQRPW
Subjt:  GKRPLCSDESEE----ENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAAD--YLPESQSQSQSQQPA-DHEVQVKREIRHYRGVRQRPW

Query:  GKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYH
        GKWAAEIRDPKKAARVWLGTF+TAE+AALAYDEAAL+FKG+KAKLNFPER     +  Y+
Subjt:  GKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYH

Q9LY29 Ethylene-responsive transcription factor ERF1157.9e-3755.95Show/hide
Query:  GKRPLCSDESEE-------ENPFPIYSARSEYDMSAMVSALGEVI--RSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQR
        GKRP   DES+E       EN FP +SARS+YDM AMVSAL +VI  +S S   +   P   +  D  P +        P   +  + R+ RHYRGVRQR
Subjt:  GKRPLCSDESEE-------ENPFPIYSARSEYDMSAMVSALGEVI--RSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQR

Query:  PWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER---------FTTPPSY
        PWGKWAAEIRDP+KAARVWLGTF+TAEAAALAYD AAL+FKG+KAKLNFPER          T PP+Y
Subjt:  PWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER---------FTTPPSY

Q9LYU3 Ethylene-responsive transcription factor ERF1134.4e-2760.17Show/hide
Query:  MVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAAL
        MVSAL  VI + +                 P  Q   +S Q    + Q +R  RHYRGVRQRPWGKWAAEIRDPKKAARVWLGTF+TAE AALAYD AAL
Subjt:  MVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAAL

Query:  RFKGTKAKLNFPERFTTP
        +FKGTKAKLNFPER   P
Subjt:  RFKGTKAKLNFPERFTTP

Arabidopsis top hitse value%identityAlignment
AT2G33710.1 Integrase-type DNA-binding superfamily protein2.0e-2748.75Show/hide
Query:  GKRPLCSD----ESEEENPFPIYSARSEYDMSAMVSAL-GEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGK
        GKRPL  +      EE+      S  SE D+S  VS L G+ I           P+++D        Q +S S+Q            R+YRGVRQRPWGK
Subjt:  GKRPLCSD----ESEEENPFPIYSARSEYDMSAMVSAL-GEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGK

Query:  WAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHP
        WAAEIRDP KAARVWLGTFDTAE AALAYD+AA  F+G KAKLNFPE     P+  Y  P
Subjt:  WAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHP

AT5G07310.1 Integrase-type DNA-binding superfamily protein5.6e-3855.95Show/hide
Query:  GKRPLCSDESEE-------ENPFPIYSARSEYDMSAMVSALGEVI--RSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQR
        GKRP   DES+E       EN FP +SARS+YDM AMVSAL +VI  +S S   +   P   +  D  P +        P   +  + R+ RHYRGVRQR
Subjt:  GKRPLCSDESEE-------ENPFPIYSARSEYDMSAMVSALGEVI--RSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQR

Query:  PWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER---------FTTPPSY
        PWGKWAAEIRDP+KAARVWLGTF+TAEAAALAYD AAL+FKG+KAKLNFPER          T PP+Y
Subjt:  PWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPER---------FTTPPSY

AT5G13330.1 related to AP2 6l3.1e-2860.17Show/hide
Query:  MVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAAL
        MVSAL  VI + +                 P  Q   +S Q    + Q +R  RHYRGVRQRPWGKWAAEIRDPKKAARVWLGTF+TAE AALAYD AAL
Subjt:  MVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEVQVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAAL

Query:  RFKGTKAKLNFPERFTTP
        +FKGTKAKLNFPER   P
Subjt:  RFKGTKAKLNFPERFTTP

AT5G50080.1 ethylene response factor 1105.3e-2852.11Show/hide
Query:  MSAMVSALGEVI------------RSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEV------QVKREIRHYRGVRQRPWGKWAAEIRDPKKAAR
        MSAMVSAL +V+             S S +G  R    +D+A           S  P +  +      + + + R YRGVRQRPWGKWAAEIRDP +AAR
Subjt:  MSAMVSALGEVI------------RSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEV------QVKREIRHYRGVRQRPWGKWAAEIRDPKKAAR

Query:  VWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPE--RFTTPP
        VWLGTFDTAEAAA AYDEAALRF+G KAKLNFPE  R   PP
Subjt:  VWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPE--RFTTPP

AT5G61890.1 Integrase-type DNA-binding superfamily protein2.3e-3959.38Show/hide
Query:  GKRPLCSDESEE----ENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAAD--YLPESQSQSQSQQPA-DHEVQVKREIRHYRGVRQRPW
        GKRP   DESEE    EN FP++SARS++DM  MVSAL +VI    G+  S+S   + + D  Y      Q  +QQ A  H+ Q     RHYRGVRQRPW
Subjt:  GKRPLCSDESEE----ENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAAD--YLPESQSQSQSQQPA-DHEVQVKREIRHYRGVRQRPW

Query:  GKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYH
        GKWAAEIRDPKKAARVWLGTF+TAE+AALAYDEAAL+FKG+KAKLNFPER     +  Y+
Subjt:  GKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTAAAGAAAACGGGTATCCATGCTGGTTTGAAAGGGAGGGAAGGTGTAATAACCAAAGCAAGGGTGGTGAAAATTAACAAAGTGGATCCAAAGCGAGGGAAGAG
ACCCCTTTGTTCCGATGAATCGGAGGAAGAAAATCCGTTCCCGATCTACTCAGCTAGATCTGAATATGACATGTCAGCCATGGTTTCTGCGTTAGGTGAAGTGATAAGGA
GTGGGAGTGGAAGTGGGAGTAGCAGAAGCCCAGCAGCAGTAGATGCTGCAGATTATTTGCCGGAATCGCAATCGCAATCGCAGTCGCAGCAACCAGCTGATCATGAAGTT
CAAGTGAAAAGAGAAATTCGACATTATCGAGGAGTAAGACAGAGACCGTGGGGAAAATGGGCAGCTGAGATTCGTGATCCAAAAAAGGCAGCACGAGTATGGCTAGGCAC
CTTCGACACTGCTGAGGCTGCGGCCCTTGCTTATGATGAAGCTGCCCTTAGATTCAAAGGAACAAAAGCCAAGCTCAACTTCCCTGAGAGATTCACCACTCCACCTTCCT
ATGCCTATCATCATCCCACCTTTATCCTCAATCTCCACTCCTCAACTTCTCATCATCAGGATCACAAAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTAAAGAAAACGGGTATCCATGCTGGTTTGAAAGGGAGGGAAGGTGTAATAACCAAAGCAAGGGTGGTGAAAATTAACAAAGTGGATCCAAAGCGAGGGAAGAG
ACCCCTTTGTTCCGATGAATCGGAGGAAGAAAATCCGTTCCCGATCTACTCAGCTAGATCTGAATATGACATGTCAGCCATGGTTTCTGCGTTAGGTGAAGTGATAAGGA
GTGGGAGTGGAAGTGGGAGTAGCAGAAGCCCAGCAGCAGTAGATGCTGCAGATTATTTGCCGGAATCGCAATCGCAATCGCAGTCGCAGCAACCAGCTGATCATGAAGTT
CAAGTGAAAAGAGAAATTCGACATTATCGAGGAGTAAGACAGAGACCGTGGGGAAAATGGGCAGCTGAGATTCGTGATCCAAAAAAGGCAGCACGAGTATGGCTAGGCAC
CTTCGACACTGCTGAGGCTGCGGCCCTTGCTTATGATGAAGCTGCCCTTAGATTCAAAGGAACAAAAGCCAAGCTCAACTTCCCTGAGAGATTCACCACTCCACCTTCCT
ATGCCTATCATCATCCCACCTTTATCCTCAATCTCCACTCCTCAACTTCTCATCATCAGGATCACAAAAATTAA
Protein sequenceShow/hide protein sequence
MLVKKTGIHAGLKGREGVITKARVVKINKVDPKRGKRPLCSDESEEENPFPIYSARSEYDMSAMVSALGEVIRSGSGSGSSRSPAAVDAADYLPESQSQSQSQQPADHEV
QVKREIRHYRGVRQRPWGKWAAEIRDPKKAARVWLGTFDTAEAAALAYDEAALRFKGTKAKLNFPERFTTPPSYAYHHPTFILNLHSSTSHHQDHKN