; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016242 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016242
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionethylene-responsive transcription factor ERF003-like
Genome locationscaffold9_2:1381236..1382007
RNA-Seq ExpressionMS016242
SyntenyMS016242
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606826.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. sororia]3.3e-5970.17Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQ +YRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAARIMCCPNNL +NHNAP     S SA  P S LLTPTLI+KL+KCR ASF
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP
        QM KK+ P KTH  D     PSRR     D G    GV  PET          QE  RLEDDHVEQMIQELLDLGSF FCP
Subjt:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP

KAG7036532.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-5970.17Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQ +YRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAARIMCCPNNL +NHNAP     S SA  P S LLTPTLI+KL+KCR ASF
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP
        QM KK+ P KTH  D     PSRR     D G    GV  PET          QE  RLEDDHVEQMIQELLDLGSF FCP
Subjt:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP

XP_022153447.1 ethylene-responsive transcription factor ERF003-like, partial [Momordica charantia]2.0e-8596.91Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAARIMCCPNNLSSNHNAPPQYSSSAS  GPVSKLLTPTLIDKLEKCRMASF
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAKKKLPTKTHLYDPSRRPAAKLDGGGVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP
        QMAKKKLPTKTHL D SRRPAAKLDGGGVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP
Subjt:  QMAKKKLPTKTHLYDPSRRPAAKLDGGGVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP

XP_022949239.1 ethylene-responsive transcription factor ERF003-like [Cucurbita moschata]3.3e-5970.17Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQ +YRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAARIMCCPNNL +NHNAP     S SA  P S LLTPTLI+KL+KCR ASF
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP
        QM KK+ P KTH  D     PSRR     D G    GV  PET          QE  RLEDDHVEQMIQELLDLGSF FCP
Subjt:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP

XP_023525763.1 ethylene-responsive transcription factor ERF003-like [Cucurbita pepo subsp. pepo]8.0e-5869.06Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQ +YRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAARIMCCPNNL +NHN+P     S SA  P S LLTPTLI+KL+KCR ASF
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP
        QM KK+ P KTH  D     PSR     L+ G    GV  PET          QE  RLEDDHVEQMIQELLDLGSF FCP
Subjt:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP

TrEMBL top hitse value%identityAlignment
A0A0A0KVJ4 AP2/ERF domain-containing protein1.9e-5267.25Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCC-PNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMAS
        M RPQ +YRGVRQRHWGSWVSEIRHP+LKTRIWLGTFETAEDAARAY+EAA IMCC PNNLS+NH+A   + SSA  S   SKLLTPT I+KL++CRMAS
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCC-PNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMAS

Query:  FQMAKKKLPTKTHLYDPSRRPAAKLDGG--------GVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP
         QM K +   +    DP++ PA KLDGG         V S E +E  RLED+HVEQMIQELLDLGSF FCP
Subjt:  FQMAKKKLPTKTHLYDPSRRPAAKLDGG--------GVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP

A0A1S3BY29 ethylene-responsive transcription factor ERF003-like2.0e-5467.25Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCC-PNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMAS
        M RPQ +YRGVRQRHWGSWVSEIRHP+LKTRIWLGTFETAEDAARAY+EAA IMCC PNNLS+NH+AP       S + P SKLLTPTLI+KL++CRMAS
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCC-PNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMAS

Query:  FQMAKKKLPTKTHLYDPSRRPAAKLDGG--------GVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP
         QM K +   +    DP++ PA KL+GG         V S E +E  RLEDDHVEQMIQELLDLGSF FCP
Subjt:  FQMAKKKLPTKTHLYDPSRRPAAKLDGG--------GVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP

A0A6J1DIX5 ethylene-responsive transcription factor ERF003-like9.8e-8696.91Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAARIMCCPNNLSSNHNAPPQYSSSAS  GPVSKLLTPTLIDKLEKCRMASF
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAKKKLPTKTHLYDPSRRPAAKLDGGGVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP
        QMAKKKLPTKTHL D SRRPAAKLDGGGVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP
Subjt:  QMAKKKLPTKTHLYDPSRRPAAKLDGGGVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP

A0A6J1GC95 ethylene-responsive transcription factor ERF003-like1.6e-5970.17Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQ +YRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAARIMCCPNNL +NHNAP     S SA  P S LLTPTLI+KL+KCR ASF
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP
        QM KK+ P KTH  D     PSRR     D G    GV  PET          QE  RLEDDHVEQMIQELLDLGSF FCP
Subjt:  QMAKKKLPTKTHLYD-----PSRRPAAKLDGG----GVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP

A0A6J1KDM8 ethylene-responsive transcription factor ERF003-like9.6e-5769.06Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQ +YRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAARIMCCPNNL +N NA P +S SA    P S LLTPTLI+KL+KCR ASF
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAKKKLPTKTHLYD-----PSRRPAAKLD----GGGVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP
        QM KK+ P KTH  D     PSR     LD      GV  PET          QE  RLEDDHVEQMIQELLDLGSF FCP
Subjt:  QMAKKKLPTKTHLYD-----PSRRPAAKLD----GGGVQSPET----------QEFHRLEDDHVEQMIQELLDLGSFHFCP

SwissProt top hitse value%identityAlignment
P16146 Protein PPLZ024.0e-2863.46Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF
        MARPQ +YRG RQRHWGSWVSEIRH +LKTRIW GTFE+AEDAARAY+EAAR+MC      +  N P   ++S S+S   SKLL+ TLI KL +C MAS 
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASF

Query:  QMAK
        QM +
Subjt:  QMAK

Q3E958 Ethylene-responsive transcription factor SHINE 32.5e-1751.65Show/hide
Query:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM
        K+RGVRQR WGSWVSEIRHPLLK R+WLGTF+TAE AARAY++AA +M   N  S+  N P   S+ +++    S L +P  + +L   ++
Subjt:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM

Q94AW5 Ethylene-responsive transcription factor ERF0033.3e-3851.61Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCP---NNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM
        MARPQ ++RGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAAR+MC P    N   N NA P  S         SKLL+ TL  KL KC M
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCP---NNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM

Query:  ASFQMAKKKLPTKTHLYDPSRRPAAKLDGGGVQSPET----------------------QEFHRLEDDHVEQMIQELLDLGSFHFC
        AS QM K+   T+T     + R +   D  GV + E+                      Q F  LE+DH+EQMI+ELL  GS   C
Subjt:  ASFQMAKKKLPTKTHLYDPSRRPAAKLDGGGVQSPET----------------------QEFHRLEDDHVEQMIQELLDLGSFHFC

Q9LFN7 Ethylene-responsive transcription factor SHINE 27.1e-1751.61Show/hide
Query:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSAS--ASGPVSKLLTPTLIDKLEKCRM
        K+RGVRQR WGSWVSEIRHPLLK R+WLGTFETAE AARAY++AA +M   N  ++  N P   S   S       S L++P  + +L   ++
Subjt:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSAS--ASGPVSKLLTPTLIDKLEKCRM

Q9XI33 Ethylene-responsive transcription factor WIN13.1e-2046.43Show/hide
Query:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPN---NLSSNHNAPPQYS--------SSASASGPVSKLLTPTLIDKLEK-
        K+RGVRQRHWGSWV+EIRHPLLK RIWLGTFETAE+AARAY+EAA +M   N   N   N+N   + S        SS  +S   S  L+  L  KL K 
Subjt:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPN---NLSSNHNAPPQYS--------SSASASGPVSKLLTPTLIDKLEK-

Query:  CRMASFQMAKKKLPT-KTHLYDPSRRPAAKLDGGGVQSPE
        C+  S  +   +L T  +H+    +R  +K D   V + E
Subjt:  CRMASFQMAKKKLPT-KTHLYDPSRRPAAKLDGGGVQSPE

Arabidopsis top hitse value%identityAlignment
AT1G15360.1 Integrase-type DNA-binding superfamily protein2.2e-2146.43Show/hide
Query:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPN---NLSSNHNAPPQYS--------SSASASGPVSKLLTPTLIDKLEK-
        K+RGVRQRHWGSWV+EIRHPLLK RIWLGTFETAE+AARAY+EAA +M   N   N   N+N   + S        SS  +S   S  L+  L  KL K 
Subjt:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPN---NLSSNHNAPPQYS--------SSASASGPVSKLLTPTLIDKLEK-

Query:  CRMASFQMAKKKLPT-KTHLYDPSRRPAAKLDGGGVQSPE
        C+  S  +   +L T  +H+    +R  +K D   V + E
Subjt:  CRMASFQMAKKKLPT-KTHLYDPSRRPAAKLDGGGVQSPE

AT5G11190.1 Integrase-type DNA-binding superfamily protein5.1e-1851.61Show/hide
Query:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSAS--ASGPVSKLLTPTLIDKLEKCRM
        K+RGVRQR WGSWVSEIRHPLLK R+WLGTFETAE AARAY++AA +M   N  ++  N P   S   S       S L++P  + +L   ++
Subjt:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSAS--ASGPVSKLLTPTLIDKLEKCRM

AT5G25190.1 Integrase-type DNA-binding superfamily protein2.3e-3951.61Show/hide
Query:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCP---NNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM
        MARPQ ++RGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAY+EAAR+MC P    N   N NA P  S         SKLL+ TL  KL KC M
Subjt:  MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCP---NNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM

Query:  ASFQMAKKKLPTKTHLYDPSRRPAAKLDGGGVQSPET----------------------QEFHRLEDDHVEQMIQELLDLGSFHFC
        AS QM K+   T+T     + R +   D  GV + E+                      Q F  LE+DH+EQMI+ELL  GS   C
Subjt:  ASFQMAKKKLPTKTHLYDPSRRPAAKLDGGGVQSPET----------------------QEFHRLEDDHVEQMIQELLDLGSFHFC

AT5G25390.1 Integrase-type DNA-binding superfamily protein8.1e-1649.45Show/hide
Query:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM
        K+RGVRQR WGSWVSEIRHPLL   +WLGTF+TAE AARAY++AA +M   N  S+  N P   S+ +++    S L +P  + +L   ++
Subjt:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM

AT5G25390.2 Integrase-type DNA-binding superfamily protein1.7e-1851.65Show/hide
Query:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM
        K+RGVRQR WGSWVSEIRHPLLK R+WLGTF+TAE AARAY++AA +M   N  S+  N P   S+ +++    S L +P  + +L   ++
Subjt:  KYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAGACCTCAACACAAGTACAGAGGAGTCCGGCAGCGCCATTGGGGCTCTTGGGTTTCTGAGATTCGCCACCCTTTGCTGAAGACAAGGATATGGCTGGGAACTTT
CGAAACGGCCGAAGACGCCGCTCGAGCTTACGAGGAAGCAGCCAGAATAATGTGTTGCCCTAATAATCTCTCCTCCAATCACAATGCACCACCTCAGTACTCCTCGTCCG
CATCGGCGTCGGGGCCGGTGTCGAAGCTTCTCACTCCGACTTTGATCGATAAATTGGAGAAATGTCGAATGGCTTCCTTCCAAATGGCCAAAAAGAAACTCCCCACCAAA
ACGCACCTATACGACCCGAGTCGCCGTCCGGCGGCGAAACTCGACGGCGGAGGAGTTCAATCGCCGGAGACTCAGGAGTTTCACCGGCTTGAAGACGACCACGTAGAGCA
AATGATACAAGAGTTGCTTGATCTTGGCTCCTTTCACTTTTGCCCT
mRNA sequenceShow/hide mRNA sequence
ATGGCGAGACCTCAACACAAGTACAGAGGAGTCCGGCAGCGCCATTGGGGCTCTTGGGTTTCTGAGATTCGCCACCCTTTGCTGAAGACAAGGATATGGCTGGGAACTTT
CGAAACGGCCGAAGACGCCGCTCGAGCTTACGAGGAAGCAGCCAGAATAATGTGTTGCCCTAATAATCTCTCCTCCAATCACAATGCACCACCTCAGTACTCCTCGTCCG
CATCGGCGTCGGGGCCGGTGTCGAAGCTTCTCACTCCGACTTTGATCGATAAATTGGAGAAATGTCGAATGGCTTCCTTCCAAATGGCCAAAAAGAAACTCCCCACCAAA
ACGCACCTATACGACCCGAGTCGCCGTCCGGCGGCGAAACTCGACGGCGGAGGAGTTCAATCGCCGGAGACTCAGGAGTTTCACCGGCTTGAAGACGACCACGTAGAGCA
AATGATACAAGAGTTGCTTGATCTTGGCTCCTTTCACTTTTGCCCT
Protein sequenceShow/hide protein sequence
MARPQHKYRGVRQRHWGSWVSEIRHPLLKTRIWLGTFETAEDAARAYEEAARIMCCPNNLSSNHNAPPQYSSSASASGPVSKLLTPTLIDKLEKCRMASFQMAKKKLPTK
THLYDPSRRPAAKLDGGGVQSPETQEFHRLEDDHVEQMIQELLDLGSFHFCP