; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009944 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009944
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationscaffold943_1:214375..216353
RNA-Seq ExpressionMS009944
SyntenyMS009944
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143373.2 homeobox-leucine zipper protein ATHB-22 isoform X1 [Cucumis sativus]3.3e-5475.43Show/hide
Query:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII
        +AG+G D  KK+RLS DQLE LERSF EEVKLDPDRK+KLSK+LGLQPRQIAVWFQNRRARWK KQLEHLYD LKQ+FDTISKEK+NLQQEVMKLRSM+ 
Subjt:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII

Query:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP
        EQ  RNQGS A  DVSGEE TVE TSV   S CNNY+YNV EDFNQI        SASA P +WGA A HLPSYP
Subjt:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP

XP_022143896.1 putative homeobox-leucine zipper protein ATHB-51 [Momordica charantia]6.8e-8498.22Show/hide
Query:  VAGSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQ
        VAG G DKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQ
Subjt:  VAGSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQ

Query:  AARNQGSGADVSGEETVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGAAHHLPSYP
        AARNQGSGADVSGEETVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA HHLPSYP
Subjt:  AARNQGSGADVSGEETVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGAAHHLPSYP

XP_031742621.1 homeobox-leucine zipper protein ATHB-22 isoform X2 [Cucumis sativus]3.3e-5475.43Show/hide
Query:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII
        +AG+G D  KK+RLS DQLE LERSF EEVKLDPDRK+KLSK+LGLQPRQIAVWFQNRRARWK KQLEHLYD LKQ+FDTISKEK+NLQQEVMKLRSM+ 
Subjt:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII

Query:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP
        EQ  RNQGS A  DVSGEE TVE TSV   S CNNY+YNV EDFNQI        SASA P +WGA A HLPSYP
Subjt:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP

XP_031742622.1 putative homeobox-leucine zipper protein ATHB-51 isoform X3 [Cucumis sativus]3.3e-5475.43Show/hide
Query:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII
        +AG+G D  KK+RLS DQLE LERSF EEVKLDPDRK+KLSK+LGLQPRQIAVWFQNRRARWK KQLEHLYD LKQ+FDTISKEK+NLQQEVMKLRSM+ 
Subjt:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII

Query:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP
        EQ  RNQGS A  DVSGEE TVE TSV   S CNNY+YNV EDFNQI        SASA P +WGA A HLPSYP
Subjt:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP

XP_031742623.1 putative homeobox-leucine zipper protein ATHB-51 isoform X4 [Cucumis sativus]3.3e-5475.43Show/hide
Query:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII
        +AG+G D  KK+RLS DQLE LERSF EEVKLDPDRK+KLSK+LGLQPRQIAVWFQNRRARWK KQLEHLYD LKQ+FDTISKEK+NLQQEVMKLRSM+ 
Subjt:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII

Query:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP
        EQ  RNQGS A  DVSGEE TVE TSV   S CNNY+YNV EDFNQI        SASA P +WGA A HLPSYP
Subjt:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP

TrEMBL top hitse value%identityAlignment
A0A0A0KET9 Homeobox domain-containing protein1.6e-5475.43Show/hide
Query:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII
        +AG+G D  KK+RLS DQLE LERSF EEVKLDPDRK+KLSK+LGLQPRQIAVWFQNRRARWK KQLEHLYD LKQ+FDTISKEK+NLQQEVMKLRSM+ 
Subjt:  VAGSGAD--KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMII

Query:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP
        EQ  RNQGS A  DVSGEE TVE TSV   S CNNY+YNV EDFNQI        SASA P +WGA A HLPSYP
Subjt:  EQAARNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP

A0A1S3CIQ6 homeobox-leucine zipper protein ATHB-22-like2.2e-5173.1Show/hide
Query:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAA
        G    KK+RLS DQLE LERSF EEVKLDPDRK+KLS +LGLQPRQIAVWFQNRRARWK KQLEHLYD LKQ+FDTISKEK+NLQQEVMKLR+M+ EQ  
Subjt:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAA

Query:  RNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP
        RN GS A  D+SGEE TVE TSV   S CNNY+YNV EDFNQI        SASA P +WGA A HLPSYP
Subjt:  RNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP

A0A5A7SJ21 Homeobox-leucine zipper protein ATHB-22-like2.2e-5173.1Show/hide
Query:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAA
        G    KK+RLS DQLE LERSF EEVKLDPDRK+KLS +LGLQPRQIAVWFQNRRARWK KQLEHLYD LKQ+FDTISKEK+NLQQEVMKLR+M+ EQ  
Subjt:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAA

Query:  RNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP
        RN GS A  D+SGEE TVE TSV   S CNNY+YNV EDFNQI        SASA P +WGA A HLPSYP
Subjt:  RNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP

A0A5D3CAN1 Homeobox-leucine zipper protein ATHB-22-like8.2e-5172.51Show/hide
Query:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAA
        G    KK+RLS DQLE LERSF EEVKLDPDRK+KLS +LGLQPRQIAVWFQNRRARWK KQLEHLYD LKQ+FDTISKEK+NLQQEVMKLR+M+ EQ  
Subjt:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAA

Query:  RNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP
        RN GS A  D+SGEE TVE TSV   S CNNY+YNV EDF+QI        SASA P +WGA A HLPSYP
Subjt:  RNQGSGA--DVSGEE-TVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA-AHHLPSYP

A0A6J1CRP2 putative homeobox-leucine zipper protein ATHB-513.3e-8498.22Show/hide
Query:  VAGSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQ
        VAG G DKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQ
Subjt:  VAGSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQ

Query:  AARNQGSGADVSGEETVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGAAHHLPSYP
        AARNQGSGADVSGEETVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGA HHLPSYP
Subjt:  AARNQGSGADVSGEETVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGAAHHLPSYP

SwissProt top hitse value%identityAlignment
Q4PSR7 Homeobox-leucine zipper protein ATHB-221.1e-2861.17Show/hide
Query:  KKRRLSNDQLEVLERSFGEEV--------KLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIE
        KK++++++QL+ LERSF EE+        KL+PDRK+KLSK+LGLQPRQIAVWFQNR+ARWK KQLEHLY++L+QEFD +S+EK  LQ+E+++L+SMI E
Subjt:  KKRRLSNDQLEVLERSFGEEV--------KLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIE

Query:  QAA
         ++
Subjt:  QAA

Q6K498 Homeobox-leucine zipper protein HOX42.0e-2250.47Show/hide
Query:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLE-------HLYDALKQEFDTISKEKNNLQQEVMKLRS
        G G +KKRRLS +Q+  LERSF  E KL+P+RK +L++ LGLQPRQ+AVWFQNRRARWK KQLE       H YD+L+ + D + ++K+ L  E+ +L++
Subjt:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLE-------HLYDALKQEFDTISKEKNNLQQEVMKLRS

Query:  MIIEQAA
         + ++ A
Subjt:  MIIEQAA

Q8LC03 Homeobox-leucine zipper protein ATHB-131.0e-2150Show/hide
Query:  DKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQG
        +KKRRL+ +Q++ LE++F    KL+P+RK++L++ LGLQPRQIA+WFQNRRARWK KQLE  YD LK++FDT+  E + LQ    KL++ I+    R Q 
Subjt:  DKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQG

Query:  SGADVSGE
           +++ E
Subjt:  SGADVSGE

Q9LZR0 Putative homeobox-leucine zipper protein ATHB-514.1e-3163.03Show/hide
Query:  KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQGS
        KK+RL++ QL  LERSF EE+KLD DRK+KLS++LGLQPRQIAVWFQNRRARWKAKQLE LYD+L+QE+D +S+EK  L  EV KLR+++ +Q    +  
Subjt:  KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQGS

Query:  GA---DVSGEE-TVESTSV
         A    VSGEE TVE +SV
Subjt:  GA---DVSGEE-TVESTSV

Q9XH37 Homeobox-leucine zipper protein HOX42.0e-2250.47Show/hide
Query:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLE-------HLYDALKQEFDTISKEKNNLQQEVMKLRS
        G G +KKRRLS +Q+  LERSF  E KL+P+RK +L++ LGLQPRQ+AVWFQNRRARWK KQLE       H YD+L+ + D + ++K+ L  E+ +L++
Subjt:  GSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLE-------HLYDALKQEFDTISKEKNNLQQEVMKLRS

Query:  MIIEQAA
         + ++ A
Subjt:  MIIEQAA

Arabidopsis top hitse value%identityAlignment
AT1G69780.1 Homeobox-leucine zipper protein family7.1e-2350Show/hide
Query:  DKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQG
        +KKRRL+ +Q++ LE++F    KL+P+RK++L++ LGLQPRQIA+WFQNRRARWK KQLE  YD LK++FDT+  E + LQ    KL++ I+    R Q 
Subjt:  DKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQG

Query:  SGADVSGE
           +++ E
Subjt:  SGADVSGE

AT2G22430.1 homeobox protein 67.1e-2345.53Show/hide
Query:  ADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEH-------LYDALKQEFDTISKEKNNLQQEVMKLRSMII
        ++KKRRLS +Q++ LE++F  E KL+P+RK+KL+++LGLQPRQ+AVWFQNRRARWK KQLE         YD+L+  FD++ ++  +L QE+ KL++ + 
Subjt:  ADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEH-------LYDALKQEFDTISKEKNNLQQEVMKLRSMII

Query:  EQAARNQGSGADVSGEETVESTS
             N G G +   E     T+
Subjt:  EQAARNQGSGADVSGEETVESTS

AT2G36610.1 homeobox protein 227.9e-3061.17Show/hide
Query:  KKRRLSNDQLEVLERSFGEEV--------KLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIE
        KK++++++QL+ LERSF EE+        KL+PDRK+KLSK+LGLQPRQIAVWFQNR+ARWK KQLEHLY++L+QEFD +S+EK  LQ+E+++L+SMI E
Subjt:  KKRRLSNDQLEVLERSFGEEV--------KLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIE

Query:  QAA
         ++
Subjt:  QAA

AT5G03790.1 homeobox 512.9e-3263.03Show/hide
Query:  KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQGS
        KK+RL++ QL  LERSF EE+KLD DRK+KLS++LGLQPRQIAVWFQNRRARWKAKQLE LYD+L+QE+D +S+EK  L  EV KLR+++ +Q    +  
Subjt:  KKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQGS

Query:  GA---DVSGEE-TVESTSV
         A    VSGEE TVE +SV
Subjt:  GA---DVSGEE-TVESTSV

AT5G65310.1 homeobox protein 57.1e-2349.02Show/hide
Query:  AGSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQA
        + + A+KKRRL  +Q++ LE++F  + KL+P+RK+KL+++LGLQPRQ+A+WFQNRRARWK KQLE  Y  LK  FD + + +++LQ++   L   I E  
Subjt:  AGSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQA

Query:  AR
        A+
Subjt:  AR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTGGCGGGCAGTGGAGCGGATAAGAAGAGGAGGTTAAGTAACGATCAGTTGGAGGTATTGGAGAGAAGCTTTGGAGAGGAGGTGAAGCTTGATCCTGATAGGAAAATAAA
GCTGTCCAAGCAACTTGGGCTTCAACCAAGGCAAATTGCTGTGTGGTTTCAGAATAGAAGAGCTCGCTGGAAGGCCAAGCAACTTGAGCATCTCTATGATGCCCTCAAAC
AAGAGTTTGATACCATTTCTAAGGAAAAAAACAACCTTCAACAAGAGGTGATGAAACTGAGAAGCATGATAATAGAACAAGCGGCGAGGAATCAAGGGTCGGGGGCGGAT
GTGTCGGGCGAAGAGACGGTGGAGAGCACGTCGGTAGGGGGTCGGAGCTGCTGCAACAATTACATATACAATGTGGCCGAAGATTTCAACCAAATATCAGCATCAGCATC
AGCATCAGCTTCAGCTTCAGCTTCTCCATTATTCTGGGGCGCTGCTCACCACCTGCCTTCTTATCCT
mRNA sequenceShow/hide mRNA sequence
GTGGCGGGCAGTGGAGCGGATAAGAAGAGGAGGTTAAGTAACGATCAGTTGGAGGTATTGGAGAGAAGCTTTGGAGAGGAGGTGAAGCTTGATCCTGATAGGAAAATAAA
GCTGTCCAAGCAACTTGGGCTTCAACCAAGGCAAATTGCTGTGTGGTTTCAGAATAGAAGAGCTCGCTGGAAGGCCAAGCAACTTGAGCATCTCTATGATGCCCTCAAAC
AAGAGTTTGATACCATTTCTAAGGAAAAAAACAACCTTCAACAAGAGGTGATGAAACTGAGAAGCATGATAATAGAACAAGCGGCGAGGAATCAAGGGTCGGGGGCGGAT
GTGTCGGGCGAAGAGACGGTGGAGAGCACGTCGGTAGGGGGTCGGAGCTGCTGCAACAATTACATATACAATGTGGCCGAAGATTTCAACCAAATATCAGCATCAGCATC
AGCATCAGCTTCAGCTTCAGCTTCTCCATTATTCTGGGGCGCTGCTCACCACCTGCCTTCTTATCCT
Protein sequenceShow/hide protein sequence
VAGSGADKKRRLSNDQLEVLERSFGEEVKLDPDRKIKLSKQLGLQPRQIAVWFQNRRARWKAKQLEHLYDALKQEFDTISKEKNNLQQEVMKLRSMIIEQAARNQGSGAD
VSGEETVESTSVGGRSCCNNYIYNVAEDFNQISASASASASASASPLFWGAAHHLPSYP