; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0509 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0509
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionnucleoid-associated protein At4g30620, chloroplastic-like
Genome locationMC06:4187406..4192940
RNA-Seq ExpressionMC06g0509
SyntenyMC06g0509
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR004401 - Nucleoid-associated protein YbaB/EbfC family
IPR036894 - Nucleoid-associated protein YbaB-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587818.1 Nucleoid-associated protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.38e-11186.7Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTIS+SAQ+PN+RGISDWK+RSNLNSMS I+G RI+ +GPWKV KN RS CV GLFG KKENEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RM+DLAQSLGMPQGL+EGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

KAG7023441.1 Nucleoid-associated protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]1.38e-11188.3Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISLSAQ+PN+RGISD K+RSNLNSMSNIVG RI+ +GPWKV KN+RS CV GLFG KKEN EKSDDAPSKAGIFGNMQNLYETV+ AQ VVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RM+DLAQSLGMPQGL EGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

XP_022135106.1 nucleoid-associated protein At4g30620, chloroplastic-like [Momordica charantia]4.92e-130100Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

XP_022932159.1 nucleoid-associated protein At4g30620, chloroplastic-like isoform X1 [Cucurbita moschata]1.68e-11287.23Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTIS+SAQ+PN+RGISDWK+RSNLNSMS I+G RI+ +GPWKV KNNRS CV GLFG KKENEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RM+DLAQSLGMPQGL+EGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

XP_038879413.1 nucleoid-associated protein At4g30620, chloroplastic isoform X2 [Benincasa hispida]3.39e-11287.23Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISLSAQLPN+RG+SD+K+RSNLNS+SNI+GVRI+ +GPWKV K+NRS CVYGLFG KK+NEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAM+LGPEKLSLLVTEAY+DAHQKSVQAMK RM+DLAQSLGMPQGL+EGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

TrEMBL top hitse value%identityAlignment
A0A0A0LWF8 Uncharacterized protein3.06e-11087.23Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISLSAQLPN+RGISD+K+RSNLNSMSNIVG R++ +GPWKV KNNRS CVYGLFG KK+ EEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAM+LGPEKLSLLVTEAY+DAHQKSV AMK RM+DLAQSLGMPQGL+EGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

A0A6J1BZP3 nucleoid-associated protein At4g30620, chloroplastic-like2.38e-130100Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

A0A6J1EW88 nucleoid-associated protein At4g30620, chloroplastic-like isoform X18.13e-11387.23Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTIS+SAQ+PN+RGISDWK+RSNLNSMS I+G RI+ +GPWKV KNNRS CV GLFG KKENEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RM+DLAQSLGMPQGL+EGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

A0A6J1HN29 nucleoid-associated protein At4g30620, chloroplastic-like isoform X18.13e-11387.23Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTIS+SAQ+PN+RGISDWK+RSNLNSMS I+G RI+ +GPWKV KNNRS CV GLFG KKENEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RM+DLAQSLGMPQGL+EGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

A0A6J1JDG0 nucleoid-associated protein At2g24020, chloroplastic-like2.23e-11087.23Show/hide
Query:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISLSAQ+PN+RGIS  K+R+NLNSMSNIVG RI+ +GPWKV KN+RS CV GLFG KKEN EKSDDAPSKAGIFGNMQNLYETV+ AQ VVQVEAV
Subjt:  MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RM+DLAQSLGMPQGL EGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

SwissProt top hitse value%identityAlignment
B7K422 Nucleoid-associated protein PCC8801_25548.6e-1441.18Show/hide
Query:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLG
        G ++ L E   KAQ  VQ  A ++Q+EL   E +G+ EG+L+KV +SGNQ+P    I   A++ G ++LS LVT+A KDA+ +S + M+ +M +L   L 
Subjt:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLG

Query:  MP
        +P
Subjt:  MP

O82230 Nucleoid-associated protein At2g24020, chloroplastic7.2e-5369.09Show/hide
Query:  SNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVT
        S+++  +++   R T    +K     RS  V GLFG   + +  S+D  SKAGIFGNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYC GEL+KVT
Subjt:  SNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVT

Query:  LSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        LSGNQQP+RT+ITEAAM+LG EKLS LVTEAYKDAH KSV AMK RM+DLAQSLGMP GL+EG+K
Subjt:  LSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

Q5N376 Nucleoid-associated protein syc1054_d7.7e-1548.04Show/hide
Query:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLG
        G M+ L +  KKAQ  VQ  A +VQ++L   E +G  +G L+KV +SGNQ+P+R EI   A+  G E LS LV  A KDA+QKS  AMK +M  L   LG
Subjt:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q8GMT0 Nucleoid-associated protein Synpcc7942_04647.7e-1548.04Show/hide
Query:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLG
        G M+ L +  KKAQ  VQ  A +VQ++L   E +G  +G L+KV +SGNQ+P+R EI   A+  G E LS LV  A KDA+QKS  AMK +M  L   LG
Subjt:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q9M098 Nucleoid-associated protein At4g30620, chloroplastic1.6e-5273.01Show/hide
Query:  NSMSNIVGVRITSHGPWKVHKNNRSFCVYGLF-GKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLS
        +S  NIV +     G      NNRS  V GLF G KK+N+E   D  SKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELA AEFDGYC+GEL+KVTLS
Subjt:  NSMSNIVGVRITSHGPWKVHKNNRSFCVYGLF-GKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLS

Query:  GNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        GNQQP+RT+IT+AAM+LG EKLSLLVTEAYKDAH KSV AMK RM+DLAQSLGMP GL +GLK
Subjt:  GNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

Arabidopsis top hitse value%identityAlignment
AT2G24020.1 Uncharacterised BCR, YbaB family COG07185.1e-5469.09Show/hide
Query:  SNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVT
        S+++  +++   R T    +K     RS  V GLFG   + +  S+D  SKAGIFGNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYC GEL+KVT
Subjt:  SNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVT

Query:  LSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        LSGNQQP+RT+ITEAAM+LG EKLS LVTEAYKDAH KSV AMK RM+DLAQSLGMP GL+EG+K
Subjt:  LSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

AT2G24020.2 Uncharacterised BCR, YbaB family COG07181.1e-5376.03Show/hide
Query:  WKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKL
        +K     RS  V GLFG   + +  S+D  SKAGIFGNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYC GEL+KVTLSGNQQP+RT+ITEAAM+L
Subjt:  WKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKL

Query:  GPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        G EKLS LVTEAYKDAH KSV AMK RM+DLAQSLGMP GL+EG+K
Subjt:  GPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK

AT4G30620.1 Uncharacterised BCR, YbaB family COG07181.1e-5373.01Show/hide
Query:  NSMSNIVGVRITSHGPWKVHKNNRSFCVYGLF-GKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLS
        +S  NIV +     G      NNRS  V GLF G KK+N+E   D  SKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELA AEFDGYC+GEL+KVTLS
Subjt:  NSMSNIVGVRITSHGPWKVHKNNRSFCVYGLF-GKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLS

Query:  GNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK
        GNQQP+RT+IT+AAM+LG EKLSLLVTEAYKDAH KSV AMK RM+DLAQSLGMP GL +GLK
Subjt:  GNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGACAATATCTCTGAGTGCTCAACTACCAAATATGCGAGGAATTTCTGATTGGAAACAACGAAGTAACCTAAATTCAATGTCAAATATAGTTGGTGTGCGGAT
CACATCTCATGGTCCTTGGAAAGTTCATAAAAATAATAGATCCTTTTGTGTTTATGGTCTATTTGGAAAAAAGAAGGAGAATGAGGAGAAGAGCGATGATGCACCTTCAA
AGGCAGGAATATTCGGAAACATGCAGAACTTATACGAGACTGTGAAGAAGGCGCAAATGGTTGTCCAAGTAGAGGCAGTGCGTGTACAAAAAGAACTTGCAGCGGCAGAG
TTTGATGGCTACTGCGAAGGAGAGCTAATTAAGGTGACGTTATCTGGGAATCAACAACCTGTTCGCACGGAGATCACTGAGGCTGCAATGAAATTAGGACCAGAAAAACT
GTCCCTTCTAGTCACCGAAGCATACAAGGACGCACACCAGAAGAGTGTTCAGGCCATGAAGCTAAGAATGAATGATCTTGCCCAGAGCTTAGGTATGCCCCAGGGCCTCA
ATGAGGGATTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
CAAAATTTTCGTAATAAGAAACGTGGCCTTTGTGTATTGGCCTGTCTTTAAGAACTATAACCGACGGGTGATGGTTGGGTGGGCGCGGTCTCTCGGAAGCGAAACTCCAC
TTTATAGTTTATCCTCTCGACTTTCTCCGCGGCACATTCACAGCTTTCAGAAATGGCGTCGACAATATCTCTGAGTGCTCAACTACCAAATATGCGAGGAATTTCTGATT
GGAAACAACGAAGTAACCTAAATTCAATGTCAAATATAGTTGGTGTGCGGATCACATCTCATGGTCCTTGGAAAGTTCATAAAAATAATAGATCCTTTTGTGTTTATGGT
CTATTTGGAAAAAAGAAGGAGAATGAGGAGAAGAGCGATGATGCACCTTCAAAGGCAGGAATATTCGGAAACATGCAGAACTTATACGAGACTGTGAAGAAGGCGCAAAT
GGTTGTCCAAGTAGAGGCAGTGCGTGTACAAAAAGAACTTGCAGCGGCAGAGTTTGATGGCTACTGCGAAGGAGAGCTAATTAAGGTGACGTTATCTGGGAATCAACAAC
CTGTTCGCACGGAGATCACTGAGGCTGCAATGAAATTAGGACCAGAAAAACTGTCCCTTCTAGTCACCGAAGCATACAAGGACGCACACCAGAAGAGTGTTCAGGCCATG
AAGCTAAGAATGAATGATCTTGCCCAGAGCTTAGGTATGCCCCAGGGCCTCAATGAGGGATTGAAGTAGACATTTTTATGTCTGTCGAGCTTCAGTTTTGTAAATGCAGG
AGCGGAACGACTTAATTGTAATCTGACCTTGGCAGAAAATTCCATTTTAGTTTTTACTGGTGAACTAAATGTTTGGAAACCTTGTTCTCCATCTCTAAATGTAAAACAGA
CATCTGTGATATTTATTGTGGAATCAGAACGAAGAATGAAAAGTACAAGACGAGTATTCAAATTGGAATCTTCTATACAAGCGATATCAGCACTGGCAATTGAGTTCGCT
GAAGGGGGAACATATTAAGATATTTGGCAATGTATGGACATTTTATTTCTTGGAAAGAAAGAGCTTTAAACTTCTACTTTGCAGAAGATAATCATTGGCAAATGTTAGTA
ATCTTAGTTCAACCTTTTCATCTGCGGACTTGAAATGGGTAATGACATTGACATGGCGTAGCCAATATAATTTAGAATGAATACAATTCTAGTTCCCTACACCCCAGCAT
TGCTGTACTGGAATTGTTGCGAAATAATGACCAATATCTGGAAACGAGACAAATGACCAATTACTTGTTCTGTGTTCATTGCTCTGTTTCTCTGCAGATAAATCAAGATC
TTGGAAGATGTTTTCCTGATAGAAACGATCTGAAAAGTGTCACAGCTCTCTGTGGCTGGATCAGTGGAACCTCATGGCCAGCTCCTCTCACTGTGGCAAAGGGTTTTCAG
AAGTAAACCTGTCCTCTGGAGTACCATGGATACCAGCGAGTTTTGATGGAGAGGTTAAGATGGCTGAGGGCGAATCTGGTGGCTGTCACTGGAACCACTGAATCGGTATC
CCCACTGCATTTTTCCACATATAAAAGTCACATTAAAGGGAAACTTTTACCTCTCTTTACTTGGGAACTTTAGTAGAATTATGTAAAACCTTCTCTACACTTCAATTTGT
ATGCATATGGAAGGAAAAACAGAGAATGAGATGCTGGGAAAGTGAATAAGAGGATAAAAGAAGGATTTATGTGGAAAGATCAGAGCTTTTCAAACCAAACCTAAAGACCC
ATATCCTGAGACCAGTTGCAATCAGCTCCTTGTACGTGGGCAACATGGATTCTTGGGTATCCTTCCAGTTTTTGAGCAGAATATCACTGCCCCAAAAAGGAAAAACAAAA
CTTTATTGGTGAAGGAGATAAGTTTTGGCCAAAGACACAGAAACTAGCAAAACCCACCTGCAAGCAGTCCATTTGTAGGGAATTCCAGTAACATTGGCATGCATTGCCAA
CTGAACCTCCTTAAGATTATAATACTTCTCTGCATAATTCTCAGTACATGGATCATACCCTGACACCCTCCGCCGGAGAAGAGAGCTTTTGAATCTGAAGGTTGAGGTTG
TTGCCACCGTGTTGTTGGGAATGACGGAGGGGCATCTGGGAGTGTAGATACTGTACTGATCAACATTGCCGAACTCGTGGTTCATGGCGTAGCTAACAACATGGTCGCAT
TGCTCGGAAGTCTTGTCGGAGGTGAAGTTGCAGTGCTTAAGGATGGAGTTGTAGGTTGTGTCTGATATCATGGCATGGCTCCACCAATACGTCACCGTTCCAAGGGCGTC
GAGGTTCATATCCGTTACTGCATTTCCCACCTGCATTTTCAAAACTTAACTACAAATTCCACCGGTTCCTATGATACTCCTACTAATCCAATGTAGTAATTTTCCGTTCT
TACAATGAAGCCTTTGAGGTTGATGAAGGAGTGAGAGTGTGCTTTGTTGTAGTCGATAATCTTCTTTGCCAGTTGTGGAACGTAGTGTCCTATACGTCGCAAAGATTAGA
TTCTTTGCAATGTTCAAGAATATATTGAAAAGATTAAATTTTACCCGAGTGGATTTACCTGCATAACTCTCCCCAGAAATGAAGAATTCTCTGTATTTGTATTGCGGGAA
TCTAGACATCCATCGGATAAGGAAAACAAGAGCATCTTGAGCTGTTCGGTTGTCACCGGAATCTTCGAGGTCGGAGGTGGTATTGGTGTAAGAGAATCCAACTCCGGCAG
GTGATTCCAGAAACAGCAGATTTGCATCTGCAGAGAAATGGTGTTTCTGTAAAGAGATTTTGATCAAAAGGGAGAGGAATTAGAGAATGTAATAGACCTTTGTTCCATGA
GTATTTGTTTAGATAAAGAGAAGAAGCAGTTTTGTTGATTCTGAATGGCCCAATTTCCTCCGATGCTCCATATGCTATTGATGAACAACCTGGTCCTGTGTATATCACCA
AACCCTCAAAAAGTGCTTTTACAAACCTTTATTCTTTTTCCTAAAATTATATTACTTTATCAAAACTGCTTCTAAAAATATGCA
Protein sequenceShow/hide protein sequence
MASTISLSAQLPNMRGISDWKQRSNLNSMSNIVGVRITSHGPWKVHKNNRSFCVYGLFGKKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAE
FDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMNDLAQSLGMPQGLNEGLK