; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr002816 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr002816
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNucleoid-associated protein
Genome locationtig00001784:12735..20167
RNA-Seq ExpressionSgr002816
SyntenySgr002816
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR004401 - Nucleoid-associated protein YbaB/EbfC family
IPR036894 - Nucleoid-associated protein YbaB-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023441.1 Nucleoid-associated protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]5.2e-8389.3Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISL+AQ+PNLRGISD KKRSNLNSMSNIVG RI  +GPWKVEKN+RS  V GLFG KKEN EKSDDAPSKAGIFGNMQNLYETV+ AQ VVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RMSDLAQSLGMPQGL EGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

XP_022135106.1 nucleoid-associated protein At4g30620, chloroplastic-like [Momordica charantia]2.6e-9094.65Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISL+AQ+PN+RGISD K+RSNLNSMSNIVGVRI SHGPWKV KNNRSF VYGLFG+KKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRM+DLAQSLGMPQGLNEGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

XP_022932159.1 nucleoid-associated protein At4g30620, chloroplastic-like isoform X1 [Cucurbita moschata]4.4e-8287.7Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTIS++AQ+PNLRGISD KKRSNLNSMS I+G RI  +GPWKVEKNNRS  V GLFG KKENEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELA AEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RMSDLAQSLGMPQGL+EGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

XP_023516534.1 nucleoid-associated protein At4g30620, chloroplastic-like [Cucurbita pepo subsp. pepo]2.6e-8288.77Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISL+AQ+PNLRGISD KKRSNLNS SNIVG RI  +GPWKVEKN+RS  V GLFG KKEN EKSDDAPSKAGIFGNMQNLYETV+ AQ VVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RMSDLAQSLGMPQGL EGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

XP_038879413.1 nucleoid-associated protein At4g30620, chloroplastic isoform X2 [Benincasa hispida]1.5e-8287.7Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISL+AQ+PNLRG+SD KKRSNLNS+SNI+GVRI  +GPWKVEK+NRS  VYGLFG KK+NEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAM+LGPEKLSLLVTEAY+DAHQKSVQAMK RMSDLAQSLGMPQGL+EGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

TrEMBL top hitse value%identityAlignment
A0A5A7SUZ8 Nucleoid-associated protein2.4e-8187.7Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISL+AQ+PNLRGISD KKRSNLN MSNIVG R+  +GPWKVEKNNRS  VYGLFG KK+ EEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAM+LGPEKLSLLVTEAY+DAHQKSVQAMK RMSDLAQSLGMPQGL+EGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

A0A6J1BZP3 nucleoid-associated protein At4g30620, chloroplastic-like1.3e-9094.65Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISL+AQ+PN+RGISD K+RSNLNSMSNIVGVRI SHGPWKV KNNRSF VYGLFG+KKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRM+DLAQSLGMPQGLNEGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

A0A6J1EW88 nucleoid-associated protein At4g30620, chloroplastic-like isoform X12.2e-8287.7Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTIS++AQ+PNLRGISD KKRSNLNSMS I+G RI  +GPWKVEKNNRS  V GLFG KKENEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELA AEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RMSDLAQSLGMPQGL+EGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

A0A6J1HN29 nucleoid-associated protein At4g30620, chloroplastic-like isoform X12.2e-8287.7Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTIS++AQ+PNLRGISD KKRSNLNSMS I+G RI  +GPWKVEKNNRS  V GLFG KKENEEKSDDAPSKAGIFGNMQ LYETV+ AQ VVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELA AEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RMSDLAQSLGMPQGL+EGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

A0A6J1JDG0 nucleoid-associated protein At2g24020, chloroplastic-like3.7e-8288.24Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV
        MASTISL+AQ+PNLRGIS  KKR+NLNSMSNIVG RI  +GPWKVEKN+RS  V GLFG KKEN EKSDDAPSKAGIFGNMQNLYETV+ AQ VVQVEAV
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMK RMSDLAQSLGMPQGL EGL
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

SwissProt top hitse value%identityAlignment
B7K422 Nucleoid-associated protein PCC8801_25542.0e-1341.18Show/hide
Query:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLG
        G ++ L E   KAQ  VQ  A ++Q+EL   E +G+ EG+L+KV +SGNQ+P    I   A++ G ++LS LVT+A KDA+ +S + M+ +M +L   L 
Subjt:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

O82230 Nucleoid-associated protein At2g24020, chloroplastic5.9e-5377.24Show/hide
Query:  WKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKL
        +K +   RS RV GLFG   + +  S+D  SKAGIFGNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYC GEL+KVTLSGNQQP+RT+ITEAAM+L
Subjt:  WKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKL

Query:  GPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        G EKLS LVTEAYKDAH KSV AMK RMSDLAQSLGMP GL+EG+
Subjt:  GPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

Q5N376 Nucleoid-associated protein syc1054_d1.8e-1448.04Show/hide
Query:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLG
        G M+ L +  KKAQ  VQ  A +VQ++L   E +G  +G L+KV +SGNQ+P+R EI   A+  G E LS LV  A KDA+QKS  AMK +M  L   LG
Subjt:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q8GMT0 Nucleoid-associated protein Synpcc7942_04641.8e-1448.04Show/hide
Query:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLG
        G M+ L +  KKAQ  VQ  A +VQ++L   E +G  +G L+KV +SGNQ+P+R EI   A+  G E LS LV  A KDA+QKS  AMK +M  L   LG
Subjt:  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q9M098 Nucleoid-associated protein At4g30620, chloroplastic2.6e-5367.03Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLF-GRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEA
        MAST + T     L         +  +S  NIV +     G      NNRS RV GLF G KK+N+E   D  SKAGI GNMQNLYETVKKAQMVVQVEA
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLF-GRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEA

Query:  VRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLN
        VRVQKELA AEFDGYC+GEL+KVTLSGNQQP+RT+IT+AAM+LG EKLSLLVTEAYKDAH KSV AMK RMSDLAQSLGMP GL+
Subjt:  VRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLN

Arabidopsis top hitse value%identityAlignment
AT2G24020.1 Uncharacterised BCR, YbaB family COG07184.2e-5477.24Show/hide
Query:  WKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKL
        +K +   RS RV GLFG   + +  S+D  SKAGIFGNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYC GEL+KVTLSGNQQP+RT+ITEAAM+L
Subjt:  WKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKL

Query:  GPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        G EKLS LVTEAYKDAH KSV AMK RMSDLAQSLGMP GL+EG+
Subjt:  GPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

AT2G24020.2 Uncharacterised BCR, YbaB family COG07184.2e-5477.24Show/hide
Query:  WKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKL
        +K +   RS RV GLFG   + +  S+D  SKAGIFGNMQN+YETVKKAQMVVQVEAVRVQKELAAAEFDGYC GEL+KVTLSGNQQP+RT+ITEAAM+L
Subjt:  WKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKL

Query:  GPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL
        G EKLS LVTEAYKDAH KSV AMK RMSDLAQSLGMP GL+EG+
Subjt:  GPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGL

AT4G30620.1 Uncharacterised BCR, YbaB family COG07181.9e-5467.03Show/hide
Query:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLF-GRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEA
        MAST + T     L         +  +S  NIV +     G      NNRS RV GLF G KK+N+E   D  SKAGI GNMQNLYETVKKAQMVVQVEA
Subjt:  MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLF-GRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEA

Query:  VRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLN
        VRVQKELA AEFDGYC+GEL+KVTLSGNQQP+RT+IT+AAM+LG EKLSLLVTEAYKDAH KSV AMK RMSDLAQSLGMP GL+
Subjt:  VRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGACAATTTCTTTGACTGCTCAAATACCAAATTTGCGAGGAATTTCTGATCGGAAAAAACGCAGTAACCTAAATTCAATGTCAAATATAGTTGGTGTGCGGAT
CTTATCTCATGGTCCTTGGAAAGTTGAGAAAAACAATAGATCTTTTCGTGTTTATGGTCTATTTGGAAGAAAAAAGGAGAATGAGGAGAAGAGTGATGATGCGCCTTCAA
AGGCAGGAATCTTTGGAAACATGCAGAACTTATATGAGACTGTGAAGAAGGCGCAAATGGTTGTCCAAGTAGAGGCAGTGCGTGTACAAAAAGAACTTGCGGCGGCGGAG
TTTGATGGCTACTGCGAAGGAGAGCTAATTAAGGTGACATTATCCGGGAATCAGCAACCTGTTCGCACAGAGATCACTGAGGCTGCAATGAAATTAGGACCGGAAAAACT
GTCTCTGCTAGTCACTGAAGCATACAAGGACGCGCATCAGAAGAGCGTTCAGGCCATGAAGCTAAGAATGAGTGATCTTGCCCAGAGCTTAGGTATGCCCCAGGGCCTCA
ATGAGGGATTGAACATAATTCTCAGTGCATGGATCGTACCCAGACACCCTCCGCCGGAGAAGAGAACTTTTGAATCTGAAAGTGGGCGTCGCCGCCGTGTTGTTGGGAAA
CACGGCGCGGCACTTGGGAGTGTAGATACTGTACTGATCAACATTACCAAACTCGTGGTTCATGGCGTAGCTTACAACGTCGTCGCATTGGTTGGAAGTCTTGTCGGAGG
TGAAATTGCAGTGCTTGAGAATGGAGTTGTAGGTGGTGTCCGATATCATTGCATGGCTCCACCAATACGTCACTGTTCCAAGGGCGTCGTGGTTCGTATCCTTTGGGAAT
CTGGACATCCATCGGACGAGGAAAATAAGAGCATCTTGAGCTGTTCGGTTGTCGCCGGAATCTTCAAGGTCGGAGGAGGTGTTTGTCCTGAGTACTGCGAGAATGCCACC
GGTGGCTGTCCCGGGAGCGACGTGATTCTGTCGCGGAGCTGCTTTTCTGGCACGGCGGCGGCAATGGCGGAGGTGGTTGAATGGGCCAGGAGTGATATTAAGGCAATTGA
GAGAAGGGAGACCATTGCACTCCTCAAACTACTCATGCATGAAGGGTTTAGAGAGAGAGTTGTTGATGAGTCAATAATGGGATGGTGTTTTCTCTCTCTCTCTCTCTCTC
TCCCCCCAAATTCTGTGGAGAGAGAGAAGGAGATTGAAGGGGTAAATGAGATGAGATGTGGGTTGGTTAAGAAGAAGAAGGAGGGAGAAGAAAATCATAAATGCACCACC
ACTCTGATGATGAGAGTGTGGTCCTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCGACAATTTCTTTGACTGCTCAAATACCAAATTTGCGAGGAATTTCTGATCGGAAAAAACGCAGTAACCTAAATTCAATGTCAAATATAGTTGGTGTGCGGAT
CTTATCTCATGGTCCTTGGAAAGTTGAGAAAAACAATAGATCTTTTCGTGTTTATGGTCTATTTGGAAGAAAAAAGGAGAATGAGGAGAAGAGTGATGATGCGCCTTCAA
AGGCAGGAATCTTTGGAAACATGCAGAACTTATATGAGACTGTGAAGAAGGCGCAAATGGTTGTCCAAGTAGAGGCAGTGCGTGTACAAAAAGAACTTGCGGCGGCGGAG
TTTGATGGCTACTGCGAAGGAGAGCTAATTAAGGTGACATTATCCGGGAATCAGCAACCTGTTCGCACAGAGATCACTGAGGCTGCAATGAAATTAGGACCGGAAAAACT
GTCTCTGCTAGTCACTGAAGCATACAAGGACGCGCATCAGAAGAGCGTTCAGGCCATGAAGCTAAGAATGAGTGATCTTGCCCAGAGCTTAGGTATGCCCCAGGGCCTCA
ATGAGGGATTGAACATAATTCTCAGTGCATGGATCGTACCCAGACACCCTCCGCCGGAGAAGAGAACTTTTGAATCTGAAAGTGGGCGTCGCCGCCGTGTTGTTGGGAAA
CACGGCGCGGCACTTGGGAGTGTAGATACTGTACTGATCAACATTACCAAACTCGTGGTTCATGGCGTAGCTTACAACGTCGTCGCATTGGTTGGAAGTCTTGTCGGAGG
TGAAATTGCAGTGCTTGAGAATGGAGTTGTAGGTGGTGTCCGATATCATTGCATGGCTCCACCAATACGTCACTGTTCCAAGGGCGTCGTGGTTCGTATCCTTTGGGAAT
CTGGACATCCATCGGACGAGGAAAATAAGAGCATCTTGAGCTGTTCGGTTGTCGCCGGAATCTTCAAGGTCGGAGGAGGTGTTTGTCCTGAGTACTGCGAGAATGCCACC
GGTGGCTGTCCCGGGAGCGACGTGATTCTGTCGCGGAGCTGCTTTTCTGGCACGGCGGCGGCAATGGCGGAGGTGGTTGAATGGGCCAGGAGTGATATTAAGGCAATTGA
GAGAAGGGAGACCATTGCACTCCTCAAACTACTCATGCATGAAGGGTTTAGAGAGAGAGTTGTTGATGAGTCAATAATGGGATGGTGTTTTCTCTCTCTCTCTCTCTCTC
TCCCCCCAAATTCTGTGGAGAGAGAGAAGGAGATTGAAGGGGTAAATGAGATGAGATGTGGGTTGGTTAAGAAGAAGAAGGAGGGAGAAGAAAATCATAAATGCACCACC
ACTCTGATGATGAGAGTGTGGTCCTTATAA
Protein sequenceShow/hide protein sequence
MASTISLTAQIPNLRGISDRKKRSNLNSMSNIVGVRILSHGPWKVEKNNRSFRVYGLFGRKKENEEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELAAAE
FDGYCEGELIKVTLSGNQQPVRTEITEAAMKLGPEKLSLLVTEAYKDAHQKSVQAMKLRMSDLAQSLGMPQGLNEGLNIILSAWIVPRHPPPEKRTFESESGRRRRVVGK
HGAALGSVDTVLINITKLVVHGVAYNVVALVGSLVGGEIAVLENGVVGGVRYHCMAPPIRHCSKGVVVRILWESGHPSDEENKSILSCSVVAGIFKVGGGVCPEYCENAT
GGCPGSDVILSRSCFSGTAAAMAEVVEWARSDIKAIERRETIALLKLLMHEGFRERVVDESIMGWCFLSLSLSLPPNSVEREKEIEGVNEMRCGLVKKKKEGEENHKCTT
TLMMRVWSL