; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg14082 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg14082
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionNucleoid-associated protein
Genome locationCarg_Chr02:6183209..6187049
RNA-Seq ExpressionCarg14082
SyntenyCarg14082
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR004401 - Nucleoid-associated protein YbaB/EbfC family
IPR036894 - Nucleoid-associated protein YbaB-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587818.1 Nucleoid-associated protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.7e-9397.87Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR    GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

KAG7023441.1 Nucleoid-associated protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]2.1e-8692.02Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTIS+SAQVPNLRGISD KKRSNLNSMS I+G R    GPWKVEKN RSLCVCGLFGGKKEN EKSDDAPSKAGIFGNMQ LYETVRTAQQVVQVEAV
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMKQRMSDLAQSLGMPQGL EGLK
Subjt:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

KAG7035685.1 Nucleoid-associated protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]1.6e-94100Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTRGPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQK
        MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTRGPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQK
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTRGPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQK

Query:  ELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        ELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  ELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

XP_022932159.1 nucleoid-associated protein At4g30620, chloroplastic-like isoform X1 [Cucurbita moschata]3.3e-9297.34Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR    GPWKVEKN RSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

XP_023531577.1 nucleoid-associated protein At4g30620, chloroplastic-like [Cucurbita pepo subsp. pepo]6.2e-9196.79Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR---GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVR
        MASTI+ISAQVPNLRGISDWKKRSNLNSMSKIIGTR    PWKVEKN RSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVR
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR---GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVR

Query:  VQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        VQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  VQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

TrEMBL top hitse value%identityAlignment
A0A0A0LWF8 Uncharacterized protein6.1e-8490.43Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTIS+SAQ+PNLRGISD+KKRSNLNSMS I+GTR    GPWKVEKN RSLCV GLFGGKK+ EEKSDDAPSKAGIFGNMQKLYETVRTAQ+VVQVEAV
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQP+RTEITEAAM+LGPEKLSLLVTEAYQDAHQKSV AMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

A0A6J1E6N9 nucleoid-associated protein At2g24020, chloroplastic-like1.6e-8490.96Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTIS+SAQ+PNLRGISD KKRSNLNSMS I+G R    GPWKVEKN RSLCVCGLFGGKKEN EK DDAPSKAGIFGNMQ LYETVRTAQQVVQVEAV
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMKQRMSDLAQSLGMPQGL EGLK
Subjt:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

A0A6J1EW88 nucleoid-associated protein At4g30620, chloroplastic-like isoform X11.6e-9297.34Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR    GPWKVEKN RSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

A0A6J1HN29 nucleoid-associated protein At4g30620, chloroplastic-like isoform X11.6e-9297.34Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR    GPWKVEKN RSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

A0A6J1JDG0 nucleoid-associated protein At2g24020, chloroplastic-like1.5e-8590.96Show/hide
Query:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTIS+SAQVPNLRGIS  KKR+NLNSMS I+G R    GPWKVEKN RSLCVCGLFGGKKEN EKSDDAPSKAGIFGNMQ LYETVRTAQQVVQVEAV
Subjt:  MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTR----GPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAY+DAHQKSVQAMKQRMSDLAQSLGMPQGL EGLK
Subjt:  RVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

SwissProt top hitse value%identityAlignment
O82230 Nucleoid-associated protein At2g24020, chloroplastic2.3e-5174.66Show/hide
Query:  WKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKL
        +K +   RSL V GLFGG  + +  S+D  SKAGIFGNMQ +YETV+ AQ VVQVEAVRVQKELA AEFDGYC GEL+KVTLSGNQQP+RT+ITEAAM+L
Subjt:  WKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKL

Query:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        G EKLS LVTEAY+DAH KSV AMK+RMSDLAQSLGMP GLSEG+K
Subjt:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

Q5N376 Nucleoid-associated protein syc1054_d1.7e-1447.06Show/hide
Query:  GNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG
        G M++L +  + AQQ VQ  A +VQ++L   E +G  +G L+KV +SGNQ+PLR EI   A+  G E LS LV  A +DA+QKS  AMK++M  L   LG
Subjt:  GNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q8GMT0 Nucleoid-associated protein Synpcc7942_04641.7e-1447.06Show/hide
Query:  GNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG
        G M++L +  + AQQ VQ  A +VQ++L   E +G  +G L+KV +SGNQ+PLR EI   A+  G E LS LV  A +DA+QKS  AMK++M  L   LG
Subjt:  GNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q8YM73 Nucleoid-associated protein alr50674.2e-1346.08Show/hide
Query:  GNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG
        G M++L E  + AQQ VQ  A R+Q+EL   E  G   G L+KV +SGNQ+P R EI+  A+  G + LS LVT A +DA+ KS   M++RM DL   L 
Subjt:  GNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q9M098 Nucleoid-associated protein At4g30620, chloroplastic6.6e-5178.87Show/hide
Query:  NKRSLCVCGLF-GGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEK
        N RSL V GLF GGKK+N+E   D  SKAGI GNMQ LYETV+ AQ VVQVEAVRVQKELAVAEFDGYC+GEL+KVTLSGNQQP+RT+IT+AAM+LG EK
Subjt:  NKRSLCVCGLF-GGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEK

Query:  LSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        LSLLVTEAY+DAH KSV AMK+RMSDLAQSLGMP GL +GLK
Subjt:  LSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

Arabidopsis top hitse value%identityAlignment
AT2G24020.1 Uncharacterised BCR, YbaB family COG07181.6e-5274.66Show/hide
Query:  WKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKL
        +K +   RSL V GLFGG  + +  S+D  SKAGIFGNMQ +YETV+ AQ VVQVEAVRVQKELA AEFDGYC GEL+KVTLSGNQQP+RT+ITEAAM+L
Subjt:  WKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKL

Query:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        G EKLS LVTEAY+DAH KSV AMK+RMSDLAQSLGMP GLSEG+K
Subjt:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

AT2G24020.2 Uncharacterised BCR, YbaB family COG07181.6e-5274.66Show/hide
Query:  WKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKL
        +K +   RSL V GLFGG  + +  S+D  SKAGIFGNMQ +YETV+ AQ VVQVEAVRVQKELA AEFDGYC GEL+KVTLSGNQQP+RT+ITEAAM+L
Subjt:  WKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKL

Query:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        G EKLS LVTEAY+DAH KSV AMK+RMSDLAQSLGMP GLSEG+K
Subjt:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

AT4G30620.1 Uncharacterised BCR, YbaB family COG07184.7e-5278.87Show/hide
Query:  NKRSLCVCGLF-GGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEK
        N RSL V GLF GGKK+N+E   D  SKAGI GNMQ LYETV+ AQ VVQVEAVRVQKELAVAEFDGYC+GEL+KVTLSGNQQP+RT+IT+AAM+LG EK
Subjt:  NKRSLCVCGLF-GGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMKLGPEK

Query:  LSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        LSLLVTEAY+DAH KSV AMK+RMSDLAQSLGMP GL +GLK
Subjt:  LSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGACAATTTCAATCAGTGCTCAAGTACCAAATCTGCGAGGAATTTCTGATTGGAAAAAACGAAGTAACCTTAATTCAATGTCAAAAATAATTGGCACTCGAGG
TCCTTGGAAAGTTGAGAAAAATAAAAGATCTCTTTGTGTTTGTGGCCTATTTGGAGGGAAAAAGGAGAATGAGGAGAAGAGCGATGATGCACCTTCAAAGGCAGGAATCT
TTGGAAACATGCAGAAGTTATATGAGACTGTGAGGACAGCGCAACAGGTTGTCCAAGTAGAGGCAGTGCGCGTACAAAAGGAACTTGCGGTGGCAGAGTTTGATGGCTAC
TGCGAAGGAGAGCTAATAAAGGTGACATTATCCGGAAATCAGCAACCTCTTCGGACAGAGATCACTGAGGCTGCAATGAAATTAGGACCAGAAAAATTGTCCCTTCTAGT
TACTGAAGCATATCAGGACGCACATCAGAAGAGCGTTCAGGCCATGAAGCAAAGAATGAGTGATCTTGCCCAGAGTTTGGGTATGCCACAAGGCCTCAGTGAGGGATTGA
AGTAA
mRNA sequenceShow/hide mRNA sequence
GGCGAAAATCTACTTTATAGTTTATCCTCTCGAGTTTCTCCTCGGCACATTCACAGCTTTGGGCAATGGCATCGACAATTTCAATCAGTGCTCAAGTACCAAATCTGCGA
GGAATTTCTGATTGGAAAAAACGAAGTAACCTTAATTCAATGTCAAAAATAATTGGCACTCGAGGTCCTTGGAAAGTTGAGAAAAATAAAAGATCTCTTTGTGTTTGTGG
CCTATTTGGAGGGAAAAAGGAGAATGAGGAGAAGAGCGATGATGCACCTTCAAAGGCAGGAATCTTTGGAAACATGCAGAAGTTATATGAGACTGTGAGGACAGCGCAAC
AGGTTGTCCAAGTAGAGGCAGTGCGCGTACAAAAGGAACTTGCGGTGGCAGAGTTTGATGGCTACTGCGAAGGAGAGCTAATAAAGGTGACATTATCCGGAAATCAGCAA
CCTCTTCGGACAGAGATCACTGAGGCTGCAATGAAATTAGGACCAGAAAAATTGTCCCTTCTAGTTACTGAAGCATATCAGGACGCACATCAGAAGAGCGTTCAGGCCAT
GAAGCAAAGAATGAGTGATCTTGCCCAGAGTTTGGGTATGCCACAAGGCCTCAGTGAGGGATTGAAGTAAAACCTTTTATGTCTGTCGAGCTTCAGTTTTGTAAATGCAG
CGGCAGTGGGACTACGTAATTGTAATCTGACCTTGACAGGAAATTCCATATCAGTTTTTACTGGTGAACTAAATGTTTGGGAACCTTGTTCTCCATCTTTAGATGTAAAA
AAAAAAAAAAAACAGCTATGATATTTATTGTGGAATCAGAACAAAGAATCAAAAGAATGAAATGAAAGTGAGATTTCAAATTGGAATCTACTTTAGAAGCGATATCGACT
ATCATTTCGGTTTGTTGAAAGGAAAAGATATTTGACAATATGTGGACATCTGATTTCTTGAAAAGAAAAAGCTTTAAACT
Protein sequenceShow/hide protein sequence
MASTISISAQVPNLRGISDWKKRSNLNSMSKIIGTRGPWKVEKNKRSLCVCGLFGGKKENEEKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAVAEFDGY
CEGELIKVTLSGNQQPLRTEITEAAMKLGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK