; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008882 (gene) of Snake gourd v1 genome

Gene IDTan0008882
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNucleoid-associated protein
Genome locationLG09:67557576..67560993
RNA-Seq ExpressionTan0008882
SyntenyTan0008882
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR004401 - Nucleoid-associated protein YbaB/EbfC family
IPR036894 - Nucleoid-associated protein YbaB-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587818.1 Nucleoid-associated protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]9.8e-9294.15Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTIS+SAQVPNLRG+SDWKKRSNLNSMS I+G R SPYGPWKVEKN RSLCV GLFGGKKENE+KSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQPLRTEITEAAM+LGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

XP_004146679.1 nucleoid-associated protein At2g24020, chloroplastic [Cucumis sativus]4.1e-9094.15Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISLSAQ+PNLRG+SD+KKRSNLNSMSNIVG R SPYGPWKVEKNNRSLCVYGLFGGKK+ E+KSDDAPSKAGIFGNMQKLYETVRTAQ+VVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMELGPEKLSLLVTEAYQDAHQKSV AMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

XP_022932159.1 nucleoid-associated protein At4g30620, chloroplastic-like isoform X1 [Cucurbita moschata]2.0e-9294.68Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTIS+SAQVPNLRG+SDWKKRSNLNSMS I+G R SPYGPWKVEKNNRSLCV GLFGGKKENE+KSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQPLRTEITEAAM+LGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

XP_038879412.1 nucleoid-associated protein At4g30620, chloroplastic isoform X1 [Benincasa hispida]3.2e-9091.75Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRS------NLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQV
        MASTISLSAQ+PNLRG+SD+KKRS      NLNS+SNI+GVR SPYGPWKVEK+NRSLCVYGLFGGKK+NE+KSDDAPSKAGIFGNMQKLYETVRTAQ+V
Subjt:  MASTISLSAQVPNLRGVSDWKKRS------NLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQV

Query:  VQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        VQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  VQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

XP_038879413.1 nucleoid-associated protein At4g30620, chloroplastic isoform X2 [Benincasa hispida]3.4e-9294.68Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISLSAQ+PNLRG+SD+KKRSNLNS+SNI+GVR SPYGPWKVEK+NRSLCVYGLFGGKK+NE+KSDDAPSKAGIFGNMQKLYETVRTAQ+VVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

TrEMBL top hitse value%identityAlignment
A0A0A0LWF8 Uncharacterized protein2.0e-9094.15Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISLSAQ+PNLRG+SD+KKRSNLNSMSNIVG R SPYGPWKVEKNNRSLCVYGLFGGKK+ E+KSDDAPSKAGIFGNMQKLYETVRTAQ+VVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMELGPEKLSLLVTEAYQDAHQKSV AMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

A0A1S3B9T2 nucleoid-associated protein At2g24020, chloroplastic-like3.4e-9094.15Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISLSAQ+PNLRG+SD KKRSNLN MSNIVG R SPYGPWKVEKNNRSLCVYGLFGGKK+ E+KSDDAPSKAGIFGNMQKLYETVRTAQ+VVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

A0A5A7SUZ8 Nucleoid-associated protein3.4e-9094.15Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTISLSAQ+PNLRG+SD KKRSNLN MSNIVG R SPYGPWKVEKNNRSLCVYGLFGGKK+ E+KSDDAPSKAGIFGNMQKLYETVRTAQ+VVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELAAAEFDGYCEGELIKVTLSGNQQP+RTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

A0A6J1EW88 nucleoid-associated protein At4g30620, chloroplastic-like isoform X19.6e-9394.68Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTIS+SAQVPNLRG+SDWKKRSNLNSMS I+G R SPYGPWKVEKNNRSLCV GLFGGKKENE+KSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQPLRTEITEAAM+LGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

A0A6J1HN29 nucleoid-associated protein At4g30620, chloroplastic-like isoform X19.6e-9394.68Show/hide
Query:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
        MASTIS+SAQVPNLRG+SDWKKRSNLNSMS I+G R SPYGPWKVEKNNRSLCV GLFGGKKENE+KSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV
Subjt:  MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAV

Query:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        RVQKELA AEFDGYCEGELIKVTLSGNQQPLRTEITEAAM+LGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
Subjt:  RVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

SwissProt top hitse value%identityAlignment
B7K422 Nucleoid-associated protein PCC8801_25541.5e-1341.18Show/hide
Query:  GNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG
        G +++L E    AQQ VQ  A ++Q+EL   E +G+ EG+L+KV +SGNQ+P    I   A+E G ++LS LVT+A +DA+ +S + M+ +M +L   L 
Subjt:  GNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

O82230 Nucleoid-associated protein At2g24020, chloroplastic7.2e-5376.03Show/hide
Query:  WKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMEL
        +K +   RSL V GLFGG  + ++ S+D  SKAGIFGNMQ +YETV+ AQ VVQVEAVRVQKELAAAEFDGYC GEL+KVTLSGNQQP+RT+ITEAAMEL
Subjt:  WKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMEL

Query:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        G EKLS LVTEAY+DAH KSV AMK+RMSDLAQSLGMP GLSEG+K
Subjt:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

Q5N376 Nucleoid-associated protein syc1054_d1.3e-1447.06Show/hide
Query:  GNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG
        G M++L +  + AQQ VQ  A +VQ++L   E +G  +G L+KV +SGNQ+PLR EI   A+  G E LS LV  A +DA+QKS  AMK++M  L   LG
Subjt:  GNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q8GMT0 Nucleoid-associated protein Synpcc7942_04641.3e-1447.06Show/hide
Query:  GNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG
        G M++L +  + AQQ VQ  A +VQ++L   E +G  +G L+KV +SGNQ+PLR EI   A+  G E LS LV  A +DA+QKS  AMK++M  L   LG
Subjt:  GNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLG

Query:  MP
        +P
Subjt:  MP

Q9M098 Nucleoid-associated protein At4g30620, chloroplastic2.1e-5271.6Show/hide
Query:  NSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSG
        +S  NIV +     G      NNRSL V GLFGG K  +D  +D  SKAGI GNMQ LYETV+ AQ VVQVEAVRVQKELA AEFDGYC+GEL+KVTLSG
Subjt:  NSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSG

Query:  NQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        NQQP+RT+IT+AAMELG EKLSLLVTEAY+DAH KSV AMK+RMSDLAQSLGMP GL +GLK
Subjt:  NQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

Arabidopsis top hitse value%identityAlignment
AT2G24020.1 Uncharacterised BCR, YbaB family COG07185.1e-5476.03Show/hide
Query:  WKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMEL
        +K +   RSL V GLFGG  + ++ S+D  SKAGIFGNMQ +YETV+ AQ VVQVEAVRVQKELAAAEFDGYC GEL+KVTLSGNQQP+RT+ITEAAMEL
Subjt:  WKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMEL

Query:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        G EKLS LVTEAY+DAH KSV AMK+RMSDLAQSLGMP GLSEG+K
Subjt:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

AT2G24020.2 Uncharacterised BCR, YbaB family COG07185.1e-5476.03Show/hide
Query:  WKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMEL
        +K +   RSL V GLFGG  + ++ S+D  SKAGIFGNMQ +YETV+ AQ VVQVEAVRVQKELAAAEFDGYC GEL+KVTLSGNQQP+RT+ITEAAMEL
Subjt:  WKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPLRTEITEAAMEL

Query:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        G EKLS LVTEAY+DAH KSV AMK+RMSDLAQSLGMP GLSEG+K
Subjt:  GPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK

AT4G30620.1 Uncharacterised BCR, YbaB family COG07181.5e-5371.6Show/hide
Query:  NSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSG
        +S  NIV +     G      NNRSL V GLFGG K  +D  +D  SKAGI GNMQ LYETV+ AQ VVQVEAVRVQKELA AEFDGYC+GEL+KVTLSG
Subjt:  NSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSG

Query:  NQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK
        NQQP+RT+IT+AAMELG EKLSLLVTEAY+DAH KSV AMK+RMSDLAQSLGMP GL +GLK
Subjt:  NQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGACAATATCTCTGAGTGCTCAAGTACCAAATCTGCGAGGAGTTTCTGATTGGAAAAAACGAAGTAACCTAAATTCAATGTCAAATATAGTTGGTGTACGGAA
CTCACCTTATGGTCCTTGGAAAGTTGAGAAAAACAATAGATCTCTCTGTGTTTATGGTCTATTTGGAGGAAAAAAGGAGAATGAGGATAAGAGTGATGATGCACCTTCAA
AGGCAGGAATCTTTGGAAACATGCAGAAGTTATATGAGACTGTGAGGACAGCGCAACAGGTTGTCCAAGTAGAGGCAGTGCGTGTACAGAAAGAACTTGCGGCGGCAGAG
TTTGATGGCTACTGCGAAGGAGAGCTAATAAAGGTGACATTATCCGGGAATCAGCAACCTCTTCGCACTGAGATCACCGAGGCTGCAATGGAATTAGGACCAGAAAAACT
GTCCCTTCTAGTCACTGAAGCATACCAGGATGCGCACCAGAAGAGCGTTCAGGCCATGAAGCAAAGAATGAGCGATCTTGCCCAGAGCTTAGGAATGCCCCAGGGCCTCA
GTGAGGGATTGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCGACAATATCTCTGAGTGCTCAAGTACCAAATCTGCGAGGAGTTTCTGATTGGAAAAAACGAAGTAACCTAAATTCAATGTCAAATATAGTTGGTGTACGGAA
CTCACCTTATGGTCCTTGGAAAGTTGAGAAAAACAATAGATCTCTCTGTGTTTATGGTCTATTTGGAGGAAAAAAGGAGAATGAGGATAAGAGTGATGATGCACCTTCAA
AGGCAGGAATCTTTGGAAACATGCAGAAGTTATATGAGACTGTGAGGACAGCGCAACAGGTTGTCCAAGTAGAGGCAGTGCGTGTACAGAAAGAACTTGCGGCGGCAGAG
TTTGATGGCTACTGCGAAGGAGAGCTAATAAAGGTGACATTATCCGGGAATCAGCAACCTCTTCGCACTGAGATCACCGAGGCTGCAATGGAATTAGGACCAGAAAAACT
GTCCCTTCTAGTCACTGAAGCATACCAGGATGCGCACCAGAAGAGCGTTCAGGCCATGAAGCAAAGAATGAGCGATCTTGCCCAGAGCTTAGGAATGCCCCAGGGCCTCA
GTGAGGGATTGAAGTAG
Protein sequenceShow/hide protein sequence
MASTISLSAQVPNLRGVSDWKKRSNLNSMSNIVGVRNSPYGPWKVEKNNRSLCVYGLFGGKKENEDKSDDAPSKAGIFGNMQKLYETVRTAQQVVQVEAVRVQKELAAAE
FDGYCEGELIKVTLSGNQQPLRTEITEAAMELGPEKLSLLVTEAYQDAHQKSVQAMKQRMSDLAQSLGMPQGLSEGLK