; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g1179 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g1179
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGlutaredoxin-dependent peroxiredoxin
Genome locationMC11:10805154..10807526
RNA-Seq ExpressionMC11g1179
SyntenyMC11g1179
Gene Ontology termsGO:0098869 - cellular oxidant detoxification (biological process)
GO:0016209 - antioxidant activity (molecular function)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR013740 - Redoxin
IPR013766 - Thioredoxin domain
IPR036249 - Thioredoxin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461947.1 PREDICTED: uncharacterized protein LOC103500426 [Cucumis melo]3.25e-15285.83Show/hide
Query:  MAATAAT--VSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLV
        MAATA++  VSLI +PFLP PFR + + + I P+P  SK+RFQ  P  LP RR+L+LRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLV
Subjt:  MAATAAT--VSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLV

Query:  MFICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHG
        MFICNHCPFVIHLKKDIVKLSNFYMKKGLAV AISSNSV THPQDGPEFMAEDAKAFSYPFPYLYD SQ+VARDF AVCTPEFFLFKKDGRRP+ELVYHG
Subjt:  MFICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHG

Query:  QFDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        QFDDSRPSNN P+TGRDLSLALDCVLSGQPVSS QKPSVGCSIKWHP
Subjt:  QFDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

XP_011659125.1 uncharacterized protein LOC101213663 isoform X1 [Cucumis sativus]8.59e-15084.36Show/hide
Query:  ATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMFIC
        A++++ SLI  PFLP PFR + + +QI P+P   K+RFQ  P  LP RR+L+LRCAR ESKG+SLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMFIC
Subjt:  ATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMFIC

Query:  NHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQFDD
        NHCPFVIHLKKDIVKLSNFYMKKGLAV AISSNSV THPQDGPEFMAEDAKAFSYPFPYLYD SQ+VARDF AVCTPEFFLFKKDGRRP+ELVYHGQFDD
Subjt:  NHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQFDD

Query:  SRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        SRPSNN P+TGRDLSLALDCVLSGQPVSS QKPSVGCSIKWHP
Subjt:  SRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

XP_022143688.1 uncharacterized protein LOC111013532 isoform X1 [Momordica charantia]5.63e-180100Show/hide
Query:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF
        MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF
Subjt:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF

Query:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
        ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
Subjt:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF

Query:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
Subjt:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

XP_022143689.1 uncharacterized protein LOC111013532 isoform X2 [Momordica charantia]4.81e-15288.98Show/hide
Query:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF
        MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALL   
Subjt:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF

Query:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
                                KGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
Subjt:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF

Query:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
Subjt:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

XP_038897890.1 uncharacterized protein LOC120085778 [Benincasa hispida]1.24e-15386.12Show/hide
Query:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF
        MAATA+++S+I +PFL  PFR +   +QI  +P  SK+RFQ  PL LP RRTLVLRCARTESKGVSLGFRAPNFELPEPLTGK+WKLEDFEPYPALLVMF
Subjt:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF

Query:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
        +CNHCPFVIHLKKDIVKLSNFYMKKGLAV AISSNSV THPQDGPEFMAEDAKAFSYPFPYLYD SQ+VARDFGAVCTPEFFLFKKDGRRP+ELVYHGQF
Subjt:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF

Query:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        DDSRPSNN P+TGRDLSLALDCVLSGQPVSS QKPSVGCSIKWHP
Subjt:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

TrEMBL top hitse value%identityAlignment
A0A0A0K9N9 Glutaredoxin-dependent peroxiredoxin4.16e-15084.36Show/hide
Query:  ATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMFIC
        A++++ SLI  PFLP PFR + + +QI P+P   K+RFQ  P  LP RR+L+LRCAR ESKG+SLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMFIC
Subjt:  ATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMFIC

Query:  NHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQFDD
        NHCPFVIHLKKDIVKLSNFYMKKGLAV AISSNSV THPQDGPEFMAEDAKAFSYPFPYLYD SQ+VARDF AVCTPEFFLFKKDGRRP+ELVYHGQFDD
Subjt:  NHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQFDD

Query:  SRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        SRPSNN P+TGRDLSLALDCVLSGQPVSS QKPSVGCSIKWHP
Subjt:  SRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

A0A1S3CFR0 Glutaredoxin-dependent peroxiredoxin1.57e-15285.83Show/hide
Query:  MAATAAT--VSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLV
        MAATA++  VSLI +PFLP PFR + + + I P+P  SK+RFQ  P  LP RR+L+LRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLV
Subjt:  MAATAAT--VSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLV

Query:  MFICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHG
        MFICNHCPFVIHLKKDIVKLSNFYMKKGLAV AISSNSV THPQDGPEFMAEDAKAFSYPFPYLYD SQ+VARDF AVCTPEFFLFKKDGRRP+ELVYHG
Subjt:  MFICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHG

Query:  QFDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        QFDDSRPSNN P+TGRDLSLALDCVLSGQPVSS QKPSVGCSIKWHP
Subjt:  QFDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

A0A6J1CQ28 uncharacterized protein LOC111013532 isoform X22.33e-15288.98Show/hide
Query:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF
        MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALL   
Subjt:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF

Query:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
                                KGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
Subjt:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF

Query:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
Subjt:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

A0A6J1CRB6 Glutaredoxin-dependent peroxiredoxin2.73e-180100Show/hide
Query:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF
        MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF
Subjt:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF

Query:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
        ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
Subjt:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF

Query:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
Subjt:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

A0A6J1IWZ6 Glutaredoxin-dependent peroxiredoxin5.26e-14884.08Show/hide
Query:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF
        MAAT  ++S I +PFLP  FRS A  +QI P+P  SK+RFQ  P RL TRR+ V+RCARTESK V+LG RAP+FELPEPLTGKVWKLEDFEPYPALLVMF
Subjt:  MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMF

Query:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF
        ICNHCPFVIHLKKDIVKLSNFYMKKGLAV AISSNSVATHPQDGPEFMAE+AKAF YPFPYLYD SQ+VARDFGAVCTPEFFLFKK GRRP+ELVYHGQF
Subjt:  ICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQF

Query:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        DDSRPSN+ PVTGRDLSLALDCVLSGQPVSS QKPSVGCSIKWHP
Subjt:  DDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21350.1 Thioredoxin superfamily protein6.8e-7587.07Show/hide
Query:  MFICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHG
        MFICNHCPFVIHLKKDIVKL NFYMKKGLAV AISSNSV THPQDGPEFMAEDAK F YPFPYLYD SQ+VAR+FGAVCTPEFFL+KKDGRRP+ELVYHG
Subjt:  MFICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHG

Query:  QFDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        QFDDSRPS+N PVTGRDLSLA+D  LS QP+ S QKPSVGCSIKWHP
Subjt:  QFDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

AT1G21350.2 Thioredoxin superfamily protein1.2e-7661.38Show/hide
Query:  AATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQ--LQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVM
        A+T  T    GA  + P   + A + +   +   S+L F    + +  P+ R LV+R ARTES GV LG RAPNFELPEPLTG +WKLEDFE YP+LL  
Subjt:  AATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQ--LQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVM

Query:  FICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQ
                                 KGLAV AISSNSV THPQDGPEFMAEDAK F YPFPYLYD SQ+VAR+FGAVCTPEFFL+KKDGRRP+ELVYHGQ
Subjt:  FICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQ

Query:  FDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        FDDSRPS+N PVTGRDLSLA+D  LS QP+ S QKPSVGCSIKWHP
Subjt:  FDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP

AT1G21350.3 Thioredoxin superfamily protein9.8e-9871.95Show/hide
Query:  AATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQ--LQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVM
        A+T  T    GA  + P   + A + +   +   S+L F    + +  P+ R LV+R ARTES GV LG RAPNFELPEPLTG +WKLEDFE YP+LLVM
Subjt:  AATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQ--LQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVM

Query:  FICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQ
        FICNHCPFVIHLKKDIVKL NFYMKKGLAV AISSNSV THPQDGPEFMAEDAK F YPFPYLYD SQ+VAR+FGAVCTPEFFL+KKDGRRP+ELVYHGQ
Subjt:  FICNHCPFVIHLKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQ

Query:  FDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP
        FDDSRPS+N PVTGRDLSLA+D  LS QP+ S QKPSVGCSIKWHP
Subjt:  FDDSRPSNNVPVTGRDLSLALDCVLSGQPVSSEQKPSVGCSIKWHP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTACTGCTGCCACTGTTTCGTTAATCGGCGCTCCATTTCTTCCCCCGCCATTCAGGTCCGCCGCCGCAAACCTTCAAATTGCCCCAACGCCATGGCTCTCCAA
ACTCAGGTTTCAGCTCCAGCCTCTGCGTCTTCCAACGCGGAGAACCCTCGTTCTTCGATGCGCTCGAACCGAATCCAAAGGGGTCTCCTTAGGCTTCCGAGCCCCAAATT
TTGAGCTTCCTGAACCTCTTACAGGCAAAGTATGGAAGTTGGAGGATTTTGAACCCTATCCTGCTTTACTGGTCATGTTTATTTGCAATCATTGCCCATTTGTTATACAC
TTGAAAAAAGATATTGTGAAACTTTCAAATTTCTATATGAAGAAAGGACTGGCTGTTGCAGCCATATCTTCAAACTCTGTAGCTACGCATCCACAGGATGGACCAGAGTT
CATGGCAGAAGATGCAAAAGCATTCAGTTATCCCTTTCCATATTTATATGATGCATCACAGAAAGTTGCAAGGGATTTCGGTGCAGTCTGCACACCAGAGTTTTTTCTGT
TCAAAAAGGATGGGCGAAGGCCATATGAGTTGGTTTATCATGGTCAGTTTGACGATTCTCGACCGAGTAATAACGTGCCTGTCACTGGGAGGGATTTAAGTTTGGCATTA
GATTGTGTTCTCAGTGGCCAACCAGTATCTTCGGAGCAGAAACCGAGTGTTGGATGTAGCATAAAGTGGCATCCTTAG
mRNA sequenceShow/hide mRNA sequence
GGAAAGTAAGTAGTGAGTTATGAAGCAACGGCCACGTACTGCCCTGTTCGAACCTTATCAAACATGGCCGCTACTGCTGCCACTGTTTCGTTAATCGGCGCTCCATTTCT
TCCCCCGCCATTCAGGTCCGCCGCCGCAAACCTTCAAATTGCCCCAACGCCATGGCTCTCCAAACTCAGGTTTCAGCTCCAGCCTCTGCGTCTTCCAACGCGGAGAACCC
TCGTTCTTCGATGCGCTCGAACCGAATCCAAAGGGGTCTCCTTAGGCTTCCGAGCCCCAAATTTTGAGCTTCCTGAACCTCTTACAGGCAAAGTATGGAAGTTGGAGGAT
TTTGAACCCTATCCTGCTTTACTGGTCATGTTTATTTGCAATCATTGCCCATTTGTTATACACTTGAAAAAAGATATTGTGAAACTTTCAAATTTCTATATGAAGAAAGG
ACTGGCTGTTGCAGCCATATCTTCAAACTCTGTAGCTACGCATCCACAGGATGGACCAGAGTTCATGGCAGAAGATGCAAAAGCATTCAGTTATCCCTTTCCATATTTAT
ATGATGCATCACAGAAAGTTGCAAGGGATTTCGGTGCAGTCTGCACACCAGAGTTTTTTCTGTTCAAAAAGGATGGGCGAAGGCCATATGAGTTGGTTTATCATGGTCAG
TTTGACGATTCTCGACCGAGTAATAACGTGCCTGTCACTGGGAGGGATTTAAGTTTGGCATTAGATTGTGTTCTCAGTGGCCAACCAGTATCTTCGGAGCAGAAACCGAG
TGTTGGATGTAGCATAAAGTGGCATCCTTAGATGAAGCTATAATCACTAATCTTATTCTCTGCTGCCTGAAGATCAAATCTCCGAGCAAAGGCGTTAAATGCAGCACACA
AACTTCCCAACCGAACCAATAAGCATTTTGATTTTCTGAATCATTCGATCCATCCTACTCCTGTGATATTTTCTTAACTTCTCTTTTTTCCTTTTATAGCGATAATTCTT
TTTGTATTTATGTCAAATAAAGAGGAAAAAAAGGCTATTGTTTTATTATTTCTCATCCAAGTATAATTTAATTTTAATCGAGTGTAACGTGGTATTTTTCATATGAATAT
GTAAGAGTAAATTTTAGTCTAGCTGCTTTCATTGCTTTATAGTGGGGTCCAAATCATTTTGTGTAGAAAGCAATCTTTTGTACTAGATGTTGATTCATGACATAGAATTT
TTAATTTTTTTATAATTGGGAGTCAGAGCTTTGCTCCCCTATACTCAAG
Protein sequenceShow/hide protein sequence
MAATAATVSLIGAPFLPPPFRSAAANLQIAPTPWLSKLRFQLQPLRLPTRRTLVLRCARTESKGVSLGFRAPNFELPEPLTGKVWKLEDFEPYPALLVMFICNHCPFVIH
LKKDIVKLSNFYMKKGLAVAAISSNSVATHPQDGPEFMAEDAKAFSYPFPYLYDASQKVARDFGAVCTPEFFLFKKDGRRPYELVYHGQFDDSRPSNNVPVTGRDLSLAL
DCVLSGQPVSSEQKPSVGCSIKWHP