; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc10G00500 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc10G00500
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPeptide methionine sulfoxide reductase MrsB
Genome locationClcChr10:536171..543681
RNA-Seq ExpressionClc10G00500
SyntenyClc10G00500
Gene Ontology termsNA
InterPro domainsIPR011057 - Mss4-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146165.1 uncharacterized protein At4g08330, chloroplastic [Cucumis sativus]1.0e-6898.41Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGY+YDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIKALRITSES
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

XP_008448515.1 PREDICTED: uncharacterized protein LOC103490667 [Cucumis melo]7.3e-6796.03Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIYSC+ECGTNLNLNS+HLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKC SCGHLVGYIYDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIKALRITSE+
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

XP_022965337.1 uncharacterized protein LOC111465232 isoform X2 [Cucurbita maxima]1.3e-6391.27Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIYSC ECG NLNLNS+HLFPPDFYFEAGNKGTLSFS IDSTKFRLEKEDKLRPFFET+NYWGIQRKRTK+KCNSC  LVGY+YDDGPPLTDSPGQYH
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIK LRI+SES
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

XP_023551931.1 uncharacterized protein LOC111809759 isoform X2 [Cucurbita pepo subsp. pepo]1.4e-6289.68Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIY C ECG NLNLNS+HLFPPDFYFEAGNKGTLSFS IDSTKFRLEKEDKLRPFFET+NYWGIQRKRTK+KCNSC  LVGY+YDDGPPLT+SPGQYH
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIK LRI+SES
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

XP_038903300.1 uncharacterized protein LOC120089928 [Benincasa hispida]1.5e-6796.83Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIYSC ECGTNLNLNS+HLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFET+NYWGIQRKRTKLKCNSCGHLVGY+YDDGPPLTDSPGQYH
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIKALRITSES
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

TrEMBL top hitse value%identityAlignment
A0A0A0L4W0 Uncharacterized protein5.0e-6998.41Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGY+YDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIKALRITSES
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

A0A1S3BJW2 uncharacterized protein LOC1034906673.6e-6796.03Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIYSC+ECGTNLNLNS+HLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKC SCGHLVGYIYDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIKALRITSE+
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

A0A5B7CDG7 Uncharacterized protein3.4e-6287.3Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIYSCTECGTNLNL++ HLFPPDFYFEAGNKGTLSFS ID TKF+LEKEDK+RPFFETLNYWGIQRKRTK+ CNSCG +VGY+YDDGPPLTDSPGQ+H
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFK KALRI+SE+
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

A0A6J1E6R2 uncharacterized protein LOC1114312099.1e-6389.68Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIY C ECG NLNLNS+HLFPPDFYFEAGNKGTLSFS IDSTKFRLEKEDKLRPFFET+NYWGIQR RTK+KCNSC  LVGY+YDDGPPLTDSPGQYH
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIK LRI+SES
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

A0A6J1HK24 uncharacterized protein LOC111465232 isoform X26.3e-6491.27Show/hide
Query:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH
        MASIYSC ECG NLNLNS+HLFPPDFYFEAGNKGTLSFS IDSTKFRLEKEDKLRPFFET+NYWGIQRKRTK+KCNSC  LVGY+YDDGPPLTDSPGQYH
Subjt:  MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYH

Query:  FGPSQVIPRAPRYRFKIKALRITSES
        FGPSQVIPRAPRYRFKIK LRI+SES
Subjt:  FGPSQVIPRAPRYRFKIKALRITSES

SwissProt top hitse value%identityAlignment
Q9STN5 Uncharacterized protein At4g08330, chloroplastic1.4e-0730.89Show/hide
Query:  YSCTECGTNLNLNSTHLFPPDF---YFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYHF
        YSC  CG  LNL+ST+         Y ++   G +SF  ID  +F    E +  P F   + WG+ R RTKL C  C + +G    +  P      Q   
Subjt:  YSCTECGTNLNLNSTHLFPPDF---YFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYHF

Query:  GPSQVIPRAPRYRFKIKALRITS
          S  I    +Y  +I++L+ +S
Subjt:  GPSQVIPRAPRYRFKIKALRITS

Arabidopsis top hitse value%identityAlignment
AT2G17705.1 unknown protein6.2e-5674.4Show/hide
Query:  ASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYHF
        ++IY+C ECG++LNLN   LFPPDFYFEAGNKGTLSF+ +D+ KFR EKEDK+ PFFETLNYWGIQRKRTK+KC SC HL+GYIYDDGPPLT   GQY F
Subjt:  ASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYHF

Query:  GPSQVIPRAPRYRFKIKALRITSES
        GPSQVIPRAPRYRFK KA++++S++
Subjt:  GPSQVIPRAPRYRFKIKALRITSES

AT4G08330.1 unknown protein9.7e-0930.89Show/hide
Query:  YSCTECGTNLNLNSTHLFPPDF---YFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYHF
        YSC  CG  LNL+ST+         Y ++   G +SF  ID  +F    E +  P F   + WG+ R RTKL C  C + +G    +  P      Q   
Subjt:  YSCTECGTNLNLNSTHLFPPDF---YFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYHF

Query:  GPSQVIPRAPRYRFKIKALRITS
          S  I    +Y  +I++L+ +S
Subjt:  GPSQVIPRAPRYRFKIKALRITS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCATCTATTCCTGTACAGAATGCGGAACAAATCTGAATCTGAATTCCACTCATCTCTTCCCGCCGGATTTCTATTTCGAGGCCGGAAACAAGGGCACCCTTTC
CTTTTCCTTCATCGACTCCACCAAGTTCAGGCTCGAGAAAGAGGACAAGCTCCGACCATTCTTCGAGACCCTCAACTATTGGGGAATCCAGCGCAAACGGACCAAGCTCA
AGTGCAATTCCTGTGGCCATCTCGTCGGTTATATTTACGACGATGGGCCTCCTCTTACCGACAGTCCAGGTCAGTACCACTTCGGACCTAGCCAGGTTATTCCCCGAGCT
CCCAGGTACAGATTCAAGATCAAGGCTCTACGGATTACCTCAGAGAGTTGA
mRNA sequenceShow/hide mRNA sequence
GGCAAATAACAAGAACAAGAGCTGTAATTTGGAGCAGTACAAAAGTCCACTTCCCAATTCGTATCGATGACGACGACTTTGTATCAATTAACCTTCCTTCAGTCTTCGTT
TCTTCAAACTTGGGGAATACGGTTGAACAATCCCACTTCTTATTCTTAGCGATTCAAACCCAGTTACAAAATCTCTCTCCCTTTCTCTCCTCGAAGCTTCTGAATTTCTC
ATCAACAATGGCTTCCATCTATTCCTGTACAGAATGCGGAACAAATCTGAATCTGAATTCCACTCATCTCTTCCCGCCGGATTTCTATTTCGAGGCCGGAAACAAGGGCA
CCCTTTCCTTTTCCTTCATCGACTCCACCAAGTTCAGGCTCGAGAAAGAGGACAAGCTCCGACCATTCTTCGAGACCCTCAACTATTGGGGAATCCAGCGCAAACGGACC
AAGCTCAAGTGCAATTCCTGTGGCCATCTCGTCGGTTATATTTACGACGATGGGCCTCCTCTTACCGACAGTCCAGGTCAGTACCACTTCGGACCTAGCCAGGTTATTCC
CCGAGCTCCCAGGTACAGATTCAAGATCAAGGCTCTACGGATTACCTCAGAGAGTTGAAACATTTTGATTATCGCACACTAGGGTTTGAAAGTTCTAAGAATTGTTTCTG
TTTTTCAACTTTTGCATAGTTGATGGGTGTCTGTTTTTGCCATGGGGTTTGTAAATATTGTCTATATCGTCAGTACACACTACTGTACAAATACGTTTAACTTCAATAGA
ATGCTTGGGAAATATTCTGTTCATTTTATTGCTTTTGTTTGATATAGGAATAAAAAGCCCTTTTCTGGCTTTGCTATTGCCACCATCTGTTCATTCTTTATCACTATGTA
ATGTTCTTTGTCACTTGTATTCTTCCCCATTTTATCAGCCCTTTGAATCCTTTAACATTCACACGAGCTGGAAAACAAAAAGGAAATTTTCTTCATATCCTGGATGGATG
CTTGTAACTTTTGTGTAGAAACGTGGAGAACCTGATTCTTGTAATTTACATTAAGAAGAGACATGCCCTGACCATTCAAGCTGGCCTTGTTAATGACTCAGTCTTGTGTG
CTGATCAAATTTCTTATCACATAATGTAGAATTCCTCCATGATCAAAATAAGTGAGCTCAACCTGCATCCATTTAAACCAATGTCTGAATTTTCCTAATCTTAACAAGGC
CAAGATTTTGATGATAATGATATAAGCAAGTTGATCTGAAATACCTCTGTGTCAAATCTTAGCGTGCATTGAAAGGACTTCCTATTTTCAGCTTCTTCTGTTCTCACAGT
TATATCCTGAAATGGTTTGAGATCCTTGATGCTGTTTGGAATTTCAATATTATAGCGCTCTCGACCAGTCAGTCCAAGAGATTCGGCATCTTCTCCTTGCTTAAAGCAGA
GAGGAATTACTCCCATTCCCACCAAATTACTGCGATGAGTATGCTCAAAGCTTTTAGCAATCACTGCTTTTACTCCCTGTCACCGAATAAAATACTTTAGCTATCACTAG
AAAAAGCAAATACAAATGCTCAATTGAATACAGAAGCTGTTGGAAACTTACGAGAAGCATTGGTCCTTTAGCAGCCCAATCCCGGGAACTCCCACTTCCGTAATCAGCTC
CCGCAAGAACAACGGTATCATGACCCTCACTCTTGTATCTCTGTTAATTCAGTATTGTGTCAAGAGTAAAATATACCCATCAACTTAATTGACTGAAAAGAGGATAAAAG
GAAACAAACAACAGACTTCTTGCTGAAGATGAAGGAAAGATATCTGTACCATGGCAACATCAAAGATGGAAAGCTTCGCTCCAGTGGGGAAGTGTATGGTTTTAGGTCCA
ACCTCTCCATTTAATAGTTTATTGACAAGTCTTATATTGGCAAAAGTACCACGGATCATCACCTCATGGTTTCCCCGACGACTGCCGTAAGAGTTGAAGTCTTTGCGATT
AACCCCGTGGTCCATTAGGTATCTGGCGGCTGGGCTGTCTTTGTGTATGCTGCCAGATGGTGAGATATGGTCTGTCGTAATGCTGTCTTCTAAGCTGAGCACACAGTAAG
CATGCTTCACTCCATGAGGTCCAGGAGGAGTCATAGTCATGCCTTTGAAGAAAGGTGGTTTTAGTATATAAGTTGATGATGGGTCCCAATTGTACAAAGTTCCAGGAGGC
ACAACTAATTGATTCCACATCGAGTTTCCTTGGTTGATTGCTTTGTATACTTCTTGAAACATATTGGACAAGACATTTGATTCTACAACCTATGGCGAAAAACAAAAAGG
AAAATTTTTGCAAGAATTCAAACCTGAGTTATGAACAGTTCCAAGTGAACCTAATCCCGCTTAACATTCTTCAGCATTGTGGGTTTGCAAACTTACATGTGCTGTTTCTT
CACTTGACGGCCAAATATCCCTGAGAAATATCTTCTTTCCTTCCTTTCCAACTCCAATAGGTTCCTTCTCAAAATCAAAGTTCACCTATTGCAGCAGTAGTTCAATATCA
ACTAGTTAAATACAATAACAAGTCATCTAGTTTTTTAGGACGAGAATGCTCCTCTTTTCAATATCTTTCGGGGTCATTTTTAATGGTGTAGTCAACCTTAAAAAAAACAC
AGAGATCAGAAATTGAGTTGGATGTTATTTGATTTGCAATTTGAAATATCATATTTTTTTCTATGAAATGACTTTAGACCATCTAAACCTTACCGTGCCAGCAAGAGCAT
AGGCAACCACGAGGGGAGGAGAGGCAAGATAATTGGCCCTTGTCAAAGGGTGTACACGACCCTCAAAGTTCCTGTTTCCAGACAGAACAGCTGCTGCTACCATATCTGCA
AATATACAACATTAGTGATTCAATTGCTTACCTGAGCTAGAGAGGCTAAAAGGGTACAAGTGGGAACAACTAAACTTTACCATTTTCAGAAATTGCAGAGGCCACTTCTT
CATCTAGATCGCCTGAATTACCAATGCATGTTGTGCATCCATACCCAACAGTGTTGAACCCCAGCTGATCTAAATACTTCTGCAATCCACTGCAGAAATGCAATTCTTGA
CTTGAGAAATATAACCTCCATACAACAATAACAACAAGAAAGGACAGAAAAACTCTCAATCAAAGTATAATTGAATTATAATGGTTCATTCTATGTTGAATTTGTTGATT
CAATTTTTATTTCTCATTTAACAACGGAAGCATTTGGAGAGTTTGATAGTGGCACCTCTTTTCCAAGTATTTGGTTACCACTCTTGAACCTGGAGCAAGACTGGTCTTTA
TCCAAGGCTTAACCTGTGCAGAAGGTCAAATATCAATGGTGTGTTTGGGAATATATTTAATAAAATGTTTCATTTTGTTTGTCATTGTTCTCCTGAAATGAATTATTGAT
GCACGGGCAACGCAAGGAGGTTGAAACAAGGGAATAACTTGCACACCTCTAAACCAAGTTCACAAGCTTTCTTGGCAACCAAAGCAGCTCCAAGCATTATACTAGGATTG
GAGGTATTGGTGCAGCTAGTAATAGCTGCTATTACAACATCACCATGCCTAAGCTCAGCTGTTGTTCCATGAAAACTGAACTCCACAACCTTAGTTTGTGATTCCTTTGG
TATAGCAAAGCCCTGCACAACCAATTATTTCAGAAGCATGTACAGCTCAAGACAGCACATTAGAAAGGAAAAACTTCCTATCTGAAAGATAAATGAAAGTTGTTAAACCT
TAAATCCAACCCTATTGTTTAGACATGACTTCCAATCTGCTTTCATTTCTTGCAAAGGGATCCGATCATGTGGCCTGTGAAGCATTACATCCCCAGAGGAAGATCATGAA
TTATACCGAATAGATCATACTGTCAGACAGACAGTTGAAGCAGAAAAGGACAATATTATGGGATACCTTTCCAACTGCTGGGGTTGAAGTCTATTTTCCTTCAGTATCTT
ATAGAGCATCATAATTTTCAGAATTTAAAGTTATAGATCATAACGTGAAATAAAAGAGATTTCATTTTGATATTGATGGTATACTCAACTGGAAATGAGATCTGTTAGAG
TCTAATACTTCCATTTTTGGGTTTGTCATGAAAATGAAGTCTACATTGGGAAGGACCTGAAACTGTACCTTTTAGGGCCAGAAACACATGGTTCGACGTCCTGAAGATTG
AGCTCGAGATGAGAGGAGTAAACTCTCTCTTCCAGTTGCTGCTTCAATCAGTAAACTAGTATCAGTTTAAATAAAACAATCTATTAGATTGTGTAAAGTTAGAAATTAAG
TTCGACTTACCTCATTATAGTCCACAAACATTTTATTAGCTCTCAAGTAAGACTCTACCATACAAATCTAAATAAAAACTTTACTTCAGTCATAATTGACATATTAAGTG
CAATGAACACCCCGCCCCTCCAAGAGTAAATAGTTGACTAACAGTTTCATCAGTTCTGCCAGTGAGTTTCAGATACTGCAAAGTGACATGATCAACAGGGAAGAAACCCA
TTGTTGCACCATAGTCAGGAGACATATTACCAATGGTGGCTCTGTCAGCTAATGAAAGTTCACTCATGCCTTCTCCTGCAAAAGAAAATAACAGATGAATATATGGGGCA
GGGGATTTGAACTTCCGACCTAGACTCGAAGGTGTAAGCGTTAACCAATTGGGTTATACGTAGGTTTGTTGCTCACCAATGGATTAACCTTGTTGGAGTCGCCACCAATG
TTGTTGACTGCGTCTCTCATGCAAGCAAGATCAATAACAGCAGGCACTCCGGACTCTAGCCGGCTTAAATGGGATTTCAACCTGTCTGGGACAAGTCTTTTCCCAATCAA
GAATCTTCTCAACATCTTCAGATTTCACCTGAAATTCATCGCAATTGCGTATTGCCGATTCCAGAAGCACTCTAATTGAAAATGGCAGCCTCTCTGCAATGCAACACAGC
ATACCCAGATTTAGACTTGCTTAATTTGCCTATATTGTAAGCCTCTATTTGAAACGACTTTTCGAACCCTTAAAAATTCATTCCAAATCGACTTGACAAATTCACAAATC
ACACAGTTAGCTCATGCGGATTATAGTTCTTCTTTTCTGAAATAGAGGTGGATTTGATAGAAATGGAAAAAGGAAGGGACAAAGGCTGACCAATTCGAGGATCTTGTAGA
AGAGGCAAAGAGAAATATGAGGCCAAGTGACCACCTTCTGGCTTCTCAAGCTTGTTTATTAGTCTGTTGAATGGATTTTCCAGTCCTTTCATGACTTGAAGATTGAGAGA
AAAAAACAAAAACAGAGACAAAGATCTCAGAGAAAGAGGCTGCTTCTGTTGGCGGTGAAGCCAAGCCTCCTAGTCTTACCCAATTCACTTAGCGGTTGTTTTGTCTATAC
ATTCTAAGTTCCCACATGTCCATCTTTTAACACCACTACTGAAGTATTAAACGAATAGTCCTCAACTTGGACATTTGCATTCTTTTAAAAATTGCAATATTATATCGTTC
TTGACGTT
Protein sequenceShow/hide protein sequence
MASIYSCTECGTNLNLNSTHLFPPDFYFEAGNKGTLSFSFIDSTKFRLEKEDKLRPFFETLNYWGIQRKRTKLKCNSCGHLVGYIYDDGPPLTDSPGQYHFGPSQVIPRA
PRYRFKIKALRITSES