; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG04G005490 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG04G005490
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionCysteine dioxygenase
Genome locationCG_Chr04:19197093..19201329
RNA-Seq ExpressionClCG04G005490
SyntenyClCG04G005490
Gene Ontology termsGO:0017172 - cysteine dioxygenase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR011051 - RmlC-like cupin domain superfamily
IPR012864 - Cysteine oxygenase/2-aminoethanethiol dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041769.1 plant cysteine oxidase 2-like [Cucumis melo var. makuwa]9.3e-5889.6Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCK-----RRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEH
        MTVFSKLLLGKMHIKSYDWVDPT+SDD AQPC+     RRLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEH
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCK-----RRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEH

Query:  PYASFPNGDMGLGEEEGEGYGWRGE
        PYASFPNGDMGLGEEEGEGY W  E
Subjt:  PYASFPNGDMGLGEEEGEGYGWRGE

TYK27060.1 plant cysteine oxidase 2-like [Cucumis melo var. makuwa]1.3e-5993.33Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLLLGKMHIKSYDWVDPT+SDD AQPC+RRLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEHPYASF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEGYGWRGE
        PNGDMGLGEEEGEGY W  E
Subjt:  PNGDMGLGEEEGEGYGWRGE

XP_004149110.1 plant cysteine oxidase 1 [Cucumis sativus]1.9e-5891.74Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLLLGKMHIKSYDWVDPT+SDD AQPC++RLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEHPYASF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEE-EGEGYGWRGE
        PNGDMGLGEE +GEGYGW  E
Subjt:  PNGDMGLGEE-EGEGYGWRGE

XP_008442017.1 PREDICTED: plant cysteine oxidase 2-like [Cucumis melo]1.7e-5993.33Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLLLGKMHIKSYDWVDPT+SDD AQPC+RRLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEHPYASF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEGYGWRGE
        PN DMGLGEEEGEGYGW  E
Subjt:  PNGDMGLGEEEGEGYGWRGE

XP_038883864.1 plant cysteine oxidase 2-like [Benincasa hispida]7.1e-5891.67Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLLLGKMHIKSYDWVD T+SDDPAQPC++RLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEGYGWRGE
         NG+M LGEEEGEGYGW  E
Subjt:  PNGDMGLGEEEGEGYGWRGE

TrEMBL top hitse value%identityAlignment
A0A0A0KWK6 Cysteine dioxygenase9.0e-5991.74Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLLLGKMHIKSYDWVDPT+SDD AQPC++RLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEHPYASF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEE-EGEGYGWRGE
        PNGDMGLGEE +GEGYGW  E
Subjt:  PNGDMGLGEE-EGEGYGWRGE

A0A1S3B5D6 Cysteine dioxygenase8.2e-6093.33Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLLLGKMHIKSYDWVDPT+SDD AQPC+RRLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEHPYASF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEGYGWRGE
        PN DMGLGEEEGEGYGW  E
Subjt:  PNGDMGLGEEEGEGYGWRGE

A0A5A7TK65 Cysteine dioxygenase4.5e-5889.6Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCK-----RRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEH
        MTVFSKLLLGKMHIKSYDWVDPT+SDD AQPC+     RRLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEH
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCK-----RRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEH

Query:  PYASFPNGDMGLGEEEGEGYGWRGE
        PYASFPNGDMGLGEEEGEGY W  E
Subjt:  PYASFPNGDMGLGEEEGEGYGWRGE

A0A5D3DUS9 Cysteine dioxygenase6.2e-6093.33Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLLLGKMHIKSYDWVDPT+SDD AQPC+RRLAKLKAD VFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDV+GPPYSMEDGRDCSYYKEHPYASF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEGYGWRGE
        PNGDMGLGEEEGEGY W  E
Subjt:  PNGDMGLGEEEGEGYGWRGE

A0A6J1HS36 Cysteine dioxygenase1.7e-5789.17Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLLLGKMHIKSYDWVDPT++DDPAQP ++RLAKLKADTVFTSPCSTSVLYPT+GGNIHSFTA+TPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEGYGWRGE
        PNG++ L EEEGEGYGW  E
Subjt:  PNGDMGLGEEEGEGYGWRGE

SwissProt top hitse value%identityAlignment
Q1G3U6 Plant cysteine oxidase 34.8e-2545.53Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDP----TDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHP
        M VFSK+L G +H+K+YDWV+P    T           RLAKL +D V T       LYP +GGN+H FTA+TPCAVLD++ PPY    GR CSYY ++P
Subjt:  MTVFSKLLLGKMHIKSYDWVDP----TDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHP

Query:  YASF--PNGDMGLGEEEGEGYGW
        +++F   NG   + E + + Y W
Subjt:  YASF--PNGDMGLGEEEGEGYGW

Q8LGJ5 Plant cysteine oxidase 21.7e-3359.17Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLL G MHIKSYDWV     D P      RLAK+K D+ FT+PC TS+LYP  GGN+H FTA T CAVLDVIGPPYS   GR C+YY ++P++SF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEGYGWRGE
            + + EEE EGY W  E
Subjt:  PNGDMGLGEEEGEGYGWRGE

Q9LXG9 Plant cysteine oxidase 14.1e-3258.2Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPY--A
        MTVFSKLL G MHIKSYDWV     D P +  K RLAKLK D+ FT+PC+ S+LYP  GGN+H FTAIT CAVLDV+GPPY   +GR C+Y+ E P    
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPY--A

Query:  SFPNGDMGLGEEEGEGYGWRGE
        S  + D+   EEE EGY W  E
Subjt:  SFPNGDMGLGEEEGEGYGWRGE

Q9LXT4 Plant cysteine oxidase 55.3e-2442.96Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDS--DDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYA
        MTV SKL+ G MH+KSYDW +P  S  DDP Q    R AKL  D   TSP   + LYPT+GGNIH F AIT CA+ D++ PPYS   GR C+Y+++ P  
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDS--DDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYA

Query:  SFPNGDMGLGEEEGEGYGWRGEERGGSKKRVVWFSSLESPFV
          P     +  E      W  EE       V+W      P +
Subjt:  SFPNGDMGLGEEEGEGYGWRGEERGGSKKRVVWFSSLESPFV

Q9SJI9 Plant cysteine oxidase 41.7e-2247.37Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTV SKL+ G MH+KSYDW++P    +P  P + R AKL  DT  T+    + LYP SGGNIH F AIT CA+LD++ PPYS E  R C+Y+++      
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEG
        P      GE E +G
Subjt:  PNGDMGLGEEEGEG

Arabidopsis top hitse value%identityAlignment
AT1G18490.1 Protein of unknown function (DUF1637)3.4e-2645.53Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDP----TDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHP
        M VFSK+L G +H+K+YDWV+P    T           RLAKL +D V T       LYP +GGN+H FTA+TPCAVLD++ PPY    GR CSYY ++P
Subjt:  MTVFSKLLLGKMHIKSYDWVDP----TDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHP

Query:  YASF--PNGDMGLGEEEGEGYGW
        +++F   NG   + E + + Y W
Subjt:  YASF--PNGDMGLGEEEGEGYGW

AT3G58670.1 Protein of unknown function (DUF1637)3.8e-2542.96Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDS--DDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYA
        MTV SKL+ G MH+KSYDW +P  S  DDP Q    R AKL  D   TSP   + LYPT+GGNIH F AIT CA+ D++ PPYS   GR C+Y+++ P  
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDS--DDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYA

Query:  SFPNGDMGLGEEEGEGYGWRGEERGGSKKRVVWFSSLESPFV
          P     +  E      W  EE       V+W      P +
Subjt:  SFPNGDMGLGEEEGEGYGWRGEERGGSKKRVVWFSSLESPFV

AT3G58670.2 Protein of unknown function (DUF1637)3.8e-2542.96Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDS--DDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYA
        MTV SKL+ G MH+KSYDW +P  S  DDP Q    R AKL  D   TSP   + LYPT+GGNIH F AIT CA+ D++ PPYS   GR C+Y+++ P  
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDS--DDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYA

Query:  SFPNGDMGLGEEEGEGYGWRGEERGGSKKRVVWFSSLESPFV
          P     +  E      W  EE       V+W      P +
Subjt:  SFPNGDMGLGEEEGEGYGWRGEERGGSKKRVVWFSSLESPFV

AT5G15120.1 Protein of unknown function (DUF1637)2.9e-3358.2Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPY--A
        MTVFSKLL G MHIKSYDWV     D P +  K RLAKLK D+ FT+PC+ S+LYP  GGN+H FTAIT CAVLDV+GPPY   +GR C+Y+ E P    
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPY--A

Query:  SFPNGDMGLGEEEGEGYGWRGE
        S  + D+   EEE EGY W  E
Subjt:  SFPNGDMGLGEEEGEGYGWRGE

AT5G39890.1 Protein of unknown function (DUF1637)1.2e-3459.17Show/hide
Query:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF
        MTVFSKLL G MHIKSYDWV     D P      RLAK+K D+ FT+PC TS+LYP  GGN+H FTA T CAVLDVIGPPYS   GR C+YY ++P++SF
Subjt:  MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASF

Query:  PNGDMGLGEEEGEGYGWRGE
            + + EEE EGY W  E
Subjt:  PNGDMGLGEEEGEGYGWRGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTTTTCAGTAAGCTTCTGTTGGGGAAAATGCACATCAAATCGTACGATTGGGTTGATCCAACCGACAGTGATGATCCTGCTCAACCTTGTAAAAGGAGATTGGC
AAAGCTGAAAGCTGATACTGTCTTCACTTCACCCTGCAGTACCTCTGTTTTGTACCCAACATCAGGAGGCAACATCCACTCATTCACTGCTATAACGCCATGTGCAGTGC
TTGATGTGATTGGACCTCCTTATTCCATGGAGGACGGTCGAGATTGTTCTTATTATAAGGAACATCCCTATGCCTCTTTTCCAAACGGTGACATGGGACTTGGAGAAGAA
GAGGGTGAGGGTTATGGATGGAGAGGAGAAGAAAGGGGAGGCAGCAAGAAAAGGGTAGTCTGGTTTTCCTCTCTGGAGTCACCCTTCGTCGGCCATATTCTCGCATTTCC
CCGTCACTCAGTTGCAGCGCTCCAGCCTTCACGGTGCGGCCTTAATCGCAGTTCACAGCATTGTGGCATCGTCATAGTTGTCGCGCCGCAGCATCTTGGTCGTCGCGCCC
CAGCATCCCGGTCATCGTCTCGTCGCTTTGCCTCAGGTGCTTGGCGACGCATCATCAACCCTCGACGCATCATTAGCCCGCGACACATCATCAGTACGCGTTACAGCATC
ACCATAGTCCGCGATGCAACGTCATCCTGCATCGTCGCGTCGCCTGTCCTCTCCGCGACGCACTGTCACAGCCTTCGTAGCGCGTTGCCGCATTGTAAGGAAGTCGCTGT
CTCGCTGACACGCTTAGTATCGCTCGGTAACACTCGTATCGCTCAGTATCGCTCGGTAACGCTCGTATCACTCAGTGTCGCTCCAGTTTCGCTCTCACAGACAATGCGTA
CTTTTGGGGCGCTTCCTATGCTTATGCTTATGTCGGGTAAGGAGTATGGTGGCCCGGCGCATGGCCGAGGAAATCCATGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAGTGAAATTATTGGAGTATGCCTATAGTGAGCAAGCTTGTGAACTATCTATAGGGGGTAAGTTACAAATCCTGAAAAGCTAAATCTGTACAGTCAAAGAGGGCA
GAGTTATAAGTTGAAATCATACCATATATTATGGGTTTGCCTAATGCCTAGTGATGAAACTACAAATCTAGCCGAGCTGTATGTATATTCTCTGTAGGATATGTAACACC
TATTATTTCAATTCAAATTCCAAAATATGTAACAATATTTCTTGCTTGGTTCCAAATTCCTGATGGCATTTTTTCCCCTCTTTGTTACAGTTGTGCATCTTCTTCCTCCC
AGCAACTGGTGTAATTCCTCTACACAACCATCCAGGAATGACTGTTTTCAGTAAGCTTCTGTTGGGGAAAATGCACATCAAATCGTACGATTGGGTTGATCCAACCGACA
GTGATGATCCTGCTCAACCTTGTAAAAGGAGATTGGCAAAGCTGAAAGCTGATACTGTCTTCACTTCACCCTGCAGTACCTCTGTTTTGTACCCAACATCAGGAGGCAAC
ATCCACTCATTCACTGCTATAACGCCATGTGCAGTGCTTGATGTGATTGGACCTCCTTATTCCATGGAGGACGGTCGAGATTGTTCTTATTATAAGGAACATCCCTATGC
CTCTTTTCCAAACGGTGACATGGGACTTGGAGAAGAAGAGGGTGAGGGTTATGGATGGAGAGGAGAAGAAAGGGGAGGCAGCAAGAAAAGGGTAGTCTGGTTTTCCTCTC
TGGAGTCACCCTTCGTCGGCCATATTCTCGCATTTCCCCGTCACTCAGTTGCAGCGCTCCAGCCTTCACGGTGCGGCCTTAATCGCAGTTCACAGCATTGTGGCATCGTC
ATAGTTGTCGCGCCGCAGCATCTTGGTCGTCGCGCCCCAGCATCCCGGTCATCGTCTCGTCGCTTTGCCTCAGGTGCTTGGCGACGCATCATCAACCCTCGACGCATCAT
TAGCCCGCGACACATCATCAGTACGCGTTACAGCATCACCATAGTCCGCGATGCAACGTCATCCTGCATCGTCGCGTCGCCTGTCCTCTCCGCGACGCACTGTCACAGCC
TTCGTAGCGCGTTGCCGCATTGTAAGGAAGTCGCTGTCTCGCTGACACGCTTAGTATCGCTCGGTAACACTCGTATCGCTCAGTATCGCTCGGTAACGCTCGTATCACTC
AGTGTCGCTCCAGTTTCGCTCTCACAGACAATGCGTACTTTTGGGGCGCTTCCTATGCTTATGCTTATGTCGGGTAAGGAGTATGGTGGCCCGGCGCATGGCCGAGGAAA
TCCATGA
Protein sequenceShow/hide protein sequence
MTVFSKLLLGKMHIKSYDWVDPTDSDDPAQPCKRRLAKLKADTVFTSPCSTSVLYPTSGGNIHSFTAITPCAVLDVIGPPYSMEDGRDCSYYKEHPYASFPNGDMGLGEE
EGEGYGWRGEERGGSKKRVVWFSSLESPFVGHILAFPRHSVAALQPSRCGLNRSSQHCGIVIVVAPQHLGRRAPASRSSSRRFASGAWRRIINPRRIISPRHIISTRYSI
TIVRDATSSCIVASPVLSATHCHSLRSALPHCKEVAVSLTRLVSLGNTRIAQYRSVTLVSLSVAPVSLSQTMRTFGALPMLMLMSGKEYGGPAHGRGNP