; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G036220 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G036220
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionAcid phosphatase/vanadium-dependent haloperoxidase-related
Genome locationCla97Chr02:16356749..16375535
RNA-Seq ExpressionCla97C02G036220
SyntenyCla97C02G036220
Gene Ontology termsGO:0098869 - cellular oxidant detoxification (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004601 - peroxidase activity (molecular function)
InterPro domainsIPR003832 - Protein of unknown function DUF212


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151900.1 uncharacterized protein LOC101219445 isoform X2 [Cucumis sativus]1.1e-4195.05Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT P P LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

XP_008455920.1 PREDICTED: uncharacterized membrane protein YuiD isoform X1 [Cucumis melo]3.7e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

XP_008455921.1 PREDICTED: uncharacterized membrane protein YuiD isoform X2 [Cucumis melo]3.7e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

XP_031737146.1 uncharacterized protein LOC101219445 isoform X1 [Cucumis sativus]1.1e-4195.05Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT P P LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

XP_031737147.1 uncharacterized protein LOC101219445 isoform X3 [Cucumis sativus]1.1e-4195.05Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT P P LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

TrEMBL top hitse value%identityAlignment
A0A1S3C1L4 uncharacterized membrane protein YuiD isoform X11.8e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

A0A1S3C253 uncharacterized membrane protein YuiD isoform X21.8e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

A0A5A7SVV8 Putative membrane protein YuiD isoform X21.8e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

A0A6J1CFR7 uncharacterized protein LOC1110110233.7e-4087.74Show/hide
Query:  MDEVMTVGDAASSSIKT-----LPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIAL
        MDEVMTVGDA SSSIKT      PAP+LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRM+DSGGMPSSHSATVTALA+AIALQ+GSGGPAFA+A+
Subjt:  MDEVMTVGDAASSSIKT-----LPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIAL

Query:  VFACVV
        VFACVV
Subjt:  VFACVV

A0A6J1JGD2 uncharacterized protein LOC1114841416.5e-3783.02Show/hide
Query:  MDEVMTVGDAASSSIK-----TLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIAL
        MDEVMTVGDA SSS+K       P PLL SNLPL+SAFLAGAIAQFLK+FTTWYKERKWESKRM  SGGMPSSHSATVTALA+AIALQEGSGGPAFA+A+
Subjt:  MDEVMTVGDAASSSIK-----TLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIAL

Query:  VFACVV
        VFACVV
Subjt:  VFACVV

SwissProt top hitse value%identityAlignment
O32107 Uncharacterized membrane protein YuiD8.9e-0741.03Show/hide
Query:  LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        L +N PLLS+  A   AQ +K+   +   RK +   +  +GGMPSSHSA VTAL+  +AL+ G     FA++ +FA +
Subjt:  LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Arabidopsis top hitse value%identityAlignment
AT1G24350.1 Acid phosphatase/vanadium-dependent haloperoxidase-related protein7.7e-2267.53Show/hide
Query:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV
        +N PL+SA  +  IAQF+KLFT+WY+ER+W+ K+++ SGGMPSSHSATVTALAVAI LQEG GG  FAIAL+ A VV
Subjt:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV

AT1G24350.2 Acid phosphatase/vanadium-dependent haloperoxidase-related protein7.7e-2267.53Show/hide
Query:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV
        +N PL+SA  +  IAQF+KLFT+WY+ER+W+ K+++ SGGMPSSHSATVTALAVAI LQEG GG  FAIAL+ A VV
Subjt:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV

AT1G24350.3 Acid phosphatase/vanadium-dependent haloperoxidase-related protein7.7e-2267.53Show/hide
Query:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV
        +N PL+SA  +  IAQF+KLFT+WY+ER+W+ K+++ SGGMPSSHSATVTALAVAI LQEG GG  FAIAL+ A VV
Subjt:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV

AT1G67600.1 Acid phosphatase/vanadium-dependent haloperoxidase-related protein4.5e-2267.53Show/hide
Query:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV
        +N PL+SA LA  IAQF+K FT+WYKER+W+ KR++ SGGMPSSHSATVTALA+A+ LQEG GG  FAIALV   +V
Subjt:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV

AT3G21610.1 Acid phosphatase/vanadium-dependent haloperoxidase-related protein5.5e-2862.96Show/hide
Query:  MDEVMTVGDAAS-----SSIKTLPAP--LLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAI
        MDEVMT  D  S      ++   P    L   NLP+ SAFLA A+AQFLK+FT WYKE++W+SKRM+ SGGMPSSHSATVTALAVAI  +EG+G PAFAI
Subjt:  MDEVMTVGDAAS-----SSIKTLPAP--LLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAI

Query:  ALVFACVV
        A+V ACVV
Subjt:  ALVFACVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTCTTCAACGCAGCGACTTCAATGCATGGTTGTCCTCAACGCAGAGTGACTTTAGCGCATGGTTCAAGCACATTTCATCCCCCGCCAATAATCACTTAT
CCTTTGACGCGAGCAGCTACTTCAGCTCACAACCCTTTTTTTAAGAACGGGCTAGAAAAGAAAGAAGAAAGCACTGACCATCGGGGGGTCGCTAGGACAGAAGTA
TCGCTAAAAAGTTGGAGGGTGATCAGCGAGACAAGTGCCGCTGGTCAAGAAGAGCTAGGGAAAGCGTATAACCCGCTCGAAGCCCGGAATAAGCCCCCTAGTATT
TTTATGAAGCTCTCACCAAAGGATATAGAGAAGGTGATTGAGTATTTCCCTGGGACCCTAGTACTGAAGGACATGAGAAGGGAAGTCGCGACATCGCACAGGCAG
TCTCGCTGTCTTGCTCAGTCTCGCCGCAGTTTTGCTCTTGCAGGTTGCCCTATTGTTGTGCCCCGTACCCCTAATGTACTATGCTACGAAGGGTGTCCTTCGAGA
TCACCACCTATGTCCATGTTACGAATGCTCAGTAGTGGGTCTCTTACTAAGTATGATTACTCACCATTTTTCTATGTTTTTCTCCCAGGGGGAAGGAACGTACTT
GACTTACCATGCGTCGGTAAGGTTGCCATCCGTTATGGTCCCTATGCGTCGAAGACAGAGTGGTCCCTATGCGTCGAAGATTGGATGATTGAGTTAACGTTGTTT
GTGGACGCACGGGCTAGACAGTGTCTGATGTTTGGAGTAGAACGAATCGAGGAGATCGAGTCTAGGGCTGAAATTTTAAGGGGCTTTCCGTCTCACAAAACTCGC
TGGGTATCGCTAGGTATCGCTCCTCAGGTAAATCACGCTAGTCTCGCTTATCCTAGTGTCGCTAGTATCGTGTCCTGTTGGAGAAGGCTCAGTGCTGAAGGACAT
GAGAAGGCAAGTGAGCTTCCCCGGGACCTAGTACTGAAGGACATAGAGAAGGTACTTGGGCAACCCGCCATGTTCCTATGTGCACATAGAGGAGGAGATGTAGTA
GATGGGACAATTGTATCAGATGACTATTTACCCTGTTTCCGGCGAGCAACCAGAGGTACAGTTTTGTGGTGTGTGTTAATGGACGAGGTGATGACGGTTGGGGAT
GCAGCCTCATCTTCCATTAAAACGTTGCCGGCTCCTTTGCTCGCTTCCAATCTCCCCCTCCTCTCCGCCTTCCTTGCTGGCGCCATCGCCCAGTTTCTCAAGCTC
TTTACCACTTGGTACAAGGAAAGAAAATGGGAATCTAAGCGGATGCTTGATTCTGGCGGGATGCCATCGTCTCACTCTGCAACTGTGACTGCTTTGGCCGTTGCT
ATTGCCCTCCAAGAAGGATCCGGAGGACCTGCTTTTGCCATCGCCCTGGTCTTTGCATGTGTTGTATGTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTCTTCAACGCAGCGACTTCAATGCATGGTTGTCCTCAACGCAGAGTGACTTTAGCGCATGGTTCAAGCACATTTCATCCCCCGCCAATAATCACTTAT
CCTTTGACGCGAGCAGCTACTTCAGCTCACAACCCTTTTTTTAAGAACGGGCTAGAAAAGAAAGAAGAAAGCACTGACCATCGGGGGGTCGCTAGGACAGAAGTA
TCGCTAAAAAGTTGGAGGGTGATCAGCGAGACAAGTGCCGCTGGTCAAGAAGAGCTAGGGAAAGCGTATAACCCGCTCGAAGCCCGGAATAAGCCCCCTAGTATT
TTTATGAAGCTCTCACCAAAGGATATAGAGAAGGTGATTGAGTATTTCCCTGGGACCCTAGTACTGAAGGACATGAGAAGGGAAGTCGCGACATCGCACAGGCAG
TCTCGCTGTCTTGCTCAGTCTCGCCGCAGTTTTGCTCTTGCAGGTTGCCCTATTGTTGTGCCCCGTACCCCTAATGTACTATGCTACGAAGGGTGTCCTTCGAGA
TCACCACCTATGTCCATGTTACGAATGCTCAGTAGTGGGTCTCTTACTAAGTATGATTACTCACCATTTTTCTATGTTTTTCTCCCAGGGGGAAGGAACGTACTT
GACTTACCATGCGTCGGTAAGGTTGCCATCCGTTATGGTCCCTATGCGTCGAAGACAGAGTGGTCCCTATGCGTCGAAGATTGGATGATTGAGTTAACGTTGTTT
GTGGACGCACGGGCTAGACAGTGTCTGATGTTTGGAGTAGAACGAATCGAGGAGATCGAGTCTAGGGCTGAAATTTTAAGGGGCTTTCCGTCTCACAAAACTCGC
TGGGTATCGCTAGGTATCGCTCCTCAGGTAAATCACGCTAGTCTCGCTTATCCTAGTGTCGCTAGTATCGTGTCCTGTTGGAGAAGGCTCAGTGCTGAAGGACAT
GAGAAGGCAAGTGAGCTTCCCCGGGACCTAGTACTGAAGGACATAGAGAAGGTACTTGGGCAACCCGCCATGTTCCTATGTGCACATAGAGGAGGAGATGTAGTA
GATGGGACAATTGTATCAGATGACTATTTACCCTGTTTCCGGCGAGCAACCAGAGGTACAGTTTTGTGGTGTGTGTTAATGGACGAGGTGATGACGGTTGGGGAT
GCAGCCTCATCTTCCATTAAAACGTTGCCGGCTCCTTTGCTCGCTTCCAATCTCCCCCTCCTCTCCGCCTTCCTTGCTGGCGCCATCGCCCAGTTTCTCAAGCTC
TTTACCACTTGGTACAAGGAAAGAAAATGGGAATCTAAGCGGATGCTTGATTCTGGCGGGATGCCATCGTCTCACTCTGCAACTGTGACTGCTTTGGCCGTTGCT
ATTGCCCTCCAAGAAGGATCCGGAGGACCTGCTTTTGCCATCGCCCTGGTCTTTGCATGTGTTGTATGTGCTTGATTCTTTCTTTCCTTCTTGTTTACATGCTTG
TTTTCTTTCCTCTACTTTCTTCTTCTGATGCTTTGATTACTGAAAGAATTTGGGAAGTTTTGGGCTTCTGCTTCATCAGCTGTACATTCTTCCATAGTTCAATGC
TTGTTTGGGATTTTGATTTACAATTTAGGTTAAATTATAATTTGCAAGTATAGTTATTGAGTAGCTTAAAAAGTTTCTAATACGTTCTGGTTGTATCTATTTAGT
TCCTCCAATTTATAAAATTTCTAATTCTGTATCTAATTAGTTTTTATTTTTTACTTAATTAGTGAAACATTTGCGAAGCATAAGTCTGGTATATATAAGGATCGC
GTTTTACGTCCAAATTCATCTTGTCATGTAATTTTTAAATTACACATGGTTAATTGGTTATGTAAATGTTAAATGAAATTCATTATAGATATAAAACTTAACAAG
TGTAAAATGGGAAATGAAATGTGTATTCATTTCCATAATGAAAATATTTGTTCATAAAGAATGGTTGAAACCACTCAACAGAATAACATGTGTACAAATAACAAC
TTAGCATCTTTACTAAACTATCGTGATACTCCCCCTCAAGATGGAGTGAACATATTTATCACAAAACAATAGTGTACGCAAAGAAGAAGTAATCTGTAAATCAGA
TAACAAAGAGGACAACCAAACAAGTTCACATGCAGTCAAAGCCAAAGCCTGATACTCTACTTTAGCTTAGGAATGGGAAACAATGTCTGTTTCTTAGACTTCTAT
GACACCAAAGAATCACCTAGAAAAACACTGGACCCCGTTGTTGAACACTGTATGACTAGGCAAGATGCCCAATCAGAATCGGCTAAAGTACGAACTTGAAAACTG
GTGGTTGGCTGAAGTAAATTACTTTGTCTAGGCATTGAAGTCATGTATTTCAGCAAGTGACGTGCACATGGTTTAAAAACAAATTTACTTAGTTTGTTGGAAATT
GCCAGATACAACAATCTACTGGTAAGTCTTCGGTATGAAGATAGATCAACTAACAATTCGCCACCATCTTACCTTAGCTTGAGATGAGGATTCATAGGCACCGCC
TCAGGCTTAGAACCAAGAAGACCAACATCTTCAAGGAGTTGCAGAGTATAATGTCGCTGTGACAAATAAATTCCTGTGGAAGAACGAGCCAGCTCAAGTCCAAGA
AAGTATTTTAGATCTCCCAAGTTTTTAAGGTTAAAATTCTTATTGAGAAGAAGTTTCAGTTTAGTTGTGGCAGAAGGATTGGCTCCAGTGATAATCATGTCATTT
ACATGCACTAGAAGGGACAAAAAATCCGAACCAAAACCCCTCATAAAAAGGGAATAATCAGATTTTGATTGTTGAAAACCAATAGATAGCAGAGTAGTGGAGGAC
TTTGCAACCCATTGTCGAGAGGCCTGTTTTAGACCATAGTTAGACTTACGTAATCTACATAACAAGAGGCTCCCTCTTACTTGAGACATGTGCCTGTGGTGTACA
ACCTAAGGTTAGGTCCATGTATACTTCTTCAAATAATTTGCTATGTAGAAAAGCGTTGTTGACGTCCAGTTGTACAAGTGGCCAATTAAATGAAACAACCACTTG
TGAGTACACACACTTTCACAAGCTTTGCTATTGGGGAGAAAGTCTCTATAAAATCCAAACCCTCTTGTTGAGTTTAGCCTTTAGCAACCAAATGAACTTTGTACA
TCTCAGTGGAATCATTGACATTATATTTGACTTTATAGTTCCATTTACACTTGATAGAATGCTTATCAGGAGGTAAAGAAACGACACTCCATGTGTTATTTGCCT
CCATAGCCTCAAGTTCAGCTTGCATTTCTATTTTCCAATTATCAGAACGGCTTGATGATAAAATTGTGGCTCATGTCTCATAGTGAGCAGAGACATTAAGAGAAA
AACCCTTAAAGGAAGGAGAAAGTTTGTCATATGAGACATGGTGTTGTAAGGGATATTTGGTAGTGGTTGTAGAGAAAGGAAAATTGGATAAAAAGCCACAATGAT
AATTCTGGACACAAGAAGGAGGTTTGGTTGGTCTTGTTGATTTTCTAATAATAAGGGAGGAAGAAGACATAACTGTAGGTTAATTAATTGGAGTGGATATAGAAG
GTTGACCAGGGCTGGTAGGCTCAAGTAGTTGACTTGGATCATTGGTAGGACCTTGAACAATCGGTTAAGAAATATGTGATGCATATGTGATGTAGAAGGTATTCA
TTAGATGTAATATCCACAGCTCGAGGAAGAACTATATTAGATAAAAACTTGGGTTTCTCATGGAAATCGGTAACCTTAATGAAAAGGGAAAATGTTCCTTGAATA
TTACTTCACGGGAGATTAAAAATTTCTAACTTTCAATATCAAATAACCTGTATGCCCTACCAGGGGATATCATACAAAAACTGCAGGAATGGCTCAGGGGGCAAA
CTTATGTGTTTGATGTTAGAGAGTGGATGCAAAATAGAGACAACCGAACACTCATAACCTGTAATAATCTGGCAAAGTGCCATTAAACCAAGCAAAAGGAGTCTG
CTAGTCTAAAACATTTGATGGAGTTTTGTTTATGAAATGCACAACAGTCAATACACACTCACCCCAAAAGGAAAGAAGTACACATGATTGAAAATACAAGACTTG
TGCAACATTTAAATTCTGCTGGAGTTTTCTATCACAAAATTTTGCTCAGGCCTAGTAGCACGCGAAAACTGATGTACAACTCCCTTTTCTTTAAATAAATCTTCA
AAACTTAACTCACGCGTATTGCCAGAAAAGTCTTGATACTTTTTTCATACTGAGTTTGAATCAATTGAAAGAACCGAGGAGCTATTGTCAACACATCAAAATTTC
TTTTTAACATGTAAAGCCAAATGTACCGTGTACAATGATCCACTATGGTAAGAAAGTAAGTATAACCAACATGAGGAGTGCAAAAGGTCCCCGCACTTCCACATG
TATCAAATCAAAAGCATTCGGTGATAGATGATTCTTGGAAGTAATTGACAGATGCCTTTGCTTAGCGAAAGGACAAATAACACAAGGAGTATTATAATCTGTTCT
AGGAGAATCAAAATCCAAAACATTCATCAACACATTCAAATGAGAAAAGGAAAGATGGCCAAGCGTGGAATGTTATAAGACAGCAGAAAAACGTGGAAAAGAAGT
TGCACAAATAGAAGAATCAATGTCAACACTATCAGCAAACTCATCAAGACACTCTTTCATTTTACCCTTACTAATCATCCGCAAAGTGAACTTGTCCTGAAGTGT
ACAATAGTTAGTGGAGAAATTAACCTAAACAAAAATAACTACAAAAAGTCTTCGAAATCAAAGCCCACAAGGAAACATGAAAGTGAACGAGGGACCATAACTCTT
AGGATCCATCTCCACACCTCTACTATTCTGTTCACCCGACAAAACCCATAAGACTGCACACACCCCATCAAGCCAAAGAAAATGACGTTTCTTCCCAAAAGGTGG
ATTGAGGAGGAACTCCTTGATCATGGCACTGACATCTCTATGAAGAATATACATCATACCAAATGTCTGGAAAAAAGACTCCCAGACACACTTATAATTGCCAAA
GAATATGATCCAAGTCTTTCTCTGCCTTCCGACAAAGAATACAACAAAAAGGCGCAACAAACAAAGGCAACTTCCTCACGAGCCTATCCAATGTGTTAGCACGAT
TGTGAAGAACCTGCCAAGTAAGGAACCTCACCTTTCTCGGAATTTGATCCTCCAGAGCACCGAAAAGACCAACACACCTAAGGGGGAAGATCAACCAAACATTGA
AAGAAATACTTGTATGAGAACCCTTTCAAAGGATTGGGGCTCCAAAATCTGACATTCCTTCTCTCAATTCTAAAGGGATGACCCTCGAGTAAAGAAAGGAGAGTA
GCCACATCCATTTCTTCTCTATTAGAAAGAGAACAACGGAATTGGAAGGAAAAGGAACAAGAGTTCCTAGACCACACTAGAAAGTTGATAACCAAATGATTTTTG
AGAGATGAAAAATGATACAACCAAGGAAACAAAACACAAAGAGATATCTCTCCCCTCCCCCACCACACATAACGAACCATGTGAAGAAAAGAAGGGAGCTCTTTC
GAAACATCTTTCCACAAATTCTACTGAGTACCTTTAACCCCTTTAGACAACCACTTGAAAGGATGGGACCATGTTTACTAGCAAGGGTCCTATGCCATAGAGAAT
TGGGCCTAAAGGAAAACGCCACAGCCATTTGGCTAAAAGGGCTTTGTTACGCATCCTAAGATTCCCAATTTCCAGACCCCCTAAGGAAACTAGTCTCTCAATAAC
TCCCACCCAATCAAGTGTGATCCTTTTCCTTTTCCTTCATCACTCCCTTCCTAGAAGAAGTCACACATCAATCTCTGATCTCTCCAAACACTTACACACCTCACA
TGGGGCCCTGAAGAGGGAAAAGAAATAAACAGGAATGCCACCTAGACAGACTTGATCTGGGTAAGGTAGCCAGCTTTAGAAAAAACTTTTCTTCCAAGAGTCAAA
CCTGTTTCGAACTTTATCCACCAAGGGATCCCACAAAGAAACGGACTTAGGGTTACTTCCAAGAGGAAGATTTGCTTCGAACTTTATGATAGTTGTGTTAATAGG
TGCCAAATATGTAAAAGTTGCTTAGTTAAAACATATGTCAGGAGAGGGGCCAGAGTCGATTATCCAATAGGCAATAGAGAATTTTTTGGGCAAATACTTGCTAGA
TAAGAAGTGGTAGCCTCTGACTCTGTTTTGACAGCAACAAGTTTTGTCTCTAGCATAGCCAACATATCATGACATTGTGCAAGAGTGGCAGAAGGACCAACCCGT
GGATCAGAAGATGAGACAACATTGGTAGGTTTATTAATGGCATTACTCGTGACAACAAGTGGAGTAGTAGGTTAAGGTTGTTGACGTTGTCCTCCTCTTTGGTTA
TTAACTAGGAGGATATCCATGTACCTTATGACAACGATCTACTTTATGGCCCTGTATTCCACAATGAGTGCAGATAGGGCAATCTTTATGCTATCGATTATTGGA
ACTTTGAGGAGTTTTAGAAACGACAGCGGCTTGATTAACAAAGATTTCAATTGTAGGAGTTGAAGAAAAGATGGGAGCAGATCGCTACTATTATTCTTGAGCAAT
CAATGAAACAACCTTATTAATTGTAGCAACAGGTTCCATGAGGAGAGTTTAAGAAGGAGAATGGCCATATATGACCCATTGAGTCCCATTACAAAATTCACAAGG
TATTCATGCTGAAGAATTCATCAAGATCTTTAATGCCTCCACAATTGCATTTTTTGTAGGTGCATCCTGGTTGATAAGTGACATCTTCATCTCAAATGCTCTTCA
ATTGCGGAAAATACATTGTAACAGACTGCTGATCTTGAGTAAGAGTTGCATGTTTATGCTTTAGATGAAAGATGCAAGGGCCATTCTTATGTTCAAGTCGCTTCT
TATGTTCAAGTCGCTTCTTAAGATCAAGCCAAATTGCTCTTGCCGAATCAGAAAACAAAATGGTGGAAGAAATTCCTTTCAAAATGGAATTTAAAATCCAAGCCC
TAACGTTGTTCTAGATCTAGACAGGAAGCAAAAAACGAGTCGGTTCAGTTAGGGTTCCATCAGCAAAACCTAACTTATTATGGATGGAGAGTGCCAGAATCATCG
AACTACTCCAGGTAATGTAATTGTCATTAGTGTGCAGATCAGAAATCAAAACAAGGTTTGAAGTGTCATTGTGATGAGGAAAGTAGGGTTTCTGAAAATTCTTTG
ACGAGGTTTGAACTTGAATTGAAGAAGACGAGACATTTTTAGGCGCTTGAACCTCAGTATGAGAACCTTCATGAGTCATGACGAAGAAGCCGAAAAAAAGTACTT
GATAGCAATAAAAGAGATAATTTTTCTTTGATACCATAACAAGTGTAAAATGGGAAATGAAATGTGTATCTTATTCATTTCCATAATGAAAATATTCATTCATAA
AGAATGGTTGAAACCATTCAACAGAATAACACTTGTATAAATAACAGCATAACAACTTTACTTAACTATCTTTGTGAAATTGAAAGTTCACAACTTGTTGAAATT
GTTTAAAGAATGTATAGACACTAACTATAAAAGTTTAGATAACCTGTGCGAACCTGAAATTTACTAGTTCAAGTTTTTGCTAGTCTAAATTTGTATTTGATGGCC
TTATCTTACCCCTTTTATTTGATTTCACTGGTATTCGCTATATGTACTTTGTAAATTTGTAATCATTTATTTTTTGTTAATCTTGTTTTTTGATTACATGTCTCT
CTTGCTTTTGTATTCAGAAATGGTAACTTGGCTGTTTTTGTATAGACCCTGTTTTCTTGGCGTTGTCTATCATGTTTTTTCATTACACTTCTGTCGTGTTTCAAT
TCTGTAATGTAACTCGGCTGTTTTTGTAAAGACCTTGACTTCTTGACGTCATCTTTTCATGTATTTCTGTTTGCGTTTCTTTTGCACACCAGATGAGTACTATCA
ATTTCTTTCTATCAATTTCAAATCAGCTAAAAATGTCGTAAATATGTTGATTTGAAATTGTCGTGTGGTAATATATCAATTTCAAATCAGCTAAAAATGTCGTAA
ATAGGAAACTAGTCTCTCAATAACTCCCACCCAATCAAGTGTGTCACAGGGCCTGCAAGTTGGTTTAAACTGCTTTTTGAATGTTTATGTTCCTAATCTAATATA
CTTGTTTTATTATTTATCTAAATGAGACCGCAGTTTAAACTTTGGGTTTTTGAACTCTTCTTAATGTTTTAAGGTAATTGGCCTTCATATATTTTCAATTTCTTG
AAGGTAATGTATGATGCTACTGGTGTCAGACTTCATGCTGGTCGTCAAGCCGAGGTAAGTTTGTCAGAAAGAAATATAGATTCCCCGTTTTATCTTTCATTTATT
ATTTCCACTTCCATCTGTATCACCTTGAGTCTGATTATTATGTCCATTTTACAAACAAAATTCCCATGTTCATCTTCTCTTCTTTAAGTCTTGTCATTTGATGAT
CATTGCAGTTGCTGAACCAAATTGTTTGCGAGTTTCCTCCTGAACATCCTTTGTCCAGTATTAGACCATTGCGAGATTCACTTGGCCACACTCCACTTCAGGTTA
TTGCAGGTGCTATGTTGGGATGTCTAGTGGCTTATTTGATAAGAAATCAAAATTAAAGAATGGAATTTGTACATCAGGATAGAATACGACATGTCGTTGGTTGAA
AGCACTCGAAGCTTAAAAGTGTAAAGTAAACATGGGTGCTGCAGCTAAGGTAGTTGGACTGGGTGGTTATGGAAGATGCAGAAATGATATATTCTTTGGTTTTAG
AACTCTCTATTGCTAAGTCTTCTTCCCCTGATCCCATCCACATTGAATTCCTTGTTTAGACTTGTCATATGTATACACACACTAGTGTTTATTTGAGTGAAATGT
TGTTTAATATAACTGTTTCTTA
Protein sequenceShow/hide protein sequence
MFVFNAATSMHGCPQRRVTLAHGSSTFHPPPIITYPLTRAATSAHNPFFKNGLEKKEESTDHRGVARTEVSLKSWRVISETSAAGQEELGKAYNPLEARNKPPSI
FMKLSPKDIEKVIEYFPGTLVLKDMRREVATSHRQSRCLAQSRRSFALAGCPIVVPRTPNVLCYEGCPSRSPPMSMLRMLSSGSLTKYDYSPFFYVFLPGGRNVL
DLPCVGKVAIRYGPYASKTEWSLCVEDWMIELTLFVDARARQCLMFGVERIEEIESRAEILRGFPSHKTRWVSLGIAPQVNHASLAYPSVASIVSCWRRLSAEGH
EKASELPRDLVLKDIEKVLGQPAMFLCAHRGGDVVDGTIVSDDYLPCFRRATRGTVLWCVLMDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKL
FTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVVCA