; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G036220 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G036220
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionAcid phosphatase/vanadium-dependent haloperoxidase-related
Genome locationCla97Chr02:16356749..16375535
RNA-Seq ExpressionCla97C02G036220
SyntenyCla97C02G036220
Gene Ontology termsGO:0098869 - cellular oxidant detoxification (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004601 - peroxidase activity (molecular function)
InterPro domainsIPR003832 - Protein of unknown function DUF212


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151900.1 uncharacterized protein LOC101219445 isoform X2 [Cucumis sativus]1.1e-4195.05Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT P P LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

XP_008455920.1 PREDICTED: uncharacterized membrane protein YuiD isoform X1 [Cucumis melo]3.7e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

XP_008455921.1 PREDICTED: uncharacterized membrane protein YuiD isoform X2 [Cucumis melo]3.7e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

XP_031737146.1 uncharacterized protein LOC101219445 isoform X1 [Cucumis sativus]1.1e-4195.05Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT P P LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

XP_031737147.1 uncharacterized protein LOC101219445 isoform X3 [Cucumis sativus]1.1e-4195.05Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT P P LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

TrEMBL top hitse value%identityAlignment
A0A1S3C1L4 uncharacterized membrane protein YuiD isoform X11.8e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

A0A1S3C253 uncharacterized membrane protein YuiD isoform X21.8e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

A0A5A7SVV8 Putative membrane protein YuiD isoform X21.8e-4296.04Show/hide
Query:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        MDEVMTVGDAASSSIKT PAP LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATV+ALAVAIA QEGSGGPAFAIALVFACV
Subjt:  MDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Query:  V
        V
Subjt:  V

A0A6J1CFR7 uncharacterized protein LOC1110110233.7e-4087.74Show/hide
Query:  MDEVMTVGDAASSSIKT-----LPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIAL
        MDEVMTVGDA SSSIKT      PAP+LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRM+DSGGMPSSHSATVTALA+AIALQ+GSGGPAFA+A+
Subjt:  MDEVMTVGDAASSSIKT-----LPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIAL

Query:  VFACVV
        VFACVV
Subjt:  VFACVV

A0A6J1JGD2 uncharacterized protein LOC1114841416.5e-3783.02Show/hide
Query:  MDEVMTVGDAASSSIK-----TLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIAL
        MDEVMTVGDA SSS+K       P PLL SNLPL+SAFLAGAIAQFLK+FTTWYKERKWESKRM  SGGMPSSHSATVTALA+AIALQEGSGGPAFA+A+
Subjt:  MDEVMTVGDAASSSIK-----TLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIAL

Query:  VFACVV
        VFACVV
Subjt:  VFACVV

SwissProt top hitse value%identityAlignment
O32107 Uncharacterized membrane protein YuiD8.9e-0741.03Show/hide
Query:  LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV
        L +N PLLS+  A   AQ +K+   +   RK +   +  +GGMPSSHSA VTAL+  +AL+ G     FA++ +FA +
Subjt:  LASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACV

Arabidopsis top hitse value%identityAlignment
AT1G24350.1 Acid phosphatase/vanadium-dependent haloperoxidase-related protein7.7e-2267.53Show/hide
Query:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV
        +N PL+SA  +  IAQF+KLFT+WY+ER+W+ K+++ SGGMPSSHSATVTALAVAI LQEG GG  FAIAL+ A VV
Subjt:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV

AT1G24350.2 Acid phosphatase/vanadium-dependent haloperoxidase-related protein7.7e-2267.53Show/hide
Query:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV
        +N PL+SA  +  IAQF+KLFT+WY+ER+W+ K+++ SGGMPSSHSATVTALAVAI LQEG GG  FAIAL+ A VV
Subjt:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV

AT1G24350.3 Acid phosphatase/vanadium-dependent haloperoxidase-related protein7.7e-2267.53Show/hide
Query:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV
        +N PL+SA  +  IAQF+KLFT+WY+ER+W+ K+++ SGGMPSSHSATVTALAVAI LQEG GG  FAIAL+ A VV
Subjt:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV

AT1G67600.1 Acid phosphatase/vanadium-dependent haloperoxidase-related protein4.5e-2267.53Show/hide
Query:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV
        +N PL+SA LA  IAQF+K FT+WYKER+W+ KR++ SGGMPSSHSATVTALA+A+ LQEG GG  FAIALV   +V
Subjt:  SNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVV

AT3G21610.1 Acid phosphatase/vanadium-dependent haloperoxidase-related protein5.5e-2862.96Show/hide
Query:  MDEVMTVGDAAS-----SSIKTLPAP--LLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAI
        MDEVMT  D  S      ++   P    L   NLP+ SAFLA A+AQFLK+FT WYKE++W+SKRM+ SGGMPSSHSATVTALAVAI  +EG+G PAFAI
Subjt:  MDEVMTVGDAAS-----SSIKTLPAP--LLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGGMPSSHSATVTALAVAIALQEGSGGPAFAI

Query:  ALVFACVV
        A+V ACVV
Subjt:  ALVFACVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTCTTCAACGCAGCGACTTCAATGCATGGTTGTCCTCAACGCAGAGTGACTTTAGCGCATGGTTCAAGCACATTTCATCCCCCGCCAATAATCACTTATCCTTT
GACGCGAGCAGCTACTTCAGCTCACAACCCTTTTTTTAAGAACGGGCTAGAAAAGAAAGAAGAAAGCACTGACCATCGGGGGGTCGCTAGGACAGAAGTATCGCTAAAAA
GTTGGAGGGTGATCAGCGAGACAAGTGCCGCTGGTCAAGAAGAGCTAGGGAAAGCGTATAACCCGCTCGAAGCCCGGAATAAGCCCCCTAGTATTTTTATGAAGCTCTCA
CCAAAGGATATAGAGAAGGTGATTGAGTATTTCCCTGGGACCCTAGTACTGAAGGACATGAGAAGGGAAGTCGCGACATCGCACAGGCAGTCTCGCTGTCTTGCTCAGTC
TCGCCGCAGTTTTGCTCTTGCAGGTTGCCCTATTGTTGTGCCCCGTACCCCTAATGTACTATGCTACGAAGGGTGTCCTTCGAGATCACCACCTATGTCCATGTTACGAA
TGCTCAGTAGTGGGTCTCTTACTAAGTATGATTACTCACCATTTTTCTATGTTTTTCTCCCAGGGGGAAGGAACGTACTTGACTTACCATGCGTCGGTAAGGTTGCCATC
CGTTATGGTCCCTATGCGTCGAAGACAGAGTGGTCCCTATGCGTCGAAGATTGGATGATTGAGTTAACGTTGTTTGTGGACGCACGGGCTAGACAGTGTCTGATGTTTGG
AGTAGAACGAATCGAGGAGATCGAGTCTAGGGCTGAAATTTTAAGGGGCTTTCCGTCTCACAAAACTCGCTGGGTATCGCTAGGTATCGCTCCTCAGGTAAATCACGCTA
GTCTCGCTTATCCTAGTGTCGCTAGTATCGTGTCCTGTTGGAGAAGGCTCAGTGCTGAAGGACATGAGAAGGCAAGTGAGCTTCCCCGGGACCTAGTACTGAAGGACATA
GAGAAGGTACTTGGGCAACCCGCCATGTTCCTATGTGCACATAGAGGAGGAGATGTAGTAGATGGGACAATTGTATCAGATGACTATTTACCCTGTTTCCGGCGAGCAAC
CAGAGGTACAGTTTTGTGGTGTGTGTTAATGGACGAGGTGATGACGGTTGGGGATGCAGCCTCATCTTCCATTAAAACGTTGCCGGCTCCTTTGCTCGCTTCCAATCTCC
CCCTCCTCTCCGCCTTCCTTGCTGGCGCCATCGCCCAGTTTCTCAAGCTCTTTACCACTTGGTACAAGGAAAGAAAATGGGAATCTAAGCGGATGCTTGATTCTGGCGGG
ATGCCATCGTCTCACTCTGCAACTGTGACTGCTTTGGCCGTTGCTATTGCCCTCCAAGAAGGATCCGGAGGACCTGCTTTTGCCATCGCCCTGGTCTTTGCATGTGTTGT
ATGTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTCTTCAACGCAGCGACTTCAATGCATGGTTGTCCTCAACGCAGAGTGACTTTAGCGCATGGTTCAAGCACATTTCATCCCCCGCCAATAATCACTTATCCTTT
GACGCGAGCAGCTACTTCAGCTCACAACCCTTTTTTTAAGAACGGGCTAGAAAAGAAAGAAGAAAGCACTGACCATCGGGGGGTCGCTAGGACAGAAGTATCGCTAAAAA
GTTGGAGGGTGATCAGCGAGACAAGTGCCGCTGGTCAAGAAGAGCTAGGGAAAGCGTATAACCCGCTCGAAGCCCGGAATAAGCCCCCTAGTATTTTTATGAAGCTCTCA
CCAAAGGATATAGAGAAGGTGATTGAGTATTTCCCTGGGACCCTAGTACTGAAGGACATGAGAAGGGAAGTCGCGACATCGCACAGGCAGTCTCGCTGTCTTGCTCAGTC
TCGCCGCAGTTTTGCTCTTGCAGGTTGCCCTATTGTTGTGCCCCGTACCCCTAATGTACTATGCTACGAAGGGTGTCCTTCGAGATCACCACCTATGTCCATGTTACGAA
TGCTCAGTAGTGGGTCTCTTACTAAGTATGATTACTCACCATTTTTCTATGTTTTTCTCCCAGGGGGAAGGAACGTACTTGACTTACCATGCGTCGGTAAGGTTGCCATC
CGTTATGGTCCCTATGCGTCGAAGACAGAGTGGTCCCTATGCGTCGAAGATTGGATGATTGAGTTAACGTTGTTTGTGGACGCACGGGCTAGACAGTGTCTGATGTTTGG
AGTAGAACGAATCGAGGAGATCGAGTCTAGGGCTGAAATTTTAAGGGGCTTTCCGTCTCACAAAACTCGCTGGGTATCGCTAGGTATCGCTCCTCAGGTAAATCACGCTA
GTCTCGCTTATCCTAGTGTCGCTAGTATCGTGTCCTGTTGGAGAAGGCTCAGTGCTGAAGGACATGAGAAGGCAAGTGAGCTTCCCCGGGACCTAGTACTGAAGGACATA
GAGAAGGTACTTGGGCAACCCGCCATGTTCCTATGTGCACATAGAGGAGGAGATGTAGTAGATGGGACAATTGTATCAGATGACTATTTACCCTGTTTCCGGCGAGCAAC
CAGAGGTACAGTTTTGTGGTGTGTGTTAATGGACGAGGTGATGACGGTTGGGGATGCAGCCTCATCTTCCATTAAAACGTTGCCGGCTCCTTTGCTCGCTTCCAATCTCC
CCCTCCTCTCCGCCTTCCTTGCTGGCGCCATCGCCCAGTTTCTCAAGCTCTTTACCACTTGGTACAAGGAAAGAAAATGGGAATCTAAGCGGATGCTTGATTCTGGCGGG
ATGCCATCGTCTCACTCTGCAACTGTGACTGCTTTGGCCGTTGCTATTGCCCTCCAAGAAGGATCCGGAGGACCTGCTTTTGCCATCGCCCTGGTCTTTGCATGTGTTGT
ATGTGCTTGATTCTTTCTTTCCTTCTTGTTTACATGCTTGTTTTCTTTCCTCTACTTTCTTCTTCTGATGCTTTGATTACTGAAAGAATTTGGGAAGTTTTGGGCTTCTG
CTTCATCAGCTGTACATTCTTCCATAGTTCAATGCTTGTTTGGGATTTTGATTTACAATTTAGGTTAAATTATAATTTGCAAGTATAGTTATTGAGTAGCTTAAAAAGTT
TCTAATACGTTCTGGTTGTATCTATTTAGTTCCTCCAATTTATAAAATTTCTAATTCTGTATCTAATTAGTTTTTATTTTTTACTTAATTAGTGAAACATTTGCGAAGCA
TAAGTCTGGTATATATAAGGATCGCGTTTTACGTCCAAATTCATCTTGTCATGTAATTTTTAAATTACACATGGTTAATTGGTTATGTAAATGTTAAATGAAATTCATTA
TAGATATAAAACTTAACAAGTGTAAAATGGGAAATGAAATGTGTATTCATTTCCATAATGAAAATATTTGTTCATAAAGAATGGTTGAAACCACTCAACAGAATAACATG
TGTACAAATAACAACTTAGCATCTTTACTAAACTATCGTGATACTCCCCCTCAAGATGGAGTGAACATATTTATCACAAAACAATAGTGTACGCAAAGAAGAAGTAATCT
GTAAATCAGATAACAAAGAGGACAACCAAACAAGTTCACATGCAGTCAAAGCCAAAGCCTGATACTCTACTTTAGCTTAGGAATGGGAAACAATGTCTGTTTCTTAGACT
TCTATGACACCAAAGAATCACCTAGAAAAACACTGGACCCCGTTGTTGAACACTGTATGACTAGGCAAGATGCCCAATCAGAATCGGCTAAAGTACGAACTTGAAAACTG
GTGGTTGGCTGAAGTAAATTACTTTGTCTAGGCATTGAAGTCATGTATTTCAGCAAGTGACGTGCACATGGTTTAAAAACAAATTTACTTAGTTTGTTGGAAATTGCCAG
ATACAACAATCTACTGGTAAGTCTTCGGTATGAAGATAGATCAACTAACAATTCGCCACCATCTTACCTTAGCTTGAGATGAGGATTCATAGGCACCGCCTCAGGCTTAG
AACCAAGAAGACCAACATCTTCAAGGAGTTGCAGAGTATAATGTCGCTGTGACAAATAAATTCCTGTGGAAGAACGAGCCAGCTCAAGTCCAAGAAAGTATTTTAGATCT
CCCAAGTTTTTAAGGTTAAAATTCTTATTGAGAAGAAGTTTCAGTTTAGTTGTGGCAGAAGGATTGGCTCCAGTGATAATCATGTCATTTACATGCACTAGAAGGGACAA
AAAATCCGAACCAAAACCCCTCATAAAAAGGGAATAATCAGATTTTGATTGTTGAAAACCAATAGATAGCAGAGTAGTGGAGGACTTTGCAACCCATTGTCGAGAGGCCT
GTTTTAGACCATAGTTAGACTTACGTAATCTACATAACAAGAGGCTCCCTCTTACTTGAGACATGTGCCTGTGGTGTACAACCTAAGGTTAGGTCCATGTATACTTCTTC
AAATAATTTGCTATGTAGAAAAGCGTTGTTGACGTCCAGTTGTACAAGTGGCCAATTAAATGAAACAACCACTTGTGAGTACACACACTTTCACAAGCTTTGCTATTGGG
GAGAAAGTCTCTATAAAATCCAAACCCTCTTGTTGAGTTTAGCCTTTAGCAACCAAATGAACTTTGTACATCTCAGTGGAATCATTGACATTATATTTGACTTTATAGTT
CCATTTACACTTGATAGAATGCTTATCAGGAGGTAAAGAAACGACACTCCATGTGTTATTTGCCTCCATAGCCTCAAGTTCAGCTTGCATTTCTATTTTCCAATTATCAG
AACGGCTTGATGATAAAATTGTGGCTCATGTCTCATAGTGAGCAGAGACATTAAGAGAAAAACCCTTAAAGGAAGGAGAAAGTTTGTCATATGAGACATGGTGTTGTAAG
GGATATTTGGTAGTGGTTGTAGAGAAAGGAAAATTGGATAAAAAGCCACAATGATAATTCTGGACACAAGAAGGAGGTTTGGTTGGTCTTGTTGATTTTCTAATAATAAG
GGAGGAAGAAGACATAACTGTAGGTTAATTAATTGGAGTGGATATAGAAGGTTGACCAGGGCTGGTAGGCTCAAGTAGTTGACTTGGATCATTGGTAGGACCTTGAACAA
TCGGTTAAGAAATATGTGATGCATATGTGATGTAGAAGGTATTCATTAGATGTAATATCCACAGCTCGAGGAAGAACTATATTAGATAAAAACTTGGGTTTCTCATGGAA
ATCGGTAACCTTAATGAAAAGGGAAAATGTTCCTTGAATATTACTTCACGGGAGATTAAAAATTTCTAACTTTCAATATCAAATAACCTGTATGCCCTACCAGGGGATAT
CATACAAAAACTGCAGGAATGGCTCAGGGGGCAAACTTATGTGTTTGATGTTAGAGAGTGGATGCAAAATAGAGACAACCGAACACTCATAACCTGTAATAATCTGGCAA
AGTGCCATTAAACCAAGCAAAAGGAGTCTGCTAGTCTAAAACATTTGATGGAGTTTTGTTTATGAAATGCACAACAGTCAATACACACTCACCCCAAAAGGAAAGAAGTA
CACATGATTGAAAATACAAGACTTGTGCAACATTTAAATTCTGCTGGAGTTTTCTATCACAAAATTTTGCTCAGGCCTAGTAGCACGCGAAAACTGATGTACAACTCCCT
TTTCTTTAAATAAATCTTCAAAACTTAACTCACGCGTATTGCCAGAAAAGTCTTGATACTTTTTTCATACTGAGTTTGAATCAATTGAAAGAACCGAGGAGCTATTGTCA
ACACATCAAAATTTCTTTTTAACATGTAAAGCCAAATGTACCGTGTACAATGATCCACTATGGTAAGAAAGTAAGTATAACCAACATGAGGAGTGCAAAAGGTCCCCGCA
CTTCCACATGTATCAAATCAAAAGCATTCGGTGATAGATGATTCTTGGAAGTAATTGACAGATGCCTTTGCTTAGCGAAAGGACAAATAACACAAGGAGTATTATAATCT
GTTCTAGGAGAATCAAAATCCAAAACATTCATCAACACATTCAAATGAGAAAAGGAAAGATGGCCAAGCGTGGAATGTTATAAGACAGCAGAAAAACGTGGAAAAGAAGT
TGCACAAATAGAAGAATCAATGTCAACACTATCAGCAAACTCATCAAGACACTCTTTCATTTTACCCTTACTAATCATCCGCAAAGTGAACTTGTCCTGAAGTGTACAAT
AGTTAGTGGAGAAATTAACCTAAACAAAAATAACTACAAAAAGTCTTCGAAATCAAAGCCCACAAGGAAACATGAAAGTGAACGAGGGACCATAACTCTTAGGATCCATC
TCCACACCTCTACTATTCTGTTCACCCGACAAAACCCATAAGACTGCACACACCCCATCAAGCCAAAGAAAATGACGTTTCTTCCCAAAAGGTGGATTGAGGAGGAACTC
CTTGATCATGGCACTGACATCTCTATGAAGAATATACATCATACCAAATGTCTGGAAAAAAGACTCCCAGACACACTTATAATTGCCAAAGAATATGATCCAAGTCTTTC
TCTGCCTTCCGACAAAGAATACAACAAAAAGGCGCAACAAACAAAGGCAACTTCCTCACGAGCCTATCCAATGTGTTAGCACGATTGTGAAGAACCTGCCAAGTAAGGAA
CCTCACCTTTCTCGGAATTTGATCCTCCAGAGCACCGAAAAGACCAACACACCTAAGGGGGAAGATCAACCAAACATTGAAAGAAATACTTGTATGAGAACCCTTTCAAA
GGATTGGGGCTCCAAAATCTGACATTCCTTCTCTCAATTCTAAAGGGATGACCCTCGAGTAAAGAAAGGAGAGTAGCCACATCCATTTCTTCTCTATTAGAAAGAGAACA
ACGGAATTGGAAGGAAAAGGAACAAGAGTTCCTAGACCACACTAGAAAGTTGATAACCAAATGATTTTTGAGAGATGAAAAATGATACAACCAAGGAAACAAAACACAAA
GAGATATCTCTCCCCTCCCCCACCACACATAACGAACCATGTGAAGAAAAGAAGGGAGCTCTTTCGAAACATCTTTCCACAAATTCTACTGAGTACCTTTAACCCCTTTA
GACAACCACTTGAAAGGATGGGACCATGTTTACTAGCAAGGGTCCTATGCCATAGAGAATTGGGCCTAAAGGAAAACGCCACAGCCATTTGGCTAAAAGGGCTTTGTTAC
GCATCCTAAGATTCCCAATTTCCAGACCCCCTAAGGAAACTAGTCTCTCAATAACTCCCACCCAATCAAGTGTGATCCTTTTCCTTTTCCTTCATCACTCCCTTCCTAGA
AGAAGTCACACATCAATCTCTGATCTCTCCAAACACTTACACACCTCACATGGGGCCCTGAAGAGGGAAAAGAAATAAACAGGAATGCCACCTAGACAGACTTGATCTGG
GTAAGGTAGCCAGCTTTAGAAAAAACTTTTCTTCCAAGAGTCAAACCTGTTTCGAACTTTATCCACCAAGGGATCCCACAAAGAAACGGACTTAGGGTTACTTCCAAGAG
GAAGATTTGCTTCGAACTTTATGATAGTTGTGTTAATAGGTGCCAAATATGTAAAAGTTGCTTAGTTAAAACATATGTCAGGAGAGGGGCCAGAGTCGATTATCCAATAG
GCAATAGAGAATTTTTTGGGCAAATACTTGCTAGATAAGAAGTGGTAGCCTCTGACTCTGTTTTGACAGCAACAAGTTTTGTCTCTAGCATAGCCAACATATCATGACAT
TGTGCAAGAGTGGCAGAAGGACCAACCCGTGGATCAGAAGATGAGACAACATTGGTAGGTTTATTAATGGCATTACTCGTGACAACAAGTGGAGTAGTAGGTTAAGGTTG
TTGACGTTGTCCTCCTCTTTGGTTATTAACTAGGAGGATATCCATGTACCTTATGACAACGATCTACTTTATGGCCCTGTATTCCACAATGAGTGCAGATAGGGCAATCT
TTATGCTATCGATTATTGGAACTTTGAGGAGTTTTAGAAACGACAGCGGCTTGATTAACAAAGATTTCAATTGTAGGAGTTGAAGAAAAGATGGGAGCAGATCGCTACTA
TTATTCTTGAGCAATCAATGAAACAACCTTATTAATTGTAGCAACAGGTTCCATGAGGAGAGTTTAAGAAGGAGAATGGCCATATATGACCCATTGAGTCCCATTACAAA
ATTCACAAGGTATTCATGCTGAAGAATTCATCAAGATCTTTAATGCCTCCACAATTGCATTTTTTGTAGGTGCATCCTGGTTGATAAGTGACATCTTCATCTCAAATGCT
CTTCAATTGCGGAAAATACATTGTAACAGACTGCTGATCTTGAGTAAGAGTTGCATGTTTATGCTTTAGATGAAAGATGCAAGGGCCATTCTTATGTTCAAGTCGCTTCT
TATGTTCAAGTCGCTTCTTAAGATCAAGCCAAATTGCTCTTGCCGAATCAGAAAACAAAATGGTGGAAGAAATTCCTTTCAAAATGGAATTTAAAATCCAAGCCCTAACG
TTGTTCTAGATCTAGACAGGAAGCAAAAAACGAGTCGGTTCAGTTAGGGTTCCATCAGCAAAACCTAACTTATTATGGATGGAGAGTGCCAGAATCATCGAACTACTCCA
GGTAATGTAATTGTCATTAGTGTGCAGATCAGAAATCAAAACAAGGTTTGAAGTGTCATTGTGATGAGGAAAGTAGGGTTTCTGAAAATTCTTTGACGAGGTTTGAACTT
GAATTGAAGAAGACGAGACATTTTTAGGCGCTTGAACCTCAGTATGAGAACCTTCATGAGTCATGACGAAGAAGCCGAAAAAAAGTACTTGATAGCAATAAAAGAGATAA
TTTTTCTTTGATACCATAACAAGTGTAAAATGGGAAATGAAATGTGTATCTTATTCATTTCCATAATGAAAATATTCATTCATAAAGAATGGTTGAAACCATTCAACAGA
ATAACACTTGTATAAATAACAGCATAACAACTTTACTTAACTATCTTTGTGAAATTGAAAGTTCACAACTTGTTGAAATTGTTTAAAGAATGTATAGACACTAACTATAA
AAGTTTAGATAACCTGTGCGAACCTGAAATTTACTAGTTCAAGTTTTTGCTAGTCTAAATTTGTATTTGATGGCCTTATCTTACCCCTTTTATTTGATTTCACTGGTATT
CGCTATATGTACTTTGTAAATTTGTAATCATTTATTTTTTGTTAATCTTGTTTTTTGATTACATGTCTCTCTTGCTTTTGTATTCAGAAATGGTAACTTGGCTGTTTTTG
TATAGACCCTGTTTTCTTGGCGTTGTCTATCATGTTTTTTCATTACACTTCTGTCGTGTTTCAATTCTGTAATGTAACTCGGCTGTTTTTGTAAAGACCTTGACTTCTTG
ACGTCATCTTTTCATGTATTTCTGTTTGCGTTTCTTTTGCACACCAGATGAGTACTATCAATTTCTTTCTATCAATTTCAAATCAGCTAAAAATGTCGTAAATATGTTGA
TTTGAAATTGTCGTGTGGTAATATATCAATTTCAAATCAGCTAAAAATGTCGTAAATAGGAAACTAGTCTCTCAATAACTCCCACCCAATCAAGTGTGTCACAGGGCCTG
CAAGTTGGTTTAAACTGCTTTTTGAATGTTTATGTTCCTAATCTAATATACTTGTTTTATTATTTATCTAAATGAGACCGCAGTTTAAACTTTGGGTTTTTGAACTCTTC
TTAATGTTTTAAGGTAATTGGCCTTCATATATTTTCAATTTCTTGAAGGTAATGTATGATGCTACTGGTGTCAGACTTCATGCTGGTCGTCAAGCCGAGGTAAGTTTGTC
AGAAAGAAATATAGATTCCCCGTTTTATCTTTCATTTATTATTTCCACTTCCATCTGTATCACCTTGAGTCTGATTATTATGTCCATTTTACAAACAAAATTCCCATGTT
CATCTTCTCTTCTTTAAGTCTTGTCATTTGATGATCATTGCAGTTGCTGAACCAAATTGTTTGCGAGTTTCCTCCTGAACATCCTTTGTCCAGTATTAGACCATTGCGAG
ATTCACTTGGCCACACTCCACTTCAGGTTATTGCAGGTGCTATGTTGGGATGTCTAGTGGCTTATTTGATAAGAAATCAAAATTAAAGAATGGAATTTGTACATCAGGAT
AGAATACGACATGTCGTTGGTTGAAAGCACTCGAAGCTTAAAAGTGTAAAGTAAACATGGGTGCTGCAGCTAAGGTAGTTGGACTGGGTGGTTATGGAAGATGCAGAAAT
GATATATTCTTTGGTTTTAGAACTCTCTATTGCTAAGTCTTCTTCCCCTGATCCCATCCACATTGAATTCCTTGTTTAGACTTGTCATATGTATACACACACTAGTGTTT
ATTTGAGTGAAATGTTGTTTAATATAACTGTTTCTTA
Protein sequenceShow/hide protein sequence
MFVFNAATSMHGCPQRRVTLAHGSSTFHPPPIITYPLTRAATSAHNPFFKNGLEKKEESTDHRGVARTEVSLKSWRVISETSAAGQEELGKAYNPLEARNKPPSIFMKLS
PKDIEKVIEYFPGTLVLKDMRREVATSHRQSRCLAQSRRSFALAGCPIVVPRTPNVLCYEGCPSRSPPMSMLRMLSSGSLTKYDYSPFFYVFLPGGRNVLDLPCVGKVAI
RYGPYASKTEWSLCVEDWMIELTLFVDARARQCLMFGVERIEEIESRAEILRGFPSHKTRWVSLGIAPQVNHASLAYPSVASIVSCWRRLSAEGHEKASELPRDLVLKDI
EKVLGQPAMFLCAHRGGDVVDGTIVSDDYLPCFRRATRGTVLWCVLMDEVMTVGDAASSSIKTLPAPLLASNLPLLSAFLAGAIAQFLKLFTTWYKERKWESKRMLDSGG
MPSSHSATVTALAVAIALQEGSGGPAFAIALVFACVVCA