; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G009350 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G009350
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTransmembrane protein
Genome locationCG_Chr05:10194419..10204559
RNA-Seq ExpressionClCG05G009350
SyntenyClCG05G009350
Gene Ontology termsGO:0005758 - mitochondrial intermembrane space (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0005507 - copper ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0016531 - copper chaperone activity (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576996.1 hypothetical protein SDJN03_24570, partial [Cucurbita argyrosperma subsp. sororia]2.0e-5497.32Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLI+TSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA+DWPRQ V
Subjt:  EDLARDWPRQGV

XP_008456033.1 PREDICTED: uncharacterized protein LOC103496084 [Cucumis melo]3.1e-5599.11Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLARDWPRQ V
Subjt:  EDLARDWPRQGV

XP_022984157.1 uncharacterized protein LOC111482570 isoform X2 [Cucurbita maxima]2.0e-5497.32Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLI+TSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA+DWPRQ V
Subjt:  EDLARDWPRQGV

XP_023552104.1 uncharacterized protein LOC111809873 isoform X1 [Cucurbita pepo subsp. pepo]5.8e-5496.43Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEV+VFGHGPETAIKLQGSSPHDQLLI+TSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA+DWPRQ V
Subjt:  EDLARDWPRQGV

XP_038875201.1 uncharacterized protein LOC120067716 [Benincasa hispida]2.0e-5498.21Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA DWPRQ V
Subjt:  EDLARDWPRQGV

TrEMBL top hitse value%identityAlignment
A0A1S3C293 uncharacterized protein LOC1034960841.5e-5599.11Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLARDWPRQ V
Subjt:  EDLARDWPRQGV

A0A6J1E545 uncharacterized protein LOC111430823 isoform X28.2e-5496.43Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSS HDQLLI+TSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA+DWPRQ V
Subjt:  EDLARDWPRQGV

A0A6J1FMS5 uncharacterized protein LOC1114468816.3e-5496.43Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSP DQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDR+FQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA+DWPRQ V
Subjt:  EDLARDWPRQGV

A0A6J1J176 uncharacterized protein LOC1114817571.4e-5395.54Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSP DQLLIKTSDSFSGLLLFTVGFLLFMV+FVKDR+FQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA+DWPRQ V
Subjt:  EDLARDWPRQGV

A0A6J1J9R2 uncharacterized protein LOC111482570 isoform X29.7e-5597.32Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLI+TSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA+DWPRQ V
Subjt:  EDLARDWPRQGV

SwissProt top hitse value%identityAlignment
B3DNN5 Anaphase-promoting complex subunit 62.0e-0958.93Show/hide
Query:  ATSFICKVTTLDGTFTPALIGYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLI
        A  +  K T +DG+F+PA IGYGN+  AQEEG QAMSAYRT A+LF G +L  L I
Subjt:  ATSFICKVTTLDGTFTPALIGYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLI

Q13042 Cell division cycle protein 16 homolog1.8e-0538.46Show/hide
Query:  ATSFICKVTTLDGTFTPALIGYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLIAQCFNWRKMEFSLDGNDNLSIR
        A  ++ K TTL+ T+ PA I YG++   + E  QAM+AY T AQL  G +L +L I        +E+ L  N  L+ R
Subjt:  ATSFICKVTTLDGTFTPALIGYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLIAQCFNWRKMEFSLDGNDNLSIR

Q8R349 Cell division cycle protein 16 homolog1.8e-0538.46Show/hide
Query:  ATSFICKVTTLDGTFTPALIGYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLIAQCFNWRKMEFSLDGNDNLSIR
        A  ++ K TTL+ T+ PA I YG++   + E  QAM+AY T AQL  G +L +L I        +E+ L  N  L+ R
Subjt:  ATSFICKVTTLDGTFTPALIGYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLIAQCFNWRKMEFSLDGNDNLSIR

Arabidopsis top hitse value%identityAlignment
AT1G53035.1 unknown protein3.4e-3662.5Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        +LHS+IALT G LMMFY+ +  +FG G E A KL+GS+PHD+LLI+ S SFSGLLLF +G +LFMV+FVKD+EF SFFA G V+L++ MA+WRV FE K+
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA +WP+Q +
Subjt:  EDLARDWPRQGV

AT1G78770.1 anaphase promoting complex 61.5e-1058.93Show/hide
Query:  ATSFICKVTTLDGTFTPALIGYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLI
        A  +  K T +DG+F+PA IGYGN+  AQEEG QAMSAYRT A+LF G +L  L I
Subjt:  ATSFICKVTTLDGTFTPALIGYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLI

AT3G15358.1 unknown protein2.2e-3561.61Show/hide
Query:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL
        +LHS+IALT G LMMFY+ +  +FGHG + A KL+GS+PHD+ LI+ S SFSGLLLF +G +LFMV+FVKDREF SFFA G V+L++ MA+WRV FE K+
Subjt:  MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKL

Query:  EDLARDWPRQGV
        EDLA + P+Q +
Subjt:  EDLARDWPRQGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCATTCTCTGATTGCTTTAACTTGTGGAGCTTTAATGATGTTTTACTCCCACGAGGTCTATGTGTTTGGCCATGGCCCTGAAACAGCAATCAAGCTGCAGGGATC
TAGTCCCCACGATCAGCTTCTGATAAAGACATCTGATTCTTTCTCTGGTTTGCTTCTCTTTACTGTTGGGTTTCTTTTGTTCATGGTGGCTTTTGTCAAAGATAGAGAAT
TCCAAAGCTTCTTTGCCAAAGGGTGCGTGTTGCTTCACCTAGCTATGGCCATATGGAGAGTTTATTTCGAGAGGAAGCTTGAGGATCTCGCACGAGATTGGCCTCGACAG
GGGGTTTGGGGACTGGGAAGTGAGATAGAAATTGAGGATCGATTTGATTGGTTTCCGTTAACCAGATTGAGCTTCAATTTGGTTTCTGTTCTGTGTCTGGATAGACAACA
CATTCATTTGCTCATGCTGGATATCTTCTGCACGATTATGCATTTTGCCTTTCTTTATGGTCAGAATCAAAGTTCAACTCCATGCCAAAATGGCATTTTGGACCATGAAT
GCGACACCGGGGCAAGAAGCAGCTCATGGGTCCCTCCTACCATTCAGGAAGAAAATAAAGAAAATGAGAGTGAGACCCTAAGGCTGCGACGCCACTCCCTCGTTTGCGAA
CGGCGGCGGGTATTCTCGACGGCAACCAGAACCGCGAATGCGGCTGATTCCGTCGGCGACGAACCTCTTCCTCCTTTCCAGCGTTCTCCGGCGATAGCCGGGAGAATCTC
AGGCTACGTCAGGCCACTACAATGCTGCCGCTCCTCAACGTTCGCCAAGTGCCTTTTTCCCCCTCCTTCTTTCTTTCCTGTTAATACCCGAGGATGTTTTCTCTCTTCCC
GATGGAATAACTCATTCTTTCCTCAATCTATGAGTTCAAATGTATCTTTTGCAACAAGTTTTATTTGCAAAGTTACAACTCTGGATGGAACATTTACACCTGCTTTGATA
GGTTATGGTAATGCTTGTGGAGCTCAAGAAGAGGGTGTTCAAGCCATGTCTGCCTATCGGACTGGTGCTCAATTGTTTCTAGGGTTGAATCTTGATATTCTCTTAATTGC
TCAATGCTTCAACTGGAGAAAGATGGAATTTAGTTTGGATGGCAATGACAATTTGTCCATCAGATCCTCTTGTTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCATTCTCTGATTGCTTTAACTTGTGGAGCTTTAATGATGTTTTACTCCCACGAGGTCTATGTGTTTGGCCATGGCCCTGAAACAGCAATCAAGCTGCAGGGATC
TAGTCCCCACGATCAGCTTCTGATAAAGACATCTGATTCTTTCTCTGGTTTGCTTCTCTTTACTGTTGGGTTTCTTTTGTTCATGGTGGCTTTTGTCAAAGATAGAGAAT
TCCAAAGCTTCTTTGCCAAAGGGTGCGTGTTGCTTCACCTAGCTATGGCCATATGGAGAGTTTATTTCGAGAGGAAGCTTGAGGATCTCGCACGAGATTGGCCTCGACAG
GGGGTTTGGGGACTGGGAAGTGAGATAGAAATTGAGGATCGATTTGATTGGTTTCCGTTAACCAGATTGAGCTTCAATTTGGTTTCTGTTCTGTGTCTGGATAGACAACA
CATTCATTTGCTCATGCTGGATATCTTCTGCACGATTATGCATTTTGCCTTTCTTTATGGTCAGAATCAAAGTTCAACTCCATGCCAAAATGGCATTTTGGACCATGAAT
GCGACACCGGGGCAAGAAGCAGCTCATGGGTCCCTCCTACCATTCAGGAAGAAAATAAAGAAAATGAGAGTGAGACCCTAAGGCTGCGACGCCACTCCCTCGTTTGCGAA
CGGCGGCGGGTATTCTCGACGGCAACCAGAACCGCGAATGCGGCTGATTCCGTCGGCGACGAACCTCTTCCTCCTTTCCAGCGTTCTCCGGCGATAGCCGGGAGAATCTC
AGGCTACGTCAGGCCACTACAATGCTGCCGCTCCTCAACGTTCGCCAAGTGCCTTTTTCCCCCTCCTTCTTTCTTTCCTGTTAATACCCGAGGATGTTTTCTCTCTTCCC
GATGGAATAACTCATTCTTTCCTCAATCTATGAGTTCAAATGTATCTTTTGCAACAAGTTTTATTTGCAAAGTTACAACTCTGGATGGAACATTTACACCTGCTTTGATA
GGTTATGGTAATGCTTGTGGAGCTCAAGAAGAGGGTGTTCAAGCCATGTCTGCCTATCGGACTGGTGCTCAATTGTTTCTAGGGTTGAATCTTGATATTCTCTTAATTGC
TCAATGCTTCAACTGGAGAAAGATGGAATTTAGTTTGGATGGCAATGACAATTTGTCCATCAGATCCTCTTGTTTATAATGAGCATGGAGTTGTTGCTTATGACATGAAG
GAGTATCTATATTTTTTTTCTTTATTCTTTATATTTCTGCATTTTTCACACAATAATTTTGTTTGAAGTTTTGCTTCCTGGCTCAGATAGTGGTTTGTTTTTATGTATAT
TGTGATTTCAGTACAAAATAGTGCAGATCATGAGAAACCAAGTAACTTCTATATATTTTACTTGTATATGATCATGAAGTTTTACGTATTCAAGTGTCAGTAATTTTCTC
TAGTCCTTCAAAAAGTTTGCATAAAAATAAAACATACCCTTTGAGATACGTTACTTACCATGGTTTTTGTATTCTGATTTTTAAGTTGTAAACAGTGTATTTAATCCATG
GTCCCACAATCTTTGTACTCCATCAATAGATGGTTGATTCTTTAATCAGGAGTGAGAAGGATTAAAGAACATTGACTTTATAACATGTCTTCTCCTAGACCTAGGCACGC
CGCACATATAAGTTAAGTGTATAGTTAAGACCATTGAATTTGGTCAAGTTTCTTAGAAATGATAATGGCAATTGCTTTTTGAAATTTCAGTCTTTGCAGACGAGAGAGGA
AACTTCTAAAAATAGGAAATCATTTGGAAGGTTACAACTAGGCACAAAGCCGTTGGGACTTTTTGAATTAGGAATACGTTGAATGAGCGATTTGGAACACCTTAAACAAG
GAGGAATTGAAAATGAGAAAAGATATTCTGCTGAAAGTGGTTTCTATTGAATACTTCTACCCTTCTACCTTTTTCTTTTTATTCTCAATTTTTAAGTTTATCTTTGTTGA
AGTTCTAATAATTAAATGCCAATCCTTAATTTAGGACAGTCCCTGCTTGCAGTTTTGTTTCATGTAAATCATCGGAAACGTTGTGCTTGGTACTGGAAATGATTCGATGA
AACTCTTAAGCAAGAAATTAACTTGGTAAGTCATTTTTATTTTGAGAAGAAGTTGCAAAGATGAAAAGCTTTAGGTTAGTTTTATGGGTTTCATTCTTTAGGTCCCAAAA
TTTTTTTTGGGTTTCTTCCTATGGTTCAAGAAATTTTCTTGATCTTGTGTGAATTCTTATTTGGGGTTGGCTCAAGTTAAAGAAGTTAAAAGAAGTAGAGATCTAAATTT
TTTTAGGCCGATTTCTATGGATTTATCTTGAAACAAAAGTAAAGAAATTAGATGAGGCTTAGAGAGAGAAAATATTAGAAATATTAGTTTGCTTGTGCTATGTTATTTCT
TGACGTTTATGTGAAAAAGATTATAAAGAAGAGAAGTGTTGGATTGGAATAGAGGAAACAAAAAAATGTTGGTTCTATGTCTTTGGTTTAATGAGATGGAGATTGGAGTT
TTTTCTATGAAGATTCGGATAGTTGTTGGAATGGAAGAAGAGAAAAGGAAACGAAGGAAATGAGCAAAACCTTGTAGATAGAAAAAGGAAGAAGAAGATGAATGGTAAAA
TTGGGGAGTTAGTGAAATTAATTGCATGATGGATGTTAGCTTTGGATCTTGGAATTAATGAGCATGATGTTTTAATTGAAAACAAAGAAAGTAATCATGCTAAGTTCAAA
TTAATTATCCGAGGAACGAAGATTGGGATGGCTAATCACAACTTGGAATCCTTGCTTCTATAATGTTGTGAGTGGTTTATTTTGACCAAATGTTTTTACTTTATTTAGCT
ATGTTTTATGTTGTTTCAAGAAAACTACCTATGTAGATTATGTATGATTTAGAAAAGTTACAAGAGTTTCTAAAGCAGCAATAGATTTTTTAGAAAGTTCATGATTTTAG
TACAAGTTGTGGTTTTAAAATTATTTTAGATATTTTATGACTATTTGAGAATTTTGAAATATATTACTTTGTTGAGGAATGAAAATGATATTTTAAGAGTTTGAAAGATC
AATCATTGAGCTTTGATTGAAATGTGTTCAAAGAGAAAAAGTATGAAATAGAATGTTTGAGAATGTAGAGTTTTATGTTGGAAGTGGTTTTCAAACCACTAGATCTTTGT
ACCCATTGTGATTACCTGTATGCTACGTGAATGTGGTTCTGTAGTATACTAGAGAGAGAGAGAGATTGGTCTTTGGGACCCCACTTCGACACATATATAGGCTTTGGATC
GTGTGTCGGGGTGCTTTGCCCTTTCGGGGCCGCTCTGCGTGTACTATCACAAAAATTTGCTCTGTTATGCACCTTGAGAGTACTTGATATATAGTATGATGTAGATTGCT
TTGCTATATACCTAGAGATTTCTAAGTACATAGTATGATAAAGATTTACTCTGCTGTATGTCTGGAGATTACTCGATGTTGATGCATAATATGACAAACATTGCTCTGTT
GTGCGTATTGAGATTACTCTACGTTAGCATGACAAAGATTGCTCTGCTGTGCGTATTGAGATTACTCTACGCTAGTATGACAAAGATTTGGGTACATGTTTGGTGATGGT
TTTTACCTACTTGTTGCTATGTTTTCAAAACTACTTTTACGAAATTTATGTTTTTTTAATTATCTTGAGTTTTGAAGCTGCTCATGATTTCTGTTGATTTACAAAGTTTT
AAATTTGCAATGAGAATGTGAAATCTTTCATTTAAAATTATTTTCTTATGAAATCATACCACTCGCTAGGTTTTTTAGCCCATTATTTTCCTTAAATTTTTCTACCAGGT
AGCACGAGAGCATACAATGGGCCTACTGAGGTTGAGAACTACCACCAATCTTTTGCATGTTATGTTTCATTTTACATCGTAAATATTTTAAAATTTGTTTATCTTTGAAA
CTAAAGGGGCTAGCATTACTTTTGATGATGTTTACCATTCTTTCAATTTTATGATGGTCGTAGATTATAGAACTTTAATGCTTTTCCATTGTTGGTTTATGATAAGGGTT
GCCTTCATAAGAAGAATTTGGGAAGCTTTGAAGTTGTGTGCTTGTGGGAATTTTTCCTGAAAGGTTGTTGTACGACAAATTCAATGCTGCCAGCCTTGATATTTCAGAAA
GAGTAGTTGGAATTGGACCGTATAGCCGATTTCTTGATAGATCCAGAGACTCCAATGATTGAAGTTGGCCCATGTTACCAGGAATTGGACCTCTTAATTCATTGCTGGAC
AGATTCAAAGTAATCAAATCAACAAGCTCGGTGATTTCCTCAGGAATTTTCCCAGTCAAATGATTGCAGGATAGATCAATACTTCTTTCAAGTCTGAATACTTCTCCTGT
GATGAGTCTCTCTTGACCTTTCAACACCATTATCAAATCTTCCCTAATAATGATTTTGGACAGGATTCCCACAGCAGGATAGTTGTAGTCAAAATCATCAATTGTGTTAA
ATGTTTGAAGGCTTGGAAATTATGAATGCAGGTTAGTAGGCTTCCAGAAACATTATTGGATGAAATGTCCAATACTCTAATCTTTTTGAGGTTGCAAATACTTGCAGGCA
AATTTCCATGAAAGTTGTTTGACTTCAATTCAGGTAACTCAAATTAGGCATCTTTGATCCAATCCACAATGGTATTGCTCCAAATAAATTGTTATTCATTGCATCAATGA
CTTTCAAAGATGTCAAGTTGGACAAAGAATCCAACCCTCCAGAGAATTGATTGTTGCGTAAAACTAAGGTTTCCAAACGACTGAGATTAGTCATAGGGTGTGGAAGATGT
CCGGAAAAGTAATTATTGGAGAAGTTTAATCGGAGTAAATTGACCATGGAGTCCCAACAACTAGGAAGTTGTCCTGATAATTGGTTGTTAGAGAGATCCAAATCCAGTAA
AGGAGAGGTGACAATTTGACAGAAGCAAGTTAAGTCGGAAAAATTATTATTTGAAAGTCTGAGGAAATATGATTGAAAAAGAAAGGCGGGAATTTGCCCCACAAATTTGT
TTGAGTCTAAATTCATATATTTCATTCTTTGGAACTTTACTGACAAGTCAGGAGTTTCACCCATGATTTTATTATCGGAAAGATCCAGAAATAGAAGGTTTGGAAACAAG
GTGGTCCAAAACCAATGAGGAATGATGTCTGAAATTTCACCAAATGAAATATCAATCACAGAAGATATATTTTGAGTTTGAAGCCACCCGGG
Protein sequenceShow/hide protein sequence
MLHSLIALTCGALMMFYSHEVYVFGHGPETAIKLQGSSPHDQLLIKTSDSFSGLLLFTVGFLLFMVAFVKDREFQSFFAKGCVLLHLAMAIWRVYFERKLEDLARDWPRQ
GVWGLGSEIEIEDRFDWFPLTRLSFNLVSVLCLDRQHIHLLMLDIFCTIMHFAFLYGQNQSSTPCQNGILDHECDTGARSSSWVPPTIQEENKENESETLRLRRHSLVCE
RRRVFSTATRTANAADSVGDEPLPPFQRSPAIAGRISGYVRPLQCCRSSTFAKCLFPPPSFFPVNTRGCFLSSRWNNSFFPQSMSSNVSFATSFICKVTTLDGTFTPALI
GYGNACGAQEEGVQAMSAYRTGAQLFLGLNLDILLIAQCFNWRKMEFSLDGNDNLSIRSSCL