; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0175151 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0175151
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionHeavy metal transport/detoxification superfamily protein
Genome locationCMiso1.1chr06:32026155..32028244
RNA-Seq ExpressionCmc06g0175151
SyntenyCmc06g0175151
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151526.2 heavy metal-associated isoprenylated plant protein 34 [Cucumis sativus]5.3e-9293.68Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS
        MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFG+D +NHS
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS

Query:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
        SYSNPY SYNEQSHWFDR YPNLQRPQPYPWQLMLPQPQPQPV WPM+WPGWP PDNQ+LDGNQ+N QRCCTVM
Subjt:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM

XP_008449023.1 PREDICTED: uncharacterized protein LOC103491020 isoform X1 [Cucumis melo]8.8e-9586.57Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIY---------------------------GIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAE
        MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIY                           GIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAE
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIY---------------------------GIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAE

Query:  VRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTV
        VRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTV
Subjt:  VRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTV

Query:  M
        M
Subjt:  M

XP_008449024.1 PREDICTED: uncharacterized protein LOC103491020 isoform X2 [Cucumis melo]3.5e-99100Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS
        MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS

Query:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
        SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
Subjt:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM

XP_038905213.1 uncharacterized protein LOC120091309 isoform X1 [Benincasa hispida]5.0e-7472Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNI------------------------YGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRS
        M+SMDYSVA+MSCVL+ASVQCEACKAK+QEILQNI                         G+YTITMDS+DGSVRICGRVNPRTFLKVIE SGKHAEV+S
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNI------------------------YGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRS

Query:  IRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLML--PQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
        IRFDGEAGDRRYYP +G+D +N+ SY N Y SY EQ HWFDR+YPNL RPQPYPWQLML  PQPQPQPVP P+IWPGWPHPDNQ L+ N++NNQRCCTVM
Subjt:  IRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLML--PQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM

XP_038905214.1 heavy metal-associated isoprenylated plant protein 42 isoform X2 [Benincasa hispida]5.7e-7881.82Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS
        M+SMDYSVA+MSCVL+ASVQCEACKAK+QEILQNI G+YTITMDS+DGSVRICGRVNPRTFLKVIE SGKHAEV+SIRFDGEAGDRRYYP +G+D +N+ 
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS

Query:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLML--PQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
        SY N Y SY EQ HWFDR+YPNL RPQPYPWQLML  PQPQPQPVP P+IWPGWPHPDNQ L+ N++NNQRCCTVM
Subjt:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLML--PQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM

TrEMBL top hitse value%identityAlignment
A0A0A0L264 Uncharacterized protein9.9e-6891.67Show/hide
Query:  MDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQP
        MDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFG+D +NHSSYSNPY SYNEQSHWFDR YPNLQRPQPYPWQLMLPQPQPQP
Subjt:  MDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQP

Query:  VPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
        V WPM+WPGWP PDNQ+LDGNQ+N QRCCTVM
Subjt:  VPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM

A0A1S3BL42 uncharacterized protein LOC103491020 isoform X14.3e-9586.57Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIY---------------------------GIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAE
        MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIY                           GIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAE
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIY---------------------------GIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAE

Query:  VRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTV
        VRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTV
Subjt:  VRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTV

Query:  M
        M
Subjt:  M

A0A1S3BLR2 uncharacterized protein LOC103491020 isoform X21.7e-99100Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS
        MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS

Query:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
        SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
Subjt:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM

A0A5A7VH99 Chitin-binding lectin 1-like1.7e-99100Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS
        MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS

Query:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
        SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM
Subjt:  SYSNPYHSYNEQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM

A0A6J1EW45 uncharacterized protein LOC1114387284.3e-6372.57Show/hide
Query:  MDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHSSYS
        MD+S+AEMSCVL+AS+QCEAC AK+QEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIE+SGKHAEV+SIRFDGEAGDRRYYP +G+D  +H    
Subjt:  MDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHSSYS

Query:  NPYHSYNEQSHWFDRSYPNLQ--RPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNN--QRCCTVM
         PY +  EQS WFD +YP      PQPYPWQ MLPQPQPQPVPWPMI PG P  +    + +QDNN  QRCCT+M
Subjt:  NPYHSYNEQSHWFDRSYPNLQ--RPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNN--QRCCTVM

SwissProt top hitse value%identityAlignment
B3H6D0 Heavy metal-associated isoprenylated plant protein 451.1e-0426.73Show/hide
Query:  LNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS-----SYSNPYHSY
        L   + C+ C+ K++  +  + G+ T+ +D D   V + G V+    LK+++++G+ AE     ++G  GD   YPS   + ++       SYS  Y  Y
Subjt:  LNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHS-----SYSNPYHSY

Query:  N
        +
Subjt:  N

F4JZL7 Heavy metal-associated isoprenylated plant protein 336.3e-1142.86Show/hide
Query:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV
        +CVL  ++ C+ CK K+++ILQ I G++T  +D++ G V + G V+P   +K + KSGKHAE+
Subjt:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV

Q0WV37 Heavy metal-associated isoprenylated plant protein 342.8e-1134.69Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTN
        MN  D    + +CVL  +V CE CK K+++ LQ I G+Y++  D + G V + G ++P   +K + KSGKHAE+      G  G  + +P+      N
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTN

Q9CAV5 Heavy metal-associated isoprenylated plant protein 423.9e-0524.24Show/hide
Query:  MNSMDYSVAEMSCVLNASVQ-CEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTN
        M  +D+ +    C+L  ++Q CE   ++++++L+ + G+Y IT+D   G + +CG   P   +K + K G+  ++ +   D      R+        TN
Subjt:  MNSMDYSVAEMSCVLNASVQ-CEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTN

Q9M8K5 Heavy metal-associated isoprenylated plant protein 327.4e-1244.44Show/hide
Query:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV
        +CVL  ++ C+ CK K+++ILQ I G++T  +DS+ G V + G V+P   +K + KSGKHAE+
Subjt:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV

Arabidopsis top hitse value%identityAlignment
AT3G05220.1 Heavy metal transport/detoxification superfamily protein2.0e-1234.69Show/hide
Query:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTN
        MN  D    + +CVL  +V CE CK K+++ LQ I G+Y++  D + G V + G ++P   +K + KSGKHAE+      G  G  + +P+      N
Subjt:  MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTN

AT3G06130.1 Heavy metal transport/detoxification superfamily protein5.3e-1344.44Show/hide
Query:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV
        +CVL  ++ C+ CK K+++ILQ I G++T  +DS+ G V + G V+P   +K + KSGKHAE+
Subjt:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV

AT3G06130.2 Heavy metal transport/detoxification superfamily protein5.3e-1344.44Show/hide
Query:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV
        +CVL  ++ C+ CK K+++ILQ I G++T  +DS+ G V + G V+P   +K + KSGKHAE+
Subjt:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV

AT3G13140.1 hydroxyproline-rich glycoprotein family protein1.6e-1431.58Show/hide
Query:  MDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEA----GDRRYYPSFGE--DTT
        MDY+V +MSCV+  +  C+ C+ K+ E++  +YG+Y++    DD S+++  RVNP   L V E+ G+H ++ ++RFDGE     G   YY   G    TT
Subjt:  MDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEA----GDRRYYPSFGE--DTT

Query:  NHSSYSNP-YHSYNEQSHWFDRSY----PNLQRPQPYPWQLMLPQPQPQ----PVPWPMIWPGWPHPDNQY
        +  +Y+ P  + Y    H +  +Y     N  R    P Q   P   P     P P P     + + + QY
Subjt:  NHSSYSNP-YHSYNEQSHWFDRSY----PNLQRPQPYPWQLMLPQPQPQ----PVPWPMIWPGWPHPDNQY

AT5G19090.1 Heavy metal transport/detoxification superfamily protein4.5e-1242.86Show/hide
Query:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV
        +CVL  ++ C+ CK K+++ILQ I G++T  +D++ G V + G V+P   +K + KSGKHAE+
Subjt:  SCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCAATGGATTACTCTGTCGCAGAAATGAGTTGTGTTCTTAATGCAAGTGTTCAATGTGAAGCATGCAAAGCAAAAATCCAAGAAATTCTACAAAACATCTATGG
TATTTACACAATCACAATGGATTCAGATGATGGATCAGTCAGAATCTGTGGAAGAGTGAATCCAAGAACATTCTTGAAAGTAATTGAAAAATCAGGCAAACATGCAGAGG
TGAGAAGTATAAGATTTGATGGTGAAGCTGGAGATAGAAGATATTACCCTTCTTTTGGAGAAGATACTACCAATCATTCTTCATACTCAAACCCTTATCATAGCTATAAT
GAACAATCTCATTGGTTTGACAGATCTTACCCCAACCTACAGCGGCCGCAGCCATATCCTTGGCAATTAATGCTACCACAGCCGCAGCCCCAACCTGTGCCATGGCCAAT
GATATGGCCGGGATGGCCGCATCCTGATAATCAATACCTTGACGGAAACCAAGATAACAATCAGAGATGTTGTACGGTTATGTAA
mRNA sequenceShow/hide mRNA sequence
TGCTGAGAATGAAAATTGAAACTCCCACAAATGGTTAAATTTGATTTTCACAAAAAATAGAAAAGAAAAAGAAGAAGAAGAGAGATTTCAATTGATCTGAAATCTCTAAT
CATAGAATCTTTCTATTTAGATTTCAACATTTGAAAGAGAGAAATATTAGGAGAAATTGAAGTTGGTGACACTATCGGTTGAAAAAATAGTCAACTTGAGAAAAGATGAC
ACCACCATCTCTCCTCTTTATAATCCCCTTTGTTCTCTTCTCCATCAAACTGCTGCAAAAACAGAACTTAGAAGAAACCCCTCCCTTGAACCTTTTCAACGATCTCCATA
AATGAACTCAATGGATTACTCTGTCGCAGAAATGAGTTGTGTTCTTAATGCAAGTGTTCAATGTGAAGCATGCAAAGCAAAAATCCAAGAAATTCTACAAAACATCTATG
GTATTTACACAATCACAATGGATTCAGATGATGGATCAGTCAGAATCTGTGGAAGAGTGAATCCAAGAACATTCTTGAAAGTAATTGAAAAATCAGGCAAACATGCAGAG
GTGAGAAGTATAAGATTTGATGGTGAAGCTGGAGATAGAAGATATTACCCTTCTTTTGGAGAAGATACTACCAATCATTCTTCATACTCAAACCCTTATCATAGCTATAA
TGAACAATCTCATTGGTTTGACAGATCTTACCCCAACCTACAGCGGCCGCAGCCATATCCTTGGCAATTAATGCTACCACAGCCGCAGCCCCAACCTGTGCCATGGCCAA
TGATATGGCCGGGATGGCCGCATCCTGATAATCAATACCTTGACGGAAACCAAGATAACAATCAGAGATGTTGTACGGTTATGTAAAAGTTTTGAATAAGATTAAATGTA
GAAACTATTAATGATCACTCAATATATGAATGCAATAGTTTCTTCCTTCGTTTCTTGAAGACCAATACCAAATTCTTCTTGCTCCTTACACTAGATTTTTACCAATTTTT
TTTTTTGTGAAGCAAAGTTCTTTAAAGCAAGAAATAAAGTAAAAGGGTATTTCATTAGAAAGGGGTATTTTTAATATACCATTATATGTGACAAGGTTTTTACAAGATTC
ATTAGTATCCCATCAAGATCAGCATATACAGCCAAACAAAAGCTTTGACCAAAAAAGAAAAAAATATATATAAAAAAGACACAGATACTAATATTTCTGCATCTTAAACT
AACATCACATTCATTGACCTAGCTATACAAACACCTTAAAAACCGGGTCATTTTTTCGTTGAGGAATTTCATCAAACGCATGGTGGGTAAATTCAAAGTTGGTCAGGCCA
TAAGAACGTCCCAATAGCAAATGCCCTTTTCTCCTCAAGAGTACCATCAAAACAGAGTCCATATTGTTTCAGAAGCAAATCCAACTTACCTTCCTCCATTTCCTCATAAT
CCCTCTTCGTGTACCTAGGATAATGAAGCGGCATTTGAAACCCACTTTCAAAATTCAATCCTCCACCATAAAAACAATCTCGCCCCTCCTTCACTTGTTCCACGTTCTTG
ATTCCTTCTCCTCCTGCCTCCTTCTGATTCTGTTGTTGCTTCTCGGTGAGACCCAGAACTTGAGTAAAGGCAGAATTGAGAATCCACTTCAACCCCATTTACAAGAAAAA
GGGAAAAAGTTGAAGAAGAGGACAAAGAGAGCGATGTAAAGCGTTTTTTTTTCTCTTCTGACTTGGGTATGAAGAAGAAGGGTAGTTGAATCGCTTATATATATATATAA
AGGCGTTAGATTGTTGGAATTTTCTTGAGGAGAAAGGCAAAAATGAGGAACAGTTGTTAAGAATTGAGTGGCGGTAATTGGCATTTGTTGCTAGAACGTTCTTCGAGGAG
GTTATGACGAGGCCACGTTAGCCCATTGTTTTGTTGGAGTTGTT
Protein sequenceShow/hide protein sequence
MNSMDYSVAEMSCVLNASVQCEACKAKIQEILQNIYGIYTITMDSDDGSVRICGRVNPRTFLKVIEKSGKHAEVRSIRFDGEAGDRRYYPSFGEDTTNHSSYSNPYHSYN
EQSHWFDRSYPNLQRPQPYPWQLMLPQPQPQPVPWPMIWPGWPHPDNQYLDGNQDNNQRCCTVM