; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007963 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007963
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionHeavy metal transport/detoxification superfamily protein
Genome locationChr10:17966155..17974330
RNA-Seq ExpressionHG10007963
SyntenyHG10007963
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR007493 - Protein of unknown function DUF538
IPR036163 - Heavy metal-associated domain superfamily
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PQQ13102.1 uncharacterized protein Pyn_11052 [Prunus yedoensis var. nudiflora]1.5e-7147.01Show/hide
Query:  MGKLKKMAKVFDCFSPSSCTSTSCFCINSMEVQDEEDEFFDKQPLIPTNNKNNQLLTLKDV-VNGNQTLAFQLKPKMVTLRVSMHCKGCAKKVEKHISKM
        MGKL  + +V D F  SS  S+SCFC+NSME +DE    F++ PL+    + +Q L LKDV   G QTLA QLKPKMV LRVSMHC GCA+KVEKHISK+
Subjt:  MGKLKKMAKVFDCFSPSSCTSTSCFCINSMEVQDEEDEFFDKQPLIPTNNKNNQLLTLKDV-VNGNQTLAFQLKPKMVTLRVSMHCKGCAKKVEKHISKM

Query:  EGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRDLSTVSLPPGFVRRRSEPRRRRQFLRRKSNEAVTASSFYADLPRFLPSSPVITVAQTRALVSLSTQ
        EGV+SY +DLE+KMV+++GD+LP EV+ESVSK                                                                    
Subjt:  EGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRDLSTVSLPPGFVRRRSEPRRRRQFLRRKSNEAVTASSFYADLPRFLPSSPVITVAQTRALVSLSTQ

Query:  SIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRS
                         L +YGFP GLLPS V  Y+++ TSGDF +DLG +CK TLPPDNY+A++S+ +TGKI +G+I  ++GI VRA FQWWSITGIRS
Subjt:  SIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRS

Query:  TDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        + ++LVFEVG++TAKYPSK+F+ESP CEGR S+S
Subjt:  TDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

XP_004148715.1 uncharacterized protein LOC101218800 [Cucumis sativus]3.2e-6992.96Show/hide
Query:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
        AL+SLSTQSIDSK+L IPPSSAHARLADYGFPFGLLPS+V++Y+IN+TSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
Subjt:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW

Query:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        WSITGIRST EDLVFEVGLITAKYPSKSFNESPVCEGRRS+S
Subjt:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

XP_008463372.1 PREDICTED: uncharacterized protein LOC103501542 [Cucumis melo]1.9e-6993.66Show/hide
Query:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
        AL+ LSTQSIDSK+L IPPSSAHARLADYGFPFGLLPS+V+TY+INETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
Subjt:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW

Query:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        WSITGIRST EDLVFEVGLITAKYPSKSFNESPVCEGRRS+S
Subjt:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

XP_022961236.1 uncharacterized protein LOC111461806 [Cucurbita moschata]2.1e-6892.25Show/hide
Query:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
        ALVSLSTQS DS DLK PPSSAHARLA+YGFPFGLLPS+V++Y+INETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
Subjt:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW

Query:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        WSITGIRST EDLVFEVG+ITAKYPSKSFNESPVCEGRRS+S
Subjt:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

XP_038878918.1 uncharacterized protein LOC120071008 [Benincasa hispida]1.7e-7095.77Show/hide
Query:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
        ALVSLSTQSIDSKDLKIPPS+AHARLADYGFPFGLLPSSV+TY+INETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
Subjt:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW

Query:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        WSIT IRST EDLVFEVGLITAKYPSKSFNESPVCEGRRS S
Subjt:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

TrEMBL top hitse value%identityAlignment
A0A0A0LTW9 Uncharacterized protein1.5e-6992.96Show/hide
Query:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
        AL+SLSTQSIDSK+L IPPSSAHARLADYGFPFGLLPS+V++Y+IN+TSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
Subjt:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW

Query:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        WSITGIRST EDLVFEVGLITAKYPSKSFNESPVCEGRRS+S
Subjt:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

A0A1S3CJ18 uncharacterized protein LOC1035015429.1e-7093.66Show/hide
Query:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
        AL+ LSTQSIDSK+L IPPSSAHARLADYGFPFGLLPS+V+TY+INETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
Subjt:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW

Query:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        WSITGIRST EDLVFEVGLITAKYPSKSFNESPVCEGRRS+S
Subjt:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

A0A314YWI6 HMA domain-containing protein7.4e-7247.01Show/hide
Query:  MGKLKKMAKVFDCFSPSSCTSTSCFCINSMEVQDEEDEFFDKQPLIPTNNKNNQLLTLKDV-VNGNQTLAFQLKPKMVTLRVSMHCKGCAKKVEKHISKM
        MGKL  + +V D F  SS  S+SCFC+NSME +DE    F++ PL+    + +Q L LKDV   G QTLA QLKPKMV LRVSMHC GCA+KVEKHISK+
Subjt:  MGKLKKMAKVFDCFSPSSCTSTSCFCINSMEVQDEEDEFFDKQPLIPTNNKNNQLLTLKDV-VNGNQTLAFQLKPKMVTLRVSMHCKGCAKKVEKHISKM

Query:  EGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRDLSTVSLPPGFVRRRSEPRRRRQFLRRKSNEAVTASSFYADLPRFLPSSPVITVAQTRALVSLSTQ
        EGV+SY +DLE+KMV+++GD+LP EV+ESVSK                                                                    
Subjt:  EGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRDLSTVSLPPGFVRRRSEPRRRRQFLRRKSNEAVTASSFYADLPRFLPSSPVITVAQTRALVSLSTQ

Query:  SIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRS
                         L +YGFP GLLPS V  Y+++ TSGDF +DLG +CK TLPPDNY+A++S+ +TGKI +G+I  ++GI VRA FQWWSITGIRS
Subjt:  SIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRS

Query:  TDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        + ++LVFEVG++TAKYPSK+F+ESP CEGR S+S
Subjt:  TDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

A0A5D3BM00 Uncharacterized protein9.1e-7093.66Show/hide
Query:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
        AL+ LSTQSIDSK+L IPPSSAHARLADYGFPFGLLPS+V+TY+INETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
Subjt:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW

Query:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        WSITGIRST EDLVFEVGLITAKYPSKSFNESPVCEGRRS+S
Subjt:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

A0A6J1JBL6 uncharacterized protein LOC1114852981.0e-6892.25Show/hide
Query:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
        ALVSLSTQS DS DLK PPSSAHARLA+YGFPFGLLPS+V++Y+INETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW
Subjt:  ALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQW

Query:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS
        WSITGIRST EDLVFEVG+ITAKYPSKSFNESPVCEGRRS+S
Subjt:  WSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRRSAS

SwissProt top hitse value%identityAlignment
Q58FZ0 Protein SODIUM POTASSIUM ROOT DEFECTIVE 24.1e-1146.67Show/hide
Query:  KMVTLRVSMHCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRD
        ++V L+VS+HC+GC  KV KH+++M+GV+S+ ID   K V + GDI P E+++S+SK ++
Subjt:  KMVTLRVSMHCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRD

Q84J88 Heavy metal-associated isoprenylated plant protein 363.4e-0537.74Show/hide
Query:  LRVSMHCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSK
        LRVS+HC+GC +K++K +SK++GV +  ID++ + V ++G++ P  +++ + K
Subjt:  LRVSMHCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSK

Q8LDS4 Protein SODIUM POTASSIUM ROOT DEFECTIVE 13.7e-1253.33Show/hide
Query:  KMVTLRVSMHCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRD
        ++V LRVS+HCKGCA KV+KH+SK++GV+SY ID   K V + GD+ P  V+ S+SK ++
Subjt:  KMVTLRVSMHCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRD

Q8RXH8 Protein SODIUM POTASSIUM ROOT DEFECTIVE 31.2e-1045.33Show/hide
Query:  KMVTLRVSM--HCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRDLSTVSLPPGFVRR
        ++V LRVS+  HC+GC  KV+KH+SKM+GV+S+ ID  +K V + GDI P EV+  +SK ++    + PP  + R
Subjt:  KMVTLRVSM--HCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRDLSTVSLPPGFVRR

Q94BT9 Copper transport protein ATX13.4e-0540Show/hide
Query:  VTLRVSMHCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSK
        V LRV+M C+GC   V++ + KMEGV S+ +D++ + V + G++ P  V+++V+K
Subjt:  VTLRVSMHCKGCAKKVEKHISKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSK

Arabidopsis top hitse value%identityAlignment
AT1G02816.1 Protein of unknown function, DUF5383.3e-1632.17Show/hide
Query:  SAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRSTDEDLVFEVGLI
        +A+  L  Y FP G+LP  V +Y +++++G F      SC F L   +Y   +   ++G I++ +I  L G++V+ LF W +I  +    ++L F VG+ 
Subjt:  SAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRSTDEDLVFEVGLI

Query:  TAKYPSKSFNESPVC
        +A +    F ESP C
Subjt:  TAKYPSKSFNESPVC

AT3G24450.1 Heavy metal transport/detoxification superfamily protein5.7e-3250.34Show/hide
Query:  MGKLKKMAKVFDC-FSPSSCTSTSCFCINSMEVQDEEDEFFDKQPLIPTN-NKNNQLLTLKDVV--NGNQTLAFQLKPKMVTLRVSMHCKGCAKKVEKHI
        MGKL+K+ +V+DC F P++    SCFC+N++    +E+E F+K+PLI ++  K+ +++ LKDVV  +  QTLAF LKPK+V L+VSMHC GCAKKVEKHI
Subjt:  MGKLKKMAKVFDC-FSPSSCTSTSCFCINSMEVQDEEDEFFDKQPLIPTN-NKNNQLLTLKDVV--NGNQTLAFQLKPKMVTLRVSMHCKGCAKKVEKHI

Query:  SKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRDLSTVSLP
        SK++GV+ Y ++LE+K V++ G+ILP +V+ES+ K ++    S P
Subjt:  SKMEGVSSYTIDLETKMVIIVGDILPFEVVESVSKCRDLSTVSLP

AT4G02360.1 Protein of unknown function, DUF5381.4e-1735.51Show/hide
Query:  YGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRSTDEDLVFEVGLITAKYPSKS
        Y  P G+LP  V  Y +N  +G+F +   D+C+FT+   +Y   +   ++G I+ G + NL G+ V+ LF W +I  +     DL F VG+ +A +P+ +
Subjt:  YGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRSTDEDLVFEVGLITAKYPSKS

Query:  FNESPVC
        F ESP C
Subjt:  FNESPVC

AT4G02370.1 Protein of unknown function, DUF5381.8e-1730.28Show/hide
Query:  ITVAQTRALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIR
        I +A    L SL+   + + +   P  +A++ L  Y FP G+LP  V  Y ++ T+G F     DSC F L   +Y  ++   ++G I++ ++  L G++
Subjt:  ITVAQTRALVSLSTQSIDSKDLKIPPSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIR

Query:  VRALFQWWSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVC
        V+ LF W +I  +    +++ F VG+ +A +  + F ESP C
Subjt:  VRALFQWWSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVC

AT5G19590.1 Protein of unknown function, DUF5381.8e-4163.71Show/hide
Query:  PSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRSTDEDLVFEVG
        P+ AHA L ++GFP GLLP SV  Y +N+TSGDFSL L  +CK TLPPDNY+A++S  VTG+I++G+I  L GIRVRA F+ WSITGIRS+ ++LVFEV 
Subjt:  PSSAHARLADYGFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRSTDEDLVFEVG

Query:  LITAKYPSKSFNESPVCEGRRSAS
         ITAKYPSK+F+ES  CEG+RS+S
Subjt:  LITAKYPSKSFNESPVCEGRRSAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAGCTGAAGAAAATGGCAAAGGTTTTTGATTGTTTCAGTCCTTCATCTTGTACTTCCACTTCTTGTTTTTGCATAAACTCCATGGAAGTTCAAGATGAAGAAGA
TGAGTTTTTTGACAAACAACCACTCATTCCCACCAACAACAAGAACAACCAATTGCTCACATTGAAGGATGTTGTCAATGGAAATCAGACCTTAGCCTTTCAACTCAAGC
CCAAGATGGTGACACTGAGGGTGTCCATGCACTGTAAAGGCTGTGCAAAAAAAGTGGAGAAACACATTTCAAAGATGGAAGGAGTGAGCTCGTACACAATAGACTTGGAG
ACTAAGATGGTGATTATTGTTGGAGACATTCTGCCGTTTGAAGTGGTAGAGAGCGTTTCTAAATGTCGTGATCTCTCAACAGTTTCGCTGCCGCCAGGGTTCGTTCGACG
CAGATCCGAACCACGCCGCCGCCGCCAGTTCCTTCGACGCAAATCCAACGAAGCTGTCACCGCCAGTTCCTTCTACGCAGATCTGCCGCGCTTCCTGCCGTCGTCCCCCG
TGATTACAGTCGCTCAGACGCGAGCTCTCGTTTCTCTTTCCACCCAATCCATTGACTCCAAAGACCTAAAAATTCCACCATCCTCTGCTCACGCTCGCCTCGCCGATTAC
GGATTTCCCTTCGGTCTTCTCCCCTCCTCCGTCGCGACCTACTCCATCAACGAGACCTCTGGTGACTTCTCTTTGGACCTTGGTGATTCCTGCAAGTTCACTCTTCCACC
AGACAACTATGTCGCTTCCTTTTCTAGGGTTGTCACCGGTAAGATTGCAAAGGGTCGGATCCACAATCTTGACGGGATTCGTGTTCGTGCTTTGTTCCAGTGGTGGTCGA
TTACTGGTATTAGGTCCACCGATGAGGATTTGGTTTTCGAGGTTGGGTTGATCACCGCTAAGTACCCGTCTAAGAGTTTCAACGAGAGCCCAGTGTGTGAAGGCCGTCGG
TCTGCTTCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAGCTGAAGAAAATGGCAAAGGTTTTTGATTGTTTCAGTCCTTCATCTTGTACTTCCACTTCTTGTTTTTGCATAAACTCCATGGAAGTTCAAGATGAAGAAGA
TGAGTTTTTTGACAAACAACCACTCATTCCCACCAACAACAAGAACAACCAATTGCTCACATTGAAGGATGTTGTCAATGGAAATCAGACCTTAGCCTTTCAACTCAAGC
CCAAGATGGTGACACTGAGGGTGTCCATGCACTGTAAAGGCTGTGCAAAAAAAGTGGAGAAACACATTTCAAAGATGGAAGGAGTGAGCTCGTACACAATAGACTTGGAG
ACTAAGATGGTGATTATTGTTGGAGACATTCTGCCGTTTGAAGTGGTAGAGAGCGTTTCTAAATGTCGTGATCTCTCAACAGTTTCGCTGCCGCCAGGGTTCGTTCGACG
CAGATCCGAACCACGCCGCCGCCGCCAGTTCCTTCGACGCAAATCCAACGAAGCTGTCACCGCCAGTTCCTTCTACGCAGATCTGCCGCGCTTCCTGCCGTCGTCCCCCG
TGATTACAGTCGCTCAGACGCGAGCTCTCGTTTCTCTTTCCACCCAATCCATTGACTCCAAAGACCTAAAAATTCCACCATCCTCTGCTCACGCTCGCCTCGCCGATTAC
GGATTTCCCTTCGGTCTTCTCCCCTCCTCCGTCGCGACCTACTCCATCAACGAGACCTCTGGTGACTTCTCTTTGGACCTTGGTGATTCCTGCAAGTTCACTCTTCCACC
AGACAACTATGTCGCTTCCTTTTCTAGGGTTGTCACCGGTAAGATTGCAAAGGGTCGGATCCACAATCTTGACGGGATTCGTGTTCGTGCTTTGTTCCAGTGGTGGTCGA
TTACTGGTATTAGGTCCACCGATGAGGATTTGGTTTTCGAGGTTGGGTTGATCACCGCTAAGTACCCGTCTAAGAGTTTCAACGAGAGCCCAGTGTGTGAAGGCCGTCGG
TCTGCTTCGTAA
Protein sequenceShow/hide protein sequence
MGKLKKMAKVFDCFSPSSCTSTSCFCINSMEVQDEEDEFFDKQPLIPTNNKNNQLLTLKDVVNGNQTLAFQLKPKMVTLRVSMHCKGCAKKVEKHISKMEGVSSYTIDLE
TKMVIIVGDILPFEVVESVSKCRDLSTVSLPPGFVRRRSEPRRRRQFLRRKSNEAVTASSFYADLPRFLPSSPVITVAQTRALVSLSTQSIDSKDLKIPPSSAHARLADY
GFPFGLLPSSVATYSINETSGDFSLDLGDSCKFTLPPDNYVASFSRVVTGKIAKGRIHNLDGIRVRALFQWWSITGIRSTDEDLVFEVGLITAKYPSKSFNESPVCEGRR
SAS