; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008255 (gene) of Snake gourd v1 genome

Gene IDTan0008255
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHeavy metal-associated domain containing protein
Genome locationLG10:2204079..2205381
RNA-Seq ExpressionTan0008255
SyntenyTan0008255
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004133846.1 heavy metal-associated isoprenylated plant protein 39 [Cucumis sativus]4.1e-5379.14Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE
        ++KVVLKLDLHDDKAKKKALKLVS L GIDSIAMDMKE+KLTVIGAVDPVTIVSKLRKFWPA+I+SVGPAVEPKKEE KKE EGKKEEE KKEEG   E 
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE

Query:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        KKEE  K+++++        N  PNPND VLELV+AYRAYNPHLTTYYYVQSMEENPN+CAIC
Subjt:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

XP_008437990.1 PREDICTED: uncharacterized protein LOC103483245 [Cucumis melo]2.9e-5480.37Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE
        ++KVVLKLDLHDDKAKKKALKLVS L GIDSIAMDMKE+KLTVIGAVDPVTIVSKLRKFWPA+I+SVGPAVEPKKEE KKE EGKKEEE KKEEG  E +
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE

Query:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        K+EE KK++++         N NPNPNDAVLELV+AYRAYNPHLTTYYYVQSMEENPN+CAIC
Subjt:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

XP_022935536.1 heavy metal-associated isoprenylated plant protein 39-like [Cucurbita moschata]3.3e-5884.57Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEK
        ++KVVLKLDL DDKAKKKALKLVS L GIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADI+SVGPAVEPKK+E KKEEGKKEEEGKKEEGKKEEEK
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEK

Query:  KEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        K E KKEEEKK          NPNPNDAVLELV+AYRAYNP+LTT+YY QS+EENPNACAIC
Subjt:  KEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

XP_022974611.1 heavy metal-associated isoprenylated plant protein 39-like [Cucurbita maxima]5.4e-5362.4Show/hide
Query:  NSFTKFTLFSLSSFPSLFSLSSSSSSSSLQLIFVLSSAISAQSPRTITHSITSTTNEGSFPNPQFSPLSLLSLDISSVSLLQKVVLKLDLHDDKAKKKAL
        NSFTKFTL+SLSS P L                                             P F P +   L  SS   ++KVVLKLDL DDKAKKKAL
Subjt:  NSFTKFTLFSLSSFPSLFSLSSSSSSSSLQLIFVLSSAISAQSPRTITHSITSTTNEGSFPNPQFSPLSLLSLDISSVSLLQKVVLKLDLHDDKAKKKAL

Query:  KLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKEEGKKEEEKKNPNPNPNPN
        KLVS L GIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPAD++SVGPAVEPKK+E KKEEGKK EEGKKEE KK E KKEE KKE E+K         
Subjt:  KLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKEEGKKEEEKKNPNPNPNPN

Query:  PNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
         NPNPNDAVLELV+AYRAYNP+LTT+YY QS+EENPNACAIC
Subjt:  PNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

XP_038900253.1 heavy metal-associated isoprenylated plant protein 39 [Benincasa hispida]1.9e-5884.24Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEK
        ++KVVLKLDLHDDKAKKKALKLVS L GIDSIAMDMKE+KLTVIGAVDPVTIVSKLRKFWPADI+SVGPAVEPKKEE KKEE KKE EGKKEE KK   K
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEK

Query:  KEEGKKEEEKK---NPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        +EEGKKEEEKK         N N NPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPN+CAIC
Subjt:  KEEGKKEEEKK---NPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

TrEMBL top hitse value%identityAlignment
A0A0A0L641 HMA domain-containing protein2.0e-5379.14Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE
        ++KVVLKLDLHDDKAKKKALKLVS L GIDSIAMDMKE+KLTVIGAVDPVTIVSKLRKFWPA+I+SVGPAVEPKKEE KKE EGKKEEE KKEEG   E 
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE

Query:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        KKEE  K+++++        N  PNPND VLELV+AYRAYNPHLTTYYYVQSMEENPN+CAIC
Subjt:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

A0A1S3AVX0 uncharacterized protein LOC1034832451.4e-5480.37Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE
        ++KVVLKLDLHDDKAKKKALKLVS L GIDSIAMDMKE+KLTVIGAVDPVTIVSKLRKFWPA+I+SVGPAVEPKKEE KKE EGKKEEE KKEEG  E +
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE

Query:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        K+EE KK++++         N NPNPNDAVLELV+AYRAYNPHLTTYYYVQSMEENPN+CAIC
Subjt:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

A0A5D3D2S2 HMA domain-containing protein1.4e-5480.37Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE
        ++KVVLKLDLHDDKAKKKALKLVS L GIDSIAMDMKE+KLTVIGAVDPVTIVSKLRKFWPA+I+SVGPAVEPKKEE KKE EGKKEEE KKEEG  E +
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKE-EGKKEEEGKKEEGKKEEE

Query:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        K+EE KK++++         N NPNPNDAVLELV+AYRAYNPHLTTYYYVQSMEENPN+CAIC
Subjt:  KKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

A0A6J1F4Y1 heavy metal-associated isoprenylated plant protein 39-like1.6e-5884.57Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEK
        ++KVVLKLDL DDKAKKKALKLVS L GIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADI+SVGPAVEPKK+E KKEEGKKEEEGKKEEGKKEEEK
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEK

Query:  KEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        K E KKEEEKK          NPNPNDAVLELV+AYRAYNP+LTT+YY QS+EENPNACAIC
Subjt:  KEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

A0A6J1IBU7 heavy metal-associated isoprenylated plant protein 39-like2.6e-5362.4Show/hide
Query:  NSFTKFTLFSLSSFPSLFSLSSSSSSSSLQLIFVLSSAISAQSPRTITHSITSTTNEGSFPNPQFSPLSLLSLDISSVSLLQKVVLKLDLHDDKAKKKAL
        NSFTKFTL+SLSS P L                                             P F P +   L  SS   ++KVVLKLDL DDKAKKKAL
Subjt:  NSFTKFTLFSLSSFPSLFSLSSSSSSSSLQLIFVLSSAISAQSPRTITHSITSTTNEGSFPNPQFSPLSLLSLDISSVSLLQKVVLKLDLHDDKAKKKAL

Query:  KLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKEEGKKEEEKKNPNPNPNPN
        KLVS L GIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPAD++SVGPAVEPKK+E KKEEGKK EEGKKEE KK E KKEE KKE E+K         
Subjt:  KLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKEEGKKEEEKKNPNPNPNPN

Query:  PNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
         NPNPNDAVLELV+AYRAYNP+LTT+YY QS+EENPNACAIC
Subjt:  PNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

SwissProt top hitse value%identityAlignment
O03982 Heavy metal-associated isoprenylated plant protein 393.5e-4768.93Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWP-ADIVSVGPAVEP---KKEEAKKEEG----KKEEEGKKE
        ++K+VLKLDLHDD+AK+KALK VS LPGIDSIAMDMKEKKLTVIG VDPV +VSKLRK+WP  DIV VGPA EP   KKEE KKE G    KKE E  KE
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWP-ADIVSVGPAVEP---KKEEAKKEEG----KKEEEGKKE

Query:  EG-------KKEEEKKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        EG       KKEEEKKE G K+E +K   P   P P   P D VLELVKAY+AYNPHLTTYYY QS+EENPNAC IC
Subjt:  EG-------KKEEEKKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

Q9LTE2 Heavy metal-associated isoprenylated plant protein 132.2e-0434.9Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKE
        K VL+L +H+++ +KKA   VS  PG+ SI MD K  K+TV+G VD   IV KLRK    ++VSV                             E  K  
Subjt:  KVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKE

Query:  EGKKEEEKKNPNPNPNPNPNPNPNDAV---LELVKAYRAYNPHLTTYYY
        E K E EK  P P P P P   P + V   +++   Y+ YNP     YY
Subjt:  EGKKEEEKKNPNPNPNPNPNPNPNDAV---LELVKAYRAYNPHLTTYYY

Q9LTE3 Heavy metal-associated isoprenylated plant protein 129.4e-0843.16Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGK
        +Q VVLKLD+H +K K+KA+  V  L G++S+  ++K+ KLTV G +D   IV KL+K    + +SVGP  EP+K        KK ++ KK E K
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGK

Arabidopsis top hitse value%identityAlignment
AT1G01490.1 Heavy metal transport/detoxification superfamily protein2.5e-4868.93Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWP-ADIVSVGPAVEP---KKEEAKKEEG----KKEEEGKKE
        ++K+VLKLDLHDD+AK+KALK VS LPGIDSIAMDMKEKKLTVIG VDPV +VSKLRK+WP  DIV VGPA EP   KKEE KKE G    KKE E  KE
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWP-ADIVSVGPAVEP---KKEEAKKEEG----KKEEEGKKE

Query:  EG-------KKEEEKKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        EG       KKEEEKKE G K+E +K   P   P P   P D VLELVKAY+AYNPHLTTYYY QS+EENPNAC IC
Subjt:  EG-------KKEEEKKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

AT1G01490.2 Heavy metal transport/detoxification superfamily protein2.5e-4868.93Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWP-ADIVSVGPAVEP---KKEEAKKEEG----KKEEEGKKE
        ++K+VLKLDLHDD+AK+KALK VS LPGIDSIAMDMKEKKLTVIG VDPV +VSKLRK+WP  DIV VGPA EP   KKEE KKE G    KKE E  KE
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWP-ADIVSVGPAVEP---KKEEAKKEEG----KKEEEGKKE

Query:  EG-------KKEEEKKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC
        EG       KKEEEKKE G K+E +K   P   P P   P D VLELVKAY+AYNPHLTTYYY QS+EENPNAC IC
Subjt:  EG-------KKEEEKKEEGKKEEEKKNPNPNPNPNPNPNPNDAVLELVKAYRAYNPHLTTYYYVQSMEENPNACAIC

AT5G23760.1 Copper transport protein family1.9e-1654.37Show/hide
Query:  LLQKVVLK-LDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEE
        + QKVVLK L + DDK K+KA++  + + G+DSIA DMK++KLTVIG +D V +V KL+K    D++SVGPA E KKEE K+E+ ++++E KKEE K+EE
Subjt:  LLQKVVLK-LDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEE

Query:  EKK
         KK
Subjt:  EKK

AT5G52740.1 Copper transport protein family6.7e-0943.16Show/hide
Query:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGK
        +Q VVLKLD+H +K K+KA+  V  L G++S+  ++K+ KLTV G +D   IV KL+K    + +SVGP  EP+K        KK ++ KK E K
Subjt:  LQKVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGK

AT5G52750.1 Heavy metal transport/detoxification superfamily protein1.5e-0534.9Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKE
        K VL+L +H+++ +KKA   VS  PG+ SI MD K  K+TV+G VD   IV KLRK    ++VSV                             E  K  
Subjt:  KVVLKLDLHDDKAKKKALKLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKE

Query:  EGKKEEEKKNPNPNPNPNPNPNPNDAV---LELVKAYRAYNPHLTTYYY
        E K E EK  P P P P P   P + V   +++   Y+ YNP     YY
Subjt:  EGKKEEEKKNPNPNPNPNPNPNPNDAV---LELVKAYRAYNPHLTTYYY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTATATAGCCGTACCTGATCCCTCCAATTCCTTCACTAAATTTACTCTCTTTTCTCTCTCCTCTTTCCCTTCACTTTTTTCTCTCTCTTCTTCTTCTTCTTCTTC
CTCTCTGCAACTTATTTTTGTTCTTTCTTCCGCCATTTCTGCCCAAAGCCCTAGAACAATAACACACAGCATTACCAGTACCACCAATGAAGGTTCGTTTCCCAATCCTC
AATTTAGTCCTCTTTCGCTTCTTTCTCTTGATATTTCCTCTGTTTCTCTCTTGCAGAAGGTCGTTCTGAAACTGGATTTGCACGACGACAAAGCCAAGAAGAAGGCCTTG
AAATTGGTCTCCGCTCTCCCAGGAATCGATTCCATAGCGATGGATATGAAGGAAAAGAAGCTGACGGTGATCGGAGCCGTGGATCCGGTGACGATCGTGAGCAAACTGAG
GAAATTCTGGCCGGCGGACATAGTTTCGGTCGGACCGGCGGTGGAGCCGAAGAAGGAGGAGGCGAAAAAGGAAGAGGGGAAGAAGGAGGAAGAAGGGAAAAAGGAAGAGG
GGAAGAAGGAAGAAGAGAAGAAAGAAGAGGGGAAGAAGGAAGAAGAGAAGAAGAATCCGAATCCGAACCCGAACCCGAATCCAAACCCGAATCCAAACGACGCCGTTTTG
GAACTGGTGAAGGCTTACAGAGCTTACAATCCTCATCTTACAACTTACTATTACGTTCAGAGCATGGAAGAGAATCCAAATGCGTGTGCCATTTGCTAA
mRNA sequenceShow/hide mRNA sequence
AAAACCGAATGGATGGACTATATAGCCGTACCTGATCCCTCCAATTCCTTCACTAAATTTACTCTCTTTTCTCTCTCCTCTTTCCCTTCACTTTTTTCTCTCTCTTCTTC
TTCTTCTTCTTCCTCTCTGCAACTTATTTTTGTTCTTTCTTCCGCCATTTCTGCCCAAAGCCCTAGAACAATAACACACAGCATTACCAGTACCACCAATGAAGGTTCGT
TTCCCAATCCTCAATTTAGTCCTCTTTCGCTTCTTTCTCTTGATATTTCCTCTGTTTCTCTCTTGCAGAAGGTCGTTCTGAAACTGGATTTGCACGACGACAAAGCCAAG
AAGAAGGCCTTGAAATTGGTCTCCGCTCTCCCAGGAATCGATTCCATAGCGATGGATATGAAGGAAAAGAAGCTGACGGTGATCGGAGCCGTGGATCCGGTGACGATCGT
GAGCAAACTGAGGAAATTCTGGCCGGCGGACATAGTTTCGGTCGGACCGGCGGTGGAGCCGAAGAAGGAGGAGGCGAAAAAGGAAGAGGGGAAGAAGGAGGAAGAAGGGA
AAAAGGAAGAGGGGAAGAAGGAAGAAGAGAAGAAAGAAGAGGGGAAGAAGGAAGAAGAGAAGAAGAATCCGAATCCGAACCCGAACCCGAATCCAAACCCGAATCCAAAC
GACGCCGTTTTGGAACTGGTGAAGGCTTACAGAGCTTACAATCCTCATCTTACAACTTACTATTACGTTCAGAGCATGGAAGAGAATCCAAATGCGTGTGCCATTTGCTA
ATGAAATGAAACTATGATCTCCATTTTCAGAAGCTTGGGGGACTGTAAAAATGGCGGCTACCAGTTCGAATTTACGATCTGGGTTTGGATTAGAAGGTGTAAAAAAAGAA
GAAGAACAGATGTAATATAGGGATTATGAAGGTAGTGGGCTTAGTGTTTTTTCTTTTTTTTTTGGTTCTTTGTGTTAACTGTTAAGTAATTAATTATGTGTTTATTGTGT
TTGTGTTGAAAAATGGAAAGTGATGGATTAAATTTTGATTGGATTTTGTGGGTGTGGTTGATTGGGAGAACAGAGGAAAATTCTTGGCCATTAGAGGAATTCTTGGTCGG
TGATAGCCATTAATTATAGTGTTTTTTTTTTCTTCAATCTCTCTCTTTCTATTTTTTTTTTTTTTTAATTAAACAACCCTGAAATTAAAGATTTGATGTATCGTCTTAT
Protein sequenceShow/hide protein sequence
MDYIAVPDPSNSFTKFTLFSLSSFPSLFSLSSSSSSSSLQLIFVLSSAISAQSPRTITHSITSTTNEGSFPNPQFSPLSLLSLDISSVSLLQKVVLKLDLHDDKAKKKAL
KLVSALPGIDSIAMDMKEKKLTVIGAVDPVTIVSKLRKFWPADIVSVGPAVEPKKEEAKKEEGKKEEEGKKEEGKKEEEKKEEGKKEEEKKNPNPNPNPNPNPNPNDAVL
ELVKAYRAYNPHLTTYYYVQSMEENPNACAIC