; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g04110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g04110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionHMA domain-containing protein
Genome locationchr5:2853144..2855410
RNA-Seq ExpressionMoc05g04110
SyntenyMoc05g04110
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437990.1 PREDICTED: uncharacterized protein LOC103483245 [Cucumis melo]5.7e-5179.25Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAE-EPKKEEE
        KVVLKLDLHDDKAKKKALKLVSTL GIDSIAMDMKE+KLTVIGAVDPVT+VSKLRKFWPA+I SVGPAVEPKKEEE K+ E KKEEEKK  E E KKEEE
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAE-EPKKEEE

Query:  KKE---------GEKKKE-INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
         K+         GEKK +  NPNDA LELV+AYRAYNPHLTTYYY  SMEENPN+CAIC
Subjt:  KKE---------GEKKKE-INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

XP_022147443.1 heavy metal-associated isoprenylated plant protein 39 [Momordica charantia]1.6e-69100Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEEK
        KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEEK
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEEK

Query:  KEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
        KEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
Subjt:  KEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

XP_022935536.1 heavy metal-associated isoprenylated plant protein 39-like [Cucurbita moschata]3.2e-5482.28Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEP----------KKEEEGKKEEPKKEEEKKA
        KVVLKLDL DDKAKKKALKLVSTL GIDSIAMDMKEKKLTVIGAVDPVT+VSKLRKFWPADI SVGPAVEP          KKEEEGKKEE KKEEEKK 
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEP----------KKEEEGKKEEPKKEEEKKA

Query:  AEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
          E KKEEEKKEGE+KK  NPNDA LELV+AYRAYNP+LTT+YYA S+EENPNACAIC
Subjt:  AEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

XP_022974611.1 heavy metal-associated isoprenylated plant protein 39-like [Cucurbita maxima]1.0e-5274.71Show/hide
Query:  IAPPEPPQPCVYATRHRPKVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKE----EEGK
        + PP  P+          KVVLKLDL DDKAKKKALKLVSTL GIDSIAMDMKEKKLTVIGAVDPVT+VSKLRKFWPAD+ SVGPAVEPKK+    EEGK
Subjt:  IAPPEPPQPCVYATRHRPKVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKE----EEGK

Query:  KEEPKKEEEKKA----AEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
        KEE KKEEEKK      EE K+ EEKKEGE+KK  NPNDA LELV+AYRAYNP+LTT+YYA S+EENPNACAIC
Subjt:  KEEPKKEEEKKA----AEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

XP_038900253.1 heavy metal-associated isoprenylated plant protein 39 [Benincasa hispida]2.7e-5381.99Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEP------KKEEEKKAAEEP
        KVVLKLDLHDDKAKKKALKLVSTL GIDSIAMDMKE+KLTVIGAVDPVT+VSKLRKFWPADI SVGPAVEPKK EEGKKEE       KKEEEKK  EE 
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEP------KKEEEKKAAEEP

Query:  KKEEEKK----EGEKKKE---INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
        KKEEEKK    EGEKK +    NPNDA LELVKAYRAYNPHLTTYYY  SMEENPN+CAIC
Subjt:  KKEEEKK----EGEKKKE---INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

TrEMBL top hitse value%identityAlignment
A0A1S3AVX0 uncharacterized protein LOC1034832452.8e-5179.25Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAE-EPKKEEE
        KVVLKLDLHDDKAKKKALKLVSTL GIDSIAMDMKE+KLTVIGAVDPVT+VSKLRKFWPA+I SVGPAVEPKKEEE K+ E KKEEEKK  E E KKEEE
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAE-EPKKEEE

Query:  KKE---------GEKKKE-INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
         K+         GEKK +  NPNDA LELV+AYRAYNPHLTTYYY  SMEENPN+CAIC
Subjt:  KKE---------GEKKKE-INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

A0A5D3D2S2 HMA domain-containing protein2.8e-5179.25Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAE-EPKKEEE
        KVVLKLDLHDDKAKKKALKLVSTL GIDSIAMDMKE+KLTVIGAVDPVT+VSKLRKFWPA+I SVGPAVEPKKEEE K+ E KKEEEKK  E E KKEEE
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAE-EPKKEEE

Query:  KKE---------GEKKKE-INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
         K+         GEKK +  NPNDA LELV+AYRAYNPHLTTYYY  SMEENPN+CAIC
Subjt:  KKE---------GEKKKE-INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

A0A6J1D1B9 heavy metal-associated isoprenylated plant protein 397.7e-70100Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEEK
        KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEEK
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEEK

Query:  KEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
        KEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
Subjt:  KEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

A0A6J1F4Y1 heavy metal-associated isoprenylated plant protein 39-like1.6e-5482.28Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEP----------KKEEEGKKEEPKKEEEKKA
        KVVLKLDL DDKAKKKALKLVSTL GIDSIAMDMKEKKLTVIGAVDPVT+VSKLRKFWPADI SVGPAVEP          KKEEEGKKEE KKEEEKK 
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEP----------KKEEEGKKEEPKKEEEKKA

Query:  AEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
          E KKEEEKKEGE+KK  NPNDA LELV+AYRAYNP+LTT+YYA S+EENPNACAIC
Subjt:  AEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

A0A6J1IBU7 heavy metal-associated isoprenylated plant protein 39-like5.0e-5374.71Show/hide
Query:  IAPPEPPQPCVYATRHRPKVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKE----EEGK
        + PP  P+          KVVLKLDL DDKAKKKALKLVSTL GIDSIAMDMKEKKLTVIGAVDPVT+VSKLRKFWPAD+ SVGPAVEPKK+    EEGK
Subjt:  IAPPEPPQPCVYATRHRPKVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKE----EEGK

Query:  KEEPKKEEEKKA----AEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
        KEE KKEEEKK      EE K+ EEKKEGE+KK  NPNDA LELV+AYRAYNP+LTT+YYA S+EENPNACAIC
Subjt:  KEEPKKEEEKKA----AEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

SwissProt top hitse value%identityAlignment
O03982 Heavy metal-associated isoprenylated plant protein 393.4e-4667.43Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWP-ADIFSVGPAVEPKKE--EEGKKE----------EPKKEEE
        K+VLKLDLHDD+AK+KALK VSTLPGIDSIAMDMKEKKLTVIG VDPV VVSKLRK+WP  DI  VGPA EP+KE  EE KKE          E  KEE 
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWP-ADIFSVGPAVEPKKE--EEGKKE----------EPKKEEE

Query:  KKAAEEPKKEEEKKEGEKKKE--------------INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
        KK  E PKKEEEKKEG  KKE              + P D  LELVKAY+AYNPHLTTYYYA S+EENPNAC IC
Subjt:  KKAAEEPKKEEEKKEGEKKKE--------------INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

Q9LTE2 Heavy metal-associated isoprenylated plant protein 132.0e-0641.41Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEE
        K VL+L +H+++ +KKA   VS  PG+ SI MD K  K+TV+G VD   +V KLRK    ++ SV     P+     KK EP+K    K A  P K  E
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEE

Q9LTE3 Heavy metal-associated isoprenylated plant protein 129.6e-0944.83Show/hide
Query:  VVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEK
        VVLKLD+H +K K+KA+  V  L G++S+  ++K+ KLTV G +D   +V KL+K    +  SVGP  EP+K+   K ++PKK E K
Subjt:  VVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEK

Q9SHQ8 Heavy metal-associated isoprenylated plant protein 41.4e-0433.92Show/hide
Query:  VLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRK--FWPADIFSVGPAVEPKKEEEGKK-----EEPKKEEEKKAAEEPK
        VLK+ +H  +  K     +     I  +  D K + LTV G ++   +++ ++K     A+I S     E KKEEE KK     ++ KKE+EKK  EE K
Subjt:  VLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRK--FWPADIFSVGPAVEPKKEEEGKK-----EEPKKEEEKKAAEEPK

Query:  KEEE--KKEGEKKKEINPNDAALELVKAYRAYN---------------PHLTTYYYAHSM--EENPNACAI
        KEEE  KKEGEKKKE    +   + +     Y                P+   Y YA  +  +ENPNAC I
Subjt:  KEEE--KKEGEKKKEINPNDAALELVKAYRAYN---------------PHLTTYYYAHSM--EENPNACAI

Arabidopsis top hitse value%identityAlignment
AT1G01490.1 Heavy metal transport/detoxification superfamily protein2.4e-4767.43Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWP-ADIFSVGPAVEPKKE--EEGKKE----------EPKKEEE
        K+VLKLDLHDD+AK+KALK VSTLPGIDSIAMDMKEKKLTVIG VDPV VVSKLRK+WP  DI  VGPA EP+KE  EE KKE          E  KEE 
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWP-ADIFSVGPAVEPKKE--EEGKKE----------EPKKEEE

Query:  KKAAEEPKKEEEKKEGEKKKE--------------INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
        KK  E PKKEEEKKEG  KKE              + P D  LELVKAY+AYNPHLTTYYYA S+EENPNAC IC
Subjt:  KKAAEEPKKEEEKKEGEKKKE--------------INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

AT1G01490.2 Heavy metal transport/detoxification superfamily protein2.4e-4767.43Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWP-ADIFSVGPAVEPKKE--EEGKKE----------EPKKEEE
        K+VLKLDLHDD+AK+KALK VSTLPGIDSIAMDMKEKKLTVIG VDPV VVSKLRK+WP  DI  VGPA EP+KE  EE KKE          E  KEE 
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWP-ADIFSVGPAVEPKKE--EEGKKE----------EPKKEEE

Query:  KKAAEEPKKEEEKKEGEKKKE--------------INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC
        KK  E PKKEEEKKEG  KKE              + P D  LELVKAY+AYNPHLTTYYYA S+EENPNAC IC
Subjt:  KKAAEEPKKEEEKKEGEKKKE--------------INPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC

AT5G23760.1 Copper transport protein family6.4e-1655.88Show/hide
Query:  KVVLK-LDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEE
        KVVLK L + DDK K+KA++  + + G+DSIA DMK++KLTVIG +D V VV KL+K    D+ SVGPA E KKEE  KKEE K+E++++  EE K+EE 
Subjt:  KVVLK-LDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEE

Query:  KK
        KK
Subjt:  KK

AT5G52740.1 Copper transport protein family6.8e-1044.83Show/hide
Query:  VVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEK
        VVLKLD+H +K K+KA+  V  L G++S+  ++K+ KLTV G +D   +V KL+K    +  SVGP  EP+K+   K ++PKK E K
Subjt:  VVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEK

AT5G52750.1 Heavy metal transport/detoxification superfamily protein1.4e-0741.41Show/hide
Query:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEE
        K VL+L +H+++ +KKA   VS  PG+ SI MD K  K+TV+G VD   +V KLRK    ++ SV     P+     KK EP+K    K A  P K  E
Subjt:  KVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTVIGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACGACTGCCACGTGGCAGATGAAAATTATGTGTTTGGCGATAGCCGTAGAACCGATTGCAGCGGATTCTTTATTCCTTTTGGACGGTCTGATTGTACTCGTTGC
CGGTGCGCCGTGGGGTCCGCATTTCCCCGTTGGCCAAAAATTGATTGCTCCACCCGAGCCGCCTCAGCCGTGCGTATATGCCACGCGCCACCGGCCCAAGGTTGTTCTGA
AGCTGGATTTGCACGATGATAAAGCCAAGAAGAAGGCCCTGAAGTTGGTCTCCACTCTCCCAGGAATCGACTCCATCGCGATGGACATGAAGGAGAAGAAGCTGACGGTG
ATCGGAGCCGTGGATCCGGTGACCGTGGTGAGCAAACTGCGAAAGTTCTGGCCGGCGGACATATTCTCCGTCGGGCCAGCGGTGGAGCCGAAGAAGGAAGAGGAAGGGAA
AAAGGAGGAGCCGAAGAAGGAGGAGGAGAAGAAGGCGGCGGAGGAGCCGAAGAAGGAAGAGGAAAAGAAAGAAGGAGAGAAGAAGAAAGAGATCAATCCAAACGACGCCG
CTTTGGAGCTGGTGAAGGCTTACAGAGCCTACAACCCCCATCTCACCACCTACTATTACGCCCATAGCATGGAGGAAAATCCAAATGCCTGCGCCATTTGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAACGACTGCCACGTGGCAGATGAAAATTATGTGTTTGGCGATAGCCGTAGAACCGATTGCAGCGGATTCTTTATTCCTTTTGGACGGTCTGATTGTACTCGTTGC
CGGTGCGCCGTGGGGTCCGCATTTCCCCGTTGGCCAAAAATTGATTGCTCCACCCGAGCCGCCTCAGCCGTGCGTATATGCCACGCGCCACCGGCCCAAGGTTGTTCTGA
AGCTGGATTTGCACGATGATAAAGCCAAGAAGAAGGCCCTGAAGTTGGTCTCCACTCTCCCAGGAATCGACTCCATCGCGATGGACATGAAGGAGAAGAAGCTGACGGTG
ATCGGAGCCGTGGATCCGGTGACCGTGGTGAGCAAACTGCGAAAGTTCTGGCCGGCGGACATATTCTCCGTCGGGCCAGCGGTGGAGCCGAAGAAGGAAGAGGAAGGGAA
AAAGGAGGAGCCGAAGAAGGAGGAGGAGAAGAAGGCGGCGGAGGAGCCGAAGAAGGAAGAGGAAAAGAAAGAAGGAGAGAAGAAGAAAGAGATCAATCCAAACGACGCCG
CTTTGGAGCTGGTGAAGGCTTACAGAGCCTACAACCCCCATCTCACCACCTACTATTACGCCCATAGCATGGAGGAAAATCCAAATGCCTGCGCCATTTGCTAG
Protein sequenceShow/hide protein sequence
METTATWQMKIMCLAIAVEPIAADSLFLLDGLIVLVAGAPWGPHFPVGQKLIAPPEPPQPCVYATRHRPKVVLKLDLHDDKAKKKALKLVSTLPGIDSIAMDMKEKKLTV
IGAVDPVTVVSKLRKFWPADIFSVGPAVEPKKEEEGKKEEPKKEEEKKAAEEPKKEEEKKEGEKKKEINPNDAALELVKAYRAYNPHLTTYYYAHSMEENPNACAIC