; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002436 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002436
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHeavy metal transport/detoxification superfamily protein
Genome locationchr4:42875672..42876449
RNA-Seq ExpressionLag0002436
SyntenyLag0002436
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057597.1 heavy metal-associated isoprenylated plant protein 42 [Cucumis melo var. makuwa]5.6e-6263.98Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDD-RHF
        M+INCC+KCPLKLERKLLKS GVESV I+  KGLVTV GDIDP+VLLQK++ MGKE KLWFF++EPD +D    CS  ASKIE    ES S +EDD + F
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDD-RHF

Query:  DWHSQHKCETDTMGGKE------HGWGF-QSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPS-RA
         WHSQHKC T+TM  KE      H   + QSLP MSSNVH+  Y  SL  A SSN HAYSYSRSLPR GY+   PYQQSVPGH  TPH Y+LQPHP    
Subjt:  DWHSQHKCETDTMGGKE------HGWGF-QSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPS-RA

Query:  YRYFRSQSPPR
        Y YF+++SPPR
Subjt:  YRYFRSQSPPR

KAG6575888.1 Heavy metal-associated isoprenylated plant protein 42, partial [Cucurbita argyrosperma subsp. sororia]2.3e-6868.47Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD
        M+INCC+KCPLKLERKLLKSNGVESVTID DKGLVTV+GDIDPV LLQK+K MGKEAKLWFF+QE +C++K    S   S I+ GG  S S  E++  FD
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD

Query:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP
        WH          G KEH  GFQSLP MSS+V+  SYP SL  AMSSNVHAYSY +SL RLGYRP  PYQQSVP HRLTPHGYYLQPHP  AYR+F+ +SP
Subjt:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP

Query:  PRA
        PRA
Subjt:  PRA

XP_022992551.1 heavy metal-associated isoprenylated plant protein 42-like [Cucurbita maxima]1.1e-7069.95Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD
        M+INCC+KCPLKLERKLLKSNGV+SVTID DKGLVT++GDIDPVVLLQK+K MGKEAKLWFF+QE DCN+K  E     S IE GGT S S  E++ HFD
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD

Query:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP
        WH          G KEH  GFQSLP MSS+V+  SYP SL  AMSSNVHAYSY +SLPRLG RP  PY+QSVP HRLTPHGYYLQPHP  AYR+F+  SP
Subjt:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP

Query:  PRA
        PRA
Subjt:  PRA

XP_023548357.1 heavy metal-associated isoprenylated plant protein 42-like isoform X1 [Cucurbita pepo subsp. pepo]5.7e-6767.82Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD
        M+INCC+KCPLKLERKLLKSNGVESVTI+ D+GLVTV+GDIDPV LLQK+K MGKEAKLWFF+QE +C++K    S   S IE  G  S S  E++  FD
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD

Query:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP
        WH          G KEH  GFQSLP MSS+V+  SYP SL  A+SSNVHAYSY RSLPRLGY P  PYQQSVPGHRLTPHGYYLQPHP  AYR+F+ +SP
Subjt:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP

Query:  PR
        PR
Subjt:  PR

XP_023548358.1 heavy metal-associated isoprenylated plant protein 42-like isoform X2 [Cucurbita pepo subsp. pepo]5.7e-6767.82Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD
        M+INCC+KCPLKLERKLLKSNGVESVTI+ D+GLVTV+GDIDPV LLQK+K MGKEAKLWFF+QE +C++K    S   S IE  G  S S  E++  FD
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD

Query:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP
        WH          G KEH  GFQSLP MSS+V+  SYP SL  A+SSNVHAYSY RSLPRLGY P  PYQQSVPGHRLTPHGYYLQPHP  AYR+F+ +SP
Subjt:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP

Query:  PR
        PR
Subjt:  PR

TrEMBL top hitse value%identityAlignment
A0A0A0K7J5 HMA domain-containing protein3.9e-6164.15Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDD-RHF
        MNINCC+KCPLKLERKLLKSNGVESV I+  KGLVTV GDIDP+VLLQK++ MGKEAKLWFF++E D +D    CS  A KIE    ES SNNEDD + F
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDD-RHF

Query:  DWHSQHKCETDTMGGKE------HGWGF-QSL-PAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPH-PSR
        DWHSQHKC T+T+  KE      H   + QSL P + SNVH+  +  SL  AMSSNVH YSYSRSLP+ GY+   PYQQSVPGH  TPH  YLQPH P  
Subjt:  DWHSQHKCETDTMGGKE------HGWGF-QSL-PAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPH-PSR

Query:  AYRYFRSQSPPR
        AY YF+ +SPPR
Subjt:  AYRYFRSQSPPR

A0A1S4DZF3 uncharacterized protein LOC1034926959.3e-3960.38Show/hide
Query:  MGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDD-RHFDWHSQHKCETDTMGGKE------HGWGF-QSLPAMSSNVHSRSYPPSLPSAM
        MGKE KLWFF++EPD +D    CS  ASKIE    ES S +EDD + F WHSQHKC T+TM  KE      H   + QSLP MSSNVH+  Y  SL  A 
Subjt:  MGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDD-RHFDWHSQHKCETDTMGGKE------HGWGF-QSLPAMSSNVHSRSYPPSLPSAM

Query:  SSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPS-RAYRYFRSQSPPR
        SSNVHAYSYSRSLPR GY+   PYQQSVPGH  TPH Y+LQPHP    Y YF+++SPPR
Subjt:  SSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPS-RAYRYFRSQSPPR

A0A5D3DBI1 Heavy metal-associated isoprenylated plant protein 422.7e-6263.98Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDD-RHF
        M+INCC+KCPLKLERKLLKS GVESV I+  KGLVTV GDIDP+VLLQK++ MGKE KLWFF++EPD +D    CS  ASKIE    ES S +EDD + F
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDD-RHF

Query:  DWHSQHKCETDTMGGKE------HGWGF-QSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPS-RA
         WHSQHKC T+TM  KE      H   + QSLP MSSNVH+  Y  SL  A SSN HAYSYSRSLPR GY+   PYQQSVPGH  TPH Y+LQPHP    
Subjt:  DWHSQHKCETDTMGGKE------HGWGF-QSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPS-RA

Query:  YRYFRSQSPPR
        Y YF+++SPPR
Subjt:  YRYFRSQSPPR

A0A6J1D7U6 heavy metal-associated isoprenylated plant protein 42-like4.3e-5257.69Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNED-DRHF
        +NI CC +CP K+ R+LL+ +GV+SV ID DKGLVTVSGDIDPV+LLQK+K+MGKEAKLWFF+Q P       +CS   SK      ES  NNED  + F
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNED-DRHF

Query:  DWHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRL-TPHGYYLQ---PHPSRAYRYF
        DWH QHKC  +TM  KE GWG +SLP                 AM SNVH YSYS+S+PRLGYRPT PYQQ VP HRL  PHGYY Q   P P  AY YF
Subjt:  DWHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRL-TPHGYYLQ---PHPSRAYRYF

Query:  RSQSPPRA
        + QSPP+A
Subjt:  RSQSPPRA

A0A6J1JTW0 heavy metal-associated isoprenylated plant protein 42-like5.4e-7169.95Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD
        M+INCC+KCPLKLERKLLKSNGV+SVTID DKGLVT++GDIDPVVLLQK+K MGKEAKLWFF+QE DCN+K  E     S IE GGT S S  E++ HFD
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFD

Query:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP
        WH          G KEH  GFQSLP MSS+V+  SYP SL  AMSSNVHAYSY +SLPRLG RP  PY+QSVP HRLTPHGYYLQPHP  AYR+F+  SP
Subjt:  WHSQHKCETDTMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSP

Query:  PRA
        PRA
Subjt:  PRA

SwissProt top hitse value%identityAlignment
F4JZL7 Heavy metal-associated isoprenylated plant protein 331.3e-0531.3Show/hide
Query:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFDWHSQH
        C+ C  K+++ L K  GV +  ID++ G VTVSG++DP VL++K+ + GK A++W                        G  + GSNN  ++  +  +Q 
Subjt:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFDWHSQH

Query:  KCETDTMGGKEHGWG
        K      GGK  G G
Subjt:  KCETDTMGGKEHGWG

Q0WV37 Heavy metal-associated isoprenylated plant protein 343.4e-0644.44Show/hide
Query:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKL
        CE C  K++++L K  GV SV  D ++G VTV+G+IDP +L++K+ + GK A++
Subjt:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKL

Q84J88 Heavy metal-associated isoprenylated plant protein 367.7e-0632.98Show/hide
Query:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQE----PDCN--DKPVECSDGASKIECGGTESGSNN
        CE C  K+++ L K +GV +  ID  +  VTV G+++P +L++K+ + G+ A+LW    E     DCN   KP + ++  S  E     + +NN
Subjt:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQE----PDCN--DKPVECSDGASKIECGGTESGSNN

Q9CAV5 Heavy metal-associated isoprenylated plant protein 424.8e-0837.88Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEP
        MN+ CCE  P ++++ L +  GV ++TID  KGL+ V G  +P VL++ V ++G+  +L+ +E++P
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEP

Q9M8K5 Heavy metal-associated isoprenylated plant protein 324.1e-0735Show/hide
Query:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKI--------ECGGTESGSNNEDDR
        C+ C  K+++ L K  GV +  IDS++G VTVSG +DP VL++K+ + GK A++W     P  N+ P + S  A++         + GG   G+NN + +
Subjt:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKI--------ECGGTESGSNNEDDR

Arabidopsis top hitse value%identityAlignment
AT3G04900.1 Heavy metal transport/detoxification superfamily protein3.4e-0937.88Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEP
        MN+ CCE  P ++++ L +  GV ++TID  KGL+ V G  +P VL++ V ++G+  +L+ +E++P
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEP

AT3G06130.1 Heavy metal transport/detoxification superfamily protein2.9e-0835Show/hide
Query:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKI--------ECGGTESGSNNEDDR
        C+ C  K+++ L K  GV +  IDS++G VTVSG +DP VL++K+ + GK A++W     P  N+ P + S  A++         + GG   G+NN + +
Subjt:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKI--------ECGGTESGSNNEDDR

AT3G06130.2 Heavy metal transport/detoxification superfamily protein2.9e-0835Show/hide
Query:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKI--------ECGGTESGSNNEDDR
        C+ C  K+++ L K  GV +  IDS++G VTVSG +DP VL++K+ + GK A++W     P  N+ P + S  A++         + GG   G+NN + +
Subjt:  CEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKI--------ECGGTESGSNNEDDR

AT4G23882.1 Heavy metal transport/detoxification superfamily protein2.0e-0943.55Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFF
        + I CC+ C  K +RKLL  +GV +V  ++++GL+TV+GD +P  LL K+ + GK+A+L  F
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFF

AT5G37860.1 Heavy metal transport/detoxification superfamily protein3.8e-0844.26Show/hide
Query:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWF
        +NIN C+ C +K+++ L K  GV SV ID+D+  V V G++DP +L++K+ + GK A+L F
Subjt:  MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACATCAACTGCTGTGAAAAGTGCCCTCTAAAACTAGAGAGGAAGCTCCTCAAATCAAATGGAGTGGAATCTGTCACCATAGACTCAGATAAAGGGCTGGTAACAGT
ATCAGGTGACATTGACCCTGTTGTGCTCTTGCAAAAAGTAAAAAGAATGGGCAAAGAAGCCAAGCTCTGGTTCTTTGAACAAGAACCCGACTGCAACGACAAACCCGTTG
AGTGTTCGGATGGTGCTTCGAAGATCGAATGTGGCGGTACTGAATCTGGTAGTAACAATGAAGATGACAGACACTTTGATTGGCATTCACAACATAAGTGTGAAACAGAT
ACAATGGGAGGGAAGGAACATGGTTGGGGGTTTCAGAGCTTGCCTGCTATGTCTTCTAATGTTCATTCGCGCTCGTATCCCCCGAGCTTGCCTTCTGCGATGTCTTCTAA
TGTTCATGCGTATTCATATTCTCGGAGCTTGCCTCGACTTGGATACCGACCCACTCAGCCATATCAGCAATCAGTTCCTGGCCATAGGCTCACACCTCATGGTTACTACT
TGCAGCCACATCCATCACGTGCTTATCGTTATTTTCGGTCGCAGTCTCCCCCTCGAGCTATGCTATGGTGCATTATACTGATTATGCAGACAATTATAGCTTATAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACATCAACTGCTGTGAAAAGTGCCCTCTAAAACTAGAGAGGAAGCTCCTCAAATCAAATGGAGTGGAATCTGTCACCATAGACTCAGATAAAGGGCTGGTAACAGT
ATCAGGTGACATTGACCCTGTTGTGCTCTTGCAAAAAGTAAAAAGAATGGGCAAAGAAGCCAAGCTCTGGTTCTTTGAACAAGAACCCGACTGCAACGACAAACCCGTTG
AGTGTTCGGATGGTGCTTCGAAGATCGAATGTGGCGGTACTGAATCTGGTAGTAACAATGAAGATGACAGACACTTTGATTGGCATTCACAACATAAGTGTGAAACAGAT
ACAATGGGAGGGAAGGAACATGGTTGGGGGTTTCAGAGCTTGCCTGCTATGTCTTCTAATGTTCATTCGCGCTCGTATCCCCCGAGCTTGCCTTCTGCGATGTCTTCTAA
TGTTCATGCGTATTCATATTCTCGGAGCTTGCCTCGACTTGGATACCGACCCACTCAGCCATATCAGCAATCAGTTCCTGGCCATAGGCTCACACCTCATGGTTACTACT
TGCAGCCACATCCATCACGTGCTTATCGTTATTTTCGGTCGCAGTCTCCCCCTCGAGCTATGCTATGGTGCATTATACTGATTATGCAGACAATTATAGCTTATAGTTAA
Protein sequenceShow/hide protein sequence
MNINCCEKCPLKLERKLLKSNGVESVTIDSDKGLVTVSGDIDPVVLLQKVKRMGKEAKLWFFEQEPDCNDKPVECSDGASKIECGGTESGSNNEDDRHFDWHSQHKCETD
TMGGKEHGWGFQSLPAMSSNVHSRSYPPSLPSAMSSNVHAYSYSRSLPRLGYRPTQPYQQSVPGHRLTPHGYYLQPHPSRAYRYFRSQSPPRAMLWCIILIMQTIIAYS