; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031651 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031651
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationscaffold11:39405143..39405673
RNA-Seq ExpressionSpg031651
SyntenySpg031651
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138097.1 uncharacterized protein LOC111009349 [Momordica charantia]1.5e-7884.75Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEK AR CSWEPY DKSDA+KFI+D+VLPHPWYR ICVDGRPVGAI +T N+AA DRC  ELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK-TDELQC
        VKLVAERIF ERPELER+EALV VENLASQRV+EKAGF REGVLRK+GVLKGKTRDFVMFSLLSTDL+SK  DEL C
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK-TDELQC

XP_022138172.1 uncharacterized protein LOC111009408 [Momordica charantia]4.3e-8187.28Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEKVARFCSWEPY DKSDA+KFI+D+VLPHPWYR ICVDGRPVGAI+++ N+AA DRCR ELGYVLGSKFWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKTDE
        VKLVAERIF ERP LERLEALVDVENLASQRV+EKAGF REGVLRK+GVLKGKTRDFVMFSLLST L SKTDE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKTDE

XP_022955368.1 uncharacterized protein LOC111457417 [Cucurbita moschata]3.8e-7786.31Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEK AR CSWEPY+DKSDAIK+I D+VL HPW+R ICVDGRPVGAI +T N AA DRCR ELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELERLEALV VENLASQRVLEKAGFQREGVLRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

XP_022980780.1 uncharacterized protein LOC111480066 [Cucurbita maxima]9.9e-7886.31Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEK AR CSWEPY+DK DAIK+I D+VL HPWYR ICVDGRPVGAI +T N AA DRCR ELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELERLEALVDVENLASQRV+EKAGFQREGVLRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

XP_038898770.1 uncharacterized N-acetyltransferase p20 [Benincasa hispida]1.7e-7785.8Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTD+DDFM WA+DEK ARFCSWEPY+DKS+AIKFI D+VL HPWYR ICVDGRPVGAI +  NTA  DRCR ELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLES
        VKLVAERIFVE PELERLEALVDVENLASQRVLEKAGFQREGVLRK+GV KGKTRDFVMFS LSTDL+S
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLES

TrEMBL top hitse value%identityAlignment
A0A5D3D131 Putative N-acetyltransferase p20-like5.5e-7482.25Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WA+DEK AR+CSWEPY+DKS+AIKFI D+VL HP+YR ICVDGRPVGAI +  NTAA D+CR ELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLES
        VKLVAERIFVE PELERLEALVDVEN ASQRVLEKAGFQREGVLRK+GVLKG  RD+VMFS L TD  S
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLES

A0A6J1CA30 uncharacterized protein LOC1110093497.4e-7984.75Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEK AR CSWEPY DKSDA+KFI+D+VLPHPWYR ICVDGRPVGAI +T N+AA DRC  ELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK-TDELQC
        VKLVAERIF ERPELER+EALV VENLASQRV+EKAGF REGVLRK+GVLKGKTRDFVMFSLLSTDL+SK  DEL C
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK-TDELQC

A0A6J1CAC0 uncharacterized protein LOC1110094082.1e-8187.28Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEKVARFCSWEPY DKSDA+KFI+D+VLPHPWYR ICVDGRPVGAI+++ N+AA DRCR ELGYVLGSKFWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKTDE
        VKLVAERIF ERP LERLEALVDVENLASQRV+EKAGF REGVLRK+GVLKGKTRDFVMFSLLST L SKTDE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKTDE

A0A6J1GTR6 uncharacterized protein LOC1114574171.8e-7786.31Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEK AR CSWEPY+DKSDAIK+I D+VL HPW+R ICVDGRPVGAI +T N AA DRCR ELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELERLEALV VENLASQRVLEKAGFQREGVLRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

A0A6J1IXH7 uncharacterized protein LOC1114800664.8e-7886.31Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEK AR CSWEPY+DK DAIK+I D+VL HPWYR ICVDGRPVGAI +T N AA DRCR ELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELERLEALVDVENLASQRV+EKAGFQREGVLRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

SwissProt top hitse value%identityAlignment
O31633 Putative [ribosomal protein S5]-alanine N-acetyltransferase4.1e-1031.25Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEK-------VARFCSWEPYKDKSDAIKFIEDRVLPHPWYR-GI--CVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFW
        + +RPL++TD ++ +   S+ +       + R   +   + +   I   ++R+     Y  GI    D R +G + L +      +    +GY L     
Subjt:  VTLRPLDLTDIDDFMEWASDEK-------VARFCSWEPYKDKSDAIKFIEDRVLPHPWYR-GI--CVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFW

Query:  GKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        GKGI T AV+LV +  F E  +L R+EA V   NL S RVLEKAGF +EG+ RK   + G   D  + ++L+ D E
Subjt:  GKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

O34569 Uncharacterized N-acetyltransferase YoaA3.3e-0727.61Show/hide
Query:  LRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGI--CVDGRPVGAILLT--ENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        LR +   D +      S+++V R+   E  +    AI  I+     +   RGI   ++ R    ++ T   +  A    R E+GY +  + W  G  +  
Subjt:  LRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGI--CVDGRPVGAILLT--ENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLL
        +  V    F     L R+ A+V  +N AS R+L K GFQ+EGVLR++    G   D  ++S++
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLL

P05332 Uncharacterized N-acetyltransferase p207.2e-1532.2Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKS---DAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRC----------RVELGYVLGS
        +TLR ++L D D   ++ SD +V ++ +  P+ D S   D I+ I D  L     R          +I++ E       C          R E+GY LG 
Subjt:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKS---DAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRC----------RVELGYVLGS

Query:  KFWGKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD
          WGKG  + AV+ + +  F     L R+EA V+ EN  S ++L    FQ+EG+LR +   KG+  D  MFSLL  +
Subjt:  KFWGKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD

P96579 Putative ribosomal N-acetyltransferase YdaF2.5e-0728.65Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYR----------GICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFW
        +T+R L+  D +   E     +  R   W  + +   +     + ++P  W R          G+  DG   G I L  N    +R + E+GY +  +F 
Subjt:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYR----------GICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFW

Query:  GKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK
        GKGI TAA + +    F E  EL R+     V N  S+ V E+ GF  EG  R    + G   D V +SLL  + E +
Subjt:  GKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK

Arabidopsis top hitse value%identityAlignment
AT2G32020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.8e-5460.61Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICV-DGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAAV
        ++LRP+ L+D+DD+M WA+D KVARFC+WEP   + +AIK+I DRVL HPW R IC+ D RP+G IL+     A D  R E+GYVL  K+WGKG  T AV
Subjt:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICV-DGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAAV

Query:  KLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD
        +LV   +F E PE+ERLEALVDV+N+ SQRVLEK GF REGV+RKF  +KG  RD VMFS LSTD
Subjt:  KLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD

AT2G32030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.4e-5358.79Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDG-RPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAAV
        + LRP+ L+D+DDFM WA+D  V RFC+WEPY  +  AI ++ D +LPHPW R IC+D  RP+G+I +T      D  R E+GYVLGSK+WGKGI T AV
Subjt:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDG-RPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAAV

Query:  KLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD
        +LVA  IF E+PE++RLEALVDV+N+ SQ+VLEK GF +EGV+RKF  LKG  RD VMFS L +D
Subjt:  KLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD

AT3G22560.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.5e-3641.57Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICV--DGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA
        + LRP +L+D +D  +WA D+ V R+  W+      +A + I ++ +PHPW R I +  DG  +G + +  ++    RCR +L Y +  +FWG+GI TAA
Subjt:  VTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICV--DGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD
        V++  E+   + PE+ RL+A+V+VEN ASQRVLEKAGF++EG+L K+G  KG  RD  ++S +  D
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTAACTCTCCGGCCACTTGATCTCACTGACATCGACGATTTCATGGAGTGGGCATCGGACGAGAAAGTCGCTCGATTCTGCTCGTGGGAGCCCTACAAGGACAA
ATCGGACGCCATCAAGTTCATCGAGGACCGAGTCCTACCGCACCCCTGGTACCGAGGGATCTGCGTCGACGGCCGCCCGGTGGGGGCGATTTTGCTGACGGAGAATACGG
CGGCGTTCGACCGATGCAGGGTCGAACTAGGGTACGTTTTGGGGTCGAAATTTTGGGGGAAAGGGATCGGGACGGCGGCGGTGAAATTGGTGGCGGAGAGGATTTTTGTG
GAGCGGCCGGAGTTGGAGAGGCTGGAGGCGTTGGTGGATGTGGAGAATTTGGCGTCTCAAAGAGTGCTGGAGAAGGCTGGATTTCAGAGAGAAGGAGTTCTGAGGAAATT
TGGAGTGTTGAAAGGGAAAACTAGGGATTTTGTCATGTTCAGTCTCCTTTCTACTGATCTTGAATCGAAAACTGATGAACTGCAATGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTAACTCTCCGGCCACTTGATCTCACTGACATCGACGATTTCATGGAGTGGGCATCGGACGAGAAAGTCGCTCGATTCTGCTCGTGGGAGCCCTACAAGGACAA
ATCGGACGCCATCAAGTTCATCGAGGACCGAGTCCTACCGCACCCCTGGTACCGAGGGATCTGCGTCGACGGCCGCCCGGTGGGGGCGATTTTGCTGACGGAGAATACGG
CGGCGTTCGACCGATGCAGGGTCGAACTAGGGTACGTTTTGGGGTCGAAATTTTGGGGGAAAGGGATCGGGACGGCGGCGGTGAAATTGGTGGCGGAGAGGATTTTTGTG
GAGCGGCCGGAGTTGGAGAGGCTGGAGGCGTTGGTGGATGTGGAGAATTTGGCGTCTCAAAGAGTGCTGGAGAAGGCTGGATTTCAGAGAGAAGGAGTTCTGAGGAAATT
TGGAGTGTTGAAAGGGAAAACTAGGGATTTTGTCATGTTCAGTCTCCTTTCTACTGATCTTGAATCGAAAACTGATGAACTGCAATGCTGA
Protein sequenceShow/hide protein sequence
MAVTLRPLDLTDIDDFMEWASDEKVARFCSWEPYKDKSDAIKFIEDRVLPHPWYRGICVDGRPVGAILLTENTAAFDRCRVELGYVLGSKFWGKGIGTAAVKLVAERIFV
ERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKTDELQC