; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031433 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031433
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationscaffold11:39397116..39397652
RNA-Seq ExpressionSpg031433
SyntenySpg031433
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138097.1 uncharacterized protein LOC111009349 [Momordica charantia]3.3e-8188.57Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEKAAR CSWEPY DKSDA+KFI+D+VLPHPW+RAICVDGRPVGAISVT N+AARDRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK-IDEL
        VKLVAERIF ERPELER+EALV VENLASQRV+EKAGF REGVLRK+GVLKGKTRDFVMFSLLSTDL+SK IDEL
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK-IDEL

XP_022138172.1 uncharacterized protein LOC111009408 [Momordica charantia]1.6e-8087.28Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEK ARFCSWEPY DKSDA+KFI+D+VLPHPW+RAICVDGRPVGAI V+ N+AARDRCR ELGYVLGSKFWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKIDE
        VKLVAERIF ERP LERLEALVDVENLASQRV+EKAGF REGVLRK+GVLKGKTRDFVMFSLLST L SK DE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKIDE

XP_022955368.1 uncharacterized protein LOC111457417 [Cucurbita moschata]2.8e-8089.88Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY+DKSDAIK+I D+VL HPW RAICVDGRPVGAISVT N AARDRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELERLEALV VENLASQRVLEKAGFQREGVLRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

XP_022980780.1 uncharacterized protein LOC111480066 [Cucurbita maxima]9.7e-8189.29Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY+DK DAIK+I D+VL HPW+RAICVDGRPVGAISVT N AARDRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELERLEALVDVENLASQRV+EKAGFQREGVLRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

XP_038898770.1 uncharacterized N-acetyltransferase p20 [Benincasa hispida]1.3e-8088.76Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTD+DDFM WA+DEKAARFCSWEPY+DKS+AIKFI D+VL HPW+RAICVDGRPVGAISV  NTA RDRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLES
        VKLVAERIFVE PELERLEALVDVENLASQRVLEKAGFQREGVLRK+GV KGKTRDFVMFS LSTDL+S
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLES

TrEMBL top hitse value%identityAlignment
A0A5D3D131 Putative N-acetyltransferase p20-like4.1e-7785.21Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WA+DEKAAR+CSWEPY+DKS+AIKFI D+VL HP++RAICVDGRPVGAISV  NTAARD+CRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLES
        VKLVAERIFVE PELERLEALVDVEN ASQRVLEKAGFQREGVLRK+GVLKG  RD+VMFS L TD  S
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLES

A0A6J1CA30 uncharacterized protein LOC1110093491.6e-8188.57Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEKAAR CSWEPY DKSDA+KFI+D+VLPHPW+RAICVDGRPVGAISVT N+AARDRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK-IDEL
        VKLVAERIF ERPELER+EALV VENLASQRV+EKAGF REGVLRK+GVLKGKTRDFVMFSLLSTDL+SK IDEL
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK-IDEL

A0A6J1CAC0 uncharacterized protein LOC1110094088.0e-8187.28Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEK ARFCSWEPY DKSDA+KFI+D+VLPHPW+RAICVDGRPVGAI V+ N+AARDRCR ELGYVLGSKFWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKIDE
        VKLVAERIF ERP LERLEALVDVENLASQRV+EKAGF REGVLRK+GVLKGKTRDFVMFSLLST L SK DE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKIDE

A0A6J1GTR6 uncharacterized protein LOC1114574171.4e-8089.88Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY+DKSDAIK+I D+VL HPW RAICVDGRPVGAISVT N AARDRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELERLEALV VENLASQRVLEKAGFQREGVLRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

A0A6J1IXH7 uncharacterized protein LOC1114800664.7e-8189.29Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY+DK DAIK+I D+VL HPW+RAICVDGRPVGAISVT N AARDRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELERLEALVDVENLASQRV+EKAGFQREGVLRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

SwissProt top hitse value%identityAlignment
O31633 Putative [ribosomal protein S5]-alanine N-acetyltransferase7.1e-1030.68Show/hide
Query:  VTLRPLDLTDID----------DFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFW
        + +RPL++TD +          DF E  S  +A  + + E  + +    +   ++   + +      D R +G +S+ +      +    +GY L     
Subjt:  VTLRPLDLTDID----------DFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFW

Query:  GKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE
        GKGI T AV+LV +  F E  +L R+EA V   NL S RVLEKAGF +EG+ RK   + G   D  + ++L+ D E
Subjt:  GKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLE

O34569 Uncharacterized N-acetyltransferase YoaA8.1e-0625.77Show/hide
Query:  LRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAI--CVDGRPVGAI--SVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        LR +   D +      S+++  R+   E  +    AI  I+     +   R I   ++ R    +  ++  +  A+   R E+GY +  + W  G  +  
Subjt:  LRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAI--CVDGRPVGAI--SVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLL
        +  V    F     L R+ A+V  +N AS R+L K GFQ+EGVLR++    G   D  ++S++
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLL

P05332 Uncharacterized N-acetyltransferase p201.3e-1433.33Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKS---DAIKFIEDRVLPHPWFR-AICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGT
        +TLR ++L D D   ++ SD +  ++ +  P+ D S   D I+ I D  L     R +I V        +   N   ++  R E+GY LG   WGKG  +
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKS---DAIKFIEDRVLPHPWFR-AICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGT

Query:  AAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD
         AV+ + +  F     L R+EA V+ EN  S ++L    FQ+EG+LR +   KG+  D  MFSLL  +
Subjt:  AAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD

P96579 Putative ribosomal N-acetyltransferase YdaF2.8e-0628.09Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFR----------AICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFW
        +T+R L+  D +   E    +   R   W  + +   +     + ++P  W R           +  DG   G IS+  N    +R + E+GY +  +F 
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFR----------AICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFW

Query:  GKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK
        GKGI TAA + +    F E  EL R+     V N  S+ V E+ GF  EG  R    + G   D V +SLL  + E +
Subjt:  GKGIGTAAVKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESK

Arabidopsis top hitse value%identityAlignment
AT2G32020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein5.3e-5360Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICV-DGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAAV
        ++LRP+ L+D+DD+M WA+D K ARFC+WEP   + +AIK+I DRVL HPW RAIC+ D RP+G I +     A D  R E+GYVL  K+WGKG  T AV
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICV-DGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAAV

Query:  KLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD
        +LV   +F E PE+ERLEALVDV+N+ SQRVLEK GF REGV+RKF  +KG  RD VMFS LSTD
Subjt:  KLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD

AT2G32030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.3e-5560.61Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDG-RPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAAV
        + LRP+ L+D+DDFM WA+D    RFC+WEPY  +  AI ++ D +LPHPW RAIC+D  RP+G+ISVT      D  RGE+GYVLGSK+WGKGI T AV
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDG-RPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAAV

Query:  KLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD
        +LVA  IF E+PE++RLEALVDV+N+ SQ+VLEK GF +EGV+RKF  LKG  RD VMFS L +D
Subjt:  KLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD

AT3G22560.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.2e-3642.17Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICV--DGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA
        + LRP +L+D +D  +WA D+   R+  W+      +A + I ++ +PHPW R+I +  DG  +G +SV  + +   RCR +L Y +  +FWG+GI TAA
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICV--DGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD
        V++  E+   + PE+ RL+A+V+VEN ASQRVLEKAGF++EG+L K+G  KG  RD  ++S +  D
Subjt:  VKLVAERIFVERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTAACTCTCCGCCCACTGGATCTCACCGACATCGACGATTTCATGGAGTGGGCATCGGACGAGAAAGCCGCTCGATTCTGCTCGTGGGAGCCCTACAAGGACAA
ATCGGACGCCATCAAGTTCATCGAGGATCGAGTCCTACCGCACCCCTGGTTCCGAGCGATCTGCGTCGACGGCCGCCCGGTGGGGGCGATTTCGGTGACGGAGAATACGG
CGGCGAGGGACCGGTGCAGGGGCGAGCTAGGGTACGTTTTGGGGTCGAAATTTTGGGGGAAAGGGATCGGGACGGCGGCGGTGAAATTGGTGGCGGAGAGGATTTTCGTG
GAGCGGCCGGAGTTGGAGAGGCTGGAGGCGTTGGTGGATGTGGAGAATTTGGCGTCTCAGAGAGTGCTGGAGAAGGCTGGATTTCAGAGAGAAGGAGTTCTGAGGAAATT
TGGAGTGTTGAAAGGGAAAACTAGGGATTTTGTTATGTTTAGTCTTCTTTCTACTGATCTTGAGTCGAAAATTGATGAACTGACGCTGATGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTAACTCTCCGCCCACTGGATCTCACCGACATCGACGATTTCATGGAGTGGGCATCGGACGAGAAAGCCGCTCGATTCTGCTCGTGGGAGCCCTACAAGGACAA
ATCGGACGCCATCAAGTTCATCGAGGATCGAGTCCTACCGCACCCCTGGTTCCGAGCGATCTGCGTCGACGGCCGCCCGGTGGGGGCGATTTCGGTGACGGAGAATACGG
CGGCGAGGGACCGGTGCAGGGGCGAGCTAGGGTACGTTTTGGGGTCGAAATTTTGGGGGAAAGGGATCGGGACGGCGGCGGTGAAATTGGTGGCGGAGAGGATTTTCGTG
GAGCGGCCGGAGTTGGAGAGGCTGGAGGCGTTGGTGGATGTGGAGAATTTGGCGTCTCAGAGAGTGCTGGAGAAGGCTGGATTTCAGAGAGAAGGAGTTCTGAGGAAATT
TGGAGTGTTGAAAGGGAAAACTAGGGATTTTGTTATGTTTAGTCTTCTTTCTACTGATCTTGAGTCGAAAATTGATGAACTGACGCTGATGAATTGA
Protein sequenceShow/hide protein sequence
MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWFRAICVDGRPVGAISVTENTAARDRCRGELGYVLGSKFWGKGIGTAAVKLVAERIFV
ERPELERLEALVDVENLASQRVLEKAGFQREGVLRKFGVLKGKTRDFVMFSLLSTDLESKIDELTLMN