; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021250 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021250
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationchr7:5887038..5887568
RNA-Seq ExpressionLag0021250
SyntenyLag0021250
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138097.1 uncharacterized protein LOC111009349 [Momordica charantia]9.6e-8186.44Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEKAAR CSWEPY DKSDA+KFI+D+VLPHPWYRAICVDGRPVGAISVT N+AA DRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESK-TDELQC
        VKLVAERIF ERPELERVE LV V NLASQRV+EKAGF REG+LRK+GVLKGKTRDFVMFSLLSTDL+SK  DEL C
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESK-TDELQC

XP_022138172.1 uncharacterized protein LOC111009408 [Momordica charantia]1.4e-7985.55Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEK ARFCSWEPY DKSDA+KFI+D+VLPHPWYRAICVDGRPVGAI V+ N+AA DRCR ELGYVLGSKFWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESKTDE
        VKLVAERIF ERP LER+E LVDV NLASQRV+EKAGF REG+LRK+GVLKGKTRDFVMFSLLST L SKTDE
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESKTDE

XP_022955368.1 uncharacterized protein LOC111457417 [Cucurbita moschata]8.9e-7986.9Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY+DKSDAIK+I D+VL HPW+RAICVDGRPVGAISVT N AA DRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELER+E LV V NLASQRVLEKAGFQREG+LRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE

XP_022980780.1 uncharacterized protein LOC111480066 [Cucurbita maxima]2.4e-7986.9Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY+DK DAIK+I D+VL HPWYRAICVDGRPVGAISVT N AA DRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELER+E LVDV NLASQRV+EKAGFQREG+LRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE

XP_038898770.1 uncharacterized N-acetyltransferase p20 [Benincasa hispida]4.0e-7986.39Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTD+DDFM WA+DEKAARFCSWEPY+DKS+AIKFI D+VL HPWYRAICVDGRPVGAISV  NTA  DRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLES
        VKLVAERIFVE PELER+E LVDV NLASQRVLEKAGFQREG+LRK+GV KGKTRDFVMFS LSTDL+S
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLES

TrEMBL top hitse value%identityAlignment
A0A5D3D131 Putative N-acetyltransferase p20-like1.3e-7582.84Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WA+DEKAAR+CSWEPY+DKS+AIKFI D+VL HP+YRAICVDGRPVGAISV  NTAA D+CRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLES
        VKLVAERIFVE PELER+E LVDV N ASQRVLEKAGFQREG+LRK+GVLKG  RD+VMFS L TD  S
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLES

A0A6J1CA30 uncharacterized protein LOC1110093494.6e-8186.44Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEKAAR CSWEPY DKSDA+KFI+D+VLPHPWYRAICVDGRPVGAISVT N+AA DRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESK-TDELQC
        VKLVAERIF ERPELERVE LV V NLASQRV+EKAGF REG+LRK+GVLKGKTRDFVMFSLLSTDL+SK  DEL C
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESK-TDELQC

A0A6J1CAC0 uncharacterized protein LOC1110094086.7e-8085.55Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        MAVTLRPLDLTDIDDFM WASDEK ARFCSWEPY DKSDA+KFI+D+VLPHPWYRAICVDGRPVGAI V+ N+AA DRCR ELGYVLGSKFWGKGI T A
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESKTDE
        VKLVAERIF ERP LER+E LVDV NLASQRV+EKAGF REG+LRK+GVLKGKTRDFVMFSLLST L SKTDE
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESKTDE

A0A6J1GTR6 uncharacterized protein LOC1114574174.3e-7986.9Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY+DKSDAIK+I D+VL HPW+RAICVDGRPVGAISVT N AA DRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELER+E LV V NLASQRVLEKAGFQREG+LRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE

A0A6J1IXH7 uncharacterized protein LOC1114800661.1e-7986.9Show/hide
Query:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        M +TLRPLDLTDIDDFM WASDEKAAR CSWEPY+DK DAIK+I D+VL HPWYRAICVDGRPVGAISVT N AA DRCRGELGYVLGSKFWGKGIGTAA
Subjt:  MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE
        VKLVAERIFVERPELER+E LVDV NLASQRV+EKAGFQREG+LRK+GV+KG+TRD+VMFSLLSTDLE
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE

SwissProt top hitse value%identityAlignment
O31633 Putative [ribosomal protein S5]-alanine N-acetyltransferase7.0e-1030.68Show/hide
Query:  VTLRPLDLTDID----------DFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFW
        + +RPL++TD +          DF E  S  +A  + + E  + +    +   ++   + +      D R +G +S+ +      +    +GY L     
Subjt:  VTLRPLDLTDID----------DFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFW

Query:  GKGIGTAAVKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE
        GKGI T AV+LV +  F E  +L R+E  V   NL S RVLEKAGF +EGI RK   + G   D  + ++L+ D E
Subjt:  GKGIGTAAVKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLE

O34569 Uncharacterized N-acetyltransferase YoaA1.8e-0524.54Show/hide
Query:  LRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAI--CVDGRPVGAI--SVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        LR +   D +      S+++  R+   E  +    AI  I+     +   R I   ++ R    +  ++  +  A    R E+GY +  + W  G  +  
Subjt:  LRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAI--CVDGRPVGAI--SVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLL
        +  V    F     L R+  +V   N AS R+L K GFQ+EG+LR++    G   D  ++S++
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLL

P05332 Uncharacterized N-acetyltransferase p204.7e-1432.14Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKS---DAIKFIEDRVLPHPWYR-AICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGT
        +TLR ++L D D   ++ SD +  ++ +  P+ D S   D I+ I D  L     R +I V        +   N    +  R E+GY LG   WGKG  +
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKS---DAIKFIEDRVLPHPWYR-AICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGT

Query:  AAVKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTD
         AV+ + +  F     L R+E  V+  N  S ++L    FQ+EG+LR +   KG+  D  MFSLL  +
Subjt:  AAVKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTD

P96579 Putative ribosomal N-acetyltransferase YdaF3.0e-0829.21Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYR----------AICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFW
        +T+R L+  D +   E    +   R   W  + +   +     + ++P  W R           +  DG   G IS+  N   ++R + E+GY +  +F 
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYR----------AICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFW

Query:  GKGIGTAAVKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESK
        GKGI TAA + +    F E  EL RV     VGN  S+ V E+ GF  EG  R    + G   D V +SLL  + E +
Subjt:  GKGIGTAAVKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESK

Arabidopsis top hitse value%identityAlignment
AT2G32020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.5e-5258.18Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICV-DGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAAV
        ++LRP+ L+D+DD+M WA+D K ARFC+WEP   + +AIK+I DRVL HPW RAIC+ D RP+G I +     A+D  R E+GYVL  K+WGKG  T AV
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICV-DGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAAV

Query:  KLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTD
        +LV   +F E PE+ER+E LVDV N+ SQRVLEK GF REG++RKF  +KG  RD VMFS LSTD
Subjt:  KLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTD

AT2G32030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein9.6e-5558.79Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDG-RPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAAV
        + LRP+ L+D+DDFM WA+D    RFC+WEPY  +  AI ++ D +LPHPW RAIC+D  RP+G+ISVT     +D  RGE+GYVLGSK+WGKGI T AV
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDG-RPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAAV

Query:  KLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTD
        +LVA  IF E+PE++R+E LVDV N+ SQ+VLEK GF +EG++RKF  LKG  RD VMFS L +D
Subjt:  KLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTD

AT3G22560.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.0e-3540.36Show/hide
Query:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICV--DGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA
        + LRP +L+D +D  +WA D+   R+  W+      +A + I ++ +PHPW R+I +  DG  +G +SV  ++    RCR +L Y +  +FWG+GI TAA
Subjt:  VTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICV--DGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAA

Query:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTD
        V++  E+   + PE+ R++ +V+V N ASQRVLEKAGF++EG+L K+G  KG  RD  ++S +  D
Subjt:  VKLVAERIFVERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTAACTCTCCGGCCACTGGATCTCACCGACATCGACGACTTCATGGAGTGGGCATCGGACGAGAAAGCCGCTCGATTCTGCTCCTGGGAGCCCTACAAGGACAA
ATCGGACGCCATCAAGTTCATCGAGGACCGAGTCCTGCCGCACCCCTGGTACCGAGCGATCTGCGTCGACGGCCGCCCGGTGGGGGCGATTTCGGTGACGGAGAATACGG
CGGCGATGGACCGGTGCAGGGGCGAGCTAGGGTACGTTTTGGGGTCGAAATTTTGGGGGAAAGGGATCGGGACGGCGGCGGTGAAATTGGTGGCGGAGAGGATTTTCGTG
GAGCGGCCGGAGTTGGAGAGGGTGGAGGGGTTGGTGGATGTGGGGAATTTGGCGTCTCAGAGAGTGCTGGAGAAGGCCGGATTTCAGAGAGAAGGAATTCTGAGGAAATT
TGGAGTGTTGAAAGGGAAAACTAGGGATTTTGTTATGTTCAGTCTTCTTTCTACTGATCTTGAATCGAAAACTGATGAACTGCAATGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGTAACTCTCCGGCCACTGGATCTCACCGACATCGACGACTTCATGGAGTGGGCATCGGACGAGAAAGCCGCTCGATTCTGCTCCTGGGAGCCCTACAAGGACAA
ATCGGACGCCATCAAGTTCATCGAGGACCGAGTCCTGCCGCACCCCTGGTACCGAGCGATCTGCGTCGACGGCCGCCCGGTGGGGGCGATTTCGGTGACGGAGAATACGG
CGGCGATGGACCGGTGCAGGGGCGAGCTAGGGTACGTTTTGGGGTCGAAATTTTGGGGGAAAGGGATCGGGACGGCGGCGGTGAAATTGGTGGCGGAGAGGATTTTCGTG
GAGCGGCCGGAGTTGGAGAGGGTGGAGGGGTTGGTGGATGTGGGGAATTTGGCGTCTCAGAGAGTGCTGGAGAAGGCCGGATTTCAGAGAGAAGGAATTCTGAGGAAATT
TGGAGTGTTGAAAGGGAAAACTAGGGATTTTGTTATGTTCAGTCTTCTTTCTACTGATCTTGAATCGAAAACTGATGAACTGCAATGCTGA
Protein sequenceShow/hide protein sequence
MAVTLRPLDLTDIDDFMEWASDEKAARFCSWEPYKDKSDAIKFIEDRVLPHPWYRAICVDGRPVGAISVTENTAAMDRCRGELGYVLGSKFWGKGIGTAAVKLVAERIFV
ERPELERVEGLVDVGNLASQRVLEKAGFQREGILRKFGVLKGKTRDFVMFSLLSTDLESKTDELQC