; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021249 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021249
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationchr7:5884320..5884832
RNA-Seq ExpressionLag0021249
SyntenyLag0021249
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049351.1 putative N-acetyltransferase YoaA [Cucumis melo var. makuwa]2.4e-8186.98Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SSRIS+RPFNLSDA+DFLRWA D+RVTRYLRWNTI SK EALTY+EK+AIPH WRRSICLDGRSVGYVS KPESEEKCRAHISYAVAAEHWG+GIAT 
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALRAAIPAAL+EFPEVVRVQA+VEVENEGSQ+VLEK+GFCREG LRKYG CKGEIRDL++FSFLRT+QL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

KAE8650353.1 hypothetical protein Csa_010832 [Cucumis sativus]9.2e-8186.98Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SSRIS+RPFNLSDA+DFLRWA D+RVTRYLRWNTI SK EALTYLEKVAIPH WRRSICLDGRSVGYVS KPESEEKCRAHISYAVAAEHWG+GIAT 
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALRAAIPAAL++FPE+VRVQA+VEVENEGSQ+VLEK+GFCREG LRKYG CKGEIRDL++FS LRTDQL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

XP_004134439.1 uncharacterized protein LOC101219695 [Cucumis sativus]9.2e-8186.98Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SSRIS+RPFNLSDA+DFLRWA D+RVTRYLRWNTI SK EALTYLEKVAIPH WRRSICLDGRSVGYVS KPESEEKCRAHISYAVAAEHWG+GIAT 
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALRAAIPAAL++FPE+VRVQA+VEVENEGSQ+VLEK+GFCREG LRKYG CKGEIRDL++FS LRTDQL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

XP_022955367.1 uncharacterized protein LOC111457416 [Cucurbita moschata]3.2e-8188.76Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SS ISVRPFNLSDA+DFLRWAGDDRVTRYLRWNTI SK EALTYLEKVAIPHPWRRSICL+G SVGYVS+KPESEEKCR +ISYAVAAEHWGRGIATA
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALRAAIPAAL +FPEVVR+QALVEVENEGSQRVLEK+GFCREG LRKYG CKGEIR+ I+FSFLRTDQL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

XP_023527193.1 uncharacterized protein LOC111790505 [Cucurbita pepo subsp. pepo]8.4e-8289.35Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SS ISVRPFNLSDA+DFLRWAGDDRVTRYLRWNTI SK EALTYLEKVAIPHPWRRSICL+GRSVGYVS+KPESEEKCRA+ISYAVAAEHWGRGIATA
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALR AIPAAL +FPEVVR+QALVEVENEGSQRVLEK+GFCREG LRKYG CKGEIR+ I+FSFLRTDQL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

TrEMBL top hitse value%identityAlignment
A0A0A0LA76 N-acetyltransferase domain-containing protein4.5e-8186.98Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SSRIS+RPFNLSDA+DFLRWA D+RVTRYLRWNTI SK EALTYLEKVAIPH WRRSICLDGRSVGYVS KPESEEKCRAHISYAVAAEHWG+GIAT 
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALRAAIPAAL++FPE+VRVQA+VEVENEGSQ+VLEK+GFCREG LRKYG CKGEIRDL++FS LRTDQL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

A0A5A7U710 Putative N-acetyltransferase YoaA1.2e-8186.98Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SSRIS+RPFNLSDA+DFLRWA D+RVTRYLRWNTI SK EALTY+EK+AIPH WRRSICLDGRSVGYVS KPESEEKCRAHISYAVAAEHWG+GIAT 
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALRAAIPAAL+EFPEVVRVQA+VEVENEGSQ+VLEK+GFCREG LRKYG CKGEIRDL++FSFLRT+QL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

A0A6J1CCF0 uncharacterized protein LOC1110094374.5e-8187.06Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SSRISVR F LSDANDFLRWAGDDRVTRYLRW+ I SK EA+TYLEKVAIPHPWRRSICLDGRSVGYVS++PE+EE+CRAHISYAVAAEHWGRGIATA
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQLK
        ALRAA+  ALKEFPEVVRVQALVEVEN GSQRVLEKVGFCREG LRKYG CKGEIRD  IFSFLRTDQ++
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQLK

A0A6J1GTD3 uncharacterized protein LOC1114574161.5e-8188.76Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+SS ISVRPFNLSDA+DFLRWAGDDRVTRYLRWNTI SK EALTYLEKVAIPHPWRRSICL+G SVGYVS+KPESEEKCR +ISYAVAAEHWGRGIATA
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALRAAIPAAL +FPEVVR+QALVEVENEGSQRVLEK+GFCREG LRKYG CKGEIR+ I+FSFLRTDQL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

A0A6J1J069 uncharacterized protein LOC111480065 isoform X21.1e-7986.98Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA
        M+ S ISVRPFNLSDA+DFLRWAGDDRVTRYLRWNTI SK EAL YLEKVAIPHPWRRSICL+GRSVGYVS+KPESEEKCRA+ISYAVAAEHWGRGIAT 
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATA

Query:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
        ALR AIPAAL +FPEVVR+QALVEVENEGSQRVLEK+GFCREG LRKYG CKGEIR+ I+FS LRTDQL
Subjt:  ALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

SwissProt top hitse value%identityAlignment
O31633 Putative [ribosomal protein S5]-alanine N-acetyltransferase2.4e-0730.06Show/hide
Query:  ISVRPFNLSDANDFLRWAGDDR-------VTRYLRWNTIASKNEALT-YLEKVAIPHPWRRSI--CLDGRSVGYVSIKPESEEKCR-AHISYAVAAEHWG
        I VRP  ++DA + L    ++R       + R   + T+  + + +T Y E++     +   I    D R +G VS+        + A I Y +   H G
Subjt:  ISVRPFNLSDANDFLRWAGDDR-------VTRYLRWNTIASKNEALT-YLEKVAIPHPWRRSI--CLDGRSVGYVSIKPESEEKCR-AHISYAVAAEHWG

Query:  RGIATAALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTD
        +GI T A+R  +  A  E  ++ R++A V   N GS RVLEK GF +EG  RK     G   D  + + L  D
Subjt:  RGIATAALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTD

O34569 Uncharacterized N-acetyltransferase YoaA1.0e-1025.73Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSI--CLDGRS----VGYVSIKPESEEKCRAHISYAVAAEHWG
        +++ R+ +R     DA        +D VTRY     + S  +A++ ++  A  +  +R I   ++ R     +G +     +++  RA I Y +  EHW 
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSI--CLDGRS----VGYVSIKPESEEKCRAHISYAVAAEHWG

Query:  RGIATAALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLR
         G A+  +   +         + R+ A+V  +NE S R+L K+GF +EG LR+Y    G   D  ++S ++
Subjt:  RGIATAALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLR

P05332 Uncharacterized N-acetyltransferase p209.5e-1225.28Show/hide
Query:  SSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRS-------------VGYVSIKPESEEKCRAHISYAVA
        + R+++R   L DA+   ++  D  VT+Y+        ++A   ++ +         + L+G++             +G        +E  RA I Y + 
Subjt:  SSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRS-------------VGYVSIKPESEEKCRAHISYAVA

Query:  AEHWGRGIATAALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTD
          HWG+G A+ A++  I         + R++A VE EN  S ++L  + F +EG LR Y   KG + D+ +FS L+ +
Subjt:  AEHWGRGIATAALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTD

P49855 Uncharacterized protein YkkB1.5e-0427.86Show/hide
Query:  FNLSDANDFLRWAGDDRVTR-YLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPES-EEKCRAHISYAVAAEHWGRGIATAALRAAIPA
        FN  D   +     D R TR ++ WN    K   ++          W       G  +G   I P+  E +    I Y  A  HWG G A  A RA +  
Subjt:  FNLSDANDFLRWAGDDRVTR-YLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPES-EEKCRAHISYAVAAEHWGRGIATAALRAAIPA

Query:  ALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKY
           E  +  ++ AL++  N+ S RV EK+G      +RK+
Subjt:  ALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKY

Arabidopsis top hitse value%identityAlignment
AT2G32020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.0e-4247.9Show/hide
Query:  RISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICL-DGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATAALR
        RIS+RP  LSD +D++ WA D +V R+  W    S++EA+ Y+    + HPW R+ICL D R +GY+ I   + +  R  I Y +A ++WG+G AT A+R
Subjt:  RISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICL-DGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATAALR

Query:  AAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQLK
               +EFPE+ R++ALV+V+N GSQRVLEKVGF REG +RK+   KG +RD ++FSFL TD LK
Subjt:  AAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQLK

AT2G32030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.5e-4146.99Show/hide
Query:  RISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDG-RSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATAALR
        +I +RP  LSD +DF+ WA D  VTR+  W    S+  A+ YL    +PHPW R+ICLD  R +G +S+ P  E   R  I Y + +++WG+GIAT A+R
Subjt:  RISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDG-RSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATAALR

Query:  AAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL
               KE PE+ R++ALV+V+N GSQ+VLEKVGF +EG +RK+   KG +RD+++FSFL +D L
Subjt:  AAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQL

AT3G22560.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein9.9e-5761.76Show/hide
Query:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICL--DGRSVGYVSIKPES-EEKCRAHISYAVAAEHWGRGI
        M+S RI +RPFNLSDA D  +WAGDD VTRYLRW+++ S  EA  ++   AIPHPWRRSI L  DG S+GYVS+KP+S + +CRA ++YAVA E WGRGI
Subjt:  MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICL--DGRSVGYVSIKPES-EEKCRAHISYAVAAEHWGRGI

Query:  ATAALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTD
        ATAA+R A+  AL++FPEVVR+QA+VEVEN+ SQRVLEK GF +EG L KYG  KG IRD+ ++S+++ D
Subjt:  ATAALRAAIPAALKEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTCATCAAGAATTTCCGTCCGGCCGTTCAATCTCTCCGACGCCAACGACTTTCTCCGGTGGGCCGGCGACGACAGAGTAACCCGCTATCTCCGATGGAATACAAT
TGCCTCCAAAAATGAAGCATTAACTTACCTAGAGAAGGTCGCGATTCCCCATCCATGGCGGCGGTCCATCTGCTTGGACGGCCGTTCCGTCGGATACGTTTCGATCAAGC
CGGAATCGGAGGAGAAATGCAGAGCGCATATCAGTTATGCTGTGGCAGCGGAGCATTGGGGCCGAGGAATAGCCACGGCGGCGCTGAGGGCGGCGATTCCAGCGGCGTTG
AAAGAGTTTCCGGAGGTGGTTAGGGTTCAGGCGCTGGTGGAGGTCGAGAATGAGGGATCGCAGAGGGTTCTGGAGAAGGTGGGGTTTTGCAGAGAGGGGCAGTTGAGGAA
ATATGGAATTTGTAAGGGCGAAATTCGGGATTTGATAATTTTCAGCTTTTTAAGAACCGATCAACTCAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTCATCAAGAATTTCCGTCCGGCCGTTCAATCTCTCCGACGCCAACGACTTTCTCCGGTGGGCCGGCGACGACAGAGTAACCCGCTATCTCCGATGGAATACAAT
TGCCTCCAAAAATGAAGCATTAACTTACCTAGAGAAGGTCGCGATTCCCCATCCATGGCGGCGGTCCATCTGCTTGGACGGCCGTTCCGTCGGATACGTTTCGATCAAGC
CGGAATCGGAGGAGAAATGCAGAGCGCATATCAGTTATGCTGTGGCAGCGGAGCATTGGGGCCGAGGAATAGCCACGGCGGCGCTGAGGGCGGCGATTCCAGCGGCGTTG
AAAGAGTTTCCGGAGGTGGTTAGGGTTCAGGCGCTGGTGGAGGTCGAGAATGAGGGATCGCAGAGGGTTCTGGAGAAGGTGGGGTTTTGCAGAGAGGGGCAGTTGAGGAA
ATATGGAATTTGTAAGGGCGAAATTCGGGATTTGATAATTTTCAGCTTTTTAAGAACCGATCAACTCAAGTGA
Protein sequenceShow/hide protein sequence
MDSSRISVRPFNLSDANDFLRWAGDDRVTRYLRWNTIASKNEALTYLEKVAIPHPWRRSICLDGRSVGYVSIKPESEEKCRAHISYAVAAEHWGRGIATAALRAAIPAAL
KEFPEVVRVQALVEVENEGSQRVLEKVGFCREGQLRKYGICKGEIRDLIIFSFLRTDQLK