; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023042 (gene) of Chayote v1 genome

Gene IDSed0023042
OrganismSechium edule (Chayote v1)
Description4HBT domain-containing protein
Genome locationLG12:4331023..4333210
RNA-Seq ExpressionSed0023042
SyntenySed0023042
Gene Ontology termsGO:0042372 - phylloquinone biosynthetic process (biological process)
GO:0051289 - protein homotetramerization (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0061522 - 1,4-dihydroxy-2-naphthoyl-CoA thioesterase activity (molecular function)
InterPro domainsIPR003736 - Phenylacetic acid degradation-related domain
IPR006683 - Thioesterase domain
IPR029069 - HotDog domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582134.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1, partial [Cucurbita argyrosperma subsp. sororia]2.7e-5875.16Show/hide
Query:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA
        MS+ + PP S P VLD  L A GFE +HVS   V GRLLVS ICCQPFKVLHGGVSALIAESLAS+ AH ASGY+RVAGI L+++HLKSAALGDLV AEA
Subjt:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA

Query:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        APV++GRTIQVWDVKLWKD+KE KV+VS+ARVTLLCN+ VPKHAE+A+NALK F+KL
Subjt:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

XP_004147638.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1 [Cucumis sativus]1.2e-5575.84Show/hide
Query:  SSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRT
        S+ P VLDA L++LGFE  HVS   V+GRLLVSPICCQPFKVLHGGVSALIAESLASM AH ASGY+RVAGI L+++HLKSA+LG+LV AEA PV++GRT
Subjt:  SSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRT

Query:  IQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        IQVWDV+LWKD+KE+KV+VS+ARVTLL N+PVPKH EDA++ALK FSKL
Subjt:  IQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

XP_022955680.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like [Cucurbita moschata]3.2e-5975.8Show/hide
Query:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA
        MS+ + PP S P VLD  L A GFE +HVS   V GRLLVS ICCQPFKVLHGGVSALIAESLAS+ AH ASGY+RVAGI L+++HLKSAALGDLV AEA
Subjt:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA

Query:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        APV++GRTIQVWDVKLWKD+KE KV+VS+ARVTLLCN+PVPKHAE+A+NALK F+KL
Subjt:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

XP_022979957.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like [Cucurbita maxima]9.2e-5975.8Show/hide
Query:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA
        MS    PP S P VLD  L A GFE +HVS   V GRLLVS ICCQPFKVLHGGVSALIAESLAS+ AH ASGY+RVAGI L+++HLKSAALGDLVFAEA
Subjt:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA

Query:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        APV++GRTIQVWDVKLWKD+KE KV+VS+ARVTLLCN+PVPKHA +A+NALK F+KL
Subjt:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

XP_038903276.1 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like isoform X1 [Benincasa hispida]2.9e-6078.21Show/hide
Query:  STKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAA
        ST N+PP SKP VLDA L+A GFE + VSS  V+GRLLVSPICCQPFKVLHGGVSALIAESLASM AH ASGY+RVAGI L+++HLKSAALGDLV AEA 
Subjt:  STKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAA

Query:  PVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        PV++GRTIQVWDV+LWKD+KE+KV+VS+ARVTLLCN+PVPKHAE+A++ALK+FSKL
Subjt:  PVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

TrEMBL top hitse value%identityAlignment
A0A0A0L876 4HBT domain-containing protein6.0e-5675.84Show/hide
Query:  SSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRT
        S+ P VLDA L++LGFE  HVS   V+GRLLVSPICCQPFKVLHGGVSALIAESLASM AH ASGY+RVAGI L+++HLKSA+LG+LV AEA PV++GRT
Subjt:  SSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRT

Query:  IQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        IQVWDV+LWKD+KE+KV+VS+ARVTLL N+PVPKH EDA++ALK FSKL
Subjt:  IQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

A0A1S3AYE8 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like7.3e-5470.7Show/hide
Query:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA
        MS+ ++P  + P +LDA L++ GFE   VS   V GRLLVS ICCQPFKVLHGGVSALIAESLASM AH ASGY+RVAGI L+++HLKSAALG+LV AEA
Subjt:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA

Query:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
         PV++GRTIQVWDV+LWKD+KE+KV+VS+ARVTLLCN+PVPKH ++A++ALK FSKL
Subjt:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

A0A5D3C1G9 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like9.6e-5470.7Show/hide
Query:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA
        MS+ ++P  + P +LDA L++ GFE   VS   V GRLLVS ICCQPFKVLHGGVSALIAESLASM AH ASGY+RVAGI L+++HLKSAALG+LV AEA
Subjt:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA

Query:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
         PV++GRTIQVWDV+LWKD+KE+KV+VS+ARVTLLCN+PVPKH ++A++ALK FSKL
Subjt:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

A0A6J1GVR9 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like1.5e-5975.8Show/hide
Query:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA
        MS+ + PP S P VLD  L A GFE +HVS   V GRLLVS ICCQPFKVLHGGVSALIAESLAS+ AH ASGY+RVAGI L+++HLKSAALGDLV AEA
Subjt:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA

Query:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        APV++GRTIQVWDVKLWKD+KE KV+VS+ARVTLLCN+PVPKHAE+A+NALK F+KL
Subjt:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

A0A6J1IQ50 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 1-like4.4e-5975.8Show/hide
Query:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA
        MS    PP S P VLD  L A GFE +HVS   V GRLLVS ICCQPFKVLHGGVSALIAESLAS+ AH ASGY+RVAGI L+++HLKSAALGDLVFAEA
Subjt:  MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEA

Query:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        APV++GRTIQVWDVKLWKD+KE KV+VS+ARVTLLCN+PVPKHA +A+NALK F+KL
Subjt:  APVSLGRTIQVWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

SwissProt top hitse value%identityAlignment
P45083 Putative esterase HI_11616.9e-0933.88Show/hide
Query:  DAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVA-SGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRTIQVWDV
        ++A+  LG E        +   + V     QPF VLHGGVS  +AE++ S+A  +     + V G+ +  +HL+    G  V A A P++LGR IQVW +
Subjt:  DAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVA-SGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRTIQVWDV

Query:  KLWKDVK-EKKVLVSSARVTL
            D++ E+  L   +R+TL
Subjt:  KLWKDVK-EKKVLVSSARVTL

P77781 1,4-dihydroxy-2-naphthoyl-CoA hydrolase5.3e-0933.63Show/hide
Query:  LGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVAS-GYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRTIQVWDVKLWKDV
        L   FEH+    +   + V     QPF +LHGG S ++AES+ S+A ++ + G ++V G+++  +H++SA  G  V     P+ LG   QVW ++++   
Subjt:  LGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVAS-GYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRTIQVWDVKLWKDV

Query:  KEKKVLVSSARVT
         EK  L  S+R+T
Subjt:  KEKKVLVSSARVT

Q9FI76 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 21.8e-4153.55Show/hide
Query:  PSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGR
        P S   ++D  L+ LGF F+ +S+  V+G L ++  CCQPFKVLHGGVSALIAE+LAS+ A +ASG++RVAGI L++ HL+ AALG++VFAE+ PVS+G+
Subjt:  PSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGR

Query:  TIQVWDVKLWK----DVKEKKVLVSSARVTLLCNIPVPKHAEDASNAL-KTFSKL
         IQVW+V+LWK    +  + K++VS++RVTL C +P+P H +DA + L K  SKL
Subjt:  TIQVWDVKLWK----DVKEKKVLVSSARVTLLCNIPVPKHAEDASNAL-KTFSKL

Q9I3A4 Putative esterase PA16185.3e-0937.25Show/hide
Query:  EALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAH--VASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRTIQVWDVKLW
        + LG  FE      +T  + V     QPF +LHGG S ++AESL SMA++  V +      G+++  +HL+    G  V A A  + LGRT  VWD++L 
Subjt:  EALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAH--VASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRTIQVWDVKLW

Query:  KD
         D
Subjt:  KD

Q9SX65 1,4-dihydroxy-2-naphthoyl-CoA thioesterase 16.6e-5266.45Show/hide
Query:  SSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRT
        SS    +D  L  LGFEF+ +S   +TGRL VSP+CCQPFKVLHGGVSALIAESLASM AH+ASG++RVAGIQL+++HLKSA LGDLVFAEA PVS G+T
Subjt:  SSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRT

Query:  IQVWDVKLWKDV---KEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        IQVW+VKLWK     K  K+L+SS+RVTL+CN+P+P +A+DA+N LK  +KL
Subjt:  IQVWDVKLWKDV---KEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

Arabidopsis top hitse value%identityAlignment
AT1G48320.1 Thioesterase superfamily protein4.7e-5366.45Show/hide
Query:  SSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRT
        SS    +D  L  LGFEF+ +S   +TGRL VSP+CCQPFKVLHGGVSALIAESLASM AH+ASG++RVAGIQL+++HLKSA LGDLVFAEA PVS G+T
Subjt:  SSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRT

Query:  IQVWDVKLWKDV---KEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL
        IQVW+VKLWK     K  K+L+SS+RVTL+CN+P+P +A+DA+N LK  +KL
Subjt:  IQVWDVKLWKDV---KEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL

AT5G48950.1 Thioesterase superfamily protein1.3e-4253.55Show/hide
Query:  PSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGR
        P S   ++D  L+ LGF F+ +S+  V+G L ++  CCQPFKVLHGGVSALIAE+LAS+ A +ASG++RVAGI L++ HL+ AALG++VFAE+ PVS+G+
Subjt:  PSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGR

Query:  TIQVWDVKLWK----DVKEKKVLVSSARVTLLCNIPVPKHAEDASNAL-KTFSKL
         IQVW+V+LWK    +  + K++VS++RVTL C +P+P H +DA + L K  SKL
Subjt:  TIQVWDVKLWK----DVKEKKVLVSSARVTLLCNIPVPKHAEDASNAL-KTFSKL

AT5G48950.2 Thioesterase superfamily protein1.2e-2955.26Show/hide
Query:  PSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGR
        P S   ++D  L+ LGF F+ +S+  V+G L ++  CCQPFKVLHGGVSALIAE+LAS+ A +ASG++RVAGI L++ HL+ AALG++VFAE+ PVS+G+
Subjt:  PSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGR

Query:  TIQVWDVKLWKDVK
         IQ  D+K   +VK
Subjt:  TIQVWDVKLWKDVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACAAAAAATTCTCCGCCGTCGTCGAAGCCGGCGGTTCTGGACGCTGCGCTGGAGGCGTTGGGATTCGAGTTCGAACACGTTTCTTCAGTGGGCGTGACCGGCCG
TCTTCTAGTTTCCCCAATTTGCTGCCAGCCCTTCAAAGTGTTGCACGGAGGAGTGTCGGCGTTGATAGCTGAGTCGCTTGCCAGCATGGCGGCTCACGTGGCTTCCGGCT
ACCGGAGGGTCGCCGGAATCCAACTCACTGTCAGCCATTTGAAGAGCGCCGCCCTCGGCGACCTCGTTTTCGCCGAAGCCGCTCCCGTCTCCCTTGGTAGAACCATTCAG
GTATGGGATGTAAAATTATGGAAGGATGTAAAAGAGAAGAAAGTGTTGGTGTCCAGTGCAAGGGTAACTCTTTTATGCAACATCCCTGTTCCAAAACATGCTGAAGATGC
TTCAAATGCCCTCAAAACCTTTTCAAAATTGTAA
mRNA sequenceShow/hide mRNA sequence
TTAGATTTTATGTAGTTGTCAAATACCCACAAATATTAAATCAACAATCCAATCCAAGATTTCACCCTCTCGAATTCTCCGATCAGAAAATCCCCAATTATGTCTACAAA
AAATTCTCCGCCGTCGTCGAAGCCGGCGGTTCTGGACGCTGCGCTGGAGGCGTTGGGATTCGAGTTCGAACACGTTTCTTCAGTGGGCGTGACCGGCCGTCTTCTAGTTT
CCCCAATTTGCTGCCAGCCCTTCAAAGTGTTGCACGGAGGAGTGTCGGCGTTGATAGCTGAGTCGCTTGCCAGCATGGCGGCTCACGTGGCTTCCGGCTACCGGAGGGTC
GCCGGAATCCAACTCACTGTCAGCCATTTGAAGAGCGCCGCCCTCGGCGACCTCGTTTTCGCCGAAGCCGCTCCCGTCTCCCTTGGTAGAACCATTCAGGTATGGGATGT
AAAATTATGGAAGGATGTAAAAGAGAAGAAAGTGTTGGTGTCCAGTGCAAGGGTAACTCTTTTATGCAACATCCCTGTTCCAAAACATGCTGAAGATGCTTCAAATGCCC
TCAAAACCTTTTCAAAATTGTAACTTTTTTTTTTGTTACAACATATTGGAAAAAGGGATATTTGTTCGAACCTATAATCTCATGGTTACTAAGTTGTATAAGATGTCGGT
AGAGCTCTGCTCTCGTTGACTTTCAAAATTGTAACTTGATTGTACTTGTATAGTGTTAATTAGAATAATTGCCATAGGATCAATGGCATTAGATTAGTATAGATAATTAA
TATAGTATTAATATTTATTATTTTGTACTATTGTAATATTGTGTAAGACGTATTAAATTCATTAGGACTTTCTTGGATGATTATACTTTTTAGGTTTA
Protein sequenceShow/hide protein sequence
MSTKNSPPSSKPAVLDAALEALGFEFEHVSSVGVTGRLLVSPICCQPFKVLHGGVSALIAESLASMAAHVASGYRRVAGIQLTVSHLKSAALGDLVFAEAAPVSLGRTIQ
VWDVKLWKDVKEKKVLVSSARVTLLCNIPVPKHAEDASNALKTFSKL