; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0228441 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0228441
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr08:22695653..22696147
RNA-Seq ExpressionCmc08g0228441
SyntenyCmc08g0228441
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031683.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.9e-7184.15Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        MKSEFEMSLVGELSCFLGLQIKQRSEG+FISQEKYAKN++KKFGL QSQYK+TP ATH KITKD+IG AVDHKLY+SMI SLLYL  S  DI YAVGIC 
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        RYQSDPR S+LNAVKRIIKYVHGTTDF ILYSY+TSS+LVGYCDA W GSADDRK+T GGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

KAA0034930.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]3.2e-86100Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

KAA0054435.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.8e-7385.98Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        MKSEFEMSLVGELSCFLGLQIKQRSEG+FISQEKYAKN++KKFGLDQSQ+KRTP  THAKITKD +G  VDHKLY+SMI SLLYLTTSR DIAY VGICA
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        RYQSD R S+LNA+KRIIKYVHGTTDFGILYSY+TSSELVGY +AD AGSADDRK TSGGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

KAA0054623.1 putative mitochondrial protein [Cucumis melo var. makuwa]4.7e-7486.59Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        M SEFEMSLVGELSCFLGLQIKQRSEG+F+SQEKYAKN++KKFGLDQSQYKRTP ATHAKITKD++G AVDHKLY+SMI S LYL  SR DIAYAV ICA
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        RYQSDP  S+LNAVKRIIKYVHGTTDFGILYSY+T S+LVGYCDADWAGSADDRKSTSGGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

TYK05471.1 Copia protein [Cucumis melo var. makuwa]2.1e-8296.34Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNI+KKFGLDQSQYKRT TATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRS+IAYAVGICA
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        +YQSDPRIS+LNAVKRIIKYVHGTTDF ILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

TrEMBL top hitse value%identityAlignment
A0A5A7SVD7 Putative gag-pol polyprotein1.5e-86100Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

A0A5A7UF87 Putative mitochondrial protein2.3e-7486.59Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        M SEFEMSLVGELSCFLGLQIKQRSEG+F+SQEKYAKN++KKFGLDQSQYKRTP ATHAKITKD++G AVDHKLY+SMI S LYL  SR DIAYAV ICA
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        RYQSDP  S+LNAVKRIIKYVHGTTDFGILYSY+T S+LVGYCDADWAGSADDRKSTSGGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

A0A5D3BY43 Gag-pol polyprotein1.4e-7184.15Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        MKSEFEMSLVGELSCFLGLQIKQRSEG+FISQEKYAKN++KKFGL QSQYK+TP ATH KITKD+IG AVDHKLY+SMI SLLYL  S  DI YAVGIC 
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        RYQSDPR S+LNAVKRIIKYVHGTTDF ILYSY+TSS+LVGYCDA W GSADDRK+T GGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

A0A5D3C0M9 Copia protein1.0e-8296.34Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNI+KKFGLDQSQYKRT TATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRS+IAYAVGICA
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        +YQSDPRIS+LNAVKRIIKYVHGTTDF ILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

A0A5D3CS19 Gag-pol polyprotein3.3e-7385.98Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        MKSEFEMSLVGELSCFLGLQIKQRSEG+FISQEKYAKN++KKFGLDQSQ+KRTP  THAKITKD +G  VDHKLY+SMI SLLYLTTSR DIAY VGICA
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        RYQSD R S+LNA+KRIIKYVHGTTDFGILYSY+TSSELVGY +AD AGSADDRK TSGGCFFL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.3e-1833.13Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVD-HKLYKSMIVSLLY-LTTSRSDIAYAVGI
        +  +F M+ + E+  F+G++I+ + + I++SQ  Y K IL KF ++      TP    +KI  + +    D +   +S+I  L+Y +  +R D+  AV I
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVD-HKLYKSMIVSLLY-LTTSRSDIAYAVGI

Query:  CARYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSE--LVGYCDADWAGSADDRKSTSGGCF
         +RY S         +KR+++Y+ GT D  +++  N + E  ++GY D+DWAGS  DRKST+G  F
Subjt:  CARYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSE--LVGYCDADWAGSADDRKSTSGGCF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-1833.92Show/hide
Query:  MKSEFEMSLVGELSCFLGLQI--KQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHK------LYKSMIVSLLY-LTTSRSD
        +   F+M  +G     LG++I  ++ S  +++SQEKY + +L++F +  ++   TP A H K++K      V+ K       Y S + SL+Y +  +R D
Subjt:  MKSEFEMSLVGELSCFLGLQI--KQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHK------LYKSMIVSLLY-LTTSRSD

Query:  IAYAVGICARYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCF
        IA+AVG+ +R+  +P   +  AVK I++Y+ GTT   + +   +   L GY DAD AG  D+RKS++G  F
Subjt:  IAYAVGICARYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCF

P92519 Uncharacterized mitochondrial protein AtMg008101.1e-2235.37Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        + S F M  +G +  FLG+QIK    G+F+SQ KYA+ IL   G+   +   TP       +  S     D   ++S++ +L YLT +R DI+YAV I  
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        +   +P ++  + +KR+++YV GT   G+    N+   +  +CD+DWAG    R+ST+G C FL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-1834.64Show/hide
Query:  ELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICARYQSDPRISYL
        EL  FLG++ K+   G+ +SQ +Y  ++L +  +  ++   TP A   K++  S     D   Y+ ++ SL YL  +R DI+YAV   +++   P   +L
Subjt:  ELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICARYQSDPRISYL

Query:  NAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
         A+KRI++Y+ GT + GI      +  L  Y DADWAG  DD  ST+G   +L
Subjt:  NAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.4e-2033.54Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        +   F +    +L  FLG++ K+  +G+ +SQ +Y  ++L +  +  ++   TP AT  K+T  S     D   Y+ ++ SL YL  +R D++YAV   +
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        +Y   P   + NA+KR+++Y+ GT D GI      +  L  Y DADWAG  DD  ST+G   +L
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.4e-2235.37Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        +KS F++  +G L  FLGL+I + + GI I Q KYA ++L + GL   +    P       +  S G  VD K Y+ +I  L+YL  +R DI++AV   +
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        ++   PR+++  AV +I+ Y+ GT   G+ YS     +L  + DA +    D R+ST+G C FL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.0e-1035.44Show/hide
Query:  LYLTTSRSDIAYAVGICARYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGC
        +YLT +R D+ +AV   +++ S  R + + AV +++ YV GT   G+ YS  +  +L  + D+DWA   D R+S +G C
Subjt:  LYLTTSRSDIAYAVGICARYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGC

ATMG00810.1 DNA/RNA polymerases superfamily protein8.2e-2435.37Show/hide
Query:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA
        + S F M  +G +  FLG+QIK    G+F+SQ KYA+ IL   G+   +   TP       +  S     D   ++S++ +L YLT +R DI+YAV I  
Subjt:  MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICA

Query:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL
        +   +P ++  + +KR+++YV GT   G+    N+   +  +CD+DWAG    R+ST+G C FL
Subjt:  RYQSDPRISYLNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAGAATTCGAAATGAGCTTAGTAGGTGAATTGTCTTGCTTTTTGGGGTTGCAGATCAAACAGAGAAGTGAAGGTATATTTATATCGCAAGAAAAGTATGCCAA
GAACATACTCAAGAAGTTTGGTCTGGATCAGTCACAATACAAAAGGACTCCAACTGCGACACATGCTAAAATTACCAAGGATAGTATTGGTATTGCAGTAGATCATAAAT
TGTACAAGAGCATGATTGTGAGCCTCTTATATTTAACGACAAGCAGATCGGATATTGCCTATGCTGTTGGAATATGTGCTCGATATCAGTCAGATCCTCGTATCTCTTAC
TTGAATGCAGTTAAACGAATAATCAAATATGTTCACGGAACAACCGATTTCGGAATTCTGTATTCCTACAATACATCTTCTGAACTGGTGGGATATTGTGATGCTGACTG
GGCTGGTTCTGCTGATGATAGGAAAAGCACCTCTGGTGGATGTTTCTTTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATCAGAATTCGAAATGAGCTTAGTAGGTGAATTGTCTTGCTTTTTGGGGTTGCAGATCAAACAGAGAAGTGAAGGTATATTTATATCGCAAGAAAAGTATGCCAA
GAACATACTCAAGAAGTTTGGTCTGGATCAGTCACAATACAAAAGGACTCCAACTGCGACACATGCTAAAATTACCAAGGATAGTATTGGTATTGCAGTAGATCATAAAT
TGTACAAGAGCATGATTGTGAGCCTCTTATATTTAACGACAAGCAGATCGGATATTGCCTATGCTGTTGGAATATGTGCTCGATATCAGTCAGATCCTCGTATCTCTTAC
TTGAATGCAGTTAAACGAATAATCAAATATGTTCACGGAACAACCGATTTCGGAATTCTGTATTCCTACAATACATCTTCTGAACTGGTGGGATATTGTGATGCTGACTG
GGCTGGTTCTGCTGATGATAGGAAAAGCACCTCTGGTGGATGTTTCTTTCTTTGA
Protein sequenceShow/hide protein sequence
MKSEFEMSLVGELSCFLGLQIKQRSEGIFISQEKYAKNILKKFGLDQSQYKRTPTATHAKITKDSIGIAVDHKLYKSMIVSLLYLTTSRSDIAYAVGICARYQSDPRISY
LNAVKRIIKYVHGTTDFGILYSYNTSSELVGYCDADWAGSADDRKSTSGGCFFL