; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc12g0326071 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc12g0326071
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTransposable element protein
Genome locationCMiso1.1chr12:16442602..16443129
RNA-Seq ExpressionCmc12g0326071
SyntenyCmc12g0326071
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABO36622.1 copia LTR rider [Solanum lycopersicum]5.6e-5766.47Show/hide
Query:  VKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWY
        VK+KAR+VA+GF Q++GVDYNEIFSPVV+H SIR+LLAIVA +NLELEQLDV  AFLHG LEEEI+M QP GF+V GK+  VCKLK+SLYGLKQSPR WY
Subjt:  VKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWY

Query:  KRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
        KRFD Y+  +G+ RS YD CVYY +L +  +IYL+LYVDDML+A K    ++++K  L  EF+MKDL
Subjt:  KRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

KAA0047818.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.7e-6977.59Show/hide
Query:  IGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLK
        +G+ N+KVKFK RLVAKGFKQK+G DY EIFSPVV   SIR+LL IV CE+L+LEQ+DVTIAFLHGSLEE+++MEQPKGFE KGK +LVCKLK+SLYGLK
Subjt:  IGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLK

Query:  QSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
        QSPRCW KRFDD+I+ IGF RSLYDPCVYYKKLT G  IYLLLYVDDMLLAGK  TKL EIK QL  EF+MKDL
Subjt:  QSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

KAA0054988.1 hypothetical protein E6C27_scaffold43052G001360 [Cucumis melo var. makuwa]3.6e-6477.14Show/hide
Query:  MIGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGL
        MI DLNDKVK KARLVAKGFKQK+GVDYNEIFSPVVKH  IRILL IVACENL+LEQLDVT AFLHGSLEEEIFMEQPKGFEVKGKKELVCKL       
Subjt:  MIGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGL

Query:  KQSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
                            NRSLYDPCVYYKKLTEGDYIY+LLYVDDMLLAG+ PTKLKEIKAQL  EFDMKDL
Subjt:  KQSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

MUG03575.1 hypothetical protein [Bacillus tequilensis]5.6e-5766.47Show/hide
Query:  VKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWY
        VK+KAR+VA+GF Q++GVDYNEIFSPVV+H SIR+LLAIVA +NLELEQLDV  AFLHG LEEEI+M QP GF+V GK+  VCKLK+SLYGLKQSPR WY
Subjt:  VKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWY

Query:  KRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
        KRFD Y+  +G+ RS YD CVYY +L +  +IYL+LYVDDML+A K    ++++K  L  EF+MKDL
Subjt:  KRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

TYK02789.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-6875.86Show/hide
Query:  IGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLK
        +G+ N+KVKFK RLVAKGFKQK+GV Y + FSPVVKH SIR+LL+IVACE+LELEQ+DVT  FLHGSLEE+++MEQ KGFE KGK +LVCKLK+SLYGLK
Subjt:  IGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLK

Query:  QSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
        QSPRCWYKRFD +I+ IGF RSLYDP VYYKKLT+G  IYLLLYVD+MLLAGK  TKL EIK QL  EF+MKDL
Subjt:  QSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

TrEMBL top hitse value%identityAlignment
A0A445B365 Uncharacterized protein3.5e-5767.06Show/hide
Query:  NDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPR
        +D  +FKARLVAKGF QK+GVDYNEIFSPVVKH SIR+LL++VA  NLELEQLDV  AFLHG LEEEI+M QP+GF+V+GK+  VC+L++SLYGLKQSPR
Subjt:  NDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPR

Query:  CWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
         WYKRFD ++    F+RS YD CVY +KL  GDYIYLLLYVDDML+A K   ++  +K QL  EF+ KDL
Subjt:  CWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

A0A5A7TXZ9 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-6977.59Show/hide
Query:  IGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLK
        +G+ N+KVKFK RLVAKGFKQK+G DY EIFSPVV   SIR+LL IV CE+L+LEQ+DVTIAFLHGSLEE+++MEQPKGFE KGK +LVCKLK+SLYGLK
Subjt:  IGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLK

Query:  QSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
        QSPRCW KRFDD+I+ IGF RSLYDPCVYYKKLT G  IYLLLYVDDMLLAGK  TKL EIK QL  EF+MKDL
Subjt:  QSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

A0A5A7UJ23 Integrase catalytic domain-containing protein1.8e-6477.14Show/hide
Query:  MIGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGL
        MI DLNDKVK KARLVAKGFKQK+GVDYNEIFSPVVKH  IRILL IVACENL+LEQLDVT AFLHGSLEEEIFMEQPKGFEVKGKKELVCKL       
Subjt:  MIGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGL

Query:  KQSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
                            NRSLYDPCVYYKKLTEGDYIY+LLYVDDMLLAG+ PTKLKEIKAQL  EFDMKDL
Subjt:  KQSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

A0A5D3BXB7 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-6875.86Show/hide
Query:  IGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLK
        +G+ N+KVKFK RLVAKGFKQK+GV Y + FSPVVKH SIR+LL+IVACE+LELEQ+DVT  FLHGSLEE+++MEQ KGFE KGK +LVCKLK+SLYGLK
Subjt:  IGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLK

Query:  QSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
        QSPRCWYKRFD +I+ IGF RSLYDP VYYKKLT+G  IYLLLYVD+MLLAGK  TKL EIK QL  EF+MKDL
Subjt:  QSPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

B1N668 Copia LTR rider2.7e-5766.47Show/hide
Query:  VKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWY
        VK+KAR+VA+GF Q++GVDYNEIFSPVV+H SIR+LLAIVA +NLELEQLDV  AFLHG LEEEI+M QP GF+V GK+  VCKLK+SLYGLKQSPR WY
Subjt:  VKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWY

Query:  KRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
        KRFD Y+  +G+ RS YD CVYY +L +  +IYL+LYVDDML+A K    ++++K  L  EF+MKDL
Subjt:  KRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-3141.62Show/hide
Query:  DLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQS
        +L + +++KARLVA+GF QK  +DY E F+PV +  S R +L++V   NL++ Q+DV  AFL+G+L+EEI+M  P+G  +    + VCKL +++YGLKQ+
Subjt:  DLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQS

Query:  PRCWYKRFDDYISLIGFNRSLYDPCVY-YKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
         RCW++ F+  +    F  S  D C+Y   K    + IY+LLYVDD+++A    T++   K  L  +F M DL
Subjt:  PRCWYKRFDDYISLIGFNRSLYDPCVY-YKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

P0C2J7 Transposon Ty4-H Gag-Pol polyprotein1.0e-1633.74Show/hide
Query:  FKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWYKR
        +KAR+V +G  Q     Y+ I +  + H  I+I L I    N+ ++ LD+  AFL+  LEEEI++  P        +  V KL ++LYGLKQSP+ W   
Subjt:  FKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWYKR

Query:  FDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMK
           Y++ IG   + Y P +Y    TE   + + +YVDD ++A     +L E   +L   F++K
Subjt:  FDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.7e-5359.28Show/hide
Query:  VKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWY
        V++KARLV KGF+QKKG+D++EIFSPVVK  SIR +L++ A  +LE+EQLDV  AFLHG LEEEI+MEQP+GFEV GKK +VCKL +SLYGLKQ+PR WY
Subjt:  VKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWY

Query:  KRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
         +FD ++    + ++  DPCVY+K+ +E ++I LLLYVDDML+ GK    + ++K  L   FDMKDL
Subjt:  KRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.1e-3444.19Show/hide
Query:  GDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQ
        G LN   ++KARLVAKG+ Q+ G+DY E FSPV+K  SIRI+L +    +  + QLDV  AFL G+L ++++M QP GF  K +   VCKL+++LYGLKQ
Subjt:  GDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQ

Query:  SPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKD
        +PR WY    +Y+  IGF  S+ D  ++  +  +   +Y+L+YVDD+L+ G  PT L      L   F +KD
Subjt:  SPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-3343.35Show/hide
Query:  GDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQ
        G LN   ++KARLVAKG+ Q+ G+DY E FSPV+K  SIRI+L +    +  + QLDV  AFL G+L +E++M QP GF  K + + VC+L++++YGLKQ
Subjt:  GDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQ

Query:  SPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEG-DYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKD
        +PR WY     Y+  +GF  S+ D  ++   L  G   IY+L+YVDD+L+ G     LK     L   F +K+
Subjt:  SPRCWYKRFDDYISLIGFNRSLYDPCVYYKKLTEG-DYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.5e-3141.52Show/hide
Query:  KFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKEL----VCKLKRSLYGLKQSPR
        ++KARLVAKG+ Q++G+D+ E FSPV K  S++++LAI A  N  L QLD++ AFL+G L+EEI+M+ P G+  +    L    VC LK+S+YGLKQ+ R
Subjt:  KFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKEL----VCKLKRSLYGLKQSPR

Query:  CWYKRFDDYISLIGFN-RSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL
         W+ +F   ++LIGF     +    Y+ K+T   ++ +L+YVDD+++       + E+K+QL   F ++DL
Subjt:  CWYKRFDDYISLIGFN-RSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.5e-0445.83Show/hide
Query:  KFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQ
        + KARLVAKGF Q++G+ + E +SPVV+  +IR +L +   + LE+ Q
Subjt:  KFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGGTGATTTGAATGACAAAGTAAAATTTAAGGCAAGACTAGTAGCTAAAGGCTTCAAGCAAAAGAAGGGTGTAGACTATAATGAGATTTTTTCCCCTGTTGTAAA
GCACATTTCCATAAGAATCCTACTGGCTATTGTAGCATGTGAAAATCTTGAGCTGGAACAGCTAGATGTTACTATTGCTTTCCTACATGGAAGCTTAGAGGAAGAAATCT
TCATGGAGCAGCCTAAGGGTTTTGAAGTAAAAGGCAAAAAGGAGCTGGTGTGTAAACTAAAGAGGTCTTTATATGGCCTAAAACAGTCTCCAAGGTGTTGGTACAAACGT
TTTGATGACTATATCAGTCTGATAGGATTTAATAGAAGTCTATATGACCCATGTGTCTACTACAAGAAGCTCACTGAAGGTGATTACATCTATCTTTTACTCTATGTTGA
TGATATGTTACTAGCAGGGAAATATCCAACCAAGTTAAAAGAAATCAAAGCTCAACTCGATGTTGAATTTGATATGAAGGACCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTGGTGATTTGAATGACAAAGTAAAATTTAAGGCAAGACTAGTAGCTAAAGGCTTCAAGCAAAAGAAGGGTGTAGACTATAATGAGATTTTTTCCCCTGTTGTAAA
GCACATTTCCATAAGAATCCTACTGGCTATTGTAGCATGTGAAAATCTTGAGCTGGAACAGCTAGATGTTACTATTGCTTTCCTACATGGAAGCTTAGAGGAAGAAATCT
TCATGGAGCAGCCTAAGGGTTTTGAAGTAAAAGGCAAAAAGGAGCTGGTGTGTAAACTAAAGAGGTCTTTATATGGCCTAAAACAGTCTCCAAGGTGTTGGTACAAACGT
TTTGATGACTATATCAGTCTGATAGGATTTAATAGAAGTCTATATGACCCATGTGTCTACTACAAGAAGCTCACTGAAGGTGATTACATCTATCTTTTACTCTATGTTGA
TGATATGTTACTAGCAGGGAAATATCCAACCAAGTTAAAAGAAATCAAAGCTCAACTCGATGTTGAATTTGATATGAAGGACCTATGA
Protein sequenceShow/hide protein sequence
MIGDLNDKVKFKARLVAKGFKQKKGVDYNEIFSPVVKHISIRILLAIVACENLELEQLDVTIAFLHGSLEEEIFMEQPKGFEVKGKKELVCKLKRSLYGLKQSPRCWYKR
FDDYISLIGFNRSLYDPCVYYKKLTEGDYIYLLLYVDDMLLAGKYPTKLKEIKAQLDVEFDMKDL