; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0132971 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0132971
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr05:13342275..13342769
RNA-Seq ExpressionCmc05g0132971
SyntenyCmc05g0132971
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0050832 - defense response to fungus (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055915.1 copia protein [Cucumis melo var. makuwa]5.4e-7088.54Show/hide
Query:  TTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSL
        TTPMNVNEKLQQNDGAEMA+AQRFRS+VGGLIYLTHT PDISYSI VIS FM+ PSRDHFG  KRVMRYIAGTIEYG+WYSKVSD KLC F DSDWASSL
Subjt:  TTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSL

Query:  DDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        DDRRSVSANVFTLG GVITWSSKKQA VALSSSEAEYAA TSA CQ IWLRRMLTEL
Subjt:  DDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

KAA0056051.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]3.9e-7690.24Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        MINCKP T P+NVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHT PDISYSIGVISRFM+RPSRDHF  AKRVMRYIAGTIEYGIW SKVS+FKLCGFTD
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        SDWASSL DRRSVSANVFTLGLGVITWS KKQA  ALSSSEAEYAA TSA CQ IWLRRMLTEL
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

KAA0059765.1 integrase [Cucumis melo var. makuwa]3.3e-7587.8Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        MINCKP  TPMNVNEKLQQNDGAEMANAQ FRSLVGG IYLTHT P+ISYSIGVI  FM+RPS+DHFG AKRVMRYIAGTI+YGIWYSKVSDFKLCGFTD
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        +DWASSLDDRRSVSANVFTLG GVI WSSKKQA VALSSSEAEY A TSATCQ IWLRRMLT+L
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

TYK27736.1 putative gag-pol polyprotein, identical [Cucumis melo var. makuwa]1.1e-7591.72Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        MINCKPTT PMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHT PDISYSIGVI +FM+RPSRDHFG AKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWL
        SDWASSLDDRRSVSANVFTLGLGVITWSSKKQ   AL+SSEAEYAA TSA CQ IWL
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWL

TYK28116.1 Zinc finger, CCHC-type [Cucumis melo var. makuwa]4.9e-7189.81Show/hide
Query:  TTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSL
        TTPMNVNEKLQQNDGAEMA+AQRFRS VGGLIYLTHT PDISYSI VIS FM+ PSRDHFG  KRVMRYIAGTIEYGIWYSKVSD KLCGF DSDWASSL
Subjt:  TTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSL

Query:  DDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        DDRRSVSANVFTLG GVITWSSKKQA VALSSSEAEYAA TSA CQ IWLRRMLTEL
Subjt:  DDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

TrEMBL top hitse value%identityAlignment
A0A5A7UQM0 Copia protein2.6e-7088.54Show/hide
Query:  TTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSL
        TTPMNVNEKLQQNDGAEMA+AQRFRS+VGGLIYLTHT PDISYSI VIS FM+ PSRDHFG  KRVMRYIAGTIEYG+WYSKVSD KLC F DSDWASSL
Subjt:  TTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSL

Query:  DDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        DDRRSVSANVFTLG GVITWSSKKQA VALSSSEAEYAA TSA CQ IWLRRMLTEL
Subjt:  DDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

A0A5A7URA0 Putative gag-pol polyprotein, identical1.9e-7690.24Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        MINCKP T P+NVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHT PDISYSIGVISRFM+RPSRDHF  AKRVMRYIAGTIEYGIW SKVS+FKLCGFTD
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        SDWASSL DRRSVSANVFTLGLGVITWS KKQA  ALSSSEAEYAA TSA CQ IWLRRMLTEL
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

A0A5A7V1M6 Integrase1.6e-7587.8Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        MINCKP  TPMNVNEKLQQNDGAEMANAQ FRSLVGG IYLTHT P+ISYSIGVI  FM+RPS+DHFG AKRVMRYIAGTI+YGIWYSKVSDFKLCGFTD
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        +DWASSLDDRRSVSANVFTLG GVI WSSKKQA VALSSSEAEY A TSATCQ IWLRRMLT+L
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

A0A5D3DVG4 Putative gag-pol polyprotein, identical5.5e-7691.72Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        MINCKPTT PMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHT PDISYSIGVI +FM+RPSRDHFG AKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWL
        SDWASSLDDRRSVSANVFTLGLGVITWSSKKQ   AL+SSEAEYAA TSA CQ IWL
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWL

A0A5D3DWI5 Zinc finger, CCHC-type2.4e-7189.81Show/hide
Query:  TTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSL
        TTPMNVNEKLQQNDGAEMA+AQRFRS VGGLIYLTHT PDISYSI VIS FM+ PSRDHFG  KRVMRYIAGTIEYGIWYSKVSD KLCGF DSDWASSL
Subjt:  TTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSL

Query:  DDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        DDRRSVSANVFTLG GVITWSSKKQA VALSSSEAEYAA TSA CQ IWLRRMLTEL
Subjt:  DDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

SwissProt top hitse value%identityAlignment
P0CV72 Secreted RxLR effector protein 1611.4e-2342.19Show/hide
Query:  FRSLVGGLIYL-THTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSLDDRRSVSANVFTLGLGVITWSS
        + S VG ++YL   T PD++ ++GV+S+F   P   H+   KRV+RY+  T  YG+ +++    KL G++D+DWA  ++ RRS S  +F L  G ++W S
Subjt:  FRSLVGGLIYL-THTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSLDDRRSVSANVFTLGLGVITWSS

Query:  KKQAIVALSSSEAEYAATTSATCQTIWL
        KKQ  VALSS+E EY A + AT + +WL
Subjt:  KKQAIVALSSSEAEYAATTSATCQTIWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-2438.6Show/hide
Query:  MINCKPTTTP----MNVNEKLQQNDGAEMANAQR--FRSLVGGLIY-LTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDF
        M N KP +TP    + +++K+      E  N  +  + S VG L+Y +  T PDI++++GV+SRF+  P ++H+   K ++RY+ GT    + +   SD 
Subjt:  MINCKPTTTP----MNVNEKLQQNDGAEMANAQR--FRSLVGGLIY-LTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDF

Query:  KLCGFTDSDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
         L G+TD+D A  +D+R+S +  +FT   G I+W SK Q  VALS++EAEY A T    + IWL+R L EL
Subjt:  KLCGFTDSDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

P92519 Uncharacterized mitochondrial protein AtMg008108.5e-2639.49Show/hide
Query:  MINCKPTTTPMNVNEKLQQN-DGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFT
        M++CKP +TP+ +  KL  +   A+  +   FRS+VG L YLT T PDISY++ ++ + M  P+   F + KRV+RY+ GTI +G++  K S   +  F 
Subjt:  MINCKPTTTPMNVNEKLQQN-DGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFT

Query:  DSDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIW
        DSDWA     RRS +     LG  +I+WS+K+Q  V+ SS+E EY A      +  W
Subjt:  DSDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-3040.85Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        MI  KP TTPM  + KL    G ++ +   +R +VG L YL  T PDISY++  +S+FM  P+ +H    KR++RY+AGT  +GI+  K +   L  ++D
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        +DWA   DD  S +  +  LG   I+WSSKKQ  V  SS+EAEY +  + + +  W+  +LTEL
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-3039.63Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        M+  KP  TPM  + KL  + G ++ +   +R +VG L YL  T PD+SY++  +S++M  P+ DH+   KRV+RY+AGT ++GI+  K +   L  ++D
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        +DWA   DD  S +  +  LG   I+WSSKKQ  V  SS+EAEY +  + + +  W+  +LTEL
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-2735.37Show/hide
Query:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD
        ++ CKP++ PM+ +     + G +  +A+ +R L+G L+YL  T  DIS+++  +S+F   P   H     +++ YI GT+  G++YS  ++ +L  F+D
Subjt:  MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTD

Query:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL
        + + S  D RRS +     LG  +I+W SKKQ +V+ SS+EAEY A + AT + +WL +   EL
Subjt:  SDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.6e-1139.47Show/hide
Query:  IYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSLDDRRSVS
        +YLT T PD+++++  +S+F             +V+ Y+ GT+  G++YS  SD +L  F DSDWAS  D RRSV+
Subjt:  IYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSLDDRRSVS

ATMG00810.1 DNA/RNA polymerases superfamily protein6.0e-2739.49Show/hide
Query:  MINCKPTTTPMNVNEKLQQN-DGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFT
        M++CKP +TP+ +  KL  +   A+  +   FRS+VG L YLT T PDISY++ ++ + M  P+   F + KRV+RY+ GTI +G++  K S   +  F 
Subjt:  MINCKPTTTPMNVNEKLQQN-DGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFT

Query:  DSDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIW
        DSDWA     RRS +     LG  +I+WS+K+Q  V+ SS+E EY A      +  W
Subjt:  DSDWASSLDDRRSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAATTGCAAGCCAACAACTACACCAATGAATGTGAATGAGAAGCTGCAACAAAATGATGGTGCAGAGATGGCCAATGCGCAGCGGTTTAGAAGCCTTGTTGGAGG
CTTGATTTATCTAACTCATACCCATCCCGATATTTCGTATTCTATTGGTGTGATTTCTAGATTTATGCGACGTCCTTCAAGGGATCATTTCGGAATAGCAAAGCGAGTTA
TGCGATACATTGCTGGAACTATAGAATATGGTATTTGGTACTCTAAAGTTTCTGATTTCAAATTATGCGGGTTCACAGACAGTGATTGGGCGAGCTCTTTAGATGATAGG
CGAAGTGTTTCAGCGAATGTTTTCACTTTAGGGTTAGGAGTTATTACTTGGAGCTCGAAGAAACAAGCAATTGTTGCCTTATCATCTTCAGAAGCAGAATATGCTGCAAC
AACTTCAGCAACATGTCAGACAATTTGGTTGCGAAGAATGCTAACAGAACTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCAATTGCAAGCCAACAACTACACCAATGAATGTGAATGAGAAGCTGCAACAAAATGATGGTGCAGAGATGGCCAATGCGCAGCGGTTTAGAAGCCTTGTTGGAGG
CTTGATTTATCTAACTCATACCCATCCCGATATTTCGTATTCTATTGGTGTGATTTCTAGATTTATGCGACGTCCTTCAAGGGATCATTTCGGAATAGCAAAGCGAGTTA
TGCGATACATTGCTGGAACTATAGAATATGGTATTTGGTACTCTAAAGTTTCTGATTTCAAATTATGCGGGTTCACAGACAGTGATTGGGCGAGCTCTTTAGATGATAGG
CGAAGTGTTTCAGCGAATGTTTTCACTTTAGGGTTAGGAGTTATTACTTGGAGCTCGAAGAAACAAGCAATTGTTGCCTTATCATCTTCAGAAGCAGAATATGCTGCAAC
AACTTCAGCAACATGTCAGACAATTTGGTTGCGAAGAATGCTAACAGAACTCTAA
Protein sequenceShow/hide protein sequence
MINCKPTTTPMNVNEKLQQNDGAEMANAQRFRSLVGGLIYLTHTHPDISYSIGVISRFMRRPSRDHFGIAKRVMRYIAGTIEYGIWYSKVSDFKLCGFTDSDWASSLDDR
RSVSANVFTLGLGVITWSSKKQAIVALSSSEAEYAATTSATCQTIWLRRMLTEL