; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0193751 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0193751
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCMiso1.1chr07:15725381..15725920
RNA-Seq ExpressionCmc07g0193751
SyntenyCmc07g0193751
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0010158 - abaxial cell fate specification (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055376.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-7982.12Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPS +PYSSPVLLVKK+ G WRFCV YRKLNQ TIS+KF IP+IEELLDEL+ A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEFLVM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFFYDILVYS DITEHEKHLGMVFAVLR+N+L+AN KKCV AHSKIQYLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

KAA0056659.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.0e-7879.33Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPSH+P+SSPVLLVKK++G WRFCV YRKLN++TI++KF IP+IEELLDELH A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEFLVM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFF DILVYS+DITEHEKHLGMVFA LR+N+L+AN+KKCV AHS+I YLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

KAA0059481.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-7878.77Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPSH+P+SSPVLLVKK++G WRFCV YRKLN++TI++KF IP+IEELLDELH A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEF+VM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFF DILVYS+DITEHEKHLGMVFA LR+N+L+AN+KKCV AHS+I YLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]1.7e-8082.68Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQA ++RPSH+PYSSPVLLVKK++G WRFCV YRKLNQVTIS+KF IP+IEELLDELH A VF KLD+KS YHQIRM+E+D+EKTAF+THEG+YEFLVM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFF DILVYS DI+EHEKHLGMVFAVLR+N LFANKKKCVIAHSKIQYLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

TYJ99303.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-7982.12Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPS +PYSSPVLLVKK+ G WRFCV YRKLNQ TIS+KF IP+IEELLDEL+ A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEFLVM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFFYDILVYS DITEHEKHLGMVFAVLR+N+L+AN KKCV AHSKIQYLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

TrEMBL top hitse value%identityAlignment
A0A5A7UM77 Ty3/gypsy retrotransposon protein1.2e-7982.12Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPS +PYSSPVLLVKK+ G WRFCV YRKLNQ TIS+KF IP+IEELLDEL+ A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEFLVM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFFYDILVYS DITEHEKHLGMVFAVLR+N+L+AN KKCV AHSKIQYLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

A0A5A7UT21 Ty3/gypsy retrotransposon protein9.8e-7979.33Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPSH+P+SSPVLLVKK++G WRFCV YRKLN++TI++KF IP+IEELLDELH A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEFLVM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFF DILVYS+DITEHEKHLGMVFA LR+N+L+AN+KKCV AHS+I YLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

A0A5A7V194 Ty3/gypsy retrotransposon protein1.7e-7878.77Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPSH+P+SSPVLLVKK++G WRFCV YRKLN++TI++KF IP+IEELLDELH A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEF+VM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFF DILVYS+DITEHEKHLGMVFA LR+N+L+AN+KKCV AHS+I YLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

A0A5D3BJ50 Ty3/gypsy retrotransposon protein1.2e-7982.12Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPS +PYSSPVLLVKK+ G WRFCV YRKLNQ TIS+KF IP+IEELLDEL+ A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEFLVM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNAP TFQS  NQVFKPFLRRC LVFFYDILVYS DITEHEKHLGMVFAVLR+N+L+AN KKCV AHSKIQYLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

A0A5D3CW02 Uncharacterized protein2.2e-7879.89Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        MLQ  ++RPSH+PYSSPVLLVK+++G WRFCV YRKLNQ T+S+KF IP+IEELLDELH A VF KLDLKSGYHQIRMKE+DIEKTAF+THEG+YEFLVM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF LTNA  TFQS  NQVFKPFLRRC LVF YDILVYS DI+EHEKHLGMVFA+LR+N+L+AN+KKCV AHSKIQYLGH
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.9e-3539.13Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEG-----EWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYY
        ML   ++R S++PY+SP+ +V K++      ++R  + YRKLN++T+ ++  IP ++E+L +L R   F  +DL  G+HQI M  + + KTAF T  G+Y
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEG-----EWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYY

Query:  EFLVMPFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        E+L MPF L NAP TFQ   N + +P L +  LV+  DI+V+S  + EH + LG+VF  L +  L     KC     +  +LGH
Subjt:  EFLVMPFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

P20825 Retrovirus-related Pol polyprotein from transposon 2971.7e-3539.13Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKE-----EGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYY
        ML   ++R S++PY+SP  +V K+       ++R  + YRKLN++TI +++ IP ++E+L +L + + F  +DL  G+HQI M E+ I KTAF T  G+Y
Subjt:  MLQARMVRPSHNPYSSPVLLVKKE-----EGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYY

Query:  EFLVMPFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        E+L MPF L NAP TFQ   N + +P L +  LV+  DI+++S  +TEH   + +VF  L +  L     KC     +  +LGH
Subjt:  EFLVMPFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.0e-3745.25Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        +L  + + PS +P SSPV+LV K++G +R CV YR LN+ TIS+ F +P I+ LL  +  A++F  LDL SGYHQI M+ KD  KTAF T  G YE+ VM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF L NAP+TF       F+    R   V+  DIL++S    EH KHL  V   L+   L   KKKC  A  + ++LG+
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.0e-3340.76Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKE-----EGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYY
        +LQ  ++RPS++PY+SP+ +V K+     E ++R  V +++LN VTI + + IP I   L  L  AK F  LDL SG+HQI MKE DI KTAF T  G Y
Subjt:  MLQARMVRPSHNPYSSPVLLVKKE-----EGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYY

Query:  EFLVMPFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        EFL +PF L NAP  FQ   + + +  + +   V+  DI+V+S D   H K+L +V A L +  L  N +K     +++++LG+
Subjt:  EFLVMPFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

Q99315 Transposon Ty3-G Gag-Pol polyprotein9.0e-3745.25Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM
        +L  + + PS +P SSPV+LV K++G +R CV YR LN+ TIS+ F +P I+ LL  +  A++F  LDL SGYHQI M+ KD  KTAF T  G YE+ VM
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVM

Query:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH
        PF L NAP+TF       F+    R   V+  DIL++S    EH KHL  V   L+   L   KKKC  A  + ++LG+
Subjt:  PFSLTNAPTTFQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein3.2e-0553.85Show/hide
Query:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQ
        ML+AR+++PS +PYSSPVLLV+K++G W    G   L Q
Subjt:  MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCAAGCAAGAATGGTTAGACCTAGCCACAACCCCTATTCCAGTCCAGTTTTACTAGTGAAGAAAGAGGAGGGAGAATGGAGGTTCTGTGTGGGTTACAGGAAACT
CAATCAAGTAACCATTTCTAATAAATTTTTCATTCCTATAATTGAAGAATTATTGGATGAGCTACATAGAGCCAAGGTTTTTCCGAAGCTGGACTTAAAATCTGGGTACC
ACCAAATTAGAATGAAGGAAAAAGACATAGAGAAGACAGCTTTCAAGACTCATGAAGGCTATTATGAGTTCCTTGTCATGCCTTTTAGCCTCACTAATGCACCAACTACC
TTCCAATCCCCAAAGAACCAGGTATTTAAACCTTTCTTAAGGCGCTGTGCATTAGTCTTTTTTTATGATATACTTGTTTATAGTGCTGATATTACCGAGCATGAGAAACA
TTTGGGAATGGTGTTTGCAGTTTTGAGAGAGAACAGGTTGTTTGCCAACAAAAAGAAGTGTGTGATAGCACACTCGAAGATTCAATATTTGGGGCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCCAAGCAAGAATGGTTAGACCTAGCCACAACCCCTATTCCAGTCCAGTTTTACTAGTGAAGAAAGAGGAGGGAGAATGGAGGTTCTGTGTGGGTTACAGGAAACT
CAATCAAGTAACCATTTCTAATAAATTTTTCATTCCTATAATTGAAGAATTATTGGATGAGCTACATAGAGCCAAGGTTTTTCCGAAGCTGGACTTAAAATCTGGGTACC
ACCAAATTAGAATGAAGGAAAAAGACATAGAGAAGACAGCTTTCAAGACTCATGAAGGCTATTATGAGTTCCTTGTCATGCCTTTTAGCCTCACTAATGCACCAACTACC
TTCCAATCCCCAAAGAACCAGGTATTTAAACCTTTCTTAAGGCGCTGTGCATTAGTCTTTTTTTATGATATACTTGTTTATAGTGCTGATATTACCGAGCATGAGAAACA
TTTGGGAATGGTGTTTGCAGTTTTGAGAGAGAACAGGTTGTTTGCCAACAAAAAGAAGTGTGTGATAGCACACTCGAAGATTCAATATTTGGGGCATTAA
Protein sequenceShow/hide protein sequence
MLQARMVRPSHNPYSSPVLLVKKEEGEWRFCVGYRKLNQVTISNKFFIPIIEELLDELHRAKVFPKLDLKSGYHQIRMKEKDIEKTAFKTHEGYYEFLVMPFSLTNAPTT
FQSPKNQVFKPFLRRCALVFFYDILVYSADITEHEKHLGMVFAVLRENRLFANKKKCVIAHSKIQYLGH