; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0044971 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0044971
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCMiso1.1chr02:8330058..8330639
RNA-Seq ExpressionCmc02g0044971
SyntenyCmc02g0044971
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004601 - peroxidase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061073.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-8784.74Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        M++FLL QY DIF TPKGLPPKR+IDHRILTL +Q+PI VRPYKYGH+QK E EKLVA+MLQ  VIRPS SPYSSP+LLVKKKDGGWRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF
        I DKFPIPVI+ELLDEL+GA VFSKLDLKS  HQIRM+EED+EKTAFRTHE HYEFL+MPFGLTNAPATFQSLMNQVFKPFLRR VLVFF
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF

TYK07870.1 retrotransposon protein, putative, unclassified [Cucumis melo var. makuwa]1.5e-9599.43Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDG WRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQ
        IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQ
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQ

TYK14439.1 uncharacterized protein E5676_scaffold186G00980 [Cucumis melo var. makuwa]1.4e-8885.71Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        M++FLL QYADIF TPKGLPPKREIDHRILT+ +QRPI VRPYKYGHVQKEE E LVA+MLQ  +IRPSHSPYSSP+LLVK+KDGGWRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF
        + DKFPIPVI+ELLDELHGA VFSKLDLKS  HQIRM+EEDIEKTAFRTHE HYEFL+MPFGLTNA ATFQSLMNQVFKPFLRR VLVF
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF

XP_016901651.1 PREDICTED: uncharacterized protein LOC103495179 [Cucumis melo]6.1e-8987.89Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        M++FLL QYADIFETPKGLPPKREID+RILTL EQRPI  RPYKYGHVQKEE EKLVA+MLQT VIRPS SPYSSP+LLVKKKDGGWRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF
        I DKFPIPVI+ELLDELHGA+VFSKLDLKS  HQIRM+EEDIEKTAFRTHE HYEFLIM FGLTNAP TFQSLMNQVFKP LRR VLVFF
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF

XP_031744430.1 uncharacterized protein LOC116405043 [Cucumis sativus]2.0e-8785.79Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        M+K LL QYADIFE PK LPPKREIDHRIL L  QRPI VRPYKYG+VQKEE EKLV +MLQ  VIRPSHSPYSSP+LLVKKKDGGWRFCVDYRKLNQ T
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF
        I DKFPIPVI+ELLDELHGATVFSKLDLKS  HQIRM +ED+EKTAFRTHE HYEFL+MPFGLTNAPATFQSLMNQVFKPFLRR VLVFF
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF

TrEMBL top hitse value%identityAlignment
A0A1S4E096 uncharacterized protein LOC1034951793.0e-8987.89Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        M++FLL QYADIFETPKGLPPKREID+RILTL EQRPI  RPYKYGHVQKEE EKLVA+MLQT VIRPS SPYSSP+LLVKKKDGGWRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF
        I DKFPIPVI+ELLDELHGA+VFSKLDLKS  HQIRM+EEDIEKTAFRTHE HYEFLIM FGLTNAP TFQSLMNQVFKP LRR VLVFF
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF

A0A5A7UM77 Ty3/gypsy retrotransposon protein1.2e-8784.74Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        M++FLL QY DIF TPKGLPPKR+IDHRILTL +Q+PI VRPYKYGH+QK E EKLVA+MLQ  VIRPS SPYSSP+LLVKKK+GGWRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF
        I DKFPIPVI+ELLDEL+GA VFSKLDLKS  HQIRM+EEDIEKTAFRTHE HYEFL+MPFGLTNAPATFQSLMNQVFKPFLRR VLVFF
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF

A0A5A7UYM1 Ty3/gypsy retrotransposon protein5.6e-8884.74Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        M++FLL QY DIF TPKGLPPKR+IDHRILTL +Q+PI VRPYKYGH+QK E EKLVA+MLQ  VIRPS SPYSSP+LLVKKKDGGWRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF
        I DKFPIPVI+ELLDEL+GA VFSKLDLKS  HQIRM+EED+EKTAFRTHE HYEFL+MPFGLTNAPATFQSLMNQVFKPFLRR VLVFF
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFF

A0A5D3C962 Retrotransposon protein, putative, unclassified7.3e-9699.43Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDG WRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQ
        IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQ
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQ

A0A5D3CW02 Uncharacterized protein6.6e-8985.71Show/hide
Query:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT
        M++FLL QYADIF TPKGLPPKREIDHRILT+ +QRPI VRPYKYGHVQKEE E LVA+MLQ  +IRPSHSPYSSP+LLVK+KDGGWRFCVDYRKLNQAT
Subjt:  MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQAT

Query:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF
        + DKFPIPVI+ELLDELHGA VFSKLDLKS  HQIRM+EEDIEKTAFRTHE HYEFL+MPFGLTNA ATFQSLMNQVFKPFLRR VLVF
Subjt:  IFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.67.2e-3241.36Show/hide
Query:  LLAQYADI-FETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGG-----WRFCVDYRKLNQ
        LL +Y DI +     L    +  H I T     P+Y + Y Y    ++E E  +  ML   +IR S+SPY+SPI +V KK        +R  +DYRKLN+
Subjt:  LLAQYADI-FETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGG-----WRFCVDYRKLNQ

Query:  ATIFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF
         T+ D+ PIP + E+L +L     F+ +DL    HQI M  E + KTAF T   HYE+L MPFGL NAPATFQ  MN + +P L +  LV+
Subjt:  ATIFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF

P20825 Retrovirus-related Pol polyprotein from transposon 2971.4e-3244.58Show/hide
Query:  ILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGG-----WRFCVDYRKLNQATIFDKFPIPVIKELLDELHGATVF
        +L      PIY + Y      + E E  V +ML   +IR S+SPY+SP  +V KK        +R  +DYRKLN+ TI D++PIP + E+L +L     F
Subjt:  ILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGG-----WRFCVDYRKLNQATIFDKFPIPVIKELLDELHGATVF

Query:  SKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF
        + +DL    HQI M EE I KTAF T   HYE+L MPFGL NAPATFQ  MN + +P L +  LV+
Subjt:  SKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein9.3e-3249.29Show/hide
Query:  VRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQATIFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMRE
        ++PY      ++E  K+V K+L  + I PS SP SSP++LV KKDG +R CVDYR LN+ATI D FP+P I  LL  +  A +F+ LDL S  HQI M  
Subjt:  VRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQATIFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMRE

Query:  EDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFK
        +D  KTAF T    YE+ +MPFGL NAP+TF   M   F+
Subjt:  EDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.6e-3141.36Show/hide
Query:  LLAQYADIFETP-KGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKK-----DGGWRFCVDYRKLNQ
        LL ++  IFE P  G+  +  +   I T   Q PIY + Y Y    + E E+ + ++LQ  +IRPS+SPY+SPI +V KK     +  +R  VD+++LN 
Subjt:  LLAQYADIFETP-KGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKK-----DGGWRFCVDYRKLNQ

Query:  ATIFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF
         TI D +PIP I   L  L  A  F+ LDL S  HQI M+E DI KTAF T    YEFL +PFGL NAPA FQ +++ + +  + +   V+
Subjt:  ATIFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVF

Q99315 Transposon Ty3-G Gag-Pol polyprotein9.3e-3249.29Show/hide
Query:  VRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQATIFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMRE
        ++PY      ++E  K+V K+L  + I PS SP SSP++LV KKDG +R CVDYR LN+ATI D FP+P I  LL  +  A +F+ LDL S  HQI M  
Subjt:  VRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQATIFDKFPIPVIKELLDELHGATVFSKLDLKSVDHQIRMRE

Query:  EDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFK
        +D  KTAF T    YE+ +MPFGL NAP+TF   M   F+
Subjt:  EDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFK

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein4.8e-0752.5Show/hide
Query:  VQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGW
        +++   +  + +ML+ R+I+PS SPYSSP+LLV+KKDGGW
Subjt:  VQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAAGTTCTTGTTAGCACAGTATGCTGATATTTTTGAGACACCAAAAGGGTTACCTCCCAAGAGAGAAATAGATCACCGTATCCTGACTCTACTAGAGCAAAGACC
AATCTATGTGCGGCCATATAAATACGGCCATGTGCAAAAGGAAGAAACTGAAAAACTGGTGGCAAAAATGCTTCAGACAAGGGTAATTCGACCCAGTCACAGTCCCTATT
CGAGCCCTATTCTGTTGGTGAAGAAAAAGGATGGGGGATGGAGATTTTGCGTGGATTACAGGAAGTTGAACCAAGCCACAATTTTTGATAAATTTCCTATTCCAGTCATT
AAAGAACTATTGGATGAACTTCATGGAGCAACTGTATTTTCGAAACTAGACTTAAAATCTGTTGATCACCAAATCAGAATGAGGGAGGAGGACATCGAAAAGACCGCATT
TCGTACCCACGAAGACCATTACGAATTTTTGATAATGCCCTTTGGCTTGACTAATGCACCAGCCACATTCCAATCTCTCATGAACCAAGTATTTAAACCCTTCTTAAGAC
GCAGTGTGTTAGTTTTTTTTATGATATCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCAAGTTCTTGTTAGCACAGTATGCTGATATTTTTGAGACACCAAAAGGGTTACCTCCCAAGAGAGAAATAGATCACCGTATCCTGACTCTACTAGAGCAAAGACC
AATCTATGTGCGGCCATATAAATACGGCCATGTGCAAAAGGAAGAAACTGAAAAACTGGTGGCAAAAATGCTTCAGACAAGGGTAATTCGACCCAGTCACAGTCCCTATT
CGAGCCCTATTCTGTTGGTGAAGAAAAAGGATGGGGGATGGAGATTTTGCGTGGATTACAGGAAGTTGAACCAAGCCACAATTTTTGATAAATTTCCTATTCCAGTCATT
AAAGAACTATTGGATGAACTTCATGGAGCAACTGTATTTTCGAAACTAGACTTAAAATCTGTTGATCACCAAATCAGAATGAGGGAGGAGGACATCGAAAAGACCGCATT
TCGTACCCACGAAGACCATTACGAATTTTTGATAATGCCCTTTGGCTTGACTAATGCACCAGCCACATTCCAATCTCTCATGAACCAAGTATTTAAACCCTTCTTAAGAC
GCAGTGTGTTAGTTTTTTTTATGATATCCTAG
Protein sequenceShow/hide protein sequence
MVKFLLAQYADIFETPKGLPPKREIDHRILTLLEQRPIYVRPYKYGHVQKEETEKLVAKMLQTRVIRPSHSPYSSPILLVKKKDGGWRFCVDYRKLNQATIFDKFPIPVI
KELLDELHGATVFSKLDLKSVDHQIRMREEDIEKTAFRTHEDHYEFLIMPFGLTNAPATFQSLMNQVFKPFLRRSVLVFFMIS