; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0015488 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0015488
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon 297 family
Genome locationchr10:291996..293037
RNA-Seq ExpressionPI0015488
SyntenyPI0015488
Gene Ontology termsNA
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042324.1 Retrovirus-related Pol polyprotein from transposon 297 family [Cucumis melo var. makuwa]1.1e-10367.79Show/hide
Query:  NEFARTSPNILLG--RPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL
        N + R  PN+  G   P      +  +  +GLLSVEFDG+VVS +IFDDVKSSN HVSLCALDTLESL+ LE+HDKFNELIDQ TLE+VENEFAK KPS 
Subjt:  NEFARTSPNILLG--RPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL

Query:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKEN----ESCEVYLLK----------
        DDLS FLDFV S+NDEY LVDN+ T E H   N+F  SNEH+ GR+IK D E ILK I I N H+ S HIKN EKEN    + CEV L K          
Subjt:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKEN----ESCEVYLLK----------

Query:  ----NENRIELKVLPSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSP
             EN I+LKVLPSHLKYVFLGEKN Y VIIS ELTKEQEARLLETLK H+QAIGAKNKIQ QRR+NP LKE VKKEV KLKDVGIIYPVP+STWVS 
Subjt:  ----NENRIELKVLPSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSP

Query:  IHVVPKKTGMTIVENSKGELVPTRVS
        I+VVPKKTGMTIVENS+G+ V TRVS
Subjt:  IHVVPKKTGMTIVENSKGELVPTRVS

KAA0054444.1 Retrovirus-related Pol polyprotein from transposon 297 family [Cucumis melo var. makuwa]9.0e-5854.04Show/hide
Query:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL
        M+NEFART P ILLGRPFLKTAKAIINVDKGLLSVEF+G+VVSF    DVKSSNDHVSLCALDT ESL+TLE+HDKFNELIDQ TLEYV+NEFAKEKPS 
Subjt:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL

Query:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHL
        DDLS FLDF+ SINDE+AL                                                                    EN I+LKVLPSHL
Subjt:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHL

Query:  KYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRR
        +YVFL EKN Y VI   EL +EQEARLL+TLKRH+QAI                       GAKNK+Q QRR
Subjt:  KYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRR

KAA0065429.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]5.6e-7656.68Show/hide
Query:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL
        M+ EFARTSP ILLGRPFLKTAKAIINVDKGLLSVEFDG+V+SFNIFD                                       +V+NEFAKEKP+ 
Subjt:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL

Query:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKE----NESCEVYLLKNENRIELKVL
        DDLS FL+FV SINDE+ALVDNVTT E HA  NNF  SNEH+LGRDIK DDEQILK I I N H+I+THIKN EKE    N+SCE      EN IELKVL
Subjt:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKE----NESCEVYLLKNENRIELKVL

Query:  PSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIVE
        P HLKYVFLGEKN Y V+  +          +  L++     G   +   +   NPTLKEVVKKEVLKLKD+GIIY V ++TW++PIHVV KK  MTIV+
Subjt:  PSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIVE

Query:  NSKGELV
        NS+ +++
Subjt:  NSKGELV

KAA0066992.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]8.1e-8361.59Show/hide
Query:  LKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSLDDLSTFLDFVPSINDEYA
        L    AI ++ + LLSVEFD DVVSFNIF DVKSSNDHVSLCALDTLESLE LE+HD FNELIDQ TLEYVENEFAK KPS DDLS FL FV S+NDE+ 
Subjt:  LKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSLDDLSTFLDFVPSINDEYA

Query:  LVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHLKYVFLGEKNIYPVIISNE
         VDN+ T E HA  N+F  SNEH+LGR+IK D+  I+K                                   +LKVLPSHLKY FL EKN YPVII  E
Subjt:  LVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHLKYVFLGEKNIYPVIISNE

Query:  LTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIV
        LTKEQEARL ETLKR++QA                        GAKNKIQ QRRLNPTLKEVVKKEVLKLKDVGIIY VP+STWVS IHVVPKK  +TIV
Subjt:  LTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIV

Query:  EN
        EN
Subjt:  EN

KAE8647060.1 hypothetical protein Csa_022980 [Cucumis sativus]8.3e-10467.69Show/hide
Query:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL
        M++EFARTSP ILLGRP LKTAKA+INVDKGLLSVEFDGDVVSFNIFDD+KSSNDHVSLCALDTLESLETLE+ DK NELIDQ TLEYV+NEF+KEKPS 
Subjt:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL

Query:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEK-----ENESCEVYLLK---------
        DDLS FL+FV SINDE+ALVDNVTT + HA  NNFA S E +LGRDIK +DEQ LK I I N  + S HI+ L+K     EN+ CEVYL K         
Subjt:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEK-----ENESCEVYLLK---------

Query:  ---------NENRIELKVLPSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRRLNPTL
                  +N IE+KV PSHLKYVFLGEKN YP IIS ELT+EQEARLLETLKRHKQAI                       GAKNK+Q QR LNPTL
Subjt:  ---------NENRIELKVLPSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRRLNPTL

Query:  KEVVKKEVLKLKDVGIIYPVPYSTW
        KEVVKKEVLKLK   IIYPVP+STW
Subjt:  KEVVKKEVLKLKDVGIIYPVPYSTW

TrEMBL top hitse value%identityAlignment
A0A5A7TE46 Uncharacterized protein8.2e-5773.46Show/hide
Query:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL
        M++EF+RTSP ILLG+ FLK AKA INVDKGLLSVEFDGD+ SFN FDDV SSNDHVSLCALDTLESL+T E+HDKFNELIDQ TLEYVENEFAK K S 
Subjt:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL

Query:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGN
        DD S FLDF  S+NDE+ALVDN+ T  HHA  N+FA SNEH+LGR+I  D+E  LK+I I N
Subjt:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGN

A0A5A7VIZ2 Retrovirus-related Pol polyprotein from transposon 17.62.7e-7656.68Show/hide
Query:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL
        M+ EFARTSP ILLGRPFLKTAKAIINVDKGLLSVEFDG+V+SFNIFD                                       +V+NEFAKEKP+ 
Subjt:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL

Query:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKE----NESCEVYLLKNENRIELKVL
        DDLS FL+FV SINDE+ALVDNVTT E HA  NNF  SNEH+LGRDIK DDEQILK I I N H+I+THIKN EKE    N+SCE      EN IELKVL
Subjt:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKE----NESCEVYLLKNENRIELKVL

Query:  PSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIVE
        P HLKYVFLGEKN Y V+  +          +  L++     G   +   +   NPTLKEVVKKEVLKLKD+GIIY V ++TW++PIHVV KK  MTIV+
Subjt:  PSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIVE

Query:  NSKGELV
        NS+ +++
Subjt:  NSKGELV

A0A5D3BDS0 Transposon Ty3-I Gag-Pol polyprotein3.9e-8361.59Show/hide
Query:  LKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSLDDLSTFLDFVPSINDEYA
        L    AI ++ + LLSVEFD DVVSFNIF DVKSSNDHVSLCALDTLESLE LE+HD FNELIDQ TLEYVENEFAK KPS DDLS FL FV S+NDE+ 
Subjt:  LKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSLDDLSTFLDFVPSINDEYA

Query:  LVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHLKYVFLGEKNIYPVIISNE
         VDN+ T E HA  N+F  SNEH+LGR+IK D+  I+K                                   +LKVLPSHLKY FL EKN YPVII  E
Subjt:  LVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHLKYVFLGEKNIYPVIISNE

Query:  LTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIV
        LTKEQEARL ETLKR++QA                        GAKNKIQ QRRLNPTLKEVVKKEVLKLKDVGIIY VP+STWVS IHVVPKK  +TIV
Subjt:  LTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIV

Query:  EN
        EN
Subjt:  EN

A0A5D3CSW0 Retrovirus-related Pol polyprotein from transposon 297 family4.4e-5854.04Show/hide
Query:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL
        M+NEFART P ILLGRPFLKTAKAIINVDKGLLSVEF+G+VVSF    DVKSSNDHVSLCALDT ESL+TLE+HDKFNELIDQ TLEYV+NEFAKEKPS 
Subjt:  MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL

Query:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHL
        DDLS FLDF+ SINDE+AL                                                                    EN I+LKVLPSHL
Subjt:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHL

Query:  KYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRR
        +YVFL EKN Y VI   EL +EQEARLL+TLKRH+QAI                       GAKNK+Q QRR
Subjt:  KYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAI-----------------------GAKNKIQAQRR

A0A5D3CW21 Retrovirus-related Pol polyprotein from transposon 297 family5.2e-10467.79Show/hide
Query:  NEFARTSPNILLG--RPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL
        N + R  PN+  G   P      +  +  +GLLSVEFDG+VVS +IFDDVKSSN HVSLCALDTLESL+ LE+HDKFNELIDQ TLE+VENEFAK KPS 
Subjt:  NEFARTSPNILLG--RPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSL

Query:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKEN----ESCEVYLLK----------
        DDLS FLDFV S+NDEY LVDN+ T E H   N+F  SNEH+ GR+IK D E ILK I I N H+ S HIKN EKEN    + CEV L K          
Subjt:  DDLSTFLDFVPSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKEN----ESCEVYLLK----------

Query:  ----NENRIELKVLPSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSP
             EN I+LKVLPSHLKYVFLGEKN Y VIIS ELTKEQEARLLETLK H+QAIGAKNKIQ QRR+NP LKE VKKEV KLKDVGIIYPVP+STWVS 
Subjt:  ----NENRIELKVLPSHLKYVFLGEKNIYPVIISNELTKEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSP

Query:  IHVVPKKTGMTIVENSKGELVPTRVS
        I+VVPKKTGMTIVENS+G+ V TRVS
Subjt:  IHVVPKKTGMTIVENSKGELVPTRVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAATGAATTTGCACGTACATCCCCTAACATACTTTTAGGAAGACCATTCTTGAAAACTGCTAAAGCCATAATAAATGTTGATAAAGGTCTTCTTAGTGTTGAGTT
TGACGGAGATGTTGTGTCTTTCAATATTTTTGATGATGTAAAGTCCTCTAATGATCATGTTTCATTGTGTGCACTTGATACTTTAGAATCATTAGAAACTTTAGAAAAGC
ATGATAAATTCAACGAGCTTATAGATCAAGTAACTCTAGAATATGTTGAAAATGAATTTGCTAAGGAGAAACCTAGTCTTGATGATCTTTCTACCTTCCTTGACTTTGTT
CCTAGTATAAATGATGAGTATGCTTTGGTTGATAATGTTACAACACTTGAACATCATGCTCGAGCTAATAATTTTGCAATTAGCAATGAACATAATCTAGGAAGAGATAT
TAAGTATGATGATGAGCAAATTTTGAAAATCATAGGCATTGGCAATATTCATAACATAAGCACCCACATAAAAAACCTTGAAAAAGAGAATGAATCTTGTGAAGTTTACT
TGCTAAAGAATGAAAACAGGATAGAGCTTAAAGTTCTTCCTTCTCATCTCAAATATGTGTTCTTGGGAGAAAAGAACATATATCCTGTCATAATTTCGAACGAACTTACC
AAAGAACAAGAAGCAAGGTTGCTTGAAACCTTGAAGAGGCATAAACAAGCCATTGGGGCAAAGAACAAGATTCAAGCACAAAGGCGCCTTAATCCTACCTTGAAGGAAGT
TGTCAAGAAAGAGGTCTTGAAGCTTAAAGACGTCGGTATCATCTATCCAGTACCGTATAGCACTTGGGTAAGCCCTATTCATGTAGTACCCAAGAAGACCGGTATGACTA
TTGTGGAAAATAGCAAAGGTGAGTTAGTTCCGACTCGTGTTTCAAACAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAATGAATTTGCACGTACATCCCCTAACATACTTTTAGGAAGACCATTCTTGAAAACTGCTAAAGCCATAATAAATGTTGATAAAGGTCTTCTTAGTGTTGAGTT
TGACGGAGATGTTGTGTCTTTCAATATTTTTGATGATGTAAAGTCCTCTAATGATCATGTTTCATTGTGTGCACTTGATACTTTAGAATCATTAGAAACTTTAGAAAAGC
ATGATAAATTCAACGAGCTTATAGATCAAGTAACTCTAGAATATGTTGAAAATGAATTTGCTAAGGAGAAACCTAGTCTTGATGATCTTTCTACCTTCCTTGACTTTGTT
CCTAGTATAAATGATGAGTATGCTTTGGTTGATAATGTTACAACACTTGAACATCATGCTCGAGCTAATAATTTTGCAATTAGCAATGAACATAATCTAGGAAGAGATAT
TAAGTATGATGATGAGCAAATTTTGAAAATCATAGGCATTGGCAATATTCATAACATAAGCACCCACATAAAAAACCTTGAAAAAGAGAATGAATCTTGTGAAGTTTACT
TGCTAAAGAATGAAAACAGGATAGAGCTTAAAGTTCTTCCTTCTCATCTCAAATATGTGTTCTTGGGAGAAAAGAACATATATCCTGTCATAATTTCGAACGAACTTACC
AAAGAACAAGAAGCAAGGTTGCTTGAAACCTTGAAGAGGCATAAACAAGCCATTGGGGCAAAGAACAAGATTCAAGCACAAAGGCGCCTTAATCCTACCTTGAAGGAAGT
TGTCAAGAAAGAGGTCTTGAAGCTTAAAGACGTCGGTATCATCTATCCAGTACCGTATAGCACTTGGGTAAGCCCTATTCATGTAGTACCCAAGAAGACCGGTATGACTA
TTGTGGAAAATAGCAAAGGTGAGTTAGTTCCGACTCGTGTTTCAAACAGTTGA
Protein sequenceShow/hide protein sequence
MSNEFARTSPNILLGRPFLKTAKAIINVDKGLLSVEFDGDVVSFNIFDDVKSSNDHVSLCALDTLESLETLEKHDKFNELIDQVTLEYVENEFAKEKPSLDDLSTFLDFV
PSINDEYALVDNVTTLEHHARANNFAISNEHNLGRDIKYDDEQILKIIGIGNIHNISTHIKNLEKENESCEVYLLKNENRIELKVLPSHLKYVFLGEKNIYPVIISNELT
KEQEARLLETLKRHKQAIGAKNKIQAQRRLNPTLKEVVKKEVLKLKDVGIIYPVPYSTWVSPIHVVPKKTGMTIVENSKGELVPTRVSNS