; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy4G082040 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy4G082040
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionHMA domain-containing protein
Genome locationchrH04:21521824..21530055
RNA-Seq ExpressionChy4G082040
SyntenyChy4G082040
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581006.1 hypothetical protein SDJN03_21008, partial [Cucurbita argyrosperma subsp. sororia]6.43e-7977.6Show/hide
Query:  MASISTFPALHL----PFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVL
        MASIS FPALH       +  SPI   SF+LPSS++FT+SAASFRHS ++GRS  R LKQVR VEEDAS+PE GVE EA SPSPS+ PAVTVPVS SD+L
Subjt:  MASISTFPALHL----PFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVL

Query:  TMFFQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        TMFFQAEGTLNES++PSVT ALEQT+GI+GLKVQ+VEGIASV LTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFED+EEV
Subjt:  TMFFQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV

XP_004147445.1 uncharacterized protein LOC101219347 [Cucumis sativus]8.96e-10898.32Show/hide
Query:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF
        MASISTFPALH PFSKSSPIPSHSFSLPS TNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF
Subjt:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF

Query:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        QAEGTLNES+IPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
Subjt:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV

XP_008443474.1 PREDICTED: uncharacterized protein LOC103487059 isoform X1 [Cucumis melo]1.73e-10092.74Show/hide
Query:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF
        MASISTFPALHLPF  SSP+P  SFSLPSSTNFTLSAASFRHSP+QGRSLPR LKQVRAVEEDASVPELGVE+EASSPSPSDPPAVTVPVSPSDVLTMFF
Subjt:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF

Query:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        QAEGTLNES+IPSVT ALEQTEGI+GLKVQV EGIASV LTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
Subjt:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV

XP_023526965.1 uncharacterized protein LOC111790332 [Cucurbita pepo subsp. pepo]2.59e-7876.5Show/hide
Query:  MASISTFPALHL----PFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVL
        MASIS FPALH       +  SPI    F+LPSS++FT+SAASFRHS ++GRS PR LKQ R VEEDAS+PE GVE E  SPSPS+ PAVTVPVS SD+L
Subjt:  MASISTFPALHL----PFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVL

Query:  TMFFQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        TMFFQAEGTLNES++PSVT ALEQT+GI+GLKVQ+VEGIASV LTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFED+EEV
Subjt:  TMFFQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV

XP_038903053.1 uncharacterized protein LOC120089747 [Benincasa hispida]8.59e-8282.07Show/hide
Query:  MASIS-TFPALH----LPFSKSSPIPSHSFSLPSSTNFTLSAAS-FRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSD
        MASIS TFPALH    L F   SPIP  SFSLPSSTNFTL+AAS F HSP+ GRS  R LKQ +AVEEDAS+PE GVE E  SPSPSD PAVTVPVSPSD
Subjt:  MASIS-TFPALH----LPFSKSSPIPSHSFSLPSSTNFTLSAAS-FRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSD

Query:  VLTMFFQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEE
        +LTMFFQAEGTLNES+IP+VT ALEQTEGI+GLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFED+EE
Subjt:  VLTMFFQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEE

TrEMBL top hitse value%identityAlignment
A0A0A0LC88 HMA domain-containing protein5.2e-8498.32Show/hide
Query:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF
        MASISTFPALH PFSKSSPIPSHSFSLPS TNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF
Subjt:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF

Query:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        QAEGTLNES+IPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
Subjt:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV

A0A0A0LFE6 Uncharacterized protein1.8e-6096.69Show/hide
Query:  VSANANANGLSQEKDAVYPHVPVPVQAPSTPVYKAPPVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTPPPVKAPYTPAPPVKLPPPPY
        VSANANANGLSQ+KDAVYPHVPVPVQAPSTPVYKAPPVKPPTIPVLTPPPA  VKPPTIPVLTPPPA PVKPPTIPVLTPPPVKAPYTPAPPVKLPPPPY
Subjt:  VSANANANGLSQEKDAVYPHVPVPVQAPSTPVYKAPPVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTPPPVKAPYTPAPPVKLPPPPY

Query:  THSPPVKPPSSPPPAKAPYTPSPPVKPPSTPVPPVKPPSPAAPRPPPVLGK
        T SPPVKPPSSPPPAKAPYTPSPPVKPPSTPVPPVKPPSPAAPRPPPVLGK
Subjt:  THSPPVKPPSSPPPAKAPYTPSPPVKPPSTPVPPVKPPSPAAPRPPPVLGK

A0A1S3B868 uncharacterized protein LOC103487059 isoform X11.9e-7892.74Show/hide
Query:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF
        MASISTFPALHLPF  SSP+P  SFSLPSSTNFTLSAASFRHSP+QGRSLPR LKQVRAVEEDASVPELGVE+EASSPSPSDPPAVTVPVSPSDVLTMFF
Subjt:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF

Query:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        QAEGTLNES+IPSVT ALEQTEGI+GLKVQV EGIASV LTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
Subjt:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV

A0A5D3D1F9 Heavy-metal-associated domain-containing protein1.9e-7892.74Show/hide
Query:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF
        MASISTFPALHLPF  SSP+P  SFSLPSSTNFTLSAASFRHSP+QGRSLPR LKQVRAVEEDASVPELGVE+EASSPSPSDPPAVTVPVSPSDVLTMFF
Subjt:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFF

Query:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        QAEGTLNES+IPSVT ALEQTEGI+GLKVQV EGIASV LTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
Subjt:  QAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV

A0A6J1F3X5 uncharacterized protein LOC1114419463.7e-6177.05Show/hide
Query:  MASISTFPALHL----PFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVL
        MASIS FPALH       +  SPI   SF+LPSS++FT+SAASF HS ++GRS  R LKQVR VEEDAS+PE GVE EA SPSPS+ PAVTVPVS SD+L
Subjt:  MASISTFPALHL----PFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVL

Query:  TMFFQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        TMFFQAEGTLNES++PSVT ALEQT+GI+GLKVQ+VEGIASV LTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFED+EEV
Subjt:  TMFFQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV

SwissProt top hitse value%identityAlignment
Q9FZA2 Non-classical arabinogalactan protein 316.8e-0447.65Show/hide
Query:  KDAVYPHVPVPVQAPSTPVYKAP-------PVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTP---PPVKAPYTP--APPVKLPPPPYT
        K  V P V  PV  P+ P  K P       PVKPPT P + PP +P  KPP  P + PP   PVKPPT P + P   PP KAP  P   PPVK P  P T
Subjt:  KDAVYPHVPVPVQAPSTPVYKAP-------PVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTP---PPVKAPYTP--APPVKLPPPPYT

Query:  HSPPVKPPSSPPPAKAPYTP--SPPVKPPSTP--VPPVKPPSPAAPRPP
         + PVKPP+  PP K P +P   PPVKPP  P    PVKPP     +PP
Subjt:  HSPPVKPPSSPPPAKAPYTP--SPPVKPPSTP--VPPVKPPSPAAPRPP

Arabidopsis top hitse value%identityAlignment
AT1G28290.1 arabinogalactan protein 314.8e-0547.65Show/hide
Query:  KDAVYPHVPVPVQAPSTPVYKAP-------PVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTP---PPVKAPYTP--APPVKLPPPPYT
        K  V P V  PV  P+ P  K P       PVKPPT P + PP +P  KPP  P + PP   PVKPPT P + P   PP KAP  P   PPVK P  P T
Subjt:  KDAVYPHVPVPVQAPSTPVYKAP-------PVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTP---PPVKAPYTP--APPVKLPPPPYT

Query:  HSPPVKPPSSPPPAKAPYTP--SPPVKPPSTP--VPPVKPPSPAAPRPP
         + PVKPP+  PP K P +P   PPVKPP  P    PVKPP     +PP
Subjt:  HSPPVKPPSSPPPAKAPYTP--SPPVKPPSTP--VPPVKPPSPAAPRPP

AT4G13340.1 Leucine-rich repeat (LRR) family protein4.1e-0445Show/hide
Query:  PVQAPSTPVYKAPPVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTPPPVKAP------YTPAPPVKLPPPPYTHSPPVKPP-SSPPPAK
        P  +P  PVY  PP  PP  PV +PPP P   PP  PV +PPP  P  PP  PV +PPP   P      Y+P PP   PPPP  +SPP  P  SSPPP  
Subjt:  PVQAPSTPVYKAPPVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTPPPVKAP------YTPAPPVKLPPPPYTHSPPVKPP-SSPPPAK

Query:  AP-----YTPSPPVKPPSTPVPPVKPPSPAAP----RPPP
        +P     Y   PP  PP +P PP   P P  P     PPP
Subjt:  AP-----YTPSPPVKPPSTPVPPVKPPSPAAP----RPPP

AT4G15160.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein7.4e-0643.11Show/hide
Query:  PHVPVPVQAPSTPVYKAPP-----------VKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTPPPVKAPYTPAPPVKLPPPPYTHSPPVK
        P V  P   P  P  K PP            KPPT+    PPP P VKPP  P + PPP   VKPP  P + PPP   PYTP PP    PPP T  PP  
Subjt:  PHVPVPVQAPSTPVYKAPP-----------VKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTPPPVKAPYTPAPPVKLPPPPYTHSPPVK

Query:  P-PSSPPPAKAPYTPSPPVKPPSTPVPPVKPPSPAAPRPPPVLGKGASV---FHQVHMGTGKSVESA
        P  + PPP   P  P PP  PP TP PP  PP P    P   L  GA V      +H+G GKS   A
Subjt:  P-PSSPPPAKAPYTPSPPVKPPSTPVPPVKPPSPAAPRPPPVLGKGASV---FHQVHMGTGKSVESA

AT5G14910.1 Heavy metal transport/detoxification superfamily protein9.3e-3350.56Show/hide
Query:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVR-AVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMF
        MASI+     H      SP  S +  LP S N      +  +S  +  S    +KQ R  + + +S+ E G     +   P +   V+VPVSPSD+LTMF
Subjt:  MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVR-AVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMF

Query:  FQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV
        FQA+GTLNE++IP+VT AL+  +G+S LKVQV EG+A V L KQTT+Q+TGVAS+L+ETIQGAGFKLQTLNLSFED++EV
Subjt:  FQAEGTLNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAATCTCCACCTTCCCCGCCCTTCACCTTCCATTTTCAAAATCATCACCAATTCCCAGCCATTCCTTCTCTCTTCCTTCCTCCACTAACTTCACTCTC
TCTGCCGCCTCCTTCAGACATTCTCCGCTTCAGGGCCGGAGCCTGCCACGCGTCCTCAAACAAGTCAGGGCAGTTGAAGAGGATGCCTCTGTTCCCGAGCTAGGA
GTTGAAAGCGAAGCGTCGTCGCCTTCTCCATCTGACCCGCCTGCCGTCACCGTCCCCGTCTCCCCCTCTGATGTTCTCACCATGTTCTTTCAGGCAGAGGGGACG
CTAAATGAATCATCTATTCCTTCTGTAACTGGGGCTTTGGAGCAAACGGAGGGTATTTCCGGCTTGAAAGTCCAAGTCGTTGAGGGCATTGCATCAGTTGCGCTT
ACAAAACAAACAACAATACAATCTACAGGAGTGGCCTCGAGTTTGATCGAGACCATTCAAGGTGCAGGTTTTAAGTTACAAACGTTGAATTTGAGTTTTGAGGAT
GAGGAAGAAGTTTCAGCCAATGCCAATGCCAATGGACTGTCCCAAGAGAAGGATGCTGTTTACCCCCATGTTCCAGTTCCAGTTCAAGCTCCTTCTACTCCAGTT
TACAAAGCGCCGCCAGTGAAGCCACCAACAATCCCTGTTTTGACCCCACCGCCAGCTCCGCTTGTGAAGCCACCAACAATCCCTGTTTTGACGCCACCGCCAGCT
CTGCCTGTTAAGCCACCCACAATCCCTGTTTTGACACCACCCCCAGTTAAAGCACCATATACCCCTGCTCCACCAGTTAAGCTTCCACCACCTCCATATACTCAT
TCTCCACCGGTGAAGCCACCGTCTTCTCCCCCGCCTGCTAAAGCTCCTTACACTCCATCTCCTCCTGTGAAGCCACCTTCCACCCCTGTTCCACCAGTGAAGCCA
CCGTCTCCGGCAGCTCCTAGGCCACCTCCAGTCTTAGGCAAGGGTGCAAGTGTGTTCCACCAGGTACATATGGGAACAGGGAAGTCTGTGGAAAGTGCTACACTG
ACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAATCTCCACCTTCCCCGCCCTTCACCTTCCATTTTCAAAATCATCACCAATTCCCAGCCATTCCTTCTCTCTTCCTTCCTCCACTAACTTCACTCTC
TCTGCCGCCTCCTTCAGACATTCTCCGCTTCAGGGCCGGAGCCTGCCACGCGTCCTCAAACAAGTCAGGGCAGTTGAAGAGGATGCCTCTGTTCCCGAGCTAGGA
GTTGAAAGCGAAGCGTCGTCGCCTTCTCCATCTGACCCGCCTGCCGTCACCGTCCCCGTCTCCCCCTCTGATGTTCTCACCATGTTCTTTCAGGCAGAGGGGACG
CTAAATGAATCATCTATTCCTTCTGTAACTGGGGCTTTGGAGCAAACGGAGGGTATTTCCGGCTTGAAAGTCCAAGTCGTTGAGGGCATTGCATCAGTTGCGCTT
ACAAAACAAACAACAATACAATCTACAGGAGTGGCCTCGAGTTTGATCGAGACCATTCAAGGTGCAGGTTTTAAGTTACAAACGTTGAATTTGAGTTTTGAGGAT
GAGGAAGAAGTTTCAGCCAATGCCAATGCCAATGGACTGTCCCAAGAGAAGGATGCTGTTTACCCCCATGTTCCAGTTCCAGTTCAAGCTCCTTCTACTCCAGTT
TACAAAGCGCCGCCAGTGAAGCCACCAACAATCCCTGTTTTGACCCCACCGCCAGCTCCGCTTGTGAAGCCACCAACAATCCCTGTTTTGACGCCACCGCCAGCT
CTGCCTGTTAAGCCACCCACAATCCCTGTTTTGACACCACCCCCAGTTAAAGCACCATATACCCCTGCTCCACCAGTTAAGCTTCCACCACCTCCATATACTCAT
TCTCCACCGGTGAAGCCACCGTCTTCTCCCCCGCCTGCTAAAGCTCCTTACACTCCATCTCCTCCTGTGAAGCCACCTTCCACCCCTGTTCCACCAGTGAAGCCA
CCGTCTCCGGCAGCTCCTAGGCCACCTCCAGTCTTAGGCAAGGGTGCAAGTGTGTTCCACCAGGTACATATGGGAACAGGGAAGTCTGTGGAAAGTGCTACACTG
ACATGA
Protein sequenceShow/hide protein sequence
MASISTFPALHLPFSKSSPIPSHSFSLPSSTNFTLSAASFRHSPLQGRSLPRVLKQVRAVEEDASVPELGVESEASSPSPSDPPAVTVPVSPSDVLTMFFQAEGT
LNESSIPSVTGALEQTEGISGLKVQVVEGIASVALTKQTTIQSTGVASSLIETIQGAGFKLQTLNLSFEDEEEVSANANANGLSQEKDAVYPHVPVPVQAPSTPV
YKAPPVKPPTIPVLTPPPAPLVKPPTIPVLTPPPALPVKPPTIPVLTPPPVKAPYTPAPPVKLPPPPYTHSPPVKPPSSPPPAKAPYTPSPPVKPPSTPVPPVKP
PSPAAPRPPPVLGKGASVFHQVHMGTGKSVESATLT