; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006649 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006649
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCAAX amino terminal protease
Genome locationscaffold7:47519919..47526652
RNA-Seq ExpressionSpg006649
SyntenySpg006649
Gene Ontology termsGO:0071586 - CAAX-box protein processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022933054.1 uncharacterized protein LOC111439764 isoform X1 [Cucurbita moschata]1.0e-8987.44Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIAL ILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

XP_022997131.1 uncharacterized protein LOC111492131 isoform X1 [Cucurbita maxima]2.8e-9087.94Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIALTILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

XP_022997159.1 uncharacterized protein LOC111492131 isoform X3 [Cucurbita maxima]2.8e-9087.94Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIALTILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

XP_023530074.1 uncharacterized protein LOC111792735 isoform X1 [Cucurbita pepo subsp. pepo]2.8e-9087.94Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIALTILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

XP_023530096.1 uncharacterized protein LOC111792735 isoform X3 [Cucurbita pepo subsp. pepo]2.8e-9087.94Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIALTILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

TrEMBL top hitse value%identityAlignment
A0A5A7UGF1 Uncharacterized protein6.9e-8785.43Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        ++LCE   TTESS K  E  +LQKEPER  EQII +K+HNIV+SLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGG+LK++GRMALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRIL SVGLVLVLWSPITLPLLPKLVDSWTS+TPSK+ NLAC FGLYIALTILVM+WGKRIRGYENPAKEYGLDLTSWSK
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

A0A6J1EY34 uncharacterized protein LOC111439764 isoform X35.1e-9087.44Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIAL ILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

A0A6J1EYN6 uncharacterized protein LOC111439764 isoform X15.1e-9087.44Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIAL ILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

A0A6J1K6M7 uncharacterized protein LOC111492131 isoform X11.3e-9087.94Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIALTILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

A0A6J1KAN7 uncharacterized protein LOC111492131 isoform X31.3e-9087.94Show/hide
Query:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA
        +NLC  ++TTESSFKS EKASLQKEPER +EQIIPEK+HNIVT+LAEKAMSVASPVVPKKEDGEVDEERLV+MLAELGEKGGILK+VGR+ALLWGGIRTA
Subjt:  KNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTA

Query:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK
        +S++EKLISILRIAERPLFQRILGSVGLVL+LWSPITLPLLPKLVDSWTS TPSK+ANLAC FGLYIALTILVM+WGKRIRGYE+PAKEYGLDL SW K
Subjt:  VSLSEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G03140.1 alpha/beta-Hydrolases superfamily protein8.4e-5354.79Show/hide
Query:  KSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTAVSLSEKLISILRIA
        KS++K S QKE  + ++        ++V S AEKAMS+A P VP KE GEVD++R+V+MLA+LG++GGIL +VG++ALLWGG+R A+SL+++LI  L + 
Subjt:  KSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTAVSLSEKLISILRIA

Query:  ERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSKPKI
        E PL +R +G +G+VLVLWSP+ +PLLP L+ +W++  PS++A LA V GLY+A+ ILVMLWGKR+R YENP K+YGLDL + +K KI
Subjt:  ERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSKPKI

AT2G03140.2 alpha/beta-Hydrolases superfamily protein8.4e-5354.79Show/hide
Query:  KSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTAVSLSEKLISILRIA
        KS++K S QKE  + ++        ++V S AEKAMS+A P VP KE GEVD++R+V+MLA+LG++GGIL +VG++ALLWGG+R A+SL+++LI  L + 
Subjt:  KSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTAVSLSEKLISILRIA

Query:  ERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSKPKI
        E PL +R +G +G+VLVLWSP+ +PLLP L+ +W++  PS++A LA V GLY+A+ ILVMLWGKR+R YENP K+YGLDL + +K KI
Subjt:  ERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSKPKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAGATTTTGCCCTTTAAGAATTTATGTGAAGCTGAAGAGACTACTGAAAGTTCATTCAAGTCCAATGAGAAAGCTAGTCTTCAGAAAGAGCCAGAAAGGCAAAA
CGAGCAGATAATACCTGAAAAAGATCATAACATTGTCACTTCTCTTGCTGAGAAGGCAATGTCAGTTGCTAGTCCGGTGGTGCCGAAGAAAGAAGATGGTGAAGTGGATG
AAGAAAGGCTAGTTTCTATGTTAGCTGAGTTAGGAGAGAAGGGTGGCATACTGAAGGTAGTAGGCAGAATGGCTTTACTCTGGGGTGGTATACGTACTGCAGTGAGTTTG
AGTGAAAAACTTATCTCGATTCTTCGAATAGCAGAACGCCCTTTATTTCAAAGGATTCTTGGGTCTGTCGGCTTGGTGCTTGTTTTATGGTCTCCAATTACTCTTCCACT
GCTTCCAAAACTTGTTGATAGCTGGACTTCTCGTACTCCCTCAAAACTTGCTAATCTTGCTTGTGTTTTTGGTCTTTATATTGCTCTCACAATTCTGGTTATGTTGTGGG
GAAAAAGAATACGCGGGTATGAAAATCCAGCAAAAGAATATGGACTCGATTTGACATCTTGGTCGAAGCCCAAGATTTTACTTCAAGTGGATGAATTACACAGTAAGGAA
GGGGAAAAGTCTTATGTGGAAGTGGTTAAGTTGCATTCTATGGTGAACTTCAGCTCCAAGGTTCCTCAGTTACAGAAAGTTGATTCAGATAAGTCTTCTCCTATTCAATC
TTATTGGGTTCGTAAGGAACATGAGGCGTTGCGTTTAGATCTTGAAAATTTATGGATAGTATCTAGATTATTTGCCCATAATGAGTGGAAGGAGATTAAAGCTACTTTCG
AAGATCATTTTCAGTCAAAAGTTTTGATTAATCCACTCTTTGATGATAAAACCTTGATTAAAATTGGTAAAGATATTTGTGATATTCCTAATGTTTCAATTGAAAAGTGG
AAATCCTTTGTCAAAGATAAAACTAGAGGGAATATTTTTCTTCACTTTGGAGATATTGAAGCTTTGGATCCTCCAAATATTATCAATAGAGAGTTTCATGTTAGTGATTT
TCAGAATCCAATGGACCTTTTTCGGCTTAATAAGGTTATGGAGGATGAAGATTTAAATCCTATGGATAAACCATTCAAAGAAATCGCTGATAATGAGTTTTGTAATAATG
CTACGTGTAACAAGGTGGAATTTCAAGCTTCTCCTTCTAATCTGGATTTGGTGATTGGTGTGAAATCTAATGGAGTTTCAAAGAAAGTTATAGATAAGGTAGTCTTAAAT
TCGAATTGTGAATCTCTGACAAATCTCTTTGTTCATGCTGTCATTTCCGCACTTTTCTACTCATGCGCCTCTCTCATTCATGCTTTTGCTAGCATTCCAGTCCCCTCCGA
G
mRNA sequenceShow/hide mRNA sequence
ATGAATGAGATTTTGCCCTTTAAGAATTTATGTGAAGCTGAAGAGACTACTGAAAGTTCATTCAAGTCCAATGAGAAAGCTAGTCTTCAGAAAGAGCCAGAAAGGCAAAA
CGAGCAGATAATACCTGAAAAAGATCATAACATTGTCACTTCTCTTGCTGAGAAGGCAATGTCAGTTGCTAGTCCGGTGGTGCCGAAGAAAGAAGATGGTGAAGTGGATG
AAGAAAGGCTAGTTTCTATGTTAGCTGAGTTAGGAGAGAAGGGTGGCATACTGAAGGTAGTAGGCAGAATGGCTTTACTCTGGGGTGGTATACGTACTGCAGTGAGTTTG
AGTGAAAAACTTATCTCGATTCTTCGAATAGCAGAACGCCCTTTATTTCAAAGGATTCTTGGGTCTGTCGGCTTGGTGCTTGTTTTATGGTCTCCAATTACTCTTCCACT
GCTTCCAAAACTTGTTGATAGCTGGACTTCTCGTACTCCCTCAAAACTTGCTAATCTTGCTTGTGTTTTTGGTCTTTATATTGCTCTCACAATTCTGGTTATGTTGTGGG
GAAAAAGAATACGCGGGTATGAAAATCCAGCAAAAGAATATGGACTCGATTTGACATCTTGGTCGAAGCCCAAGATTTTACTTCAAGTGGATGAATTACACAGTAAGGAA
GGGGAAAAGTCTTATGTGGAAGTGGTTAAGTTGCATTCTATGGTGAACTTCAGCTCCAAGGTTCCTCAGTTACAGAAAGTTGATTCAGATAAGTCTTCTCCTATTCAATC
TTATTGGGTTCGTAAGGAACATGAGGCGTTGCGTTTAGATCTTGAAAATTTATGGATAGTATCTAGATTATTTGCCCATAATGAGTGGAAGGAGATTAAAGCTACTTTCG
AAGATCATTTTCAGTCAAAAGTTTTGATTAATCCACTCTTTGATGATAAAACCTTGATTAAAATTGGTAAAGATATTTGTGATATTCCTAATGTTTCAATTGAAAAGTGG
AAATCCTTTGTCAAAGATAAAACTAGAGGGAATATTTTTCTTCACTTTGGAGATATTGAAGCTTTGGATCCTCCAAATATTATCAATAGAGAGTTTCATGTTAGTGATTT
TCAGAATCCAATGGACCTTTTTCGGCTTAATAAGGTTATGGAGGATGAAGATTTAAATCCTATGGATAAACCATTCAAAGAAATCGCTGATAATGAGTTTTGTAATAATG
CTACGTGTAACAAGGTGGAATTTCAAGCTTCTCCTTCTAATCTGGATTTGGTGATTGGTGTGAAATCTAATGGAGTTTCAAAGAAAGTTATAGATAAGGTAGTCTTAAAT
TCGAATTGTGAATCTCTGACAAATCTCTTTGTTCATGCTGTCATTTCCGCACTTTTCTACTCATGCGCCTCTCTCATTCATGCTTTTGCTAGCATTCCAGTCCCCTCCGA
G
Protein sequenceShow/hide protein sequence
MNEILPFKNLCEAEETTESSFKSNEKASLQKEPERQNEQIIPEKDHNIVTSLAEKAMSVASPVVPKKEDGEVDEERLVSMLAELGEKGGILKVVGRMALLWGGIRTAVSL
SEKLISILRIAERPLFQRILGSVGLVLVLWSPITLPLLPKLVDSWTSRTPSKLANLACVFGLYIALTILVMLWGKRIRGYENPAKEYGLDLTSWSKPKILLQVDELHSKE
GEKSYVEVVKLHSMVNFSSKVPQLQKVDSDKSSPIQSYWVRKEHEALRLDLENLWIVSRLFAHNEWKEIKATFEDHFQSKVLINPLFDDKTLIKIGKDICDIPNVSIEKW
KSFVKDKTRGNIFLHFGDIEALDPPNIINREFHVSDFQNPMDLFRLNKVMEDEDLNPMDKPFKEIADNEFCNNATCNKVEFQASPSNLDLVIGVKSNGVSKKVIDKVVLN
SNCESLTNLFVHAVISALFYSCASLIHAFASIPVPSE