; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg017180 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg017180
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionstromal processing peptidase, chloroplastic-like
Genome locationscaffold4:39076280..39086721
RNA-Seq ExpressionSpg017180
SyntenySpg017180
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0009793 - embryo development ending in seed dormancy (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR007863 - Peptidase M16, C-terminal
IPR011249 - Metalloenzyme, LuxS/M16 peptidase-like
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152885.1 stromal processing peptidase, chloroplastic isoform X1 [Cucumis sativus]1.3e-9196.7Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES+V FEEEG DQDFQGV+P+GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

XP_008441914.1 PREDICTED: stromal processing peptidase, chloroplastic isoform X1 [Cucumis melo]1.3e-9196.7Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES+V FEEEG DQDFQGV+P+GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

XP_008441915.1 PREDICTED: stromal processing peptidase, chloroplastic isoform X2 [Cucumis melo]1.3e-9196.7Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES+V FEEEG DQDFQGV+P+GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

XP_011648983.1 stromal processing peptidase, chloroplastic isoform X2 [Cucumis sativus]1.3e-9196.7Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES+V FEEEG DQDFQGV+P+GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

XP_023537547.1 stromal processing peptidase, chloroplastic-like [Cucurbita pepo subsp. pepo]1.7e-9195.6Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFT+VRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACK+VLRGLH NKI+QRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIG+AGAQAGEES+VPFEEEG DQDFQGVVPTGRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

TrEMBL top hitse value%identityAlignment
A0A0A0LH02 Uncharacterized protein6.3e-9296.7Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES+V FEEEG DQDFQGV+P+GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

A0A1S3B556 stromal processing peptidase, chloroplastic isoform X16.3e-9296.7Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES+V FEEEG DQDFQGV+P+GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

A0A1S3B595 stromal processing peptidase, chloroplastic isoform X26.3e-9296.7Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES+V FEEEG DQDFQGV+P+GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

A0A5A7U266 Stromal processing peptidase3.1e-9196.69Show/hide
Query:  LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPR
        LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPR
Subjt:  LFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPR

Query:  KDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        KDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEES+V FEEEG DQDFQGV+P+GRGLSTMTRPT+
Subjt:  KDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

A0A6J1HNZ8 stromal processing peptidase, chloroplastic-like isoform X17.0e-9195.05Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFT+VRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACK+VLRGLH NKI+QRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIG+AGAQAGEES+V FEEEG DQDFQGVVPTGRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

SwissProt top hitse value%identityAlignment
B8B0E2 Stromal processing peptidase, chloroplastic8.5e-7072.68Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFT+VRDS+GLTYDVSFEL+LFD+L LGWYVI+VTSTP+KV+KAVDACK VLRGLHSNKI +RELDRAKRTLLM+HEAE K+NAYWLGLLAHLQ+SSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPT-GRGLSTMTRPTS
        RK++SCIK+LT LYE+ATI+D+Y+AY+ LKVD  SL+ CIGIAGA++GEE+     ++  D    G+ P  GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPT-GRGLSTMTRPTS

Q40983 Stromal processing peptidase, chloroplastic1.3e-7880.22Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFT+VRDSLGLTYDVSFEL+LFDRLKLGWYV+SVTSTP+KV+KAVDACK+VLRGLHSN I  RELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQ+SSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RKDLSCIKDLTSLYEAATI+D  +AY+QLKVD DSLY+CIG++GAQA ++   P EEE   + + GV+P GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

Q69TY5 Stromal processing peptidase, chloroplastic8.5e-7072.68Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFT+VRDS+GLTYDVSFEL+LFD+L LGWYVI+VTSTP+KV+KAVDACK VLRGLHSNKI +RELDRAKRTLLM+HEAE K+NAYWLGLLAHLQ+SSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPT-GRGLSTMTRPTS
        RK++SCIK+LT LYE+ATI+D+Y+AY+ LKVD  SL+ CIGIAGA++GEE+     ++  D    G+ P  GRGLSTMTRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPT-GRGLSTMTRPTS

Q9FIH8 Stromal processing peptidase, chloroplastic5.9e-7981.87Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFT+VRDSLGLTYDVSFEL+LFDRL LGWYVISVTSTP KVYKAVDACKSVLRGLHSN+IA RELDRAKRTLLMRHEAE+KSNAYWL LLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RK+LSCIK+L SLYEAA+I+D+Y+AY+QL+VD DSLY+CIGIAGAQAGEE  V  EEE P+  F GVVP GRG S  TRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS

Arabidopsis top hitse value%identityAlignment
AT5G42390.1 Insulinase (Peptidase family M16) family protein4.2e-8081.87Show/hide
Query:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP
        RLFT+VRDSLGLTYDVSFEL+LFDRL LGWYVISVTSTP KVYKAVDACKSVLRGLHSN+IA RELDRAKRTLLMRHEAE+KSNAYWL LLAHLQASSVP
Subjt:  RLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKSVLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVP

Query:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS
        RK+LSCIK+L SLYEAA+I+D+Y+AY+QL+VD DSLY+CIGIAGAQAGEE  V  EEE P+  F GVVP GRG S  TRPT+
Subjt:  RKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPDQDFQGVVPTGRGLSTMTRPTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAG
CCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGT
GCTCCTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCAACTGTTGAATTCA
GGTGATCAACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATTTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATC
ATATAAAGATGTCTTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGATTATTGTTGTTCAACGAT
TCCATCAAAAGGATGATTGGCCTTCCATTCGGAACACCATTCTTGCCGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATAT
GATCAACATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCGATTGACTACCGACTCATTTTATCAAGACACTAT
GACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACCGAAATAT
CCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTTCCTTCATCCCTTGCCGGC
GGCGACGTTACAGTGGAAATCAAAGGGTTGACGGCCAGCTTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAG
TGAGAAATTGGATGGAAAAAATTTGAAACTACCAAAGGAAATCAAATCGCCTCCTAAAAATCAGGAATTCATTCCTAGTTTGGCCGAAACTGCTTCTCCAAAGGATTTAT
CTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGT
AAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATACAAATTATCTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAA
CAAGATAATAGCTTCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAA
TTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACACTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGAT
GTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCATCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCAT
CATGGCTCTTCCAACTGGCTCAATACCCAAACCGCCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGTTTATTTCATGGAATGTTAGAG
GTTTGGGCTCTTGGAAAAAAAGAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTACTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTT
ATTAAATCTATATGGAGTTCTTCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGA
AGTTATTCAAGGTCACTTTTCAATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATT
TTTGGCAAGAACTCCATGATTTGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTT
ACTAGGAGCATGCGCACTTTCAATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATA
TCTCTCACTTCTGGATAGATTTTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTC
TTTCTTTTGGAGACATAGCTTGGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCT
CTCCAAGGCTGGCCAGGGCATGGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCATGAATCGTAATGATGTTTCCCAACTACCATCTCT
TATTTCTCAATTGAAGAATCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTTGCTAAGTTGTGGAGTCTTGTCTG
AGGCTTTCCCTCGTCTTTATAGATTATCTAATCGCTCGGAAGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAAT
GATTCGGAGACAAATGAGTGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCGTCTAATGCATTCAC
AACCTTGACTTTGCCTGATAGCATTCATGATTTTCTTTACATCCATACTTGTGGGACATCCCTTCCAAGGCTTTTCACAAGTGTCCGGGATTCTCTTGGATTGACTTATG
ACGTATCCTTTGAGTTGAGCCTGTTCGATAGGCTTAAGCTCGGATGGTATGTTATATCAGTAACATCAACTCCAGCAAAGGTATACAAAGCTGTTGATGCATGCAAAAGC
GTTCTGAGAGGTTTACATAGCAACAAAATTGCCCAAAGAGAGTTGGACAGGGCAAAACGTACTCTTCTTATGAGACATGAAGCTGAAATAAAGTCCAATGCTTATTGGCT
TGGCCTATTGGCTCATCTGCAGGCGTCTTCTGTTCCACGGAAGGACCTATCGTGCATCAAAGATCTTACGTCATTGTATGAAGCTGCCACCATTGATGACGTATACATTG
CTTATGATCAGTTGAAAGTGGACGCAGATTCTTTGTATACGTGCATTGGGATAGCTGGAGCTCAAGCTGGCGAAGAAAGTGTTGTTCCTTTTGAAGAGGAAGGACCAGAT
CAAGATTTTCAAGGTGTTGTTCCCACTGGACGCGGCTTATCTACCATGACCAGGCCCACATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAG
CCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGT
GCTCCTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCAACTGTTGAATTCA
GGTGATCAACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATTTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATC
ATATAAAGATGTCTTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGATTATTGTTGTTCAACGAT
TCCATCAAAAGGATGATTGGCCTTCCATTCGGAACACCATTCTTGCCGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATAT
GATCAACATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCGATTGACTACCGACTCATTTTATCAAGACACTAT
GACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACCGAAATAT
CCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTTCCTTCATCCCTTGCCGGC
GGCGACGTTACAGTGGAAATCAAAGGGTTGACGGCCAGCTTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAG
TGAGAAATTGGATGGAAAAAATTTGAAACTACCAAAGGAAATCAAATCGCCTCCTAAAAATCAGGAATTCATTCCTAGTTTGGCCGAAACTGCTTCTCCAAAGGATTTAT
CTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGT
AAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATACAAATTATCTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAA
CAAGATAATAGCTTCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAA
TTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACACTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGAT
GTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCATCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCAT
CATGGCTCTTCCAACTGGCTCAATACCCAAACCGCCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGTTTATTTCATGGAATGTTAGAG
GTTTGGGCTCTTGGAAAAAAAGAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTACTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTT
ATTAAATCTATATGGAGTTCTTCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGA
AGTTATTCAAGGTCACTTTTCAATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATT
TTTGGCAAGAACTCCATGATTTGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTT
ACTAGGAGCATGCGCACTTTCAATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATA
TCTCTCACTTCTGGATAGATTTTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTC
TTTCTTTTGGAGACATAGCTTGGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCT
CTCCAAGGCTGGCCAGGGCATGGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCATGAATCGTAATGATGTTTCCCAACTACCATCTCT
TATTTCTCAATTGAAGAATCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTTGCTAAGTTGTGGAGTCTTGTCTG
AGGCTTTCCCTCGTCTTTATAGATTATCTAATCGCTCGGAAGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAAT
GATTCGGAGACAAATGAGTGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCGTCTAATGCATTCAC
AACCTTGACTTTGCCTGATAGCATTCATGATTTTCTTTACATCCATACTTGTGGGACATCCCTTCCAAGGCTTTTCACAAGTGTCCGGGATTCTCTTGGATTGACTTATG
ACGTATCCTTTGAGTTGAGCCTGTTCGATAGGCTTAAGCTCGGATGGTATGTTATATCAGTAACATCAACTCCAGCAAAGGTATACAAAGCTGTTGATGCATGCAAAAGC
GTTCTGAGAGGTTTACATAGCAACAAAATTGCCCAAAGAGAGTTGGACAGGGCAAAACGTACTCTTCTTATGAGACATGAAGCTGAAATAAAGTCCAATGCTTATTGGCT
TGGCCTATTGGCTCATCTGCAGGCGTCTTCTGTTCCACGGAAGGACCTATCGTGCATCAAAGATCTTACGTCATTGTATGAAGCTGCCACCATTGATGACGTATACATTG
CTTATGATCAGTTGAAAGTGGACGCAGATTCTTTGTATACGTGCATTGGGATAGCTGGAGCTCAAGCTGGCGAAGAAAGTGTTGTTCCTTTTGAAGAGGAAGGACCAGAT
CAAGATTTTCAAGGTGTTGTTCCCACTGGACGCGGCTTATCTACCATGACCAGGCCCACATCATGA
Protein sequenceShow/hide protein sequence
MTTAISATAQPWNHSTRSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINQLLNS
GDQRRLLIPSEDNKQGWFSFFSLISDYPGEAHRSTKSYKDVFQQKESHVVTTHPSSSVPSPQPLDSEIIVVQRFHQKDDWPSIRNTILAGISHRCSINPFQDNKALLHVY
DQHIVSKLCNNKDWSSIGKYRLKFYRLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIFRYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG
GDVTVEIKGLTASFFKSARFEESPSFSEQNNLEIKRSEKLDGKNLKLPKEIKSPPKNQEFIPSLAETASPKDLSPCLVSPQTISQPRSILQAADHSNISPKKKTAHSNKG
KSPLHVASPIEAKNHTNYLLPVGPTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTSHTPSSPHIIPSPTKD
VTPLQQPSSSPPEPSSLSLPTYLCHLAPMLSKHGLCIMALPTGSIPKPPTKKTKATTGKKSKLKREFISWNVRGLGSWKKRALIKKTIQQQNPSFVLLQETKKTSVDGKF
IKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNV
TRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNP
LQGWPGHGFMMKLKGLKMELRKWNIMNRNDVSQLPSLISQLKNLVASRIQRRLGNGCSTLFWHDSLLSCGVLSEAFPRLYRLSNRSEGTVADFWVSLNSAWDLSLRRNLN
DSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFTTLTLPDSIHDFLYIHTCGTSLPRLFTSVRDSLGLTYDVSFELSLFDRLKLGWYVISVTSTPAKVYKAVDACKS
VLRGLHSNKIAQRELDRAKRTLLMRHEAEIKSNAYWLGLLAHLQASSVPRKDLSCIKDLTSLYEAATIDDVYIAYDQLKVDADSLYTCIGIAGAQAGEESVVPFEEEGPD
QDFQGVVPTGRGLSTMTRPTS