; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024933 (gene) of Chayote v1 genome

Gene IDSed0024933
OrganismSechium edule (Chayote v1)
DescriptionUnknown protein
Genome locationLG12:32588598..32589619
RNA-Seq ExpressionSed0024933
SyntenySed0024933
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7023388.1 hypothetical protein SDJN02_14413 [Cucurbita argyrosperma subsp. argyrosperma]7.1e-8566.67Show/hide
Query:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF
        MF+DD +E  PLTS+H QDH         E++E+L+FSDLP     SD   P    KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND 
Subjt:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF

Query:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR
        S       RT++  FW ++SRK + FRKRS+SL GLQ+SVS     KINLKR+SRSLDYRK++RQ+NSI    +EIDR+ S K+GLK D ++KKA SKPR
Subjt:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR

Query:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        WYLLMFGMVKFPAEM+L DIKSRQVRRSSSALFP NE+KGK+ C  RSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA

XP_008453606.1 PREDICTED: uncharacterized protein LOC103494265 [Cucumis melo]3.2e-8569.92Show/hide
Query:  IDDANEPNPLTSKHNQ-----DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPR-T
        +DD +E NPLTSK NQ     D ESDESL+FSDLP    NSD     +   KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND S R T
Subjt:  IDDANEPNPLTSKHNQ-----DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPR-T

Query:  SDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMV
        +D  FW E+ SRK + FRKRS+SL GLQ+SV    S K NLKR+SRSLDYR+++RQ+NSI    +EIDR+ S K+GLK D ++KK  SKPRWYLLMFGMV
Subjt:  SDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMV

Query:  KFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        KFPAEMEL DIKSRQVRRSSS LFP NE K KF CGRSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  KFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA

XP_022921809.1 uncharacterized protein LOC111429953 [Cucurbita moschata]7.1e-8566.67Show/hide
Query:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF
        MF+DD +E  PLTS+H QDH         E++E+L+FSDLP     SD   P    KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND 
Subjt:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF

Query:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR
        S       RT++  FW ++SRK + FRKRS+SL GLQ+SVS     KINLKR+SRSLDYRK++RQ+NSI    +EIDR+ S K+GLK D ++KKA SKPR
Subjt:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR

Query:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        WYLLMFGMVKFPAEM+L DIKSRQVRRSSSALFP NE+KGK+ C  RSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA

XP_022987522.1 uncharacterized protein LOC111485063 [Cucurbita maxima]7.1e-8566.67Show/hide
Query:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF
        MF+DD +E  PLTS+H QDH         E++E+L+FSDLP     SD   P    KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND 
Subjt:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF

Query:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR
        S       RT+D  FW ++SRK + FRKRS+SL GLQ+SVS     KINLKR+SRSLDYRK++RQ+NSI    +EIDR+ S K+GLK D ++KKA SKPR
Subjt:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR

Query:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        WYLLMFGMVKFPAEM+L DIKSRQVRRSSSALFP +E+KGK+ C  RSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA

XP_038879756.1 uncharacterized protein LOC120071507 [Benincasa hispida]1.3e-8669.66Show/hide
Query:  MFIDDANEPNPLTSKHNQ--DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSP---R
        MF+DD+ E NPL S H+Q  D ES+ESL+FSDLP     SD     E   KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND SP   R
Subjt:  MFIDDANEPNPLTSKHNQ--DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSP---R

Query:  TSDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGM
        T++  FW E+ +RK + FRKRS+SL GLQ+SV    S K NLKR+SRSLDYRK++RQ NSI    +EIDR+ +  +GLK D ++KKA SKPRWYLLMFGM
Subjt:  TSDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGM

Query:  VKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        VKFPAEM+LRDIKSRQVRRSSS LFP NENKGKF C RSSGEAAWRILRALSCKN  SVDVTA LTA
Subjt:  VKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA

TrEMBL top hitse value%identityAlignment
A0A1S3BW36 uncharacterized protein LOC1034942651.5e-8569.92Show/hide
Query:  IDDANEPNPLTSKHNQ-----DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPR-T
        +DD +E NPLTSK NQ     D ESDESL+FSDLP    NSD     +   KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND S R T
Subjt:  IDDANEPNPLTSKHNQ-----DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPR-T

Query:  SDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMV
        +D  FW E+ SRK + FRKRS+SL GLQ+SV    S K NLKR+SRSLDYR+++RQ+NSI    +EIDR+ S K+GLK D ++KK  SKPRWYLLMFGMV
Subjt:  SDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMV

Query:  KFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        KFPAEMEL DIKSRQVRRSSS LFP NE K KF CGRSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  KFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA

A0A5A7UVS6 Uncharacterized protein1.5e-8569.92Show/hide
Query:  IDDANEPNPLTSKHNQ-----DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPR-T
        +DD +E NPLTSK NQ     D ESDESL+FSDLP    NSD     +   KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND S R T
Subjt:  IDDANEPNPLTSKHNQ-----DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPR-T

Query:  SDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMV
        +D  FW E+ SRK + FRKRS+SL GLQ+SV    S K NLKR+SRSLDYR+++RQ+NSI    +EIDR+ S K+GLK D ++KK  SKPRWYLLMFGMV
Subjt:  SDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMV

Query:  KFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        KFPAEMEL DIKSRQVRRSSS LFP NE K KF CGRSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  KFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA

A0A5D3CL04 Uncharacterized protein2.2e-8469.92Show/hide
Query:  IDDANEPNPLTSKHNQ-----DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPR-T
        +DD +E NPLTSK NQ     D ESDESL+FSDLP    NSD     +   KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND S R T
Subjt:  IDDANEPNPLTSKHNQ-----DHESDESLTFSDLPFHDPNSD-----EICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPR-T

Query:  SDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMV
        +D  FW E+ SRK + FRKRS+SL GLQ+SV    S K NLKR+SRSLDYR+++RQ+NSI    +EIDR+ S K+GLK D ++KK  SKPRWYLLMFGMV
Subjt:  SDNDFWNEQ-SRKSSGFRKRSDSLPGLQTSV----STKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMV

Query:  KFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        KFPAEMEL DIKSRQVRRSSS LFP NE K KF CGRSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  KFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA

A0A6J1E1L2 uncharacterized protein LOC1114299533.4e-8566.67Show/hide
Query:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF
        MF+DD +E  PLTS+H QDH         E++E+L+FSDLP     SD   P    KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND 
Subjt:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF

Query:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR
        S       RT++  FW ++SRK + FRKRS+SL GLQ+SVS     KINLKR+SRSLDYRK++RQ+NSI    +EIDR+ S K+GLK D ++KKA SKPR
Subjt:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR

Query:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        WYLLMFGMVKFPAEM+L DIKSRQVRRSSSALFP NE+KGK+ C  RSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA

A0A6J1JAL3 uncharacterized protein LOC1114850633.4e-8566.67Show/hide
Query:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF
        MF+DD +E  PLTS+H QDH         E++E+L+FSDLP     SD   P    KNPRRSSS+PLDLFEFF+AGFI SEISPAEDLIF GRLLPLND 
Subjt:  MFIDDANEPNPLTSKHNQDH---------ESDESLTFSDLPFHDPNSDEICP----KNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDF

Query:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR
        S       RT+D  FW ++SRK + FRKRS+SL GLQ+SVS     KINLKR+SRSLDYRK++RQ+NSI    +EIDR+ S K+GLK D ++KKA SKPR
Subjt:  SP------RTSDNDFWNEQSRKSSGFRKRSDSLPGLQTSVS----TKINLKRSSRSLDYRKIHRQSNSI----SEIDRDFSSKSGLKADLVSKKALSKPR

Query:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA
        WYLLMFGMVKFPAEM+L DIKSRQVRRSSSALFP +E+KGK+ C  RSSGEA WRILRALSCKN  SVDVTA LTA
Subjt:  WYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGKFQC-GRSSGEAAWRILRALSCKNDVSVDVTAPLTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30230.1 unknown protein5.3e-3036Show/hide
Query:  PLTSKHNQDHESDESLTFSDLPFHDPNSDEICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPRTSDNDFWNEQSRKSSGFRKR
        P   +H+ + E +++L+  DLP    N +    ++ +  S+   +LFEF T+   + +++PAE++IF G+L+PLN        N F++     S   R R
Subjt:  PLTSKHNQDHESDESLTFSDLPFHDPNSDEICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPRTSDNDFWNEQSRKSSGFRKR

Query:  SDSLPGLQ---------TSVSTKINL--KRSSRSLDYRKIHRQSNSI-SEIDRDFSSKSGLKADLVSKKALS--KPRWYLLMFGMVKFPAEMELRDIKSR
        S+SL  +Q          +V+ + N    R+SRSLDYRK+ R   ++ S  +   S+K+  K +  S  ++   +PRWY++MFGMVKFP E+EL+DIKSR
Subjt:  SDSLPGLQ---------TSVSTKINL--KRSSRSLDYRKIHRQSNSI-SEIDRDFSSKSGLKADLVSKKALS--KPRWYLLMFGMVKFPAEMELRDIKSR

Query:  QVRRS-SSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAP
        Q+RR+    +FP   N+        S   +WR L ALSCK   SV  TAP
Subjt:  QVRRS-SSALFPVNENKGKFQCGRSSGEAAWRILRALSCKNDVSVDVTAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCTTTCCACACAATGTTCATCGACGACGCAAACGAACCAAATCCTCTAACCTCAAAACACAATCAAGATCACGAATCAGATGAATCTCTCACCTTCTCCGATCT
TCCATTCCACGATCCAAATTCCGACGAAATCTGCCCCAAGAATCCTCGCCGATCTTCCTCCCAGCCTCTGGATCTCTTCGAGTTCTTCACCGCCGGATTCATCGCCTCCG
AGATTTCTCCGGCCGAGGATCTGATCTTCCGCGGCAGGTTGCTTCCTCTCAACGATTTCTCTCCGCGTACCTCGGACAACGATTTCTGGAACGAACAGAGCCGAAAATCG
TCTGGATTCAGAAAACGATCCGATTCCTTGCCTGGATTGCAAACCTCTGTTTCCACGAAGATCAATCTCAAGCGAAGCAGCCGATCGCTCGATTACCGTAAGATTCATCG
TCAATCGAATTCGATTTCTGAAATTGATCGTGATTTTTCGAGTAAGAGCGGATTGAAGGCGGATCTGGTGAGTAAAAAGGCGTTATCGAAGCCGCGGTGGTACTTGCTGA
TGTTTGGAATGGTGAAGTTTCCGGCGGAGATGGAGCTCAGGGACATTAAGAGCAGACAGGTCCGGCGGAGTTCGTCGGCGCTTTTCCCTGTGAATGAGAACAAAGGTAAA
TTCCAGTGTGGCCGGAGCTCCGGCGAGGCGGCTTGGAGGATTCTTCGAGCGCTTAGCTGCAAGAACGACGTTAGTGTAGATGTAACGGCGCCGTTAACTGCCTGA
mRNA sequenceShow/hide mRNA sequence
GTACAGTTCCAAACACACACACACCCATTTTCCATGTCTTCTTTCCACACAATGTTCATCGACGACGCAAACGAACCAAATCCTCTAACCTCAAAACACAATCAAGATCA
CGAATCAGATGAATCTCTCACCTTCTCCGATCTTCCATTCCACGATCCAAATTCCGACGAAATCTGCCCCAAGAATCCTCGCCGATCTTCCTCCCAGCCTCTGGATCTCT
TCGAGTTCTTCACCGCCGGATTCATCGCCTCCGAGATTTCTCCGGCCGAGGATCTGATCTTCCGCGGCAGGTTGCTTCCTCTCAACGATTTCTCTCCGCGTACCTCGGAC
AACGATTTCTGGAACGAACAGAGCCGAAAATCGTCTGGATTCAGAAAACGATCCGATTCCTTGCCTGGATTGCAAACCTCTGTTTCCACGAAGATCAATCTCAAGCGAAG
CAGCCGATCGCTCGATTACCGTAAGATTCATCGTCAATCGAATTCGATTTCTGAAATTGATCGTGATTTTTCGAGTAAGAGCGGATTGAAGGCGGATCTGGTGAGTAAAA
AGGCGTTATCGAAGCCGCGGTGGTACTTGCTGATGTTTGGAATGGTGAAGTTTCCGGCGGAGATGGAGCTCAGGGACATTAAGAGCAGACAGGTCCGGCGGAGTTCGTCG
GCGCTTTTCCCTGTGAATGAGAACAAAGGTAAATTCCAGTGTGGCCGGAGCTCCGGCGAGGCGGCTTGGAGGATTCTTCGAGCGCTTAGCTGCAAGAACGACGTTAGTGT
AGATGTAACGGCGCCGTTAACTGCCTGAATTGTGTGTCACGGGCGATGCACGTGACTGGCGGTTGCATTTGGCAATTCCGCGGGAAGGGTAGTTTTTGGGAATTTGAAAG
TTGAAATGAAAATTTAGGTAAATTATAAGTTTGTTTCTAGGCTTTAAACTTCAATCAAGTTCTAGAAGAGTTTTAAAAAAAAAATTAATTCATATAATTTTAGAAATTGA
AACTTAGTCTATATAATTTGATAACATTTAAC
Protein sequenceShow/hide protein sequence
MSSFHTMFIDDANEPNPLTSKHNQDHESDESLTFSDLPFHDPNSDEICPKNPRRSSSQPLDLFEFFTAGFIASEISPAEDLIFRGRLLPLNDFSPRTSDNDFWNEQSRKS
SGFRKRSDSLPGLQTSVSTKINLKRSSRSLDYRKIHRQSNSISEIDRDFSSKSGLKADLVSKKALSKPRWYLLMFGMVKFPAEMELRDIKSRQVRRSSSALFPVNENKGK
FQCGRSSGEAAWRILRALSCKNDVSVDVTAPLTA