; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002033 (gene) of Chayote v1 genome

Gene IDSed0002033
OrganismSechium edule (Chayote v1)
DescriptionSurvival motor neuron
Genome locationLG04:7763302..7770886
RNA-Seq ExpressionSed0002033
SyntenySed0002033
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR040424 - Survival motor neuron-like protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577733.1 hypothetical protein SDJN03_25307, partial [Cucurbita argyrosperma subsp. sororia]1.7e-10372.57Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        M LD +YWDDSM VKAM+EA++KYK MHG++VR  SA+GGG F GCG  KSDEP RSVDE+S I AN+V FEVNET+NTSEA ENI+VEPCPISC DFS 
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        AL++KETEQ  +E  +L+LKG++ YN+LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV AGS  G QWGTS+AY E+PVSASLPS N  I S  PSSYP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK
        I A  GPQSS   DGDIIKTAMDSAARAISS+ TVNK   EKESE H GIMPQ GAS ETDL  VLNAWY+AGF TG+Y+ EQ+ AKK
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK

XP_022923599.1 uncharacterized protein LOC111431235 [Cucurbita moschata]1.1e-10271.88Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        M LD ++WDDSM V+AM+EA++KYK MHG++VR  SA+GGG F GCG  KSDEP RSVDE+S I AN+V FEVNET+NTSEA ENI+VEPCPISC DFS 
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        AL++KETEQ  +E  +L+LKG++ YN+LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV AGS  G QWGTS+AY E+PVSAS PS N  IPS  PSSYP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK
        I A  GPQSS   DGDIIKTAMDSAARAISS+ TVNK   EKESE H GIMPQ GAS ETDL  VLNAWY+AGF TG+Y+ EQ+ AKK
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK

XP_022965361.1 uncharacterized protein LOC111465241 isoform X1 [Cucurbita maxima]2.5e-10273.76Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        M LD MYWDDSM VKAM+EA++KYK MHG+D+R  SA+GGG FNGCG  KSDEP RSVDE+S I AN+V FEVNET NTSEA ENI+VEPCPISC DFS 
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        AL++KETEQ  IE  +L+L+G++ YN+LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV AGS  G QWGTS+AY E+PVSAS PS N AIPS  PSSYP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAE
        I A  GPQSS L DGDIIKTAMDSAARAISS+ TVNK   EKESE H GIMPQ GAS ETDL  VLNAWY+AGF TG+ V +
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAE

XP_022965362.1 uncharacterized protein LOC111465241 isoform X2 [Cucurbita maxima]1.1e-10573.96Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        M LD MYWDDSM VKAM+EA++KYK MHG+D+R  SA+GGG FNGCG  KSDEP RSVDE+S I AN+V FEVNET NTSEA ENI+VEPCPISC DFS 
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        AL++KETEQ  IE  +L+L+G++ YN+LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV AGS  G QWGTS+AY E+PVSAS PS N AIPS  PSSYP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK
        I A  GPQSS L DGDIIKTAMDSAARAISS+ TVNK   EKESE H GIMPQ GAS ETDL  VLNAWY+AGF TG+Y+ EQ+ AKK
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK

XP_023551935.1 uncharacterized protein LOC111809760 isoform X2 [Cucurbita pepo subsp. pepo]1.2e-10472.92Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        M LD MYWDDSM VKAM+EA++KYK MHG++VR  SA+GGG FNGCG  KSDEP RSVDE+S I AN+V FEVNE +NTSEA ENI+VEPCPISC DFS 
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        AL++KETEQ  ++  +L+LKG++ YN+LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV AGS  G QWGTS+AY E+PVSAS PS N AIPS  PSSYP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK
        + A  GPQSS   DGDIIKTAMDSAARAISS+ TVNK   EKESE H GIMPQSGAS ETDL  VLNAWY+AGF TG+Y+ EQ+ AKK
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK

TrEMBL top hitse value%identityAlignment
A0A1S3BL18 uncharacterized protein LOC103490751 isoform X36.6e-9367.83Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        MGLD MYWD+SM VKAM+EA++KYK+MHGH+V C SA+GGG  N CG  KSDE  RSVDE+S    NNV+FEV ET++T EA ENI VE   I+C DFSD
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        ALH++ET++ P+E  DL     E+YN LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV+AGS  G QWGTS+A  E+PVSAS PSH   IPS  P+ YP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAI-SSVNTVNKEKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK
        ILA  GPQSS L D DIIKTAMDSA RAI SS+ TVNK KES++H  IMPQSG S ETDL  VLNAWY+AGF TG+Y+ EQ++AKK
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAI-SSVNTVNKEKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK

A0A6J1CU98 uncharacterized protein LOC111014833 isoform X31.6e-9167.25Show/hide
Query:  MYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSDALHLK
        M  DDS  V AM EA++KYK+MHGH++   S +GG  FNG G  +SDEP R  DE SNIEANNV+FEV+E +NTS  NENI+VEPCPISC DFSDALH+K
Subjt:  MYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSDALHLK

Query:  ETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYPILAAT
        ET+QGPIE  +L+LKG E YN+LL+QYYELEEKRQKVL+QLY     GWNY DVSAGS+ G QWGTSSAY E+PV AS  SHNHAI + +PSSYPI    
Subjt:  ETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYPILAAT

Query:  GPQSSYLPDGDIIKTAMDSAARAISSVNT-------VNKEKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK
        GPQSS L DGDIIKTAMD+AARAISS+ T       VNKEK SE+  GIMPQS AS ETDL  V NAWY+AGF TG+Y+ EQ+YAKK
Subjt:  GPQSSYLPDGDIIKTAMDSAARAISSVNT-------VNKEKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK

A0A6J1E6W0 uncharacterized protein LOC1114312355.4e-10371.88Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        M LD ++WDDSM V+AM+EA++KYK MHG++VR  SA+GGG F GCG  KSDEP RSVDE+S I AN+V FEVNET+NTSEA ENI+VEPCPISC DFS 
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        AL++KETEQ  +E  +L+LKG++ YN+LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV AGS  G QWGTS+AY E+PVSAS PS N  IPS  PSSYP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK
        I A  GPQSS   DGDIIKTAMDSAARAISS+ TVNK   EKESE H GIMPQ GAS ETDL  VLNAWY+AGF TG+Y+ EQ+ AKK
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK

A0A6J1HK48 uncharacterized protein LOC111465241 isoform X25.2e-10673.96Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        M LD MYWDDSM VKAM+EA++KYK MHG+D+R  SA+GGG FNGCG  KSDEP RSVDE+S I AN+V FEVNET NTSEA ENI+VEPCPISC DFS 
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        AL++KETEQ  IE  +L+L+G++ YN+LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV AGS  G QWGTS+AY E+PVSAS PS N AIPS  PSSYP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK
        I A  GPQSS L DGDIIKTAMDSAARAISS+ TVNK   EKESE H GIMPQ GAS ETDL  VLNAWY+AGF TG+Y+ EQ+ AKK
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKK

A0A6J1HLH3 uncharacterized protein LOC111465241 isoform X11.2e-10273.76Show/hide
Query:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD
        M LD MYWDDSM VKAM+EA++KYK MHG+D+R  SA+GGG FNGCG  KSDEP RSVDE+S I AN+V FEVNET NTSEA ENI+VEPCPISC DFS 
Subjt:  MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSD

Query:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP
        AL++KETEQ  IE  +L+L+G++ YN+LLKQYYELEEKRQKVLEQLYQCGA GWNYQDV AGS  G QWGTS+AY E+PVSAS PS N AIPS  PSSYP
Subjt:  ALHLKETEQGPIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYP

Query:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAE
        I A  GPQSS L DGDIIKTAMDSAARAISS+ TVNK   EKESE H GIMPQ GAS ETDL  VLNAWY+AGF TG+ V +
Subjt:  ILAATGPQSSYLPDGDIIKTAMDSAARAISSVNTVNK---EKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCTAGACAGTATGTACTGGGACGATTCCATGTTCGTCAAAGCCATGAACGAAGCTATAATGAAGTATAAGGTAATGCATGGACATGATGTCCGTTGTGGTTCAGC
TGATGGAGGAGGTGATTTTAACGGTTGTGGTTGTCATAAGAGTGACGAGCCGACGAGGAGTGTAGATGAAGATAGCAATATTGAAGCAAATAATGTTGACTTTGAAGTCA
ATGAGACTTCAAATACCTCAGAAGCTAATGAAAATATCACCGTAGAGCCATGTCCTATATCTTGTGAGGATTTTTCAGATGCTCTACATTTGAAAGAGACGGAGCAGGGG
CCCATTGAGCACCCCGATTTACATCTAAAAGGAAAAGAGGACTATAACCAGCTACTCAAACAATATTATGAGCTTGAGGAGAAGAGGCAGAAGGTTCTAGAACAGCTGTA
TCAATGTGGTGCTGATGGTTGGAACTACCAGGATGTCAGTGCAGGGTCTGCCAATGGATATCAATGGGGAACATCATCTGCTTATCTAGAAAACCCAGTCTCTGCAAGCC
TACCTTCTCATAATCATGCAATACCCTCCAACTATCCCTCCAGTTATCCAATTTTAGCTGCTACAGGTCCTCAAAGTTCATACCTTCCTGATGGTGACATTATCAAAACT
GCAATGGATTCTGCAGCAAGAGCTATATCCTCTGTGAACACTGTAAATAAAGAAAAAGAAAGCGAGAAACACGGTGGGATAATGCCTCAAAGTGGTGCTAGCTTTGAAAC
AGACCTTTTTCCTGTTTTAAATGCTTGGTATAATGCAGGCTTCTGCACTGGCCAATATGTTGCTGAGCAAAATTATGCCAAGAAACAGTGA
mRNA sequenceShow/hide mRNA sequence
CAGAAATACGGGTCGGGTCAATCCACAAAACCCGAAAGTAGATTCGGGTCGGTATCGCTAATGCTTGACACCGTAGAAAGGTCGCGGCAAGCTGAGGCAAGAACACATTT
CCGTGAAGTTATTTTCTCTGTTTCTTTGTTTTCGCTCTCAGATTTCAATCGAGAAGATGGGGCTAGACAGTATGTACTGGGACGATTCCATGTTCGTCAAAGCCATGAAC
GAAGCTATAATGAAGTATAAGGTAATGCATGGACATGATGTCCGTTGTGGTTCAGCTGATGGAGGAGGTGATTTTAACGGTTGTGGTTGTCATAAGAGTGACGAGCCGAC
GAGGAGTGTAGATGAAGATAGCAATATTGAAGCAAATAATGTTGACTTTGAAGTCAATGAGACTTCAAATACCTCAGAAGCTAATGAAAATATCACCGTAGAGCCATGTC
CTATATCTTGTGAGGATTTTTCAGATGCTCTACATTTGAAAGAGACGGAGCAGGGGCCCATTGAGCACCCCGATTTACATCTAAAAGGAAAAGAGGACTATAACCAGCTA
CTCAAACAATATTATGAGCTTGAGGAGAAGAGGCAGAAGGTTCTAGAACAGCTGTATCAATGTGGTGCTGATGGTTGGAACTACCAGGATGTCAGTGCAGGGTCTGCCAA
TGGATATCAATGGGGAACATCATCTGCTTATCTAGAAAACCCAGTCTCTGCAAGCCTACCTTCTCATAATCATGCAATACCCTCCAACTATCCCTCCAGTTATCCAATTT
TAGCTGCTACAGGTCCTCAAAGTTCATACCTTCCTGATGGTGACATTATCAAAACTGCAATGGATTCTGCAGCAAGAGCTATATCCTCTGTGAACACTGTAAATAAAGAA
AAAGAAAGCGAGAAACACGGTGGGATAATGCCTCAAAGTGGTGCTAGCTTTGAAACAGACCTTTTTCCTGTTTTAAATGCTTGGTATAATGCAGGCTTCTGCACTGGCCA
ATATGTTGCTGAGCAAAATTATGCCAAGAAACAGTGAAAAGTTAAAAACCAAGCTTCTTCATCTAGCCTCATGTAGCTAATTTTGCCCCTTTTCACACTCATGGCTTATC
TGCTTTTATGCCTGCACCTCTATAAATCTTCTTTTCACTGAACTTTTGCTTGTATAGAACAAACTCAACTCTGATCTAAATGAGTTTGAGTTTTATTGTATTCGGTCGAA
ATTTTGAGAAAGATCTAAATCTTTTTATTTTTATTAACTTTCCACAAATATATATAATATATGGAATATTT
Protein sequenceShow/hide protein sequence
MGLDSMYWDDSMFVKAMNEAIMKYKVMHGHDVRCGSADGGGDFNGCGCHKSDEPTRSVDEDSNIEANNVDFEVNETSNTSEANENITVEPCPISCEDFSDALHLKETEQG
PIEHPDLHLKGKEDYNQLLKQYYELEEKRQKVLEQLYQCGADGWNYQDVSAGSANGYQWGTSSAYLENPVSASLPSHNHAIPSNYPSSYPILAATGPQSSYLPDGDIIKT
AMDSAARAISSVNTVNKEKESEKHGGIMPQSGASFETDLFPVLNAWYNAGFCTGQYVAEQNYAKKQ