; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021173 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021173
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSurvival motor neuron
Genome locationscaffold6:47168399..47175276
RNA-Seq ExpressionSpg021173
SyntenySpg021173
Gene Ontology termsGO:0005634 - nucleus (cellular component)
InterPro domainsIPR040424 - Survival motor neuron-like protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022923599.1 uncharacterized protein LOC111431235 [Cucurbita moschata]1.2e-11472.05Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        M LDR++WDDSMIV+AMDEA+LKYK MHG+EV  VSAEGG VF GCGKSDEP+                       RSVDEES I AN+  FEVNET NT
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
        SEA ENI+VEPCPISCT+FS ALYVKET+QE +E SNLNLKGE+GYN+LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV A SDIG QWGTS+AYQEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG
        VSASQPS N  IPSY PSSYPI AGPQSSS ADGDIIKTAMDSAARAISS+KTV K   EKESE HDGIMPQ  ASSET+L  VLNAWYSAGFYTGK   
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG

Query:  GQGKDFRERLLRYLVEQYHAKK
                    YLVEQ  AKK
Subjt:  GQGKDFRERLLRYLVEQYHAKK

XP_022965361.1 uncharacterized protein LOC111465241 isoform X1 [Cucurbita maxima]1.4e-11576.77Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        M LDRMYWDDSMIVKAMDEA+LKYK MHG+++  VSAEGG VFNGCGKSDEP+                       RSVDEES I AN+  FEVNET NT
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
        SEA ENI+VEPCPISCT+FS ALYVKET+QE IE SNLNL+GE+GYN+LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV A SDIG QWGTS+AYQEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGK
        VSASQPS N AIPSY PSSYPI AGPQSSSLADGDIIKTAMDSAARAISS+KTV K   EKESE HDGIMPQ  ASSET+L  VLNAWYSAGFYTGK
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGK

XP_022965362.1 uncharacterized protein LOC111465241 isoform X2 [Cucurbita maxima]7.4e-11773.29Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        M LDRMYWDDSMIVKAMDEA+LKYK MHG+++  VSAEGG VFNGCGKSDEP+                       RSVDEES I AN+  FEVNET NT
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
        SEA ENI+VEPCPISCT+FS ALYVKET+QE IE SNLNL+GE+GYN+LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV A SDIG QWGTS+AYQEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG
        VSASQPS N AIPSY PSSYPI AGPQSSSLADGDIIKTAMDSAARAISS+KTV K   EKESE HDGIMPQ  ASSET+L  VLNAWYSAGFYTGK   
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG

Query:  GQGKDFRERLLRYLVEQYHAKK
                    YLVEQ  AKK
Subjt:  GQGKDFRERLLRYLVEQYHAKK

XP_023551933.1 uncharacterized protein LOC111809760 isoform X1 [Cucurbita pepo subsp. pepo]2.6e-11476.09Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        M LDRMYWDDSMIVKAMDEA+LKYK MHG+EV  VSAEGG VFNGCGKSDEP+                       RSVDEES I AN+  FEVNE  NT
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
        SEA ENI+VEPCPISCT+FS ALYVKET+QE ++ SNLNLKGE+GYN+LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV A SDIG QWGTS+AYQEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGK
        VSASQPS N AIPSY PSSYP+ AGPQSSS ADGDIIKTAMDSAARAISS+KTV K   EKESE H GIMPQS ASSET+L  VLNAWYSAGFYTGK
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGK

XP_023551935.1 uncharacterized protein LOC111809760 isoform X2 [Cucurbita pepo subsp. pepo]1.4e-11572.67Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        M LDRMYWDDSMIVKAMDEA+LKYK MHG+EV  VSAEGG VFNGCGKSDEP+                       RSVDEES I AN+  FEVNE  NT
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
        SEA ENI+VEPCPISCT+FS ALYVKET+QE ++ SNLNLKGE+GYN+LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV A SDIG QWGTS+AYQEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG
        VSASQPS N AIPSY PSSYP+ AGPQSSS ADGDIIKTAMDSAARAISS+KTV K   EKESE H GIMPQS ASSET+L  VLNAWYSAGFYTGK   
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG

Query:  GQGKDFRERLLRYLVEQYHAKK
                    YLVEQ  AKK
Subjt:  GQGKDFRERLLRYLVEQYHAKK

TrEMBL top hitse value%identityAlignment
A0A1S3BL18 uncharacterized protein LOC103490751 isoform X35.0e-10368.12Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        MGLD+MYWD+SMIVKAMDEA+LKYKIMHGHEV CVSAEGG V N CGKSDE K                       RSVDEES    NN EFEV ET +T
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
         EA ENI VE   I+CT+FSDAL+V+ETQ+EP+E S+L     E YN LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV+A SD+GTQWGTS+A QEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAI-SSVKTVTKEKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGGGQ
        VSASQPSH   IPSY P+ YPILAGPQSSSL D DIIKTAMDSA RAI SS+KTV K KES+RHD IMPQS  SSET+LA VLNAWYSAGFYTGK     
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAI-SSVKTVTKEKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGGGQ

Query:  GKDFRERLLRYLVEQYHAKK
                  YL+EQ HAKK
Subjt:  GKDFRERLLRYLVEQYHAKK

A0A6J1CU98 uncharacterized protein LOC111014833 isoform X33.3e-10266.98Show/hide
Query:  MYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANTSEANE
        M  DDS +V AM+EA+LKYKIMHGHE+  +S EGGE FNG G+SDEPK                       R  DE+SNIEANN  FEV+E  NTS  NE
Subjt:  MYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANTSEANE

Query:  NITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHPVSASQ
        NI+VEPCPISC +FSDAL+VKETQQ PIE SNLNLKG EGYN+LL+QYYELEEKRQKVL+QLY    GGWNY DVSA S +GTQWGTSSAYQEHPV ASQ
Subjt:  NITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHPVSASQ

Query:  PSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKT-------VTKEKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGGG
         SHNHAI + WPSSYPI  GPQSSSLADGDIIKTAMD+AARAISS+ T       V KEK SER DGIMPQS ASSET+LAAV NAWYSAGFYTGK    
Subjt:  PSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKT-------VTKEKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGGG

Query:  QGKDFRERLLRYLVEQYHAKK
                   YLVEQ +AKK
Subjt:  QGKDFRERLLRYLVEQYHAKK

A0A6J1E6W0 uncharacterized protein LOC1114312355.7e-11572.05Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        M LDR++WDDSMIV+AMDEA+LKYK MHG+EV  VSAEGG VF GCGKSDEP+                       RSVDEES I AN+  FEVNET NT
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
        SEA ENI+VEPCPISCT+FS ALYVKET+QE +E SNLNLKGE+GYN+LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV A SDIG QWGTS+AYQEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG
        VSASQPS N  IPSY PSSYPI AGPQSSS ADGDIIKTAMDSAARAISS+KTV K   EKESE HDGIMPQ  ASSET+L  VLNAWYSAGFYTGK   
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG

Query:  GQGKDFRERLLRYLVEQYHAKK
                    YLVEQ  AKK
Subjt:  GQGKDFRERLLRYLVEQYHAKK

A0A6J1HK48 uncharacterized protein LOC111465241 isoform X23.6e-11773.29Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        M LDRMYWDDSMIVKAMDEA+LKYK MHG+++  VSAEGG VFNGCGKSDEP+                       RSVDEES I AN+  FEVNET NT
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
        SEA ENI+VEPCPISCT+FS ALYVKET+QE IE SNLNL+GE+GYN+LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV A SDIG QWGTS+AYQEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG
        VSASQPS N AIPSY PSSYPI AGPQSSSLADGDIIKTAMDSAARAISS+KTV K   EKESE HDGIMPQ  ASSET+L  VLNAWYSAGFYTGK   
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGG

Query:  GQGKDFRERLLRYLVEQYHAKK
                    YLVEQ  AKK
Subjt:  GQGKDFRERLLRYLVEQYHAKK

A0A6J1HLH3 uncharacterized protein LOC111465241 isoform X16.7e-11676.77Show/hide
Query:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT
        M LDRMYWDDSMIVKAMDEA+LKYK MHG+++  VSAEGG VFNGCGKSDEP+                       RSVDEES I AN+  FEVNET NT
Subjt:  MGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVDEESNIEANNFEFEVNETANT

Query:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP
        SEA ENI+VEPCPISCT+FS ALYVKET+QE IE SNLNL+GE+GYN+LLKQYYELEEKRQKVLEQLYQC AGGWNYQDV A SDIG QWGTS+AYQEHP
Subjt:  SEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQWGTSSAYQEHP

Query:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGK
        VSASQPS N AIPSY PSSYPI AGPQSSSLADGDIIKTAMDSAARAISS+KTV K   EKESE HDGIMPQ  ASSET+L  VLNAWYSAGFYTGK
Subjt:  VSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTK---EKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCGCGGCAAGCTGAGGCGGGAACACATATTTCCTTCAGGTTATTTCTTTCCCTTCTGCAATCGCATCACTCTCAGAATTCAATCGAGAGGATGGGGCTGGACAGAATGTA
CTGGGATGATTCCATGATTGTCAAAGCCATGGATGAAGCTATCTTGAAGTATAAGATAATGCATGGACATGAAGTTCTCTGTGTTTCAGCCGAGGGAGGAGAGGTCTTTA
ACGGCTGCGGTAAGAGTGACGAGCCAAAAAGTTTTGTGATCACGGTAACTGCGGAGGCAGTTAGAGGTGCTTGGTTTAGGTTGGATATTCTGGAGAACAGGAGTGTAGAT
GAAGAGAGCAATATTGAAGCAAATAATTTTGAGTTCGAAGTCAATGAGACTGCAAATACCTCAGAAGCTAATGAAAATATCACTGTAGAGCCATGTCCTATATCTTGTAC
GGAATTTTCAGATGCTCTATATGTGAAAGAGACCCAACAGGAGCCCATTGAGCACTCCAATTTAAATCTAAAAGGTGAAGAGGGCTATAACCAGCTACTCAAGCAGTATT
ATGAGCTTGAGGAGAAGAGGCAAAAGGTTCTAGAACAGCTGTATCAATGTAGTGCTGGTGGTTGGAACTACCAGGATGTCAGCGCAGAGTCTGACATTGGAACTCAATGG
GGAACATCTTCTGCTTATCAAGAACACCCAGTCTCTGCAAGTCAACCTTCTCATAACCATGCAATTCCCTCCTATTGGCCCTCCAGTTATCCAATTTTAGCTGGTCCTCA
AAGTTCGTCCCTTGCTGATGGTGACATTATCAAAACTGCAATGGATTCTGCAGCAAGAGCTATATCCTCCGTGAAAACTGTAACTAAAGAGAAAGAGAGCGAGAGACATG
ATGGGATAATGCCTCAAAGTGATGCTAGCTCTGAAACAGAACTTGCTGCTGTTTTAAATGCTTGGTATTCTGCAGGCTTCTACACTGGCAAACTTGGAGGTGGCCAAGGC
AAAGATTTTAGGGAGCGCCTCCTAAGATACCTTGTGGAGCAATATCATGCCAAGAAACAGTGA
mRNA sequenceShow/hide mRNA sequence
TCGCGGCAAGCTGAGGCGGGAACACATATTTCCTTCAGGTTATTTCTTTCCCTTCTGCAATCGCATCACTCTCAGAATTCAATCGAGAGGATGGGGCTGGACAGAATGTA
CTGGGATGATTCCATGATTGTCAAAGCCATGGATGAAGCTATCTTGAAGTATAAGATAATGCATGGACATGAAGTTCTCTGTGTTTCAGCCGAGGGAGGAGAGGTCTTTA
ACGGCTGCGGTAAGAGTGACGAGCCAAAAAGTTTTGTGATCACGGTAACTGCGGAGGCAGTTAGAGGTGCTTGGTTTAGGTTGGATATTCTGGAGAACAGGAGTGTAGAT
GAAGAGAGCAATATTGAAGCAAATAATTTTGAGTTCGAAGTCAATGAGACTGCAAATACCTCAGAAGCTAATGAAAATATCACTGTAGAGCCATGTCCTATATCTTGTAC
GGAATTTTCAGATGCTCTATATGTGAAAGAGACCCAACAGGAGCCCATTGAGCACTCCAATTTAAATCTAAAAGGTGAAGAGGGCTATAACCAGCTACTCAAGCAGTATT
ATGAGCTTGAGGAGAAGAGGCAAAAGGTTCTAGAACAGCTGTATCAATGTAGTGCTGGTGGTTGGAACTACCAGGATGTCAGCGCAGAGTCTGACATTGGAACTCAATGG
GGAACATCTTCTGCTTATCAAGAACACCCAGTCTCTGCAAGTCAACCTTCTCATAACCATGCAATTCCCTCCTATTGGCCCTCCAGTTATCCAATTTTAGCTGGTCCTCA
AAGTTCGTCCCTTGCTGATGGTGACATTATCAAAACTGCAATGGATTCTGCAGCAAGAGCTATATCCTCCGTGAAAACTGTAACTAAAGAGAAAGAGAGCGAGAGACATG
ATGGGATAATGCCTCAAAGTGATGCTAGCTCTGAAACAGAACTTGCTGCTGTTTTAAATGCTTGGTATTCTGCAGGCTTCTACACTGGCAAACTTGGAGGTGGCCAAGGC
AAAGATTTTAGGGAGCGCCTCCTAAGATACCTTGTGGAGCAATATCATGCCAAGAAACAGTGA
Protein sequenceShow/hide protein sequence
SRQAEAGTHISFRLFLSLLQSHHSQNSIERMGLDRMYWDDSMIVKAMDEAILKYKIMHGHEVLCVSAEGGEVFNGCGKSDEPKSFVITVTAEAVRGAWFRLDILENRSVD
EESNIEANNFEFEVNETANTSEANENITVEPCPISCTEFSDALYVKETQQEPIEHSNLNLKGEEGYNQLLKQYYELEEKRQKVLEQLYQCSAGGWNYQDVSAESDIGTQW
GTSSAYQEHPVSASQPSHNHAIPSYWPSSYPILAGPQSSSLADGDIIKTAMDSAARAISSVKTVTKEKESERHDGIMPQSDASSETELAAVLNAWYSAGFYTGKLGGGQG
KDFRERLLRYLVEQYHAKKQ