; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023472 (gene) of Chayote v1 genome

Gene IDSed0023472
OrganismSechium edule (Chayote v1)
DescriptionMyosin heavy chain
Genome locationLG07:6281223..6283541
RNA-Seq ExpressionSed0023472
SyntenySed0023472
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050514.1 myosin heavy chain [Cucumis melo var. makuwa]6.8e-7069.09Show/hide
Query:  ERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD-
        E    S+TPP KLSLFSL R  PEPPGMVTPPLHASISVPFQWEEAPGKPRP GI++  NSKPKS RSLDLPPRLFADAKVAHF SP  A      GRD 
Subjt:  ERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD-

Query:  ---LSFRFPDSWAE------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIY
           LSFRFPD+WAE      TA    E K GK+VG RRWMSFRKNK+IP SGSEI+V+ GG    G+ DG TRVKITRFRSRRSFF K NS SH IA+IY
Subjt:  ---LSFRFPDSWAE------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIY

Query:  GSLKQVFPWRRKQDETTKLS
        GSLKQ   WRRK DE   +S
Subjt:  GSLKQVFPWRRKQDETTKLS

TYK29189.1 myosin heavy chain [Cucumis melo var. makuwa]1.2e-6968.47Show/hide
Query:  ERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD-
        E    S+TPP KLSLFSL R  PEPPGMVTPPLHASISVPFQWEEAPGKPRP GI++  NSKPKS RSLDLPPRLFADAKVAHF SP  A      GRD 
Subjt:  ERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD-

Query:  ---LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIAS
           LSFRFPD+WAE        TA    E K GK+VG RRWMSFRKNK+IP SGSEI+V+ GG    G+ DG TRVKITRFRSRRSFF K NS SH IA+
Subjt:  ---LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIAS

Query:  IYGSLKQVFPWRRKQDETTKLS
        IYGSLKQ   WRRK DE   +S
Subjt:  IYGSLKQVFPWRRKQDETTKLS

XP_004146225.1 uncharacterized protein At4g00950 [Cucumis sativus]1.4e-6768.33Show/hide
Query:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPA------IAGRD
        T  N SST  P KLSLFSL R  PEPPGMVTPPLHASISVPFQWEEAPGKPRP GI++  NSKP+S RSLDLPPRLFADAKVAHF SP       I G D
Subjt:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPA------IAGRD

Query:  ----LSFRFPDSWAE----TAEMAEEDKGGKFVGYRRWMSFRKNKK--IPTSGSEISVSAG----GGADDGGTRVKITRFRSRRSFFGKSNSNSHLIASI
            LSFRFPD+WAE    TA   +E+K GK VG RRWMSFRKNKK  IP SG EI+V+ G    GG+ DG TRVKITRFRS+RSFF K NS SH IA+I
Subjt:  ----LSFRFPDSWAE----TAEMAEEDKGGKFVGYRRWMSFRKNKK--IPTSGSEISVSAG----GGADDGGTRVKITRFRSRRSFFGKSNSNSHLIASI

Query:  YGSLKQVFPWRRKQDETTKLS
        YGSLKQV  WRRK DE   +S
Subjt:  YGSLKQVFPWRRKQDETTKLS

XP_008466837.1 PREDICTED: uncharacterized protein LOC103504144 [Cucumis melo]1.2e-6968.61Show/hide
Query:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD
        T  N SST  P KLSLFSL R  PEPPGMVTPPLHASISVPFQWEEAPGKPRP GI++  NSKPKS RSLDLPPRLFADAKVAHF SP  A      GRD
Subjt:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD

Query:  ----LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIA
            LSFRFPD+WAE        TA    E K GK+VG RRWMSFRKNK+IP SGSEI+V+ GG    G+ DG TRVKITRFRSRRSFF K NS SH IA
Subjt:  ----LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIA

Query:  SIYGSLKQVFPWRRKQDETTKLS
        +IYGSLKQ   WRRK DE   +S
Subjt:  SIYGSLKQVFPWRRKQDETTKLS

XP_022984515.1 uncharacterized protein At4g00950-like [Cucurbita maxima]1.1e-6767.27Show/hide
Query:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIAGRD------
        T  N SST  P KLSLFSL R  PEPPG+VTPPLHASISVPFQWEEAPGKPRP GI++  NSKP+S RSLDLPPRLF D KVAHF SP  A  D      
Subjt:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIAGRD------

Query:  ----LSFRFPDSWAETAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG--------GADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIYG
            LSFRFPD+WAETA    E K GK+VG RRWMSFRKNK++P   SEI  SAGG        G+ +G TRVKITRFRSRRSF  KS+S S LIASIYG
Subjt:  ----LSFRFPDSWAETAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG--------GADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIYG

Query:  SLKQVFPWRRKQDETTKLSQ
        SLKQV PWRRK DET  +SQ
Subjt:  SLKQVFPWRRKQDETTKLSQ

TrEMBL top hitse value%identityAlignment
A0A1S3CS64 uncharacterized protein LOC1035041445.6e-7068.61Show/hide
Query:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD
        T  N SST  P KLSLFSL R  PEPPGMVTPPLHASISVPFQWEEAPGKPRP GI++  NSKPKS RSLDLPPRLFADAKVAHF SP  A      GRD
Subjt:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD

Query:  ----LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIA
            LSFRFPD+WAE        TA    E K GK+VG RRWMSFRKNK+IP SGSEI+V+ GG    G+ DG TRVKITRFRSRRSFF K NS SH IA
Subjt:  ----LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIA

Query:  SIYGSLKQVFPWRRKQDETTKLS
        +IYGSLKQ   WRRK DE   +S
Subjt:  SIYGSLKQVFPWRRKQDETTKLS

A0A5A7U5K0 Myosin heavy chain3.3e-7069.09Show/hide
Query:  ERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD-
        E    S+TPP KLSLFSL R  PEPPGMVTPPLHASISVPFQWEEAPGKPRP GI++  NSKPKS RSLDLPPRLFADAKVAHF SP  A      GRD 
Subjt:  ERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD-

Query:  ---LSFRFPDSWAE------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIY
           LSFRFPD+WAE      TA    E K GK+VG RRWMSFRKNK+IP SGSEI+V+ GG    G+ DG TRVKITRFRSRRSFF K NS SH IA+IY
Subjt:  ---LSFRFPDSWAE------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIY

Query:  GSLKQVFPWRRKQDETTKLS
        GSLKQ   WRRK DE   +S
Subjt:  GSLKQVFPWRRKQDETTKLS

A0A5D3E0H1 Myosin heavy chain5.6e-7068.47Show/hide
Query:  ERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD-
        E    S+TPP KLSLFSL R  PEPPGMVTPPLHASISVPFQWEEAPGKPRP GI++  NSKPKS RSLDLPPRLFADAKVAHF SP  A      GRD 
Subjt:  ERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD-

Query:  ---LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIAS
           LSFRFPD+WAE        TA    E K GK+VG RRWMSFRKNK+IP SGSEI+V+ GG    G+ DG TRVKITRFRSRRSFF K NS SH IA+
Subjt:  ---LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIAS

Query:  IYGSLKQVFPWRRKQDETTKLS
        IYGSLKQ   WRRK DE   +S
Subjt:  IYGSLKQVFPWRRKQDETTKLS

A0A6J1J8U1 uncharacterized protein At4g00950-like5.3e-6867.27Show/hide
Query:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIAGRD------
        T  N SST  P KLSLFSL R  PEPPG+VTPPLHASISVPFQWEEAPGKPRP GI++  NSKP+S RSLDLPPRLF D KVAHF SP  A  D      
Subjt:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIAGRD------

Query:  ----LSFRFPDSWAETAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG--------GADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIYG
            LSFRFPD+WAETA    E K GK+VG RRWMSFRKNK++P   SEI  SAGG        G+ +G TRVKITRFRSRRSF  KS+S S LIASIYG
Subjt:  ----LSFRFPDSWAETAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG--------GADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIYG

Query:  SLKQVFPWRRKQDETTKLSQ
        SLKQV PWRRK DET  +SQ
Subjt:  SLKQVFPWRRKQDETTKLSQ

E5GBA5 Uncharacterized protein5.6e-7068.61Show/hide
Query:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD
        T  N SST  P KLSLFSL R  PEPPGMVTPPLHASISVPFQWEEAPGKPRP GI++  NSKPKS RSLDLPPRLFADAKVAHF SP  A      GRD
Subjt:  TERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIA------GRD

Query:  ----LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIA
            LSFRFPD+WAE        TA    E K GK+VG RRWMSFRKNK+IP SGSEI+V+ GG    G+ DG TRVKITRFRSRRSFF K NS SH IA
Subjt:  ----LSFRFPDSWAE--------TAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGG----GADDGGTRVKITRFRSRRSFFGKSNSNSHLIA

Query:  SIYGSLKQVFPWRRKQDETTKLS
        +IYGSLKQ   WRRK DE   +S
Subjt:  SIYGSLKQVFPWRRKQDETTKLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G46535.1 unknown protein5.7e-0628Show/hide
Query:  PEPPGMVTPPLHASISVPFQWEEAPGKPR-PVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIAGRDLSFRFPDSWAETAEMAEEDKGGKFVGYR
        P  P +   P+H   SVPF WE+ PGKP+ P+    R  S PK    LDLPPRL    +      P                       E K G      
Subjt:  PEPPGMVTPPLHASISVPFQWEEAPGKPR-PVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIAGRDLSFRFPDSWAETAEMAEEDKGGKFVGYR

Query:  RWMSFRKNKKIPTSGSEISVSAGGGADD--GGTRVKITRFRSRRSFF-GKSNSNSHLIASIYGSLKQVFPWRRKQ
        R++  +    +   G+ + +S    A D      +KI +F    S+  G S   SH   S+   LK   PW+ K+
Subjt:  RWMSFRKNKKIPTSGSEISVSAGGGADD--GGTRVKITRFRSRRSFF-GKSNSNSHLIASIYGSLKQVFPWRRKQ

AT4G00950.1 Protein of unknown function (DUF688)5.3e-0440Show/hide
Query:  VTPPLHASI--SVPFQWEEAPGKPRPVGIVDRSNSKPKSV-----------RSLDLPPRL
        ++ P+H+SI  SVPF WEE PGKP+       S+S    +           +SL+LPPRL
Subjt:  VTPPLHASI--SVPFQWEEAPGKPRPVGIVDRSNSKPKSV-----------RSLDLPPRL

AT4G27810.1 unknown protein8.4e-1833.84Show/hide
Query:  KLSLFSLT-RPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVG----IVDRSNSKPKS----VRSLDLPPRLFADAKVAHFESPAIAGRDLSFRFPDSW
        KL LFS+      + PG+ TPP++ + SVPF WEEAPGKPR       +  + N +       VR L+LPPRLF  A      +  + G    +  P   
Subjt:  KLSLFSLT-RPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVG----IVDRSNSKPKS----VRSLDLPPRLFADAKVAHFESPAIAGRDLSFRFPDSW

Query:  AETAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGGGADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIYGSLKQVFPWRRKQDETTKLS
              +E    G+F                 S S  S    GG   GGT VKI+R R + S    S+S S  +A +Y   KQV PWRR+Q+   ++S
Subjt:  AETAEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGGGADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIYGSLKQVFPWRRKQDETTKLS

AT5G53030.1 unknown protein3.5e-1633.66Show/hide
Query:  PGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPA----IAG----RDLSFRFPDSWAETAEM------AEE
        PG+ TPP++ + SVPF WEEAPGKPR V    R N K   VRSL+LPPRL    +      P+    + G    R  S   P S A   ++      A E
Subjt:  PGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPA----IAG----RDLSFRFPDSWAETAEM------AEE

Query:  DKGGKFVGYRRWMSFRKNKKIPTSGSEIS------------VSAGGGADD--GGTRVKITRFRSRRSFFGKSNSNS-----HLIASIYGSLKQVFPWRRK
         +     G  RW SF   K++     + S             + GGG  +  G  +VK+ R   + SFF  S++        + A +Y   KQV PW+RK
Subjt:  DKGGKFVGYRRWMSFRKNKKIPTSGSEIS------------VSAGGGADD--GGTRVKITRFRSRRSFFGKSNSNS-----HLIASIYGSLKQVFPWRRK

Query:  QD
        Q+
Subjt:  QD

AT5G53030.2 unknown protein7.6e-1133.33Show/hide
Query:  PGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPA----IAG----RDLSFRFPDSWAETAEM------AEE
        PG+ TPP++ + SVPF WEEAPGKPR V    R N K   VRSL+LPPRL    +      P+    + G    R  S   P S A   ++      A E
Subjt:  PGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPA----IAG----RDLSFRFPDSWAETAEM------AEE

Query:  DKGGKFVGYRRWMSFRKNKKIPTSGSEIS------------VSAGGGADD--GGTRVKITRFRSRRSFFGKSNS
         +     G  RW SF   K++     + S             + GGG  +  G  +VK+ R   + SFF  S++
Subjt:  DKGGKFVGYRRWMSFRKNKKIPTSGSEIS------------VSAGGGADD--GGTRVKITRFRSRRSFFGKSNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTCTGGAACAGAGCGAAACCAGAGCTCCACCACGCCTCCGGCGAAGCTGTCTCTGTTTTCCCTTACACGACCGCTGCCGGAGCCGCCGGGGATGGTGACGCCGCC
GCTGCACGCTTCGATTTCGGTGCCGTTTCAGTGGGAGGAGGCGCCGGGGAAGCCGAGACCGGTCGGAATTGTTGATCGATCGAATTCGAAGCCTAAAAGTGTACGGTCTT
TGGATCTGCCGCCGAGGTTGTTTGCCGATGCGAAAGTGGCGCATTTTGAGTCTCCGGCGATCGCCGGCCGAGATTTGTCGTTTAGATTCCCGGACAGCTGGGCGGAGACG
GCGGAGATGGCGGAGGAGGATAAGGGTGGTAAATTTGTTGGGTATCGGCGGTGGATGAGCTTTAGGAAGAATAAGAAGATTCCGACGAGTGGGTCTGAAATTTCGGTTTC
GGCCGGCGGCGGTGCTGACGACGGCGGTACGAGGGTGAAGATTACGAGGTTTAGGAGTAGAAGAAGCTTTTTTGGGAAATCTAATTCCAATTCGCACTTGATTGCAAGCA
TTTATGGGAGTTTGAAGCAAGTGTTTCCCTGGAGGCGAAAGCAGGATGAAACGACAAAACTTTCACAGTGA
mRNA sequenceShow/hide mRNA sequence
AAGACGAAGCGCCAGGAGCGTTTCGGCAGTTACCCCTCACAATTCGCATCCCTTTAAACTAACTCTTAATTCCACTTCTTTCTCTCTCTAAAAACCTCCCCTGATTTCTC
TCACTATAAAAACTTCCAATTTCTACTGTAACTAATCATCCATTGCTTCAACTGGGTGTTCTTGATTTTGCTCAAAATCAAATGGGTCTGTGAGAATTTGGCGAAATGAT
GTCTGGAACAGAGCGAAACCAGAGCTCCACCACGCCTCCGGCGAAGCTGTCTCTGTTTTCCCTTACACGACCGCTGCCGGAGCCGCCGGGGATGGTGACGCCGCCGCTGC
ACGCTTCGATTTCGGTGCCGTTTCAGTGGGAGGAGGCGCCGGGGAAGCCGAGACCGGTCGGAATTGTTGATCGATCGAATTCGAAGCCTAAAAGTGTACGGTCTTTGGAT
CTGCCGCCGAGGTTGTTTGCCGATGCGAAAGTGGCGCATTTTGAGTCTCCGGCGATCGCCGGCCGAGATTTGTCGTTTAGATTCCCGGACAGCTGGGCGGAGACGGCGGA
GATGGCGGAGGAGGATAAGGGTGGTAAATTTGTTGGGTATCGGCGGTGGATGAGCTTTAGGAAGAATAAGAAGATTCCGACGAGTGGGTCTGAAATTTCGGTTTCGGCCG
GCGGCGGTGCTGACGACGGCGGTACGAGGGTGAAGATTACGAGGTTTAGGAGTAGAAGAAGCTTTTTTGGGAAATCTAATTCCAATTCGCACTTGATTGCAAGCATTTAT
GGGAGTTTGAAGCAAGTGTTTCCCTGGAGGCGAAAGCAGGATGAAACGACAAAACTTTCACAGTGATCCTATAGAATTAAAATAGAAATATAAATATGTTTTTTTTTTTT
AAAGAATTTTACAAATATGTCGTTCTACTTCTTGTTTCTCCTCTGTTTACAGCTCAAATTTCCACTTGTAAATATCACTATATAATATTTAAT
Protein sequenceShow/hide protein sequence
MMSGTERNQSSTTPPAKLSLFSLTRPLPEPPGMVTPPLHASISVPFQWEEAPGKPRPVGIVDRSNSKPKSVRSLDLPPRLFADAKVAHFESPAIAGRDLSFRFPDSWAET
AEMAEEDKGGKFVGYRRWMSFRKNKKIPTSGSEISVSAGGGADDGGTRVKITRFRSRRSFFGKSNSNSHLIASIYGSLKQVFPWRRKQDETTKLSQ