; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0025902 (gene) of Chayote v1 genome

Gene IDSed0025902
OrganismSechium edule (Chayote v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 2170 .
Genome locationLG07:43431261..43433274
RNA-Seq ExpressionSed0025902
SyntenySed0025902
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592161.1 hypothetical protein SDJN03_14507, partial [Cucurbita argyrosperma subsp. sororia]3.6e-6157.78Show/hide
Query:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL
        MD GS A+TRSRS    ALNHDIFRSWNGKQIHL+DD+  AVEYGFR S+PQ SP+ +RS+Y+SLSPP+KALA+ATGQKELME+VNNMPE  YELSLRDL
Subjt:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL

Query:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKM--IDSGLNMSAKVSPKPSQ----
        VEQPMVLG +E T  ++       GGDR     ENR+S+KE+       S  +ENGGLYLKMG P SIG +T +KK    DSGLN SAKVSPKPS     
Subjt:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKM--IDSGLNMSAKVSPKPSQ----

Query:  ---------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD
                 SSE G       +N+G++KS             KNRTK++ RH   GCWSCIYPK NERD+
Subjt:  ---------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD

XP_004146783.1 uncharacterized protein LOC101215856 [Cucumis sativus]4.7e-6158.53Show/hide
Query:  MDSGSAATRSRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRST-PQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPM
        M+  S    SR ALNHDIFRSWNGKQIHLRDD     EYGFR T PQ SP+ +RS+Y +LSPP+KALA+ATGQKELMEIVNNMPE  YELSLRDLVEQPM
Subjt:  MDSGSAATRSRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRST-PQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPM

Query:  VLGAREQTLTDDVRVFDLGGG----DRENRRSRKESRP--PSAFVENGGLYLKMGLPKSIGAKT-TRKKMIDSGLNMSAKVSPKPSQ-------------
        VLG RE T  D+   + LGG      RENR+SRKE+R       +EN GLYLKMG PKSIG  T  +KK  DS LNMSAKVSPKP Q             
Subjt:  VLGAREQTLTDDVRVFDLGGG----DRENRRSRKESRP--PSAFVENGGLYLKMGLPKSIGAKT-TRKKMIDSGLNMSAKVSPKPSQ-------------

Query:  SSE-------NGINDGNMKS----------KNRTKATHRHESGGCWSCIYPKYNERDD
        SSE       + +N+G++KS          KNRTK+T R  +GGCWS IYPKY+ERD+
Subjt:  SSE-------NGINDGNMKS----------KNRTKATHRHESGGCWSCIYPKYNERDD

XP_022937271.1 uncharacterized protein LOC111443607 [Cucurbita moschata]4.7e-6157.56Show/hide
Query:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL
        MD GS A+TRSRS    ALNHDIFRSWNGKQIHL+DD+  AVEYGFR S+PQ SP+ +RS+Y+SLSPP+KALA+ATGQKELME+VNNMPE  YELSLRDL
Subjt:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL

Query:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKM---IDSGLNMSAKVSPKPSQ---
        VEQPMVLG +E T  ++       GGDR     ENR+S+KE+       S  +ENGGLYLKMG P SIG +T +KK     DSGLN SAKVSPKPS    
Subjt:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKM---IDSGLNMSAKVSPKPSQ---

Query:  ----------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD
                  SSE G       +N+G++KS             KNRTK++ RH   GCWSCIYPK NERD+
Subjt:  ----------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD

XP_022975014.1 uncharacterized protein LOC111473911 [Cucurbita maxima]1.6e-6157.41Show/hide
Query:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL
        MD GS A+TRSRS    ALNHDIFRSWNGKQIHL+DD+   VEYGFR S+PQ SP+ +RS+Y+SLSPP+K+LA+ATGQKELME+VNNMPE  YELSLRDL
Subjt:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL

Query:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESR----PPSAFVENGGLYLKMGLPKSIGAKTTRKKM--IDSGLNMSAKVSPKPSQ----
        VEQPMVLG +E T  ++       GGDR     ENR+S+KE+       S  +ENGGLYLKMG P SIG +T +KK    DSGLN SAKVSPKPS     
Subjt:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESR----PPSAFVENGGLYLKMGLPKSIGAKTTRKKM--IDSGLNMSAKVSPKPSQ----

Query:  ---------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD
                 SSE G       +N+G++KS             KNRTK++ RH +GGCWSCIYPK NERD+
Subjt:  ---------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD

XP_023536601.1 uncharacterized protein LOC111797724 [Cucurbita pepo subsp. pepo]3.0e-6358.3Show/hide
Query:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL
        MD GS A+TRSRS    ALNHDIFRSWNGKQIHL+DD+   VEYGFR S+PQ SP+ +RS+Y+SLSPP+KALA+ATGQKELME+VNNMPE  YELSLRDL
Subjt:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL

Query:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKM---IDSGLNMSAKVSPKPSQ---
        VEQPMVLG +E T  ++       GGDR     ENR+SRKE+       S  +ENGGLYLKMG P SIG +T +KK     DSGLN SAKVSPKPS    
Subjt:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKM---IDSGLNMSAKVSPKPSQ---

Query:  ----------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD
                  SSE G       +N+G++KS             KNRTK++ RH +GGCWSCIYPKYNERD+
Subjt:  ----------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD

TrEMBL top hitse value%identityAlignment
A0A0A0KH64 Uncharacterized protein3.3e-5257.14Show/hide
Query:  MDSGSAATRSRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRST-PQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPM
        M+  S    SR ALNHDIFRSWNGKQIHLRDD     EYGFR T PQ SP+ +RS+Y +LSPP+KALA+ATGQKELMEIVNNMPE  YELSLRDLVEQPM
Subjt:  MDSGSAATRSRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRST-PQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPM

Query:  VLGAREQTLTDDVRVFDLGGG----DRENRRSRKESRP--PSAFVENGGLYLKMGLPKSIGAKT-TRKKMIDSGLNMSAKVSPKPSQ-------------
        VLG RE T  D+   + LGG      RENR+SRKE+R       +EN GLYLKMG PKSIG  T  +KK  DS LNMSAKVSPKP Q             
Subjt:  VLGAREQTLTDDVRVFDLGGG----DRENRRSRKESRP--PSAFVENGGLYLKMGLPKSIGAKT-TRKKMIDSGLNMSAKVSPKPSQ-------------

Query:  SSE-------NGINDGNMKS----------KNRTKATHRHESGGC
        SSE       + +N+G++KS          KNRTK+T R  S  C
Subjt:  SSE-------NGINDGNMKS----------KNRTKATHRHESGGC

A0A1S4E6L8 uncharacterized protein LOC1035040413.3e-6058.59Show/hide
Query:  SGSAATRSRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRST-PQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVL
        S S    SR ALNHDIFRSWNGKQIHLRDD     EYGFR T PQ SP+ +RS+Y +LSPP+KALA+ATGQKELMEIVNNMPE  YELSLRDLVEQPMV+
Subjt:  SGSAATRSRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRST-PQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVL

Query:  GAREQTLTDDVRVFDLGGG----DRENRRSRKESRP--PSAFVENGGLYLKMGLPKSIGAKT-TRKKMIDSGLNMSAKVSPKPSQ-------------SS
        G RE T  D+ R  +LGG      RENR+SRKE+R     + +EN GLYLKMG PKSIG  T  +KK  DS LNMSAKVSPKP Q             SS
Subjt:  GAREQTLTDDVRVFDLGGG----DRENRRSRKESRP--PSAFVENGGLYLKMGLPKSIGAKT-TRKKMIDSGLNMSAKVSPKPSQ-------------SS

Query:  E-------NGINDGNMKS----------KNRTKATHRHESGGCWSCIYPKYNERDD
        E       + +N+G++KS          K+RTK+T R   GGCWS IYPKY+ERD+
Subjt:  E-------NGINDGNMKS----------KNRTKATHRHESGGCWSCIYPKYNERDD

A0A6J1DJS6 uncharacterized protein LOC1110211322.7e-5455.65Show/hide
Query:  SRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRSTPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLT
        SR ALNH+IFRSWNG+QIHLR D+A  +E GFR +PQ SP+ +RS+Y+SLSPP+KA A+ATGQKELME+V++MPE  YELSLRDLVEQP VLG  E+T+ 
Subjt:  SRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRSTPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLT

Query:  DDVRVFDLGGGDR-----ENRRSRKE-SRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKMIDSGLNMSAKVSPKP-------------SQSSE----
        D+ R F+L GGDR     ENR+S+K  SRP     +  +ENGGLYLKMG PKSIG    +KK  DS LN SAKVSPKP             S SSE    
Subjt:  DDVRVFDLGGGDR-----ENRRSRKE-SRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKMIDSGLNMSAKVSPKP-------------SQSSE----

Query:  ---NGINDGNMKS-------------KNRTKATHRHESGGCWSCIYPK
           + IN+G++KS             KNRTK+  R+ SGGCWS IY K
Subjt:  ---NGINDGNMKS-------------KNRTKATHRHESGGCWSCIYPK

A0A6J1F9W6 uncharacterized protein LOC1114436072.3e-6157.56Show/hide
Query:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL
        MD GS A+TRSRS    ALNHDIFRSWNGKQIHL+DD+  AVEYGFR S+PQ SP+ +RS+Y+SLSPP+KALA+ATGQKELME+VNNMPE  YELSLRDL
Subjt:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL

Query:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKM---IDSGLNMSAKVSPKPSQ---
        VEQPMVLG +E T  ++       GGDR     ENR+S+KE+       S  +ENGGLYLKMG P SIG +T +KK     DSGLN SAKVSPKPS    
Subjt:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESRP----PSAFVENGGLYLKMGLPKSIGAKTTRKKM---IDSGLNMSAKVSPKPSQ---

Query:  ----------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD
                  SSE G       +N+G++KS             KNRTK++ RH   GCWSCIYPK NERD+
Subjt:  ----------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD

A0A6J1IJ72 uncharacterized protein LOC1114739117.9e-6257.41Show/hide
Query:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL
        MD GS A+TRSRS    ALNHDIFRSWNGKQIHL+DD+   VEYGFR S+PQ SP+ +RS+Y+SLSPP+K+LA+ATGQKELME+VNNMPE  YELSLRDL
Subjt:  MDSGS-AATRSRS----ALNHDIFRSWNGKQIHLRDDDAAAVEYGFR-STPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDL

Query:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESR----PPSAFVENGGLYLKMGLPKSIGAKTTRKKM--IDSGLNMSAKVSPKPSQ----
        VEQPMVLG +E T  ++       GGDR     ENR+S+KE+       S  +ENGGLYLKMG P SIG +T +KK    DSGLN SAKVSPKPS     
Subjt:  VEQPMVLGAREQTLTDDVRVFDLGGGDR-----ENRRSRKESR----PPSAFVENGGLYLKMGLPKSIGAKTTRKKM--IDSGLNMSAKVSPKPSQ----

Query:  ---------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD
                 SSE G       +N+G++KS             KNRTK++ RH +GGCWSCIYPK NERD+
Subjt:  ---------SSENG-------INDGNMKS-------------KNRTKATHRHESGGCWSCIYPKYNERDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21390.1 embryo defective 21701.8e-1334.95Show/hide
Query:  FRSTPQLSPRSFR--SSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLTDDV--RVFDLGGGDRENRRSRKESRPPSA
        +R++P  SP  F     Y SLSP +KA A+A GQ+ELME+V+ MPE  YELSL+DLVE   V    E+ + D++  R        R+ +  ++     S 
Subjt:  FRSTPQLSPRSFR--SSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLTDDV--RVFDLGGGDRENRRSRKESRPPSA

Query:  FVENGGLYLKMGLPKSIGA-KTTRKKMIDSGLNMSAKVSPKPSQSSEN-GINDGNMKSKNRTKATHRHESGGCWSCIYPKYNERDD
           N G  LK+    S+GA K T KK      + + KVSP+PS S E   + D    ++    +T R  S    + I  + + RD+
Subjt:  FVENGGLYLKMGLPKSIGA-KTTRKKMIDSGLNMSAKVSPKPSQSSEN-GINDGNMKSKNRTKATHRHESGGCWSCIYPKYNERDD

AT1G76980.1 BEST Arabidopsis thaliana protein match is: embryo defective 2170 (TAIR:AT1G21390.1)8.7e-1332.93Show/hide
Query:  FRSTPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLTDDVRVFD-LGGGDRENRRSRKESRPP-----
        +R++P  SP    ++Y++LSP  KA  +A GQ+ELM++V+ MPE  YELSL+DLVE            T++ +VFD +   +++ R+  ++++       
Subjt:  FRSTPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLTDDVRVFD-LGGGDRENRRSRKESRPP-----

Query:  --SAFVENGGLYLKMGLPKSIGAKTTRKKMIDSGLNMSAKVSPKPSQSSEN-GINDGNMKSKNR
          +  V N G  LK+  P S+GAK    K  D+  + S+  S +   SS    I+D +MK +++
Subjt:  --SAFVENGGLYLKMGLPKSIGAKTTRKKMIDSGLNMSAKVSPKPSQSSEN-GINDGNMKSKNR

AT1G76980.2 FUNCTIONS IN: molecular_function unknown8.7e-1332.93Show/hide
Query:  FRSTPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLTDDVRVFD-LGGGDRENRRSRKESRPP-----
        +R++P  SP    ++Y++LSP  KA  +A GQ+ELM++V+ MPE  YELSL+DLVE            T++ +VFD +   +++ R+  ++++       
Subjt:  FRSTPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLTDDVRVFD-LGGGDRENRRSRKESRPP-----

Query:  --SAFVENGGLYLKMGLPKSIGAKTTRKKMIDSGLNMSAKVSPKPSQSSEN-GINDGNMKSKNR
          +  V N G  LK+  P S+GAK    K  D+  + S+  S +   SS    I+D +MK +++
Subjt:  --SAFVENGGLYLKMGLPKSIGAKTTRKKMIDSGLNMSAKVSPKPSQSSEN-GINDGNMKSKNR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCGGTTCTGCAGCCACCAGGTCGCGGTCTGCTTTGAATCACGACATCTTCCGGAGTTGGAACGGGAAGCAGATTCATCTCAGGGATGACGACGCCGCGGCGGT
GGAGTATGGATTTCGATCCACCCCTCAGTTGAGCCCTAGATCTTTCCGATCGAGTTACCGGAGCCTCTCGCCGCCGGCCAAGGCTCTCGCCGTGGCTACCGGGCAGAAGG
AGCTCATGGAAATCGTGAACAATATGCCGGAGTGTTCTTACGAGTTGTCGTTGAGAGATCTGGTGGAGCAGCCCATGGTTTTGGGCGCCCGTGAGCAAACCCTAACTGAT
GATGTTAGGGTTTTCGATTTGGGCGGCGGCGATCGGGAGAATCGGAGATCGAGGAAGGAAAGTAGGCCGCCGTCGGCGTTCGTGGAGAATGGAGGTTTGTATCTGAAGAT
GGGGTTACCGAAATCGATTGGAGCGAAGACGACGAGGAAGAAGATGATTGATTCTGGTTTGAATATGAGTGCTAAAGTTTCGCCGAAACCTTCTCAGTCGAGTGAAAATG
GTATAAACGATGGAAATATGAAGAGCAAGAACAGAACAAAAGCCACCCATAGGCATGAGAGTGGAGGTTGCTGGTCATGTATTTATCCCAAATACAACGAACGAGATGAT
TAA
mRNA sequenceShow/hide mRNA sequence
CAAAATCTTCTTTTTGTAATTTTATATATTTTCCCGAGAAAGTGACTGAGAAAATCGAACCGCCATGATAATGACTCTCCTTCTTCTACAACAACAATCTCCATTTTCAT
TCCTTCAATTTCCATTTCGATTTCGAATCTCCCGATTTTAGGGTTTCCAACGCCATGGATTCCGGTTCTGCAGCCACCAGGTCGCGGTCTGCTTTGAATCACGACATCTT
CCGGAGTTGGAACGGGAAGCAGATTCATCTCAGGGATGACGACGCCGCGGCGGTGGAGTATGGATTTCGATCCACCCCTCAGTTGAGCCCTAGATCTTTCCGATCGAGTT
ACCGGAGCCTCTCGCCGCCGGCCAAGGCTCTCGCCGTGGCTACCGGGCAGAAGGAGCTCATGGAAATCGTGAACAATATGCCGGAGTGTTCTTACGAGTTGTCGTTGAGA
GATCTGGTGGAGCAGCCCATGGTTTTGGGCGCCCGTGAGCAAACCCTAACTGATGATGTTAGGGTTTTCGATTTGGGCGGCGGCGATCGGGAGAATCGGAGATCGAGGAA
GGAAAGTAGGCCGCCGTCGGCGTTCGTGGAGAATGGAGGTTTGTATCTGAAGATGGGGTTACCGAAATCGATTGGAGCGAAGACGACGAGGAAGAAGATGATTGATTCTG
GTTTGAATATGAGTGCTAAAGTTTCGCCGAAACCTTCTCAGTCGAGTGAAAATGGTATAAACGATGGAAATATGAAGAGCAAGAACAGAACAAAAGCCACCCATAGGCAT
GAGAGTGGAGGTTGCTGGTCATGTATTTATCCCAAATACAACGAACGAGATGATTAAAGAGGCTGCTATGCAATCCAATCAGACAAAGCAAAGCACAAAGCAGAAGCCAT
AGGCATATCTATATGAAAATGGAGGCATTTATCCTACACCATCTGTATATTCATTCCTTTCTAGTTTTTCAACTAAAATGCTTCCTTTTTTGTTTGTCTTTTTTTTTTGT
TCTTTGATATATATACATATAAACATATATATATATATCCTTGTGTCTTCTTCTTGAAAAGCTAATCCATGGAATGAATCTCTTATTTCAATGAAAA
Protein sequenceShow/hide protein sequence
MDSGSAATRSRSALNHDIFRSWNGKQIHLRDDDAAAVEYGFRSTPQLSPRSFRSSYRSLSPPAKALAVATGQKELMEIVNNMPECSYELSLRDLVEQPMVLGAREQTLTD
DVRVFDLGGGDRENRRSRKESRPPSAFVENGGLYLKMGLPKSIGAKTTRKKMIDSGLNMSAKVSPKPSQSSENGINDGNMKSKNRTKATHRHESGGCWSCIYPKYNERDD