; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G010680 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G010680
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSterile alpha motif (SAM) domain-containing protein
Genome locationchr06:20917442..20918236
RNA-Seq ExpressionLsi06G010680
SyntenyLsi06G010680
Gene Ontology termsNA
InterPro domainsIPR013761 - Sterile alpha motif/pointed domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589808.1 hypothetical protein SDJN03_15231, partial [Cucurbita argyrosperma subsp. sororia]3.7e-12182.26Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L P+DIPRFNH FLQ IG+SIAKHRLEILKLAK +  + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS
        G+FREDAAV+SPEP T  E+  ++++VKEVLKPP+RRSKHVSLSGPLDGRTHEK + +SKSLKLSGPLDRKERP  MFPRSPR+SGPLDGR+SDWA S+S
Subjt:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS

Query:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSP ICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

KAG7023481.1 hypothetical protein SDJN02_14506, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-12181.89Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L P+DIPRFNH FLQ IG+SIAKHRLEILKLAK +  + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS
        G+FREDAAV+SPEP T  E+  ++++VKEVLKPP+RRSKHVSLSGPLDGRTHEK + +SKSLKLSGPLDRKERP  MFPRSPR+SGPLDGR+SDWA S+S
Subjt:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS

Query:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSP +CCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_022960905.1 uncharacterized protein LOC111461567 [Cucurbita moschata]6.4e-12182.26Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L P+DIPRFNH FLQ IG+SIAKHRLEILKLAK    + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS
        G+FREDAAV+SPEP T  E+  ++++VKEVLKPP+RRSKHVSLSGPLDGRTHEK + +SKSLKLSGPLDRKERP  MFPRSPR+SGPLDGR+SDWA S+S
Subjt:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS

Query:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSP ICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_023516525.1 uncharacterized protein LOC111780375 [Cucurbita pepo subsp. pepo]4.9e-12182.64Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L P+DIPRFNH FLQ IG+SIAKHRLEILKLAK    + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS
        G+FREDAAVISPEP T  E+  ++++VKEVLKPP+RRSKHVSLSGPLDGRTHEK + +SKSLKLSGPLDRKERP  MFPRSPR+SGPLDGR+SDWA S+S
Subjt:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS

Query:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSP ICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_038879652.1 uncharacterized protein LOC120071438 [Benincasa hispida]1.4e-13190.23Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQ-THNKSLLSAFTKTKNCLRNCLRKLILPTARPE
        MDWFSWLSRTGLDPLHTYEYGLLFARNAL PDDIPRFNHHFLQ IGISIAKHRLEILKLAKSH+HQQ T N  LLSAFTKTK CLRNCLRKLILPTARP+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQ-THNKSLLSAFTKTKNCLRNCLRKLILPTARPE

Query:  KGLFREDAAVISPEPPTVEEEEQEVKVKEVLK-PPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSK
        KG+FRED AVISPEPPT++    EVKVKEV+K PP+RRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPP MFPRSPRTSGPLDGR+SDWALSK
Subjt:  KGLFREDAAVISPEPPTVEEEEQEVKVKEVLK-PPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSK

Query:  SPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        SPK+NGPPQGRMM+LIPPSRSPRVSGPLDGRDGSP ICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  SPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

TrEMBL top hitse value%identityAlignment
A0A1S3B951 uncharacterized protein LOC1034873982.3e-11680.36Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSH-SHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE
        MDWFSWLSRTGLDPLHTYEYGLLFARNAL P+DIPRFNHHFLQ IGISIAKHRLEILKLAKSH +HQ   N  L+SAF KTK CLRNCLR+LI P+  P+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSH-SHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE

Query:  KGLFREDAAVISPEPPTVEEEEQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERP---------PLMFPRSPRTSGPLDGR
        K        +ISPEPP    +E +VKVKEVLKPPRRRSKHVSLSGPLD RTHEKFVMSSKSLKLSGPLDRKERP         P MFPRSPRTSGPLDGR
Subjt:  KGLFREDAAVISPEPPTVEEEEQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERP---------PLMFPRSPRTSGPLDGR

Query:  MSDWALS-KSPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        +SDW LS KSPK+NGPPQGRMMRLIPPSRSPRVSGPLDGRDGSP ICCRCNRER+E++DDYHSLWVSLFYDMKPT
Subjt:  MSDWALS-KSPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A5A7U992 Sterile alpha motif, type 22.3e-11680.36Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSH-SHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE
        MDWFSWLSRTGLDPLHTYEYGLLFARNAL P+DIPRFNHHFLQ IGISIAKHRLEILKLAKSH +HQ   N  L+SAF KTK CLRNCLR+LI P+  P+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSH-SHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE

Query:  KGLFREDAAVISPEPPTVEEEEQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERP---------PLMFPRSPRTSGPLDGR
        K        +ISPEPP    +E +VKVKEVLKPPRRRSKHVSLSGPLD RTHEKFVMSSKSLKLSGPLDRKERP         P MFPRSPRTSGPLDGR
Subjt:  KGLFREDAAVISPEPPTVEEEEQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERP---------PLMFPRSPRTSGPLDGR

Query:  MSDWALS-KSPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        +SDW LS KSPK+NGPPQGRMMRLIPPSRSPRVSGPLDGRDGSP ICCRCNRER+E++DDYHSLWVSLFYDMKPT
Subjt:  MSDWALS-KSPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A5D3CF00 Sterile alpha motif, type 23.0e-11680.36Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSH-SHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE
        MDWFSWLSRTGLDPLHTYEYGLLFARNAL P+DIPRFNHHFLQ IGISIAKHRLEILKLAKSH +HQ   N  L+SAF KTK CLRNCLR+LI P+  P+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSH-SHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE

Query:  KGLFREDAAVISPEPPTVEEEEQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERP---------PLMFPRSPRTSGPLDGR
        K        +ISPEPP    +E +VKVKEVLKPPRRRSKHVSLSGPLD RTHEKFVMSSKSLKLSGPLDRKERP         P MFPRSPRTSGPLDGR
Subjt:  KGLFREDAAVISPEPPTVEEEEQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERP---------PLMFPRSPRTSGPLDGR

Query:  MSDWALS-KSPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        +SDW LS KSPK+NGPPQGRMMRLIPPSRSPRVSGPLDGRDGSP ICCRCNRER+E++DDYHSLWVSLFYDMKPT
Subjt:  MSDWALS-KSPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1HAF4 uncharacterized protein LOC1114615673.1e-12182.26Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L P+DIPRFNH FLQ IG+SIAKHRLEILKLAK    + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS
        G+FREDAAV+SPEP T  E+  ++++VKEVLKPP+RRSKHVSLSGPLDGRTHEK + +SKSLKLSGPLDRKERP  MFPRSPR+SGPLDGR+SDWA S+S
Subjt:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS

Query:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSP ICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1JJY8 uncharacterized protein LOC1114852624.2e-11881.13Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L P+DIPRFNH FLQ IG+SIAKHRLEILKLAK    + T  K LLSA  KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS
        G+FREDAAVISPEP T  E+  ++ +VKEVLKPP+RRSKHVSLSGPLDGRTHEK + +SKSLKLSGPLDRKERP  M PRSPR SGPLDGR+SDWA S+S
Subjt:  GLFREDAAVISPEPPTVEEE-EQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKS

Query:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMMRLIPPSRSPRVS PLDGRDGSP ICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15760.1 Sterile alpha motif (SAM) domain-containing protein2.4e-1749.47Show/hide
Query:  WFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKS-----LLSAFTKTKNCLRNCLRKLI
        WFSWLSRT L+P   +EYGL F++N L  +DI  F+H FLQ++GISIAKHRLEILKLA+          S     +++A  KT+ CL + +R  I
Subjt:  WFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKS-----LLSAFTKTKNCLRNCLRKLI

AT1G80520.1 Sterile alpha motif (SAM) domain-containing protein4.8e-1868.85Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAK
        MDWFSWLSRT L+    YEYGL F+ N L  +DI  FNH FLQ++GISIAKHRLEILKLA+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAK

AT2G12462.1 BEST Arabidopsis thaliana protein match is: Sterile alpha motif (SAM) domain-containing protein (TAIR:AT1G15760.1)7.2e-3034.71Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAK-------SHSHQQTHNKSLLSAFTKTKNCLRNCLRKLIL
        MDWFSWLS+T LDP  +YEYGL+FA+  L  +DI  FNH+FL+ +G+++ KHR+EILKL+K       S++H+    K L+S   K    + N       
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAK-------SHSHQQTHNKSLLSAFTKTKNCLRNCLRKLIL

Query:  PTARPEKGLFREDAAVISP-----EPPTV-----------EEEEQEVKVKEVLKPPRRRSKHVSLSGPLD---GRTHEKFVMSSKSLKLSGPLDRKERPP
           R  K L     AV+ P      PP              +   E  V +V + P  + K +  SGPLD   G   +  ++S++S+ LSGPLDR  +  
Subjt:  PTARPEKGLFREDAAVISP-----EPPTV-----------EEEEQEVKVKEVLKPPRRRSKHVSLSGPLD---GRTHEKFVMSSKSLKLSGPLDRKERPP

Query:  LMFP-RSPRTSGPLDGRMSDWALSKSPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT
        L+   RSP  SG LDG +++                  RL       R+SGPL GR  SP++    N+     DDD  + W ++F+++KPT
Subjt:  LMFP-RSPRTSGPLDGRMSDWALSKSPKLNGPPQGRMMRLIPPSRSPRVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGGTTCTCTTGGCTCTCAAGAACCGGCCTCGACCCACTTCACACCTACGAATACGGCCTTCTTTTTGCCCGAAATGCCCTCAATCCCGACGACATTCCTCGTTT
CAACCACCATTTTCTTCAAAATATCGGAATCTCCATCGCCAAACACCGCCTCGAGATTCTCAAACTCGCCAAATCCCACTCCCACCAACAAACCCATAACAAATCCCTTC
TTTCCGCCTTCACCAAGACCAAAAATTGCCTCAGAAACTGCCTCCGAAAGCTAATTTTACCCACCGCCAGGCCCGAGAAGGGCCTTTTCCGGGAAGACGCGGCGGTGATT
TCCCCTGAGCCGCCGACCGTGGAAGAGGAGGAGCAGGAGGTCAAGGTTAAGGAAGTCCTCAAGCCGCCGAGACGACGGAGTAAACACGTGTCGTTGTCGGGGCCTTTGGA
CGGGAGAACGCACGAGAAGTTCGTGATGAGCAGTAAGAGCTTGAAGTTATCTGGGCCGTTGGATAGGAAAGAGAGGCCCCCGCTTATGTTTCCGAGAAGCCCAAGAACAT
CTGGGCCTCTAGATGGGAGAATGTCGGATTGGGCCTTGAGTAAAAGCCCAAAATTGAATGGGCCGCCGCAAGGGAGAATGATGAGGCTGATTCCGCCAAGCCGGAGCCCA
AGAGTATCTGGGCCGTTAGATGGACGAGATGGAAGCCCAAATATTTGCTGCCGGTGTAATAGGGAAAGAATAGAAACCGATGATGATTATCATTCCTTGTGGGTTTCATT
GTTTTATGACATGAAGCCAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTGGTTCTCTTGGCTCTCAAGAACCGGCCTCGACCCACTTCACACCTACGAATACGGCCTTCTTTTTGCCCGAAATGCCCTCAATCCCGACGACATTCCTCGTTT
CAACCACCATTTTCTTCAAAATATCGGAATCTCCATCGCCAAACACCGCCTCGAGATTCTCAAACTCGCCAAATCCCACTCCCACCAACAAACCCATAACAAATCCCTTC
TTTCCGCCTTCACCAAGACCAAAAATTGCCTCAGAAACTGCCTCCGAAAGCTAATTTTACCCACCGCCAGGCCCGAGAAGGGCCTTTTCCGGGAAGACGCGGCGGTGATT
TCCCCTGAGCCGCCGACCGTGGAAGAGGAGGAGCAGGAGGTCAAGGTTAAGGAAGTCCTCAAGCCGCCGAGACGACGGAGTAAACACGTGTCGTTGTCGGGGCCTTTGGA
CGGGAGAACGCACGAGAAGTTCGTGATGAGCAGTAAGAGCTTGAAGTTATCTGGGCCGTTGGATAGGAAAGAGAGGCCCCCGCTTATGTTTCCGAGAAGCCCAAGAACAT
CTGGGCCTCTAGATGGGAGAATGTCGGATTGGGCCTTGAGTAAAAGCCCAAAATTGAATGGGCCGCCGCAAGGGAGAATGATGAGGCTGATTCCGCCAAGCCGGAGCCCA
AGAGTATCTGGGCCGTTAGATGGACGAGATGGAAGCCCAAATATTTGCTGCCGGTGTAATAGGGAAAGAATAGAAACCGATGATGATTATCATTCCTTGTGGGTTTCATT
GTTTTATGACATGAAGCCAACTTAA
Protein sequenceShow/hide protein sequence
MDWFSWLSRTGLDPLHTYEYGLLFARNALNPDDIPRFNHHFLQNIGISIAKHRLEILKLAKSHSHQQTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEKGLFREDAAVI
SPEPPTVEEEEQEVKVKEVLKPPRRRSKHVSLSGPLDGRTHEKFVMSSKSLKLSGPLDRKERPPLMFPRSPRTSGPLDGRMSDWALSKSPKLNGPPQGRMMRLIPPSRSP
RVSGPLDGRDGSPNICCRCNRERIETDDDYHSLWVSLFYDMKPT