; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy03g014200 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy03g014200
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionSterile alpha motif (SAM) domain-containing protein
Genome locationChr03:48290720..48291517
RNA-Seq ExpressionLcy03g014200
SyntenyLcy03g014200
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001660 - Sterile alpha motif domain
IPR013761 - Sterile alpha motif/pointed domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589808.1 hypothetical protein SDJN03_15231, partial [Cucurbita argyrosperma subsp. sororia]1.3e-12684.15Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK
        MDWFSWLSRTGLDPLHTYEYGL+FARNGL+P+DIPRF HDFL KIG+SIAKHRLEILKLAK +RE+AT KKLLLSAF KTKNCLRNCLR+L   N+K +K
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK

Query:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS
         +FR D A +SPEP ++ EDL R  EVK  EVLKPP+RRSKHVS SGPLDGRTHEK+M NSKSLKLSGPLDRKERPMFPRSPRSSGPL+GR+SDWASS+S
Subjt:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS

Query:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMM+LIPPSRSPRVSGPLDGRDGSPRICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

KAG7023481.1 hypothetical protein SDJN02_14506, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-12683.77Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK
        MDWFSWLSRTGLDPLHTYEYGL+FARNGL+P+DIPRF HDFL KIG+SIAKHRLEILKLAK +RE+AT KKLLLSAF KTKNCLRNCLR+L   N+K +K
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK

Query:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS
         +FR D A +SPEP ++ EDL R  EVK  EVLKPP+RRSKHVS SGPLDGRTHEK+M NSKSLKLSGPLDRKERPMFPRSPRSSGPL+GR+SDWASS+S
Subjt:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS

Query:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMM+LIPPSRSPRVSGPLDGRDGSPR+CCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_022960905.1 uncharacterized protein LOC111461567 [Cucurbita moschata]3.5e-12784.53Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK
        MDWFSWLSRTGLDPLHTYEYGL+FARNGL+P+DIPRF HDFL KIG+SIAKHRLEILKLAK DRE+AT KKLLLSAF KTKNCLRNCLR+L   N+K +K
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK

Query:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS
         +FR D A +SPEP ++ EDL R  EVK  EVLKPP+RRSKHVS SGPLDGRTHEK+M NSKSLKLSGPLDRKERPMFPRSPRSSGPL+GR+SDWASS+S
Subjt:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS

Query:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMM+LIPPSRSPRVSGPLDGRDGSPRICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_022987839.1 uncharacterized protein LOC111485262 [Cucurbita maxima]9.5e-12583.4Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK
        MDWFSWLSRTGLDPLHTYEYGL+FARNGLKP+DIPRF HDFL +IG+SIAKHRLEILKLAK DRE+AT KKLLLSA  KTKNCLRNCLR+L   N+K +K
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK

Query:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS
         +FR D A ISPEP ++ EDL R QEVK  EVLKPP+RRSKHVS SGPLDGRTHEK+M NSKSLKLSGPLDRKERPM PRSPR SGPL+GR+SDWASS+S
Subjt:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS

Query:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMM+LIPPSRSPRVS PLDGRDGSPRICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_023516525.1 uncharacterized protein LOC111780375 [Cucurbita pepo subsp. pepo]6.0e-12784.53Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK
        MDWFSWLSRTGLDPLHTYEYGL+FARNGL+P+DIPRF HDFL KIG+SIAKHRLEILKLAK DRE+AT KKLLLSAF KTKNCLRNCLR+L   N+K +K
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK

Query:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS
         +FR D A ISPEP ++ EDL R  EVK  EVLKPP+RRSKHVS SGPLDGRTHEK+M NSKSLKLSGPLDRKERPMFPRSPRSSGPL+GR+SDWASS+S
Subjt:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS

Query:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMM+LIPPSRSPRVSGPLDGRDGSP+ICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

TrEMBL top hitse value%identityAlignment
A0A5A7U992 Sterile alpha motif, type 25.1e-10875.54Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAK-FDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPD
        MDWFSWLSRTGLDPLHTYEYGLLFARN LKP+DIPRF H FL KIGISIAKHRLEILKLAK     Q      L+SAF KTK CLRNCLRRL SP+  PD
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAK-FDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPD

Query:  KPVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKER-----------PMFPRSPRSSGPL
        KP+       ISPEPP    D     +VKV+EVLKPPRRRSKHVS SGPLD RTHEK +M+SKSLKLSGPLDRKER           PMFPRSPR+SGPL
Subjt:  KPVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKER-----------PMFPRSPRSSGPL

Query:  EGRISDWA-SSKSPKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        +GRISDW  S+KSPK+NGPPQGRMM+LIPPSRSPRVSGPLDGRDGSPRICCRCNRER+E++DDYHSLWVSLFYDMKPT
Subjt:  EGRISDWA-SSKSPKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1C2N7 uncharacterized protein LOC1110068852.4e-11375.37Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKL-----LLSAFTKTKNCLRNCLRRLTSPN
        MDWFSWLSRTGLDPLHTYEYGLLFARNG++P+DIPRF HDFLHKIG+S+AKHRLEILKLAK + E+   KK      L+SAF KTK CLRNC+R+L   N
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKL-----LLSAFTKTKNCLRNCLRRLTSPN

Query:  SKPDKPVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDW
         KP++ VFR  TA ISPE  S+ E+LGR QEV+     KP  RR K+VS SGPLDGR HEK M N KSLKLSGPLDRKERPMF RSPR+SGPL+GR+SDW
Subjt:  SKPDKPVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDW

Query:  --ASSKSPKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
          AS+KSPK NGPPQG+MM+LIP SRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYD+KPT
Subjt:  --ASSKSPKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1HAF4 uncharacterized protein LOC1114615671.7e-12784.53Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK
        MDWFSWLSRTGLDPLHTYEYGL+FARNGL+P+DIPRF HDFL KIG+SIAKHRLEILKLAK DRE+AT KKLLLSAF KTKNCLRNCLR+L   N+K +K
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK

Query:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS
         +FR D A +SPEP ++ EDL R  EVK  EVLKPP+RRSKHVS SGPLDGRTHEK+M NSKSLKLSGPLDRKERPMFPRSPRSSGPL+GR+SDWASS+S
Subjt:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS

Query:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PKLNGPPQGRMM+LIPPSRSPRVSGPLDGRDGSPRICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1JJY8 uncharacterized protein LOC1114852624.6e-12583.4Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK
        MDWFSWLSRTGLDPLHTYEYGL+FARNGLKP+DIPRF HDFL +IG+SIAKHRLEILKLAK DRE+AT KKLLLSA  KTKNCLRNCLR+L   N+K +K
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDK

Query:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS
         +FR D A ISPEP ++ EDL R QEVK  EVLKPP+RRSKHVS SGPLDGRTHEK+M NSKSLKLSGPLDRKERPM PRSPR SGPL+GR+SDWASS+S
Subjt:  PVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKS

Query:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMM+LIPPSRSPRVS PLDGRDGSPRICCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  PKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1L2N4 uncharacterized protein LOC1114992835.1e-10875.19Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKF-DREQATHKKLLLSAFTKTKNCLRNCLRRLT-SPNSKP
        MDWFSWLSRTGLDPLHTYEYGLLF +NGLKP+DI  F H FLHKIGIS+A  RLEI+KLAKF  ++Q THKK LLSAFTKTKNCLRNCLR LT + NSKP
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKF-DREQATHKKLLLSAFTKTKNCLRNCLRRLT-SPNSKP

Query:  DKPVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASS
        +K +FR + AEI+P  P+H ED G  QEVKV EVLKP +RRSK+VSHSGPLDG+THEK++MN KSLKLSGPL+RKERPM PRSP S GPLEGR S+WASS
Subjt:  DKPVFRHDTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASS

Query:  KSPKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKP
        KSPK N  P+GRMM+LIPPSRSPR SGP+   +GSP ICCRC  ERIETDDDYHSLW SLF+D+KP
Subjt:  KSPKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15760.1 Sterile alpha motif (SAM) domain-containing protein1.3e-1851.09Show/hide
Query:  WFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQA---THKKL--LLSAFTKTKNCLRNCLR
        WFSWLSRT L+P   +EYGL F++N L+ +DI  F H+FL  +GISIAKHRLEILKLA+ DR+ +   T + +  +++A  KT+ CL + +R
Subjt:  WFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQA---THKKL--LLSAFTKTKNCLRNCLR

AT1G80520.1 Sterile alpha motif (SAM) domain-containing protein2.2e-1852.69Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQA--THKKL--LLSAFTKTKNCLRNCLR
        MDWFSWLSRT L+    YEYGL F+ N L+ +DI  F H+FL  +GISIAKHRLEILKLA+ DR+ +  T + +  +L A  KT  C    +R
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQA--THKKL--LLSAFTKTKNCLRNCLR

AT2G12462.1 BEST Arabidopsis thaliana protein match is: Sterile alpha motif (SAM) domain-containing protein (TAIR:AT1G15760.1)5.2e-2833.33Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQAT---HKKL---LLSAFTKTKNCLRNCLRR-LTS
        MDWFSWLS+T LDP  +YEYGL+FA+  L+ +DI  F H+FL ++G+++ KHR+EILKL+K + +  +   H+ +   L+S   K    + N L + L+ 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQAT---HKKL---LLSAFTKTKNCLRNCLRR-LTS

Query:  PNSKPDKPVFRHDTAEISPEPPSHCED---LGRNQEVK----VREVLKPPRRRSKHVSHSGPLD---GRTHEKVMMNSKSLKLSGPLDR--KERPMFP-R
          +   +P+      E    PP++       G N +V     V +V + P  + K +  SGPLD   G   +  +++++S+ LSGPLDR  +ER +   R
Subjt:  PNSKPDKPVFRHDTAEISPEPPSHCED---LGRNQEVK----VREVLKPPRRRSKHVSHSGPLD---GRTHEKVMMNSKSLKLSGPLDR--KERPMFP-R

Query:  SPRSSGPLEGRISDWASSKSPKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT
        SP  SG L+G +++                           R+SGPL GR  SP +    N+     DDD  + W ++F+++KPT
Subjt:  SPRSSGPLEGRISDWASSKSPKLNGPPQGRMMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGGTTCTCTTGGCTCTCAAGAACCGGCCTCGACCCTCTTCACACCTACGAATACGGCCTTCTTTTCGCCCGTAATGGCCTCAAACCCGACGACATTCCT
CGTTTCACCCACGATTTTCTCCACAAAATCGGAATCTCCATCGCCAAACACAGGCTCGAGATTCTCAAACTCGCCAAATTCGACCGAGAACAAGCCACCCACAAG
AAACTCCTCCTTTCCGCCTTCACCAAAACCAAGAACTGCCTCAGAAACTGCCTCCGCCGCCTCACTTCGCCCAACTCCAAGCCGGACAAGCCCGTTTTCCGCCAT
GACACGGCTGAGATTTCGCCGGAGCCGCCGTCCCACTGCGAGGATCTCGGCCGGAACCAGGAGGTTAAGGTCAGGGAAGTTCTCAAGCCGCCGAGACGACGGAGT
AAACACGTGTCGCATTCCGGGCCGTTGGATGGGAGAACGCACGAGAAGGTGATGATGAACAGTAAGAGCCTGAAGTTGTCTGGGCCGTTGGATAGAAAAGAGAGG
CCCATGTTTCCGAGAAGCCCAAGATCATCTGGGCCTCTGGAAGGGAGAATTTCCGATTGGGCCTCGAGTAAAAGCCCAAAGTTGAATGGGCCGCCGCAGGGGAGA
ATGATGAAGCTGATACCGCCGAGCCGAAGCCCAAGAGTATCTGGGCCGCTGGATGGACGAGATGGAAGCCCAAGAATTTGCTGTCGTTGTAATAGGGAGAGGATA
GAAACTGACGATGATTATCATTCGTTGTGGGTTTCATTGTTCTATGACATGAAGCCCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTGGTTCTCTTGGCTCTCAAGAACCGGCCTCGACCCTCTTCACACCTACGAATACGGCCTTCTTTTCGCCCGTAATGGCCTCAAACCCGACGACATTCCT
CGTTTCACCCACGATTTTCTCCACAAAATCGGAATCTCCATCGCCAAACACAGGCTCGAGATTCTCAAACTCGCCAAATTCGACCGAGAACAAGCCACCCACAAG
AAACTCCTCCTTTCCGCCTTCACCAAAACCAAGAACTGCCTCAGAAACTGCCTCCGCCGCCTCACTTCGCCCAACTCCAAGCCGGACAAGCCCGTTTTCCGCCAT
GACACGGCTGAGATTTCGCCGGAGCCGCCGTCCCACTGCGAGGATCTCGGCCGGAACCAGGAGGTTAAGGTCAGGGAAGTTCTCAAGCCGCCGAGACGACGGAGT
AAACACGTGTCGCATTCCGGGCCGTTGGATGGGAGAACGCACGAGAAGGTGATGATGAACAGTAAGAGCCTGAAGTTGTCTGGGCCGTTGGATAGAAAAGAGAGG
CCCATGTTTCCGAGAAGCCCAAGATCATCTGGGCCTCTGGAAGGGAGAATTTCCGATTGGGCCTCGAGTAAAAGCCCAAAGTTGAATGGGCCGCCGCAGGGGAGA
ATGATGAAGCTGATACCGCCGAGCCGAAGCCCAAGAGTATCTGGGCCGCTGGATGGACGAGATGGAAGCCCAAGAATTTGCTGTCGTTGTAATAGGGAGAGGATA
GAAACTGACGATGATTATCATTCGTTGTGGGTTTCATTGTTCTATGACATGAAGCCCACTTGA
Protein sequenceShow/hide protein sequence
MDWFSWLSRTGLDPLHTYEYGLLFARNGLKPDDIPRFTHDFLHKIGISIAKHRLEILKLAKFDREQATHKKLLLSAFTKTKNCLRNCLRRLTSPNSKPDKPVFRH
DTAEISPEPPSHCEDLGRNQEVKVREVLKPPRRRSKHVSHSGPLDGRTHEKVMMNSKSLKLSGPLDRKERPMFPRSPRSSGPLEGRISDWASSKSPKLNGPPQGR
MMKLIPPSRSPRVSGPLDGRDGSPRICCRCNRERIETDDDYHSLWVSLFYDMKPT