; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G011890 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G011890
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSAM domain-containing protein
Genome locationCG_Chr06:24996360..24997151
RNA-Seq ExpressionClCG06G011890
SyntenyClCG06G011890
Gene Ontology termsNA
InterPro domainsIPR013761 - Sterile alpha motif/pointed domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589808.1 hypothetical protein SDJN03_15231, partial [Cucurbita argyrosperma subsp. sororia]7.3e-11780.75Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L+P+DIPRFNH FLQKIG+SIAKHRLEILKLAK +  + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS
         + REDAAV+SPEP T  E+  +  +VKEV KPP+RRSKHVSLSGPLDGR HEK + +SKSLKLSGPLDRK R  PMFPRSPR+SGPLDGR+SDWA S+S
Subjt:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS

Query:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMMRLIPPSRSPRVSGPLDG DGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
Subjt:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

KAG7023481.1 hypothetical protein SDJN02_14506, partial [Cucurbita argyrosperma subsp. argyrosperma]9.5e-11780.38Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L+P+DIPRFNH FLQKIG+SIAKHRLEILKLAK +  + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS
         + REDAAV+SPEP T  E+  +  +VKEV KPP+RRSKHVSLSGPLDGR HEK + +SKSLKLSGPLDRK R  PMFPRSPR+SGPLDGR+SDWA S+S
Subjt:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS

Query:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMMRLIPPSRSPRVSGPLDG DGSPR+CCRCNRERMETDDDYHSLWVSLFYDMKPT
Subjt:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

XP_022960905.1 uncharacterized protein LOC111461567 [Cucurbita moschata]1.2e-11680.75Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L+P+DIPRFNH FLQKIG+SIAKHRLEILKLAK    + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS
         + REDAAV+SPEP T  E+  +  +VKEV KPP+RRSKHVSLSGPLDGR HEK + +SKSLKLSGPLDRK R  PMFPRSPR+SGPLDGR+SDWA S+S
Subjt:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS

Query:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMMRLIPPSRSPRVSGPLDG DGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
Subjt:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

XP_023516525.1 uncharacterized protein LOC111780375 [Cucurbita pepo subsp. pepo]2.1e-11680.75Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L+P+DIPRFNH FLQKIG+SIAKHRLEILKLAK    + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS
         + REDAAVISPEP T  E+  +  +VKEV KPP+RRSKHVSLSGPLDGR HEK + +SKSLKLSGPLDRK R  PMFPRSPR+SGPLDGR+SDWA S+S
Subjt:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS

Query:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMMRLIPPSRSPRVSGPLDG DGSP+ICCRCNRERMETDDDYHSLWVSLFYDMKPT
Subjt:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

XP_038879652.1 uncharacterized protein LOC120071438 [Benincasa hispida]3.7e-12989.39Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSH-QPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE
        MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSH+H QPT N  LLSAFTKTK CLRNCLRKLILPTARP+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSH-QPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE

Query:  KILLREDAAVISPEPPTVEEEELKQVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKSP
        K + RED AVISPEPPT++E ++K+V  V  PP+RRSKHVSLSGPLDGR HEKFVMSSKSLKLSGPLDRK R PPMFPRSPRTSGPLDGR+SDWALSKSP
Subjt:  KILLREDAAVISPEPPTVEEEELKQVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKSP

Query:  KVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        KVNGPPQGRMM+LIPPSRSPRVSGPLDG DGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
Subjt:  KVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

TrEMBL top hitse value%identityAlignment
A0A1S3B951 uncharacterized protein LOC1034873986.6e-11681.09Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSH-SHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE
        MDWFSWLSRTGLDPLHTYEYGLLFARNALKP+DIPRFNHHFLQKIGISIAKHRLEILKLAKSH +HQP  N  L+SAF KTK CLRNCLR+LI P+  P+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSH-SHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE

Query:  KILLREDAAVISPEPPTVEEEELK-QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRP---------PPMFPRSPRTSGPLDGR
        K        +ISPEPP    +E K +VKEV KPPRRRSKHVSLSGPLD R HEKFVMSSKSLKLSGPLDRK RP         PPMFPRSPRTSGPLDGR
Subjt:  KILLREDAAVISPEPPTVEEEELK-QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRP---------PPMFPRSPRTSGPLDGR

Query:  LSDWALS-KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        +SDW LS KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDG DGSPRICCRCNRERME++DDYHSLWVSLFYDMKPT
Subjt:  LSDWALS-KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

A0A5A7U992 Sterile alpha motif, type 26.6e-11681.09Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSH-SHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE
        MDWFSWLSRTGLDPLHTYEYGLLFARNALKP+DIPRFNHHFLQKIGISIAKHRLEILKLAKSH +HQP  N  L+SAF KTK CLRNCLR+LI P+  P+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSH-SHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE

Query:  KILLREDAAVISPEPPTVEEEELK-QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRP---------PPMFPRSPRTSGPLDGR
        K        +ISPEPP    +E K +VKEV KPPRRRSKHVSLSGPLD R HEKFVMSSKSLKLSGPLDRK RP         PPMFPRSPRTSGPLDGR
Subjt:  KILLREDAAVISPEPPTVEEEELK-QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRP---------PPMFPRSPRTSGPLDGR

Query:  LSDWALS-KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        +SDW LS KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDG DGSPRICCRCNRERME++DDYHSLWVSLFYDMKPT
Subjt:  LSDWALS-KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

A0A5D3CF00 Sterile alpha motif, type 28.6e-11681.09Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSH-SHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE
        MDWFSWLSRTGLDPLHTYEYGLLFARNALKP+DIPRFNHHFLQKIGISIAKHRLEILKLAKSH +HQP  N  L+SAF KTK CLRNCLR+LI P+  P+
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSH-SHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPE

Query:  KILLREDAAVISPEPPTVEEEELK-QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRP---------PPMFPRSPRTSGPLDGR
        K        +ISPEPP    +E K +VKEV KPPRRRSKHVSLSGPLD R HEKFVMSSKSLKLSGPLDRK RP         PPMFPRSPRTSGPLDGR
Subjt:  KILLREDAAVISPEPPTVEEEELK-QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRP---------PPMFPRSPRTSGPLDGR

Query:  LSDWALS-KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        +SDW LS KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDG DGSPRICCRCNRERME++DDYHSLWVSLFYDMKPT
Subjt:  LSDWALS-KSPKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

A0A6J1HAF4 uncharacterized protein LOC1114615676.0e-11780.75Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN L+P+DIPRFNH FLQKIG+SIAKHRLEILKLAK    + T  K LLSAF KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS
         + REDAAV+SPEP T  E+  +  +VKEV KPP+RRSKHVSLSGPLDGR HEK + +SKSLKLSGPLDRK R  PMFPRSPR+SGPLDGR+SDWA S+S
Subjt:  ILLREDAAVISPEPPTVEEEELK--QVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS

Query:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMMRLIPPSRSPRVSGPLDG DGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
Subjt:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

A0A6J1JJY8 uncharacterized protein LOC1114852621.2e-11480Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK
        MDWFSWLSRTGLDPLHTYEYGL+FARN LKP+DIPRFNH FLQ+IG+SIAKHRLEILKLAK    + T  K LLSA  KTKNCLRNCLRKLI   A+ EK
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEK

Query:  ILLREDAAVISPEPPTVEEE--ELKQVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS
         + REDAAVISPEP T  E+    ++VKEV KPP+RRSKHVSLSGPLDGR HEK + +SKSLKLSGPLDRK R  PM PRSPR SGPLDGR+SDWA S+S
Subjt:  ILLREDAAVISPEPPTVEEE--ELKQVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKS

Query:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
        PK+NGPPQGRMMRLIPPSRSPRVS PLDG DGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
Subjt:  PKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15760.1 Sterile alpha motif (SAM) domain-containing protein4.8e-1850.53Show/hide
Query:  WFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAK--SHSHQPTHNKSL---LSAFTKTKNCLRNCLRKLI
        WFSWLSRT L+P   +EYGL F++N L+ +DI  F+H FLQ +GISIAKHRLEILKLA+    +  P  ++S+   ++A  KT+ CL + +R  I
Subjt:  WFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAK--SHSHQPTHNKSL---LSAFTKTKNCLRNCLRKLI

AT1G80520.1 Sterile alpha motif (SAM) domain-containing protein2.8e-1853.12Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLA-KSHSHQPTHNKSL---LSAFTKTKNCLRNCLRKLI
        MDWFSWLSRT L+    YEYGL F+ N L+ +DI  FNH FLQ +GISIAKHRLEILKLA +     P   +S+   L A  KT  C    +R  I
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLA-KSHSHQPTHNKSL---LSAFTKTKNCLRNCLRKLI

AT2G12462.1 BEST Arabidopsis thaliana protein match is: Sterile alpha motif (SAM) domain-containing protein (TAIR:AT1G15760.1)7.2e-3035.69Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAK-------SHSHQPTHNKSLLSAFTKTKNCLRNCLRKLIL
        MDWFSWLS+T LDP  +YEYGL+FA+  L+ +DI  FNH+FL+++G+++ KHR+EILKL+K       S++H+P   K L+S   K    + N L K + 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAK-------SHSHQPTHNKSLLSAFTKTKNCLRNCLRKLIL

Query:  --PTARPEKILLRED-------AAVISPEPPTVEEEELKQVKEVPKPPRRRSKHVSLSGPLD---GRMHEKFVMSSKSLKLSGPLDRKGRPPPMFP-RSP
           TA  E +  ++         A ++     V  E  K V +V + P  + K +  SGPLD   G   +  ++S++S+ LSGPLDR  +   +   RSP
Subjt:  --PTARPEKILLRED-------AAVISPEPPTVEEEELKQVKEVPKPPRRRSKHVSLSGPLD---GRMHEKFVMSSKSLKLSGPLDRKGRPPPMFP-RSP

Query:  RTSGPLDGRLSDWALSKSPKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT
          SG LDG L++                  RL       R+SGPL G   SP +    N+     DDD  + W ++F+++KPT
Subjt:  RTSGPLDGRLSDWALSKSPKVNGPPQGRMMRLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGGTTCTCTTGGCTCTCAAGAACCGGCCTCGACCCACTTCACACCTACGAATACGGCCTCCTTTTTGCCCGAAATGCCCTCAAACCCGACGACATTCCT
CGTTTCAACCATCATTTCCTTCAAAAAATCGGAATCTCCATCGCCAAACACAGACTCGAGATTCTCAAACTCGCCAAATCCCACTCCCACCAACCCACCCACAAC
AAATCCCTCCTTTCCGCCTTCACCAAGACCAAAAATTGCCTCCGAAACTGCCTCCGAAAGCTAATTTTACCCACCGCCAGGCCGGAGAAGATCCTTTTACGGGAA
GACGCGGCGGTGATTTCGCCTGAGCCGCCGACCGTGGAGGAGGAGGAGCTTAAGCAGGTTAAGGAGGTCCCCAAGCCGCCGAGACGACGGAGTAAACACGTGTCG
CTGTCGGGGCCGTTGGACGGGAGAATGCACGAGAAGTTCGTGATGAGCAGTAAAAGCTTGAAGTTATCTGGGCCGTTGGATAGGAAAGGGAGGCCGCCGCCTATG
TTTCCGAGAAGCCCAAGAACATCTGGGCCTTTGGATGGGAGATTATCGGATTGGGCCTTGAGTAAAAGCCCAAAGGTGAATGGGCCGCCGCAGGGGAGAATGATG
AGGCTGATTCCGCCGAGCCGGAGCCCAAGAGTATCTGGGCCGTTAGATGGACCAGATGGAAGCCCAAGAATTTGTTGTCGTTGTAATAGGGAAAGAATGGAAACG
GATGATGATTATCATTCCTTGTGGGTTTCATTGTTTTATGACATGAAGCCAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTGGTTCTCTTGGCTCTCAAGAACCGGCCTCGACCCACTTCACACCTACGAATACGGCCTCCTTTTTGCCCGAAATGCCCTCAAACCCGACGACATTCCT
CGTTTCAACCATCATTTCCTTCAAAAAATCGGAATCTCCATCGCCAAACACAGACTCGAGATTCTCAAACTCGCCAAATCCCACTCCCACCAACCCACCCACAAC
AAATCCCTCCTTTCCGCCTTCACCAAGACCAAAAATTGCCTCCGAAACTGCCTCCGAAAGCTAATTTTACCCACCGCCAGGCCGGAGAAGATCCTTTTACGGGAA
GACGCGGCGGTGATTTCGCCTGAGCCGCCGACCGTGGAGGAGGAGGAGCTTAAGCAGGTTAAGGAGGTCCCCAAGCCGCCGAGACGACGGAGTAAACACGTGTCG
CTGTCGGGGCCGTTGGACGGGAGAATGCACGAGAAGTTCGTGATGAGCAGTAAAAGCTTGAAGTTATCTGGGCCGTTGGATAGGAAAGGGAGGCCGCCGCCTATG
TTTCCGAGAAGCCCAAGAACATCTGGGCCTTTGGATGGGAGATTATCGGATTGGGCCTTGAGTAAAAGCCCAAAGGTGAATGGGCCGCCGCAGGGGAGAATGATG
AGGCTGATTCCGCCGAGCCGGAGCCCAAGAGTATCTGGGCCGTTAGATGGACCAGATGGAAGCCCAAGAATTTGTTGTCGTTGTAATAGGGAAAGAATGGAAACG
GATGATGATTATCATTCCTTGTGGGTTTCATTGTTTTATGACATGAAGCCAACTTGA
Protein sequenceShow/hide protein sequence
MDWFSWLSRTGLDPLHTYEYGLLFARNALKPDDIPRFNHHFLQKIGISIAKHRLEILKLAKSHSHQPTHNKSLLSAFTKTKNCLRNCLRKLILPTARPEKILLRE
DAAVISPEPPTVEEEELKQVKEVPKPPRRRSKHVSLSGPLDGRMHEKFVMSSKSLKLSGPLDRKGRPPPMFPRSPRTSGPLDGRLSDWALSKSPKVNGPPQGRMM
RLIPPSRSPRVSGPLDGPDGSPRICCRCNRERMETDDDYHSLWVSLFYDMKPT