; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027083 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027083
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSAM domain-containing protein
Genome locationtig00153048:899116..899916
RNA-Seq ExpressionSgr027083
SyntenySgr027083
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001660 - Sterile alpha motif domain
IPR013761 - Sterile alpha motif/pointed domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589808.1 hypothetical protein SDJN03_15231, partial [Cucurbita argyrosperma subsp. sororia]5.1e-11878.52Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGL+FARN LRPEDIPRFNHDFLQK+G+S+AKHRLEILKLAKC +EE TQKK      L+SAF KTK CLRNC+RKL+ + 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT
         KSEK +FREDA   +PE  +Y EDL RK E +EV KPP+RRSK+VSLSGPLDGR HEKLM NSKSLKLSGPLDRKERPMF RSP++SGPLDGR+SDW  
Subjt:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT

Query:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        ASS+SPK NGP    MMRLIPP+RSPRVSGPLDGR+GSPR+CCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

KAG7023481.1 hypothetical protein SDJN02_14506, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-11878.89Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGL+FARN LRPEDIPRFNHDFLQK+G+S+AKHRLEILKLAKC +EE TQKK      L+SAF KTK CLRNC+RKL+ + 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT
         KSEK +FREDA   +PE  +Y EDL RK E +EV KPP+RRSK+VSLSGPLDGR HEKLM NSKSLKLSGPLDRKERPMF RSP++SGPLDGR+SDW  
Subjt:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT

Query:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        ASS+SPK NGP    MMRLIPP+RSPRVSGPLDGR+GSPRVCCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_022134673.1 uncharacterized protein LOC111006885 [Momordica charantia]3.8e-12180.88Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGLLFARN +RPEDIPRFNHDFL K+G+SVAKHRLEILKLAK E EEP  KKFPAA  LVSAF KTKKCLRNCIRKLV S 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVF--KPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDW
        GK E+ VFRE     +PE  SY E+LGRKQE  EV+  KP  RR KNVSLSGPLDGR HEK M N KSLKLSGPLDRKERPMFSRSP+TSGPLDGR+SDW
Subjt:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVF--KPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDW

Query:  VTASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        V AS+KSPKFNGP    MMRLIP +RSPRVSGPLDGR+GSPR+CCRCNRERIETDDDYHSLWVSLFYD+KPT
Subjt:  VTASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_022960905.1 uncharacterized protein LOC111461567 [Cucurbita moschata]8.7e-11878.15Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGL+FARN LRPEDIPRFNHDFLQK+G+S+AKHRLEILKLAKC++EE T+KK      L+SAF KTK CLRNC+RKL+ + 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT
         KSEK +FREDA   +PE  +Y EDL RK E +EV KPP+RRSK+VSLSGPLDGR HEKLM NSKSLKLSGPLDRKERPMF RSP++SGPLDGR+SDW  
Subjt:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT

Query:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        ASS+SPK NGP    MMRLIPP+RSPRVSGPLDGR+GSPR+CCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

XP_023516525.1 uncharacterized protein LOC111780375 [Cucurbita pepo subsp. pepo]6.6e-11878.15Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGL+FARN LRPEDIPRFNHDFLQK+G+S+AKHRLEILKLAKC++EE TQKK      L+SAF KTK CLRNC+RKL+ + 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT
         KSEK +FREDA   +PE  +Y EDL RK E +EV KPP+RRSK+VSLSGPLDGR HEKLM NSKSLKLSGPLDRKERPMF RSP++SGPLDGR+SDW  
Subjt:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT

Query:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        ASS+SPK NGP    MMRLIPP+RSPRVSGPLDGR+GSP++CCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

TrEMBL top hitse value%identityAlignment
A0A0A0LTT1 Uncharacterized protein5.3e-9768.35Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPL+TYEYGLLFARNAL+PEDIPRFNH FLQK+GIS+AKHRLEILKLAK    +P          L+SAF KTK CLRNC+R+L+  +
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDATPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKER-----------PMFSRSPKTSGPLD
           EK +   +  P  +S  E    K + +EV KPPRRRSK+VSLSGPLD R HEK +M+SKSLKLSGPLDRKER           PMF+RSP+TSGPLD
Subjt:  GKSEKVVFREDATPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKER-----------PMFSRSPKTSGPLD

Query:  GRISDWVTASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        GRISDW + S+KSPK NGP    MMRLIPP+RSPRVSGPLDGR+GSPR+CCRCNRER+E++DDYHSLWVSLFYDMKPT
Subjt:  GRISDWVTASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A5A7U992 Sterile alpha motif, type 25.9e-9668.35Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGLLFARNAL+PEDIPRFNH FLQK+GIS+AKHRLEILKLAK           P    L+SAF KTK CLRNC+R+L+  +
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDATPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKER-----------PMFSRSPKTSGPLD
           +K +   +  P  +S  E    K + +EV KPPRRRSK+VSLSGPLD R HEK +M+SKSLKLSGPLDRKER           PMF RSP+TSGPLD
Subjt:  GKSEKVVFREDATPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKER-----------PMFSRSPKTSGPLD

Query:  GRISDWVTASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        GRISDW   S+KSPK NGP    MMRLIPP+RSPRVSGPLDGR+GSPR+CCRCNRER+E++DDYHSLWVSLFYDMKPT
Subjt:  GRISDWVTASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1C2N7 uncharacterized protein LOC1110068851.8e-12180.88Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGLLFARN +RPEDIPRFNHDFL K+G+SVAKHRLEILKLAK E EEP  KKFPAA  LVSAF KTKKCLRNCIRKLV S 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVF--KPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDW
        GK E+ VFRE     +PE  SY E+LGRKQE  EV+  KP  RR KNVSLSGPLDGR HEK M N KSLKLSGPLDRKERPMFSRSP+TSGPLDGR+SDW
Subjt:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVF--KPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDW

Query:  VTASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        V AS+KSPKFNGP    MMRLIP +RSPRVSGPLDGR+GSPR+CCRCNRERIETDDDYHSLWVSLFYD+KPT
Subjt:  VTASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1HAF4 uncharacterized protein LOC1114615674.2e-11878.15Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGL+FARN LRPEDIPRFNHDFLQK+G+S+AKHRLEILKLAKC++EE T+KK      L+SAF KTK CLRNC+RKL+ + 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT
         KSEK +FREDA   +PE  +Y EDL RK E +EV KPP+RRSK+VSLSGPLDGR HEKLM NSKSLKLSGPLDRKERPMF RSP++SGPLDGR+SDW  
Subjt:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT

Query:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        ASS+SPK NGP    MMRLIPP+RSPRVSGPLDGR+GSPR+CCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

A0A6J1JJY8 uncharacterized protein LOC1114852625.1e-11677.04Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRTGLDPLHTYEYGL+FARN L+PEDIPRFNHDFLQ++G+S+AKHRLEILKLAKC++EE TQKK      L+SA  KTK CLRNC+RKL+ + 
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT
         KSEK +FREDA   +PE  +Y EDL RKQE +EV KPP+RRSK+VSLSGPLDGR HEKLM NSKSLKLSGPLDRKERPM  RSP+ SGPLDGR+SDW  
Subjt:  GKSEKVVFREDA---TPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVT

Query:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        ASS+SPK NGP    MMRLIPP+RSPRVS PLDGR+GSPR+CCRCNRER+ETDDDYHSLWVSLFYDMKPT
Subjt:  ASSKSPKFNGP---SMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15760.1 Sterile alpha motif (SAM) domain-containing protein1.6e-2151.04Show/hide
Query:  MEMDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIR
        ME  WFSWLSRT L+P   +EYGL F++N L  EDI  F+H+FLQ +GIS+AKHRLEILKLA+ +++        +  ++V+A  KT+KCL + +R
Subjt:  MEMDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIR

AT1G80520.1 Sterile alpha motif (SAM) domain-containing protein2.3e-2047.66Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST
        MDWFSWLSRT L+    YEYGL F+ N L  EDI  FNH+FLQ +GIS+AKHRLEILKLA+ ++ +P+     +  +++ A  KT KC    +R  +   
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLST

Query:  GKSEKVV
          S  +V
Subjt:  GKSEKVV

AT2G12462.1 BEST Arabidopsis thaliana protein match is: Sterile alpha motif (SAM) domain-containing protein (TAIR:AT1G15760.1)4.9e-3435.48Show/hide
Query:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEE-PTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLS
        MDWFSWLS+T LDP  +YEYGL+FA+  L+ EDI  FNH+FL+++G++V KHR+EILKL+K E +   +    P + +L+S   K  K + N + K +  
Subjt:  MDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEE-PTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLS

Query:  TGKSEKVVFREDATPEQ--------TSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLD---GRAHEKLMMNSKSLKLSGPLDR--KERPMFS-RSPKTS
         G +     +E  +P          T     +  ++   +V + P  + K +  SGPLD   G   +  +++++S+ LSGPLDR  +ER + + RSP  S
Subjt:  TGKSEKVVFREDATPEQ--------TSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLD---GRAHEKLMMNSKSLKLSGPLDR--KERPMFS-RSPKTS

Query:  GPLDGRISDWVTASSKSPKFNGPSMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT
        G LDG +++                 RL       R+SGPL GR  SP V    N+     DDD  + W ++F+++KPT
Subjt:  GPLDGRISDWVTASSKSPKFNGPSMMRLIPPTRSPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATGGACTGGTTCTCTTGGCTCTCCAGAACCGGCCTCGACCCGCTTCACACCTACGAATATGGCCTCCTTTTCGCCCGCAATGCCCTCCGACCCGAAGATATTCC
CCGCTTCAACCATGATTTTCTCCAGAAAGTCGGAATCTCGGTCGCCAAACACAGGCTAGAGATTCTCAAGCTTGCAAAATGCGAGAAAGAAGAACCGACCCAGAAGAAAT
TTCCGGCCGCCGGGCAGCTCGTTTCCGCATTCACCAAAACCAAGAAGTGTCTCAGAAACTGTATCAGAAAGCTGGTTTTGAGTACCGGCAAGTCGGAGAAGGTGGTTTTC
CGGGAAGACGCGACGCCGGAGCAGACGAGCTACGGAGAGGATCTCGGGCGGAAGCAGGAGAGCAGGGAGGTCTTCAAGCCGCCGAGACGACGGAGTAAGAACGTGTCATT
GTCGGGGCCGTTGGATGGGAGAGCGCACGAGAAGTTGATGATGAACAGTAAGAGCCTGAAATTATCTGGGCCGTTGGATAGAAAAGAGAGGCCCATGTTTTCGAGAAGCC
CAAAGACATCTGGGCCTCTAGATGGGAGAATATCCGATTGGGTTACGGCCTCAAGCAAAAGCCCAAAGTTCAATGGGCCGTCAATGATGAGGCTGATACCGCCGACTCGG
AGCCCAAGAGTATCTGGACCTCTGGATGGACGAGAAGGAAGCCCAAGAGTTTGCTGTCGCTGCAATAGGGAGAGGATTGAAACGGACGATGATTATCACTCATTATGGGT
TTCCTTGTTTTATGACATGAAGCCCACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATGGACTGGTTCTCTTGGCTCTCCAGAACCGGCCTCGACCCGCTTCACACCTACGAATATGGCCTCCTTTTCGCCCGCAATGCCCTCCGACCCGAAGATATTCC
CCGCTTCAACCATGATTTTCTCCAGAAAGTCGGAATCTCGGTCGCCAAACACAGGCTAGAGATTCTCAAGCTTGCAAAATGCGAGAAAGAAGAACCGACCCAGAAGAAAT
TTCCGGCCGCCGGGCAGCTCGTTTCCGCATTCACCAAAACCAAGAAGTGTCTCAGAAACTGTATCAGAAAGCTGGTTTTGAGTACCGGCAAGTCGGAGAAGGTGGTTTTC
CGGGAAGACGCGACGCCGGAGCAGACGAGCTACGGAGAGGATCTCGGGCGGAAGCAGGAGAGCAGGGAGGTCTTCAAGCCGCCGAGACGACGGAGTAAGAACGTGTCATT
GTCGGGGCCGTTGGATGGGAGAGCGCACGAGAAGTTGATGATGAACAGTAAGAGCCTGAAATTATCTGGGCCGTTGGATAGAAAAGAGAGGCCCATGTTTTCGAGAAGCC
CAAAGACATCTGGGCCTCTAGATGGGAGAATATCCGATTGGGTTACGGCCTCAAGCAAAAGCCCAAAGTTCAATGGGCCGTCAATGATGAGGCTGATACCGCCGACTCGG
AGCCCAAGAGTATCTGGACCTCTGGATGGACGAGAAGGAAGCCCAAGAGTTTGCTGTCGCTGCAATAGGGAGAGGATTGAAACGGACGATGATTATCACTCATTATGGGT
TTCCTTGTTTTATGACATGAAGCCCACCTGA
Protein sequenceShow/hide protein sequence
MEMDWFSWLSRTGLDPLHTYEYGLLFARNALRPEDIPRFNHDFLQKVGISVAKHRLEILKLAKCEKEEPTQKKFPAAGQLVSAFTKTKKCLRNCIRKLVLSTGKSEKVVF
REDATPEQTSYGEDLGRKQESREVFKPPRRRSKNVSLSGPLDGRAHEKLMMNSKSLKLSGPLDRKERPMFSRSPKTSGPLDGRISDWVTASSKSPKFNGPSMMRLIPPTR
SPRVSGPLDGREGSPRVCCRCNRERIETDDDYHSLWVSLFYDMKPT