; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020873 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020873
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 2170 .
Genome locationscaffold382:219707..220486
RNA-Seq ExpressionMS020873
SyntenyMS020873
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592161.1 hypothetical protein SDJN03_14507, partial [Cucurbita argyrosperma subsp. sororia]5.3e-9675.37Show/hide
Query:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ
        M+GS R+LL SP ESPD RA++AM+F           SRPALNH+IFRSWNG+QIHL+D+   +E GFRL SPQRSPQFYRSNYQSLSPPSKA AIATGQ
Subjt:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ

Query:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGT-TKKKKKKK
        KELME+V++MPESCYELSLRDLVEQP VLG++E T A+ERD   GGDRE+FAMENRKSKKE S  LV +S+ NMENGGLYLKMGFP SIGT TKKKKKK 
Subjt:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGT-TKKKKKKK

Query:  NDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
        NDS LNTSAKVSPKP  P +KDWWKRR SVSSE+ SV Y SS+NNGSIKSSSSSSSNGS+GSNKNRTKSS R
Subjt:  NDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

XP_022153674.1 uncharacterized protein LOC111021132 [Momordica charantia]2.1e-13298.08Show/hide
Query:  MIGSGRRLLESPLESPDLRAWNAMEFDSRPALNHEIFRSWNGRQIHLRDEAVELESGFRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMP
        MIGSGRRLLESPLESPDLRAWNAMEFDSRPALNHEIFRSWNGRQIHLRDEAVELESGFRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMP
Subjt:  MIGSGRRLLESPLESPDLRAWNAMEFDSRPALNHEIFRSWNGRQIHLRDEAVELESGFRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMP

Query:  ESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDSALNTSAKVS
        ESCYELSLRDLVEQPRVLG EEETVADERDFNLGGDREIFAMENRKSKK ISRPLVSKSTGNMENGGLYLKMGFPKSIGTT KKKKKKNDSALNTSAKVS
Subjt:  ESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDSALNTSAKVS

Query:  PKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
        PKP PPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKS+RR
Subjt:  PKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

XP_022937271.1 uncharacterized protein LOC111443607 [Cucurbita moschata]3.5e-9574.73Show/hide
Query:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ
        M+GS R+LL SP ESPD RA++AM+F           SRPALNH+IFRSWNG+QIHL+D+   +E GFRL SPQRSPQFYRSNYQSLSPPSKA AIATGQ
Subjt:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ

Query:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIG--TTKKKKKK
        KELME+V++MPESCYELSLRDLVEQP VLG++E T A+ERD   GGDRE+FAMENRKSKKE S  LV +S+ NMENGGLYLKMGFP SIG  T KKKKKK
Subjt:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIG--TTKKKKKK

Query:  KNDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
         NDS LNTSAKVSPKP  P +KDWWKRR SVSSE+ SV Y SS+NNGSIKSSSSSSSNGS+GSNKNRTKSS R
Subjt:  KNDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

XP_022975014.1 uncharacterized protein LOC111473911 [Cucurbita maxima]3.8e-9474.26Show/hide
Query:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ
        M+GS R+LL SP ESPD RA++AM+F           SRPALNH+IFRSWNG+QIHL+D+   +E GFRL SPQRSPQFYRSNYQSLSPPSK+ AIATGQ
Subjt:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ

Query:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGT-TKKKKKKK
        KELME+V++MPESCYELSLRDLVEQP VLG++E T A+ERD   GGDRE+FA ENRKSKKE S  LV +S+ NMENGGLYLKMGFP SIGT TKKKKKK 
Subjt:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGT-TKKKKKKK

Query:  NDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
        NDS LNTSAKVSPKP  P +KDWWKRR SVSSE+ SV Y S +NNGSIKSSSSSSSNGS+GSNKNRTKSS R
Subjt:  NDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

XP_023536601.1 uncharacterized protein LOC111797724 [Cucurbita pepo subsp. pepo]3.8e-9473.99Show/hide
Query:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ
        M+GS R+LL SP ESPD RA++AM+F           SRPALNH+IFRSWNG+QIHL+D+   +E GFRL SPQRSPQFYRSNYQSLSPPSKA AIATGQ
Subjt:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ

Query:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKK--
        KELME+V++MPESCYELSLRDLVEQP VLG++E T A+ERD   GGDRE+FAMENRKS+KE S  LV +S+ NMENGGLYLKMGFP SIGT  KKKKK  
Subjt:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKK--

Query:  KNDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
         NDS LNTSAKVSPKP  P +KDWWKRR SVSSE+ SV Y SS+NNGSIKSSSSSSSNGS+GSNKNRTKSS R
Subjt:  KNDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

TrEMBL top hitse value%identityAlignment
A0A1S4E6L8 uncharacterized protein LOC1035040418.1e-9072.86Show/hide
Query:  MIGSGRRLLESPLESPDLRAWN-AMEF-------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQKE
        MIG+ R+LL SP ESPD RA N  MEF        SRPALNH+IFRSWNG+QIHLRD+    E GFRL SPQRSPQFYRSNY +LSPPSKA AIATGQKE
Subjt:  MIGSGRRLLESPLESPDLRAWN-AMEF-------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQKE

Query:  LMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDS
        LME+V++MPESCYELSLRDLVEQP V+G+ E+T  DERD NLGG RE+F+ ENRKS+KE +R LV +S  +MEN GLYLKMGFPKSIGTT +KKKKKNDS
Subjt:  LMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDS

Query:  ALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
        +LN SAKVSPKP    +KDWWKRR SVSSESESV YGS++NNGSIKSSSSSSSN   GSNK+RTKS+ R
Subjt:  ALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

A0A5A7V1Z3 Putative serine/threonine-protein kinase ndrD8.1e-9072.86Show/hide
Query:  MIGSGRRLLESPLESPDLRAWN-AMEF-------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQKE
        MIG+ R+LL SP ESPD RA N  MEF        SRPALNH+IFRSWNG+QIHLRD+    E GFRL SPQRSPQFYRSNY +LSPPSKA AIATGQKE
Subjt:  MIGSGRRLLESPLESPDLRAWN-AMEF-------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQKE

Query:  LMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDS
        LME+V++MPESCYELSLRDLVEQP V+G+ E+T  DERD NLGG RE+F+ ENRKS+KE +R LV +S  +MEN GLYLKMGFPKSIGTT +KKKKKNDS
Subjt:  LMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDS

Query:  ALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
        +LN SAKVSPKP    +KDWWKRR SVSSESESV YGS++NNGSIKSSSSSSSN   GSNK+RTKS+ R
Subjt:  ALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

A0A6J1DJS6 uncharacterized protein LOC1110211321.0e-13298.08Show/hide
Query:  MIGSGRRLLESPLESPDLRAWNAMEFDSRPALNHEIFRSWNGRQIHLRDEAVELESGFRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMP
        MIGSGRRLLESPLESPDLRAWNAMEFDSRPALNHEIFRSWNGRQIHLRDEAVELESGFRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMP
Subjt:  MIGSGRRLLESPLESPDLRAWNAMEFDSRPALNHEIFRSWNGRQIHLRDEAVELESGFRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMP

Query:  ESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDSALNTSAKVS
        ESCYELSLRDLVEQPRVLG EEETVADERDFNLGGDREIFAMENRKSKK ISRPLVSKSTGNMENGGLYLKMGFPKSIGTT KKKKKKNDSALNTSAKVS
Subjt:  ESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDSALNTSAKVS

Query:  PKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
        PKP PPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKS+RR
Subjt:  PKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

A0A6J1F9W6 uncharacterized protein LOC1114436071.7e-9574.73Show/hide
Query:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ
        M+GS R+LL SP ESPD RA++AM+F           SRPALNH+IFRSWNG+QIHL+D+   +E GFRL SPQRSPQFYRSNYQSLSPPSKA AIATGQ
Subjt:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ

Query:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIG--TTKKKKKK
        KELME+V++MPESCYELSLRDLVEQP VLG++E T A+ERD   GGDRE+FAMENRKSKKE S  LV +S+ NMENGGLYLKMGFP SIG  T KKKKKK
Subjt:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIG--TTKKKKKK

Query:  KNDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
         NDS LNTSAKVSPKP  P +KDWWKRR SVSSE+ SV Y SS+NNGSIKSSSSSSSNGS+GSNKNRTKSS R
Subjt:  KNDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

A0A6J1IJ72 uncharacterized protein LOC1114739111.9e-9474.26Show/hide
Query:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ
        M+GS R+LL SP ESPD RA++AM+F           SRPALNH+IFRSWNG+QIHL+D+   +E GFRL SPQRSPQFYRSNYQSLSPPSK+ AIATGQ
Subjt:  MIGSGRRLLESPLESPDLRAWNAMEF----------DSRPALNHEIFRSWNGRQIHLRDEAVELESGFRL-SPQRSPQFYRSNYQSLSPPSKARAIATGQ

Query:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGT-TKKKKKKK
        KELME+V++MPESCYELSLRDLVEQP VLG++E T A+ERD   GGDRE+FA ENRKSKKE S  LV +S+ NMENGGLYLKMGFP SIGT TKKKKKK 
Subjt:  KELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGT-TKKKKKKK

Query:  NDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR
        NDS LNTSAKVSPKP  P +KDWWKRR SVSSE+ SV Y S +NNGSIKSSSSSSSNGS+GSNKNRTKSS R
Subjt:  NDSALNTSAKVSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21390.1 embryo defective 21702.6e-2445.83Show/hide
Query:  FRLSPQRS-PQFYR-SNYQSLSPPSKARAIATGQKELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPL
        +R SP +S P F+R  +Y SLSP SKA+AIA GQ+ELMEMVS MPESCYELSL+DLVE  +V    E  V DE       +R+   +   KS K +    
Subjt:  FRLSPQRS-PQFYR-SNYQSLSPPSKARAIATGQKELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPL

Query:  VSKSTGNMENGGLYLKMGFPKSIG----TTKKKKKKKNDSALNTSAK--VSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSS
            +G   N G  LK+ F  S+G    TTKKKKKKK D A   S +  +S + +   DK+WW R     SES +   GSS +N SI+S SS
Subjt:  VSKSTGNMENGGLYLKMGFPKSIG----TTKKKKKKKNDSALNTSAK--VSPKPLPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSS

AT1G76980.1 BEST Arabidopsis thaliana protein match is: embryo defective 2170 (TAIR:AT1G21390.1)4.0e-2543.72Show/hide
Query:  FRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVS
        +R SP +SP    +NYQ+LSP +KA+ IA GQ+ELM+MVS MPESCYELSL+DLVE    +  EEE V DE        R++  +   KS K +  P+  
Subjt:  FRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVS

Query:  KSTGNMENGGLYLKMGFPKSIGTTKKKKKKK----NDSALNTSAK--VSPKP------LPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSN
           G + N G  LK+ FP S+G  KK  KKK    +DS++ +      SP+P      +   DKDWWK   S S  S+SV   S IN+GS KSS  SSS 
Subjt:  KSTGNMENGGLYLKMGFPKSIGTTKKKKKKK----NDSALNTSAK--VSPKP------LPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSN

Query:  GSHGSNKNRTKSSRR
            SN +R+++S R
Subjt:  GSHGSNKNRTKSSRR

AT1G76980.2 FUNCTIONS IN: molecular_function unknown4.0e-2543.72Show/hide
Query:  FRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVS
        +R SP +SP    +NYQ+LSP +KA+ IA GQ+ELM+MVS MPESCYELSL+DLVE    +  EEE V DE        R++  +   KS K +  P+  
Subjt:  FRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMPESCYELSLRDLVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVS

Query:  KSTGNMENGGLYLKMGFPKSIGTTKKKKKKK----NDSALNTSAK--VSPKP------LPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSN
           G + N G  LK+ FP S+G  KK  KKK    +DS++ +      SP+P      +   DKDWWK   S S  S+SV   S IN+GS KSS  SSS 
Subjt:  KSTGNMENGGLYLKMGFPKSIGTTKKKKKKK----NDSALNTSAK--VSPKP------LPPADKDWWKRRFSVSSESESVGYGSSINNGSIKSSSSSSSN

Query:  GSHGSNKNRTKSSRR
            SN +R+++S R
Subjt:  GSHGSNKNRTKSSRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGGTTCCGGGCGAAGACTGTTGGAGAGCCCTCTCGAATCCCCGGATCTTAGGGCTTGGAACGCCATGGAATTTGATTCCAGACCGGCTCTGAACCACGAGATCTT
CAGGAGCTGGAATGGCAGGCAGATTCATCTGAGGGACGAGGCGGTGGAATTGGAATCTGGATTTCGTTTGAGCCCTCAACGGAGCCCTCAGTTCTACCGATCGAATTACC
AGAGCCTGTCGCCGCCGTCGAAGGCTCGCGCCATAGCTACTGGACAGAAGGAGCTCATGGAAATGGTGAGCCATATGCCGGAGTCGTGCTACGAGTTGTCGTTGAGAGAC
CTGGTGGAGCAGCCTAGGGTTTTGGGGCGAGAAGAAGAAACTGTAGCTGATGAGAGGGATTTCAATTTGGGCGGGGATCGAGAGATTTTCGCGATGGAGAATCGGAAATC
GAAGAAGGAAATTAGTAGGCCATTGGTTAGTAAGAGTACAGGGAACATGGAGAACGGAGGTTTGTATCTGAAAATGGGATTTCCGAAATCTATTGGAACGACTAAGAAGA
AGAAGAAGAAGAAGAACGATTCTGCGTTGAATACGAGTGCTAAAGTTTCGCCTAAACCTCTTCCTCCTGCGGACAAGGATTGGTGGAAGAGAAGATTCTCAGTTTCATCT
GAGAGTGAAAGCGTTGGTTATGGTTCGAGTATCAACAATGGAAGCATCAAGAGCAGTAGCAGCAGCAGCAGCAATGGCAGCCATGGCAGCAACAAGAACAGAACAAAATC
CTCCCGCAGG
mRNA sequenceShow/hide mRNA sequence
ATGATCGGTTCCGGGCGAAGACTGTTGGAGAGCCCTCTCGAATCCCCGGATCTTAGGGCTTGGAACGCCATGGAATTTGATTCCAGACCGGCTCTGAACCACGAGATCTT
CAGGAGCTGGAATGGCAGGCAGATTCATCTGAGGGACGAGGCGGTGGAATTGGAATCTGGATTTCGTTTGAGCCCTCAACGGAGCCCTCAGTTCTACCGATCGAATTACC
AGAGCCTGTCGCCGCCGTCGAAGGCTCGCGCCATAGCTACTGGACAGAAGGAGCTCATGGAAATGGTGAGCCATATGCCGGAGTCGTGCTACGAGTTGTCGTTGAGAGAC
CTGGTGGAGCAGCCTAGGGTTTTGGGGCGAGAAGAAGAAACTGTAGCTGATGAGAGGGATTTCAATTTGGGCGGGGATCGAGAGATTTTCGCGATGGAGAATCGGAAATC
GAAGAAGGAAATTAGTAGGCCATTGGTTAGTAAGAGTACAGGGAACATGGAGAACGGAGGTTTGTATCTGAAAATGGGATTTCCGAAATCTATTGGAACGACTAAGAAGA
AGAAGAAGAAGAAGAACGATTCTGCGTTGAATACGAGTGCTAAAGTTTCGCCTAAACCTCTTCCTCCTGCGGACAAGGATTGGTGGAAGAGAAGATTCTCAGTTTCATCT
GAGAGTGAAAGCGTTGGTTATGGTTCGAGTATCAACAATGGAAGCATCAAGAGCAGTAGCAGCAGCAGCAGCAATGGCAGCCATGGCAGCAACAAGAACAGAACAAAATC
CTCCCGCAGG
Protein sequenceShow/hide protein sequence
MIGSGRRLLESPLESPDLRAWNAMEFDSRPALNHEIFRSWNGRQIHLRDEAVELESGFRLSPQRSPQFYRSNYQSLSPPSKARAIATGQKELMEMVSHMPESCYELSLRD
LVEQPRVLGREEETVADERDFNLGGDREIFAMENRKSKKEISRPLVSKSTGNMENGGLYLKMGFPKSIGTTKKKKKKKNDSALNTSAKVSPKPLPPADKDWWKRRFSVSS
ESESVGYGSSINNGSIKSSSSSSSNGSHGSNKNRTKSSRR