; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS028341 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS028341
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationscaffold47:3614869..3615591
RNA-Seq ExpressionMS028341
SyntenyMS028341
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147984.1 uncharacterized protein LOC111016778 [Momordica charantia]3.2e-12798.75Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPS  PEN SESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD
        KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD

Query:  EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
Subjt:  EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

XP_022952129.1 uncharacterized protein LOC111454895 [Cucurbita moschata]6.5e-8071.78Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL LFD FWFQ  VF GKPLL T+      E PENR +SP+MQV++ RSQSEY L S+F  PETAYYS      N+KLQ ILSG+VTEFSG   G
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG+KT           EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVASTVR
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

XP_022969253.1 uncharacterized protein LOC111468311 [Cucurbita maxima]2.6e-8172.61Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL+LFD FWFQ  VF G PLL T+      E PENR +SP+MQV++ RSQSEY L S+F  PETAYYS     TN+KLQ ILSG+VTEFSG  GG
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG +T       EE  EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVASTVR
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

XP_023554329.1 uncharacterized protein LOC111811624 [Cucurbita pepo subsp. pepo]5.0e-8071.37Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL LFD FWFQ  VF GKPLL T+      E PENR +SP+MQV++ RSQSEY L S+F  PET YYS      N+KLQ ILSG+VTEF+GE  G
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG+KT           EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVASTVR
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

XP_038887878.1 uncharacterized protein LOC120077867 [Benincasa hispida]3.1e-8273.28Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGS-IPTNRKLQTILSGEVTEFSGER
        MA+EEIL LFD FWFQ  +F GKPLL+T      S  PE R +SP+ QV++ RSQSEYLL S  FP PETA YSTGS I T++KLQTILSG+V EF+G  
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGS-IPTNRKLQTILSGEVTEFSGER

Query:  GGK----PAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLS
         GK    PAKKKL GNE K RK+RG+GLSKSLSDLEFEELKGFMDLGFVF EEDKN S+LASIIPGLQRLG+KTGENEEEK         E GVSRPYLS
Subjt:  GGK----PAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLS

Query:  EAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        EAWEA +EENEKRILMKWRVP LG ATEMDMKDHLKFWAHTVASTVR
Subjt:  EAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0K3T7 Uncharacterized protein3.1e-6463.01Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERG
        MA+EEIL LFD FWFQ  +F+ K  L+T              +SP+ QV++ RSQSEYLL S DFP PETA      + +N+KL+TILSG+VTEF G   
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERG

Query:  G---KPAKKKLGGNEEKL-RKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSE
        G   K  KKKL GNE+K+ RK++G+GLSKSLSDLEFEELKGFMDLGFVFSEEDKN S+L SIIPGL RLG K      + EEKR E G    + RPYLSE
Subjt:  G---KPAKKKLGGNEEKL-RKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSE

Query:  AWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        AW+A +EENEK ILMKWRVP LG ATEMD+K HLKFWAHTVASTVR
Subjt:  AWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

A0A1S3C1U3 uncharacterized protein LOC1034954959.5e-6161.38Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERG
        MA+EEIL LFD FWFQ  +F  KPLL+T               SP+M++   RSQSEYLL S DFP P T      ++ +N+KL+T+LSG+VTEF G   
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERG

Query:  GKPAK---KKLGGNEEKL-RKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSE
        GK  K   KKL GNE K+ RK++ +GLSKSLSDLEFEELKGFMDLGFVFSEEDKN S+L SIIPGL RLG       +  EEKR E G    + RPYLSE
Subjt:  GKPAK---KKLGGNEEKL-RKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSE

Query:  AWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        AWEA +EENEK +LMKWRVP LG ATEMD+K HLKFWAHTVASTVR
Subjt:  AWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

A0A6J1D2N3 uncharacterized protein LOC1110167781.5e-12798.75Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPS  PEN SESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD
        KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD

Query:  EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
Subjt:  EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

A0A6J1GKQ8 uncharacterized protein LOC1114548953.1e-8071.78Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL LFD FWFQ  VF GKPLL T+      E PENR +SP+MQV++ RSQSEY L S+F  PETAYYS      N+KLQ ILSG+VTEFSG   G
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG+KT           EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVASTVR
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

A0A6J1I0F9 uncharacterized protein LOC1114683111.3e-8172.61Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL+LFD FWFQ  VF G PLL T+      E PENR +SP+MQV++ RSQSEY L S+F  PETAYYS     TN+KLQ ILSG+VTEFSG  GG
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG +T       EE  EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVASTVR
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)7.3e-0530.51Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEM
        SKSL+D + E+L+G +DLGF FS  D+   L + +P L+     + +  ++K+ K  E  +      P L  A            +  W++   G   + 
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEM

Query:  DMKDHLKFWAHTVASTVR
        D+K  LK+WA  VA TV+
Subjt:  DMKDHLKFWAHTVASTVR

AT1G05870.2 Protein of unknown function (DUF1685)7.3e-0530.51Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEM
        SKSL+D + E+L+G +DLGF FS  D+   L + +P L+     + +  ++K+ K  E  +      P L  A            +  W++   G   + 
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEM

Query:  DMKDHLKFWAHTVASTVR
        D+K  LK+WA  VA TV+
Subjt:  DMKDHLKFWAHTVASTVR

AT2G31560.1 Protein of unknown function (DUF1685)3.9e-0628.93Show/hide
Query:  GERGGKPAKKKLGGNEEKLRKRRGRGL--------------SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEK
        G+ GG+      G + +KL K++ + L              +KSL+D + EELKG +DLGF FS  D+   L + +P L+     + +  ++K++   + 
Subjt:  GERGGKPAKKKLGGNEEKLRKRRGRGL--------------SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEK

Query:  GNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
          E   S P  + A            +  W++   G   + D+K  LK+WA TVA TVR
Subjt:  GNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

AT2G31560.2 Protein of unknown function (DUF1685)3.9e-0628.93Show/hide
Query:  GERGGKPAKKKLGGNEEKLRKRRGRGL--------------SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEK
        G+ GG+      G + +KL K++ + L              +KSL+D + EELKG +DLGF FS  D+   L + +P L+     + +  ++K++   + 
Subjt:  GERGGKPAKKKLGGNEEKLRKRRGRGL--------------SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEK

Query:  GNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
          E   S P  + A            +  W++   G   + D+K  LK+WA TVA TVR
Subjt:  GNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVASTVR

AT2G42760.1 unknown protein3.0e-3539.86Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFP-----------------APETAYYSTGS-------
        MA EE+L LF+Q W +      +P+ +  +     +  E R E    ++++ R + E L   +FP                 + +T+ +S+ S       
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFP-----------------APETAYYSTGS-------

Query:  ------IPTNRKLQTILSG-EVTEFS-GER----GGKPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRL
               PT  KLQTILSG EV  F+  ER      K  ++K    +  +R R+G    KS+SDLE+EELKGFMDLGFVFSE+D K+S L SI+PGLQRL
Subjt:  ------IPTNRKLQTILSG-EVTEFS-GER----GGKPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRL

Query:  GRK-TGENEEEKEEKREEKGNEIGVSRPYLSEAWE-ADDEENEKRIL--MKWRVPDLGAATEMDMKDHLKFWAHTVASTVR
         +K  G  +EE+EE+ E+K      +RPYLSEAW+     + +K+I   +KWRVP   AA+E+D+KD+L+ WAH VAST+R
Subjt:  GRK-TGENEEEKEEKREEKGNEIGVSRPYLSEAWE-ADDEENEKRIL--MKWRVPDLGAATEMDMKDHLKFWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCGAAGAAATCCTCACCCTTTTCGACCAATTCTGGTTTCAGCACGTAGTTTTCGCCGGAAAACCGCTTTTACGGACCCAACGTGCAACAAAACCCTCCGAGGG
GCCGGAAAACCGCTCTGAGAGTCCAATAATGCAAGTGATTCAGGGGAGATCTCAGAGCGAGTATCTTCTGGGCTCGGACTTCCCAGCTCCCGAAACCGCTTATTACTCCA
CCGGCTCGATTCCGACCAATAGAAAACTGCAAACCATTCTTTCCGGCGAAGTAACAGAATTTTCCGGCGAAAGAGGGGGGAAGCCGGCGAAGAAGAAATTGGGAGGGAAT
GAAGAGAAATTAAGGAAGAGAAGAGGAAGGGGGTTGAGTAAGAGCTTGTCGGACCTTGAATTTGAGGAATTGAAGGGGTTTATGGATTTGGGATTTGTGTTCTCCGAGGA
AGATAAGAATTCAAGCTTGGCTTCCATAATTCCTGGGCTGCAGAGATTGGGGAGAAAAACAGGGGAAAATGAAGAGGAAAAAGAGGAAAAAAGAGAGGAAAAAGGGAATG
AAATTGGGGTTTCGAGGCCATATTTGTCTGAAGCATGGGAGGCTGATGATGAAGAAAATGAAAAGAGAATTCTGATGAAATGGAGAGTTCCAGATTTGGGTGCTGCCACT
GAAATGGACATGAAAGATCATCTCAAGTTTTGGGCTCATACAGTTGCTTCAACTGTGAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCCGAAGAAATCCTCACCCTTTTCGACCAATTCTGGTTTCAGCACGTAGTTTTCGCCGGAAAACCGCTTTTACGGACCCAACGTGCAACAAAACCCTCCGAGGG
GCCGGAAAACCGCTCTGAGAGTCCAATAATGCAAGTGATTCAGGGGAGATCTCAGAGCGAGTATCTTCTGGGCTCGGACTTCCCAGCTCCCGAAACCGCTTATTACTCCA
CCGGCTCGATTCCGACCAATAGAAAACTGCAAACCATTCTTTCCGGCGAAGTAACAGAATTTTCCGGCGAAAGAGGGGGGAAGCCGGCGAAGAAGAAATTGGGAGGGAAT
GAAGAGAAATTAAGGAAGAGAAGAGGAAGGGGGTTGAGTAAGAGCTTGTCGGACCTTGAATTTGAGGAATTGAAGGGGTTTATGGATTTGGGATTTGTGTTCTCCGAGGA
AGATAAGAATTCAAGCTTGGCTTCCATAATTCCTGGGCTGCAGAGATTGGGGAGAAAAACAGGGGAAAATGAAGAGGAAAAAGAGGAAAAAAGAGAGGAAAAAGGGAATG
AAATTGGGGTTTCGAGGCCATATTTGTCTGAAGCATGGGAGGCTGATGATGAAGAAAATGAAAAGAGAATTCTGATGAAATGGAGAGTTCCAGATTTGGGTGCTGCCACT
GAAATGGACATGAAAGATCATCTCAAGTTTTGGGCTCATACAGTTGCTTCAACTGTGAGATAA
Protein sequenceShow/hide protein sequence
MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSEGPENRSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGGKPAKKKLGGN
EEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAAT
EMDMKDHLKFWAHTVASTVR