; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g01330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g01330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1685)
Genome locationchr4:874978..876813
RNA-Seq ExpressionMoc04g01330
SyntenyMoc04g01330
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147984.1 uncharacterized protein LOC111016778 [Momordica charantia]1.5e-127100Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD
        KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD

Query:  EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
Subjt:  EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

XP_022952129.1 uncharacterized protein LOC111454895 [Cucurbita moschata]1.9e-7770.71Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL LFD FWFQ  VF GKPLL T+       +PEN  +SP+MQV++ RSQSEY L S+F  PETAYYS      N+KLQ ILSG+VTEFSG   G
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG+KT           EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVAST
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

XP_022969253.1 uncharacterized protein LOC111468311 [Cucurbita maxima]7.7e-7971.55Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL+LFD FWFQ  VF G PLL T+       +PEN  +SP+MQV++ RSQSEY L S+F  PETAYYS     TN+KLQ ILSG+VTEFSG  GG
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG +T       EE  EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVAST
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

XP_023554329.1 uncharacterized protein LOC111811624 [Cucurbita pepo subsp. pepo]1.4e-7770.29Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL LFD FWFQ  VF GKPLL T+       +PEN  +SP+MQV++ RSQSEY L S+F  PET YYS      N+KLQ ILSG+VTEF+GE  G
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG+KT           EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVAST
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

XP_038887878.1 uncharacterized protein LOC120077867 [Benincasa hispida]8.2e-8173.06Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGS-IPTNRKLQTILSGEVTEFSGER
        MA+EEIL LFD FWFQ  +F GKPLL+T      S+APE   +SP+ QV++ RSQSEYLL S  FP PETA YSTGS I T++KLQTILSG+V EF+G  
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGS-IPTNRKLQTILSGEVTEFSGER

Query:  GGK----PAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLS
         GK    PAKKKL GNE K RK+RG+GLSKSLSDLEFEELKGFMDLGFVF EEDKN S+LASIIPGLQRLG+KTGENEEEK         E GVSRPYLS
Subjt:  GGK----PAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLS

Query:  EAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        EAWEA +EENEKRILMKWRVP LG ATEMDMKDHLKFWAHTVAST
Subjt:  EAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

TrEMBL top hitse value%identityAlignment
A0A0A0K3T7 Uncharacterized protein3.7e-6362.7Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERG
        MA+EEIL LFD FWFQ  +F+ K  L+T              +SP+ QV++ RSQSEYLL S DFP PETA      + +N+KL+TILSG+VTEF G   
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERG

Query:  G---KPAKKKLGGNEEKL-RKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSE
        G   K  KKKL GNE+K+ RK++G+GLSKSLSDLEFEELKGFMDLGFVFSEEDKN S+L SIIPGL RLG K      + EEKR E G    + RPYLSE
Subjt:  G---KPAKKKLGGNEEKL-RKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSE

Query:  AWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        AW+A +EENEK ILMKWRVP LG ATEMD+K HLKFWAHTVAST
Subjt:  AWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

A0A1S3C1U3 uncharacterized protein LOC1034954951.9e-5961.07Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERG
        MA+EEIL LFD FWFQ  +F  KPLL+T               SP+M++   RSQSEYLL S DFP P T      ++ +N+KL+T+LSG+VTEF G   
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGS-DFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERG

Query:  GKPAK---KKLGGNEEKL-RKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSE
        GK  K   KKL GNE K+ RK++ +GLSKSLSDLEFEELKGFMDLGFVFSEEDKN S+L SIIPGL RLG       +  EEKR E G    + RPYLSE
Subjt:  GKPAK---KKLGGNEEKL-RKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKN-SSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSE

Query:  AWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        AWEA +EENEK +LMKWRVP LG ATEMD+K HLKFWAHTVAST
Subjt:  AWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

A0A6J1D2N3 uncharacterized protein LOC1110167787.4e-128100Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD
        KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADD

Query:  EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
Subjt:  EENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

A0A6J1GKQ8 uncharacterized protein LOC1114548959.1e-7870.71Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL LFD FWFQ  VF GKPLL T+       +PEN  +SP+MQV++ RSQSEY L S+F  PETAYYS      N+KLQ ILSG+VTEFSG   G
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG+KT           EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVAST
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

A0A6J1I0F9 uncharacterized protein LOC1114683113.7e-7971.55Show/hide
Query:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG
        MA+EEIL+LFD FWFQ  VF G PLL T+       +PEN  +SP+MQV++ RSQSEY L S+F  PETAYYS     TN+KLQ ILSG+VTEFSG  GG
Subjt:  MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGG

Query:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD
        KPAKKK  G+E++ R++RGRGLSKSLSDLEFEELKGFMDLGFVFSEED KNSSLASIIPGLQRLG +T       EE  EE+G E GVSRPYLSEAW+A 
Subjt:  KPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEAD

Query:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
        +EE EKR LMKWRVPDLG ATEMDMKDHLKFWAHTVAST
Subjt:  DEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)3.9e-0430.17Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEM
        SKSL+D + E+L+G +DLGF FS  D+   L + +P L+     + +  ++K+ K  E  +      P L  A            +  W++   G   + 
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEM

Query:  DMKDHLKFWAHTVAST
        D+K  LK+WA  VA T
Subjt:  DMKDHLKFWAHTVAST

AT1G05870.2 Protein of unknown function (DUF1685)3.9e-0430.17Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEM
        SKSL+D + E+L+G +DLGF FS  D+   L + +P L+     + +  ++K+ K  E  +      P L  A            +  W++   G   + 
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEM

Query:  DMKDHLKFWAHTVAST
        D+K  LK+WA  VA T
Subjt:  DMKDHLKFWAHTVAST

AT2G31560.1 Protein of unknown function (DUF1685)6.0e-0528.03Show/hide
Query:  GERGGKPAKKKLGGNEEKLRKRRGRGL--------------SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEK
        G+ GG+      G + +KL K++ + L              +KSL+D + EELKG +DLGF FS  D+   L + +P L+     + +  ++K++   + 
Subjt:  GERGGKPAKKKLGGNEEKLRKRRGRGL--------------SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEK

Query:  GNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
          E   S P  + A            +  W++   G   + D+K  LK+WA TVA T
Subjt:  GNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

AT2G31560.2 Protein of unknown function (DUF1685)6.0e-0528.03Show/hide
Query:  GERGGKPAKKKLGGNEEKLRKRRGRGL--------------SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEK
        G+ GG+      G + +KL K++ + L              +KSL+D + EELKG +DLGF FS  D+   L + +P L+     + +  ++K++   + 
Subjt:  GERGGKPAKKKLGGNEEKLRKRRGRGL--------------SKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEK

Query:  GNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST
          E   S P  + A            +  W++   G   + D+K  LK+WA TVA T
Subjt:  GNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAATEMDMKDHLKFWAHTVAST

AT2G42760.1 unknown protein6.1e-3440.66Show/hide
Query:  MASEEILTLFDQFWFQHVVFA-------GKPLLRTQRATKPSNAPENSSES----PIMQVIQGRSQSEYLLGSDFPAPETAYYSTGS-------------
        MA EE+L LF+Q W +  +F        GK   R +R  K         E+    P+  +++     E ++ +   + +T+ +S+ S             
Subjt:  MASEEILTLFDQFWFQHVVFA-------GKPLLRTQRATKPSNAPENSSES----PIMQVIQGRSQSEYLLGSDFPAPETAYYSTGS-------------

Query:  IPTNRKLQTILSG-EVTEFS-GER----GGKPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRK-TG
         PT  KLQTILSG EV  F+  ER      K  ++K    +  +R R+G    KS+SDLE+EELKGFMDLGFVFSE+D K+S L SI+PGLQRL +K  G
Subjt:  IPTNRKLQTILSG-EVTEFS-GER----GGKPAKKKLGGNEEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEED-KNSSLASIIPGLQRLGRK-TG

Query:  ENEEEKEEKREEKGNEIGVSRPYLSEAWE-ADDEENEKRIL--MKWRVPDLGAATEMDMKDHLKFWAHTVAST
          +EE+EE+ E+K      +RPYLSEAW+     + +K+I   +KWRVP   AA+E+D+KD+L+ WAH VAST
Subjt:  ENEEEKEEKREEKGNEIGVSRPYLSEAWE-ADDEENEKRIL--MKWRVPDLGAATEMDMKDHLKFWAHTVAST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCGAAGAAATCCTCACCCTTTTCGACCAATTCTGGTTTCAGCACGTAGTTTTCGCCGGAAAACCGCTTTTACGGACCCAACGTGCAACAAAACCCTCCAATGC
GCCGGAAAACAGCTCTGAGAGTCCAATAATGCAAGTGATTCAGGGGAGATCTCAGAGCGAGTATCTTCTGGGCTCGGACTTCCCAGCTCCCGAAACCGCTTATTACTCCA
CCGGCTCGATTCCGACCAATAGAAAACTGCAAACCATTCTTTCCGGCGAAGTAACAGAATTTTCCGGCGAAAGAGGGGGGAAGCCGGCGAAGAAGAAATTGGGAGGGAAT
GAAGAGAAATTAAGGAAGAGAAGAGGAAGGGGGTTGAGTAAGAGCTTGTCGGACCTTGAATTTGAGGAATTGAAGGGGTTTATGGATTTGGGATTTGTGTTCTCCGAGGA
AGATAAGAATTCAAGCTTGGCTTCCATAATTCCTGGGCTGCAGAGATTGGGGAGAAAAACAGGGGAAAATGAAGAGGAAAAAGAGGAAAAAAGAGAGGAAAAAGGGAATG
AAATTGGGGTTTCGAGGCCATATTTGTCTGAAGCATGGGAGGCTGATGATGAAGAAAATGAAAAGAGAATTCTGATGAAATGGAGAGTTCCAGATTTGGGTGCTGCCACT
GAAATGGACATGAAAGATCATCTCAAGTTTTGGGCTCATACAGTTGCTTCAACTCAAGTGGAAATACAATATGAACTGCACTTCATAATAGTGGATTCTAAAGCAGGGCA
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCCGAAGAAATCCTCACCCTTTTCGACCAATTCTGGTTTCAGCACGTAGTTTTCGCCGGAAAACCGCTTTTACGGACCCAACGTGCAACAAAACCCTCCAATGC
GCCGGAAAACAGCTCTGAGAGTCCAATAATGCAAGTGATTCAGGGGAGATCTCAGAGCGAGTATCTTCTGGGCTCGGACTTCCCAGCTCCCGAAACCGCTTATTACTCCA
CCGGCTCGATTCCGACCAATAGAAAACTGCAAACCATTCTTTCCGGCGAAGTAACAGAATTTTCCGGCGAAAGAGGGGGGAAGCCGGCGAAGAAGAAATTGGGAGGGAAT
GAAGAGAAATTAAGGAAGAGAAGAGGAAGGGGGTTGAGTAAGAGCTTGTCGGACCTTGAATTTGAGGAATTGAAGGGGTTTATGGATTTGGGATTTGTGTTCTCCGAGGA
AGATAAGAATTCAAGCTTGGCTTCCATAATTCCTGGGCTGCAGAGATTGGGGAGAAAAACAGGGGAAAATGAAGAGGAAAAAGAGGAAAAAAGAGAGGAAAAAGGGAATG
AAATTGGGGTTTCGAGGCCATATTTGTCTGAAGCATGGGAGGCTGATGATGAAGAAAATGAAAAGAGAATTCTGATGAAATGGAGAGTTCCAGATTTGGGTGCTGCCACT
GAAATGGACATGAAAGATCATCTCAAGTTTTGGGCTCATACAGTTGCTTCAACTCAAGTGGAAATACAATATGAACTGCACTTCATAATAGTGGATTCTAAAGCAGGGCA
CTGA
Protein sequenceShow/hide protein sequence
MASEEILTLFDQFWFQHVVFAGKPLLRTQRATKPSNAPENSSESPIMQVIQGRSQSEYLLGSDFPAPETAYYSTGSIPTNRKLQTILSGEVTEFSGERGGKPAKKKLGGN
EEKLRKRRGRGLSKSLSDLEFEELKGFMDLGFVFSEEDKNSSLASIIPGLQRLGRKTGENEEEKEEKREEKGNEIGVSRPYLSEAWEADDEENEKRILMKWRVPDLGAAT
EMDMKDHLKFWAHTVASTQVEIQYELHFIIVDSKAGH