; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g40370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g40370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1645)
Genome locationchr9:30851742..30852500
RNA-Seq ExpressionMoc09g40370
SyntenyMoc09g40370
Gene Ontology termsNA
InterPro domainsIPR012442 - Protein of unknown function DUF1645, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33835.1 hypothetical protein [Cucumis melo subsp. melo]4.4e-4757.21Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+                    AAP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+ +            ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

XP_011656255.1 uncharacterized protein LOC105435701 [Cucumis sativus]9.6e-5058.6Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+  T  AP              AP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+ K            ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

XP_022151187.1 uncharacterized protein LOC111019169 [Momordica charantia]1.5e-135100Show/hide
Query:  MRDPVEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSD
        MRDPVEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSD
Subjt:  MRDPVEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSD

Query:  TQDLTRVAPEIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYH
        TQDLTRVAPEIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYH
Subjt:  TQDLTRVAPEIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYH

Query:  ERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF
        ERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF
Subjt:  ERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF

XP_023542069.1 uncharacterized protein LOC111802044 [Cucurbita pepo subsp. pepo]3.2e-4558.45Show/hide
Query:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE
        ++ DDFSF+  NPD SPISAEDAF+NGQIR VFP      L  + K +E      P  VRP   LK LFM  EEL S T++ + V+P+   EWS      
Subjt:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE

Query:  AAP-ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLP
        AAP  LGKKS+STGFSKLWRFG++IRRSSSDGKEAFVFLRS SS  GGE   +   G       G+RTKG  ETASCYHER Y RNRAE E+NKRKS+LP
Subjt:  AAP-ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLP

Query:  YRSNLVGFFTSVNNNNNNN
        YRSNL+GFF   N  +N N
Subjt:  YRSNLVGFFTSVNNNNNNN

XP_038891720.1 uncharacterized protein LOC120081118 [Benincasa hispida]4.0e-4859.72Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLF-SQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAA
        DDFSFV  NPDGSPISAEDAF+NGQIRPVFP+FDQR+L     + +ETTS      VRP   LKKLFMED  + S T+  TR +              +A
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLF-SQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAA

Query:  PELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYR
        P LGKKSYSTGFSKLWRFGD+I RSSS+GK EAF+FLRS SS          +GG G      ++T  + ETAS YHERHYARNRAE+E+NKRKSYLPYR
Subjt:  PELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYR

Query:  SNLVGFFTSVNNNNNN
        SNL+GFFT+ N  N N
Subjt:  SNLVGFFTSVNNNNNN

TrEMBL top hitse value%identityAlignment
A0A0A0KQ75 Uncharacterized protein4.6e-5058.6Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+  T  AP              AP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+ K            ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

A0A5A7SYN7 Uncharacterized protein2.2e-4757.21Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+                    AAP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+ +            ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

A0A6J1DCT2 uncharacterized protein LOC1110191697.2e-136100Show/hide
Query:  MRDPVEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSD
        MRDPVEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSD
Subjt:  MRDPVEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSD

Query:  TQDLTRVAPEIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYH
        TQDLTRVAPEIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYH
Subjt:  TQDLTRVAPEIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYH

Query:  ERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF
        ERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF
Subjt:  ERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF

A0A6J1HSU3 uncharacterized protein LOC1114663161.5e-4557.6Show/hide
Query:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE
        ++ DDFSF+  NPD SPISAEDAF+NGQIR VFP      +  + K +E      P  VRP   LK LFM  EEL S T++ + V+P+   EWS      
Subjt:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE

Query:  AAP-ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLP
        A P  LGKKS+STGFSKLWRFG++IRRSSSDGKEAFVFLRS SSG GGE   +   G       G RTKG  ETASCYHER Y RNRAE E+NKRKS+LP
Subjt:  AAP-ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLP

Query:  YRSNLVGFFTSVNNNNN
        YRSN++GFF+  N  +N
Subjt:  YRSNLVGFFTSVNNNNN

E5GBJ4 Uncharacterized protein2.2e-4757.21Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+                    AAP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+ +            ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G23710.1 Protein of unknown function (DUF1645)4.2e-3540.67Show/hide
Query:  VEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDL
        +EE V    D  EE       ++FSF C N +GSPI+A++AF +GQIRPVFP+F++ LLF + + E+  +  + V    + +L+KLF+ED     D ++ 
Subjt:  VEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDL

Query:  TRVAPE---IYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFL---------RSPSSGGGGEAK--------LKANGGSGSGSG
             E    YC W+   V EA+PE  +KS STGFSKLWRF D + RS+SDG++AFVFL         RS SS     A+         K  G   + + 
Subjt:  TRVAPE---IYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFL---------RSPSSGGGGEAK--------LKANGGSGSGSG

Query:  SGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF
        S   TK +  T    HE+ Y RNRA  E  K +SYLPY+   VGFFT+V         NGL+RN H F
Subjt:  SGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF

AT1G70420.1 Protein of unknown function (DUF1645)1.1e-3542.49Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        +DFSF   N D SPI+A++AF +GQIRPV+P+F++ + F   + E+T   P          LKKLF+E      + ++   V P  YC W+ R V +A+P
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKAN-GGSGSGSGSGRR-----TKGERETASCYHERHYARNRAENEMNKRKSY
        E  +KS STGFSKLWRF D + RS+SDGK+AFVFL + SS     +   A   G    S  G+       K ++      HE+ Y RNRA  E  KR+SY
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKAN-GGSGSGSGSGRR-----TKGERETASCYHERHYARNRAENEMNKRKSY

Query:  LPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF
        LPY+   VGFFT+V         NGLTRN H +
Subjt:  LPYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF

AT3G27880.1 Protein of unknown function (DUF1645)1.8e-1432.29Show/hide
Query:  VFPVFDQRLLFSQTKAEETTSL-PLPVRVR-PQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSP-RKVGEAAPELG-KKSYSTGFS-------KLWRF
        VFPVF++ L+      E+   L  L +R R  Q   ++ +    +   +  +   +  EIYC W+P R   + +P  G +KS STG S       K WR 
Subjt:  VFPVFDQRLLFSQTKAEETTSL-PLPVRVR-PQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSP-RKVGEAAPELG-KKSYSTGFS-------KLWRF

Query:  GDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVN
         D ++RS SDGK++  FL               N      S   ++      T S  HE+ Y RN+A  E +KRKSYLPY+ +LVG F++++
Subjt:  GDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVN

AT5G14730.1 unknown protein5.0e-0432.89Show/hide
Query:  SDTQDLTRVAP-EIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFV----------FLRSPSSGGGGEAKLKANGGSGSGSGSGR
        SD+ D   ++P + YC WSP +    +P       S G S+  R  + +RRS SDG  + V           LR   S GGGE      G S S SG+G 
Subjt:  SDTQDLTRVAP-EIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFV----------FLRSPSSGGGGEAKLKANGGSGSGSGSGR

Query:  R-TKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNN
           KG  +TA+             +   +RKSYLPYR +L+G F  +    N
Subjt:  R-TKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNN

AT5G62770.1 Protein of unknown function (DUF1645)4.3e-1633.91Show/hide
Query:  DDYDDFSFVC-SNPDGSPI-SAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDT-----QDLTRVAPEIYCEW
        D+  DF+F C SN    P+ +A++ F NGQIRP+ P      + SQ  ++ TT LP P R RP   L+KL  ED +  S++     +DLT V PE YC W
Subjt:  DDYDDFSFVC-SNPDGSPI-SAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDT-----QDLTRVAPEIYCEW

Query:  SPRKVGEAAPELGK----------KSYSTGFSKLWRFGDRIR-RSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHY
         P++      +L +          KS+S GFSK W+  + +  RSSS+G +  VF           A +K N  + S        + E E  S       
Subjt:  SPRKVGEAAPELGK----------KSYSTGFSKLWRFGDRIR-RSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHY

Query:  ARNRAENEMNKRKSYLPYRSNLVGFFTSVN
         R R E    KR++Y+PYR +++G   +VN
Subjt:  ARNRAENEMNKRKSYLPYRSNLVGFFTSVN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCGACCCGGTTGAGGAAGTTGTGAGCCTCGAAATGGATCCACCGGAGGAAATTTATGGCGCCGATGACTATGATGATTTTTCTTTTGTTTGTTCAAATCCGGACGG
CTCTCCGATCAGTGCAGAAGATGCCTTCGTTAACGGCCAGATTCGCCCCGTCTTCCCCGTGTTCGACCAACGCCTTCTGTTTTCCCAGACGAAGGCGGAGGAAACGACGT
CGCTTCCGCTTCCGGTTCGAGTTCGACCCCAGGCCCAGTTGAAGAAGCTATTCATGGAAGACGAAGAACTTCGTTCGGATACTCAGGATTTGACACGTGTTGCGCCGGAA
ATATACTGTGAGTGGTCCCCGCGGAAGGTGGGGGAGGCGGCGCCGGAGCTGGGGAAGAAGAGCTACTCCACGGGGTTCTCGAAGCTGTGGAGGTTCGGGGACAGGATTCG
CCGGAGCAGCAGCGACGGCAAGGAGGCGTTCGTGTTCTTGAGAAGTCCGTCGTCGGGCGGCGGCGGCGAGGCGAAGTTAAAGGCGAATGGAGGGTCGGGGTCGGGGTCAG
GGTCAGGGAGGCGGACGAAAGGGGAAAGGGAAACGGCGTCGTGTTATCACGAGCGGCATTACGCGAGAAATAGAGCGGAGAACGAGATGAATAAACGGAAATCGTATCTG
CCGTATAGGAGTAATCTCGTGGGCTTCTTCACCAGCGTTAATAATAATAATAATAATAATAATAATAATGGTCTCACAAGAAATCCTCACTCCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCGACCCGGTTGAGGAAGTTGTGAGCCTCGAAATGGATCCACCGGAGGAAATTTATGGCGCCGATGACTATGATGATTTTTCTTTTGTTTGTTCAAATCCGGACGG
CTCTCCGATCAGTGCAGAAGATGCCTTCGTTAACGGCCAGATTCGCCCCGTCTTCCCCGTGTTCGACCAACGCCTTCTGTTTTCCCAGACGAAGGCGGAGGAAACGACGT
CGCTTCCGCTTCCGGTTCGAGTTCGACCCCAGGCCCAGTTGAAGAAGCTATTCATGGAAGACGAAGAACTTCGTTCGGATACTCAGGATTTGACACGTGTTGCGCCGGAA
ATATACTGTGAGTGGTCCCCGCGGAAGGTGGGGGAGGCGGCGCCGGAGCTGGGGAAGAAGAGCTACTCCACGGGGTTCTCGAAGCTGTGGAGGTTCGGGGACAGGATTCG
CCGGAGCAGCAGCGACGGCAAGGAGGCGTTCGTGTTCTTGAGAAGTCCGTCGTCGGGCGGCGGCGGCGAGGCGAAGTTAAAGGCGAATGGAGGGTCGGGGTCGGGGTCAG
GGTCAGGGAGGCGGACGAAAGGGGAAAGGGAAACGGCGTCGTGTTATCACGAGCGGCATTACGCGAGAAATAGAGCGGAGAACGAGATGAATAAACGGAAATCGTATCTG
CCGTATAGGAGTAATCTCGTGGGCTTCTTCACCAGCGTTAATAATAATAATAATAATAATAATAATAATGGTCTCACAAGAAATCCTCACTCCTTCTGA
Protein sequenceShow/hide protein sequence
MRDPVEEVVSLEMDPPEEIYGADDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPE
IYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGGEAKLKANGGSGSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYL
PYRSNLVGFFTSVNNNNNNNNNNGLTRNPHSF