; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015066 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015066
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF1645)
Genome locationscaffold2:922352..923032
RNA-Seq ExpressionMS015066
SyntenyMS015066
Gene Ontology termsNA
InterPro domainsIPR012442 - Protein of unknown function DUF1645, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33835.1 hypothetical protein [Cucumis melo subsp. melo]1.8e-4757.21Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+                    AAP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+              ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

XP_011656255.1 uncharacterized protein LOC105435701 [Cucumis sativus]1.7e-5058.6Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+  T  AP              AP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+ K            ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

XP_022151187.1 uncharacterized protein LOC111019169 [Momordica charantia]6.7e-11997.83Show/hide
Query:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE
        DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE
Subjt:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE

Query:  AAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPY
        AAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGG EAKLKANGGS SGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPY
Subjt:  AAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPY

Query:  RSNLVGFFTSV---NNNNNNNGLTRNPHSF
        RSNLVGFFTSV   NNNNNNNGLTRNPHSF
Subjt:  RSNLVGFFTSV---NNNNNNNGLTRNPHSF

XP_023542069.1 uncharacterized protein LOC111802044 [Cucurbita pepo subsp. pepo]4.9e-4557.99Show/hide
Query:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE
        ++ DDFSF+  NPD SPISAEDAF+NGQIR VFP      L  + K +E      P  VRP   LK LFM  EEL S T++ + V+P+   EWS      
Subjt:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE

Query:  AAP-ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLP
        AAP  LGKKS+STGFSKLWRFG++IRRSSSDGKEAFVFLRS SS  G E   +   G       G+RTKG  ETASCYHER Y RNRAE E+NKRKS+LP
Subjt:  AAP-ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLP

Query:  YRSNLVGFFTSVNNNNNNN
        YRSNL+GFF   N  +N N
Subjt:  YRSNLVGFFTSVNNNNNNN

XP_038891720.1 uncharacterized protein LOC120081118 [Benincasa hispida]1.4e-4760.65Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLF-SQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAA
        DDFSFV  NPDGSPISAEDAF+NGQIRPVFP+FDQR+L     + +ETTS      VRP   LKKLFMED  + S T+  TR +              +A
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLF-SQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAA

Query:  PELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYR
        P LGKKSYSTGFSKLWRFGD+I RSSS+GK EAF+FLRS SS  G        GG          TKG  ETAS YHERHYARNRAE+E+NKRKSYLPYR
Subjt:  PELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYR

Query:  SNLVGFFTSVNNNNNN
        SNL+GFFT+ N  N N
Subjt:  SNLVGFFTSVNNNNNN

TrEMBL top hitse value%identityAlignment
A0A0A0KQ75 Uncharacterized protein8.4e-5158.6Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+  T  AP              AP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+ K            ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

A0A5A7SYN7 Uncharacterized protein8.7e-4857.21Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+                    AAP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+              ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

A0A6J1DCT2 uncharacterized protein LOC1110191693.2e-11997.83Show/hide
Query:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE
        DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE
Subjt:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE

Query:  AAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPY
        AAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGG EAKLKANGGS SGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPY
Subjt:  AAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPY

Query:  RSNLVGFFTSV---NNNNNNNGLTRNPHSF
        RSNLVGFFTSV   NNNNNNNGLTRNPHSF
Subjt:  RSNLVGFFTSV---NNNNNNNGLTRNPHSF

A0A6J1HSU3 uncharacterized protein LOC1114663163.1e-4557.14Show/hide
Query:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE
        ++ DDFSF+  NPD SPISAEDAF+NGQIR VFP      +  + K +E      P  VRP   LK LFM  EEL S T++ + V+P+   EWS      
Subjt:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGE

Query:  AAP-ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLP
        A P  LGKKS+STGFSKLWRFG++IRRSSSDGKEAFVFLRS SSG G E   +   G       G RTKG  ETASCYHER Y RNRAE E+NKRKS+LP
Subjt:  AAP-ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLP

Query:  YRSNLVGFFTSVNNNNN
        YRSN++GFF+  N  +N
Subjt:  YRSNLVGFFTSVNNNNN

E5GBJ4 Uncharacterized protein8.7e-4857.21Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        DDFSFVC NPDGSPI AEDAF+NGQIRPVFP+FDQR+L        TTS           QLK LF+E+      T+                    AAP
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS
         LGKKSYSTGFSKLWRFGD+IRRSSS+GK EAF+FLRS SSG G +A+              ++ K + ETAS YHERHYARNRAENE+NKRKSYLPYRS
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGK-EAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRS

Query:  NLVGFFTSVNNNNNN
        NL+GFFT+ N  N N
Subjt:  NLVGFFTSVNNNNNN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G23710.1 Protein of unknown function (DUF1645)1.2e-3641.7Show/hide
Query:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPE---IYCEWSPRK
        ++ ++FSF C N +GSPI+A++AF +GQIRPVFP+F++ LLF + + E+  +  + V    + +L+KLF+ED     D ++      E    YC W+   
Subjt:  DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPE---IYCEWSPRK

Query:  VGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFL---------RSPSSGGGVEAK--------LKANGGSRSGSGSGRRTKGERETASCYHER
        V EA+PE  +KS STGFSKLWRF D + RS+SDG++AFVFL         RS SS     A+         K  G  ++ + S   TK +  T    HE+
Subjt:  VGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFL---------RSPSSGGGVEAK--------LKANGGSRSGSGSGRRTKGERETASCYHER

Query:  HYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNGLTRNPHSF
         Y RNRA  E  K +SYLPY+   VGFFT+V      NGL+RN H F
Subjt:  HYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNGLTRNPHSF

AT1G70420.1 Protein of unknown function (DUF1645)3.1e-3743.04Show/hide
Query:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP
        +DFSF   N D SPI+A++AF +GQIRPV+P+F++ + F   + E+T   P          LKKLF+E      + ++   V P  YC W+ R V +A+P
Subjt:  DDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAP

Query:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKA--NGGSRSGSGSGRRTKGERE----TASCYHERHYARNRAENEMNKRKSY
        E  +KS STGFSKLWRF D + RS+SDGK+AFVFL + SS     +   A  +G  +S      +TK +++         HE+ Y RNRA  E  KR+SY
Subjt:  ELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKA--NGGSRSGSGSGRRTKGERE----TASCYHERHYARNRAENEMNKRKSY

Query:  LPYRSNLVGFFTSVNNNNNNNGLTRNPHSF
        LPY+   VGFFT+V      NGLTRN H +
Subjt:  LPYRSNLVGFFTSVNNNNNNNGLTRNPHSF

AT3G27880.1 Protein of unknown function (DUF1645)7.3e-1532.29Show/hide
Query:  VFPVFDQRLLFSQTKAEETTSL-PLPVRVR-PQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSP-RKVGEAAPELG-KKSYSTGFS-------KLWRF
        VFPVF++ L+      E+   L  L +R R  Q   ++ +    +   +  +   +  EIYC W+P R   + +P  G +KS STG S       K WR 
Subjt:  VFPVFDQRLLFSQTKAEETTSL-PLPVRVR-PQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSP-RKVGEAAPELG-KKSYSTGFS-------KLWRF

Query:  GDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVN
         D ++RS SDGK++  FL               N      S   ++      T S  HE+ Y RN+A  E +KRKSYLPY+ +LVG F++++
Subjt:  GDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVN

AT5G14730.1 unknown protein2.0e-0432.03Show/hide
Query:  SDTQDLTRVAP-EIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFV-----------FLRSPSSGGGVEAKLKANGGSRSGSGSG
        SD+ D   ++P + YC WSP +    +P       S G S+  R  + +RRS SDG  + V             RS S GGG        G S S SG+G
Subjt:  SDTQDLTRVAP-EIYCEWSPRKVGEAAPELGKKSYSTGFSKLWRFGDRIRRSSSDGKEAFV-----------FLRSPSSGGGVEAKLKANGGSRSGSGSG

Query:  RR-TKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNN
            KG  +TA+             +   +RKSYLPYR +L+G F  +    N
Subjt:  RR-TKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNN

AT5G62770.1 Protein of unknown function (DUF1645)2.3e-1633.47Show/hide
Query:  DDYDDFSFVC-SNPDGSPI-SAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDT-----QDLTRVAPEIYCEW
        D+  DF+F C SN    P+ +A++ F NGQIRP+ P      + SQ  ++ TT LP P R RP   L+KL  ED +  S++     +DLT V PE YC W
Subjt:  DDYDDFSFVC-SNPDGSPI-SAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDT-----QDLTRVAPEIYCEW

Query:  SPRKVGEAAPELGK----------KSYSTGFSKLWRFGDRIR-RSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHY
         P++      +L +          KS+S GFSK W+  + +  RSSS+G +  VF           A +K N  + S        + E E  S       
Subjt:  SPRKVGEAAPELGK----------KSYSTGFSKLWRFGDRIR-RSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHY

Query:  ARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNGLTRNPHSF
         R R E    KR++Y+PYR +++G   +V      NGL+R+   F
Subjt:  ARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNGLTRNPHSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATGACTATGATGATTTTTCTTTTGTTTGTTCAAATCCGGACGGCTCTCCGATCAGTGCAGAAGATGCCTTCGTTAACGGCCAGATTCGCCCCGTCTTCCCCGTGTTCGA
CCAACGCCTTCTGTTTTCCCAGACGAAGGCGGAGGAAACGACGTCGCTTCCGCTTCCGGTTCGAGTTCGACCCCAGGCCCAGTTGAAGAAGCTATTCATGGAAGACGAAG
AACTTCGTTCGGATACTCAGGATTTGACACGTGTTGCGCCGGAAATATACTGTGAGTGGTCCCCGCGGAAGGTGGGGGAGGCGGCGCCGGAGCTGGGGAAGAAGAGCTAC
TCCACGGGGTTCTCGAAGCTGTGGAGGTTCGGGGACAGGATTCGCCGGAGCAGCAGCGACGGCAAGGAGGCGTTCGTGTTCTTGAGAAGTCCGTCGTCGGGCGGCGGCGT
CGAGGCGAAGTTAAAGGCGAATGGAGGGTCGAGGTCAGGGTCAGGGTCAGGGAGGCGGACGAAAGGGGAAAGGGAAACGGCGTCGTGTTATCACGAGCGGCATTACGCGA
GAAATAGAGCGGAGAACGAGATGAATAAACGGAAATCGTATCTGCCGTACAGGAGTAATCTCGTGGGCTTCTTCACCAGCGTTAATAATAATAATAATAATAATGGTCTC
ACAAGAAATCCTCACTCCTTC
mRNA sequenceShow/hide mRNA sequence
GATGACTATGATGATTTTTCTTTTGTTTGTTCAAATCCGGACGGCTCTCCGATCAGTGCAGAAGATGCCTTCGTTAACGGCCAGATTCGCCCCGTCTTCCCCGTGTTCGA
CCAACGCCTTCTGTTTTCCCAGACGAAGGCGGAGGAAACGACGTCGCTTCCGCTTCCGGTTCGAGTTCGACCCCAGGCCCAGTTGAAGAAGCTATTCATGGAAGACGAAG
AACTTCGTTCGGATACTCAGGATTTGACACGTGTTGCGCCGGAAATATACTGTGAGTGGTCCCCGCGGAAGGTGGGGGAGGCGGCGCCGGAGCTGGGGAAGAAGAGCTAC
TCCACGGGGTTCTCGAAGCTGTGGAGGTTCGGGGACAGGATTCGCCGGAGCAGCAGCGACGGCAAGGAGGCGTTCGTGTTCTTGAGAAGTCCGTCGTCGGGCGGCGGCGT
CGAGGCGAAGTTAAAGGCGAATGGAGGGTCGAGGTCAGGGTCAGGGTCAGGGAGGCGGACGAAAGGGGAAAGGGAAACGGCGTCGTGTTATCACGAGCGGCATTACGCGA
GAAATAGAGCGGAGAACGAGATGAATAAACGGAAATCGTATCTGCCGTACAGGAGTAATCTCGTGGGCTTCTTCACCAGCGTTAATAATAATAATAATAATAATGGTCTC
ACAAGAAATCCTCACTCCTTC
Protein sequenceShow/hide protein sequence
DDYDDFSFVCSNPDGSPISAEDAFVNGQIRPVFPVFDQRLLFSQTKAEETTSLPLPVRVRPQAQLKKLFMEDEELRSDTQDLTRVAPEIYCEWSPRKVGEAAPELGKKSY
STGFSKLWRFGDRIRRSSSDGKEAFVFLRSPSSGGGVEAKLKANGGSRSGSGSGRRTKGERETASCYHERHYARNRAENEMNKRKSYLPYRSNLVGFFTSVNNNNNNNGL
TRNPHSF