; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0510 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0510
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtamine P1 family protein
Genome locationMC01:11542408..11543256
RNA-Seq ExpressionMC01g0510
SyntenyMC01g0510
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143963.1 uncharacterized protein LOC111013748 [Momordica charantia]5.98e-193100Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
        MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
        NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD

Query:  GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNN
        GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNN
Subjt:  GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNN

XP_022925671.1 uncharacterized protein LOC111433021 [Cucurbita moschata]1.36e-8862.55Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKNV AIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRR CRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F  NRE RRKSSI+  +SRV +EAEDS++ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS     Y  S  RSD T    GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
        +TE +  N A S+IES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

XP_022977257.1 uncharacterized protein LOC111477626 [Cucurbita maxima]5.42e-8862.36Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKNV  IETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER
        WN    + F    E RRKSSI+  +SRV +EAEDS++ +E  G  ARDA FASS PSPPKNALILTRCRSAP RSS  G   RS     +G G       
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER

Query:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
          GNRA S+IES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI EKLYG
Subjt:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

XP_023543942.1 uncharacterized protein LOC111803666 [Cucurbita pepo subsp. pepo]1.18e-8962.55Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKNV AIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F   RE RRKSSI+  +SRV +EAEDS++ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS  G  Y  S  RSD      GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
        +TE + GN A S+ ES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

XP_038882779.1 uncharacterized protein LOC120073931 [Benincasa hispida]6.14e-9764.03Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
        MK+  K ISSP+RTD FPPPLM+FL+AD G+RS+SGRSRSSP+FV KKNVVAIETQEPSSPKVTCMGQVRA  S+   AARCRWIR+VL+FNRR+CRT W
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEER-RTER
        N    + F R  E RRKSSI   +SRVGNEAEDS E+DEE   GARDA F+SS+PSPPKNALILTRCRSAP R+S  G  YRS P  SDG+GEE  + E 
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEER-RTER

Query:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE
        D GN A S+IE           EN+ G+G    V+ KE  ++E+ +   +R+L L RCKSEP RIAEK+YGELNLREE
Subjt:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE

TrEMBL top hitse value%identityAlignment
A0A1S3BN59 uncharacterized protein LOC1034916512.43e-8661.03Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQS--AARCRWIRTVLAFNRRHCRT
        MKQ  K ISSP+RTD FPPPLM+FL+AD G+RS+S RSRSSP+F+RKKNV AIET+EPSSPKVTCMGQVR  + S+    A RCRWIR+VL+FNRRHCRT
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQS--AARCRWIRTVLAFNRRHCRT

Query:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASS-IPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTG--EER
         WN   +L  G+ RE RR     IS+SRVGNEAEDS++++E+     RDA +ASS +PSPPKNALILTRCRS P RSS +   YRSS   SDG+   EE 
Subjt:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASS-IPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTG--EER

Query:  RTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE
        +TER  GN   S+IE  NS  E +   + S +   D + V G+ R L L RCKSEP RIAEKLYGELNL+EE
Subjt:  RTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE

A0A5A7TVU1 Uncharacterized protein3.83e-8661.03Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQS--AARCRWIRTVLAFNRRHCRT
        MKQ  K ISSP+RTD FPPPLM+FL+AD G+RS+S RSRSSP+F+RKKNV AIET+EPSSPKVTCMGQVR  + S+    A RCRWIR+VL+FNRRHCRT
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQS--AARCRWIRTVLAFNRRHCRT

Query:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASS-IPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTG--EER
         WN   +L  G+ RE RR     IS+SRVGNEAEDS++++E+     RDA +ASS +PSPPKNALILTRCRS P RSS +   YRSS   SDG+   EE 
Subjt:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASS-IPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTG--EER

Query:  RTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE
        +TER  GN   S+IE  NS  E +   + S +   D + V G+ R L L RCKSEP RIAEKLYGELNL+EE
Subjt:  RTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE

A0A6J1CS06 uncharacterized protein LOC1110137482.90e-193100Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
        MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
        NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD

Query:  GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNN
        GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNN
Subjt:  GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNN

A0A6J1ECV0 uncharacterized protein LOC1114330216.59e-8962.55Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKNV AIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRR CRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F  NRE RRKSSI+  +SRV +EAEDS++ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS     Y  S  RSD T    GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
        +TE +  N A S+IES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

A0A6J1IHY6 uncharacterized protein LOC1114776262.62e-8862.36Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKNV  IETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER
        WN    + F    E RRKSSI+  +SRV +EAEDS++ +E  G  ARDA FASS PSPPKNALILTRCRSAP RSS  G   RS     +G G       
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER

Query:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
          GNRA S+IES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI EKLYG
Subjt:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37100.1 protamine P1 family protein1.9e-2033.92Show/hide
Query:  KPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNV-VAIETQEPSSPKVTCMGQVRARRSS----------------AQSAARCRWIRTV
        +P+SSP RT+  PP LM FL+    SRSRS RSR  P+F R+KN   A ETQEP+SPKVTCMGQVR  RS                  + + RC W++  
Subjt:  KPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNV-VAIETQEPSSPKVTCMGQVRARRSS----------------AQSAARCRWIRTV

Query:  LAFN------RRHC-RTLWNGWKVLMFGR-NRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARD---AAFASSIPSPPKNALILTRCRSAPQRSSVS
           +      +  C   +W  WK       +++S ++SS S S+   G    + +E +E R E  ++   ++  S   +PP+NA +LTRCRSAP RS  S
Subjt:  LAFN------RRHC-RTLWNGWKVLMFGR-NRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARD---AAFASSIPSPPKNALILTRCRSAPQRSSVS

Query:  GYGYRSSPARSDGTGEERRTERDGGNRA----TSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRR-LNLKRCKSEPGRIAEKL
                  ++    +R    +  + +    TS  E+   + +    S  S+E K     V GS R+ L L RC SEP R+  ++
Subjt:  GYGYRSSPARSDGTGEERRTERDGGNRA----TSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRR-LNLKRCKSEPGRIAEKL

AT5G03110.1 FUNCTIONS IN: molecular_function unknown1.1e-2335.49Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFL--KADVGSRSRS-----GRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQ---------SAARCRW
        M  + +P+SSP R + +PPP M FL  K++ GS SRS     GRSR+SP+FVR+    A   QEPSSPKVTCMGQVR  RS  +         +  RC W
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFL--KADVGSRSRS-----GRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQ---------SAARCRW

Query:  IRTVLAFN----RRHCRTLWNGWKVLMFG-RNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARD----AAFASSIPSPPKNALILTRCRSAPQRSS
        +R    +N    +    T W  W++  F    R  + K S    +S++ +   +S  E +E  EG  +      F S   +PP NAL+LTR RSAP RS 
Subjt:  IRTVLAFN----RRHCRTLWNGWKVLMFG-RNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARD----AAFASSIPSPPKNALILTRCRSAPQRSS

Query:  VSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEG
         S   +R     +    E ++  R     +   IE  N   E     V+  E +  + +     R+  L R KSEP RI EK+   L   EEG
Subjt:  VSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAAACTATGAAACCGATCTCTAGCCCTAATCGGACCGATTACTTCCCGCCGCCATTGATGAACTTTTTGAAGGCCGATGTTGGAAGCCGGAGTAGAAGCGGCAG
GTCGCGTTCCAGCCCTATGTTCGTCAGGAAGAAGAACGTCGTCGCCATTGAAACTCAAGAGCCGTCTTCTCCCAAGGTTACCTGTATGGGCCAAGTCCGAGCCAGACGCT
CTTCCGCCCAAAGCGCCGCCAGATGCCGCTGGATTCGAACCGTATTGGCTTTCAATCGACGCCATTGTCGAACCTTGTGGAACGGGTGGAAGGTGCTGATGTTCGGAAGA
AATCGTGAAAGCAGACGAAAATCATCGATCTCGATCTCTCAATCTCGCGTTGGAAATGAAGCGGAAGATTCGAAGGAGGAAGATGAAGAAAGAGGCGAAGGAGCGAGAGA
TGCGGCGTTCGCGTCCTCGATTCCATCGCCGCCGAAGAACGCTCTCATTCTGACGAGGTGTAGATCTGCGCCGCAACGGTCGTCGGTTTCCGGCTATGGATACCGGAGTT
CGCCGGCGAGAAGCGACGGTACTGGAGAAGAGCGGAGAACAGAGCGCGATGGTGGAAACAGAGCCACGTCCCAAATTGAGTCTGAAAACTCAATCGGAGAAGGAATTTCT
AACTCTGTAAATAGCAAAGAAAGCAAAATCGATGAAGAAAAAGTAGGCGGCTCTTCGCGGCGGTTGAATCTGAAGAGGTGTAAATCGGAACCTGGTAGAATTGCAGAGAA
ACTTTACGGAGAATTGAATCTCCGGGAAGAAGGAAGTTCGTCGGGTATGGTAACGAACGATTCTTGCTTGCCTAACAAC
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAAACTATGAAACCGATCTCTAGCCCTAATCGGACCGATTACTTCCCGCCGCCATTGATGAACTTTTTGAAGGCCGATGTTGGAAGCCGGAGTAGAAGCGGCAG
GTCGCGTTCCAGCCCTATGTTCGTCAGGAAGAAGAACGTCGTCGCCATTGAAACTCAAGAGCCGTCTTCTCCCAAGGTTACCTGTATGGGCCAAGTCCGAGCCAGACGCT
CTTCCGCCCAAAGCGCCGCCAGATGCCGCTGGATTCGAACCGTATTGGCTTTCAATCGACGCCATTGTCGAACCTTGTGGAACGGGTGGAAGGTGCTGATGTTCGGAAGA
AATCGTGAAAGCAGACGAAAATCATCGATCTCGATCTCTCAATCTCGCGTTGGAAATGAAGCGGAAGATTCGAAGGAGGAAGATGAAGAAAGAGGCGAAGGAGCGAGAGA
TGCGGCGTTCGCGTCCTCGATTCCATCGCCGCCGAAGAACGCTCTCATTCTGACGAGGTGTAGATCTGCGCCGCAACGGTCGTCGGTTTCCGGCTATGGATACCGGAGTT
CGCCGGCGAGAAGCGACGGTACTGGAGAAGAGCGGAGAACAGAGCGCGATGGTGGAAACAGAGCCACGTCCCAAATTGAGTCTGAAAACTCAATCGGAGAAGGAATTTCT
AACTCTGTAAATAGCAAAGAAAGCAAAATCGATGAAGAAAAAGTAGGCGGCTCTTCGCGGCGGTTGAATCTGAAGAGGTGTAAATCGGAACCTGGTAGAATTGCAGAGAA
ACTTTACGGAGAATTGAATCTCCGGGAAGAAGGAAGTTCGTCGGGTATGGTAACGAACGATTCTTGCTTGCCTAACAAC
Protein sequenceShow/hide protein sequence
MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLWNGWKVLMFGR
NRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESENSIGEGIS
NSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNN