; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015856 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015856
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtamine P1 family protein
Genome locationscaffold943_2:462283..463134
RNA-Seq ExpressionMS015856
SyntenyMS015856
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143963.1 uncharacterized protein LOC111013748 [Momordica charantia]1.9e-14798.24Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAKSAARCRWIRTVLAFNRRHCRTLW
        MKQ+MKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSA+SAARCRWIRTVLAFNRRHCRTLW
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAKSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
        NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDS+EEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD

Query:  GGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNK
        GGNRATSQIES+NSIGEGISNSVNSKESKIDEEKVGGS RRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNK
Subjt:  GGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNK

XP_022925671.1 uncharacterized protein LOC111433021 [Cucurbita moschata]1.2e-6962.91Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL
        MKQS+KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN VAIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRR CRT 
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F  NRE RRKS  SI++SRV +EAEDSE+ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS     Y  S  RSD T    GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------KNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG
        +TE +  N A S+IES          +NS GE   +SVN+KESKI+E  +  S R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------KNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG

XP_022977257.1 uncharacterized protein LOC111477626 [Cucurbita maxima]6.1e-6962.73Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL
        MKQS+KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN V IETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER
        WN    + F    E RRKS  SI++SRV +EAEDSE+ +E  G  ARDA FASS PSPPKNALILTRCRSAP RSS  G   RS     +G G       
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER

Query:  DGGNRATSQIESK----------NSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG
          GNRA S+IES+          NS GE   +SVN+KESKI+E  +  S R L L RCKSEPGRI EKLYG
Subjt:  DGGNRATSQIESK----------NSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG

XP_023543942.1 uncharacterized protein LOC111803666 [Cucurbita pepo subsp. pepo]1.9e-7062.91Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL
        MKQS+KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN VAIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F   RE RRKS  SI++SRV +EAEDSE+ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS  G  Y  S  RSD      GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------KNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG
        +TE + GN A S+ ES          +NS GE   +SVN+KESKI+E  +  S R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------KNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG

XP_038882779.1 uncharacterized protein LOC120073931 [Benincasa hispida]2.5e-7563.67Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAKSAARCRWIRTVLAFNRRHCRTLW
        MK+  K ISSP+RTD FPPPLM+FL+AD G+RS+SGRSRSSP+FV KKNVVAIETQEPSSPKVTCMGQVRA  S+   AARCRWIR+VL+FNRR+CRT W
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAKSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTG-EERRTER
        N    + F R  E RRKS  SI +SRVGNEAEDS E+DEE   GARDA F+SS+PSPPKNALILTRCRSAP R+S  G  YRS P  SDG+G EE + E 
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTG-EERRTER

Query:  DGGNRATSQIESK----------NSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREE
        D GN A S+IE +          N+ G+G    V+ KE  ++E+ +    R+L L RCKSEP RIAEK+YGELNLREE
Subjt:  DGGNRATSQIESK----------NSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREE

TrEMBL top hitse value%identityAlignment
A0A1S3BN59 uncharacterized protein LOC1034916515.0e-6962.13Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKS-AARCRWIRTVLAFNRRHCRT
        MKQ  K ISSP+RTD FPPPLM+FL+AD G+RS+S RSRSSP+F+RKKN VAIET+EPSSPKVTCMGQVR  +RSS K+ A RCRWIR+VL+FNRRHCRT
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKS-AARCRWIRTVLAFNRRHCRT

Query:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFA-SSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT--GEER
         WN    ++F   RE RR     IS+SRVGNEAEDSE+++E+     RDA +A SS+PSPPKNALILTRCRS P RSS +   YRSS   SDG+   EE 
Subjt:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFA-SSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT--GEER

Query:  RTERDGGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREE
        +TER  GN   S+IE +NS  E +   + S +   D + V G+ R L L RCKSEP RIAEKLYGELNL+EE
Subjt:  RTERDGGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREE

A0A5A7TVU1 Uncharacterized protein5.0e-6962.13Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKS-AARCRWIRTVLAFNRRHCRT
        MKQ  K ISSP+RTD FPPPLM+FL+AD G+RS+S RSRSSP+F+RKKN VAIET+EPSSPKVTCMGQVR  +RSS K+ A RCRWIR+VL+FNRRHCRT
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKS-AARCRWIRTVLAFNRRHCRT

Query:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFA-SSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT--GEER
         WN    ++F   RE RR     IS+SRVGNEAEDSE+++E+     RDA +A SS+PSPPKNALILTRCRS P RSS +   YRSS   SDG+   EE 
Subjt:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFA-SSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT--GEER

Query:  RTERDGGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREE
        +TER  GN   S+IE +NS  E +   + S +   D + V G+ R L L RCKSEP RIAEKLYGELNL+EE
Subjt:  RTERDGGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREE

A0A6J1CS06 uncharacterized protein LOC1110137489.3e-14898.24Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAKSAARCRWIRTVLAFNRRHCRTLW
        MKQ+MKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSA+SAARCRWIRTVLAFNRRHCRTLW
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAKSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
        NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDS+EEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD

Query:  GGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNK
        GGNRATSQIES+NSIGEGISNSVNSKESKIDEEKVGGS RRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNK
Subjt:  GGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNK

A0A6J1ECV0 uncharacterized protein LOC1114330215.9e-7062.91Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL
        MKQS+KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN VAIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRR CRT 
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F  NRE RRKS  SI++SRV +EAEDSE+ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS     Y  S  RSD T    GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------KNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG
        +TE +  N A S+IES          +NS GE   +SVN+KESKI+E  +  S R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------KNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG

A0A6J1IHY6 uncharacterized protein LOC1114776262.9e-6962.73Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL
        MKQS+KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN V IETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAKSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER
        WN    + F    E RRKS  SI++SRV +EAEDSE+ +E  G  ARDA FASS PSPPKNALILTRCRSAP RSS  G   RS     +G G       
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER

Query:  DGGNRATSQIESK----------NSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG
          GNRA S+IES+          NS GE   +SVN+KESKI+E  +  S R L L RCKSEPGRI EKLYG
Subjt:  DGGNRATSQIESK----------NSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37100.1 protamine P1 family protein7.8e-2235.76Show/hide
Query:  SMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNV-VAIETQEPSSPKVTCMGQVRARRSS----------------AKSAARCRWIR
        S +P+SSP RT+  PP LM FL+    SRSRS RSR  P+F R+KN   A ETQEP+SPKVTCMGQVR  RS                  + + RC W++
Subjt:  SMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNV-VAIETQEPSSPKVTCMGQVRARRSS----------------AKSAARCRWIR

Query:  TVLAFN------RRHC-RTLWNGWKVLMFGR-NRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARD---AAFASSIPSPPKNALILTRCRSAPQRSS
             +      +  C   +W  WK       +++S ++SS S S+   G    + EE +E R E  ++   ++  S   +PP+NA +LTRCRSAP RS 
Subjt:  TVLAFN------RRHC-RTLWNGWKVLMFGR-NRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARD---AAFASSIPSPPKNALILTRCRSAPQRSS

Query:  VSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESKNSIGEGISNSVNS-KESKIDEE---KVGGSLRR-LNLKRCKSEPGRIAEKL
         S          ++    +R    +  N + S+ E K S+ E      +S +ES   EE    V GS R+ L L RC SEP R+  ++
Subjt:  VSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESKNSIGEGISNSVNS-KESKIDEE---KVGGSLRR-LNLKRCKSEPGRIAEKL

AT5G03110.1 FUNCTIONS IN: molecular_function unknown5.7e-2536.18Show/hide
Query:  MKQSMKPISSPNRTDYFPPPLMNFL--KADVGSRSRS-----GRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAK---------SAARCRW
        M  S +P+SSP R + +PPP M FL  K++ GS SRS     GRSR+SP+FVR+    A   QEPSSPKVTCMGQVR  RS  K         +  RC W
Subjt:  MKQSMKPISSPNRTDYFPPPLMNFL--KADVGSRSRS-----GRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAK---------SAARCRW

Query:  IRTVLAFN----RRHCRTLWNGWKVLMFG-RNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARD----AAFASSIPSPPKNALILTRCRSAPQRSS
        +R    +N    +    T W  W++  F    R  + K S    +S++ +   +S  E +E  EG  +      F S   +PP NAL+LTR RSAP RS 
Subjt:  IRTVLAFN----RRHCRTLWNGWKVLMFG-RNRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARD----AAFASSIPSPPKNALILTRCRSAPQRSS

Query:  VSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREEG
         S   +R     +    E ++  R     +   IE  N   E     V+  E +  + +    +R+  L R KSEP RI EK+   L   EEG
Subjt:  VSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESKNSIGEGISNSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAATCTATGAAACCGATCTCTAGCCCTAATCGGACCGATTACTTCCCGCCGCCATTGATGAACTTTTTGAAGGCCGATGTTGGAAGCCGGAGTAGAAGCGGCAG
GTCGCGTTCCAGCCCTATGTTCGTCAGGAAGAAGAACGTCGTCGCCATTGAAACTCAAGAGCCGTCTTCTCCCAAGGTTACCTGTATGGGCCAAGTCCGAGCCAGACGCT
CTTCCGCCAAAAGCGCCGCCAGATGCCGCTGGATTCGAACCGTATTGGCTTTCAATCGACGCCATTGTCGAACCTTGTGGAACGGGTGGAAGGTGCTGATGTTCGGAAGA
AATCGTGAAAGCAGACGAAAATCATCGATCTCGATCTCTCAATCTCGCGTTGGAAATGAAGCGGAAGATTCGGAGGAGGAAGATGAAGAAAGAGGCGAAGGAGCGAGAGA
TGCGGCGTTCGCGTCCTCGATTCCATCGCCGCCGAAGAACGCTCTCATTCTGACGAGGTGTAGATCTGCGCCGCAACGGTCGTCGGTTTCCGGCTATGGATACCGGAGTT
CGCCGGCGAGAAGCGACGGTACTGGAGAAGAGCGGAGAACAGAGCGCGACGGTGGAAACAGAGCCACGTCCCAAATTGAGTCAAAAAACTCAATCGGAGAAGGAATTTCT
AACTCTGTAAATAGCAAAGAAAGCAAAATCGATGAAGAAAAAGTAGGCGGCTCTTTGCGGCGGTTGAATCTGAAGAGGTGTAAATCGGAACCTGGTAGAATTGCAGAGAA
ACTTTACGGAGAATTGAATCTCCGGGAAGAAGGAAGTTCGTCGGGTATGGTAACGAACGATTCTTGCTTACCTAACAACAAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAATCTATGAAACCGATCTCTAGCCCTAATCGGACCGATTACTTCCCGCCGCCATTGATGAACTTTTTGAAGGCCGATGTTGGAAGCCGGAGTAGAAGCGGCAG
GTCGCGTTCCAGCCCTATGTTCGTCAGGAAGAAGAACGTCGTCGCCATTGAAACTCAAGAGCCGTCTTCTCCCAAGGTTACCTGTATGGGCCAAGTCCGAGCCAGACGCT
CTTCCGCCAAAAGCGCCGCCAGATGCCGCTGGATTCGAACCGTATTGGCTTTCAATCGACGCCATTGTCGAACCTTGTGGAACGGGTGGAAGGTGCTGATGTTCGGAAGA
AATCGTGAAAGCAGACGAAAATCATCGATCTCGATCTCTCAATCTCGCGTTGGAAATGAAGCGGAAGATTCGGAGGAGGAAGATGAAGAAAGAGGCGAAGGAGCGAGAGA
TGCGGCGTTCGCGTCCTCGATTCCATCGCCGCCGAAGAACGCTCTCATTCTGACGAGGTGTAGATCTGCGCCGCAACGGTCGTCGGTTTCCGGCTATGGATACCGGAGTT
CGCCGGCGAGAAGCGACGGTACTGGAGAAGAGCGGAGAACAGAGCGCGACGGTGGAAACAGAGCCACGTCCCAAATTGAGTCAAAAAACTCAATCGGAGAAGGAATTTCT
AACTCTGTAAATAGCAAAGAAAGCAAAATCGATGAAGAAAAAGTAGGCGGCTCTTTGCGGCGGTTGAATCTGAAGAGGTGTAAATCGGAACCTGGTAGAATTGCAGAGAA
ACTTTACGGAGAATTGAATCTCCGGGAAGAAGGAAGTTCGTCGGGTATGGTAACGAACGATTCTTGCTTACCTAACAACAAG
Protein sequenceShow/hide protein sequence
MKQSMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAKSAARCRWIRTVLAFNRRHCRTLWNGWKVLMFGR
NRESRRKSSISISQSRVGNEAEDSEEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESKNSIGEGIS
NSVNSKESKIDEEKVGGSLRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNK