; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g21960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g21960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtamine P1 family protein
Genome locationchr1:15243251..15244117
RNA-Seq ExpressionMoc01g21960
SyntenyMoc01g21960
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143963.1 uncharacterized protein LOC111013748 [Momordica charantia]4.5e-152100Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
        MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
        NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD

Query:  GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNKRLND
        GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNKRLND
Subjt:  GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNKRLND

XP_022925671.1 uncharacterized protein LOC111433021 [Cucurbita moschata]1.2e-6962.55Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN VAIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRR CRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F  NRE RRKS  SI++SRV +EAEDS++ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS     Y  S  RSD T    GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
        +TE +  N A S+IES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

XP_022977257.1 uncharacterized protein LOC111477626 [Cucurbita maxima]8.1e-6962.36Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN V IETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER
        WN    + F    E RRKS  SI++SRV +EAEDS++ +E  G  ARDA FASS PSPPKNALILTRCRSAP RSS  G   RS     +G G       
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER

Query:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
          GNRA S+IES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI EKLYG
Subjt:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

XP_023543942.1 uncharacterized protein LOC111803666 [Cucurbita pepo subsp. pepo]1.9e-7062.55Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN VAIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F   RE RRKS  SI++SRV +EAEDS++ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS  G  Y  S  RSD      GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
        +TE + GN A S+ ES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

XP_038882779.1 uncharacterized protein LOC120073931 [Benincasa hispida]5.2e-7664.03Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
        MK+  K ISSP+RTD FPPPLM+FL+AD G+RS+SGRSRSSP+FV KKNVVAIETQEPSSPKVTCMGQVRA  S+   AARCRWIR+VL+FNRR+CRT W
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTG-EERRTER
        N    + F R  E RRKS  SI +SRVGNEAEDS E+DEE   GARDA F+SS+PSPPKNALILTRCRSAP R+S  G  YRS P  SDG+G EE + E 
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTG-EERRTER

Query:  DGGNRATSQIE----------SENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE
        D GN A S+IE           EN+ G+G    V+ KE  ++E+ +   +R+L L RCKSEP RIAEK+YGELNLREE
Subjt:  DGGNRATSQIE----------SENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE

TrEMBL top hitse value%identityAlignment
A0A1S3BN59 uncharacterized protein LOC1034916513.3e-6861.03Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA--RRSSAQSAARCRWIRTVLAFNRRHCRT
        MKQ  K ISSP+RTD FPPPLM+FL+AD G+RS+S RSRSSP+F+RKKN VAIET+EPSSPKVTCMGQVR   R S+   A RCRWIR+VL+FNRRHCRT
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA--RRSSAQSAARCRWIRTVLAFNRRHCRT

Query:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFA-SSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT--GEER
         WN    ++F   RE RR     IS+SRVGNEAEDS++++E+     RDA +A SS+PSPPKNALILTRCRS P RSS +   YRSS   SDG+   EE 
Subjt:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFA-SSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT--GEER

Query:  RTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE
        +TER  GN   S+IE  NS  E +   + S +   D + V G +R L L RCKSEP RIAEKLYGELNL+EE
Subjt:  RTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE

A0A5A7TVU1 Uncharacterized protein3.3e-6861.03Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA--RRSSAQSAARCRWIRTVLAFNRRHCRT
        MKQ  K ISSP+RTD FPPPLM+FL+AD G+RS+S RSRSSP+F+RKKN VAIET+EPSSPKVTCMGQVR   R S+   A RCRWIR+VL+FNRRHCRT
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA--RRSSAQSAARCRWIRTVLAFNRRHCRT

Query:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFA-SSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT--GEER
         WN    ++F   RE RR     IS+SRVGNEAEDS++++E+     RDA +A SS+PSPPKNALILTRCRS P RSS +   YRSS   SDG+   EE 
Subjt:  LWNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFA-SSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT--GEER

Query:  RTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE
        +TER  GN   S+IE  NS  E +   + S +   D + V G +R L L RCKSEP RIAEKLYGELNL+EE
Subjt:  RTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREE

A0A6J1CS06 uncharacterized protein LOC1110137482.2e-152100Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
        MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLW

Query:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
        NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD
Subjt:  NGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERD

Query:  GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNKRLND
        GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNKRLND
Subjt:  GGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNKRLND

A0A6J1ECV0 uncharacterized protein LOC1114330216.0e-7062.55Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN VAIETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRR CRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER
        WN    + F  NRE RRKS  SI++SRV +EAEDS++ +E  G  ARD  FASS PSPPKNALILTRCRSAP RSS     Y  S  RSD T    GE  
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGT----GEER

Query:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
        +TE +  N A S+IES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI E+LYG
Subjt:  RTERDGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

A0A6J1IHY6 uncharacterized protein LOC1114776263.9e-6962.36Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL
        MKQ++KPISSP+R D FPPPLM+FL+AD G+RS+SGRSRSSP+F+RKKN V IETQEPSSPKVTCMGQVR  +RSS   A RCRWIR+VL+FNRRHCRT 
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRA-RRSSAQSAARCRWIRTVLAFNRRHCRTL

Query:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER
        WN    + F    E RRKS  SI++SRV +EAEDS++ +E  G  ARDA FASS PSPPKNALILTRCRSAP RSS  G   RS     +G G       
Subjt:  WNGWKVLMFGRNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTER

Query:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG
          GNRA S+IES          ENS GE   +SVN+KESKI+E  +  S+R L L RCKSEPGRI EKLYG
Subjt:  DGGNRATSQIES----------ENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37100.1 protamine P1 family protein1.9e-2033.92Show/hide
Query:  KPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNV-VAIETQEPSSPKVTCMGQVRARRSS----------------AQSAARCRWIRTV
        +P+SSP RT+  PP LM FL+    SRSRS RSR  P+F R+KN   A ETQEP+SPKVTCMGQVR  RS                  + + RC W++  
Subjt:  KPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNV-VAIETQEPSSPKVTCMGQVRARRSS----------------AQSAARCRWIRTV

Query:  LAFN------RRHC-RTLWNGWKVLMFGR-NRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARD---AAFASSIPSPPKNALILTRCRSAPQRSSVS
           +      +  C   +W  WK       +++S ++SS S S+   G    + +E +E R E  ++   ++  S   +PP+NA +LTRCRSAP RS  S
Subjt:  LAFN------RRHC-RTLWNGWKVLMFGR-NRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARD---AAFASSIPSPPKNALILTRCRSAPQRSSVS

Query:  GYGYRSSPARSDGTGEERRTERDGGNRA----TSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRR-LNLKRCKSEPGRIAEKL
                  ++    +R    +  + +    TS  E+   + +    S  S+E K     V GS R+ L L RC SEP R+  ++
Subjt:  GYGYRSSPARSDGTGEERRTERDGGNRA----TSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRR-LNLKRCKSEPGRIAEKL

AT5G03110.1 FUNCTIONS IN: molecular_function unknown1.1e-2335.49Show/hide
Query:  MKQTMKPISSPNRTDYFPPPLMNFL--KADVGSRSRS-----GRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQ---------SAARCRW
        M  + +P+SSP R + +PPP M FL  K++ GS SRS     GRSR+SP+FVR+    A   QEPSSPKVTCMGQVR  RS  +         +  RC W
Subjt:  MKQTMKPISSPNRTDYFPPPLMNFL--KADVGSRSRS-----GRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQ---------SAARCRW

Query:  IRTVLAFN----RRHCRTLWNGWKVLMFG-RNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARD----AAFASSIPSPPKNALILTRCRSAPQRSS
        +R    +N    +    T W  W++  F    R  + K S    +S++ +   +S  E +E  EG  +      F S   +PP NAL+LTR RSAP RS 
Subjt:  IRTVLAFN----RRHCRTLWNGWKVLMFG-RNRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARD----AAFASSIPSPPKNALILTRCRSAPQRSS

Query:  VSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEG
         S   +R     +    E ++  R     +   IE  N   E     V+  E +  + +     R+  L R KSEP RI EK+   L   EEG
Subjt:  VSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESENSIGEGISNSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAAACTATGAAACCGATCTCTAGCCCTAATCGGACCGATTACTTCCCGCCGCCATTGATGAACTTTTTGAAGGCCGATGTTGGAAGCCGGAGTAGAAGCGGCAG
GTCGCGTTCCAGCCCTATGTTCGTCAGGAAGAAGAACGTCGTCGCCATTGAAACTCAAGAGCCGTCTTCTCCCAAGGTTACCTGTATGGGCCAAGTCCGAGCCAGACGCT
CTTCCGCCCAAAGCGCCGCCAGATGCCGCTGGATTCGAACCGTATTGGCTTTCAATCGACGCCATTGTCGAACCTTGTGGAACGGGTGGAAGGTGCTGATGTTCGGAAGA
AATCGTGAAAGCAGACGAAAATCATCGATCTCGATCTCTCAATCTCGCGTTGGAAATGAAGCGGAAGATTCGAAGGAGGAAGATGAAGAAAGAGGCGAAGGAGCGAGAGA
TGCGGCGTTCGCGTCCTCGATTCCATCGCCGCCGAAGAACGCTCTCATTCTGACGAGGTGTAGATCTGCGCCGCAACGGTCGTCGGTTTCCGGCTATGGATACCGGAGTT
CGCCGGCGAGAAGCGACGGTACTGGAGAAGAGCGGAGAACAGAGCGCGATGGTGGAAACAGAGCCACGTCCCAAATTGAGTCTGAAAACTCAATCGGAGAAGGAATTTCT
AACTCTGTAAATAGCAAAGAAAGCAAAATCGATGAAGAAAAAGTAGGCGGCTCTTCGCGGCGGTTGAATCTGAAGAGGTGTAAATCGGAACCTGGTAGAATTGCAGAGAA
ACTTTACGGAGAATTGAATCTCCGGGAAGAAGGAAGTTCGTCGGGTATGGTAACGAACGATTCTTGCTTGCCTAACAACAAGCGATTAAACGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAAACTATGAAACCGATCTCTAGCCCTAATCGGACCGATTACTTCCCGCCGCCATTGATGAACTTTTTGAAGGCCGATGTTGGAAGCCGGAGTAGAAGCGGCAG
GTCGCGTTCCAGCCCTATGTTCGTCAGGAAGAAGAACGTCGTCGCCATTGAAACTCAAGAGCCGTCTTCTCCCAAGGTTACCTGTATGGGCCAAGTCCGAGCCAGACGCT
CTTCCGCCCAAAGCGCCGCCAGATGCCGCTGGATTCGAACCGTATTGGCTTTCAATCGACGCCATTGTCGAACCTTGTGGAACGGGTGGAAGGTGCTGATGTTCGGAAGA
AATCGTGAAAGCAGACGAAAATCATCGATCTCGATCTCTCAATCTCGCGTTGGAAATGAAGCGGAAGATTCGAAGGAGGAAGATGAAGAAAGAGGCGAAGGAGCGAGAGA
TGCGGCGTTCGCGTCCTCGATTCCATCGCCGCCGAAGAACGCTCTCATTCTGACGAGGTGTAGATCTGCGCCGCAACGGTCGTCGGTTTCCGGCTATGGATACCGGAGTT
CGCCGGCGAGAAGCGACGGTACTGGAGAAGAGCGGAGAACAGAGCGCGATGGTGGAAACAGAGCCACGTCCCAAATTGAGTCTGAAAACTCAATCGGAGAAGGAATTTCT
AACTCTGTAAATAGCAAAGAAAGCAAAATCGATGAAGAAAAAGTAGGCGGCTCTTCGCGGCGGTTGAATCTGAAGAGGTGTAAATCGGAACCTGGTAGAATTGCAGAGAA
ACTTTACGGAGAATTGAATCTCCGGGAAGAAGGAAGTTCGTCGGGTATGGTAACGAACGATTCTTGCTTGCCTAACAACAAGCGATTAAACGATTGA
Protein sequenceShow/hide protein sequence
MKQTMKPISSPNRTDYFPPPLMNFLKADVGSRSRSGRSRSSPMFVRKKNVVAIETQEPSSPKVTCMGQVRARRSSAQSAARCRWIRTVLAFNRRHCRTLWNGWKVLMFGR
NRESRRKSSISISQSRVGNEAEDSKEEDEERGEGARDAAFASSIPSPPKNALILTRCRSAPQRSSVSGYGYRSSPARSDGTGEERRTERDGGNRATSQIESENSIGEGIS
NSVNSKESKIDEEKVGGSSRRLNLKRCKSEPGRIAEKLYGELNLREEGSSSGMVTNDSCLPNNKRLND