; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026291 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026291
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC103484772
Genome locationtig00153031:3687773..3689762
RNA-Seq ExpressionSgr026291
SyntenySgr026291
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142008.1 uncharacterized protein LOC101218305 isoform X1 [Cucumis sativus]1.5e-26375.75Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQA----HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGG
        MANPGVGTKFVSVNLNKSYGQ     HHHHSSH NSYGSNR RPG HG GGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSG GP GGG
Subjt:  MANPGVGTKFVSVNLNKSYGQA----HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGG

Query:  VLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPS
        VLGNGQRPTSAGMGWTKPRTNDL +KEG S    D+ DPSLRSVDGVSGGSSVYMPPSARAGM GPVVS S SS VH  VEK+ VLRGEDFPSLQATLPS
Subjt:  VLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPS

Query:  AAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERD
        AA PSQKQ+DG +SKLKH +E                                                E SRKQE  FPGPLPLVSMNPRSDWADDERD
Subjt:  AAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERD

Query:  TSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGN
        TSHGLIDR RDRG PKSEAYWERDFDMPRVS+LPHKP HNFSQRWNLRDDESGKFHSSDIHKVDPYGRDAR  SREGWEGNFRKNNP+PKDGFG SD+ N
Subjt:  TSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGN

Query:  DRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-------------------------
        DRN IA R TS DRETNADN HVSHFREH  KD GRRDTG+GQ GRQ WN  T+SYS QEP+R ++DKYG+EQHN                         
Subjt:  DRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-------------------------

Query:  --------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARR
                RDRRS+AKIEKPYMEDPFMKDFGAS FDGRDPF AGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELARR
Subjt:  --------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARR

Query:  EEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        EEEER+RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIARE+EKQRI LE+    +
Subjt:  EEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR

XP_022133325.1 uncharacterized protein LOC111005936 isoform X1 [Momordica charantia]3.2e-27478.18Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN
        MANPGVGTKFVSVNLNKSYGQ HHHHSSHPNSYGSNR RPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN
Subjt:  MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN

Query:  GQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGP
        GQRPTSAGMGWTKPRTNDL +KEGLS   ADR DPSLR+VDG SGGSSVYMPPSARAGM GPVV+ S SSQV+ AVEKA VLRGEDFPSLQATLPSAAGP
Subjt:  GQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGP

Query:  SQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSHG
        SQK KDG +SKLK  AE                                                E SRKQE  FPGPLPLVSMNPRSDWADDERDTSHG
Subjt:  SQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSHG

Query:  LIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRND
        LIDRGRDRG PKSEAYWERDFDMPRVSALPHKP+ NFSQRWNLRDDESGKFHS+DIHKVDPYGRDARTPSREGWEGNFR+N PIPKDGFG SDS NDRND
Subjt:  LIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRND

Query:  IAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-----------------------------
        IAAR T+ DRETNA++MHVSHFREH +KDPGRRDTGYGQ+GRQ WN   +SYS QEP+RVIRDKYG+EQHN                             
Subjt:  IAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-----------------------------

Query:  ----RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEE
            R+RRSFAKIEKPYMEDPFMKDFGASGFDGRDPF  G+VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELARREEEE
Subjt:  ----RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEE

Query:  RKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        RK LAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIARE+EKQRI+LE+    +
Subjt:  RKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR

XP_022133326.1 uncharacterized protein LOC111005936 isoform X2 [Momordica charantia]2.5e-27982.3Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN
        MANPGVGTKFVSVNLNKSYGQ HHHHSSHPNSYGSNR RPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN
Subjt:  MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN

Query:  GQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGP
        GQRPTSAGMGWTKPRTNDL +KEGLS   ADR DPSLR+VDG SGGSSVYMPPSARAGM GPVV+ S SSQV+ AVEKA VLRGEDFPSLQATLPSAAGP
Subjt:  GQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGP

Query:  SQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSHG
        SQK KDG +SKLK  AE                                                E SRKQE  FPGPLPLVSMNPRSDWADDERDTSHG
Subjt:  SQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSHG

Query:  LIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRND
        LIDRGRDRG PKSEAYWERDFDMPRVSALPHKP+ NFSQRWNLRDDESGKFHS+DIHKVDPYGRDARTPSREGWEGNFR+N PIPKDGFG SDS NDRND
Subjt:  LIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRND

Query:  IAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHNRDRRSFAKIEKPYMEDPFMKDFGASGFDG
        IAAR T+ DRETNA++MHVSHFREH +KDPGRRDTGYGQ+GRQ WN   +SYS QEP+RVIRDKYG+EQHNR+RRSFAKIEKPYMEDPFMKDFGASGFDG
Subjt:  IAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHNRDRRSFAKIEKPYMEDPFMKDFGASGFDG

Query:  RDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEA
        RDPF  G+VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELARREEEERK LAREHEERQRRAEEEAREAAWRAEQERLEA
Subjt:  RDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEA

Query:  IQKAEELRIAREDEKQRILLEKREESR
        IQKAEELRIARE+EKQRI+LE+    +
Subjt:  IQKAEELRIAREDEKQRILLEKREESR

XP_031742371.1 uncharacterized protein LOC101218305 isoform X2 [Cucumis sativus]1.5e-26375.75Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQA----HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGG
        MANPGVGTKFVSVNLNKSYGQ     HHHHSSH NSYGSNR RPG HG GGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSG GP GGG
Subjt:  MANPGVGTKFVSVNLNKSYGQA----HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGG

Query:  VLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPS
        VLGNGQRPTSAGMGWTKPRTNDL +KEG S    D+ DPSLRSVDGVSGGSSVYMPPSARAGM GPVVS S SS VH  VEK+ VLRGEDFPSLQATLPS
Subjt:  VLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPS

Query:  AAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERD
        AA PSQKQ+DG +SKLKH +E                                                E SRKQE  FPGPLPLVSMNPRSDWADDERD
Subjt:  AAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERD

Query:  TSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGN
        TSHGLIDR RDRG PKSEAYWERDFDMPRVS+LPHKP HNFSQRWNLRDDESGKFHSSDIHKVDPYGRDAR  SREGWEGNFRKNNP+PKDGFG SD+ N
Subjt:  TSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGN

Query:  DRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-------------------------
        DRN IA R TS DRETNADN HVSHFREH  KD GRRDTG+GQ GRQ WN  T+SYS QEP+R ++DKYG+EQHN                         
Subjt:  DRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-------------------------

Query:  --------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARR
                RDRRS+AKIEKPYMEDPFMKDFGAS FDGRDPF AGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELARR
Subjt:  --------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARR

Query:  EEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        EEEER+RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIARE+EKQRI LE+    +
Subjt:  EEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR

XP_038883483.1 uncharacterized protein LOC120074436 [Benincasa hispida]2.9e-26777.61Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQA-HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLG
        MANPGVGTKFVSVNLNKSYGQA HHHHSSH NSYGSNR RPG HGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGP GGGVLG
Subjt:  MANPGVGTKFVSVNLNKSYGQA-HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLG

Query:  NGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAG
        NGQRPTSAGMGWTKPRTNDL +KEGLS    D+ DPSLRSVDGVSGGSSVYMPPSARAGM GPVVS S SSQV TAVEKA VLRGEDFPSLQATLPSAA 
Subjt:  NGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAG

Query:  PSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSH
        PSQKQ+DG +SKLKH  E                                                ELS KQ+  FPGPLPLVSMNPRSDWADDERDTSH
Subjt:  PSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSH

Query:  GLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRN
        GLIDR RDRG PKSEAYWERDFDMPRVS+LPHK  HNFSQRWNLRDDESGKFHSSDIHK+DPYGRDART SREGWEGNFR+NNPIPKDGFG SDSGNDRN
Subjt:  GLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRN

Query:  DIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN----------------------------
        DIA R TS DRETNADNMHVSHFREH  KD GRRDTG+GQ GRQ WN  T+SYS QEP+R +RDKY +EQHN                            
Subjt:  DIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN----------------------------

Query:  -----RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEE
             RDRRSFAKIEKPYMEDPFMKDFGAS FDGRDPF AGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELARREEE
Subjt:  -----RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEE

Query:  ERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        ER+RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELR+ARE+EKQRILLE+    +
Subjt:  ERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR

TrEMBL top hitse value%identityAlignment
A0A0A0KLC4 Uncharacterized protein7.2e-26475.75Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQA----HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGG
        MANPGVGTKFVSVNLNKSYGQ     HHHHSSH NSYGSNR RPG HG GGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSG GP GGG
Subjt:  MANPGVGTKFVSVNLNKSYGQA----HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGG

Query:  VLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPS
        VLGNGQRPTSAGMGWTKPRTNDL +KEG S    D+ DPSLRSVDGVSGGSSVYMPPSARAGM GPVVS S SS VH  VEK+ VLRGEDFPSLQATLPS
Subjt:  VLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPS

Query:  AAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERD
        AA PSQKQ+DG +SKLKH +E                                                E SRKQE  FPGPLPLVSMNPRSDWADDERD
Subjt:  AAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERD

Query:  TSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGN
        TSHGLIDR RDRG PKSEAYWERDFDMPRVS+LPHKP HNFSQRWNLRDDESGKFHSSDIHKVDPYGRDAR  SREGWEGNFRKNNP+PKDGFG SD+ N
Subjt:  TSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGN

Query:  DRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-------------------------
        DRN IA R TS DRETNADN HVSHFREH  KD GRRDTG+GQ GRQ WN  T+SYS QEP+R ++DKYG+EQHN                         
Subjt:  DRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-------------------------

Query:  --------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARR
                RDRRS+AKIEKPYMEDPFMKDFGAS FDGRDPF AGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELARR
Subjt:  --------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARR

Query:  EEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        EEEER+RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIARE+EKQRI LE+    +
Subjt:  EEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR

A0A1S3B1H0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034847721.0e-26275.56Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQA------HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAG
        MANPGVGTKFVSVNLNKSYGQ       HHHHSSH NSYGSNR RPG HG GGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSG GP G
Subjt:  MANPGVGTKFVSVNLNKSYGQA------HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAG

Query:  GGVLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATL
        GGVLGNGQRPTSAGMGWTKPRTNDL +KEG S    D+ DPSLRSVDGVSGGSSVYMPPSARAGM GPVVS S SSQVH AVEK+ VLRGEDFPSLQATL
Subjt:  GGVLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATL

Query:  PSAAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDE
        PSAA PSQKQ+DG +SKLKHV+E                                                E SRKQE  FPGPLPLVSMNPRSDWADDE
Subjt:  PSAAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDE

Query:  RDTSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWE-GNFRKNNPIPKDGFGCSD
        RDTSHGLIDR RDRG PKSEAYWERDFDMPRVS+LPHKP HNFSQRWNL DDESGKFHSSDIHKVDPYGRD+R  SR+GWE GNFRKNNP+PKDGFG SD
Subjt:  RDTSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWE-GNFRKNNPIPKDGFGCSD

Query:  SGNDRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN----------------------
        +GNDRN IA R TS DRETNADNMHVSHFREH  KD GRRD G+GQ GRQ WN  T+SYS QEP+R ++DKYG+EQH+                      
Subjt:  SGNDRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN----------------------

Query:  -----------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALEL
                   RDRRSFAKIEKPYMEDPFMKDFGAS FDGRDPF AGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALEL
Subjt:  -----------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALEL

Query:  ARREEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        ARREEEER+RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIARE+EKQRI LE+    +
Subjt:  ARREEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR

A0A5D3CNG4 Uncharacterized protein1.2e-26375.83Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQA-----HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGG
        MANPGVGTKFVSVNLNKSYGQ      HHHHSSH NSYGSNR RPG HG GGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSG GP GG
Subjt:  MANPGVGTKFVSVNLNKSYGQA-----HHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGG

Query:  GVLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLP
        GVLGNGQRPTSAGMGWTKPRTNDL +KEG S    D+ DPSLRSVDGVSGGSSVYMPPSARAGM GPVVS S SSQVH AVEK+ VLRGEDFPSLQATLP
Subjt:  GVLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLP

Query:  SAAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDER
        SAA PSQKQ+DG +SKLKHV+E                                                E SRKQE  FPGPLPLVSMNPRSDWADDER
Subjt:  SAAGPSQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDER

Query:  DTSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWE-GNFRKNNPIPKDGFGCSDS
        DTSHGLIDR RDRG PKSEAYWERDFDMPRVS+LPHKP HNFSQRWNLRDDESGKFHSSDIHKVDPYGRD+R  SR+GWE GNFRKNNP+PKDGFG SD+
Subjt:  DTSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWE-GNFRKNNPIPKDGFGCSDS

Query:  GNDRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-----------------------
        GNDRN IA R TS DRETNADNMHVSHFREH  KD GRRD G+GQ GRQ WN  T+SYS QEP+R ++DKYG+EQH+                       
Subjt:  GNDRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-----------------------

Query:  ----------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELA
                  RDRRSFAKIEKPYMEDPFMKDFGAS FDGRDPF AGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELA
Subjt:  ----------RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELA

Query:  RREEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        RREEEER+RLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIARE+EKQRI LE+    +
Subjt:  RREEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR

A0A6J1BUR5 uncharacterized protein LOC111005936 isoform X21.2e-27982.3Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN
        MANPGVGTKFVSVNLNKSYGQ HHHHSSHPNSYGSNR RPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN
Subjt:  MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN

Query:  GQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGP
        GQRPTSAGMGWTKPRTNDL +KEGLS   ADR DPSLR+VDG SGGSSVYMPPSARAGM GPVV+ S SSQV+ AVEKA VLRGEDFPSLQATLPSAAGP
Subjt:  GQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGP

Query:  SQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSHG
        SQK KDG +SKLK  AE                                                E SRKQE  FPGPLPLVSMNPRSDWADDERDTSHG
Subjt:  SQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSHG

Query:  LIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRND
        LIDRGRDRG PKSEAYWERDFDMPRVSALPHKP+ NFSQRWNLRDDESGKFHS+DIHKVDPYGRDARTPSREGWEGNFR+N PIPKDGFG SDS NDRND
Subjt:  LIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRND

Query:  IAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHNRDRRSFAKIEKPYMEDPFMKDFGASGFDG
        IAAR T+ DRETNA++MHVSHFREH +KDPGRRDTGYGQ+GRQ WN   +SYS QEP+RVIRDKYG+EQHNR+RRSFAKIEKPYMEDPFMKDFGASGFDG
Subjt:  IAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHNRDRRSFAKIEKPYMEDPFMKDFGASGFDG

Query:  RDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEA
        RDPF  G+VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELARREEEERK LAREHEERQRRAEEEAREAAWRAEQERLEA
Subjt:  RDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEERKRLAREHEERQRRAEEEAREAAWRAEQERLEA

Query:  IQKAEELRIAREDEKQRILLEKREESR
        IQKAEELRIARE+EKQRI+LE+    +
Subjt:  IQKAEELRIAREDEKQRILLEKREESR

A0A6J1BUX9 uncharacterized protein LOC111005936 isoform X11.5e-27478.18Show/hide
Query:  MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN
        MANPGVGTKFVSVNLNKSYGQ HHHHSSHPNSYGSNR RPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN
Subjt:  MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGN

Query:  GQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGP
        GQRPTSAGMGWTKPRTNDL +KEGLS   ADR DPSLR+VDG SGGSSVYMPPSARAGM GPVV+ S SSQV+ AVEKA VLRGEDFPSLQATLPSAAGP
Subjt:  GQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGP

Query:  SQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSHG
        SQK KDG +SKLK  AE                                                E SRKQE  FPGPLPLVSMNPRSDWADDERDTSHG
Subjt:  SQKQKDGSNSKLKHVAE------------------------------------------------ELSRKQEYFFPGPLPLVSMNPRSDWADDERDTSHG

Query:  LIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRND
        LIDRGRDRG PKSEAYWERDFDMPRVSALPHKP+ NFSQRWNLRDDESGKFHS+DIHKVDPYGRDARTPSREGWEGNFR+N PIPKDGFG SDS NDRND
Subjt:  LIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNFRKNNPIPKDGFGCSDSGNDRND

Query:  IAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-----------------------------
        IAAR T+ DRETNA++MHVSHFREH +KDPGRRDTGYGQ+GRQ WN   +SYS QEP+RVIRDKYG+EQHN                             
Subjt:  IAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHN-----------------------------

Query:  ----RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEE
            R+RRSFAKIEKPYMEDPFMKDFGASGFDGRDPF  G+VGVVKRKKDVIKQTDFHDPVRESFEAELERVQQ+QEQERQRIIEEQERALELARREEEE
Subjt:  ----RDRRSFAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEE

Query:  RKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        RK LAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIARE+EKQRI+LE+    +
Subjt:  RKRLAREHEERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G50370.1 unknown protein1.3e-9244.46Show/hide
Query:  EHERLDSLGSGAGPAGGGVLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEK
        EHER+DS GS +  +GGG+ G+G RP S+G+GW+KP              TA   D    + +GV+ GS+          +   V +A    +    VEK
Subjt:  EHERLDSLGSGAGPAGGGVLGNGQRPTSAGMGWTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEK

Query:  ATVLRGEDFPSLQATLPSAAGPSQKQKDGSNSKLKHVA---------------------------------EELS---------------RKQEYFFPGP
           LRGEDFPSL+A+LPSA+   QKQK+G N K K  A                                  ELS               RK+EY F GP
Subjt:  ATVLRGEDFPSLQATLPSAAGPSQKQKDGSNSKLKHVA---------------------------------EELS---------------RKQEYFFPGP

Query:  LPLVSMNPRSDWADDERDTSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHK-PMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGN
        LPLV + PRSDWADDERDTSHGL DR RD G  K+E +W+R FD+ R   LP K    N   +   R++E  K   + +  V   GR+A           
Subjt:  LPLVSMNPRSDWADDERDTSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHK-PMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGN

Query:  FRKNNPIPKDGFGCSDSGNDRNDIAARHTSFDRE-TNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHNRDRRS
        +R ++P+        + G + N+  AR +S  RE     N  +S  RE+ + + G R+  Y   GRQ WN   DS S +      RD YG E  NRD+RS
Subjt:  FRKNNPIPKDGFGCSDSGNDRNDIAARHTSFDRE-TNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHNRDRRS

Query:  FAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEERKRLAREHE
        F K +KP++EDPFMKDFG SGFD  DPF   ++GV K+KK+ +KQT+FHDPVRESFEAELERVQ++QE+ER+RIIEEQER +ELAR EEEER RLARE +
Subjt:  FAKIEKPYMEDPFMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEERKRLAREHE

Query:  ERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR
        ERQRR EEEAREAA+R EQERLEA ++AEELR ++E+EK R+ +E+    +
Subjt:  ERQRRAEEEAREAAWRAEQERLEAIQKAEELRIAREDEKQRILLEKREESR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAATCCTGGCGTCGGGACCAAGTTTGTGTCCGTGAATCTGAACAAATCGTATGGGCAGGCTCATCATCATCATTCATCTCATCCCAACTCTTATGGATCAAATCG
AATGCGACCTGGTAGTCACGGGGCCGGAGGAGGAATGGTGGTCCTTTCGAGGCCCCGCAGTTCCCAGAAACCTGGGCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTGC
CTTCATTGCGGAAGGAGCATGAGAGACTTGATTCTTTGGGTTCAGGTGCTGGGCCTGCCGGTGGAGGGGTTTTGGGAAATGGGCAGAGGCCAACTTCAGCTGGTATGGGT
TGGACGAAGCCGCGCACAAACGATTTGCAACAGAAAGAGGGGCTTAGTGGTATTACAGCCGATAGAACTGATCCATCTTTGCGAAGTGTTGATGGGGTGAGCGGTGGGAG
CAGTGTGTACATGCCTCCTTCTGCTCGTGCTGGCATGATAGGACCGGTTGTGTCTGCTTCTGTTTCCTCTCAGGTGCATACTGCAGTTGAAAAAGCCACAGTTTTGAGAG
GTGAGGATTTTCCTTCTTTGCAGGCAACTTTACCATCTGCTGCTGGGCCTTCCCAGAAACAGAAAGATGGTTCGAATTCTAAATTGAAGCATGTGGCTGAAGAGTTATCT
CGAAAGCAGGAATATTTTTTCCCGGGTCCTTTACCTCTCGTCTCAATGAATCCAAGATCAGACTGGGCTGATGATGAACGTGACACAAGCCATGGTTTGATTGACAGGGG
AAGGGATCGAGGCCGCCCAAAGAGTGAGGCTTATTGGGAGAGAGACTTCGATATGCCTCGGGTTAGTGCTCTTCCTCACAAGCCCATGCATAATTTTTCTCAGAGATGGA
ATCTGCGGGATGATGAATCTGGGAAGTTTCATTCCAGTGACATTCATAAAGTGGACCCTTATGGTCGGGATGCAAGAACACCTAGTAGAGAAGGCTGGGAAGGAAATTTC
CGGAAAAACAATCCTATCCCAAAAGATGGTTTTGGTTGTTCAGACAGTGGAAATGATAGAAATGATATTGCAGCAAGGCATACTAGCTTTGATCGAGAAACAAATGCTGA
TAACATGCATGTTTCACATTTTCGAGAACATGGTTATAAAGATCCTGGGAGGAGAGATACTGGATATGGACAGATAGGGCGGCAAGCCTGGAATGGTACAACAGACTCTT
ACAGCTACCAGGAACCAGAACGGGTTATAAGAGACAAGTATGGTAATGAGCAACACAACAGGGATAGACGTTCTTTTGCAAAGATTGAGAAACCTTATATGGAAGATCCT
TTTATGAAAGATTTTGGAGCCTCTGGTTTTGATGGACGAGATCCTTTTAATGCTGGTCTTGTTGGGGTGGTTAAAAGGAAGAAGGATGTGATTAAGCAGACTGATTTTCA
TGACCCTGTTAGGGAATCTTTTGAGGCAGAGCTTGAGAGAGTTCAACAGCTCCAAGAACAGGAACGGCAACGAATTATTGAGGAGCAAGAAAGAGCTCTGGAACTAGCCA
GGAGAGAAGAGGAAGAAAGAAAGAGGCTTGCAAGGGAACATGAAGAAAGGCAGAGAAGAGCTGAAGAAGAAGCCAGAGAAGCAGCATGGAGAGCTGAACAAGAACGACTG
GAGGCTATCCAAAAGGCTGAAGAACTTCGGATAGCTAGAGAGGACGAAAAACAGAGGATTCTTTTGGAGAAGAGAGAAGAAAGCAGGCTGCTAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAATCCTGGCGTCGGGACCAAGTTTGTGTCCGTGAATCTGAACAAATCGTATGGGCAGGCTCATCATCATCATTCATCTCATCCCAACTCTTATGGATCAAATCG
AATGCGACCTGGTAGTCACGGGGCCGGAGGAGGAATGGTGGTCCTTTCGAGGCCCCGCAGTTCCCAGAAACCTGGGCCGAAGCTTTCTGTTCCACCCCCCTTGAATCTGC
CTTCATTGCGGAAGGAGCATGAGAGACTTGATTCTTTGGGTTCAGGTGCTGGGCCTGCCGGTGGAGGGGTTTTGGGAAATGGGCAGAGGCCAACTTCAGCTGGTATGGGT
TGGACGAAGCCGCGCACAAACGATTTGCAACAGAAAGAGGGGCTTAGTGGTATTACAGCCGATAGAACTGATCCATCTTTGCGAAGTGTTGATGGGGTGAGCGGTGGGAG
CAGTGTGTACATGCCTCCTTCTGCTCGTGCTGGCATGATAGGACCGGTTGTGTCTGCTTCTGTTTCCTCTCAGGTGCATACTGCAGTTGAAAAAGCCACAGTTTTGAGAG
GTGAGGATTTTCCTTCTTTGCAGGCAACTTTACCATCTGCTGCTGGGCCTTCCCAGAAACAGAAAGATGGTTCGAATTCTAAATTGAAGCATGTGGCTGAAGAGTTATCT
CGAAAGCAGGAATATTTTTTCCCGGGTCCTTTACCTCTCGTCTCAATGAATCCAAGATCAGACTGGGCTGATGATGAACGTGACACAAGCCATGGTTTGATTGACAGGGG
AAGGGATCGAGGCCGCCCAAAGAGTGAGGCTTATTGGGAGAGAGACTTCGATATGCCTCGGGTTAGTGCTCTTCCTCACAAGCCCATGCATAATTTTTCTCAGAGATGGA
ATCTGCGGGATGATGAATCTGGGAAGTTTCATTCCAGTGACATTCATAAAGTGGACCCTTATGGTCGGGATGCAAGAACACCTAGTAGAGAAGGCTGGGAAGGAAATTTC
CGGAAAAACAATCCTATCCCAAAAGATGGTTTTGGTTGTTCAGACAGTGGAAATGATAGAAATGATATTGCAGCAAGGCATACTAGCTTTGATCGAGAAACAAATGCTGA
TAACATGCATGTTTCACATTTTCGAGAACATGGTTATAAAGATCCTGGGAGGAGAGATACTGGATATGGACAGATAGGGCGGCAAGCCTGGAATGGTACAACAGACTCTT
ACAGCTACCAGGAACCAGAACGGGTTATAAGAGACAAGTATGGTAATGAGCAACACAACAGGGATAGACGTTCTTTTGCAAAGATTGAGAAACCTTATATGGAAGATCCT
TTTATGAAAGATTTTGGAGCCTCTGGTTTTGATGGACGAGATCCTTTTAATGCTGGTCTTGTTGGGGTGGTTAAAAGGAAGAAGGATGTGATTAAGCAGACTGATTTTCA
TGACCCTGTTAGGGAATCTTTTGAGGCAGAGCTTGAGAGAGTTCAACAGCTCCAAGAACAGGAACGGCAACGAATTATTGAGGAGCAAGAAAGAGCTCTGGAACTAGCCA
GGAGAGAAGAGGAAGAAAGAAAGAGGCTTGCAAGGGAACATGAAGAAAGGCAGAGAAGAGCTGAAGAAGAAGCCAGAGAAGCAGCATGGAGAGCTGAACAAGAACGACTG
GAGGCTATCCAAAAGGCTGAAGAACTTCGGATAGCTAGAGAGGACGAAAAACAGAGGATTCTTTTGGAGAAGAGAGAAGAAAGCAGGCTGCTAAGCTAA
Protein sequenceShow/hide protein sequence
MANPGVGTKFVSVNLNKSYGQAHHHHSSHPNSYGSNRMRPGSHGAGGGMVVLSRPRSSQKPGPKLSVPPPLNLPSLRKEHERLDSLGSGAGPAGGGVLGNGQRPTSAGMG
WTKPRTNDLQQKEGLSGITADRTDPSLRSVDGVSGGSSVYMPPSARAGMIGPVVSASVSSQVHTAVEKATVLRGEDFPSLQATLPSAAGPSQKQKDGSNSKLKHVAEELS
RKQEYFFPGPLPLVSMNPRSDWADDERDTSHGLIDRGRDRGRPKSEAYWERDFDMPRVSALPHKPMHNFSQRWNLRDDESGKFHSSDIHKVDPYGRDARTPSREGWEGNF
RKNNPIPKDGFGCSDSGNDRNDIAARHTSFDRETNADNMHVSHFREHGYKDPGRRDTGYGQIGRQAWNGTTDSYSYQEPERVIRDKYGNEQHNRDRRSFAKIEKPYMEDP
FMKDFGASGFDGRDPFNAGLVGVVKRKKDVIKQTDFHDPVRESFEAELERVQQLQEQERQRIIEEQERALELARREEEERKRLAREHEERQRRAEEEAREAAWRAEQERL
EAIQKAEELRIAREDEKQRILLEKREESRLLS