; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015059 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015059
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF810)
Genome locationChr02:23510350..23530705
RNA-Seq ExpressionHG10015059
SyntenyHG10015059
Gene Ontology termsNA
InterPro domainsIPR008528 - Protein unc-13 homologue


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573790.1 Protein unc-13-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.3e-23496.78Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPP PAFTPPPVYTPPAVIAP P+PT
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSLTETNVSRSESFESS ARELT+DDIEDF+DDED+EVNS RMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKD+R
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVE QRAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+SVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ+
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH

KAG7012866.1 hypothetical protein SDJN02_25619 [Cucurbita argyrosperma subsp. argyrosperma]2.3e-23496.78Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPP PAFTPPPVYTPPAVIAP P+PT
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSLTETNVSRSESFESS ARELT+DDIEDF+DDED+EVNS RMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKD+R
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVE QRAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+SVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ+
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH

XP_022945388.1 uncharacterized protein LOC111449639 isoform X1 [Cucurbita moschata]2.3e-23496.78Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPP PAFTPPPVYTPPAVIAP P+PT
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSLTETNVSRSESFESS ARELT+DDIEDF+DDED+EVNS RMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKD+R
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVE QRAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+SVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ+
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH

XP_023541615.1 uncharacterized protein LOC111801729 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-23396.55Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQM+NSGSGDEFFLVTDLDSSGSPPKRAPPP PAFTPPPVYTPPAVIAP P+PT
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSLTETNVSRSESFESS ARELT+DDIEDFE+DED+EVNS RMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKD+R
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVE QRAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+SVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ+
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH

XP_038891920.1 protein unc-13 homolog isoform X1 [Benincasa hispida]3.5e-23597.7Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFF+VTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIA P +PT
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSLTETNVSRSESFESSQARELT+DDIEDFEDDEDIEVNS RMSRRNPND ADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVEPQRAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQ+RQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ

TrEMBL top hitse value%identityAlignment
A0A0A0KWZ4 Uncharacterized protein1.5e-23195.85Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELS+AIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPM T
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSL E NVSRSESFESSQARELT+DDI+DFEDDED+EVNS RMSRRNPND ADLALKLPSFS+GITDDDLRETAYEVLLACAGASGGLIVPS EKKKDK+
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVEP RAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREI+ISLAERPARGDLTGEVCHWADGY LNVRLYEKLL SVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ

A0A1S3BH15 uncharacterized protein LOC1034895715.7e-23195.62Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSG GDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPM T
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSL E NVSRSESFESSQARELT+DDI+DFEDDED+EVNS RMSRRNPND ADLALKLPSFS+GITDDDLRETAYEVLLACAGASGGLIVPS EKKKDK+
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVEP RAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREI+ISLA+RPARGDLTGEVCHWADGY LNVRLYEKLL SVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ

A0A6J1CB47 uncharacterized protein LOC1110100534.2e-22693.81Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFT-PPPVYTPPAVIAPPPMP
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELSEAIRDYHDLT FPQMNNSGS DEFFL TDLDSSGSPPKRAPPP PAFT PPPVYTPPAVI PPP+ 
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFT-PPPVYTPPAVIAPPPMP

Query:  TPSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDK
         PSL ETNVSRSES ESSQ RELT+DDIEDFEDDED+EVNS RMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDK
Subjt:  TPSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDK

Query:  RSKLMRKLGRSSKSG-IVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILE
        RSKLMRKLGRSSK+G +V EP RAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILE
Subjt:  RSKLMRKLGRSSKSG-IVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILE

Query:  EGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKL
        EGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREI+ISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+SVFDMLDEGKL
Subjt:  EGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKL

Query:  TEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
        TEE+EEILELLKSTWR+LGITETIHYTCF WVLFRQ
Subjt:  TEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ

A0A6J1G0P4 uncharacterized protein LOC111449639 isoform X11.1e-23496.78Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPP PAFTPPPVYTPPAVIAP P+PT
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSLTETNVSRSESFESS ARELT+DDIEDF+DDED+EVNS RMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKD+R
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVE QRAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+SVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ+
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH

A0A6J1HVX9 uncharacterized protein LOC111466644 isoform X17.2e-23496.55Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        MPPGAVTLDDVDLDQ+SVDYVLNCAKKGA+LELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPP PAFTPPPVYTPPAVIAP P+PT
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
        PSLTETNVSRSESFESS ARELT+DDIEDF+DD D+EVNS RMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKR

Query:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
        SKLMRKLGRSSKSGIVVE QRAPGLVGLLE+MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG
Subjt:  SKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEG

Query:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE
        LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+SVFDMLDEGKLTE
Subjt:  LINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTE

Query:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH
        EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ+
Subjt:  EVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH

SwissProt top hitse value%identityAlignment
Q8RX56 Protein unc-13 homolog5.6e-17573.35Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAF--TPPPVYTPPAVIAPPPM
        MPPGAVTLDDVDLDQ+SVDYV+NCAKKG MLEL+EAIRDYHD  G P MN+ G+ DEFFL T  +SSGSPPKRAPPP P    +  P+ T P     P  
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAF--TPPPVYTPPAVIAPPPM

Query:  PTPSLTETNVSRSESFESSQARELTLDDIEDFEDDEDI-EVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKK
        P        + RSESF+S +A+ELT+DDI+DFEDD+D+ EV + R+SRR  NDAADL  +LPSF+TGITDDDLRETA+E+LLACAGASGGLIVPS EKKK
Subjt:  PTPSLTETNVSRSESFESSQARELTLDDIEDFEDDEDI-EVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKK

Query:  DK-RSKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNI
        +K RS+L++KLGR S+S  V + Q + GLV LLE MR QMEISE+MD+RTR+GLLNAL+GKVGKRMD+LLVPLELL C+S+TEFSD+KA+LRWQKRQLN+
Subjt:  DK-RSKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNI

Query:  LEEGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEG
        L EGLIN+PVVGFGESGRKA++L+ LL +IEESESLP S GE+QR ECL+SLRE+AISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+ VFD+L++G
Subjt:  LEEGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEG

Query:  KLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH
        KLTEEVEEILELLKSTWRVLGITETIHYTC+ WVLFRQ+
Subjt:  KLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH

Arabidopsis top hitse value%identityAlignment
AT2G20010.1 Protein of unknown function (DUF810)8.9e-4336.5Show/hide
Query:  MEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEGLINHPVVGFGESGRKASELR-ILLSKIEESESLPP
        M ISE +D R R+ LL   SG++G+R++ +++PLELL  +  ++F D++ +  WQ+R L +LE GLI +P V   +S +   +L+ I+ S +E       
Subjt:  MEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEGLINHPVVGFGESGRKASELR-ILLSKIEESESLPP

Query:  STGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
         TGE Q      +LR + +SLA R     +  E CHWADG+ LN+R+Y+ LL S FD+ DE  + EEV+E+LEL+K TW VLGI + IH  CF WVL  +
Subjt:  STGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ

Query:  HF--GDIE---VLDPPNIV----HDALFVSDFSNSVDLFRLKQVMQDEGIDQTL---QNPNVDFYESIQNFPSI
        +   G +E   ++   N++    +DA+  +D   S  L  +  ++ D G  + L      N+D  E+++   S+
Subjt:  HF--GDIE---VLDPPNIV----HDALFVSDFSNSVDLFRLKQVMQDEGIDQTL---QNPNVDFYESIQNFPSI

AT2G20010.2 Protein of unknown function (DUF810)4.1e-4832.8Show/hide
Query:  ITDDDLRETAYEVLLACAGASGG---LIVPSTEKK----------------------KDKRSKLMRKLGRSSKSG--------IVVEPQRAPGLVGLLES
        +++ +LRETAYE+L+A   ++G      +P + K                           SK+ + LG   + G           +P R+   V + E 
Subjt:  ITDDDLRETAYEVLLACAGASGG---LIVPSTEKK----------------------KDKRSKLMRKLGRSSKSG--------IVVEPQRAPGLVGLLES

Query:  MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEGLINHPVVGFGESGRKASELR-ILLSKIEESE
        +RVQM ISE +D R R+ LL   SG++G+R++ +++PLELL  +  ++F D++ +  WQ+R L +LE GLI +P V   +S +   +L+ I+ S +E   
Subjt:  MRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEGLINHPVVGFGESGRKASELR-ILLSKIEESE

Query:  SLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTEEVEEILELLKSTWRVLGITETIHYTCFTWV
             TGE Q      +LR + +SLA R     +  E CHWADG+ LN+R+Y+ LL S FD+ DE  + EEV+E+LEL+K TW VLGI + IH  CF WV
Subjt:  SLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTEEVEEILELLKSTWRVLGITETIHYTCFTWV

Query:  LFRQHF--GDIE---VLDPPNIV----HDALFVSDFSNSVDLFRLKQVMQDEGIDQTL---QNPNVDFYESIQNFPSI
        L  ++   G +E   ++   N++    +DA+  +D   S  L  +  ++ D G  + L      N+D  E+++   S+
Subjt:  LFRQHF--GDIE---VLDPPNIV----HDALFVSDFSNSVDLFRLKQVMQDEGIDQTL---QNPNVDFYESIQNFPSI

AT2G33420.1 Protein of unknown function (DUF810)5.0e-4642.53Show/hide
Query:  IVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEGLINHPVVGFGESGR
        + V+P R    +   E MR QM+++E  D R RK LL  L G+ G+R +T+++PLELL  +  +EF D   +  WQ+RQL +LE GL+ HP +   ++  
Subjt:  IVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEGLINHPVVGFGESGR

Query:  KASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTEEVEEILELLKSTWR
         A  LR     + +SE+ P  T +    + +R+L  + +SL+ R   G+ T +VCHWADGY LN+ LY  LL S+FD+ DE  + +E++E+LEL+K TW 
Subjt:  KASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTEEVEEILELLKSTWR

Query:  VLGITETIHYTCFTWVLFRQH
         LGIT  IH  CFTWVLF Q+
Subjt:  VLGITETIHYTCFTWVLFRQH

AT4G11670.1 Protein of unknown function (DUF810)1.7e-5433.26Show/hide
Query:  PPGAVT-LDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT
        P G+ T L   DLD +S DYVL+C K G ++++S+    Y+  + +P   +S SGD +FLV+  D +GSPP R P       PPPV              
Subjt:  PPGAVT-LDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPT

Query:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGL-----IVPSTEK
              N+ +S +  +  +R +   +     D+   +  +  +    P     + L LP   TG++DDDLRE AYE+++A    S  L       P+  +
Subjt:  PSLTETNVSRSESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGL-----IVPSTEK

Query:  KKDKRSKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLN
        K +K S+LM  L R  K  +  +PQ +              EIS  MD   R+ L+   + + G+++D   + L LL  I K++F + K +++W+ RQ N
Subjt:  KKDKRSKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLN

Query:  ILEEGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDE
        +LEE L   P +   E   +A+ +R  L+ I +S+          RIE L S+R++A  L+  P R  +  E  +W   YHLN+RLYEKLL  VFD LDE
Subjt:  ILEEGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDE

Query:  GKLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ
        G++ E+   +L  +KS W  LGITE +H   + WVLF+Q
Subjt:  GKLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQ

AT5G06970.1 Protein of unknown function (DUF810)4.0e-17673.35Show/hide
Query:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAF--TPPPVYTPPAVIAPPPM
        MPPGAVTLDDVDLDQ+SVDYV+NCAKKG MLEL+EAIRDYHD  G P MN+ G+ DEFFL T  +SSGSPPKRAPPP P    +  P+ T P     P  
Subjt:  MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAF--TPPPVYTPPAVIAPPPM

Query:  PTPSLTETNVSRSESFESSQARELTLDDIEDFEDDEDI-EVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKK
        P        + RSESF+S +A+ELT+DDI+DFEDD+D+ EV + R+SRR  NDAADL  +LPSF+TGITDDDLRETA+E+LLACAGASGGLIVPS EKKK
Subjt:  PTPSLTETNVSRSESFESSQARELTLDDIEDFEDDEDI-EVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKK

Query:  DK-RSKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNI
        +K RS+L++KLGR S+S  V + Q + GLV LLE MR QMEISE+MD+RTR+GLLNAL+GKVGKRMD+LLVPLELL C+S+TEFSD+KA+LRWQKRQLN+
Subjt:  DK-RSKLMRKLGRSSKSGIVVEPQRAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNI

Query:  LEEGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEG
        L EGLIN+PVVGFGESGRKA++L+ LL +IEESESLP S GE+QR ECL+SLRE+AISLAERPARGDLTGEVCHWADGYHLNVRLYEKLL+ VFD+L++G
Subjt:  LEEGLINHPVVGFGESGRKASELRILLSKIEESESLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEG

Query:  KLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH
        KLTEEVEEILELLKSTWRVLGITETIHYTC+ WVLFRQ+
Subjt:  KLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCTGGTGCGGTTACCCTCGACGATGTGGATCTAGACCAAATTAGTGTGGATTATGTTCTTAACTGTGCTAAAAAAGGTGCAATGCTCGAACTATCTGAAGCCAT
TAGAGATTATCATGACCTTACGGGGTTTCCTCAGATGAATAATTCAGGTTCTGGGGATGAATTTTTCTTGGTTACGGATTTAGATTCTTCAGGGTCACCTCCAAAAAGGG
CACCACCACCTGCTCCTGCTTTCACACCTCCTCCTGTCTATACACCTCCAGCAGTAATTGCACCACCACCTATGCCCACACCTTCGCTAACTGAAACAAATGTATCAAGA
TCAGAGTCTTTTGAGTCCTCACAAGCTCGGGAATTGACCTTGGATGACATAGAAGATTTTGAGGATGATGAAGATATTGAGGTTAATAGTGCGAGGATGTCAAGAAGAAA
CCCAAATGATGCAGCTGATCTTGCTCTCAAATTGCCTTCTTTTTCAACAGGAATCACAGATGACGACCTTCGAGAAACAGCATATGAGGTTCTTTTGGCTTGTGCTGGGG
CCTCTGGGGGTCTTATTGTACCATCAACGGAGAAGAAGAAAGACAAAAGGTCTAAGTTGATGAGGAAGCTTGGACGTAGTAGTAAAAGTGGAATTGTTGTTGAACCTCAA
CGTGCACCTGGGTTAGTTGGGTTGTTGGAGTCCATGCGAGTACAGATGGAGATATCTGAGTCCATGGATGTAAGAACACGGAAAGGCCTCCTCAATGCCCTTTCAGGAAA
AGTAGGAAAAAGAATGGACACCCTCTTAGTTCCTCTGGAATTGTTGTCTTGTATCTCAAAAACAGAATTTTCTGATAGAAAAGCATTTTTACGCTGGCAAAAAAGGCAGC
TGAACATATTGGAGGAGGGGCTTATTAATCACCCTGTTGTGGGATTTGGAGAGTCAGGGCGCAAGGCAAGTGAGTTGAGAATTCTATTGTCAAAGATTGAGGAATCTGAG
TCTCTCCCACCTTCCACAGGGGAACTTCAACGAATAGAATGCCTGAGATCGCTTCGAGAGATTGCCATTTCACTCGCTGAGAGGCCAGCTCGGGGTGACTTAACAGGTGA
AGTTTGTCACTGGGCTGACGGTTATCATCTAAATGTCAGGCTCTACGAGAAACTTCTTGTTAGTGTCTTTGATATGTTAGATGAGGGAAAGTTGACTGAGGAGGTAGAAG
AAATCCTCGAACTCTTGAAGTCAACCTGGCGTGTTCTTGGAATCACAGAGACCATCCATTACACCTGCTTTACTTGGGTACTGTTTCGCCAGCACTTTGGTGACATTGAA
GTTCTGGACCCTCCAAATATAGTTCACGACGCTTTATTCGTGAGTGATTTTTCCAATTCTGTTGATCTTTTTCGTTTGAAGCAAGTTATGCAAGACGAAGGCATTGATCA
AACCCTTCAAAATCCAAATGTCGACTTTTATGAATCAATTCAAAACTTTCCATCAATTCCAATGGATCCCTTTGAGTCATTAAAAACTTTGAACAAACCTTACAAGAATT
GTTCGGCTGATGATGAAGTTGAGTTACCAGATTCATCTTTGCCTAAAGGTTGTGGTTTAATTGAGAAAGTTGGTGTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCTGGTGCGGTTACCCTCGACGATGTGGATCTAGACCAAATTAGTGTGGATTATGTTCTTAACTGTGCTAAAAAAGGTGCAATGCTCGAACTATCTGAAGCCAT
TAGAGATTATCATGACCTTACGGGGTTTCCTCAGATGAATAATTCAGGTTCTGGGGATGAATTTTTCTTGGTTACGGATTTAGATTCTTCAGGGTCACCTCCAAAAAGGG
CACCACCACCTGCTCCTGCTTTCACACCTCCTCCTGTCTATACACCTCCAGCAGTAATTGCACCACCACCTATGCCCACACCTTCGCTAACTGAAACAAATGTATCAAGA
TCAGAGTCTTTTGAGTCCTCACAAGCTCGGGAATTGACCTTGGATGACATAGAAGATTTTGAGGATGATGAAGATATTGAGGTTAATAGTGCGAGGATGTCAAGAAGAAA
CCCAAATGATGCAGCTGATCTTGCTCTCAAATTGCCTTCTTTTTCAACAGGAATCACAGATGACGACCTTCGAGAAACAGCATATGAGGTTCTTTTGGCTTGTGCTGGGG
CCTCTGGGGGTCTTATTGTACCATCAACGGAGAAGAAGAAAGACAAAAGGTCTAAGTTGATGAGGAAGCTTGGACGTAGTAGTAAAAGTGGAATTGTTGTTGAACCTCAA
CGTGCACCTGGGTTAGTTGGGTTGTTGGAGTCCATGCGAGTACAGATGGAGATATCTGAGTCCATGGATGTAAGAACACGGAAAGGCCTCCTCAATGCCCTTTCAGGAAA
AGTAGGAAAAAGAATGGACACCCTCTTAGTTCCTCTGGAATTGTTGTCTTGTATCTCAAAAACAGAATTTTCTGATAGAAAAGCATTTTTACGCTGGCAAAAAAGGCAGC
TGAACATATTGGAGGAGGGGCTTATTAATCACCCTGTTGTGGGATTTGGAGAGTCAGGGCGCAAGGCAAGTGAGTTGAGAATTCTATTGTCAAAGATTGAGGAATCTGAG
TCTCTCCCACCTTCCACAGGGGAACTTCAACGAATAGAATGCCTGAGATCGCTTCGAGAGATTGCCATTTCACTCGCTGAGAGGCCAGCTCGGGGTGACTTAACAGGTGA
AGTTTGTCACTGGGCTGACGGTTATCATCTAAATGTCAGGCTCTACGAGAAACTTCTTGTTAGTGTCTTTGATATGTTAGATGAGGGAAAGTTGACTGAGGAGGTAGAAG
AAATCCTCGAACTCTTGAAGTCAACCTGGCGTGTTCTTGGAATCACAGAGACCATCCATTACACCTGCTTTACTTGGGTACTGTTTCGCCAGCACTTTGGTGACATTGAA
GTTCTGGACCCTCCAAATATAGTTCACGACGCTTTATTCGTGAGTGATTTTTCCAATTCTGTTGATCTTTTTCGTTTGAAGCAAGTTATGCAAGACGAAGGCATTGATCA
AACCCTTCAAAATCCAAATGTCGACTTTTATGAATCAATTCAAAACTTTCCATCAATTCCAATGGATCCCTTTGAGTCATTAAAAACTTTGAACAAACCTTACAAGAATT
GTTCGGCTGATGATGAAGTTGAGTTACCAGATTCATCTTTGCCTAAAGGTTGTGGTTTAATTGAGAAAGTTGGTGTTTTTTAG
Protein sequenceShow/hide protein sequence
MPPGAVTLDDVDLDQISVDYVLNCAKKGAMLELSEAIRDYHDLTGFPQMNNSGSGDEFFLVTDLDSSGSPPKRAPPPAPAFTPPPVYTPPAVIAPPPMPTPSLTETNVSR
SESFESSQARELTLDDIEDFEDDEDIEVNSARMSRRNPNDAADLALKLPSFSTGITDDDLRETAYEVLLACAGASGGLIVPSTEKKKDKRSKLMRKLGRSSKSGIVVEPQ
RAPGLVGLLESMRVQMEISESMDVRTRKGLLNALSGKVGKRMDTLLVPLELLSCISKTEFSDRKAFLRWQKRQLNILEEGLINHPVVGFGESGRKASELRILLSKIEESE
SLPPSTGELQRIECLRSLREIAISLAERPARGDLTGEVCHWADGYHLNVRLYEKLLVSVFDMLDEGKLTEEVEEILELLKSTWRVLGITETIHYTCFTWVLFRQHFGDIE
VLDPPNIVHDALFVSDFSNSVDLFRLKQVMQDEGIDQTLQNPNVDFYESIQNFPSIPMDPFESLKTLNKPYKNCSADDEVELPDSSLPKGCGLIEKVGVF