; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015577 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015577
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr02:27838780..27843364
RNA-Seq ExpressionHG10015577
SyntenyHG10015577
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135379.1 uncharacterized protein LOC101207070 [Cucumis sativus]1.8e-23482.98Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRG STS+T SSS G STSGKDSKQSGNSPKK TIRRSRSVSAFSRSSTADVS DFSNSRDNPLFWSNGS   EE+R VNLESDGSSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGL
        S G  KRVSS GVENTRGRSVSRSSDSGSI SG+RK GGRSLSRVGTERRERSAS+TRYPVSSQS NSESEAERDSRY+TKFNNRKTPDSVLH RRE GL
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGL

Query:  VRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHN
        VRTRSSSS+ALQ+ KGLRDRS+H S FD SDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQM S+ GD LQGHSS  DIYDIIQYEVRRAVQDIH+
Subjt:  VRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHN

Query:  DLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD
        DLLNAPQ S+DATG+  IDIPPELV        DLR+EY KKLEQS ERA+KLR DLAVEE+R LELSRILREVIP+PKTSMRRKASIERRRMSKRLTDD
Subjt:  DLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD

Query:  ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQ
        ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT        QEPAIGTS+EI+Q NL HTSY++SNLSKLGEGKAQFSFTKKPHESYG K DIGKYIQ
Subjt:  ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQ

Query:  K-DENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
        K D  +SRVVSVKHC+ +ND +LK   E +L DRVV R+RIESGSLLLCGV+SA  SSYYASII
Subjt:  K-DENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

XP_008446704.1 PREDICTED: uncharacterized protein LOC103489346 [Cucumis melo]3.4e-24184.07Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRGSSTS+T SSS G S SG+DSKQSG+SPKK TIRRSRSVSAFSRSSTADVS DFSNSRDNPLFWSNGS P EE+R VNLESDGSSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTR-YPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG
        + G  KRVSS GVENTRGRSVSRSSDSGSIGSG+RK GGRSLSRVGTERRERSAS+TR YPVSSQS NSESEAERDSRY+TKFNNRKTPDS LH RRE G
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTR-YPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG

Query:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH
        LVRTRSSSS+AL++ KGLRDRS+H    D SDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQM S+ GD LQGHSS  DIYDIIQYEVRRAVQDIH
Subjt:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTD
        +D+LNAPQ SADATG+S IDIPPELVNPGAIEL DLRSEY KKLEQS ERARKLRADLAVEEHR LELSRILREVIPAPKTSMRRKASIERRRMSKRLTD
Subjt:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTD

Query:  DALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI
        DALAYFDECVSLSTFDGSDFSSLEETPPIHQ SSTT        QEPAIGTS+E++Q NL  TSY+SSNLSKLGEGKAQFSFTKKPHESYG K DIGKYI
Subjt:  DALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI

Query:  QKDE-NKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
        QKD+  +S+VVS KHC+ +NDTNLK   E +L DRVVF++RIESGSLLLCGV+S ACSSYYASII
Subjt:  QKDE-NKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

XP_022150606.1 uncharacterized protein LOC111018702 [Momordica charantia]6.4e-22479.89Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRG STSA PSSSSGVSTSGKDSKQ  NSPKK T+RRSRSVSAFSRS+TAD+SADFSNSRDNPLFWSNGSPP +E+R VNLE D SSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQS-LNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG
        SA SSKRVS  GVE+TRGRSVSR+S SGS G G+RK GGRSLSRVGTERR+RSAS++RYPVSSQS +NSESEAERDSRYT K NNRKTPDSVLHGRREVG
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQS-LNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG

Query:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH
        L R+ S SS  L   KGLR RSS LSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQG +S+SDIYDIIQYEVRRAVQDIH
Subjt:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELA-DLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLT
        NDLLNAPQ SADA GSS IDIPPELVNP A+EL  DLRSEY+KKLEQSQERARKLRADLAVE+HR LELSRILREVIPAPKTSMRRKASIERR+MSKRLT
Subjt:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELA-DLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLT

Query:  DDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT-------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI
        DDALAYFDECVSLSTFDGSDFSSLEE PPIHQVSSTT       QEP IGTSS  +Q N        SNLS+LG+ K+QFSFT+KPHE  GI+QDIGKYI
Subjt:  DDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT-------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI

Query:  ---QKDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
           +KD N+SRV++ K     ND NL+KP ES+LFDR++ RSRIESG LLLCGVSSA CSSYY S I
Subjt:  ---QKDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

XP_038892273.1 uncharacterized protein LOC120081462 isoform X1 [Benincasa hispida]6.4e-26490.23Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRG STS TPSSSSG STSGKDSK SGNSPKK TIRRSRSVSAFSRSS ADVSADFSNSRDNPLFWSNGSPP EE+RTVNLESDGSSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGL
        SAGSSKRVSS GVEN+RGRSVSRSSDSGS+GSG+RKTGGRSLSRVGTERRERSAS+TRYPVSS SLNSESEAERDSRY+TKFNNRKTPDS+LHGRREVGL
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGL

Query:  VRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHN
        VRTRSSSSDALQ+ KGLRDRSSH  PFDLSDNCDVSVSCSFEDRLSTASSLSEAEE+T+RAVCEQMKS+KGDCLQGHSS SDIYDIIQYEVRRAVQDIHN
Subjt:  VRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHN

Query:  DLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD
        DLL+APQ SAD TGSS IDIPPELVNPGAIEL DLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD
Subjt:  DLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD

Query:  ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQ
        ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT        QEPAIGTSSEIDQ NL  TSYRS+NLSK G+GKAQFSFTKKPHESYGIKQDIGKYIQ
Subjt:  ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQ

Query:  KDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
        KD+N+S+VVS+KHCD MNDTNL+K +ESLL DRVVFRSRIESGSLLLCGVSS  CSSYYASII
Subjt:  KDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

XP_038892274.1 uncharacterized protein LOC120081462 isoform X2 [Benincasa hispida]2.3e-23784.19Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRG STS TPSSSSG STSGKDSK SGNSPKK TIRRSRSVSAFSRSS ADVSADFSNSRDNPLFWSNGSPP EE+RTVNLESDGSSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGL
        SAGSSKRVSS GVEN+RGRSVSRSSDSGS+GSG+RKTGGRSLSRVGTERRERSAS+TRYPVSS SLNSESEAERDSRY+TKFNNRKTPDS+LHGRREVGL
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGL

Query:  VRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHN
        VRTRSSSSDALQ+ KGLRDRSSH  PFDLSDNCDVSVSCSFEDRLSTASSLSEAEE+T+RAVCEQMK                                 
Subjt:  VRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHN

Query:  DLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD
            APQ SAD TGSS IDIPPELVNPGAIEL DLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD
Subjt:  DLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD

Query:  ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQ
        ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT        QEPAIGTSSEIDQ NL  TSYRS+NLSK G+GKAQFSFTKKPHESYGIKQDIGKYIQ
Subjt:  ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQ

Query:  KDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
        KD+N+S+VVS+KHCD MNDTNL+K +ESLL DRVVFRSRIESGSLLLCGVSS  CSSYYASII
Subjt:  KDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

TrEMBL top hitse value%identityAlignment
A0A0A0KWJ7 Uncharacterized protein8.7e-23582.98Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRG STS+T SSS G STSGKDSKQSGNSPKK TIRRSRSVSAFSRSSTADVS DFSNSRDNPLFWSNGS   EE+R VNLESDGSSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGL
        S G  KRVSS GVENTRGRSVSRSSDSGSI SG+RK GGRSLSRVGTERRERSAS+TRYPVSSQS NSESEAERDSRY+TKFNNRKTPDSVLH RRE GL
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGL

Query:  VRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHN
        VRTRSSSS+ALQ+ KGLRDRS+H S FD SDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQM S+ GD LQGHSS  DIYDIIQYEVRRAVQDIH+
Subjt:  VRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHN

Query:  DLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD
        DLLNAPQ S+DATG+  IDIPPELV        DLR+EY KKLEQS ERA+KLR DLAVEE+R LELSRILREVIP+PKTSMRRKASIERRRMSKRLTDD
Subjt:  DLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDD

Query:  ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQ
        ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT        QEPAIGTS+EI+Q NL HTSY++SNLSKLGEGKAQFSFTKKPHESYG K DIGKYIQ
Subjt:  ALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQ

Query:  K-DENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
        K D  +SRVVSVKHC+ +ND +LK   E +L DRVV R+RIESGSLLLCGV+SA  SSYYASII
Subjt:  K-DENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

A0A1S3BGD4 uncharacterized protein LOC1034893461.6e-24184.07Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRGSSTS+T SSS G S SG+DSKQSG+SPKK TIRRSRSVSAFSRSSTADVS DFSNSRDNPLFWSNGS P EE+R VNLESDGSSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTR-YPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG
        + G  KRVSS GVENTRGRSVSRSSDSGSIGSG+RK GGRSLSRVGTERRERSAS+TR YPVSSQS NSESEAERDSRY+TKFNNRKTPDS LH RRE G
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTR-YPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG

Query:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH
        LVRTRSSSS+AL++ KGLRDRS+H    D SDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQM S+ GD LQGHSS  DIYDIIQYEVRRAVQDIH
Subjt:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTD
        +D+LNAPQ SADATG+S IDIPPELVNPGAIEL DLRSEY KKLEQS ERARKLRADLAVEEHR LELSRILREVIPAPKTSMRRKASIERRRMSKRLTD
Subjt:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTD

Query:  DALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI
        DALAYFDECVSLSTFDGSDFSSLEETPPIHQ SSTT        QEPAIGTS+E++Q NL  TSY+SSNLSKLGEGKAQFSFTKKPHESYG K DIGKYI
Subjt:  DALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI

Query:  QKDE-NKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
        QKD+  +S+VVS KHC+ +NDTNLK   E +L DRVVF++RIESGSLLLCGV+S ACSSYYASII
Subjt:  QKDE-NKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

A0A5A7SZ95 Uncharacterized protein1.6e-24184.07Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRGSSTS+T SSS G S SG+DSKQSG+SPKK TIRRSRSVSAFSRSSTADVS DFSNSRDNPLFWSNGS P EE+R VNLESDGSSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTR-YPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG
        + G  KRVSS GVENTRGRSVSRSSDSGSIGSG+RK GGRSLSRVGTERRERSAS+TR YPVSSQS NSESEAERDSRY+TKFNNRKTPDS LH RRE G
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTR-YPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG

Query:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH
        LVRTRSSSS+AL++ KGLRDRS+H    D SDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQM S+ GD LQGHSS  DIYDIIQYEVRRAVQDIH
Subjt:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTD
        +D+LNAPQ SADATG+S IDIPPELVNPGAIEL DLRSEY KKLEQS ERARKLRADLAVEEHR LELSRILREVIPAPKTSMRRKASIERRRMSKRLTD
Subjt:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTD

Query:  DALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI
        DALAYFDECVSLSTFDGSDFSSLEETPPIHQ SSTT        QEPAIGTS+E++Q NL  TSY+SSNLSKLGEGKAQFSFTKKPHESYG K DIGKYI
Subjt:  DALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT--------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI

Query:  QKDE-NKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
        QKD+  +S+VVS KHC+ +NDTNLK   E +L DRVVF++RIESGSLLLCGV+S ACSSYYASII
Subjt:  QKDE-NKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

A0A6J1DC05 uncharacterized protein LOC1110187023.1e-22479.89Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRG STSA PSSSSGVSTSGKDSKQ  NSPKK T+RRSRSVSAFSRS+TAD+SADFSNSRDNPLFWSNGSPP +E+R VNLE D SSTRI
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQS-LNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG
        SA SSKRVS  GVE+TRGRSVSR+S SGS G G+RK GGRSLSRVGTERR+RSAS++RYPVSSQS +NSESEAERDSRYT K NNRKTPDSVLHGRREVG
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQS-LNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG

Query:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH
        L R+ S SS  L   KGLR RSS LSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQG +S+SDIYDIIQYEVRRAVQDIH
Subjt:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELA-DLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLT
        NDLLNAPQ SADA GSS IDIPPELVNP A+EL  DLRSEY+KKLEQSQERARKLRADLAVE+HR LELSRILREVIPAPKTSMRRKASIERR+MSKRLT
Subjt:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELA-DLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLT

Query:  DDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT-------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI
        DDALAYFDECVSLSTFDGSDFSSLEE PPIHQVSSTT       QEP IGTSS  +Q N        SNLS+LG+ K+QFSFT+KPHE  GI+QDIGKYI
Subjt:  DDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTT-------QEPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYI

Query:  ---QKDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
           +KD N+SRV++ K     ND NL+KP ES+LFDR++ RSRIESG LLLCGVSSA CSSYY S I
Subjt:  ---QKDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

A0A6J1G148 uncharacterized protein LOC111449721 isoform X25.7e-21879.79Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI
        MAM+AFKSSSRRGSSTSATPSSSSGVSTS KDSKQ+ NS KK T+RRSRSVSAF RSST DVSADFSNSRDNPLFWSNGSPP EE+R+VNLE D SS R+
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRI

Query:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQS-LNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG
         +GSSKRVS  GVE+TRGRSVSR+SDSGS+GSGNRKTG RSLSRVG ERR+RSAS++RY VSSQS +NSESEAER++ Y+TK N+RKTPDSVL GRRE G
Subjt:  SAGSSKRVSSVGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQS-LNSESEAERDSRYTTKFNNRKTPDSVLHGRREVG

Query:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH
         VR   SSSDALQ+ KGL+ RSS LSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQG SSASDIYDIIQYEVRRAVQDIH
Subjt:  LVRTRSSSSDALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELA-DLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLT
        NDLLNA Q  ADA GSS IDIP ELVNPGA+E+  DLRSEY+KKLE SQ+RARKLRADLAVEE R LELSRILREVIPAPKTSMRRKASIERRRMSKRLT
Subjt:  NDLLNAPQCSADATGSSIIDIPPELVNPGAIELA-DLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLT

Query:  DDALAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQ----------EPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYG-IKQD
        DDALAYFDECVSLSTFDGSDFSS+EET PPIHQVSSTTQ            AI T+SE +Q NL  TSYRSSNLS     K+QFSF+ KP E+YG I+QD
Subjt:  DDALAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQ----------EPAIGTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYG-IKQD

Query:  IGKYIQ---KDENKSRVVSV-KHCD-TMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII
        IGKYIQ   KD NKSRVVS+ K CD  MND N++K  ESLLFDR+VFR+RIESGS+LLCGVSS+A SSYYASII
Subjt:  IGKYIQ---KDENKSRVVSV-KHCD-TMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSYYASII

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein1.2e-0526.24Show/hide
Query:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFS-RSSTADV------SADFSNSRDNPLFWSNGSPPSEESRTVNLES
        MA SAF S+ +R  +TS   SS SG          S  S ++   RR RS+S FS R    D+         F N+     F   G    ++      ES
Subjt:  MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFS-RSSTADV------SADFSNSRDNPLFWSNGSPPSEESRTVNLES

Query:  DGSSTRISAGSSKRVSSVGVENTRGRSVSRSSDSGSIGS-GNRKTGGRSLSRVG--------------TERRERSASMTRYPVSS--------QSLNSES
            + IS+G             RGRS  R+S  G+ G   N +  GRS+SRVG              TE   R  S++R P S+         S+++ S
Subjt:  DGSSTRISAGSSKRVSSVGVENTRGRSVSRSSDSGSIGS-GNRKTGGRSLSRVG--------------TERRERSASMTRYPVSS--------QSLNSES

Query:  EAERD-SRYTTKFNNRKTPDSVLHG---------RREVGLVRTRSSSS----DALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEE
           R  SR   +    +    V+ G         RR + +VR R  +S    D +Q     RD  S +S    S N     S + ++R     S S+   
Subjt:  EAERD-SRYTTKFNNRKTPDSVLHG---------RREVGLVRTRSSSS----DALQKWKGLRDRSSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEE

Query:  KTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRAD
        K       Q  ++  D  +G  S+S      ++   R ++ ++      P+   ++ G+S      + ++     ++     Y  KL++S+ER R+L A+
Subjt:  KTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDLLNAPQCSADATGSSIIDIPPELVNPGAIELADLRSEYTKKLEQSQERARKLRAD

Query:  LAVEEHRELELSRILREVI------PAPKTSMRRKASIER-RRMSKRLTDDALAYFDECVSLSTFDGSDFSSLEETPP--------IHQVSSTTQEPAIG
        + +EE R  ELS  L+E++         K    RK S +R RRMS  LTD+A  + DE +  S  + +DFSSLE+           I   SS +      
Subjt:  LAVEEHRELELSRILREVI------PAPKTSMRRKASIER-RRMSKRLTDDALAYFDECVSLSTFDGSDFSSLEETPP--------IHQVSSTTQEPAIG

Query:  TSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPH----------------------ESYGIKQDIGKYIQKDENKSRVVSVKHCDTMNDTNLKKPRE
        TS E+D   L    + + ++S      A     K PH                       S G     G +   D     V+  K         LK+P  
Subjt:  TSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPH----------------------ESYGIKQDIGKYIQKDENKSRVVSVKHCDTMNDTNLKKPRE

Query:  S-LLFDRVVFRSRIESGSLLLCGVS
        S +L +    R RI SGSL+LC  S
Subjt:  S-LLFDRVVFRSRIESGSLLLCGVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGTCTGCCTTCAAATCCTCATCTAGAAGAGGAAGTTCGACTTCAGCGACACCTTCTTCGAGCAGTGGCGTGTCAACATCGGGTAAAGATAGCAAGCAGAGTGG
TAATTCTCCCAAGAAAGGTACTATCCGTAGATCCCGAAGCGTGAGTGCTTTTTCCAGATCTAGCACAGCGGATGTTTCGGCAGATTTTTCGAATAGTAGAGATAATCCGC
TCTTCTGGAGCAATGGTTCGCCTCCATCGGAAGAATCTCGTACTGTTAACCTTGAATCTGATGGAAGTTCCACTAGAATTAGTGCAGGAAGTTCGAAACGTGTGAGTTCG
GTTGGTGTTGAGAATACGAGGGGACGATCAGTGTCGAGAAGCTCCGATTCTGGAAGTATAGGTTCAGGAAACAGGAAAACCGGTGGCCGAAGCTTGTCGAGGGTAGGCAC
TGAACGGCGGGAACGCTCGGCGTCTATGACTCGATATCCCGTCTCATCGCAGTCTTTGAACTCCGAGAGTGAGGCTGAGCGAGATAGTCGTTATACTACGAAATTCAATA
ATAGAAAGACTCCAGATTCTGTGCTTCATGGTCGGAGAGAGGTTGGCTTAGTTAGAACTAGAAGTAGTTCTTCAGATGCTTTGCAGAAATGGAAAGGCCTGCGAGATCGG
TCGAGTCATCTTTCACCATTTGATTTATCAGATAACTGTGATGTATCAGTGTCTTGTAGTTTTGAAGATAGGCTGTCCACCGCGAGCTCTTTATCTGAAGCTGAAGAGAA
AACTATAAGAGCTGTTTGTGAACAAATGAAGTCAATGAAGGGGGATTGTTTGCAAGGACATTCCAGTGCTAGTGACATATATGACATTATTCAATATGAAGTAAGACGTG
CTGTCCAAGATATCCACAACGACCTTCTCAATGCTCCACAATGCAGTGCTGATGCCACAGGAAGTTCTATCATTGATATCCCTCCTGAATTGGTGAATCCAGGTGCAATT
GAATTGGCGGATTTGAGAAGTGAGTATACGAAGAAGCTTGAGCAGTCACAAGAGCGGGCTAGAAAACTTCGGGCAGACTTGGCAGTGGAAGAGCATCGTGAATTAGAGCT
CAGTAGAATTTTGCGGGAAGTAATACCAGCTCCTAAGACTTCTATGAGACGAAAAGCCAGCATTGAAAGAAGAAGGATGTCAAAGCGTTTAACTGATGATGCCTTGGCAT
ATTTTGACGAGTGTGTTTCGTTATCAACATTTGATGGTTCTGATTTCTCATCACTGGAAGAAACACCCCCAATCCACCAAGTTTCTTCCACTACCCAGGAACCAGCAATT
GGAACTTCATCTGAGATTGACCAATGTAATTTAGAGCATACATCTTACAGAAGCAGTAACCTGAGCAAACTTGGAGAAGGCAAAGCTCAGTTTTCCTTCACAAAGAAACC
ACATGAAAGCTATGGAATCAAACAGGACATTGGGAAGTACATTCAGAAAGATGAAAACAAATCACGAGTTGTAAGCGTGAAGCATTGCGATACGATGAATGATACAAATT
TGAAAAAACCAAGAGAAAGCCTATTGTTTGATCGAGTTGTTTTCAGAAGCAGAATAGAATCGGGTAGTTTACTGCTCTGCGGTGTAAGTTCTGCAGCTTGTTCCTCTTAT
TATGCTTCTATCATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATGTCTGCCTTCAAATCCTCATCTAGAAGAGGAAGTTCGACTTCAGCGACACCTTCTTCGAGCAGTGGCGTGTCAACATCGGGTAAAGATAGCAAGCAGAGTGG
TAATTCTCCCAAGAAAGGTACTATCCGTAGATCCCGAAGCGTGAGTGCTTTTTCCAGATCTAGCACAGCGGATGTTTCGGCAGATTTTTCGAATAGTAGAGATAATCCGC
TCTTCTGGAGCAATGGTTCGCCTCCATCGGAAGAATCTCGTACTGTTAACCTTGAATCTGATGGAAGTTCCACTAGAATTAGTGCAGGAAGTTCGAAACGTGTGAGTTCG
GTTGGTGTTGAGAATACGAGGGGACGATCAGTGTCGAGAAGCTCCGATTCTGGAAGTATAGGTTCAGGAAACAGGAAAACCGGTGGCCGAAGCTTGTCGAGGGTAGGCAC
TGAACGGCGGGAACGCTCGGCGTCTATGACTCGATATCCCGTCTCATCGCAGTCTTTGAACTCCGAGAGTGAGGCTGAGCGAGATAGTCGTTATACTACGAAATTCAATA
ATAGAAAGACTCCAGATTCTGTGCTTCATGGTCGGAGAGAGGTTGGCTTAGTTAGAACTAGAAGTAGTTCTTCAGATGCTTTGCAGAAATGGAAAGGCCTGCGAGATCGG
TCGAGTCATCTTTCACCATTTGATTTATCAGATAACTGTGATGTATCAGTGTCTTGTAGTTTTGAAGATAGGCTGTCCACCGCGAGCTCTTTATCTGAAGCTGAAGAGAA
AACTATAAGAGCTGTTTGTGAACAAATGAAGTCAATGAAGGGGGATTGTTTGCAAGGACATTCCAGTGCTAGTGACATATATGACATTATTCAATATGAAGTAAGACGTG
CTGTCCAAGATATCCACAACGACCTTCTCAATGCTCCACAATGCAGTGCTGATGCCACAGGAAGTTCTATCATTGATATCCCTCCTGAATTGGTGAATCCAGGTGCAATT
GAATTGGCGGATTTGAGAAGTGAGTATACGAAGAAGCTTGAGCAGTCACAAGAGCGGGCTAGAAAACTTCGGGCAGACTTGGCAGTGGAAGAGCATCGTGAATTAGAGCT
CAGTAGAATTTTGCGGGAAGTAATACCAGCTCCTAAGACTTCTATGAGACGAAAAGCCAGCATTGAAAGAAGAAGGATGTCAAAGCGTTTAACTGATGATGCCTTGGCAT
ATTTTGACGAGTGTGTTTCGTTATCAACATTTGATGGTTCTGATTTCTCATCACTGGAAGAAACACCCCCAATCCACCAAGTTTCTTCCACTACCCAGGAACCAGCAATT
GGAACTTCATCTGAGATTGACCAATGTAATTTAGAGCATACATCTTACAGAAGCAGTAACCTGAGCAAACTTGGAGAAGGCAAAGCTCAGTTTTCCTTCACAAAGAAACC
ACATGAAAGCTATGGAATCAAACAGGACATTGGGAAGTACATTCAGAAAGATGAAAACAAATCACGAGTTGTAAGCGTGAAGCATTGCGATACGATGAATGATACAAATT
TGAAAAAACCAAGAGAAAGCCTATTGTTTGATCGAGTTGTTTTCAGAAGCAGAATAGAATCGGGTAGTTTACTGCTCTGCGGTGTAAGTTCTGCAGCTTGTTCCTCTTAT
TATGCTTCTATCATCTGA
Protein sequenceShow/hide protein sequence
MAMSAFKSSSRRGSSTSATPSSSSGVSTSGKDSKQSGNSPKKGTIRRSRSVSAFSRSSTADVSADFSNSRDNPLFWSNGSPPSEESRTVNLESDGSSTRISAGSSKRVSS
VGVENTRGRSVSRSSDSGSIGSGNRKTGGRSLSRVGTERRERSASMTRYPVSSQSLNSESEAERDSRYTTKFNNRKTPDSVLHGRREVGLVRTRSSSSDALQKWKGLRDR
SSHLSPFDLSDNCDVSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSMKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDLLNAPQCSADATGSSIIDIPPELVNPGAI
ELADLRSEYTKKLEQSQERARKLRADLAVEEHRELELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQEPAI
GTSSEIDQCNLEHTSYRSSNLSKLGEGKAQFSFTKKPHESYGIKQDIGKYIQKDENKSRVVSVKHCDTMNDTNLKKPRESLLFDRVVFRSRIESGSLLLCGVSSAACSSY
YASII