; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1463 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1463
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC09:20573122..20578584
RNA-Seq ExpressionMC09g1463
SyntenyMC09g1463
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150606.1 uncharacterized protein LOC111018702 [Momordica charantia]0.098.74Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
        LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQ-------EPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR
        LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQ       EPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQ-------EPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR

Query:  VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
Subjt:  VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

XP_022945507.1 uncharacterized protein LOC111449721 isoform X2 [Cucurbita moschata]5.37e-27577.41Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNS SGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGI-QQDIGK
        LAYFDECVSLSTFDGSDFSS+EE PP IHQVSSTTQ             I T+S SEQYN        SNLS     KSQFSF+ KP E  GI QQDIGK
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGI-QQDIGK

Query:  YIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        YI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  YIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

XP_022968380.1 uncharacterized protein LOC111467644 isoform X1 [Cucurbita maxima]1.87e-27777.18Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNS SGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ------------------EPTIGTSSVSEQYNSNLSELGDAK---SQFSFTRKPHEICGI-QQD
        LAYFDECVSLSTFDGSDFSS+EE PP IHQVSSTTQ                    TI T+S SEQYN   +    +K   SQFSF+ KP E  GI QQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ------------------EPTIGTSSVSEQYNSNLSELGDAK---SQFSFTRKPHEICGI-QQD

Query:  IGKYIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        IGKYI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  IGKYIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

XP_022968381.1 uncharacterized protein LOC111467644 isoform X2 [Cucurbita maxima]8.48e-27978.27Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNS SGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ----------EPTIGTSSVSEQYNSNLSELGDAK---SQFSFTRKPHEICGI-QQDIGKYIHNC
        LAYFDECVSLSTFDGSDFSS+EE PP IHQVSSTTQ            TI T+S SEQYN   +    +K   SQFSF+ KP E  GI QQDIGKYI  C
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ----------EPTIGTSSVSEQYNSNLSELGDAK---SQFSFTRKPHEICGI-QQDIGKYIHNC

Query:  EKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        EKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  EKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

XP_038892273.1 uncharacterized protein LOC120081462 isoform X1 [Benincasa hispida]1.02e-27677.99Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRGGSTS  PSSSSG STSGKDSK   NSPKKAT+RRSRSVSAFSRS+ AD+SADFSNSRDNPLFWSNGSPP +EAR VNLE D SSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SA SSKRVS GGVE++RGRSVSR+S SGS G GSRK GGRSLSRVGTERR+RSASV+RYPVSS S +NSESEAERDSRY+ K NNRKTPDS+LHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LAR---SNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIH
        L R   S+SD+ +Q KGLR RSS   PFDLSDNCD SVSCSFEDRLSTASSLSEAEE+T++AVCEQM+S+KGDCLQG +S SDIYDIIQYEVRRAVQDIH
Subjt:  LAR---SNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLT
        NDLL+APQ+SAD  GSS+IDIPPELVNP A+ELV DLRSEY+KKLEQSQERARKLRADLAVE+HR LELSRILREVIPAPKTSMRRKASIERR+MSKRLT
Subjt:  NDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLT

Query:  DDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQ--------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGIQQDIGKY
        DDALAYFDECVSLSTFDGSDFSSLEE PPIHQVSSTTQ        EP IGTSS  +QYN        +NLS+ G  K+QFSFT+KPHE  GI+QDIGKY
Subjt:  DDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQ--------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGIQQDIGKY

Query:  IHNCEKDGNESRVLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        I   +KD NES+V++ K     ND NLQK  ES+L DR++ RSRIESG LLLCGVSS  CSSYY S I
Subjt:  IHNCEKDGNESRVLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

TrEMBL top hitse value%identityAlignment
A0A6J1DC05 uncharacterized protein LOC1110187020.098.74Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
        LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQ-------EPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR
        LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQ       EPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQ-------EPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR

Query:  VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
Subjt:  VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

A0A6J1G123 uncharacterized protein LOC111449721 isoform X15.73e-27476.34Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNS SGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ------------------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG
        LAYFDECVSLSTFDGSDFSS+EE PP IHQVSSTTQ                     I T+S SEQYN        SNLS     KSQFSF+ KP E  G
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ------------------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG

Query:  I-QQDIGKYIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        I QQDIGKYI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  I-QQDIGKYIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

A0A6J1G148 uncharacterized protein LOC111449721 isoform X22.60e-27577.41Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNS SGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGI-QQDIGK
        LAYFDECVSLSTFDGSDFSS+EE PP IHQVSSTTQ             I T+S SEQYN        SNLS     KSQFSF+ KP E  GI QQDIGK
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGI-QQDIGK

Query:  YIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        YI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  YIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

A0A6J1HUP7 uncharacterized protein LOC111467644 isoform X24.11e-27978.27Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNS SGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ----------EPTIGTSSVSEQYNSNLSELGDAK---SQFSFTRKPHEICGI-QQDIGKYIHNC
        LAYFDECVSLSTFDGSDFSS+EE PP IHQVSSTTQ            TI T+S SEQYN   +    +K   SQFSF+ KP E  GI QQDIGKYI  C
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ----------EPTIGTSSVSEQYNSNLSELGDAK---SQFSFTRKPHEICGI-QQDIGKYIHNC

Query:  EKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        EKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  EKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

A0A6J1HX20 uncharacterized protein LOC111467644 isoform X19.06e-27877.18Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNS SGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ------------------EPTIGTSSVSEQYNSNLSELGDAK---SQFSFTRKPHEICGI-QQD
        LAYFDECVSLSTFDGSDFSS+EE PP IHQVSSTTQ                    TI T+S SEQYN   +    +K   SQFSF+ KP E  GI QQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPP-IHQVSSTTQ------------------EPTIGTSSVSEQYNSNLSELGDAK---SQFSFTRKPHEICGI-QQD

Query:  IGKYIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        IGKYI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  IGKYIHNCEKDGNESRVLTTKQYCNT--NDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein6.1e-0728.06Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFS-RSNTADI------SADFSNSRDNPLFWSNGSPPCDEARAVNLEF
        MA +AF S+ +R  +TS A SS SG           S S +++  RR RS+S FS R    DI         F N+     F   G    D+   + +EF
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFS-RSNTADI------SADFSNSRDNPLFWSNGSPPCDEARAVNLEF

Query:  DESSTRISAASSKRVSCGGVESTRGRSVSRNSGSGSNG-LGSRKVGGRSLSRVG--------------TERRDRSASVSRYPVSS--------QSFVNSE
         ES +  S  SS           RGRS  RNSG G+ G + + +  GRS+SRVG              TE   R  S+SR P S+         S  N+ 
Subjt:  DESSTRISAASSKRVSCGGVESTRGRSVSRNSGSGSNG-LGSRKVGGRSLSRVG--------------TERRDRSASVSRYPVSS--------QSFVNSE

Query:  SEAERDSRYTAKSNNRKTPDSVLHGRREVGLAR------------SNSDSSKQLKGLRTRSSQLSPF--DLSDNCDASVSCSFEDRLSTASSLSEAEEKT
        S     SR   +    +    V+ G RE   +R             NS+S        + S  +  F    S N  +  S + ++R     S S+   K 
Subjt:  SEAERDSRYTAKSNNRKTPDSVLHGRREVGLAR------------SNSDSSKQLKGLRTRSSQLSPF--DLSDNCDASVSCSFEDRLSTASSLSEAEEKT

Query:  IKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADL
              Q  ++  D  +G+ SSS      ++   R ++ ++      P+   +++G+S      + ++      V      Y+ KL++S+ER R+L A++
Subjt:  IKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADL

Query:  AVEDHRGLELSRILREVI------PAPKTSMRRKASIER-RKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEE
         +E+ RG ELS  L+E++         K    RK S +R R+MS  LTD+A  + DE +  S  + +DFSSLE+
Subjt:  AVEDHRGLELSRILREVI------PAPKTSMRRKASIER-RKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGGCTGCCTTCAAATCGTCGTCTAGAAGAGGGGGTTCGACTTCGGCAGCACCTTCTTCGAGTAGCGGCGTGTCAACATCGGGTAAAGATAGCAAGCAGGGGAG
TAATTCACCGAAGAAAGCTACTATGCGGAGATCACGAAGCGTCAGTGCCTTTTCCAGATCCAACACAGCGGATATTTCCGCCGATTTTTCGAATAGTAGAGATAATCCGC
TGTTCTGGAGCAATGGTTCGCCCCCTTGCGACGAAGCTCGTGCTGTTAACCTTGAATTCGATGAAAGTTCCACCAGAATTAGCGCAGCAAGTTCAAAACGTGTGAGTTGT
GGTGGTGTTGAGAGTACCAGGGGACGATCGGTGTCGAGAAATTCTGGTTCTGGAAGTAATGGTTTAGGAAGCAGGAAGGTCGGCGGTCGAAGCTTGTCACGGGTAGGCAC
TGAACGGCGGGACCGCTCTGCGTCTGTGTCTCGATATCCCGTCTCATCGCAATCCTTTGTGAACTCTGAGAGTGAGGCTGAGCGAGATAGTCGTTATACTGCGAAATCCA
ACAATAGAAAGACTCCAGATTCGGTTCTTCACGGTCGAAGAGAGGTTGGTTTAGCTAGAAGTAATTCAGATTCTTCGAAGCAATTGAAAGGCCTGCGAACACGGTCCAGT
CAACTTTCACCTTTTGATTTATCCGATAACTGCGATGCATCAGTGTCTTGTAGTTTTGAAGATAGGCTGTCCACTGCGAGTTCTCTATCAGAAGCCGAAGAGAAAACAAT
AAAAGCTGTTTGTGAGCAAATGAGATCAATGAAGGGGGATTGTTTGCAAGGACAAACCAGTAGTAGTGACATATATGACATTATTCAATATGAAGTTAGACGTGCTGTCC
AAGATATCCATAATGACCTTCTCAATGCTCCACAAAACAGTGCTGATGCTATAGGAAGTTCAAATATTGATATCCCTCCCGAATTGGTGAATCCAGCTGCTGTTGAATTG
GTGATGGACTTGAGAAGCGAGTATAGCAAGAAGCTTGAGCAGTCACAAGAGCGAGCTAGAAAACTTCGGGCAGACTTGGCAGTTGAGGATCATCGTGGATTAGAGCTCAG
TAGAATTTTGCGGGAAGTAATACCAGCTCCTAAGACCTCTATGAGACGAAAAGCTAGCATTGAAAGAAGAAAGATGTCAAAACGTCTAACTGACGACGCCTTGGCATATT
TTGACGAGTGTGTATCATTATCAACATTTGATGGTTCCGACTTCTCATCATTGGAAGAAGCACCCCCTATCCACCAAGTTTCTTCCACTACCCAGGAACCAACCATTGGA
ACTTCATCCGTGAGTGAGCAATATAATAGTAATCTCAGCGAACTTGGGGATGCCAAATCTCAGTTTTCCTTCACTAGAAAACCCCATGAAATTTGTGGAATTCAACAGGA
CATTGGGAAGTACATTCATAACTGTGAGAAAGATGGCAACGAATCAAGGGTTTTAACGACGAAGCAATATTGCAATACGAATGATGCAAATTTGCAGAAGCCAACAGAGA
GCGTTTTGTTTGACCGACTTCTTTTGAGAAGCAGAATCGAGTCGGGCGGTCTACTACTGTGCGGTGTTAGCTCTGCAACTTGCTCCTCTTATTATGGTTCCTTCATCTGA
mRNA sequenceShow/hide mRNA sequence
GTTTAACATTCCGTCTGAAAAATACGTTTGATCTTTGTAGGACGAGAACGAAAGGCATAAAGCACGTGCCGTCTCAATGGTCTGCGTTTAACACGGTTCATTTTTCCGAT
CAACGTGGCGGTGGAATCAAGAAACTTGGCGGAAGCACAGCCGTCGCCGCCGTCGCATTTCTCTCTCTCGGCTGTTCTTCCGCATAAATTATTGCATCTTCTTCCTCTCT
ATTTTCTCTTTGCAATCAATCATATTCCGCGTTCGCAATCGTCGCTTTAACTCCTATATTTTACTCTTATGAATTCTTCATCGAAGCCTCACCACTGCGATTATTCTCGC
CTACCTTATTTTTATGCATTCTGAAGTCTTCTCGATTCTCCATTTCCGATTCCTTTGCTCTTCTCTTCCCCTATTTGTGTTGGTTTCATTGTTATCTGATTGTTAATTAG
CAATGGCGATGGCTGCCTTCAAATCGTCGTCTAGAAGAGGGGGTTCGACTTCGGCAGCACCTTCTTCGAGTAGCGGCGTGTCAACATCGGGTAAAGATAGCAAGCAGGGG
AGTAATTCACCGAAGAAAGCTACTATGCGGAGATCACGAAGCGTCAGTGCCTTTTCCAGATCCAACACAGCGGATATTTCCGCCGATTTTTCGAATAGTAGAGATAATCC
GCTGTTCTGGAGCAATGGTTCGCCCCCTTGCGACGAAGCTCGTGCTGTTAACCTTGAATTCGATGAAAGTTCCACCAGAATTAGCGCAGCAAGTTCAAAACGTGTGAGTT
GTGGTGGTGTTGAGAGTACCAGGGGACGATCGGTGTCGAGAAATTCTGGTTCTGGAAGTAATGGTTTAGGAAGCAGGAAGGTCGGCGGTCGAAGCTTGTCACGGGTAGGC
ACTGAACGGCGGGACCGCTCTGCGTCTGTGTCTCGATATCCCGTCTCATCGCAATCCTTTGTGAACTCTGAGAGTGAGGCTGAGCGAGATAGTCGTTATACTGCGAAATC
CAACAATAGAAAGACTCCAGATTCGGTTCTTCACGGTCGAAGAGAGGTTGGTTTAGCTAGAAGTAATTCAGATTCTTCGAAGCAATTGAAAGGCCTGCGAACACGGTCCA
GTCAACTTTCACCTTTTGATTTATCCGATAACTGCGATGCATCAGTGTCTTGTAGTTTTGAAGATAGGCTGTCCACTGCGAGTTCTCTATCAGAAGCCGAAGAGAAAACA
ATAAAAGCTGTTTGTGAGCAAATGAGATCAATGAAGGGGGATTGTTTGCAAGGACAAACCAGTAGTAGTGACATATATGACATTATTCAATATGAAGTTAGACGTGCTGT
CCAAGATATCCATAATGACCTTCTCAATGCTCCACAAAACAGTGCTGATGCTATAGGAAGTTCAAATATTGATATCCCTCCCGAATTGGTGAATCCAGCTGCTGTTGAAT
TGGTGATGGACTTGAGAAGCGAGTATAGCAAGAAGCTTGAGCAGTCACAAGAGCGAGCTAGAAAACTTCGGGCAGACTTGGCAGTTGAGGATCATCGTGGATTAGAGCTC
AGTAGAATTTTGCGGGAAGTAATACCAGCTCCTAAGACCTCTATGAGACGAAAAGCTAGCATTGAAAGAAGAAAGATGTCAAAACGTCTAACTGACGACGCCTTGGCATA
TTTTGACGAGTGTGTATCATTATCAACATTTGATGGTTCCGACTTCTCATCATTGGAAGAAGCACCCCCTATCCACCAAGTTTCTTCCACTACCCAGGAACCAACCATTG
GAACTTCATCCGTGAGTGAGCAATATAATAGTAATCTCAGCGAACTTGGGGATGCCAAATCTCAGTTTTCCTTCACTAGAAAACCCCATGAAATTTGTGGAATTCAACAG
GACATTGGGAAGTACATTCATAACTGTGAGAAAGATGGCAACGAATCAAGGGTTTTAACGACGAAGCAATATTGCAATACGAATGATGCAAATTTGCAGAAGCCAACAGA
GAGCGTTTTGTTTGACCGACTTCTTTTGAGAAGCAGAATCGAGTCGGGCGGTCTACTACTGTGCGGTGTTAGCTCTGCAACTTGCTCCTCTTATTATGGTTCCTTCATCT
GAATGTCTAAGACATGTTTTGGTTTAATTGTTATAAAAAACCATAGATTTCAGAAATATTGAATTGGCTTTCTTAGGATCATGAATTCCAACAGTTCCTAGTTTGATTTG
TGGGTACGATCATGATCAAGATGGATTGCCAAGTGGAAAGTTGTCCACATGAAAATTTAGGGATGCGAAAAATTGAATCAACTCGAACCGGGTTGTTCTGATTCAATTTT
TCATGTCGGTCTAGAACCGACTATCGTGATTTTGACATATCAGATGTAAAACTACTGCATTGTTGTCATGATATATATGTATTATATATATATATATATATATATATATG
CACATAAATGTCATAAATTTACTCCATTTGTTTTTAGGAAAATGTATATGTCATGTCATTAATTTTTATAAGTTTTATTAGCTCAGCAACGAG
Protein sequenceShow/hide protein sequence
MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRISAASSKRVSC
GGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVGLARSNSDSSKQLKGLRTRSS
QLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVEL
VMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQEPTIG
TSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESRVLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI