; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g37100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g37100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr9:28568982..28573581
RNA-Seq ExpressionMoc09g37100
SyntenyMoc09g37100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150606.1 uncharacterized protein LOC111018702 [Momordica charantia]9.7e-297100Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
        LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGTPQEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR
        LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGTPQEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGTPQEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR

Query:  VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
Subjt:  VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

XP_022945507.1 uncharacterized protein LOC111449721 isoform X2 [Cucurbita moschata]7.1e-22378.46Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNS SGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ---EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG-IQQDIGK
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQV DGTPQ      I T+S SEQYN        SNLS     KSQFSF+ KP E  G IQQDIGK
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ---EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG-IQQDIGK

Query:  YIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        YI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  YIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

XP_022968380.1 uncharacterized protein LOC111467644 isoform X1 [Cucurbita maxima]1.0e-22478.22Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNS SGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ-----------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQD
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQV DGTPQ             TI T+S SEQYN   +       +KSQFSF+ KP E  G IQQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ-----------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQD

Query:  IGKYIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        IGKYI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  IGKYIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

XP_022968381.1 uncharacterized protein LOC111467644 isoform X2 [Cucurbita maxima]1.2e-22579.33Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNS SGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ---EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQDIGKYIHNC
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQV DGTPQ     TI T+S SEQYN   +       +KSQFSF+ KP E  G IQQDIGKYI  C
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ---EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQDIGKYIHNC

Query:  EKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        EKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  EKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

XP_038892273.1 uncharacterized protein LOC120081462 isoform X1 [Benincasa hispida]2.2e-22479.05Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRGGSTS  PSSSSG STSGKDSK   NSPKKAT+RRSRSVSAFSRS+ AD+SADFSNSRDNPLFWSNGSPP +EAR VNLE D SSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SA SSKRVS GGVE++RGRSVSR+S SGS G GSRK GGRSLSRVGTERR+RSASV+RYPVSS S +NSESEAERDSRY+ K NNRKTPDS+LHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LAR---SNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIH
        L R   S+SD+ +Q KGLR RSS   PFDLSDNCD SVSCSFEDRLSTASSLSEAEE+T++AVCEQM+S+KGDCLQG +S SDIYDIIQYEVRRAVQDIH
Subjt:  LAR---SNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLT
        NDLL+APQ+SAD  GSS+IDIPPELVNP A+ELV DLRSEY+KKLEQSQERARKLRADLAVE+HR LELSRILREVIPAPKTSMRRKASIERR+MSKRLT
Subjt:  NDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLT

Query:  DDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGT-PQEPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGIQQDIGKY
        DDALAYFDECVSLSTFDGSDFSSLEE PPIHQVSSTTQV DGT PQEP IGTSS  +QYN        +NLS+ G  K+QFSFT+KPHE  GI+QDIGKY
Subjt:  DDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGT-PQEPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGIQQDIGKY

Query:  IHNCEKDGNESRVLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        I   +KD NES+V++ K     ND NLQK  ES+L DR++ RSRIESG LLLCGVSS  CSSYY S I
Subjt:  IHNCEKDGNESRVLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

TrEMBL top hitse value%identityAlignment
A0A6J1DC05 uncharacterized protein LOC1110187024.7e-297100Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
        LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGTPQEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR
        LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGTPQEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGTPQEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESR

Query:  VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
Subjt:  VLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

A0A6J1G123 uncharacterized protein LOC111449721 isoform X12.9e-22277.37Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNS SGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ-----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQV DGTPQ              I T+S SEQYN        SNLS     KSQFSF+ KP E  G
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ-----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG

Query:  -IQQDIGKYIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
         IQQDIGKYI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  -IQQDIGKYIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

A0A6J1G148 uncharacterized protein LOC111449721 isoform X23.5e-22378.46Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNS SGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ---EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG-IQQDIGK
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQV DGTPQ      I T+S SEQYN        SNLS     KSQFSF+ KP E  G IQQDIGK
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ---EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG-IQQDIGK

Query:  YIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        YI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  YIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

A0A6J1HUP7 uncharacterized protein LOC111467644 isoform X25.7e-22679.33Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNS SGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ---EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQDIGKYIHNC
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQV DGTPQ     TI T+S SEQYN   +       +KSQFSF+ KP E  G IQQDIGKYI  C
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ---EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQDIGKYIHNC

Query:  EKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        EKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  EKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

A0A6J1HX20 uncharacterized protein LOC111467644 isoform X14.8e-22578.22Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RS+T D+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNS SGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVE+ RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ-----------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQD
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQV DGTPQ             TI T+S SEQYN   +       +KSQFSF+ KP E  G IQQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQVGDGTPQ-----------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQD

Query:  IGKYIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI
        IGKYI  CEKDGN+SRV++ K+ C+   ND N++K +ES+LFDRL+ R+RIESG +LLCGVSS+  SSYY S I
Subjt:  IGKYIHNCEKDGNESRVLTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSSYYGSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein6.2e-0728.06Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFS-RSNTADI------SADFSNSRDNPLFWSNGSPPCDEARAVNLEF
        MA +AF S+ +R  +TS A SS SG           S S +++  RR RS+S FS R    DI         F N+     F   G    D+   + +EF
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFS-RSNTADI------SADFSNSRDNPLFWSNGSPPCDEARAVNLEF

Query:  DESSTRISAASSKRVSCGGVESTRGRSVSRNSGSGSNG-LGSRKVGGRSLSRVG--------------TERRDRSASVSRYPVSS--------QSFVNSE
         ES +  S  SS           RGRS  RNSG G+ G + + +  GRS+SRVG              TE   R  S+SR P S+         S  N+ 
Subjt:  DESSTRISAASSKRVSCGGVESTRGRSVSRNSGSGSNG-LGSRKVGGRSLSRVG--------------TERRDRSASVSRYPVSS--------QSFVNSE

Query:  SEAERDSRYTAKSNNRKTPDSVLHGRREVGLAR------------SNSDSSKQLKGLRTRSSQLSPF--DLSDNCDASVSCSFEDRLSTASSLSEAEEKT
        S     SR   +    +    V+ G RE   +R             NS+S        + S  +  F    S N  +  S + ++R     S S+   K 
Subjt:  SEAERDSRYTAKSNNRKTPDSVLHGRREVGLAR------------SNSDSSKQLKGLRTRSSQLSPF--DLSDNCDASVSCSFEDRLSTASSLSEAEEKT

Query:  IKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADL
              Q  ++  D  +G+ SSS      ++   R ++ ++      P+   +++G+S      + ++      V      Y+ KL++S+ER R+L A++
Subjt:  IKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADL

Query:  AVEDHRGLELSRILREVI------PAPKTSMRRKASIER-RKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEE
         +E+ RG ELS  L+E++         K    RK S +R R+MS  LTD+A  + DE +  S  + +DFSSLE+
Subjt:  AVEDHRGLELSRILREVI------PAPKTSMRRKASIER-RKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGGCTGCCTTCAAATCGTCGTCTAGAAGAGGGGGTTCGACTTCGGCAGCACCTTCTTCGAGTAGCGGCGTGTCAACATCGGGTAAAGATAGCAAGCAGGGGAG
TAATTCACCGAAGAAAGCTACTATGCGGAGATCACGAAGCGTCAGTGCCTTTTCCAGATCCAACACAGCGGATATTTCCGCCGATTTTTCGAATAGTAGAGATAATCCGC
TGTTCTGGAGCAATGGTTCGCCCCCTTGCGACGAAGCTCGTGCTGTTAACCTTGAATTCGATGAAAGTTCCACCAGAATTAGCGCAGCAAGTTCAAAACGTGTGAGTTGT
GGTGGTGTTGAGAGTACCAGGGGACGATCGGTGTCGAGAAATTCTGGTTCTGGAAGTAATGGTTTAGGAAGCAGGAAGGTCGGCGGTCGAAGCTTGTCACGGGTAGGCAC
TGAACGGCGGGACCGCTCTGCGTCTGTGTCTCGATATCCCGTCTCATCGCAATCCTTTGTGAACTCTGAGAGTGAGGCTGAGCGAGATAGTCGTTATACTGCGAAATCCA
ACAATAGAAAGACTCCAGATTCGGTTCTTCACGGTCGAAGAGAGGTTGGTTTAGCTAGAAGTAATTCAGATTCTTCGAAGCAATTGAAAGGCCTGCGAACACGGTCCAGT
CAACTTTCACCTTTTGATTTATCCGATAACTGCGATGCATCAGTGTCTTGTAGTTTTGAAGATAGGCTGTCCACTGCGAGTTCTCTATCAGAAGCCGAAGAGAAAACAAT
AAAAGCTGTTTGTGAGCAAATGAGATCAATGAAGGGGGATTGTTTGCAAGGACAAACCAGTAGTAGTGACATATATGACATTATTCAATATGAAGTTAGACGTGCTGTCC
AAGATATCCATAATGACCTTCTCAATGCTCCACAAAACAGTGCTGATGCTATAGGAAGTTCAAATATTGATATCCCTCCCGAATTGGTGAATCCAGCTGCTGTTGAATTG
GTGATGGACTTGAGAAGCGAGTATAGCAAGAAGCTTGAGCAGTCACAAGAGCGAGCTAGAAAACTTCGGGCAGACTTGGCAGTTGAGGATCATCGTGGATTAGAGCTCAG
TAGAATTTTGCGGGAAGTAATACCAGCTCCTAAGACCTCTATGAGACGAAAAGCTAGCATTGAAAGAAGAAAGATGTCAAAACGTCTAACTGACGACGCCTTGGCATATT
TTGACGAGTGTGTATCATTATCAACATTTGATGGTTCCGACTTCTCATCATTGGAAGAAGCACCCCCTATCCACCAAGTTTCTTCCACTACCCAGGTGGGAGATGGAACC
CCTCAGGAACCAACCATTGGAACTTCATCCGTGAGTGAGCAATATAATAGTAATCTCAGCGAACTTGGGGATGCCAAATCTCAGTTTTCCTTCACTAGAAAACCCCATGA
AATTTGTGGAATTCAACAGGACATTGGGAAGTACATTCATAACTGTGAGAAAGATGGCAACGAATCAAGGGTTTTAACGACGAAGCAATATTGCAATACGAATGATGCAA
ATTTGCAGAAGCCAACAGAGAGCGTTTTGTTTGACCGACTTCTTTTGAGAAGCAGAATCGAGTCGGGCGGTCTACTACTGTGCGGTGTTAGCTCTGCAACTTGCTCCTCT
TATTATGGTTCCTTCATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATGGCTGCCTTCAAATCGTCGTCTAGAAGAGGGGGTTCGACTTCGGCAGCACCTTCTTCGAGTAGCGGCGTGTCAACATCGGGTAAAGATAGCAAGCAGGGGAG
TAATTCACCGAAGAAAGCTACTATGCGGAGATCACGAAGCGTCAGTGCCTTTTCCAGATCCAACACAGCGGATATTTCCGCCGATTTTTCGAATAGTAGAGATAATCCGC
TGTTCTGGAGCAATGGTTCGCCCCCTTGCGACGAAGCTCGTGCTGTTAACCTTGAATTCGATGAAAGTTCCACCAGAATTAGCGCAGCAAGTTCAAAACGTGTGAGTTGT
GGTGGTGTTGAGAGTACCAGGGGACGATCGGTGTCGAGAAATTCTGGTTCTGGAAGTAATGGTTTAGGAAGCAGGAAGGTCGGCGGTCGAAGCTTGTCACGGGTAGGCAC
TGAACGGCGGGACCGCTCTGCGTCTGTGTCTCGATATCCCGTCTCATCGCAATCCTTTGTGAACTCTGAGAGTGAGGCTGAGCGAGATAGTCGTTATACTGCGAAATCCA
ACAATAGAAAGACTCCAGATTCGGTTCTTCACGGTCGAAGAGAGGTTGGTTTAGCTAGAAGTAATTCAGATTCTTCGAAGCAATTGAAAGGCCTGCGAACACGGTCCAGT
CAACTTTCACCTTTTGATTTATCCGATAACTGCGATGCATCAGTGTCTTGTAGTTTTGAAGATAGGCTGTCCACTGCGAGTTCTCTATCAGAAGCCGAAGAGAAAACAAT
AAAAGCTGTTTGTGAGCAAATGAGATCAATGAAGGGGGATTGTTTGCAAGGACAAACCAGTAGTAGTGACATATATGACATTATTCAATATGAAGTTAGACGTGCTGTCC
AAGATATCCATAATGACCTTCTCAATGCTCCACAAAACAGTGCTGATGCTATAGGAAGTTCAAATATTGATATCCCTCCCGAATTGGTGAATCCAGCTGCTGTTGAATTG
GTGATGGACTTGAGAAGCGAGTATAGCAAGAAGCTTGAGCAGTCACAAGAGCGAGCTAGAAAACTTCGGGCAGACTTGGCAGTTGAGGATCATCGTGGATTAGAGCTCAG
TAGAATTTTGCGGGAAGTAATACCAGCTCCTAAGACCTCTATGAGACGAAAAGCTAGCATTGAAAGAAGAAAGATGTCAAAACGTCTAACTGACGACGCCTTGGCATATT
TTGACGAGTGTGTATCATTATCAACATTTGATGGTTCCGACTTCTCATCATTGGAAGAAGCACCCCCTATCCACCAAGTTTCTTCCACTACCCAGGTGGGAGATGGAACC
CCTCAGGAACCAACCATTGGAACTTCATCCGTGAGTGAGCAATATAATAGTAATCTCAGCGAACTTGGGGATGCCAAATCTCAGTTTTCCTTCACTAGAAAACCCCATGA
AATTTGTGGAATTCAACAGGACATTGGGAAGTACATTCATAACTGTGAGAAAGATGGCAACGAATCAAGGGTTTTAACGACGAAGCAATATTGCAATACGAATGATGCAA
ATTTGCAGAAGCCAACAGAGAGCGTTTTGTTTGACCGACTTCTTTTGAGAAGCAGAATCGAGTCGGGCGGTCTACTACTGTGCGGTGTTAGCTCTGCAACTTGCTCCTCT
TATTATGGTTCCTTCATCTGA
Protein sequenceShow/hide protein sequence
MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSNTADISADFSNSRDNPLFWSNGSPPCDEARAVNLEFDESSTRISAASSKRVSC
GGVESTRGRSVSRNSGSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVGLARSNSDSSKQLKGLRTRSS
QLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVEL
VMDLRSEYSKKLEQSQERARKLRADLAVEDHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQVGDGT
PQEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIHNCEKDGNESRVLTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCGVSSATCSS
YYGSFI