; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004112 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004112
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold92:148740..153322
RNA-Seq ExpressionMS004112
SyntenyMS004112
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150606.1 uncharacterized protein LOC111018702 [Momordica charantia]7.2e-28497.13Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRS+TADISADFSNSRDNPLFWSNGSPPCDEAR+VNLEFDESSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SAASSKRVSCGGVESTRGRSVSRNS SGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
        LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVE+HRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTT-------QEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIQNCEKDGNESR
        LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTT       QEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYI NCEKDGNESR
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTT-------QEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIQNCEKDGNESR

Query:  VVVTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
         V+TTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLC VSS TCSSYYGSFI
Subjt:  VVVTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

XP_022945507.1 uncharacterized protein LOC111449721 isoform X2 [Cucurbita moschata]8.1e-21978.32Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RSST D+SADFSNSRDNPLFWSNGSPP +EARSVNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNSDSGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG-IQQDIGK
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQ             I T+S SEQYN        SNLS     KSQFSF+ KP E  G IQQDIGK
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG-IQQDIGK

Query:  YIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
        YIQ CEKDGN+SRVV   KQ C+   ND N++K +ES+LFDRL+ R+RIESG +LLC VSS   SSYY S I
Subjt:  YIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

XP_022968380.1 uncharacterized protein LOC111467644 isoform X1 [Cucurbita maxima]8.6e-22178.09Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RSST D+SADFSNSRDNPLFWSNGSPP +EARSVNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNSDSGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ------------------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQD
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQ                    TI T+S SEQYN   +       +KSQFSF+ KP E  G IQQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ------------------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQD

Query:  IGKYIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
        IGKYIQ CEKDGN+SRVV   KQ C+   ND N++K +ES+LFDRL+ R+RIESG +LLC VSS   SSYY S I
Subjt:  IGKYIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

XP_022968381.1 uncharacterized protein LOC111467644 isoform X2 [Cucurbita maxima]1.0e-22179.19Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RSST D+SADFSNSRDNPLFWSNGSPP +EARSVNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNSDSGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ----------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQDIGKYIQNC
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQ            TI T+S SEQYN   +       +KSQFSF+ KP E  G IQQDIGKYIQ C
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ----------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQDIGKYIQNC

Query:  EKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
        EKDGN+SRVV   KQ C+   ND N++K +ES+LFDRL+ R+RIESG +LLC VSS   SSYY S I
Subjt:  EKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

XP_038892273.1 uncharacterized protein LOC120081462 isoform X1 [Benincasa hispida]1.9e-22078.42Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRGGSTS  PSSSSG STSGKDSK   NSPKKAT+RRSRSVSAFSRSS AD+SADFSNSRDNPLFWSNGSPP +EAR+VNLE D SSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SA SSKRVS GGVE++RGRSVSR+SDSGS G GSRK GGRSLSRVGTERR+RSASV+RYPVSS S +NSESEAERDSRY+ K NNRKTPDS+LHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LAR---SNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIH
        L R   S+SD+ +Q KGLR RSS   PFDLSDNCD SVSCSFEDRLSTASSLSEAEE+T++AVCEQM+S+KGDCLQG +S SDIYDIIQYEVRRAVQDIH
Subjt:  LAR---SNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIH

Query:  NDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLT
        NDLL+APQ+SAD  GSS+IDIPPELVNP A+ELV DLRSEY+KKLEQSQERARKLRADLAVEEHR LELSRILREVIPAPKTSMRRKASIERR+MSKRLT
Subjt:  NDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLT

Query:  DDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTT--------QEPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGIQQDIGKY
        DDALAYFDECVSLSTFDGSDFSSLEE PPIHQVSSTT        QEP IGTSS  +QYN        +NLS+ G  K+QFSFT+KPHE  GI+QDIGKY
Subjt:  DDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTT--------QEPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICGIQQDIGKY

Query:  IQNCEKDGNESRVVVTTKQYCN-TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
        IQ   KD NES+VV  + ++C+  ND NLQK  ES+L DR++ RSRIESG LLLC VSS+ CSSYY S I
Subjt:  IQNCEKDGNESRVVVTTKQYCN-TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

TrEMBL top hitse value%identityAlignment
A0A6J1DC05 uncharacterized protein LOC1110187023.5e-28497.13Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRS+TADISADFSNSRDNPLFWSNGSPPCDEAR+VNLEFDESSTRI
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
        SAASSKRVSCGGVESTRGRSVSRNS SGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
        LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVE+HRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTT-------QEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIQNCEKDGNESR
        LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTT       QEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYI NCEKDGNESR
Subjt:  LAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTT-------QEPTIGTSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIQNCEKDGNESR

Query:  VVVTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
         V+TTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLC VSS TCSSYYGSFI
Subjt:  VVVTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

A0A6J1G123 uncharacterized protein LOC111449721 isoform X13.3e-21877.24Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RSST D+SADFSNSRDNPLFWSNGSPP +EARSVNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNSDSGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ------------------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQ                     I T+S SEQYN        SNLS     KSQFSF+ KP E  G
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ------------------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG

Query:  -IQQDIGKYIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
         IQQDIGKYIQ CEKDGN+SRVV   KQ C+   ND N++K +ES+LFDRL+ R+RIESG +LLC VSS   SSYY S I
Subjt:  -IQQDIGKYIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

A0A6J1G148 uncharacterized protein LOC111449721 isoform X23.9e-21978.32Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RSST D+SADFSNSRDNPLFWSNGSPP +EARSVNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
         + SSKRVSCGGVESTRGRSVSRNSDSGS G G+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER++ Y+ KSN+RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ ++ KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIP ELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG-IQQDIGK
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQ             I T+S SEQYN        SNLS     KSQFSF+ KP E  G IQQDIGK
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ----------EPTIGTSSVSEQYN--------SNLSELGDAKSQFSFTRKPHEICG-IQQDIGK

Query:  YIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
        YIQ CEKDGN+SRVV   KQ C+   ND N++K +ES+LFDRL+ R+RIESG +LLC VSS   SSYY S I
Subjt:  YIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

A0A6J1HUP7 uncharacterized protein LOC111467644 isoform X24.9e-22279.19Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RSST D+SADFSNSRDNPLFWSNGSPP +EARSVNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNSDSGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ----------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQDIGKYIQNC
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQ            TI T+S SEQYN   +       +KSQFSF+ KP E  G IQQDIGKYIQ C
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ----------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQDIGKYIQNC

Query:  EKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
        EKDGN+SRVV   KQ C+   ND N++K +ES+LFDRL+ R+RIESG +LLC VSS   SSYY S I
Subjt:  EKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

A0A6J1HX20 uncharacterized protein LOC111467644 isoform X14.2e-22178.09Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI
        MAMAAFKSSSRRG STSA PSSSSGVSTS KDSKQ  NS KKATMRRSRSVSAF RSST D+SADFSNSRDNPLFWSNGSPP +EARSVNLE D+SS R+
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRI

Query:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG
           SSKRVSCGGVESTRGRSVSRNSDSGS GLG+RK G RSLSRVG ERRDRSASVSRY VSSQS VNSESEAER+S Y+ KSN RKTPDSVL GRRE G
Subjt:  SAASSKRVSCGGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVG

Query:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL
          RS+SD+ +Q KGL+TRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+SMKGDCLQGQ+S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LARSNSDSSKQLKGLRTRSSQLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDL

Query:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA
        LNA Q+ ADA+GSSNIDIPPELVNP AVE+VMDLRSEYSKKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ------------------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQD
        LAYFDECVSLSTFDGSDFSS+EE  PPIHQVSSTTQ                    TI T+S SEQYN   +       +KSQFSF+ KP E  G IQQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTTQ------------------EPTIGTSSVSEQYNSNLSELGD---AKSQFSFTRKPHEICG-IQQD

Query:  IGKYIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI
        IGKYIQ CEKDGN+SRVV   KQ C+   ND N++K +ES+LFDRL+ R+RIESG +LLC VSS   SSYY S I
Subjt:  IGKYIQNCEKDGNESRVVVTTKQYCN--TNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein1.8e-0628.06Show/hide
Query:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFS-RSSTADI------SADFSNSRDNPLFWSNGSPPCDEARSVNLEF
        MA +AF S+ +R  +TS A SS SG           S S +++  RR RS+S FS R    DI         F N+     F   G    D+   + +EF
Subjt:  MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFS-RSSTADI------SADFSNSRDNPLFWSNGSPPCDEARSVNLEF

Query:  DESSTRISAASSKRVSCGGVESTRGRSVSRNSDSGSNG-LGSRKVGGRSLSRVG--------------TERRDRSASVSRYPVSS--------QSFVNSE
         ES +  S  SS           RGRS  RNS  G+ G + + +  GRS+SRVG              TE   R  S+SR P S+         S  N+ 
Subjt:  DESSTRISAASSKRVSCGGVESTRGRSVSRNSDSGSNG-LGSRKVGGRSLSRVG--------------TERRDRSASVSRYPVSS--------QSFVNSE

Query:  SEAERDSRYTAKSNNRKTPDSVLHGRREVGLAR------------SNSDSSKQLKGLRTRSSQLSPF--DLSDNCDASVSCSFEDRLSTASSLSEAEEKT
        S     SR   +    +    V+ G RE   +R             NS+S        + S  +  F    S N  +  S + ++R     S S+   K 
Subjt:  SEAERDSRYTAKSNNRKTPDSVLHGRREVGLAR------------SNSDSSKQLKGLRTRSSQLSPF--DLSDNCDASVSCSFEDRLSTASSLSEAEEKT

Query:  IKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADL
              Q  ++  D  +G+ SSS      ++   R ++ ++      P+   +++G+S      + ++      V      Y+ KL++S+ER R+L A++
Subjt:  IKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVELVMDLRSEYSKKLEQSQERARKLRADL

Query:  AVEEHRGLELSRILREVI------PAPKTSMRRKASIER-RKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEE
         +EE RG ELS  L+E++         K    RK S +R R+MS  LTD+A  + DE +  S  + +DFSSLE+
Subjt:  AVEEHRGLELSRILREVI------PAPKTSMRRKASIER-RKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGGCTGCCTTCAAATCGTCGTCTAGAAGAGGGGGTTCGACTTCGGCAGCACCTTCTTCGAGTAGCGGCGTGTCAACATCGGGTAAAGATAGCAAGCAGGGGAG
TAATTCACCGAAGAAAGCTACTATGCGGAGATCACGAAGCGTCAGTGCCTTTTCCAGATCCAGCACAGCGGATATTTCCGCCGATTTTTCAAATAGTAGAGATAATCCGC
TGTTCTGGAGCAATGGTTCGCCCCCTTGCGACGAAGCTCGTTCTGTTAACCTTGAATTCGATGAAAGTTCCACCAGAATTAGCGCAGCAAGTTCAAAACGTGTGAGTTGT
GGTGGTGTTGAGAGTACCAGGGGACGGTCGGTGTCGAGAAATTCTGATTCTGGAAGTAATGGTTTAGGAAGCAGGAAGGTCGGCGGTCGAAGCTTGTCACGGGTAGGCAC
TGAACGGCGGGACCGCTCTGCGTCTGTGTCTCGATATCCCGTCTCATCGCAATCCTTTGTGAACTCTGAGAGTGAGGCTGAGCGAGATAGTCGTTATACTGCGAAATCCA
ACAATAGAAAGACTCCAGATTCGGTTCTTCACGGTCGAAGAGAGGTTGGTTTAGCTAGAAGTAATTCAGATTCTTCGAAGCAATTGAAAGGCCTGCGAACACGGTCCAGT
CAACTTTCACCTTTTGATTTATCCGATAACTGCGATGCATCAGTGTCTTGTAGTTTTGAAGATAGGCTGTCCACTGCGAGTTCTCTATCAGAAGCCGAAGAGAAAACAAT
AAAAGCTGTTTGTGAGCAAATGAGATCAATGAAGGGGGATTGTTTGCAAGGACAAACCAGTAGTAGTGACATATATGACATTATTCAATATGAAGTTAGACGTGCTGTCC
AAGATATCCATAATGACCTTCTCAATGCTCCACAAAACAGTGCTGATGCTATAGGAAGTTCAAATATTGATATCCCTCCCGAATTGGTGAATCCAGCTGCTGTTGAATTG
GTGATGGACTTGAGAAGCGAGTATAGCAAGAAGCTTGAGCAGTCACAAGAGCGAGCTAGAAAACTTCGGGCAGACTTGGCAGTTGAGGAGCATCGTGGATTAGAGCTCAG
TAGAATTTTGCGGGAAGTAATACCAGCTCCTAAGACCTCTATGAGACGAAAAGCTAGCATTGAAAGAAGAAAGATGTCAAAACGTCTAACTGACGACGCCTTGGCATATT
TTGACGAGTGTGTATCATTATCAACATTTGATGGTTCCGACTTCTCATCACTGGAAGAAGCACCCCCTATCCACCAAGTTTCTTCCACTACCCAGGAACCAACCATTGGA
ACTTCATCCGTGAGTGAGCAATATAATAGTAATCTCAGCGAACTTGGGGATGCCAAATCTCAGTTTTCCTTCACTAGAAAACCCCATGAAATTTGTGGAATTCAACAGGA
CATTGGGAAGTACATTCAGAACTGTGAGAAAGATGGCAACGAATCAAGGGTTGTTGTAACGACGAAGCAATATTGCAATACGAATGATGCAAATTTGCAGAAGCCAACAG
AGAGCGTTTTGTTTGACCGACTTCTTTTGAGAAGCAGAATCGAGTCGGGCGGTCTACTACTGTGCAGTGTTAGCTCTGTAACTTGCTCCTCTTATTATGGTTCCTTCATC
mRNA sequenceShow/hide mRNA sequence
ATGGCGATGGCTGCCTTCAAATCGTCGTCTAGAAGAGGGGGTTCGACTTCGGCAGCACCTTCTTCGAGTAGCGGCGTGTCAACATCGGGTAAAGATAGCAAGCAGGGGAG
TAATTCACCGAAGAAAGCTACTATGCGGAGATCACGAAGCGTCAGTGCCTTTTCCAGATCCAGCACAGCGGATATTTCCGCCGATTTTTCAAATAGTAGAGATAATCCGC
TGTTCTGGAGCAATGGTTCGCCCCCTTGCGACGAAGCTCGTTCTGTTAACCTTGAATTCGATGAAAGTTCCACCAGAATTAGCGCAGCAAGTTCAAAACGTGTGAGTTGT
GGTGGTGTTGAGAGTACCAGGGGACGGTCGGTGTCGAGAAATTCTGATTCTGGAAGTAATGGTTTAGGAAGCAGGAAGGTCGGCGGTCGAAGCTTGTCACGGGTAGGCAC
TGAACGGCGGGACCGCTCTGCGTCTGTGTCTCGATATCCCGTCTCATCGCAATCCTTTGTGAACTCTGAGAGTGAGGCTGAGCGAGATAGTCGTTATACTGCGAAATCCA
ACAATAGAAAGACTCCAGATTCGGTTCTTCACGGTCGAAGAGAGGTTGGTTTAGCTAGAAGTAATTCAGATTCTTCGAAGCAATTGAAAGGCCTGCGAACACGGTCCAGT
CAACTTTCACCTTTTGATTTATCCGATAACTGCGATGCATCAGTGTCTTGTAGTTTTGAAGATAGGCTGTCCACTGCGAGTTCTCTATCAGAAGCCGAAGAGAAAACAAT
AAAAGCTGTTTGTGAGCAAATGAGATCAATGAAGGGGGATTGTTTGCAAGGACAAACCAGTAGTAGTGACATATATGACATTATTCAATATGAAGTTAGACGTGCTGTCC
AAGATATCCATAATGACCTTCTCAATGCTCCACAAAACAGTGCTGATGCTATAGGAAGTTCAAATATTGATATCCCTCCCGAATTGGTGAATCCAGCTGCTGTTGAATTG
GTGATGGACTTGAGAAGCGAGTATAGCAAGAAGCTTGAGCAGTCACAAGAGCGAGCTAGAAAACTTCGGGCAGACTTGGCAGTTGAGGAGCATCGTGGATTAGAGCTCAG
TAGAATTTTGCGGGAAGTAATACCAGCTCCTAAGACCTCTATGAGACGAAAAGCTAGCATTGAAAGAAGAAAGATGTCAAAACGTCTAACTGACGACGCCTTGGCATATT
TTGACGAGTGTGTATCATTATCAACATTTGATGGTTCCGACTTCTCATCACTGGAAGAAGCACCCCCTATCCACCAAGTTTCTTCCACTACCCAGGAACCAACCATTGGA
ACTTCATCCGTGAGTGAGCAATATAATAGTAATCTCAGCGAACTTGGGGATGCCAAATCTCAGTTTTCCTTCACTAGAAAACCCCATGAAATTTGTGGAATTCAACAGGA
CATTGGGAAGTACATTCAGAACTGTGAGAAAGATGGCAACGAATCAAGGGTTGTTGTAACGACGAAGCAATATTGCAATACGAATGATGCAAATTTGCAGAAGCCAACAG
AGAGCGTTTTGTTTGACCGACTTCTTTTGAGAAGCAGAATCGAGTCGGGCGGTCTACTACTGTGCAGTGTTAGCTCTGTAACTTGCTCCTCTTATTATGGTTCCTTCATC
Protein sequenceShow/hide protein sequence
MAMAAFKSSSRRGGSTSAAPSSSSGVSTSGKDSKQGSNSPKKATMRRSRSVSAFSRSSTADISADFSNSRDNPLFWSNGSPPCDEARSVNLEFDESSTRISAASSKRVSC
GGVESTRGRSVSRNSDSGSNGLGSRKVGGRSLSRVGTERRDRSASVSRYPVSSQSFVNSESEAERDSRYTAKSNNRKTPDSVLHGRREVGLARSNSDSSKQLKGLRTRSS
QLSPFDLSDNCDASVSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGQTSSSDIYDIIQYEVRRAVQDIHNDLLNAPQNSADAIGSSNIDIPPELVNPAAVEL
VMDLRSEYSKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKRLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTTQEPTIG
TSSVSEQYNSNLSELGDAKSQFSFTRKPHEICGIQQDIGKYIQNCEKDGNESRVVVTTKQYCNTNDANLQKPTESVLFDRLLLRSRIESGGLLLCSVSSVTCSSYYGSFI