; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018096 (gene) of Snake gourd v1 genome

Gene IDTan0018096
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG04:6490496..6497549
RNA-Seq ExpressionTan0018096
SyntenyTan0018096
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150606.1 uncharacterized protein LOC111018702 [Momordica charantia]1.2e-24184.67Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG STSA PSS+SGVST+GKDSKQ  NSPKK TMRRSRSVSAFSRS+ AD+SADFSNSRDNPLFWSNG  P +EARAVNLEFD+SS RI
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
        SA SSKRV C GVESTRGRSVSRNS SGS G GSRK GGRSLSRVG ERRDRSASVSRYPVSSQS VNSESEAERDSRY+ KSNNRKTPDSVLHGRREVG
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
        L RS+SD+  Q KGLRTRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+S+KGDCLQG +S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        L AP +SADA+GSSNIDIPPELVNP AVELVMDLR+EY+KKLEQSQERARKLRADLAVE+HRGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQVGDGTPQEPAIGTSSYSS--SSNPSELAGSKSQFSFANKPHKTYGLQQDIGKYIQNCEKFDGNES
        LAYFDECVSLSTFDGSDFSSLEE PPIHQVSSTTQVGDGTPQEP IGTSS S   +SN SEL  +KSQFSF  KPH+  G+QQDIGKYI NCEK DGNES
Subjt:  LAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQVGDGTPQEPAIGTSSYSS--SSNPSELAGSKSQFSFANKPHKTYGLQQDIGKYIQNCEKFDGNES

Query:  RGIVSMKFCDMNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        R + + ++C+ N+ NLQKPTE +LFDRLLLRSRIESG +LLCGVSSA+
Subjt:  RGIVSMKFCDMNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

XP_022968380.1 uncharacterized protein LOC111467644 isoform X1 [Cucurbita maxima]4.4e-22580.74Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG+STSATPSS+SGVST+ KDSKQ+DNS KK TMRRSRSVSAF RSS  DVSADFSNSRDNPLFWSNG  P EEAR+VNLE DDSS R+
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
          GSSKRV C GVESTRGRSVSRNSDSGSVG G+RKTG RSLSRVGAERRDRSASVSRY VSSQS+VNSESEAER+S YSTKSN RKTPDSVL GRRE G
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
         VRSSSDAL QSKGL+TRSSQLSPFDLSDNCD+SVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKS+KGDCLQG SSASDIYDIIQYEVRRAVQDIHNDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        L A  S ADAVGSSNIDIPPELVNPGAVE+VMDLR+EY+KKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ-----------EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQD
        LAYFDECVSLSTFDGSDFSS+EET PPIHQVSSTTQV DGTPQ              I T+S S   N  + +      SKSQFSF+NKP +TYG +QQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ-----------EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQD

Query:  IGKYIQNCEKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        IGKYIQ CEK DGN+SR +   K CD  MN+ N++K +E LLFDRL+ R+RIESGSILLCGVSS++
Subjt:  IGKYIQNCEKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

XP_022968381.1 uncharacterized protein LOC111467644 isoform X2 [Cucurbita maxima]5.2e-22681.9Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG+STSATPSS+SGVST+ KDSKQ+DNS KK TMRRSRSVSAF RSS  DVSADFSNSRDNPLFWSNG  P EEAR+VNLE DDSS R+
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
          GSSKRV C GVESTRGRSVSRNSDSGSVG G+RKTG RSLSRVGAERRDRSASVSRY VSSQS+VNSESEAER+S YSTKSN RKTPDSVL GRRE G
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
         VRSSSDAL QSKGL+TRSSQLSPFDLSDNCD+SVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKS+KGDCLQG SSASDIYDIIQYEVRRAVQDIHNDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        L A  S ADAVGSSNIDIPPELVNPGAVE+VMDLR+EY+KKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ---EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQDIGKYIQNC
        LAYFDECVSLSTFDGSDFSS+EET PPIHQVSSTTQV DGTPQ      I T+S S   N  + +      SKSQFSF+NKP +TYG +QQDIGKYIQ C
Subjt:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ---EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQDIGKYIQNC

Query:  EKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        EK DGN+SR +   K CD  MN+ N++K +E LLFDRL+ R+RIESGSILLCGVSS++
Subjt:  EKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

XP_023541137.1 uncharacterized protein LOC111801390 [Cucurbita pepo subsp. pepo]4.4e-22581.54Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG+STSATPSS+SGVST+ KDSKQ+DNS KK TMRRSRSVSAF RSS  DVSADFSNSRDNPLFWSNG  PSEEAR+VNLE DDSS R+
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
         +GSSKRV   GVESTRGRSVSRNSDSGSVGSG+RKTG RSLSRVGAERRDRSASVSRY VSSQS+VNSESEAER+S YSTKSN+RKTPDSVL GRRE G
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
         VRSSSDAL +SKGL+ RSSQLSPFDLSDNCD+SVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKS+KGDCLQG SSASDIYDIIQYEVRRAVQDI NDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        LIA  S ADA+GSSNIDIPPELVNPGAVE+VMDLR+EY+KKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ---EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQDIGKYIQNC
        LAYFDECVSLSTFDGSDFSS+EET PPIHQVSSTTQV DGTPQ      I T+S S   N  + +      SKSQFSF+NKP +TYG +QQDIGKYIQ C
Subjt:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ---EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQDIGKYIQNC

Query:  EKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        EK DGN+SR +   K CD  MN+ N++K +E LLFDRL+ R+RIESGSILLCGVSS++
Subjt:  EKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

XP_038892273.1 uncharacterized protein LOC120081462 isoform X1 [Benincasa hispida]2.6e-22582.29Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG STS TPSS+SG ST+GKDSK S NSPKK T+RRSRSVSAFSRSS ADVSADFSNSRDNPLFWSNG  P EEAR VNLE D SS RI
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
        SAGSSKRV   GVE++RGRSVSR+SDSGSVGSGSRKTGGRSLSRVG ERR+RSASV+RYPVSS SL NSESEAERDSRYSTK NNRKTPDS+LHGRREVG
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVR---SSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIH
        LVR   SSSDAL QSKGLR RSS   PFDLSDNCD+SVSCSFEDRLSTASSLSEAEE+T+RAVCEQMKSIKGDCLQGHSS SDIYDIIQYEVRRAVQDIH
Subjt:  LVR---SSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIH

Query:  NDLLIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLT
        NDLL AP SSAD  GSS+IDIPPELVNPGA+ELV DLR+EYTKKLEQSQERARKLRADLAVEEHR LELSRILREVIPAPKTSMRRKASIERRRMSKRLT
Subjt:  NDLLIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLT

Query:  DDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQVGDGT-PQEPAIGTSS----YS------SSSNPSELAGSKSQFSFANKPHKTYGLQQDIGKY
        DDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQV DGT PQEPAIGTSS    Y+       S+N S+    K+QFSF  KPH++YG++QDIGKY
Subjt:  DDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQVGDGT-PQEPAIGTSS----YS------SSSNPSELAGSKSQFSFANKPHKTYGLQQDIGKY

Query:  IQNCEKFDGNESRGIVSMKFCD-MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSS
        IQ     D NES+ +VSMK CD MN+TNLQK  E LL DR++ RSRIESGS+LLCGVSS
Subjt:  IQNCEKFDGNESRGIVSMKFCD-MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSS

TrEMBL top hitse value%identityAlignment
A0A6J1DC05 uncharacterized protein LOC1110187025.6e-24284.67Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG STSA PSS+SGVST+GKDSKQ  NSPKK TMRRSRSVSAFSRS+ AD+SADFSNSRDNPLFWSNG  P +EARAVNLEFD+SS RI
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
        SA SSKRV C GVESTRGRSVSRNS SGS G GSRK GGRSLSRVG ERRDRSASVSRYPVSSQS VNSESEAERDSRY+ KSNNRKTPDSVLHGRREVG
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
        L RS+SD+  Q KGLRTRSSQLSPFDLSDNCD SVSCSFEDRLSTASSLSEAEEKTI+AVCEQM+S+KGDCLQG +S+SDIYDIIQYEVRRAVQDIHNDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        L AP +SADA+GSSNIDIPPELVNP AVELVMDLR+EY+KKLEQSQERARKLRADLAVE+HRGLELSRILREVIPAPKTSMRRKASIERR+MSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQVGDGTPQEPAIGTSSYSS--SSNPSELAGSKSQFSFANKPHKTYGLQQDIGKYIQNCEKFDGNES
        LAYFDECVSLSTFDGSDFSSLEE PPIHQVSSTTQVGDGTPQEP IGTSS S   +SN SEL  +KSQFSF  KPH+  G+QQDIGKYI NCEK DGNES
Subjt:  LAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQVGDGTPQEPAIGTSSYSS--SSNPSELAGSKSQFSFANKPHKTYGLQQDIGKYIQNCEKFDGNES

Query:  RGIVSMKFCDMNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        R + + ++C+ N+ NLQKPTE +LFDRLLLRSRIESG +LLCGVSSA+
Subjt:  RGIVSMKFCDMNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

A0A6J1G123 uncharacterized protein LOC111449721 isoform X12.4e-22480.39Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG+STSATPSS+SGVST+ KDSKQ++NS KK TMRRSRSVSAF RSS  DVSADFSNSRDNPLFWSNG  P EEAR+VNLE DDSS R+
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
         +GSSKRV C GVESTRGRSVSRNSDSGSVGSG+RKTG RSLSRVGAERRDRSASVSRY VSSQS+VNSESEAER++ YSTKSN+RKTPDSVL GRRE G
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
         VRSSSDAL +SKGL+TRSSQLSPFDLSDNCD+SVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKS+KGDCLQG SSASDIYDIIQYEVRRAVQDIHNDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        L A  S ADAVGSSNIDIP ELVNPGAVE+VMDLR+EY+KKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ-----------EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQD
        LAYFDECVSLSTFDGSDFSS+EET PPIHQVSSTTQV DGTPQ             AI T+S S   N  + +      SKSQFSF+NKP +TYG +QQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ-----------EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQD

Query:  IGKYIQNCEKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        IGKYIQ CEK DGN+SR +   K CD  MN+ N++K +E LLFDRL+ R+RIESGSILLCGVSS++
Subjt:  IGKYIQNCEKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

A0A6J1G148 uncharacterized protein LOC111449721 isoform X22.8e-22581.54Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG+STSATPSS+SGVST+ KDSKQ++NS KK TMRRSRSVSAF RSS  DVSADFSNSRDNPLFWSNG  P EEAR+VNLE DDSS R+
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
         +GSSKRV C GVESTRGRSVSRNSDSGSVGSG+RKTG RSLSRVGAERRDRSASVSRY VSSQS+VNSESEAER++ YSTKSN+RKTPDSVL GRRE G
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
         VRSSSDAL +SKGL+TRSSQLSPFDLSDNCD+SVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKS+KGDCLQG SSASDIYDIIQYEVRRAVQDIHNDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        L A  S ADAVGSSNIDIP ELVNPGAVE+VMDLR+EY+KKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ---EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQDIGKYIQNC
        LAYFDECVSLSTFDGSDFSS+EET PPIHQVSSTTQV DGTPQ     AI T+S S   N  + +      SKSQFSF+NKP +TYG +QQDIGKYIQ C
Subjt:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ---EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQDIGKYIQNC

Query:  EKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        EK DGN+SR +   K CD  MN+ N++K +E LLFDRL+ R+RIESGSILLCGVSS++
Subjt:  EKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

A0A6J1HUP7 uncharacterized protein LOC111467644 isoform X22.5e-22681.9Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG+STSATPSS+SGVST+ KDSKQ+DNS KK TMRRSRSVSAF RSS  DVSADFSNSRDNPLFWSNG  P EEAR+VNLE DDSS R+
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
          GSSKRV C GVESTRGRSVSRNSDSGSVG G+RKTG RSLSRVGAERRDRSASVSRY VSSQS+VNSESEAER+S YSTKSN RKTPDSVL GRRE G
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
         VRSSSDAL QSKGL+TRSSQLSPFDLSDNCD+SVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKS+KGDCLQG SSASDIYDIIQYEVRRAVQDIHNDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        L A  S ADAVGSSNIDIPPELVNPGAVE+VMDLR+EY+KKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ---EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQDIGKYIQNC
        LAYFDECVSLSTFDGSDFSS+EET PPIHQVSSTTQV DGTPQ      I T+S S   N  + +      SKSQFSF+NKP +TYG +QQDIGKYIQ C
Subjt:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ---EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQDIGKYIQNC

Query:  EKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        EK DGN+SR +   K CD  MN+ N++K +E LLFDRL+ R+RIESGSILLCGVSS++
Subjt:  EKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

A0A6J1HX20 uncharacterized protein LOC111467644 isoform X12.1e-22580.74Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI
        MAMAAFKSSSRRG+STSATPSS+SGVST+ KDSKQ+DNS KK TMRRSRSVSAF RSS  DVSADFSNSRDNPLFWSNG  P EEAR+VNLE DDSS R+
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRI

Query:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG
          GSSKRV C GVESTRGRSVSRNSDSGSVG G+RKTG RSLSRVGAERRDRSASVSRY VSSQS+VNSESEAER+S YSTKSN RKTPDSVL GRRE G
Subjt:  SAGSSKRVDCSGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVG

Query:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL
         VRSSSDAL QSKGL+TRSSQLSPFDLSDNCD+SVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKS+KGDCLQG SSASDIYDIIQYEVRRAVQDIHNDL
Subjt:  LVRSSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDL

Query:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
        L A  S ADAVGSSNIDIPPELVNPGAVE+VMDLR+EY+KKLE SQ+RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA
Subjt:  LIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDA

Query:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ-----------EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQD
        LAYFDECVSLSTFDGSDFSS+EET PPIHQVSSTTQV DGTPQ              I T+S S   N  + +      SKSQFSF+NKP +TYG +QQD
Subjt:  LAYFDECVSLSTFDGSDFSSLEET-PPIHQVSSTTQVGDGTPQ-----------EPAIGTSSYSSSSNPSELAG-----SKSQFSFANKPHKTYG-LQQD

Query:  IGKYIQNCEKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS
        IGKYIQ CEK DGN+SR +   K CD  MN+ N++K +E LLFDRL+ R+RIESGSILLCGVSS++
Subjt:  IGKYIQNCEKFDGNESRGIVSMKFCD--MNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein9.4e-0827.02Show/hide
Query:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFS-RSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIR
        MA +AF S+ +R  +TS   SS SG          S  S ++   RR RS+S FS R    D+          P+     ++    +    +  DD ++ 
Subjt:  MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFS-RSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIR

Query:  ISAGSSKRVDCSGVESTRGRSVSRNSDSGSVGS-GSRKTGGRSLSRVGA--------------ERRDRSASVSRYPVSS--------QSLVNSESEAERD
             S   + S  E  RGRS  RNS  G+ G   + +  GRS+SRVG+              E   R  S+SR P S+         S+ N+ S     
Subjt:  ISAGSSKRVDCSGVESTRGRSVSRNSDSGSVGS-GSRKTGGRSLSRVGA--------------ERRDRSASVSRYPVSS--------QSLVNSESEAERD

Query:  SRYSTKSNNRKTPDSVLHG---------RREVGLVR-------SSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAV
        SR   +    +    V+ G         RR + +VR       S  D +  S   R   S +S    S N     S + ++R     S S+   K     
Subjt:  SRYSTKSNNRKTPDSVLHG---------RREVGLVR-------SSSDALHQSKGLRTRSSQLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAV

Query:  CEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDLLIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEE
          Q  ++  D  +G  S+S      ++   R ++ ++     A P   +++G+S      + ++      V      Y  KL++S+ER R+L A++ +EE
Subjt:  CEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDLLIAPPSSADAVGSSNIDIPPELVNPGAVELVMDLRNEYTKKLEQSQERARKLRADLAVEE

Query:  HRGLELSRILREVI------PAPKTSMRRKASIER-RRMSKRLTDDALAYFDECVSLSTFDGSDFSSLEE
         RG ELS  L+E++         K    RK S +R RRMS  LTD+A  + DE +  S  + +DFSSLE+
Subjt:  HRGLELSRILREVI------PAPKTSMRRKASIER-RRMSKRLTDDALAYFDECVSLSTFDGSDFSSLEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATGGCTGCCTTCAAATCGTCGTCTAGAAGAGGCAATTCGACTTCGGCGACACCTTCTTCGACTAGCGGCGTGTCAACAACGGGTAAAGATAGCAAGCAGAGTGA
CAATTCTCCCAAGAAAGGTACTATGCGTAGATCACGAAGCGTGAGTGCTTTTTCCAGATCTAGCAAAGCGGATGTTTCGGCAGATTTTTCTAATAGTAGAGATAATCCGC
TCTTCTGGAGTAATGGTTTGTCTCCGTCGGAGGAAGCTCGCGCTGTTAACCTTGAATTCGACGACAGTTCCATCAGAATCAGTGCAGGAAGTTCGAAACGTGTGGATTGT
AGTGGTGTTGAGAGTACAAGGGGACGATCTGTGTCCAGAAATTCTGATTCTGGAAGTGTAGGTTCAGGAAGCAGGAAAACCGGTGGTCGAAGCTTATCAAGGGTTGGCGC
TGAACGGCGGGATCGCTCGGCGTCTGTGTCTCGGTATCCCGTTTCATCACAGTCACTTGTGAACTCTGAGAGTGAGGCTGAGCGAGATAGTCGTTATAGTACGAAATCCA
ATAACAGAAAGACTCCAGATTCTGTTCTTCATGGTCGGAGAGAGGTTGGTTTAGTTAGAAGTAGTTCAGATGCTTTGCATCAATCGAAAGGCCTCCGGACTCGGTCCAGT
CAACTTTCGCCCTTTGATTTATCAGATAACTGCGATTTATCAGTGTCTTGTAGTTTTGAGGATAGGCTATCCACCGCGAGTTCTTTATCTGAAGCTGAAGAGAAAACTAT
TCGAGCTGTTTGTGAACAAATGAAGTCAATAAAGGGGGATTGTTTGCAAGGACATTCCAGTGCTAGTGACATATATGACATTATCCAATATGAAGTTAGACGTGCTGTCC
AAGATATTCACAATGACCTTCTCATTGCTCCACCAAGTAGTGCTGATGCTGTAGGAAGTTCAAATATTGACATCCCTCCGGAATTGGTGAATCCAGGTGCAGTTGAATTG
GTGATGGACTTGAGAAATGAGTATACCAAGAAGCTCGAACAGTCACAAGAACGAGCTAGAAAACTTCGGGCAGACTTGGCAGTAGAGGAGCATCGTGGATTAGAGCTAAG
TAGAATTTTGAGGGAAGTAATACCAGCTCCTAAGACCTCTATGAGACGAAAAGCTAGCATTGAAAGAAGAAGGATGTCAAAACGTCTAACTGACGATGCCTTGGCTTATT
TTGATGAGTGTGTATCATTATCAACATTTGATGGTTCTGACTTCTCATCACTGGAAGAAACACCCCCAATTCACCAAGTTTCTTCCACTACCCAGGTGGGAGATGGTACC
CCCCAGGAACCAGCGATTGGAACTTCATCTTACAGCAGCAGCAGCAATCCCAGCGAACTGGCAGGCAGCAAATCTCAGTTTTCGTTCGCTAATAAACCACACAAAACTTA
TGGGTTGCAACAGGACATTGGGAAGTACATTCAGAACTGCGAGAAATTCGATGGCAATGAATCACGGGGGATTGTAAGCATGAAGTTTTGCGATATGAATGAAACAAATC
TGCAGAAGCCAACAGAAAGACTCTTGTTTGACCGACTTCTTCTCAGAAGCAGAATAGAGTCGGGTAGCATACTACTCTGTGGTGTAAGCTCAGCTTCATTGTGGTAA
mRNA sequenceShow/hide mRNA sequence
CTTTTACAAGACGAGAAAGAAACGCACAGAGCACGTGTTGTCTCGGTGGTTTGTGCTTAACACGATTCGTTTTTCCGATCAGCGGCGCAATCAAGAAACTTGGCGGAAGC
AGAACGGTCGCCGTCGCCGTCGCCTTCTTCCCTCTCGGCATCATTCTGTATAAATCATTGCATCTTCTTCCTTTACATTTTTTAATCAATCATACTCCGTAATCTCCGCC
TACTACTATTTTTTTTTACTCTTATGAATGCTTCATCGAAGCCTCACCGCTTTGACTACTCTCGCCTACCTTTTTTAGATGCATTCTAAAATCCTCTCAACTCGCCATTT
CCGATTCTTGTGTTAGTTTCATTGTTATTTTGATTGTTAATTAGTAATGGCGATGGCTGCCTTCAAATCGTCGTCTAGAAGAGGCAATTCGACTTCGGCGACACCTTCTT
CGACTAGCGGCGTGTCAACAACGGGTAAAGATAGCAAGCAGAGTGACAATTCTCCCAAGAAAGGTACTATGCGTAGATCACGAAGCGTGAGTGCTTTTTCCAGATCTAGC
AAAGCGGATGTTTCGGCAGATTTTTCTAATAGTAGAGATAATCCGCTCTTCTGGAGTAATGGTTTGTCTCCGTCGGAGGAAGCTCGCGCTGTTAACCTTGAATTCGACGA
CAGTTCCATCAGAATCAGTGCAGGAAGTTCGAAACGTGTGGATTGTAGTGGTGTTGAGAGTACAAGGGGACGATCTGTGTCCAGAAATTCTGATTCTGGAAGTGTAGGTT
CAGGAAGCAGGAAAACCGGTGGTCGAAGCTTATCAAGGGTTGGCGCTGAACGGCGGGATCGCTCGGCGTCTGTGTCTCGGTATCCCGTTTCATCACAGTCACTTGTGAAC
TCTGAGAGTGAGGCTGAGCGAGATAGTCGTTATAGTACGAAATCCAATAACAGAAAGACTCCAGATTCTGTTCTTCATGGTCGGAGAGAGGTTGGTTTAGTTAGAAGTAG
TTCAGATGCTTTGCATCAATCGAAAGGCCTCCGGACTCGGTCCAGTCAACTTTCGCCCTTTGATTTATCAGATAACTGCGATTTATCAGTGTCTTGTAGTTTTGAGGATA
GGCTATCCACCGCGAGTTCTTTATCTGAAGCTGAAGAGAAAACTATTCGAGCTGTTTGTGAACAAATGAAGTCAATAAAGGGGGATTGTTTGCAAGGACATTCCAGTGCT
AGTGACATATATGACATTATCCAATATGAAGTTAGACGTGCTGTCCAAGATATTCACAATGACCTTCTCATTGCTCCACCAAGTAGTGCTGATGCTGTAGGAAGTTCAAA
TATTGACATCCCTCCGGAATTGGTGAATCCAGGTGCAGTTGAATTGGTGATGGACTTGAGAAATGAGTATACCAAGAAGCTCGAACAGTCACAAGAACGAGCTAGAAAAC
TTCGGGCAGACTTGGCAGTAGAGGAGCATCGTGGATTAGAGCTAAGTAGAATTTTGAGGGAAGTAATACCAGCTCCTAAGACCTCTATGAGACGAAAAGCTAGCATTGAA
AGAAGAAGGATGTCAAAACGTCTAACTGACGATGCCTTGGCTTATTTTGATGAGTGTGTATCATTATCAACATTTGATGGTTCTGACTTCTCATCACTGGAAGAAACACC
CCCAATTCACCAAGTTTCTTCCACTACCCAGGTGGGAGATGGTACCCCCCAGGAACCAGCGATTGGAACTTCATCTTACAGCAGCAGCAGCAATCCCAGCGAACTGGCAG
GCAGCAAATCTCAGTTTTCGTTCGCTAATAAACCACACAAAACTTATGGGTTGCAACAGGACATTGGGAAGTACATTCAGAACTGCGAGAAATTCGATGGCAATGAATCA
CGGGGGATTGTAAGCATGAAGTTTTGCGATATGAATGAAACAAATCTGCAGAAGCCAACAGAAAGACTCTTGTTTGACCGACTTCTTCTCAGAAGCAGAATAGAGTCGGG
TAGCATACTACTCTGTGGTGTAAGCTCAGCTTCATTGTGGTAAAAGCATAAAATCCGAAAGCGAAGAAAGGATTTGCAATAATTTTAATTGTATTGGTTAAGCTGTTGGC
ATATTTCATAGGAATGAGCTAAATTTCTAACAGTCCCAACTTTGATCGTAATAACATTGTGATGGCTGGATCTTAGGTGTGGGTAAGATCATGATCGAGACAGTTCAGTT
CAGCGCTCAGCAGTGGTTAAGTAAATTGCACGGTGAAAAGTTGGCTACTTTCAAAGTCTATAAAAGAGGTAAAGAATCGAACTAAATTGAAGAACTGAATTGAATAGACC
CTTAACCGAGTTGATTTGCTTCTGTTTTTCGCGTTGGTCTAGTTGGAAATCATGTACAATTGAAATGACATCGATGACATTATGCATGTATATTCAAACATCTATTTCAT
TTTTAACATGTTATGAATCGAGTAATTTTAGAGAATAAATTATAACTCTTAAAATAATAAAGTAACTAAATAAATATTAGTCCTCCTAATATATGGAAATTGAACGGATG
GATTCTGTTCATTTCGAGGAAAAAACAGAAGACCAGACCCAACAGAGCAGCTCTGCTTTGAAAAACCAACGATCATATAAGCATTTAATAAATTTCATTTCTAAACATAT
AAAAATAGTTTGGAGTGTTCGTTAATTTAAATGAAATTATTGTAATTTAGTTATAACAACCATAGAATTTTTATAGTTTAAATTGCCAAGTATATCTTTGTTATTACTTC
AAATTATTCGCTAATGAAGATTTCTTTTGAATTTTCGATTTATAACTATTTCAAAATCTAGCATCCCATATTATCCCTCCCTTCATTTTAAAACTTATCCCTTTTTTATT
TCAAAAGTCAATATTAAGTCTAAATAAAAAATCAAGAAATAAAATTAGATAATTAGATTAAAAAAAAATCACAGAAACCAGATAATGTAA
Protein sequenceShow/hide protein sequence
MAMAAFKSSSRRGNSTSATPSSTSGVSTTGKDSKQSDNSPKKGTMRRSRSVSAFSRSSKADVSADFSNSRDNPLFWSNGLSPSEEARAVNLEFDDSSIRISAGSSKRVDC
SGVESTRGRSVSRNSDSGSVGSGSRKTGGRSLSRVGAERRDRSASVSRYPVSSQSLVNSESEAERDSRYSTKSNNRKTPDSVLHGRREVGLVRSSSDALHQSKGLRTRSS
QLSPFDLSDNCDLSVSCSFEDRLSTASSLSEAEEKTIRAVCEQMKSIKGDCLQGHSSASDIYDIIQYEVRRAVQDIHNDLLIAPPSSADAVGSSNIDIPPELVNPGAVEL
VMDLRNEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRRMSKRLTDDALAYFDECVSLSTFDGSDFSSLEETPPIHQVSSTTQVGDGT
PQEPAIGTSSYSSSSNPSELAGSKSQFSFANKPHKTYGLQQDIGKYIQNCEKFDGNESRGIVSMKFCDMNETNLQKPTERLLFDRLLLRSRIESGSILLCGVSSASLW