; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028907 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028907
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00153210:1431479..1438049
RNA-Seq ExpressionSgr028907
SyntenySgr028907
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150606.1 uncharacterized protein LOC111018702 [Momordica charantia]7.2e-21883.27Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS TADISADFSNSRDNPLFWSNGSPP DE R+VNLEFDESSTRI A SS R+S  GVES RGRSVSRN  SGS G GSRK GGRSLSRVGT
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRYPVSSQSF+NS SEAERDSRY+ KSNNRKTPDSVLHGRREVGLARS++D+ +Q KGLRTRSSQLSPFDLSDNCD S SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTIKAVCEQMRSMKGDCLQG TS+S IYDIIQYEVRRA+Q+IHNDLLN+PQ+SA+AI SSNIDIPPE+VNP AVELVMDLRSEY+KKLEQSQE
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGTTQEPTIG
        RARKLRADLAVE+HRGLELSRILREVIPAPKTSMRRKASIERRKMSK LTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSST QVGDGT QEPTIG
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGTTQEPTIG

Query:  TSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYGIQQDIGKYIQNCEKDGNEARVVSMK-YSDMNNSNLQNPTESLLFDRLLFRSRIESGSLL
        TSS+ ++YN        S+LS+LGD KSQFSFT K HE  GIQQDIGKYI NCEKDGNE+RV++ K Y + N++NLQ PTES+LFDRLL RSRIESG LL
Subjt:  TSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYGIQQDIGKYIQNCEKDGNEARVVSMK-YSDMNNSNLQNPTESLLFDRLLFRSRIESGSLL

Query:  LC
        LC
Subjt:  LC

XP_022968380.1 uncharacterized protein LOC111467644 isoform X1 [Cucurbita maxima]1.3e-19876.6Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS+T D+SADFSNSRDNPLFWSNGSPP +E RSVNLE D+SS R+R GSS R+S  GVES RGRSVSRN DSGS G G+RKTG RSLSRVG 
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRY VSSQS +NS SEAER+S YSTKSN RKTPDSVL GRRE G  RSS+DALQQSKGL+TRSSQLSPFDLSDNCDVS SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTI+AVCEQM+SMKGDCLQG +S S IYDIIQYEVRRA+Q+IHNDLLN+ QS A+A+ SSNIDIPPE+VNPGAVE+VMDLRSEY+KKLE SQ+
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ----
        RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSK LTDDALAYFDECVSLSTFDGSDFSS+EE  PPIHQVSST QV DGT Q    
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ----

Query:  -------EPTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLF
                 TI T+S  ++YNLG+T+Y SS LS     KSQFSF+NK  ETYG IQQDIGKYIQ CEKDGN++RVVSMK      MN+ N++  +ESLLF
Subjt:  -------EPTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLF

Query:  DRLLFRSRIESGSLLLC
        DRL+FR+RIESGS+LLC
Subjt:  DRLLFRSRIESGSLLLC

XP_022968381.1 uncharacterized protein LOC111467644 isoform X2 [Cucurbita maxima]1.5e-19977.8Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS+T D+SADFSNSRDNPLFWSNGSPP +E RSVNLE D+SS R+R GSS R+S  GVES RGRSVSRN DSGS G G+RKTG RSLSRVG 
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRY VSSQS +NS SEAER+S YSTKSN RKTPDSVL GRRE G  RSS+DALQQSKGL+TRSSQLSPFDLSDNCDVS SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTI+AVCEQM+SMKGDCLQG +S S IYDIIQYEVRRA+Q+IHNDLLN+ QS A+A+ SSNIDIPPE+VNPGAVE+VMDLRSEY+KKLE SQ+
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ---E
        RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSK LTDDALAYFDECVSLSTFDGSDFSS+EE  PPIHQVSST QV DGT Q    
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ---E

Query:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLFDRLLFRSR
         TI T+S  ++YNLG+T+Y SS LS     KSQFSF+NK  ETYG IQQDIGKYIQ CEKDGN++RVVSMK      MN+ N++  +ESLLFDRL+FR+R
Subjt:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLFDRLLFRSR

Query:  IESGSLLLC
        IESGS+LLC
Subjt:  IESGSLLLC

XP_023541137.1 uncharacterized protein LOC111801390 [Cucurbita pepo subsp. pepo]9.8e-19977.6Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS+T D+SADFSNSRDNPLFWSNGSPPS+E RSVNLE D+SS R+R+GSS R+S  GVES RGRSVSRN DSGS GSG+RKTG RSLSRVG 
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRY VSSQS +NS SEAER+S YSTKSN+RKTPDSVL GRRE G  RSS+DALQ+SKGL+ RSSQLSPFDLSDNCDVS SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTI+AVCEQM+SMKGDCLQG +S S IYDIIQYEVRRA+Q+I NDLL + QS A+AI SSNIDIPPE+VNPGAVE+VMDLRSEY+KKLE SQ+
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ---E
        RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSK LTDDALAYFDECVSLSTFDGSDFSS+EE  PPIHQVSST QV DGT Q    
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ---E

Query:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLFDRLLFRSR
         TI T+S  ++YNLG+T+Y SS+LS     KSQFSF+NK  ETYG IQQDIGKYIQ CEKDGN++RVVSMK      MN+ N++  +ESLLFDRL+FR+R
Subjt:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLFDRLLFRSR

Query:  IESGSLLLC
        IESGS+LLC
Subjt:  IESGSLLLC

XP_038892273.1 uncharacterized protein LOC120081462 isoform X1 [Benincasa hispida]1.1e-20579.18Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS+ AD+SADFSNSRDNPLFWSNGSPP +E R+VNLE D SSTRI AGSS R+S  GVE+ RGRSVSR+ DSGS GSGSRKTGGRSLSRVGT
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLAR---SSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLS
        ERR+RSASV+RYPVSS S LNS SEAERDSRYSTK NNRKTPDS+LHGRREVGL R   SS+DALQQSKGLR RSS   PFDLSDNCDVS SCSFEDRLS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLAR---SSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLS

Query:  TASSLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQ
        TASSLSEAEE+T++AVCEQM+S+KGDCLQGH+S S IYDIIQYEVRRA+Q+IHNDLL++PQSSA+   SS+IDIPPE+VNPGA+ELV DLRSEYTKKLEQ
Subjt:  TASSLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQ

Query:  SQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGT-TQE
        SQERARKLRADLAVEEHR LELSRILREVIPAPKTSMRRKASIERR+MSK LTDDALAYFDECVSLSTFDGSDFSSLEE PPIHQVSST QV DGT  QE
Subjt:  SQERARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGT-TQE

Query:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYGIQQDIGKYIQNCEKDGNEARVVSMKYSD-MNNSNLQNPTESLLFDRLLFRSRIES
        P IGTSS ID+YNLG+T+Y S++LSK G GK+QFSFT K HE+YGI+QDIGKYIQ   KD NE++VVSMK+ D MN++NLQ   ESLL DR++FRSRIES
Subjt:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYGIQQDIGKYIQNCEKDGNEARVVSMKYSD-MNNSNLQNPTESLLFDRLLFRSRIES

Query:  GSLLLCSGDNDINC
        GSLLLC G + +NC
Subjt:  GSLLLCSGDNDINC

TrEMBL top hitse value%identityAlignment
A0A6J1DC05 uncharacterized protein LOC1110187023.5e-21883.27Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS TADISADFSNSRDNPLFWSNGSPP DE R+VNLEFDESSTRI A SS R+S  GVES RGRSVSRN  SGS G GSRK GGRSLSRVGT
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRYPVSSQSF+NS SEAERDSRY+ KSNNRKTPDSVLHGRREVGLARS++D+ +Q KGLRTRSSQLSPFDLSDNCD S SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTIKAVCEQMRSMKGDCLQG TS+S IYDIIQYEVRRA+Q+IHNDLLN+PQ+SA+AI SSNIDIPPE+VNP AVELVMDLRSEY+KKLEQSQE
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGTTQEPTIG
        RARKLRADLAVE+HRGLELSRILREVIPAPKTSMRRKASIERRKMSK LTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSST QVGDGT QEPTIG
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGTTQEPTIG

Query:  TSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYGIQQDIGKYIQNCEKDGNEARVVSMK-YSDMNNSNLQNPTESLLFDRLLFRSRIESGSLL
        TSS+ ++YN        S+LS+LGD KSQFSFT K HE  GIQQDIGKYI NCEKDGNE+RV++ K Y + N++NLQ PTES+LFDRLL RSRIESG LL
Subjt:  TSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYGIQQDIGKYIQNCEKDGNEARVVSMK-YSDMNNSNLQNPTESLLFDRLLFRSRIESGSLL

Query:  LC
        LC
Subjt:  LC

A0A6J1G123 uncharacterized protein LOC111449721 isoform X11.5e-19776.02Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS+T D+SADFSNSRDNPLFWSNGSPP +E RSVNLE D+SS R+R+GSS R+S  GVES RGRSVSRN DSGS GSG+RKTG RSLSRVG 
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRY VSSQS +NS SEAER++ YSTKSN+RKTPDSVL GRRE G  RSS+DALQ+SKGL+TRSSQLSPFDLSDNCDVS SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTI+AVCEQM+SMKGDCLQG +S S IYDIIQYEVRRA+Q+IHNDLLN+ QS A+A+ SSNIDIP E+VNPGAVE+VMDLRSEY+KKLE SQ+
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ----
        RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSK LTDDALAYFDECVSLSTFDGSDFSS+EE  PPIHQVSST QV DGT Q    
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ----

Query:  -------EPTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLF
                  I T+S  ++YNLG+T+Y SS+LS     KSQFSF+NK  ETYG IQQDIGKYIQ CEKDGN++RVVSMK      MN+ N++  +ESLLF
Subjt:  -------EPTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLF

Query:  DRLLFRSRIESGSLLLC
        DRL+FR+RIESGS+LLC
Subjt:  DRLLFRSRIESGSLLLC

A0A6J1G148 uncharacterized protein LOC111449721 isoform X21.8e-19877.21Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS+T D+SADFSNSRDNPLFWSNGSPP +E RSVNLE D+SS R+R+GSS R+S  GVES RGRSVSRN DSGS GSG+RKTG RSLSRVG 
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRY VSSQS +NS SEAER++ YSTKSN+RKTPDSVL GRRE G  RSS+DALQ+SKGL+TRSSQLSPFDLSDNCDVS SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTI+AVCEQM+SMKGDCLQG +S S IYDIIQYEVRRA+Q+IHNDLLN+ QS A+A+ SSNIDIP E+VNPGAVE+VMDLRSEY+KKLE SQ+
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ---E
        RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSK LTDDALAYFDECVSLSTFDGSDFSS+EE  PPIHQVSST QV DGT Q    
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ---E

Query:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLFDRLLFRSR
          I T+S  ++YNLG+T+Y SS+LS     KSQFSF+NK  ETYG IQQDIGKYIQ CEKDGN++RVVSMK      MN+ N++  +ESLLFDRL+FR+R
Subjt:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLFDRLLFRSR

Query:  IESGSLLLC
        IESGS+LLC
Subjt:  IESGSLLLC

A0A6J1HUP7 uncharacterized protein LOC111467644 isoform X27.3e-20077.8Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS+T D+SADFSNSRDNPLFWSNGSPP +E RSVNLE D+SS R+R GSS R+S  GVES RGRSVSRN DSGS G G+RKTG RSLSRVG 
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRY VSSQS +NS SEAER+S YSTKSN RKTPDSVL GRRE G  RSS+DALQQSKGL+TRSSQLSPFDLSDNCDVS SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTI+AVCEQM+SMKGDCLQG +S S IYDIIQYEVRRA+Q+IHNDLLN+ QS A+A+ SSNIDIPPE+VNPGAVE+VMDLRSEY+KKLE SQ+
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ---E
        RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSK LTDDALAYFDECVSLSTFDGSDFSS+EE  PPIHQVSST QV DGT Q    
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ---E

Query:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLFDRLLFRSR
         TI T+S  ++YNLG+T+Y SS LS     KSQFSF+NK  ETYG IQQDIGKYIQ CEKDGN++RVVSMK      MN+ N++  +ESLLFDRL+FR+R
Subjt:  PTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLFDRLLFRSR

Query:  IESGSLLLC
        IESGS+LLC
Subjt:  IESGSLLLC

A0A6J1HX20 uncharacterized protein LOC111467644 isoform X16.2e-19976.6Show/hide
Query:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT
        SR  +   RS+T D+SADFSNSRDNPLFWSNGSPP +E RSVNLE D+SS R+R GSS R+S  GVES RGRSVSRN DSGS G G+RKTG RSLSRVG 
Subjt:  SRVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGT

Query:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS
        ERRDRSASVSRY VSSQS +NS SEAER+S YSTKSN RKTPDSVL GRRE G  RSS+DALQQSKGL+TRSSQLSPFDLSDNCDVS SCSFEDRLSTAS
Subjt:  ERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTAS

Query:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE
        SLSEAEEKTI+AVCEQM+SMKGDCLQG +S S IYDIIQYEVRRA+Q+IHNDLLN+ QS A+A+ SSNIDIPPE+VNPGAVE+VMDLRSEY+KKLE SQ+
Subjt:  SLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQE

Query:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ----
        RARKLRADLAVEE RGLELSRILREVIPAPKTSMRRKASIERR+MSK LTDDALAYFDECVSLSTFDGSDFSS+EE  PPIHQVSST QV DGT Q    
Subjt:  RARKLRADLAVEEHRGLELSRILREVIPAPKTSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEA-PPIHQVSSTAQVGDGTTQ----

Query:  -------EPTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLF
                 TI T+S  ++YNLG+T+Y SS LS     KSQFSF+NK  ETYG IQQDIGKYIQ CEKDGN++RVVSMK      MN+ N++  +ESLLF
Subjt:  -------EPTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG-IQQDIGKYIQNCEKDGNEARVVSMKYS---DMNNSNLQNPTESLLF

Query:  DRLLFRSRIESGSLLLC
        DRL+FR+RIESGS+LLC
Subjt:  DRLLFRSRIESGSLLLC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G50350.1 unknown protein2.4e-0927.67Show/hide
Query:  GSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGTERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLA
        G   RL  D   S R RS+SR   S ++G  S+     S+S   + RR    SVSR P       NS    E D R +  S  R++   V   RR +  +
Subjt:  GSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGTERRDRSASVSRYPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLA

Query:  RSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTS---GIYDIIQ--YEVRRAIQNIH
         S  D +Q S   R   S +S    S N     S + ++R     S S+   K       Q  ++  D  +G  S+S   G   II+  Y   +A     
Subjt:  RSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMKGDCLQGHTSTS---GIYDIIQ--YEVRRAIQNIH

Query:  NDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVI------PAPKTSMRRKASIER-R
          L NS   S       N  +                   Y  KL++S+ER R+L A++ +EE RG ELS  L+E++         K    RK S +R R
Subjt:  NDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVI------PAPKTSMRRKASIER-R

Query:  KMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGTTQEPTI--GTSSMIDRYNLGKTAYSSSDLS-------KLGDGKSQFSFTN
        +MS CLTD+A  + DE +  S  + +DFSSLE+       SS   +   ++Q  +    TS  +D   L    + + D+S       K     +  S   
Subjt:  KMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGTTQEPTI--GTSSMIDRYNLGKTAYSSSDLS-------KLGDGKSQFSFTN

Query:  KQHETYGIQQDIGKYIQNCEKDGNEARVVSMKYSDMNNSN--------LQNPTES-LLFDRLLFRSRIESGSLLLCS
        +   T       G  I      G+ +   S+    M            L+ P  S +L +    R RI SGSL+LCS
Subjt:  KQHETYGIQQDIGKYIQNCEKDGNEARVVSMKYSDMNNSN--------LQNPTES-LLFDRLLFRSRIESGSLLLCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTACAAGACGAAAATGAAACGCTTAAAGCACCGCACGTGTCGTCAGCGTTTAACACGGATTTCGTTTTTCCGATCAGCGTGGCGGTGAAATCAAGAAACTTCGT
ATATTTTACTCTATGGACACTTCATCGAAGCCTCACCGCTGCCACTATTCTCGCCTCCCTTTTTTTATGTCTTCTGAAGTCTTCTCTGCTCACCATTCTGATTCTTTACT
CTTCTCTTCCCATATTTGTGGTAGTTTCATCGTTGTTTGGTTGTTGTAATGGCGGTGGCCGCCTTCAAATCGTCGTCCAGAAGAGGGGGTTCGACTTCGGCAACGCGTCT
CGAGTGGCGGCGTGCGACATCAGATCCACCACAGCGGATATTTCCGCTGATTTTTCTAATAGTAGAGATAATCCGCTCTTCTGGAGCAATGGTTCGCCCCCATCGGATGA
AGTTCGTTCTGTTAACCTCGAATTCGACGAAAGTTCCACCAGAATTAGGGCAGGAAGTTCGACACGGTTGAGTCGTGATGGTGTTGAGAGTATGAGGGGACGATCGGTGT
CTAGAAATTATGATTCTGGAAGTAAGGGTTCAGGAAGCAGGAAGACCGGTGGCCGAAGCTTGTCGAGGGTAGGCACTGAACGGCGGGACCGCTCGGCGTCTGTGTCTCGA
TATCCCGTCTCATCGCAGTCGTTTCTGAACTCTGCGAGTGAGGCAGAGCGAGATAGTCGTTATAGTACGAAATCCAATAATAGAAAGACTCCAGATTCGGTGCTTCATGG
TCGAAGAGAGGTTGGTTTAGCTAGAAGTAGTACGGATGCTTTGCAGCAATCCAAAGGCCTGCGAACACGGTCCAGTCAACTTTCACCCTTTGATTTATCAGATAACTGCG
ATGTATCAGGGTCTTGTAGTTTTGAGGATAGGCTGTCCACTGCGAGTTCTTTATCTGAAGCCGAAGAGAAAACAATAAAAGCTGTTTGCGAACAAATGAGGTCAATGAAG
GGGGATTGTTTGCAAGGACATACCAGTACTAGTGGCATATATGACATTATTCAATATGAAGTAAGACGTGCTATCCAAAATATCCATAACGACCTTCTTAATTCTCCACA
AAGCAGTGCCAATGCTATAGCGAGTTCAAATATTGATATCCCTCCTGAAATGGTGAATCCAGGTGCAGTTGAACTGGTGATGGACTTGAGGAGCGAGTATACCAAGAAGC
TTGAGCAGTCACAAGAGCGAGCTAGAAAACTTCGGGCAGACTTGGCAGTTGAGGAGCATCGTGGGTTAGAGCTCAGTAGAATTTTGCGGGAAGTAATACCAGCTCCTAAG
ACCTCTATGAGACGAAAGGCAAGCATTGAAAGAAGAAAGATGTCAAAATGTCTTACTGACGATGCCTTGGCATATTTTGATGAGTGTGTATCATTATCAACATTTGATGG
TTCTGACTTTTCATCACTGGAGGAAGCACCCCCAATTCACCAAGTTTCTTCCACTGCCCAGGTGGGAGATGGTACAACCCAGGAACCAACCATTGGAACTTCATCCATGA
TTGACCGATATAATTTAGGAAAGACAGCTTACAGCAGCAGTGATCTCAGCAAACTGGGGGATGGAAAATCTCAGTTTTCCTTCACTAATAAACAACACGAGACTTATGGA
ATTCAACAGGACATTGGGAAGTACATTCAGAACTGTGAGAAAGATGGCAACGAAGCACGGGTTGTAAGCATGAAGTATAGCGACATGAATAATTCAAATCTGCAGAATCC
AACAGAAAGCCTCTTGTTTGATCGGCTTCTTTTCAGAAGCAGAATAGAGTCGGGTAGTCTACTTCTCTGCAGTGGTGATAATGACATCAATTGCCTGAAACGGATCTTAA
TGGGTATTAGGTGTCGGTCGGATCATGATAAAGAGGTTGGTAGGTGGTTATGTAATTCTCAAATCGTATACTTGCCAAGAGTCCGAGACAGACAAGAATTGAGGCTAAAA
TTTGAGAATTCAGTTGATGTATCTGATAAACGAAGGCGAAAGATTTGGCAAAAAGCATACATTCCTATTATCACTGAAATTGCGAGCAAGATTATGTCACAAGATTATTT
AGCTTCAAGAGCTGCAATCTCATATCTCACCTCCTCCATGTGCTCATTTATGGTTGTCACAGTATCCTGCACGCCTTTGATAGCTTCTTTGAGAGCATTGTCATATTCGG
CATCTGCTTCTGCTTCTTCAGTACCAGAAGCTTCGGTTATCTCACGGACGTGCACGCACTCGTGCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTACAAGACGAAAATGAAACGCTTAAAGCACCGCACGTGTCGTCAGCGTTTAACACGGATTTCGTTTTTCCGATCAGCGTGGCGGTGAAATCAAGAAACTTCGT
ATATTTTACTCTATGGACACTTCATCGAAGCCTCACCGCTGCCACTATTCTCGCCTCCCTTTTTTTATGTCTTCTGAAGTCTTCTCTGCTCACCATTCTGATTCTTTACT
CTTCTCTTCCCATATTTGTGGTAGTTTCATCGTTGTTTGGTTGTTGTAATGGCGGTGGCCGCCTTCAAATCGTCGTCCAGAAGAGGGGGTTCGACTTCGGCAACGCGTCT
CGAGTGGCGGCGTGCGACATCAGATCCACCACAGCGGATATTTCCGCTGATTTTTCTAATAGTAGAGATAATCCGCTCTTCTGGAGCAATGGTTCGCCCCCATCGGATGA
AGTTCGTTCTGTTAACCTCGAATTCGACGAAAGTTCCACCAGAATTAGGGCAGGAAGTTCGACACGGTTGAGTCGTGATGGTGTTGAGAGTATGAGGGGACGATCGGTGT
CTAGAAATTATGATTCTGGAAGTAAGGGTTCAGGAAGCAGGAAGACCGGTGGCCGAAGCTTGTCGAGGGTAGGCACTGAACGGCGGGACCGCTCGGCGTCTGTGTCTCGA
TATCCCGTCTCATCGCAGTCGTTTCTGAACTCTGCGAGTGAGGCAGAGCGAGATAGTCGTTATAGTACGAAATCCAATAATAGAAAGACTCCAGATTCGGTGCTTCATGG
TCGAAGAGAGGTTGGTTTAGCTAGAAGTAGTACGGATGCTTTGCAGCAATCCAAAGGCCTGCGAACACGGTCCAGTCAACTTTCACCCTTTGATTTATCAGATAACTGCG
ATGTATCAGGGTCTTGTAGTTTTGAGGATAGGCTGTCCACTGCGAGTTCTTTATCTGAAGCCGAAGAGAAAACAATAAAAGCTGTTTGCGAACAAATGAGGTCAATGAAG
GGGGATTGTTTGCAAGGACATACCAGTACTAGTGGCATATATGACATTATTCAATATGAAGTAAGACGTGCTATCCAAAATATCCATAACGACCTTCTTAATTCTCCACA
AAGCAGTGCCAATGCTATAGCGAGTTCAAATATTGATATCCCTCCTGAAATGGTGAATCCAGGTGCAGTTGAACTGGTGATGGACTTGAGGAGCGAGTATACCAAGAAGC
TTGAGCAGTCACAAGAGCGAGCTAGAAAACTTCGGGCAGACTTGGCAGTTGAGGAGCATCGTGGGTTAGAGCTCAGTAGAATTTTGCGGGAAGTAATACCAGCTCCTAAG
ACCTCTATGAGACGAAAGGCAAGCATTGAAAGAAGAAAGATGTCAAAATGTCTTACTGACGATGCCTTGGCATATTTTGATGAGTGTGTATCATTATCAACATTTGATGG
TTCTGACTTTTCATCACTGGAGGAAGCACCCCCAATTCACCAAGTTTCTTCCACTGCCCAGGTGGGAGATGGTACAACCCAGGAACCAACCATTGGAACTTCATCCATGA
TTGACCGATATAATTTAGGAAAGACAGCTTACAGCAGCAGTGATCTCAGCAAACTGGGGGATGGAAAATCTCAGTTTTCCTTCACTAATAAACAACACGAGACTTATGGA
ATTCAACAGGACATTGGGAAGTACATTCAGAACTGTGAGAAAGATGGCAACGAAGCACGGGTTGTAAGCATGAAGTATAGCGACATGAATAATTCAAATCTGCAGAATCC
AACAGAAAGCCTCTTGTTTGATCGGCTTCTTTTCAGAAGCAGAATAGAGTCGGGTAGTCTACTTCTCTGCAGTGGTGATAATGACATCAATTGCCTGAAACGGATCTTAA
TGGGTATTAGGTGTCGGTCGGATCATGATAAAGAGGTTGGTAGGTGGTTATGTAATTCTCAAATCGTATACTTGCCAAGAGTCCGAGACAGACAAGAATTGAGGCTAAAA
TTTGAGAATTCAGTTGATGTATCTGATAAACGAAGGCGAAAGATTTGGCAAAAAGCATACATTCCTATTATCACTGAAATTGCGAGCAAGATTATGTCACAAGATTATTT
AGCTTCAAGAGCTGCAATCTCATATCTCACCTCCTCCATGTGCTCATTTATGGTTGTCACAGTATCCTGCACGCCTTTGATAGCTTCTTTGAGAGCATTGTCATATTCGG
CATCTGCTTCTGCTTCTTCAGTACCAGAAGCTTCGGTTATCTCACGGACGTGCACGCACTCGTGCTCTTGA
Protein sequenceShow/hide protein sequence
MCLQDENETLKAPHVSSAFNTDFVFPISVAVKSRNFVYFTLWTLHRSLTAATILASLFLCLLKSSLLTILILYSSLPIFVVVSSLFGCCNGGGRLQIVVQKRGFDFGNAS
RVAACDIRSTTADISADFSNSRDNPLFWSNGSPPSDEVRSVNLEFDESSTRIRAGSSTRLSRDGVESMRGRSVSRNYDSGSKGSGSRKTGGRSLSRVGTERRDRSASVSR
YPVSSQSFLNSASEAERDSRYSTKSNNRKTPDSVLHGRREVGLARSSTDALQQSKGLRTRSSQLSPFDLSDNCDVSGSCSFEDRLSTASSLSEAEEKTIKAVCEQMRSMK
GDCLQGHTSTSGIYDIIQYEVRRAIQNIHNDLLNSPQSSANAIASSNIDIPPEMVNPGAVELVMDLRSEYTKKLEQSQERARKLRADLAVEEHRGLELSRILREVIPAPK
TSMRRKASIERRKMSKCLTDDALAYFDECVSLSTFDGSDFSSLEEAPPIHQVSSTAQVGDGTTQEPTIGTSSMIDRYNLGKTAYSSSDLSKLGDGKSQFSFTNKQHETYG
IQQDIGKYIQNCEKDGNEARVVSMKYSDMNNSNLQNPTESLLFDRLLFRSRIESGSLLLCSGDNDINCLKRILMGIRCRSDHDKEVGRWLCNSQIVYLPRVRDRQELRLK
FENSVDVSDKRRRKIWQKAYIPIITEIASKIMSQDYLASRAAISYLTSSMCSFMVVTVSCTPLIASLRALSYSASASASSVPEASVISRTCTHSCS