; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024665 (gene) of Chayote v1 genome

Gene IDSed0024665
OrganismSechium edule (Chayote v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG08:604080..609635
RNA-Seq ExpressionSed0024665
SyntenySed0024665
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN63561.1 hypothetical protein Csa_013107 [Cucumis sativus]1.0e-21888.11Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVA FL  STS +L SAPPI  AENF+P    AA E+QK N+V+AYL NINKPP+K IQSPDGDLIDCV SHLQPAFDHH LKGQLPLD P+RP 
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        GYNSS DSV E FQLWRQTGESCPEGT+PIRRTT+QDILRASSVQ FG+KP KS++RDS+G+GHEHAVVFVNGE+YYGAKANINVWAPHVSDQYEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLM+WKDPRHGNWWLE GQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        GLLVGYWPA LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADH NCYDI+QGKNKLWG YF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

XP_008464544.1 PREDICTED: uncharacterized protein LOC103502395 [Cucumis melo]1.1e-21787.62Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVA FL  STS +L SAPPI  AENF+P    A  E+QK N+V+AYL NINKPP+K IQSPDGDLIDCV SHLQPAFDHH LKGQLPLD P+RP 
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        GYNSS DSV E FQLWRQTGESCPEGT+PIRRTT+QDILRASSVQ FG+KP KS++RDS+G+GHEHAVVFVNGE+YYGAKANINVWAPHVSDQYEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFN+DLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLM+WKDPRHGNWWLE GQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        GLLVGYWPA LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADH NCYDI+QGKNKLWG YF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

XP_022988380.1 uncharacterized protein LOC111485640 isoform X1 [Cucurbita maxima]1.8e-21587.14Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVALFL  ST     SAPPI AAENF PT    AEE+QK   ++AYL NINKPPVKTIQSPDGDLIDCV SHLQPAFDH+ LKGQLPLD P+RP+
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        G+NSS  SV E FQLWRQTGESCP+GT+PIRRTT+QDILRASSVQ FG+KP KS++RDSS +GHEHAVVFVNGE+YYGAKANINVWAPHVSDQYEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLM+WKDPRHGNWWLE GQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        GLLVGYWPA LFSHLGSHASMIQFGGEIVNTRSTG HTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADH NCYDI+QGKNK WGTYF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

XP_023529935.1 uncharacterized protein LOC111792632 isoform X1 [Cucurbita pepo subsp. pepo]2.4e-21586.89Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVALFL VSTSSVL +  P  AAE+FRP    AA EFQK + VEA+L NINKP VKTIQSPDGDLIDCV SHLQPAFDH+ LK QLPLD PDRP+
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        GYNSS DS+   FQLWRQ+GESCPEGTIPIRRTT+QDILRASSVQ FG+KP KS++RDSSG+GHEHAVVFVNGE+YYGAKANINVWAPHVSD+YEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAY+ TGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFD+GLM+WKDPRHGNWWLEFGQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        G+LVGYWPA LFSHLGSHASMIQFGGE+VNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNL LLADH NCYDI+QGKNKLWGTYF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

XP_038880687.1 uncharacterized protein LOC120072305 [Benincasa hispida]1.2e-21988.59Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  F+ALFL VSTS +L SAPPI AAENFRP    AAEE+QK N VEAYL N+NKPPVK IQSPDGDLIDCV SHLQPAFDHH LKGQ P D P+RP+
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        GYNSS DSV E FQLWRQTGESCPEGT+PIRRTT+QDILRASSVQ FG+KP KS++RDS+G+GHEHAVVFVNGE+YYGAKANINVWAPHVSDQYEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLM+WKDPRHGNWWLE GQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        GLLVGYWPA LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLAD  NCYDI+QGKNKLWG YF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

TrEMBL top hitse value%identityAlignment
A0A0A0LU89 Uncharacterized protein5.0e-21988.11Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVA FL  STS +L SAPPI  AENF+P    AA E+QK N+V+AYL NINKPP+K IQSPDGDLIDCV SHLQPAFDHH LKGQLPLD P+RP 
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        GYNSS DSV E FQLWRQTGESCPEGT+PIRRTT+QDILRASSVQ FG+KP KS++RDS+G+GHEHAVVFVNGE+YYGAKANINVWAPHVSDQYEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLM+WKDPRHGNWWLE GQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        GLLVGYWPA LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADH NCYDI+QGKNKLWG YF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

A0A1S3CN98 uncharacterized protein LOC1035023955.5e-21887.62Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVA FL  STS +L SAPPI  AENF+P    A  E+QK N+V+AYL NINKPP+K IQSPDGDLIDCV SHLQPAFDHH LKGQLPLD P+RP 
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        GYNSS DSV E FQLWRQTGESCPEGT+PIRRTT+QDILRASSVQ FG+KP KS++RDS+G+GHEHAVVFVNGE+YYGAKANINVWAPHVSDQYEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFN+DLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLM+WKDPRHGNWWLE GQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        GLLVGYWPA LFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADH NCYDI+QGKNKLWG YF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

A0A6J1EFF9 uncharacterized protein LOC1114328581.5e-21586.89Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVALFL VSTSSVL +  P  AAE+FRPTA     EFQK + VEA+L NINKP VK IQSPDGDLIDCV SHLQPAFDH+ LK QLPLD PDRP+
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        GYNSS DS+   FQLWRQ+GESCPEGTIPIRRTT+QDILRASSVQ FGKKP KS++RDSSG+GHEHAVVFVNGE+YYGAKANINVWAPHVSD+YEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAY+ TGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFD+GLM+WKDPRHGNWWLEFGQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        G+LVGYWPA LFSHLGSHASMIQFGGE+VNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNL LLADH NCYDI+QGKNKLWGTYF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

A0A6J1HMJ7 uncharacterized protein LOC111464977 isoform X11.3e-21486.65Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVALFL  ST     SAPPI AAE+F PT    AEE+QK   ++AYL NINKPPVKTIQSPDGDLIDCV SHLQPAFDH  L+GQLPLD P+RPE
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        G+NSS  SV E FQLWRQTGESCP+GT+PIRRTT+QDILRASSVQ FG+KP KS++RDSS +GHEHAVVFVNGE+YYGAKANINVWAPHVSDQYEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLM+WKDPRHGNWWLE GQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        GLLVGYWPA LFSHLGSHASMIQFGGEIVNTRSTG HTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLP+SNLHLLADH NCYDI+QG NKLWGTYF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

A0A6J1JJE3 uncharacterized protein LOC111485640 isoform X18.8e-21687.14Show/hide
Query:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE
        M+L  FVALFL  ST     SAPPI AAENF PT    AEE+QK   ++AYL NINKPPVKTIQSPDGDLIDCV SHLQPAFDH+ LKGQLPLD P+RP+
Subjt:  MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPE

Query:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ
        G+NSS  SV E FQLWRQTGESCP+GT+PIRRTT+QDILRASSVQ FG+KP KS++RDSS +GHEHAVVFVNGE+YYGAKANINVWAPHVSDQYEFSLSQ
Subjt:  GYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQ

Query:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ
        IW+ISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLM+WKDPRHGNWWLE GQ
Subjt:  IWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQ

Query:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF
        GLLVGYWPA LFSHLGSHASMIQFGGEIVNTRSTG HTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADH NCYDI+QGKNK WGTYF
Subjt:  GLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYF

Query:  YYGGPGRNVHCP
        YYGGPGRNVHCP
Subjt:  YYGGPGRNVHCP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10750.1 Protein of Unknown Function (DUF239)2.8e-16665.07Show/hide
Query:  VDFVALF-------LFVSTSSVLSSAPPIAAAENFRP--TAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLD
        +D + LF       LF+S   +LSS+     +EN  P    +   +E  K  ++  +L  INKP +KTI SPDGD+IDCV  H QPAFDH  L+GQ PLD
Subjt:  VDFVALF-------LFVSTSSVLSSAPPIAAAENFRP--TAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLD

Query:  LPDRPEGYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQY
         P+RP G+N       + FQLW   GE+CPEGT+PIRRT ++DILRA+SV +FGKK  +  +RD+S  GHEHAV +V+GEKYYGAKA+INVWAP V +QY
Subjt:  LPDRPEGYNSSTDSVVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQY

Query:  EFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNW
        EFSLSQIW+ISGSF NDLNTIEAGWQVSPELYGDNYPRFFTYWT DAYQATGCYNLLCSGFVQTN++IAIGAAISP S Y G QFD+ L++WKDP+HGNW
Subjt:  EFSLSQIWLISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNW

Query:  WLEFGQGLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNK
        WLEFG G+LVGYWP+ LF+HL  HASM+Q+GGEIVN+   G HTSTQMGSGHFAEEG+ K+SYFRN+Q++DWDN+L+P  NL +LADH NCYDI+ G N+
Subjt:  WLEFGQGLLVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNK

Query:  LWGTYFYYGGPGRNVHCP
         WG+YFYYGGPG+N  CP
Subjt:  LWGTYFYYGGPGRNVHCP

AT1G23340.1 Protein of Unknown Function (DUF239)6.6e-16365.84Show/hide
Query:  FLFVSTSSVLSS-APPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYNSSTDS
        F F+   S+ SS A P  +     P  +    E QK   +   L  INKP +KTI S DGD IDCV SH QPAFDH +L+GQ P+D P+ P GY+   +S
Subjt:  FLFVSTSSVLSS-APPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYNSSTDS

Query:  VVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIWLISGSF
          E FQLW   GESCPEGTIPIRRTT+QD+LRA+SV+ FG+K  + ++RDSS  GHEHAV +V+G +YYGAKA+INVW P V  QYEFSLSQIW+I+GSF
Subjt:  VVEEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIWLISGSF

Query:  NNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGLLVGYWP
          DLNTIEAGWQ+SPELYGD  PRFFTYWT+DAYQATGCYNLLCSGFVQTNN+IAIGAAISP S Y G QFD+ L++WKDP+HG+WWL+FG G LVGYWP
Subjt:  NNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGLLVGYWP

Query:  AVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYYGGPGRN
          LF+HL  H +M+QFGGEIVNTR  G HTSTQMGSGHFA EG+GKASYFRNLQ++DWDN+L+P+SNL +LADH NCYDI+ G N++WG +FYYGGPG+N
Subjt:  AVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYYGGPGRN

Query:  VHCP
          CP
Subjt:  VHCP

AT1G70550.1 Protein of Unknown Function (DUF239)1.0e-16868.42Show/hide
Query:  STSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYNSSTDSVVEEF
        S SS  SS+   AA +  RP      EE QK   +   L+ INKP VKTIQS DGD IDCV +H QPAFDH +L+GQ PLD P+ P+GY S  D   E  
Subjt:  STSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYNSSTDSVVEEF

Query:  QLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFNNDLN
        QLW  +GESCPEGTIPIRRTT+QD+LRASSVQ FG+K  + +KRDS+  GHEHAV +V G +YYGAKA+INVW+P V+ QYEFSLSQIW+I+GSF +DLN
Subjt:  QLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFNNDLN

Query:  TIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGLLVGYWPAVLFS
        TIEAGWQ+SPELYGD YPRFFTYWT+DAY+ TGCYNLLCSGFVQTN +IAIGAAISPRS Y G QFD+ L++WKDP+HG+WWL+FG G LVGYWPA LF+
Subjt:  TIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGLLVGYWPAVLFS

Query:  HLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYYGGPGRNVHCP
        HL  H SM+QFGGEIVN R  G HT+TQMGSGHFA EG+GKASYFRNLQI+DWDN+L+P SNL +LADH NCYDI+ G N++WG YFYYGGPG+N  CP
Subjt:  HLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYYGGPGRNVHCP

AT1G70550.2 Protein of Unknown Function (DUF239)1.0e-16868.42Show/hide
Query:  STSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYNSSTDSVVEEF
        S SS  SS+   AA +  RP      EE QK   +   L+ INKP VKTIQS DGD IDCV +H QPAFDH +L+GQ PLD P+ P+GY S  D   E  
Subjt:  STSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYNSSTDSVVEEF

Query:  QLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFNNDLN
        QLW  +GESCPEGTIPIRRTT+QD+LRASSVQ FG+K  + +KRDS+  GHEHAV +V G +YYGAKA+INVW+P V+ QYEFSLSQIW+I+GSF +DLN
Subjt:  QLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFNNDLN

Query:  TIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGLLVGYWPAVLFS
        TIEAGWQ+SPELYGD YPRFFTYWT+DAY+ TGCYNLLCSGFVQTN +IAIGAAISPRS Y G QFD+ L++WKDP+HG+WWL+FG G LVGYWPA LF+
Subjt:  TIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGLLVGYWPAVLFS

Query:  HLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYYGGPGRNVHCP
        HL  H SM+QFGGEIVN R  G HT+TQMGSGHFA EG+GKASYFRNLQI+DWDN+L+P SNL +LADH NCYDI+ G N++WG YFYYGGPG+N  CP
Subjt:  HLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYYGGPGRNVHCP

AT5G50150.1 Protein of Unknown Function (DUF239)1.3e-17971.95Show/hide
Query:  FVALFLFVS--TSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYN
        F+ L L +S     +L SA  +    +FRP       E QK   VEAYL+ INKP +KTI SPDGD+I+CV SHLQPAFDH  L+GQ PLD P RP   N
Subjt:  FVALFLFVS--TSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYN

Query:  SSTDSVVEEF-QLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIW
         +T    E F QLW  +GESCP G+IPIR+TT+ D+LRA+SV+ FG+K  + ++RDSSG GHEHAVVFVNGE+YYGAKA+INVWAP V+D YEFSLSQIW
Subjt:  SSTDSVVEEF-QLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIW

Query:  LISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGL
        LISGSF +DLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRS YNGRQFD+GLM+WKDP+HG+WWLE G GL
Subjt:  LISGSFNNDLNTIEAGWQVSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGL

Query:  LVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYY
        LVGYWPA LFSHL SHASM+QFGGE+VN+RS+G HT TQMGSGHFA+EG+ KA+YFRNLQ++DWDN+LLP+ NLH+LADH  CYDI+QGKN +WGTYFYY
Subjt:  LVGYWPAVLFSHLGSHASMIQFGGEIVNTRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYY

Query:  GGPGRNVHCP
        GGPGRN  CP
Subjt:  GGPGRNVHCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTAGTGGATTTTGTTGCTTTGTTTCTCTTCGTTTCAACCTCCTCTGTTCTGTCTTCGGCGCCGCCCATTGCGGCGGCGGAGAATTTCAGACCCACCGCCGTCGC
CGCCGCTGAAGAGTTTCAGAAATTTAACAGCGTTGAAGCCTACTTGAACAACATTAATAAACCCCCCGTCAAAACAATTCAGAGCCCAGATGGGGATTTAATCGATTGTG
TGTTTTCTCATCTTCAACCTGCTTTTGATCACCATATGCTCAAAGGGCAGCTGCCATTGGATCTCCCAGATAGGCCAGAAGGTTACAACTCATCAACCGATTCAGTTGTG
GAAGAATTCCAGCTATGGCGGCAGACTGGTGAATCTTGCCCTGAAGGAACTATTCCCATTAGAAGAACTACACAACAAGACATTTTAAGAGCAAGCTCTGTTCAAACTTT
TGGGAAAAAGCCATTCAAATCTCTCAAAAGGGACTCATCAGGAACTGGCCATGAGCATGCTGTTGTATTTGTTAATGGAGAAAAATACTATGGAGCAAAAGCAAACATTA
ATGTTTGGGCACCCCATGTAAGTGATCAATATGAGTTCAGCTTGTCTCAAATTTGGCTTATTTCTGGGTCATTCAACAATGATCTAAACACCATTGAAGCTGGTTGGCAG
GTTAGTCCTGAATTATATGGAGATAATTATCCAAGATTCTTCACATATTGGACGACAGATGCATACCAAGCAACAGGCTGCTACAACTTACTTTGCTCTGGTTTTGTCCA
AACTAATAACAAAATTGCCATAGGAGCTGCAATATCTCCAAGATCTTATTACAATGGCAGACAATTTGATGTGGGTTTAATGGTTTGGAAGGATCCGAGACACGGAAACT
GGTGGCTGGAATTTGGGCAGGGTCTGTTGGTAGGGTACTGGCCAGCAGTTTTGTTCAGTCACTTGGGAAGCCATGCCAGCATGATCCAGTTTGGAGGAGAAATAGTGAAC
ACAAGATCAACAGGGTTCCACACATCAACACAAATGGGAAGTGGCCATTTTGCAGAAGAAGGCTATGGAAAAGCTTCTTATTTCAGAAATCTCCAAATAATTGATTGGGA
CAACAGTTTGCTTCCTGTCTCAAACCTTCATCTATTGGCTGATCATTCAAATTGTTATGACATAAAACAAGGAAAAAACAAGCTTTGGGGCACTTATTTTTACTATGGAG
GTCCAGGCAGAAATGTACATTGTCCATAA
mRNA sequenceShow/hide mRNA sequence
AAGAAAAAGGAAAGAAAGTGTACTGCAACAACTGTATCAGACATGAACCAGTTTCTGCAGGTGAAATTAACAGAACATTAAACAAGTTTCAGGCCACGTTTGTCTCATTG
ATTCCCCTCTCTGTTTCAACCATCGCCAAAATTCTCCAAAACTAAAATCCCAAAACATAAAATCATAATCAAAACCCACCTCAATTTCATCCATCTTCTTCACAAATCAA
AACCCAAATCACCAAATCAAACGCCCATTTCTTCTTCTATCTTTCTTTTGGAAATCAAAAAATCGAAACTTCCCCCACTATATCCGGATAACCATGTTCGTCCCTAGGCC
AGACACTAGGAGACATCGAAGAACTGGAGACGCTGAAATTGTCTACAAATTAATGTCATAGTCGAGTCTCGAGAGCACACTTTCAAAGTCCAAGCTTTCGACCATGGTAT
CACCGGCGACTTCTTCTTTCTCTTTTCCATCCCTGCAAAAAAGCAAAAGAAAATAACCCATAATTTCTTTCATCACTTTCGCAACCCTTTTGACCACTTTCCTATTCATA
AATACCCATTTTGCAGTTCTCAGTCTGTCAGAAAGCCCGCCATTAACGAAGCTTAATTTCTATTCGTTAAAGAGTTGAAAAAAAAAAATCATGTGGGTTTTTTATTCCGA
AGAGAATGGTTTCATTTGTTCTGTTTAGTGAGAATGTGCAGAGGAATCGAAGCAGAGGAGGAAGAAACCAAGTCTCTGTGATTTTTTCTTTTCATTTTTTTCCATAATAC
GACAATCCCAATACGTACAAACACACGTATTTCTTTTTATCTTTATTTTTCTCGATGATATTAGTGGATTTTGTTGCTTTGTTTCTCTTCGTTTCAACCTCCTCTGTTCT
GTCTTCGGCGCCGCCCATTGCGGCGGCGGAGAATTTCAGACCCACCGCCGTCGCCGCCGCTGAAGAGTTTCAGAAATTTAACAGCGTTGAAGCCTACTTGAACAACATTA
ATAAACCCCCCGTCAAAACAATTCAGAGCCCAGATGGGGATTTAATCGATTGTGTGTTTTCTCATCTTCAACCTGCTTTTGATCACCATATGCTCAAAGGGCAGCTGCCA
TTGGATCTCCCAGATAGGCCAGAAGGTTACAACTCATCAACCGATTCAGTTGTGGAAGAATTCCAGCTATGGCGGCAGACTGGTGAATCTTGCCCTGAAGGAACTATTCC
CATTAGAAGAACTACACAACAAGACATTTTAAGAGCAAGCTCTGTTCAAACTTTTGGGAAAAAGCCATTCAAATCTCTCAAAAGGGACTCATCAGGAACTGGCCATGAGC
ATGCTGTTGTATTTGTTAATGGAGAAAAATACTATGGAGCAAAAGCAAACATTAATGTTTGGGCACCCCATGTAAGTGATCAATATGAGTTCAGCTTGTCTCAAATTTGG
CTTATTTCTGGGTCATTCAACAATGATCTAAACACCATTGAAGCTGGTTGGCAGGTTAGTCCTGAATTATATGGAGATAATTATCCAAGATTCTTCACATATTGGACGAC
AGATGCATACCAAGCAACAGGCTGCTACAACTTACTTTGCTCTGGTTTTGTCCAAACTAATAACAAAATTGCCATAGGAGCTGCAATATCTCCAAGATCTTATTACAATG
GCAGACAATTTGATGTGGGTTTAATGGTTTGGAAGGATCCGAGACACGGAAACTGGTGGCTGGAATTTGGGCAGGGTCTGTTGGTAGGGTACTGGCCAGCAGTTTTGTTC
AGTCACTTGGGAAGCCATGCCAGCATGATCCAGTTTGGAGGAGAAATAGTGAACACAAGATCAACAGGGTTCCACACATCAACACAAATGGGAAGTGGCCATTTTGCAGA
AGAAGGCTATGGAAAAGCTTCTTATTTCAGAAATCTCCAAATAATTGATTGGGACAACAGTTTGCTTCCTGTCTCAAACCTTCATCTATTGGCTGATCATTCAAATTGTT
ATGACATAAAACAAGGAAAAAACAAGCTTTGGGGCACTTATTTTTACTATGGAGGTCCAGGCAGAAATGTACATTGTCCATAAGGGCATAGAAGTAAATTTCATTTCTTT
TTCTTTTTTAATTTTAAATTTTTGGGGGTTTTGTTTCTTACCCACTTTCTTTATTGGCCTTAGAATGATTTGGGTTGCTTTTATGATTATAATTTTGATTTTAGTTTTTT
TAAATCCCTATGTGGCCCCATTTTGGGATATTGTAAATTTTGAAGATAATAGGATGTATTCAAAATTATGTGCGTGGG
Protein sequenceShow/hide protein sequence
MILVDFVALFLFVSTSSVLSSAPPIAAAENFRPTAVAAAEEFQKFNSVEAYLNNINKPPVKTIQSPDGDLIDCVFSHLQPAFDHHMLKGQLPLDLPDRPEGYNSSTDSVV
EEFQLWRQTGESCPEGTIPIRRTTQQDILRASSVQTFGKKPFKSLKRDSSGTGHEHAVVFVNGEKYYGAKANINVWAPHVSDQYEFSLSQIWLISGSFNNDLNTIEAGWQ
VSPELYGDNYPRFFTYWTTDAYQATGCYNLLCSGFVQTNNKIAIGAAISPRSYYNGRQFDVGLMVWKDPRHGNWWLEFGQGLLVGYWPAVLFSHLGSHASMIQFGGEIVN
TRSTGFHTSTQMGSGHFAEEGYGKASYFRNLQIIDWDNSLLPVSNLHLLADHSNCYDIKQGKNKLWGTYFYYGGPGRNVHCP