; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009760 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009760
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr07:24714958..24720043
RNA-Seq ExpressionPI0009760
SyntenyPI0009760
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064685.1 uncharacterized protein E6C27_scaffold255G004100 [Cucumis melo var. makuwa]3.3e-25395.76Show/hide
Query:  GSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
        G E N       GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
Subjt:  GSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA

Query:  NSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
        NSPGGPSST+YATGPYAH+ QPVSPPVFSAFTTEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
Subjt:  NSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS

Query:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
        PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
Subjt:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN

Query:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPAL
        RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS+HTTLQ+QRSIKSAPEVV KETCTEVPAL
Subjt:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPAL

Query:  CSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        C+GYKDNKLQRQPGDI GSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  CSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

TYK19905.1 uncharacterized protein E5676_scaffold134G00170 [Cucumis melo var. makuwa]1.5e-25397.61Show/hide
Query:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMY
        RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST+Y
Subjt:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMY

Query:  ATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAH+ QPVSPPVFSAFTTEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
        SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPALCSGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS+HTTLQ+QRSIKSAPEVV KETCTEVPALC+GYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPALCSGYKDNKLQR

Query:  QPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        QPGDI GSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  QPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_004150969.1 uncharacterized protein At1g76660 [Cucumis sativus]1.1e-25696.83Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMYATGPYAHD Q VSPPVFSAF TEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS+HTTLQSQRSIKSAPE    ETCTE+PA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA

Query:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LC+GYKDNKLQRQPGDISGSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_008453041.1 PREDICTED: uncharacterized protein At1g76660 [Cucumis melo]5.6e-26197.67Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST+YATGPYAH+ QPVSPPVFSAFTTEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS+HTTLQ+QRSIKSAPEVV KETCTEVPA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA

Query:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LC+GYKDNKLQRQPGDI GSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_038899313.1 uncharacterized protein At1g76660 [Benincasa hispida]2.1e-24493.21Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+ Q VSPPVFSAFTTEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SS DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGSGRLFGNEKA TSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSD YSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA
        NRH+KSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTST+LSAEES EPPLLGEKLKSTHTTLQSQRSIKSAPEVV KETCTEV A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA

Query:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSW
        LC+GYKDNKLQRQPG++SGSSTS+QVEKD+FSRIGS KNSRKY+LGLS SDAEVDYRRGRSLREAKG+ SW
Subjt:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSW

TrEMBL top hitse value%identityAlignment
A0A0A0L1G3 Uncharacterized protein5.3e-25796.83Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMYATGPYAHD Q VSPPVFSAF TEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS+HTTLQSQRSIKSAPE    ETCTE+PA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA

Query:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LC+GYKDNKLQRQPGDISGSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A1S3BV86 uncharacterized protein At1g766602.7e-26197.67Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST+YATGPYAH+ QPVSPPVFSAFTTEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS+HTTLQ+QRSIKSAPEVV KETCTEVPA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA

Query:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LC+GYKDNKLQRQPGDI GSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A5A7VFM0 Uncharacterized protein1.6e-25395.76Show/hide
Query:  GSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
        G E N       GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
Subjt:  GSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA

Query:  NSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
        NSPGGPSST+YATGPYAH+ QPVSPPVFSAFTTEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
Subjt:  NSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS

Query:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
        PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
Subjt:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN

Query:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPAL
        RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS+HTTLQ+QRSIKSAPEVV KETCTEVPAL
Subjt:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPAL

Query:  CSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        C+GYKDNKLQRQPGDI GSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  CSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A5D3D8J8 Uncharacterized protein7.2e-25497.61Show/hide
Query:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMY
        RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST+Y
Subjt:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMY

Query:  ATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAH+ QPVSPPVFSAFTTEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
        SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPALCSGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKS+HTTLQ+QRSIKSAPEVV KETCTEVPALC+GYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPALCSGYKDNKLQR

Query:  QPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        QPGDI GSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  QPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A6J1BVS7 uncharacterized protein At1g766603.1e-23389.68Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+ Q VSPPVFSAFTTEPSTAP+TPPPELAHLTTPSSPDVPFAQFL+SS DLKGTGK NYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN S+S QDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKST TT+QSQRS+K A +VV KETC EV  
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPA

Query:  LCSGYKDNKLQRQPGDISGSSTS-DQVE-KDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LC+G +DNKLQRQPG++SGSS+S +QVE +DVFSRI   KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  LCSGYKDNKLQRQPGDISGSSTS-DQVE-KDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.1e-12558.02Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL
        MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL

Query:  SLSANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+MYATGPYAH+ Q VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL SS DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS
        +L SPISR SGD L S                 Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD + + P  GGRLSVSKDSDVY  + 
Subjt:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS

Query:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL      L SQ S KS  ++  +  
Subjt:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKET

Query:  CTEVPALCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN
          + P   + YKD+K + +             E+ + SR+GS K SR Y   +S SDAEV+YRRGRSLRE++ N
Subjt:  CTEVPALCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)3.0e-2640.1Show/hide
Query:  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMYAT
        ++W   W  L CF S +  KRI  +  +PE   V+   +    +    ++ + T   +APPSSPASF  S  PS  QSP   LS S   P     +++A 
Subjt:  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMYAT

Query:  GPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHL----TTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDC
        GPYAH+ Q VSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ   S+      G    ++S+     Y L PGSP   L+SP   + G  
Subjt:  GPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHL----TTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDC

Query:  LSSSFPE
         +S FP+
Subjt:  LSSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.5e-12658.02Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL
        MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL

Query:  SLSANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+MYATGPYAH+ Q VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL SS DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS
        +L SPISR SGD L S                 Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD + + P  GGRLSVSKDSDVY  + 
Subjt:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS

Query:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL      L SQ S KS  ++  +  
Subjt:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKET

Query:  CTEVPALCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN
          + P   + YKD+K + +             E+ + SR+GS K SR Y   +S SDAEV+YRRGRSLRE++ N
Subjt:  CTEVPALCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.7e-2634.19Show/hide
Query:  SEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFLSLS
        S ++R       K+ G  W    CF S+K  KRI  A  +PE    +     P     +N  ++  P  +APPSSPASF  S  PS      P    SL+
Subjt:  SEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASS-----EDLKGTGKANYIASNDLQAAYSLYPGSP
         N P  PS+  +  GPYAH+ QPV+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L SS      +  G     + A++    +  +YPGSP
Subjt:  ANSPGGPSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASS-----EDLKGTGKANYIASNDLQAAYSLYPGSP

Query:  ASSLVSPISRTS----GDCL--------------SSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFP
          +L+SP S TS    G C                  F  R +  ++ S +    G+  R GSG L  +   G+ L S         T  +    N T P
Subjt:  ASSLVSPISRTS----GDCL--------------SSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFP

Query:  HTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEI
          G  L  S+ S+V S   +     H  S   D   +  +R SF  + +++
Subjt:  HTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEI

AT5G52430.1 hydroxyproline-rich glycoprotein family protein3.1e-3145.62Show/hide
Query:  PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGG
        P   +  RWG CW   SCF +QK  KRI  A  +PE   VT+              TV+ P  +APPSSPASF  S   S   SP   LSL++N  SP  
Subjt:  PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGG

Query:  PSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELA-HLTTPSSPDVPFAQFLASSEDL----KGTGKANYIASNDLQ-AAYSLYPGSP-ASSL
        P S ++  GPYA++ QPV+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L SS +L      +G     +S+  +  +  + PGSP   +L
Subjt:  PSSTMYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELA-HLTTPSSPDVPFAQFLASSEDL----KGTGKANYIASNDLQ-AAYSLYPGSP-ASSL

Query:  VSPISRTSGDCLSSSFP
        +SP S  S    SS +P
Subjt:  VSPISRTSGDCLSSSFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCCGAGCAGAACAGATTTCCTCAGCAGGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTGTCTTGTTTTCACTCTCAGAAAGGAGAGAAGCGCATAGT
ACCTGCATCACGTTTACCTGAGGGCAATGTTGTGACAACCCAGCCCAATGGACCTCAAGCTGCGGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCC
CACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCTTGTCGTTATCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTATGCTACTGGGCCATACGCACATGATTTACAACCGGTTTCTCCTCCTGTTTTTTCAGCCTTCACCACTGAACCGTCAACTGCTCCAGTCACCCCCCCACCTGAACT
AGCCCACCTAACCACACCTTCTTCTCCGGATGTACCGTTTGCTCAGTTCCTTGCCTCATCAGAGGATCTGAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATC
TTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTTGTGTCACCAATTTCAAGAACCTCTGGCGATTGTTTATCATCTTCATTTCCTGAGAGGGACTTC
CGACCACAGTGGAATTCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGTTATTTGGAAATGAGAAAGCCGGTACATCTTTGGCATCTCAGGA
TTCCAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATACAACATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTCTACTCGT
CTTGTGGGAACGGATACCAGAACAGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCGTTTGGTTTCAGTGCAGATGAAATTATAACT
ACTACACAATACGTGGAGATATCTGATGTAATGGAGGATTCCTTTACCATGAGACCTTTTACCTCAACTAGTCTGTCAGCAGAAGAAAGTACTGAACCTCCACTGTTGGG
TGAAAAACTAAAATCCACGCATACAACTTTACAAAGTCAGAGAAGTATTAAATCAGCCCCTGAGGTTGTCGTAAAGGAAACCTGCACTGAAGTGCCGGCATTATGCAGTG
GTTATAAAGACAATAAATTGCAAAGACAACCTGGTGACATATCAGGATCAAGTACTTCAGACCAAGTTGAAAAAGACGTATTCTCAAGGATAGGGTCATCCAAAAATAGT
CGCAAGTATGATCTTGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGAAGCCAAGGGAAATGGTTCATGGCACGACTAA
mRNA sequenceShow/hide mRNA sequence
CTTGAAAGAAAAGTAAACCATAATTCTATCCTTAATTGAGGTATACGGAGAAGCCTTACCGGTGGTGTTTTGTATGAGTTTTTTTTTAAGATCTGGAGAAGAATTAAAGA
ACAGTACTTCAAAGTGTTGAGTGAAATGAACCGAACTGGTTGGGCTTCTCACAGAGTTTCAGATTGAACCGAGAGGCATCGATTCGGTTATCAGTTTGAACTGAACCGAC
CCCATGGTAGGTGTTGCTTTTGGCTTTAGGGTTTCTCCAATTTAGTATCTCTCCTTCTACTGTCAGCCAGCGGCCTCTCTCTTCTCTGTCTTTCGCATTTGCTGACTCAA
TCACACTCCGCACCCTCGCTCATCCGCCTTTGCCGTCGATTACTCAGCCACCGTTGCTTTCGACTACGCAGGCACCGTCTCCGTCGATTTCTTCCGCCCTCATTTCCTAT
CGGAAATTAAAGTATATTGGAGTGTGTTTTTCTCTTTCTTTGTTAGCTACTCTTTTCAGCGTCGAGAGACCAAACATATATGCTGTCAGATTTTGCTCTGTGAATGGAGA
AATGTCAAGGTGGATATAGCCTGACTCGCATTAGTCCTTAAATGGATGAAGTTGGCGGAGGAAGAGGAGGAGAAGGAGGGCCTTGAGCTTGTAGAAAGAGCCAGCTTAAG
ACACTAAAAGATAAGTAGAGACTTCGTTGAGCTACTGGTTACGAATGGGGTCCGAGCAGAACAGATTTCCTCAGCAGGAACGGGGAAAGAGATGGGGTGGATGTTGGGGT
GCATTGTCTTGTTTTCACTCTCAGAAAGGAGAGAAGCGCATAGTACCTGCATCACGTTTACCTGAGGGCAATGTTGTGACAACCCAGCCCAATGGACCTCAAGCTGCGGG
AATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTT
TCTTGTCGTTATCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTATGCTACTGGGCCATACGCACATGATTTACAACCGGTTTCTCCTCCTGTTTTTTCAGCCTTC
ACCACTGAACCGTCAACTGCTCCAGTCACCCCCCCACCTGAACTAGCCCACCTAACCACACCTTCTTCTCCGGATGTACCGTTTGCTCAGTTCCTTGCCTCATCAGAGGA
TCTGAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTTGTGTCACCAATTTCAAGAA
CCTCTGGCGATTGTTTATCATCTTCATTTCCTGAGAGGGACTTCCGACCACAGTGGAATTCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGG
TTATTTGGAAATGAGAAAGCCGGTACATCTTTGGCATCTCAGGATTCCAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATACAACATTCCCTCATAC
TGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTCTACTCGTCTTGTGGGAACGGATACCAGAACAGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAG
CTTACCGAGCATCGTTTGGTTTCAGTGCAGATGAAATTATAACTACTACACAATACGTGGAGATATCTGATGTAATGGAGGATTCCTTTACCATGAGACCTTTTACCTCA
ACTAGTCTGTCAGCAGAAGAAAGTACTGAACCTCCACTGTTGGGTGAAAAACTAAAATCCACGCATACAACTTTACAAAGTCAGAGAAGTATTAAATCAGCCCCTGAGGT
TGTCGTAAAGGAAACCTGCACTGAAGTGCCGGCATTATGCAGTGGTTATAAAGACAATAAATTGCAAAGACAACCTGGTGACATATCAGGATCAAGTACTTCAGACCAAG
TTGAAAAAGACGTATTCTCAAGGATAGGGTCATCCAAAAATAGTCGCAAGTATGATCTTGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTA
AGGGAAGCCAAGGGAAATGGTTCATGGCACGACTAACACAACCTCTCTGGAATAATTTGCAGTATGTTTGTATCTGTTTTTCTGCTTTGCAAGGTTTCCATGGAATGTCT
AACGTATGATCTAACCTGTTGTCTCCCTGAGTTATGACTTGGCCACGGGATGGAAGAACGATTATAATTTGTGTTCTTTATCGTGCCAATTTAGATATGGAATGGGTCAG
ATTTCTGTATTTGGTAGACAAAGTTGACTTTTTTCGATGAATTGTATACGTTGTATAATCTGCTTGCCTATTCTGTGCCCCAATGTGTATGTCTTGCATTTCTAGAATTT
GGCATGTCATCATGTTCCTATCAAGAGCAACCTTATTATCCGATCCAATCCATGGGTGTTAACTTTTGTTGTAAAGAGCCTTCCAGACAGAATTTTTACCTTTTTGGGGA
TTTTAATCTTTTACAGAACTAATTGGAACAACAGGATGAGGGAGGGATTACTAAATTTACAGGAAAAGGATTTGCACATGAACCCTTCTGAAGGATTAGTAATCCAAGTG
CGAAGATCCTCCCTAGACAAACAGAAATTCCCTAGTGATGATAAACGGGTCAAAATATCTGTTGTTTCTCGTGGAATAATTGCCTTCGAAATCCAAGGTCAAGGTAGAAT
CAAAACCTGGGGAAGAAAGGACGAAA
Protein sequenceShow/hide protein sequence
MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST
MYATGPYAHDLQPVSPPVFSAFTTEPSTAPVTPPPELAHLTTPSSPDVPFAQFLASSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF
RPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIIT
TTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSTHTTLQSQRSIKSAPEVVVKETCTEVPALCSGYKDNKLQRQPGDISGSSTSDQVEKDVFSRIGSSKNS
RKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD