; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G021140 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G021140
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationCmo_Chr01:14614189..14626416
RNA-Seq ExpressionCmoCh01G021140
SyntenyCmoCh01G021140
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608673.1 hypothetical protein SDJN03_02015, partial [Cucurbita argyrosperma subsp. sororia]2.0e-25898.72Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSL AEESIQPPLVGEKLKSTQATLQSQRSIKSAS+ VEKETCSEVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA

Query:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKG
        LCNGCK+DKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGE  G
Subjt:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKG

KAG7037989.1 hypothetical protein SDJN02_01622, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-25899.35Show/hide
Query:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMF
        +GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMF
Subjt:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSSTMF

Query:  ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVE
        SSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLALCNGCKDDKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSL AEESIQPPLVGEKLKSTQATLQSQRSIKSASD VEKETCSEVLALCNGCKDDKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLALCNGCKDDKLQR

Query:  QPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        QPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Subjt:  QPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

XP_022940824.1 uncharacterized protein At1g76660 [Cucurbita moschata]8.1e-268100Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLAL
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLAL
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLAL

Query:  CNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        CNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Subjt:  CNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

XP_022981333.1 uncharacterized protein At1g76660-like [Cucurbita maxima]1.0e-26298.73Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEK KSTQATL SQRSIKSASD VEKETCSEVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA

Query:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        LCNGCKD+KLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Subjt:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

XP_023524439.1 uncharacterized protein At1g76660 [Cucurbita pepo subsp. pepo]1.7e-26599.37Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD VEKETCSEVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA

Query:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Subjt:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

TrEMBL top hitse value%identityAlignment
A0A0A0L1G3 Uncharacterized protein4.2e-23088.16Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGP AAG+ NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGSGRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVY+S GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLAL
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES +PPL+GEKLKS+  TLQSQRSIKSA +   ETC+E+ AL
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLAL

Query:  CNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        CNG KD+KLQRQPG++ GSSTS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+  WHD
Subjt:  CNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

A0A1S3BV86 uncharacterized protein At1g766604.8e-23489.45Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGP AAG+ NQATVI PSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST++ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGSGRLFG+EK GT LASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVY+S GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES +PPL+GEKLKS+  TLQ+QRSIKSA + VEKETC+EV A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA

Query:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        LCNG KD+KLQRQPG++ GSSTS    +D+FSRIGSSKNSRKY+  LSCSDAEVDYRRGRSLR E KG+  WHD
Subjt:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

A0A6J1BVS7 uncharacterized protein At1g766601.6e-24090.97Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNR PQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGP AAG+ NQATVIAPSLLAPPSSPASFTNSALPST QSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGK NY+ASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPS+S QDGKYPRSGSGRLFGHEKTGT LASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVY+SGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESI+PPL+GEKLKST+ T+QSQRS+K ASD VEKETC+EVL 
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA

Query:  LCNGCKDDKLQRQPGNLPGSSTS--QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        LCNGC+D+KLQRQPGN+ GSS+S  Q ETED+FSRI   KNSRKYN  LSCSDAEVDYRRGRSLR EVKGDF WHD
Subjt:  LCNGCKDDKLQRQPGNLPGSSTS--QGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

A0A6J1FRS7 uncharacterized protein At1g766603.9e-268100Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLAL
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLAL
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLAL

Query:  CNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        CNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Subjt:  CNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

A0A6J1IZ74 uncharacterized protein At1g76660-like5.0e-26398.73Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
        MGSEQNRIPQQERGKRWGGCWGALSCF SQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLS

Query:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYVASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
        SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEK KSTQATL SQRSIKSASD VEKETCSEVLA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASD-VEKETCSEVLA

Query:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
        LCNGCKD+KLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD
Subjt:  LCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.5e-12859.19Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPHAAGIANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL
        MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG H AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPHAAGIANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SSMDLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASG-
        +L SPISR SGD L S                 Q+GK  RS SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Subjt:  SLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASG-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETC
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +A L SQ S KS +D++ +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETC

Query:  SEVLALCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR
               +    D  QR        +    + E L SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Subjt:  SEVLALCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)2.7e-2739.73Show/hide
Query:  MGSEQNRIPQQ---ERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL
        + S  +R+ Q     + ++W   W  L CF S +  KRI  +  +PE   +++  +    +G  +  T +    +APPSSPASF  S  PS  QSP   L
Subjt:  MGSEQNRIPQQ---ERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPG
        S S   P     ++FA GPYAHETQ VSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PG
Subjt:  SLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPG

Query:  SPASSLVSPISRTSGDCLSSSFPE
        SP   L+SP   + G   +S FP+
Subjt:  SPASSLVSPISRTSGDCLSSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.1e-12959.19Show/hide
Query:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPHAAGIANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL
        MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG H AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPHAAGIANQ--ATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+M+ATGPYAHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SSMDLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASG-
        +L SPISR SGD L S                 Q+GK  RS SG  FG++  G     Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY +  
Subjt:  SLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYASG-

Query:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETC
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL   +A L SQ S KS +D++ +  
Subjt:  -GNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETC

Query:  SEVLALCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR
               +    D  QR        +    + E L SR+GS K SR Y+  +S SDAEV+YRRGRSLR
Subjt:  SEVLALCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSRKYNHALSCSDAEVDYRRGRSLR

AT4G25620.1 hydroxyproline-rich glycoprotein family protein2.9e-2944.19Show/hide
Query:  SEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIAN---QATVIAPSLLAPPSSPASFTNSALPSTAQS--PSCFL
        S ++R       K+ G  W    CF S+K  KRI  A  +PE        +G   A + N    +T I    +APPSSPASF  S  PS + +  P    
Subjt:  SEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIAN---QATVIAPSLLAPPSSPASFTNSALPSTAQS--PSCFL

Query:  SLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLK-----GTGKANYVASNDLQAAYSLYP
        SL+ N P  PS+  F  GPYAHETQPV+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++    +  +YP
Subjt:  SLSANSPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLK-----GTGKANYVASNDLQAAYSLYP

Query:  GSPASSLVSPISRTS
        GSP  +L+SP S TS
Subjt:  GSPASSLVSPISRTS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein3.9e-3442.69Show/hide
Query:  PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE----GNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSAN--
        P   +  RWG CW   SCF +QK  KRI  A  +PE    G  V T  N       A   TV+ P  +APPSSPASF  S   S + SP   LSL++N  
Subjt:  PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE----GNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSAN--

Query:  SPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAY-----SLYPGSP-
        SP  P S +F  GPYA+ETQPV+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      + +      + Y      + PGSP 
Subjt:  SPGGPSSTMFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAY-----SLYPGSP-

Query:  ASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLAS
          +L+SP S  S    SS +P +      +P    + G+ P+      F   K G+   S
Subjt:  ASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCCGAGCAGAATAGAATCCCTCAACAAGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGT
GCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCC
CACCTTCTTCACCAGCATCATTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACT
AGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGATGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATC
TTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTT
CCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGA
TTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTT
CTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAACT
ACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGG
TGAAAAACTGAAATCCACGCAGGCAACATTACAGAGTCAAAGAAGTATTAAATCAGCATCTGACGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCT
GTAAAGACGATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGC
AAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCCGAGCAGAATAGAATCCCTCAACAAGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAAGGAGAAAAGCGCATTGT
GCCTGCATCTCGTTTACCTGAGGGCAATGTCGTGACAACCCAGCCAAATGGACCTCATGCAGCAGGAATAGCCAATCAGGCTACTGTGATAGCTCCATCCCTTTTAGCCC
CACCTTCTTCACCAGCATCATTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCTAGCTGTTTCTTGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTTCGCTACAGGGCCATATGCGCATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCACCCGAACT
AGCTCACCTAACCACTCCTTCTTCCCCTGATGTGCCTTTTGCTCAGTTCCTATCCTCATCGATGGATCTCAAAGGAACTGGAAAGGCAAATTACGTTGCTTCCAATGATC
TTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGCGATTGCTTATCATCTTCTTTTCCTGAGAGGGACTTT
CCACCACAGTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGACATGAGAAAACTGGCACACCTCTTGCTTCTCAGGA
TTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTTTACGCTT
CTGGTGGGAATGGATACCAAAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCTTTTGGTTTCAGTGCCGATGAAATTATAACT
ACCACACAATACGTGGAGATATCTGATGTAATGGAAGATTCCTTTACTATGAGACCTTTTACTTCAACTAGTCTGTCTGCAGAAGAAAGTATTCAACCTCCATTAGTGGG
TGAAAAACTGAAATCCACGCAGGCAACATTACAGAGTCAAAGAAGTATTAAATCAGCATCTGACGTTGAAAAAGAAACCTGCTCTGAAGTCCTGGCATTATGCAATGGCT
GTAAAGACGATAAATTGCAAAGACAACCTGGTAACTTGCCAGGATCAAGTACTTCCCAAGGTGAAACAGAAGACCTATTCTCAAGAATAGGGTCGTCCAAAAATAGCCGC
AAGTATAATCATGCTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTGAGGGGGGAGGTCAAGGGAGATTTTTTATGGCATGACTAAGAGCCATT
TCTGAAATAGTTTTTAGTATGTGTATCTGTTCTTTGCTTTGCGGGTTTCCATGGAATGTCTAACCTATGACCTAACCTGTTCTCTGAGTTGACTTGATCAGTGGATGGAT
GCATATAGTTTGTGTTCTTTATCGTTCCAGTTTAGAAATGGAATGGGTCAAATCTCTGTATTTGGTAGACGAAGTTTGTCTTTTTTCCAATGAATTATATAATAGCATGT
ACAATCTTGCCCATCTCGTGCATACGTCATCATTATGCATATCCAAACTCCCGTTAACAGT
Protein sequenceShow/hide protein sequence
MGSEQNRIPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPHAAGIANQATVIAPSLLAPPSSPASFTNSALPSTAQSPSCFLSLSANSPGGPSST
MFATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSMDLKGTGKANYVASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF
PPQWNPSASLQDGKYPRSGSGRLFGHEKTGTPLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYASGGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIIT
TTQYVEISDVMEDSFTMRPFTSTSLSAEESIQPPLVGEKLKSTQATLQSQRSIKSASDVEKETCSEVLALCNGCKDDKLQRQPGNLPGSSTSQGETEDLFSRIGSSKNSR
KYNHALSCSDAEVDYRRGRSLRGEVKGDFLWHD