; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008423 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008423
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr07:87909..92723
RNA-Seq ExpressionIVF0008423
SyntenyIVF0008423
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064685.1 uncharacterized protein E6C27_scaffold255G004100 [Cucumis melo var. makuwa]0.098.09Show/hide
Query:  GSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
        G E N       GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
Subjt:  GSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA

Query:  NSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
        NSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
Subjt:  NSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS

Query:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
        PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
Subjt:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN

Query:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPAL
        RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPAL
Subjt:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPAL

Query:  CNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        CNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  CNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

TYK19905.1 uncharacterized protein E5676_scaffold134G00170 [Cucumis melo var. makuwa]0.0100Show/hide
Query:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIY
        RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIY
Subjt:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIY

Query:  ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
        SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQR

Query:  QPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        QPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  QPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_004150969.1 uncharacterized protein At1g76660 [Cucumis sativus]0.096.83Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST+YATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQ+QRSIKSAPE    ETCTE+PA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA

Query:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LCNGYKDNKLQRQPGDI GSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_008453041.1 PREDICTED: uncharacterized protein At1g76660 [Cucumis melo]0.0100Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA

Query:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_038899313.1 uncharacterized protein At1g76660 [Benincasa hispida]2.44e-31493.63Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST++ATGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGSGRLFGNEKA TSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSD YSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA
        NRH+KSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTST+LSAEES EPPLLGEKLKS+HTTLQ+QRSIKSAPEVVEKETCTEV A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA

Query:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSW
        LCNGYKDNKLQRQPG++ GSSTS+QVEKD+FSRIGS KNSRKY+LGLS SDAEVDYRRGRSLREAKG+ SW
Subjt:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSW

TrEMBL top hitse value%identityAlignment
A0A0A0L1G3 Uncharacterized protein4.1e-25796.83Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST+YATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQ+QRSIKSAPE    ETCTE+PA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA

Query:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LCNGYKDNKLQRQPGDI GSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A1S3BV86 uncharacterized protein At1g766605.7e-267100Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA

Query:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A5A7VFM0 Uncharacterized protein3.3e-25998.09Show/hide
Query:  GSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
        G E N       GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
Subjt:  GSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA

Query:  NSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
        NSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
Subjt:  NSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS

Query:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
        PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
Subjt:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN

Query:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPAL
        RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPAL
Subjt:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPAL

Query:  CNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        CNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  CNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A5D3D8J8 Uncharacterized protein1.5e-259100Show/hide
Query:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIY
        RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIY
Subjt:  RGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIY

Query:  ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
        SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQR

Query:  QPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        QPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  QPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A6J1BVS7 uncharacterized protein At1g766606.3e-23489.89Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQQER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST++ATGPYAHE Q VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGK NYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN S+S QDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+ TT+Q+QRS+K A +VVEKETC EV  
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPA

Query:  LCNGYKDNKLQRQPGDILGSSTS-DQVE-KDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LCNG +DNKLQRQPG++ GSS+S +QVE +DVFSRI   KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  LCNGYKDNKLQRQPGDILGSSTS-DQVE-KDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.9e-12658.23Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL
        MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL

Query:  SLSANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS++YATGPYAHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS
        +L SPISR SGD L S                 Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD + + P  GGRLSVSKDSDVY  + 
Subjt:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS

Query:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL      L +Q S KS  ++  +  
Subjt:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKET

Query:  CTEVPALCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN
          + P   N YKD+K + +             E+ + SR+GS K SR Y   +S SDAEV+YRRGRSLRE++ N
Subjt:  CTEVPALCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)1.6e-2741.55Show/hide
Query:  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIYAT
        ++W   W  L CF S +  KRI  +  +PE   V+   +    +    ++ + T   +APPSSPASF  S  PS  QSP   LS S   P     +I+A 
Subjt:  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTIYAT

Query:  GPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDC
        GPYAHETQ VSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PGSP   L+SP   + G  
Subjt:  GPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDC

Query:  LSSSFPE
         +S FP+
Subjt:  LSSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.3e-12758.23Show/hide
Query:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL
        MGSE      Q++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL

Query:  SLSANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS++YATGPYAHETQ VSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS
        +L SPISR SGD L S                 Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD + + P  GGRLSVSKDSDVY  + 
Subjt:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS

Query:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKET
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL      L +Q S KS  ++  +  
Subjt:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKET

Query:  CTEVPALCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN
          + P   N YKD+K + +             E+ + SR+GS K SR Y   +S SDAEV+YRRGRSLRE++ N
Subjt:  CTEVPALCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN

AT4G25620.1 hydroxyproline-rich glycoprotein family protein2.7e-2734.76Show/hide
Query:  SEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFLSLS
        S ++R       K+ G  W    CF S+K  KRI  A  +PE    +     P     +N  ++  P  +APPSSPASF  S  PS      P    SL+
Subjt:  SEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFLSLS

Query:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS-----DDLKGTGKANYIASNDLQAAYSLYPGSP
         N P  PS+  +  GPYAHETQPV+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS      +  G     + A++    +  +YPGSP
Subjt:  ANSPGGPSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS-----DDLKGTGKANYIASNDLQAAYSLYPGSP

Query:  ASSLVSPISRTS----GDCL--------------SSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFP
          +L+SP S TS    G C                  F  R +  ++ S +    G+  R GSG L  +   G+ L S         T  +    N T P
Subjt:  ASSLVSPISRTS----GDCL--------------SSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFP

Query:  HTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEI
          G  L  S+ S+V S   +     H  S   D   +  +R SF  + +++
Subjt:  HTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEI

AT5G52430.1 hydroxyproline-rich glycoprotein family protein2.8e-3246.54Show/hide
Query:  PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGG
        P   +  RWG CW   SCF +QK  KRI  A  +PE   VT+              TV+ P  +APPSSPASF  S   S   SP   LSL++N  SP  
Subjt:  PQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGG

Query:  PSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSDDL----KGTGKANYIASNDLQ-AAYSLYPGSP-ASSL
        P S ++  GPYA+ETQPV+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS +L      +G     +S+  +  +  + PGSP   +L
Subjt:  PSSTIYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSDDL----KGTGKANYIASNDLQ-AAYSLYPGSP-ASSL

Query:  VSPISRTSGDCLSSSFP
        +SP S  S    SS +P
Subjt:  VSPISRTSGDCLSSSFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCCGAGCAGAACAGATTCCCTCAGCAGGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTGTCTTGTTTTCACTCTCAGAAAGGAGAGAAGCGCATTGT
ACCTGCATCTCGTTTACCTGAGGGCAATGTTGTGACAACCCAACCTAATGGACCTCAAGCTGCGGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCC
CACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCTTGTCGTTATCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATTTATGCTACGGGGCCATATGCACATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTTACCACTGAACCGTCGACTGCTCCACTCACCCCCCCACCTGAACT
AGCCCACCTAACCACACCTTCTTCTCCTGATGTGCCGTTTGCTCAGTTCCTTTCCTCATCAGATGATCTGAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATC
TTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCTAGTAGCCTCGTGTCACCAATTTCAAGAACCTCTGGCGATTGTTTATCATCTTCATTTCCTGAGAGGGACTTC
CGACCACAGTGGAATTCTTCAGCTTCTCTCCAAGATGGAAAATATCCTAGAAGTGGTTCTGGTCGGTTATTTGGAAATGAGAAAGCTGGTACATCTTTGGCATCTCAGGA
TTCCAATTTCTTTTGCCCTGCTACGTTTGCACAATTCTATCTGGACAATACAACATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTCTACTCGT
CTTGTGGGAACGGATACCAGAACAGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCGTTTGGTTTCAGTGCAGATGAAATTATAACT
ACAACACAATATGTAGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTGTCAGCAGAAGAAAGTACTGAACCTCCACTGTTGGG
TGAAAAACTAAAATCCTCGCATACAACTTTGCAAAATCAGAGAAGTATTAAATCAGCGCCTGAGGTTGTTGAAAAGGAAACCTGCACTGAAGTGCCGGCATTATGCAATG
GTTATAAAGACAATAAACTGCAAAGACAACCTGGTGACATATTAGGATCAAGTACTTCAGACCAAGTTGAAAAAGACGTATTCTCGAGGATAGGGTCATCCAAAAATAGT
CGTAAGTATGATCTTGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGAAGCCAAGGGAAATGGTTCATGGCACGACTAA
mRNA sequenceShow/hide mRNA sequence
AATGAACCGAGCTGGTCGGGCTTCTCGCAGAGTTTCAGATTGAACCGAGAGGCATCGATTCGGTTATCAGTTTGAACTGAACCGACCCCATGGTAAGTGTTGCTTTTGGC
TTTAGGGTTTCTCCAATTTAGTATCTCTCCTTCTACTGTCAGCCAGCGGCCTCTCTCTTCTCTGTCTTTCGCATTTGCTGACTCAATCACACTCCGCACCCTTGCTCATC
CGCCTTTGCCGTCGATTACTCAGCCACCGTTGCTTTCGACTACGCAGGCACCGTCTCCGTCGATTTCTTCCGCCCTCATTTCCCATCGGAAATTATAGTACATTGGAGTG
TGTTTTTCTCTTTCTTTATTAGCTACTCTTTTCAGCGTCGAGAGACCAAATATATATGCTGTCAGATTTTGCTCTGTGAATGGAGAAATGCCAGGGTGGATATGGCCTGA
CTCGCATTACTCCTTAAATGGATGAAGTTGGCGGAGGTAGAGGAGGAGGAGGGCCTTGAGTCTTGTAGGAAGAGGTAGCTTAAGACACTAAAAGATAAGTAGAGGCTTCG
TTGAGCTACTGGTTACGAATGGGATCCGAGCAGAACAGATTCCCTCAGCAGGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTGTCTTGTTTTCACTCTCAGAA
AGGAGAGAAGCGCATTGTACCTGCATCTCGTTTACCTGAGGGCAATGTTGTGACAACCCAACCTAATGGACCTCAAGCTGCGGGAATGACCAACCAGGCTACAGTGATAA
CTCCATCCCTTCTAGCCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCTTGTCGTTATCTGCCAACTCACCT
GGAGGTCCTTCATCCACAATTTATGCTACGGGGCCATATGCACATGAAACACAACCGGTTTCTCCTCCTGTTTTCTCAGCCTTTACCACTGAACCGTCGACTGCTCCACT
CACCCCCCCACCTGAACTAGCCCACCTAACCACACCTTCTTCTCCTGATGTGCCGTTTGCTCAGTTCCTTTCCTCATCAGATGATCTGAAAGGAACTGGAAAGGCCAATT
ACATTGCTTCAAATGATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCTAGTAGCCTCGTGTCACCAATTTCAAGAACCTCTGGCGATTGTTTATCATCTTCA
TTTCCTGAGAGGGACTTCCGACCACAGTGGAATTCTTCAGCTTCTCTCCAAGATGGAAAATATCCTAGAAGTGGTTCTGGTCGGTTATTTGGAAATGAGAAAGCTGGTAC
ATCTTTGGCATCTCAGGATTCCAATTTCTTTTGCCCTGCTACGTTTGCACAATTCTATCTGGACAATACAACATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGG
ATTCAGATGTCTACTCGTCTTGTGGGAACGGATACCAGAACAGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCGTTTGGTTTCAGT
GCAGATGAAATTATAACTACAACACAATATGTAGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTGTCAGCAGAAGAAAGTAC
TGAACCTCCACTGTTGGGTGAAAAACTAAAATCCTCGCATACAACTTTGCAAAATCAGAGAAGTATTAAATCAGCGCCTGAGGTTGTTGAAAAGGAAACCTGCACTGAAG
TGCCGGCATTATGCAATGGTTATAAAGACAATAAACTGCAAAGACAACCTGGTGACATATTAGGATCAAGTACTTCAGACCAAGTTGAAAAAGACGTATTCTCGAGGATA
GGGTCATCCAAAAATAGTCGTAAGTATGATCTTGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGGGAAGCCAAGGGAAATGGTTCATG
GCACGACTAAGACAACCTCTCTGGAATAATTTGCAGTATGTTTGTGTATCTTTTTTTCTGCTTTGCAAGGTTTCCATGGAATGTCTAACGTATGATCTAACCGGTTGTCT
CTCTGAGTTATGACTTGGCCACGGGATGGAAGAACGATTATAATTTGTGTTCTTTATCGTGCCAATTTAGATATGGAATGGGTCAGATTTCTGTATTTGGTAGACAAAGT
TGACTTTTTTCGATGAATTGTATACGTTGTATAATCTGCTTGCCTATTCTGTGCCCCAATGTGTATGTCTTGCATTTCTAGAATTTGGCATGTCATCATGTTCCTATCAA
GAGCCACCTTATTATCTGATCCAATCCATGGGTGTTATTGTAAAGAGCGTTCCAGACAGAATTTTTACCTTTGTGGGGATTTTAATCTTTTACAGAACTAATTGGAACAA
CAGGATGAGGGAGGGATTGCCAAATTTACAGGAAAAGGATTTGC
Protein sequenceShow/hide protein sequence
MGSEQNRFPQQERGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST
IYATGPYAHETQPVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSDDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF
RPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIIT
TTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQNQRSIKSAPEVVEKETCTEVPALCNGYKDNKLQRQPGDILGSSTSDQVEKDVFSRIGSSKNS
RKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD