; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G27420 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G27420
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationChr4:24087698..24092177
RNA-Seq ExpressionCSPI04G27420
SyntenyCSPI04G27420
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064685.1 uncharacterized protein E6C27_scaffold255G004100 [Cucumis melo var. makuwa]4.4e-25095.13Show/hide
Query:  GSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
        G E N       GKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
Subjt:  GSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA

Query:  NSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
        NSPGGPSST+YATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
Subjt:  NSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS

Query:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
        PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
Subjt:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN

Query:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPAL
        RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQ+QRSIKSAPE    ETCTE+PAL
Subjt:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPAL

Query:  CNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        CNGYKDNKLQRQPGDI GSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  CNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

TYK19905.1 uncharacterized protein E5676_scaffold134G00170 [Cucumis melo var. makuwa]2.0e-25096.96Show/hide
Query:  RGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMY
        RGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST+Y
Subjt:  RGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMY

Query:  ATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
        SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQ+QRSIKSAPE    ETCTE+PALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKLQR

Query:  QPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        QPGDI GSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  QPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_004150969.1 uncharacterized protein At1g76660 [Cucumis sativus]2.2e-265100Show/hide
Query:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCTEMPALCNG
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCTEMPALCNG
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCTEMPALCNG

Query:  YKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        YKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  YKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_008453041.1 PREDICTED: uncharacterized protein At1g76660 [Cucumis melo]2.2e-25796.83Show/hide
Query:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST+YATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQ+QRSIKSAPE    ETCTE+PA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPA

Query:  LCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LCNGYKDNKLQRQPGDI GSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

XP_038899313.1 uncharacterized protein At1g76660 [Benincasa hispida]4.0e-24392.78Show/hide
Query:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLS
Subjt:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+TQLVSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN SASLQDGKYPRSGSGRLFGNEKA TSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSD YSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPA
        NRH+KSPKQDVEEIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTST+LSAEES EPPLLGEKLKS+HTTLQSQRSIKSAPE    ETCTE+ A
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPA

Query:  LCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSW
        LCNGYKDNKLQRQPG++SGSSTSNQVEKD+FSRIGS KNSRKY+LGLS SDAEVDYRRGRSLREAKG+ SW
Subjt:  LCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSW

TrEMBL top hitse value%identityAlignment
A0A0A0L1G3 Uncharacterized protein1.1e-265100Show/hide
Query:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCTEMPALCNG
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCTEMPALCNG
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCTEMPALCNG

Query:  YKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        YKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  YKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A1S3BV86 uncharacterized protein At1g766601.1e-25796.83Show/hide
Query:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ERGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSST+YATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQ+QRSIKSAPE    ETCTE+PA
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPA

Query:  LCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LCNGYKDNKLQRQPGDI GSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  LCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A5A7VFM0 Uncharacterized protein2.1e-25095.13Show/hide
Query:  GSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
        G E N       GKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA
Subjt:  GSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSA

Query:  NSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
        NSPGGPSST+YATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLVS
Subjt:  NSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVS

Query:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
        PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN
Subjt:  PISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQN

Query:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPAL
        RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQ+QRSIKSAPE    ETCTE+PAL
Subjt:  RHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPAL

Query:  CNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        CNGYKDNKLQRQPGDI GSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  CNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A5D3D8J8 Uncharacterized protein9.6e-25196.96Show/hide
Query:  RGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMY
        RGKRWGGCWGALSCFHSQKG+KRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST+Y
Subjt:  RGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSSTMY

Query:  ATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAH+TQ VSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS+DLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
        SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQ+QRSIKSAPE    ETCTE+PALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPALCNGYKDNKLQR

Query:  QPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        QPGDI GSSTS+QVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
Subjt:  QPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

A0A6J1BVS7 uncharacterized protein At1g766605.9e-23289.26Show/hide
Query:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
        MGSEQNRFPQ ER KRWGGCWGALSCFHSQKG KRIVPASRLPEGN VTTQPNGPQAAGMTNQATVI PSLLAPPSSPASFTNSALPSTVQSPSCFLSLS
Subjt:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV
        ANSPGGPSSTM+ATGPYAH+ QLVSPPVFSAF TEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGK NYIASNDLQAAYSLYPGSPASSLV
Subjt:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLV

Query:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ
        SPISRTSGDCLSSSFPERDF PQWN S+S QDGKYPRSGSGRLFG+EK GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQ
Subjt:  SPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQ

Query:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPA
        NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+ TT+QSQRS+K A +    ETC E+  
Subjt:  NRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPE----ETCTEMPA

Query:  LCNGYKDNKLQRQPGDISGSSTS-NQVE-KDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD
        LCNG +DNKLQRQPG++SGSS+S NQVE +DVFSRI   KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  LCNGYKDNKLQRQPGDISGSSTS-NQVE-KDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGNGSWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766604.9e-12758.44Show/hide
Query:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL
        MGSEQ      ++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL

Query:  SLSANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+MYATGPYAH+TQLVSPPVFS F TEPSTAP TPPPELA LT PSSPDVP+A+FL+SS DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS
        +L SPISR SGD L S                 Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD + + P  GGRLSVSKDSDVY  + 
Subjt:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS

Query:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCT--
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL      L SQ S KS  +      
Subjt:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCT--

Query:  --EMPALCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN
          + P   N YKD+K + +          +  E+ + SR+GS K SR Y   +S SDAEV+YRRGRSLRE++ N
Subjt:  --EMPALCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)4.1e-2840.38Show/hide
Query:  PQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPS
        P H++ ++W   W  L CF S +  KRI  +  +PE   V+   +    +    ++ + T   +APPSSPASF  S  PS  QSP   LS S   P    
Subjt:  PQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPS

Query:  STMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPIS
         +++A GPYAH+TQLVSPPVFS + TEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PGSP   L+SP  
Subjt:  STMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPIS

Query:  RTSGDCLSSSFPE
         + G   +S FP+
Subjt:  RTSGDCLSSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown3.5e-12858.44Show/hide
Query:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL
        MGSEQ      ++ KRWGGC G  SCF SQKG KRIVPASR+PE GNV  +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPE-GNVVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFL

Query:  SLSANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPAS
        SL+ANSPGGPSS+MYATGPYAH+TQLVSPPVFS F TEPSTAP TPPPELA LT PSSPDVP+A+FL+SS DLK +GK +Y   NDLQA YSLYPGSPAS
Subjt:  SLSANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPAS

Query:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS
        +L SPISR SGD L S                 Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD + + P  GGRLSVSKDSDVY  + 
Subjt:  SLVSPISRTSGDCLSSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NTTFPHTGGRLSVSKDSDVY--SS

Query:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCT--
         GNG QNR ++SPKQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL      L SQ S KS  +      
Subjt:  CGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCT--

Query:  --EMPALCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN
          + P   N YKD+K + +          +  E+ + SR+GS K SR Y   +S SDAEV+YRRGRSLRE++ N
Subjt:  --EMPALCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYDLGLSCSDAEVDYRRGRSLREAKGN

AT4G25620.1 hydroxyproline-rich glycoprotein family protein3.0e-2633.9Show/hide
Query:  SEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFLSLS
        S ++R       K+ G  W    CF S+K +KRI  A  +PE    +     P     +N  ++  P  +APPSSPASF  S  PS      P    SL+
Subjt:  SEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFLSLS

Query:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS-----EDLKGTGKANYIASNDLQAAYSLYPGSP
         N P  PS+  +  GPYAH+TQ V+PPVFSAF TEPSTAP TPPPE     +PSSP+VPFAQ L+SS      +  G     + A++    +  +YPGSP
Subjt:  ANSPGGPSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS-----EDLKGTGKANYIASNDLQAAYSLYPGSP

Query:  ASSLVSPISRTS----GDCL--------------SSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFP
          +L+SP S TS    G C                  F  R +  ++ S +    G+  R GSG L  +   G+ L S         T  +    N T P
Subjt:  ASSLVSPISRTS----GDCL--------------SSSFPERDFRPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFP

Query:  HTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEI
          G  L  S+ S+V S   +     H  S   D   +  +R SF  + +++
Subjt:  HTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEI

AT5G52430.1 hydroxyproline-rich glycoprotein family protein3.1e-3145.62Show/hide
Query:  PQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGG
        P   +  RWG CW   SCF +QK +KRI  A  +PE   VT+              TV+ P  +APPSSPASF  S   S   SP   LSL++N  SP  
Subjt:  PQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSAN--SPGG

Query:  PSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSEDL----KGTGKANYIASNDLQ-AAYSLYPGSP-ASSL
        P S ++  GPYA++TQ V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS +L      +G     +S+  +  +  + PGSP   +L
Subjt:  PSSTMYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSEDL----KGTGKANYIASNDLQ-AAYSLYPGSP-ASSL

Query:  VSPISRTSGDCLSSSFP
        +SP S  S    SS +P
Subjt:  VSPISRTSGDCLSSSFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCCGAGCAGAACAGATTCCCTCAGCACGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTGTCTTGTTTTCACTCTCAGAAAGGAGATAAGCGCATTGT
ACCTGCATCTCGTTTACCTGAGGGCAATGTTGTGACAACCCAGCCTAATGGACCTCAAGCTGCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCC
CACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCTTGTCGCTATCTGCCAACTCACCTGGAGGTCCTTCATCCACA
ATGTATGCTACAGGGCCATACGCACATGACACACAACTGGTTTCTCCTCCTGTTTTCTCAGCCTTCAACACTGAACCTTCAACTGCTCCACTCACCCCCCCACCTGAACT
AGCCCACCTAACCACACCTTCTTCCCCTGATGTACCTTTTGCTCAGTTCCTTTCCTCATCAGAGGATCTGAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATC
TTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGAACCTCTGGCGATTGTTTATCATCTTCATTTCCTGAGAGGGACTTC
CGACCACAGTGGAATTCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGTTATTTGGAAATGAGAAAGCTGGTACATCTTTGGCATCTCAGGA
TTCCAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATACAACATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTCTACTCGT
CTTGTGGGAATGGGTACCAGAACAGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCATTTGGTTTCAGTGCAGATGAAATTATAACT
ACTACACAATATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTTTGTCAGCAGAAGAAAGTACTGAACCTCCACTGTTGGG
TGAAAAACTAAAATCCTCGCATACAACTTTACAAAGTCAGAGAAGTATTAAATCAGCACCTGAGGAAACTTGCACTGAAATGCCGGCATTATGCAATGGTTATAAAGACA
ATAAATTGCAAAGACAACCTGGTGACATATCAGGATCAAGTACCTCAAACCAAGTTGAAAAAGACGTATTCTCAAGGATAGGGTCATCCAAAAATAGTCGCAAGTATGAT
CTTGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGGAGGAGCCTAAGGGAAGCCAAGGGAAATGGTTCATGGCACGACTAA
mRNA sequenceShow/hide mRNA sequence
CTCACATAGTTTCAGATTGAACCGAGAGGCATCGATTCGGTTATCAGTTTGAATTGAACCGACCCCATGGTGGGTGTTGCTTTTGGCTTTAGGGTTTCTCCAATTTAGTA
TCTCTCCTCCAATTTAGTATCAGCCAGCGGCCTCTCTCTTCTCTGTCTTTTGCATTTGCTGACTCAATCACACTCACTCCGCACCCTCGCTCATCTGTCTTTGCCGTCGA
TTACTCAGCCACCGTTGCTTTCGACTACGCAGGCACCGTCTCCGTCGATTTCTTCCGCCCTCATTTCCTATCGGAAATTAAAGTACATTGGAGTGTGTTTTTCTCTTTCT
TTATTAGCTACTCTTTTCAGCGTCGAGAGACCGAATACATATGCTGTCAGATTTTGCTCTGTGAATGGAGAAATGCCAGGGTGGACATGGCCTGACTCTCATTACTCCTT
AAATGGATGAAGTTGGCGGAGGAAGAGGAGGAGGAGGGCCTTGAGTCTTGTAGGAAGAGGTAGCTTAAGACACTAAAAGATAAGTACAGGCTTCGTTGAGCTACTGGTTA
CGAATGGGGTCCGAGCAGAACAGATTCCCTCAGCACGAACGGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTGTCTTGTTTTCACTCTCAGAAAGGAGATAAGCGCAT
TGTACCTGCATCTCGTTTACCTGAGGGCAATGTTGTGACAACCCAGCCTAATGGACCTCAAGCTGCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAG
CCCCACCTTCTTCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCTTGTCGCTATCTGCCAACTCACCTGGAGGTCCTTCATCC
ACAATGTATGCTACAGGGCCATACGCACATGACACACAACTGGTTTCTCCTCCTGTTTTCTCAGCCTTCAACACTGAACCTTCAACTGCTCCACTCACCCCCCCACCTGA
ACTAGCCCACCTAACCACACCTTCTTCCCCTGATGTACCTTTTGCTCAGTTCCTTTCCTCATCAGAGGATCTGAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATG
ATCTTCAAGCAGCATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGAACCTCTGGCGATTGTTTATCATCTTCATTTCCTGAGAGGGAC
TTCCGACCACAGTGGAATTCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGTTATTTGGAAATGAGAAAGCTGGTACATCTTTGGCATCTCA
GGATTCCAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATACAACATTCCCTCATACTGGTGGGAGGTTAAGTGTATCAAAGGATTCAGATGTCTACT
CGTCTTGTGGGAATGGGTACCAGAACAGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCATTTGGTTTCAGTGCAGATGAAATTATA
ACTACTACACAATATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTTTGTCAGCAGAAGAAAGTACTGAACCTCCACTGTT
GGGTGAAAAACTAAAATCCTCGCATACAACTTTACAAAGTCAGAGAAGTATTAAATCAGCACCTGAGGAAACTTGCACTGAAATGCCGGCATTATGCAATGGTTATAAAG
ACAATAAATTGCAAAGACAACCTGGTGACATATCAGGATCAAGTACCTCAAACCAAGTTGAAAAAGACGTATTCTCAAGGATAGGGTCATCCAAAAATAGTCGCAAGTAT
GATCTTGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGAGGGAGGAGCCTAAGGGAAGCCAAGGGAAATGGTTCATGGCACGACTAAGACAACCTCTCTGGAAC
GGTTTGCAGTATGTTTGTGTATCTGTTTTTCTGCTTTGCAAGGTTTCCATGGAATGTCTAACGTATGATCTAACCTGTTGTCTCCCTGAGTTATGACCTGGCCACGGGAT
GGAAGAACGATTATAATTTGTGTTCTTTATCGTGCCAATTTAGATATGGAATGGGTCAGATTTCTGTATTTGGTAGACAAAGTTGACTTTTTCCGATGAATTGTATACGC
TGTATAATCTGCTTGCCTATTCTGTGCCTCAATGTGTATGTCCTGCATTTCTAGAATTTGGCATGTCATCATGGTCCTATCAAGAGCAACCTTGCTATCCGATCCAATCC
ATAGGTGTTAACTTTGTTGTAAAGAGCCTTCCGG
Protein sequenceShow/hide protein sequence
MGSEQNRFPQHERGKRWGGCWGALSCFHSQKGDKRIVPASRLPEGNVVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFLSLSANSPGGPSST
MYATGPYAHDTQLVSPPVFSAFNTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSEDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDF
RPQWNSSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNTTFPHTGGRLSVSKDSDVYSSCGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIIT
TTQYVEISDVMEDSFTMRPFTSTSLSAEESTEPPLLGEKLKSSHTTLQSQRSIKSAPEETCTEMPALCNGYKDNKLQRQPGDISGSSTSNQVEKDVFSRIGSSKNSRKYD
LGLSCSDAEVDYRRGRSLREAKGNGSWHD