; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G20650 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G20650
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationClcChr11:30810582..30815361
RNA-Seq ExpressionClc11G20650
SyntenyClc11G20650
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591175.1 hypothetical protein SDJN03_13521, partial [Cucurbita argyrosperma subsp. sororia]2.0e-21982.62Show/hide
Query:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQAS------------------------------WPGKRWGGCWGALSCFHSQKGEKRIV
        M PAVN L L++WTP+IEKSSMNWICGKFLSFQKGGCLSV Q +                                GK+WGGCWGALSCFHSQKGEKRIV
Subjt:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQAS------------------------------WPGKRWGGCWGALSCFHSQKGEKRIV

Query:  PASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPS
        PASRLPEGN VTTQPN P AAGM  QATVI PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPS
Subjt:  PASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPS

Query:  TAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPR
        TAPLTPPPELAHLTTPSSPDVPFA+FLSSSVDLKGTGK NYIASNDLQ AYSLYPGSP+SSLVSPISRTSGDCL SSFPERDFP QWNPS S QDGKYPR
Subjt:  TAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPR

Query:  SGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVE
        +GSGRLFG+EKA GTSL SQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYS  GN  QNRH+KSPKQDVEE+EAYRASFGFSADEII TTQYVE
Subjt:  SGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVE

Query:  ISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ
        IS VMEDSFTM+PFTSTSLSAEES EPPLL E LKS HTTLQSQR IKS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Subjt:  ISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ

XP_022975613.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita maxima]9.5e-21785.84Show/hide
Query:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI
        M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPN P  AGM  QATVI
Subjt:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI

Query:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS
         PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSS
Subjt:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS

Query:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF
        +DLKG GK NYIASNDLQ AYSLYPGSP+SSLVSPISRTSGDCL SSFPERDFP QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Subjt:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN +QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL
Subjt:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ
         E L S HTTLQSQR IKS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Subjt:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ

XP_023521113.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo]9.5e-21786.27Show/hide
Query:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI
        M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPN P AAGM  QATVI
Subjt:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI

Query:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS
         PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSS
Subjt:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS

Query:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF
        VDLKGTGK NYIASNDLQ AYSLYPGSP+SSLVSPISRTSGDCL SSFPERDF  QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Subjt:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN  QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL
Subjt:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ
         E L S HTTLQSQR IKS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Subjt:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ

XP_023521115.1 uncharacterized protein At1g76660-like isoform X2 [Cucurbita pepo subsp. pepo]1.2e-21686.27Show/hide
Query:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI
        M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPN P AAGM  QATVI
Subjt:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI

Query:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS
         PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSS
Subjt:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS

Query:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF
        VDLKGTGK NYIASNDLQ AYSLYPGSP+SSLVSPISRTSGDCL SSFPERDF  QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Subjt:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN  QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL
Subjt:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ
         E L S HTTLQSQR IKS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Subjt:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ

XP_038899313.1 uncharacterized protein At1g76660 [Benincasa hispida]3.6e-22498.31Show/hide
Query:  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
        GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
Subjt:  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA

Query:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS
        TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS
Subjt:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS

Query:  SFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE
        SFPERDFP QWNPSASLQDGKYPRSGSGRLFGNEKA  TSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD YSSSGNGYQNRH+KSPKQDVE
Subjt:  SFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR
        EIEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTST+LSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR

Query:  QPGNMSGSSTSNQ
        QPGNMSGSSTSNQ
Subjt:  QPGNMSGSSTSNQ

TrEMBL top hitse value%identityAlignment
A0A1S3BV86 uncharacterized protein At1g766609.5e-21594.92Show/hide
Query:  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
        GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++A
Subjt:  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA

Query:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS
        TGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS
Subjt:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS

Query:  SFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE
        SFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKA GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Subjt:  SFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+HTTLQ+QRSIKSAPEVVEKETCTEV ALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR

Query:  QPGNMSGSSTSNQ
        QPG++ GSSTS+Q
Subjt:  QPGNMSGSSTSNQ

A0A5A7VFM0 Uncharacterized protein9.5e-21594.92Show/hide
Query:  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
        GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++A
Subjt:  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA

Query:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS
        TGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS
Subjt:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS

Query:  SFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE
        SFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKA GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Subjt:  SFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+HTTLQ+QRSIKSAPEVVEKETCTEV ALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR

Query:  QPGNMSGSSTSNQ
        QPG++ GSSTS+Q
Subjt:  QPGNMSGSSTSNQ

A0A5D3D8J8 Uncharacterized protein9.5e-21594.92Show/hide
Query:  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
        GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++A
Subjt:  GKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA

Query:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS
        TGPYAHETQ VSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS
Subjt:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSS

Query:  SFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE
        SFPERDF  QWN SASLQDGKYPRSGSGRLFGNEKA GTSLASQDSNFFCPATFAQFYLDN  FPHTGGRLSVSKDSDVYSS GNGYQNRHSKSPKQDVE
Subjt:  SFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+HTTLQ+QRSIKSAPEVVEKETCTEV ALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQR

Query:  QPGNMSGSSTSNQ
        QPG++ GSSTS+Q
Subjt:  QPGNMSGSSTSNQ

A0A6J1F9B9 uncharacterized protein At1g76660-like isoform X13.0e-21686.06Show/hide
Query:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI
        M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPN P AAGM  QATVI
Subjt:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI

Query:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS
         PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSS
Subjt:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS

Query:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF
        VDLKGTGK NYIASNDLQ AYSLYPGSP+SSLVSPISRTSGDCL SSFPERDFP QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Subjt:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN  QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EP LL
Subjt:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ
         E L S HTTLQS R IKS P+VV+K+TCTEVLALC  Y+DNKLQRQPGNMSGSST NQ
Subjt:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ

A0A6J1IL36 uncharacterized protein At1g76660-like isoform X14.6e-21785.84Show/hide
Query:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI
        M PAVN L L++WTP+IEKSSMNWICGKFLSFQK            GK+WGGCWGALSCFHSQKGEKRIVPASRLPEGN VTTQPN P  AGM  QATVI
Subjt:  MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVI

Query:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS
         PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSSS
Subjt:  TPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS

Query:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF
        +DLKG GK NYIASNDLQ AYSLYPGSP+SSLVSPISRTSGDCL SSFPERDFP QWNPS S QDGKYPR+GSGRLFG+EKA GTSL SQDSNFFCPATF
Subjt:  VDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFPHTGGRLSVSKDSDVYS  GN +QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL
Subjt:  AQFYLDNPPFPHTGGRLSVSKDSDVYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ
         E L S HTTLQSQR IKS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQ
Subjt:  GEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQ

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.2e-11561.19Show/hide
Query:  KRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTM
        KRWGGC G  SCF SQKG KRIVPASR+PE GN   +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C++SL+ANSPGGPSS+M
Subjt:  KRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTM

Query:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCL
        +ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS+L SPISR SGD L
Subjt:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCL

Query:  SSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSS--GNGYQNRHSKSP
         S                 Q+GK  RS SG  FG +   G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY ++  GNG QNR ++SP
Subjt:  SSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSS--GNGYQNRHSKSP

Query:  KQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKD
        KQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL      L SQ S KS  ++  +    +     N YKD
Subjt:  KQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKD

Query:  NK
        +K
Subjt:  NK

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)3.5e-2841.55Show/hide
Query:  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFAT
        ++W   W  L CF S +  KRI  +  +PE  ++++  +    +G   ++ + T   +APPSSPASF  S  PS  QSP   +S S   P     ++FA 
Subjt:  KRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFAT

Query:  GPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDC
        GPYAHETQLVSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PGSP   L+SP   + G  
Subjt:  GPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDC

Query:  LSSSFPE
         +S FP+
Subjt:  LSSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.5e-11661.19Show/hide
Query:  KRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTM
        KRWGGC G  SCF SQKG KRIVPASR+PE GN   +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C++SL+ANSPGGPSS+M
Subjt:  KRWGGCWGALSCFHSQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMTNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTM

Query:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCL
        +ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQA YSLYPGSPAS+L SPISR SGD L
Subjt:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAAYSLYPGSPASSLVSPISRTSGDCL

Query:  SSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSS--GNGYQNRHSKSP
         S                 Q+GK  RS SG  FG +   G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY ++  GNG QNR ++SP
Subjt:  SSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSSS--GNGYQNRHSKSP

Query:  KQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKD
        KQD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL      L SQ S KS  ++  +    +     N YKD
Subjt:  KQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYKD

Query:  NK
        +K
Subjt:  NK

AT4G25620.1 hydroxyproline-rich glycoprotein family protein7.9e-2839.92Show/hide
Query:  SVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFMSLSANS
        S  Q S   K+ G  W    CF S+K  KRI  A  +PE  A +     P     +N  ++  P  +APPSSPASF  S  PS      P    SL+ N 
Subjt:  SVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFMSLSANS

Query:  PGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYIASNDLQAAYSLYPGSPASS
        P  PS+  F  GPYAHETQ V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++    +  +YPGSP  +
Subjt:  PGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYIASNDLQAAYSLYPGSPASS

Query:  LVSPISRTS----GDCL--------------SSSFPERDFPSQWNPSASLQDGKYPRSGSGRL
        L+SP S TS    G C                  F  R + S++   +    G+  R GSG L
Subjt:  LVSPISRTS----GDCL--------------SSSFPERDFPSQWNPSASLQDGKYPRSGSGRL

AT5G52430.1 hydroxyproline-rich glycoprotein family protein2.6e-3144.16Show/hide
Query:  RWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSAN--SPGGPSSTMFA
        RWG CW   SCF +QK  KRI  A  +PE   VT+              TV+ P  +APPSSPASF  S   S   SP   +SL++N  SP  P S +F 
Subjt:  RWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSAN--SPGGPSSTMFA

Query:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDL----KGTGKANYIASNDLQ-AAYSLYPGSP-ASSLVSPISRT
         GPYA+ETQ V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      +G     +S+  +  +  + PGSP   +L+SP S  
Subjt:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDL----KGTGKANYIASNDLQ-AAYSLYPGSP-ASSLVSPISRT

Query:  SGDCLSSSFPERDFPSQWNPSASLQDGKYPR
        S    SS +P +      +P    + G+ P+
Subjt:  SGDCLSSSFPERDFPSQWNPSASLQDGKYPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCTGCTGTTAATGCTTTGAAGTTGAACATATGGACTCCAATGATTGAGAAATCTAGCATGAACTGGATATGTGGAAAGTTCCTTTCCTTTCAGAAGGGTGGCTG
TTTATCTGTTGCTCAAGCTTCTTGGCCAGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAGGGAGAAAAGCGCATCGTACCTGCATCTC
GTTTACCTGAGGGCAATGCCGTGACAACCCAGCCTAATGGACCTCAAGCAGCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCA
CCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCACTGTCTGCAAACTCACCTGGAGGTCCTTCATCCACAATGTTTGCTAC
AGGGCCATATGCGCATGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCCGAACTAGCTCATCTAA
CCACGCCTTCTTCCCCTGATGTGCCCTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCA
TATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCCTCATTTCCTGAAAGGGACTTCCCATCACAGTG
GAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTGGTACATCGTTGGCATCTCAGGATTCTAATT
TCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCGAAGGATTCAGATGTCTACTCTTCTAGTGGG
AATGGATACCAGAACCGGCACAGTAAATCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACAGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACA
GTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTATCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAAC
TAAAATCCACGCATACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGCTGGCATTATGCAATGGTTATAAA
GACAATAAATTGCAAAGACAACCTGGTAATATGTCAGGATCAAGTACTTCAAACCAAAGGTTGACTACGGAAGGGGAAGGAGCCCAAGGGAGGCCAGGGAAGATTTTTCA
TGGCATGACTAAGACATCCTCTGGAATAGTTTACAGTGTGTTTGTGTATCTGTTTTTCTGCTTTACAGGCCCTTGTTACACGGATGCTGGTTGGGCTAGTTGCCCGAATG
ATCGTCGTAGGAGTAGTGGTTTAGTGGTTTTTGCAATTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCTGCTGTTAATGCTTTGAAGTTGAACATATGGACTCCAATGATTGAGAAATCTAGCATGAACTGGATATGTGGAAAGTTCCTTTCCTTTCAGAAGGGTGGCTG
TTTATCTGTTGCTCAAGCTTCTTGGCCAGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACTCTCAGAAGGGAGAAAAGCGCATCGTACCTGCATCTC
GTTTACCTGAGGGCAATGCCGTGACAACCCAGCCTAATGGACCTCAAGCAGCAGGAATGACCAACCAGGCTACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCTTCA
CCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCACTGTCTGCAAACTCACCTGGAGGTCCTTCATCCACAATGTTTGCTAC
AGGGCCATATGCGCATGAAACACAGCTGGTTTCTCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCCGAACTAGCTCATCTAA
CCACGCCTTCTTCCCCTGATGTGCCCTTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCAGCA
TATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCCTCATTTCCTGAAAGGGACTTCCCATCACAGTG
GAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTGGTACATCGTTGGCATCTCAGGATTCTAATT
TCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGGAGGTTAAGTGTATCGAAGGATTCAGATGTCTACTCTTCTAGTGGG
AATGGATACCAGAACCGGCACAGTAAATCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACAGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACA
GTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTCTATCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAAC
TAAAATCCACGCATACAACTTTACAGAGTCAGAGAAGTATTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGCTGGCATTATGCAATGGTTATAAA
GACAATAAATTGCAAAGACAACCTGGTAATATGTCAGGATCAAGTACTTCAAACCAAAGGTTGACTACGGAAGGGGAAGGAGCCCAAGGGAGGCCAGGGAAGATTTTTCA
TGGCATGACTAAGACATCCTCTGGAATAGTTTACAGTGTGTTTGTGTATCTGTTTTTCTGCTTTACAGGCCCTTGTTACACGGATGCTGGTTGGGCTAGTTGCCCGAATG
ATCGTCGTAGGAGTAGTGGTTTAGTGGTTTTTGCAATTTTCTAG
Protein sequenceShow/hide protein sequence
MSPAVNALKLNIWTPMIEKSSMNWICGKFLSFQKGGCLSVAQASWPGKRWGGCWGALSCFHSQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMTNQATVITPSLLAPPSS
PASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQAA
YSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPSQWNPSASLQDGKYPRSGSGRLFGNEKAGGTSLASQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSSSG
NGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTLQSQRSIKSAPEVVEKETCTEVLALCNGYK
DNKLQRQPGNMSGSSTSNQRLTTEGEGAQGRPGKIFHGMTKTSSGIVYSVFVYLFFCFTGPCYTDAGWASCPNDRRRSSGLVVFAIF