; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G030070 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G030070
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr02:36142910..36147617
RNA-Seq ExpressionLsi02G030070
SyntenyLsi02G030070
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022936803.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita moschata]2.6e-24085.21Show/hide
Query:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV
        M PAVN L LD+WTP+IEKSSMNWICGKFLSFQK             GK+WGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPN P AAGM+ QATV
Subjt:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV

Query:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS
        I PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSS
Subjt:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS

Query:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF
        SVDLKGTGK NYIASNDLQ  YSLYPGSP+SSLVSPISRTSGDCL SSFPERDFPPQWNPS S QDGKYPR+GSGRLFG+EKAGTSL SQDSNFFCPATF
Subjt:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFP+TGGRLSVSKDSD YS  GN  QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EP LL
Subjt:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK
         E L S HTT QS R  KS P+VV+K+TCTEVLALC  Y+DNKLQRQPGNMSGSST NQV  DVFSRIG  KNSRKYNLGLSCSDAEVDYRRGRSLRE K
Subjt:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK

Query:  GDFSWHD
        GDF WHD
Subjt:  GDFSWHD

XP_022975613.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita maxima]4.1e-24185.01Show/hide
Query:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV
        M PAVN L LD+WTP+IEKSSMNWICGKFLSFQK             GK+WGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPN P  AGM+ QATV
Subjt:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV

Query:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS
        I PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSS
Subjt:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS

Query:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF
        S+DLKG GK NYIASNDLQ  YSLYPGSP+SSLVSPISRTSGDCL SSFPERDFPPQWNPS S QDGKYPR+GSGRLFG+EKAGTSL SQDSNFFCPATF
Subjt:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFP+TGGRLSVSKDSD YS  GN +QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL
Subjt:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK
         E L S HTT QSQR  KS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQV  DVFSRIG  KNSRKYNLGLSCSDAEVDYRRGRSLRE K
Subjt:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK

Query:  GDFSWHD
        GDF WHD
Subjt:  GDFSWHD

XP_023521113.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo]4.1e-24185.4Show/hide
Query:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV
        M PAVN L LD+WTP+IEKSSMNWICGKFLSFQK             GK+WGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPN P AAGM+ QATV
Subjt:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV

Query:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS
        I PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSS
Subjt:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS

Query:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF
        SVDLKGTGK NYIASNDLQ  YSLYPGSP+SSLVSPISRTSGDCL SSFPERDF PQWNPS S QDGKYPR+GSGRLFG+EKAGTSL SQDSNFFCPATF
Subjt:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFP+TGGRLSVSKDSD YS  GN  QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL
Subjt:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK
         E L S HTT QSQR  KS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQV  DVFSRIG  KNSRKYNLGLSCSDAEVDYRRGRSLRE K
Subjt:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK

Query:  GDFSWHD
        GDF WHD
Subjt:  GDFSWHD

XP_023521115.1 uncharacterized protein At1g76660-like isoform X2 [Cucurbita pepo subsp. pepo]5.3e-24185.4Show/hide
Query:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV
        M PAVN L LD+WTP+IEKSSMNWICGKFLSFQK             GK+WGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPN P AAGM+ QATV
Subjt:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV

Query:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS
        I PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSS
Subjt:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS

Query:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF
        SVDLKGTGK NYIASNDLQ  YSLYPGSP+SSLVSPISRTSGDCL SSFPERDF PQWNPS S QDGKYPR+GSGRLFG+EKAGTSL SQDSNFFCPATF
Subjt:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFP+TGGRLSVSKDSD YS  GN  QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL
Subjt:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK
         E L S HTT QSQR  KS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQV  DVFSRIG  KNSRKYNLGLSCSDAEVDYRRGRSLRE K
Subjt:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK

Query:  GDFSWHD
        GDF WHD
Subjt:  GDFSWHD

XP_038899313.1 uncharacterized protein At1g76660 [Benincasa hispida]2.9e-24796.51Show/hide
Query:  GKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
        GKRWGGCWGALSCFH+QKGEKRIVPASRLPEGNAVTTQPNGPQAAGM+NQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
Subjt:  GKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA

Query:  TGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSS
        TGPYAHETQLVSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQA YSLYPGSPASSLVSPISRTSGDCLSS
Subjt:  TGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSS

Query:  SFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEE
        SFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKA TSLASQDSNFFCPATFAQFYLDNPPFP+TGGRLSVSKDSDAYSSSGNGYQNRH+KSPKQDVEE
Subjt:  SFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEE

Query:  IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQ
        IEAYRASFGFSADEII+TTQYVEISDVMEDSFTMRPFTST+LSAEESIEPPLLGEKLKSTHTT QSQRS KSAPEVVEKETCTEVLALCNGYKDNKLQRQ
Subjt:  IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQ

Query:  PGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSW
        PGNMSGSSTSNQVEKD+FSRIGS KNSRKYNLGLS SDAEVDYRRGRSLRE KGD SW
Subjt:  PGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSW

TrEMBL top hitse value%identityAlignment
A0A1S3BV86 uncharacterized protein At1g766602.0e-23893.04Show/hide
Query:  GKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
        GKRWGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPNGPQAAGM+NQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++A
Subjt:  GKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA

Query:  TGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSS
        TGPYAHETQ VSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQA YSLYPGSPASSLVSPISRTSGDCLSS
Subjt:  TGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSS

Query:  SFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEE
        SFPERDF PQWN SASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDN  FP+TGGRLSVSKDSD YSS GNGYQNRHSKSPKQDVEE
Subjt:  SFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEE

Query:  IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQ
        IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+HTT Q+QRS KSAPEVVEKETCTEV ALCNGYKDNKLQRQ
Subjt:  IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQ

Query:  PGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSWHD
        PG++ GSSTS+QVEKDVFSRIGS KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  PGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSWHD

A0A5A7VFM0 Uncharacterized protein1.2e-23892.84Show/hide
Query:  MGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMF
        +GKRWGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPNGPQAAGM+NQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++
Subjt:  MGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMF

Query:  ATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLS
        ATGPYAHETQ VSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQA YSLYPGSPASSLVSPISRTSGDCLS
Subjt:  ATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLS

Query:  SSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVE
        SSFPERDF PQWN SASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDN  FP+TGGRLSVSKDSD YSS GNGYQNRHSKSPKQDVE
Subjt:  SSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVE

Query:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQR
        EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+HTT Q+QRS KSAPEVVEKETCTEV ALCNGYKDNKLQR
Subjt:  EIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQR

Query:  QPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSWHD
        QPG++ GSSTS+QVEKDVFSRIGS KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  QPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSWHD

A0A5D3D8J8 Uncharacterized protein2.0e-23893.04Show/hide
Query:  GKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA
        GKRWGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPNGPQAAGM+NQATVITPSLLAPPSSPASFTNSALPSTVQSPSCF+SLSANSPGGPSST++A
Subjt:  GKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFA

Query:  TGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSS
        TGPYAHETQ VSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSS DLKGTGKANYIASNDLQA YSLYPGSPASSLVSPISRTSGDCLSS
Subjt:  TGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSS

Query:  SFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEE
        SFPERDF PQWN SASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDN  FP+TGGRLSVSKDSD YSS GNGYQNRHSKSPKQDVEE
Subjt:  SFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEE

Query:  IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQ
        IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEES EPPLLGEKLKS+HTT Q+QRS KSAPEVVEKETCTEV ALCNGYKDNKLQRQ
Subjt:  IEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQ

Query:  PGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSWHD
        PG++ GSSTS+QVEKDVFSRIGS KNSRKY+LGLSCSDAEVDYRRGRSLRE KG+ SWHD
Subjt:  PGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSWHD

A0A6J1F9B9 uncharacterized protein At1g76660-like isoform X11.3e-24085.21Show/hide
Query:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV
        M PAVN L LD+WTP+IEKSSMNWICGKFLSFQK             GK+WGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPN P AAGM+ QATV
Subjt:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV

Query:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS
        I PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSS
Subjt:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS

Query:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF
        SVDLKGTGK NYIASNDLQ  YSLYPGSP+SSLVSPISRTSGDCL SSFPERDFPPQWNPS S QDGKYPR+GSGRLFG+EKAGTSL SQDSNFFCPATF
Subjt:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFP+TGGRLSVSKDSD YS  GN  QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EP LL
Subjt:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK
         E L S HTT QS R  KS P+VV+K+TCTEVLALC  Y+DNKLQRQPGNMSGSST NQV  DVFSRIG  KNSRKYNLGLSCSDAEVDYRRGRSLRE K
Subjt:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK

Query:  GDFSWHD
        GDF WHD
Subjt:  GDFSWHD

A0A6J1IL36 uncharacterized protein At1g76660-like isoform X12.0e-24185.01Show/hide
Query:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV
        M PAVN L LD+WTP+IEKSSMNWICGKFLSFQK             GK+WGGCWGALSCFH+QKGEKRIVPASRLPEGN VTTQPN P  AGM+ QATV
Subjt:  MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATV

Query:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS
        I PSLLAPPSSPASFTNSALPST QSPSCF+S+SANSPGGPSSTMFATGPYAHETQLVSPP+FSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFA+FLSS
Subjt:  ITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSS

Query:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF
        S+DLKG GK NYIASNDLQ  YSLYPGSP+SSLVSPISRTSGDCL SSFPERDFPPQWNPS S QDGKYPR+GSGRLFG+EKAGTSL SQDSNFFCPATF
Subjt:  SVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATF

Query:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL
        AQFYLDNPPFP+TGGRLSVSKDSD YS  GN +QNRH+KSPKQDVEE+EAYRASFGFSADEIITTTQYVEIS VMEDSFTM+PFTSTSLSAEES EPPLL
Subjt:  AQFYLDNPPFPNTGGRLSVSKDSDAYSSSGNGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLL

Query:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK
         E L S HTT QSQR  KS P+VV+K+TCTEVLALC+ Y+DNKLQRQPGNMSGSST NQV  DVFSRIG  KNSRKYNLGLSCSDAEVDYRRGRSLRE K
Subjt:  GEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK

Query:  GDFSWHD
        GDF WHD
Subjt:  GDFSWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766605.0e-12558.3Show/hide
Query:  KRWGGCWGALSCFHAQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMSNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTM
        KRWGGC G  SCF +QKG KRIVPASR+PE GN   +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C++SL+ANSPGGPSS+M
Subjt:  KRWGGCWGALSCFHAQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMSNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTM

Query:  FATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCL
        +ATGPYAHETQLVSPP+FS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQATYSLYPGSPAS+L SPISR SGD L
Subjt:  FATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCL

Query:  SSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NPPFPNTGGRLSVSKDSDAYSSS--GNGYQNRHSKSPK
         S                 Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSD Y ++  GNG QNR ++SPK
Subjt:  SSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NPPFPNTGGRLSVSKDSDAYSSS--GNGYQNRHSKSPK

Query:  QDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDN
        QD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL        SQ S KS  ++  +    +     N YKD+
Subjt:  QDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDN

Query:  KLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK
        K + +          +  E+ + SR+GS K SR Y+  +S SDAEV+YRRGRSLRE++
Subjt:  KLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)5.8e-2840.58Show/hide
Query:  KRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFAT
        ++W   W  L CF + +  KRI  +  +PE   V+   +    +    ++ + T   +APPSSPASF  S  PS  QSP   +S S   P     ++FA 
Subjt:  KRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFAT

Query:  GPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDC
        GPYAHETQLVSPP+FS +TTEPS+AP+TPP + + +    TTPSSP+VPFAQ  +S+      G    ++S+     Y L PGSP   L+SP   + G  
Subjt:  GPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDC

Query:  LSSSFPE
         +S FP+
Subjt:  LSSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown3.5e-12658.3Show/hide
Query:  KRWGGCWGALSCFHAQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMSNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTM
        KRWGGC G  SCF +QKG KRIVPASR+PE GN   +QPNG   AG+ N   A  I  SLLAPPSSPASFTNSALPST QSP+C++SL+ANSPGGPSS+M
Subjt:  KRWGGCWGALSCFHAQKGEKRIVPASRLPE-GNAVTTQPNGPQAAGMSNQ--ATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTM

Query:  FATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCL
        +ATGPYAHETQLVSPP+FS FTTEPSTAP TPPPELA LT PSSPDVP+A+FL+SS+DLK +GK +Y   NDLQATYSLYPGSPAS+L SPISR SGD L
Subjt:  FATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQATYSLYPGSPASSLVSPISRTSGDCL

Query:  SSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NPPFPNTGGRLSVSKDSDAYSSS--GNGYQNRHSKSPK
         S                 Q+GK  RS SG  FG +  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSD Y ++  GNG QNR ++SPK
Subjt:  SSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLD-NPPFPNTGGRLSVSKDSDAYSSS--GNGYQNRHSKSPK

Query:  QDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDN
        QD+EE+EAYRASFGFSADEIITT+QYVEI+DVM+ SF    ++            P  G+KL        SQ S KS  ++  +    +     N YKD+
Subjt:  QDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYKDN

Query:  KLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK
        K + +          +  E+ + SR+GS K SR Y+  +S SDAEV+YRRGRSLRE++
Subjt:  KLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETK

AT4G25620.1 hydroxyproline-rich glycoprotein family protein1.5e-2844.83Show/hide
Query:  SMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFMSLSANSPGGPSS
        S+ K+ G  W    CF ++K  KRI  A  +PE  A +     P     SN  ++  P  +APPSSPASF  S  PS      P    SL+ N P  PS+
Subjt:  SMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPST--VQSPSCFMSLSANSPGGPSS

Query:  TMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYIASNDLQATYSLYPGSPASSLVSPIS
          F  GPYAHETQ V+PP+FSAFTTEPSTAP TPPPE     +PSSP+VPFAQ L+SS++       G     + A++    +  +YPGSP  +L+SP S
Subjt:  TMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLK-----GTGKANYIASNDLQATYSLYPGSPASSLVSPIS

Query:  RTS
         TS
Subjt:  RTS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.9e-3142.17Show/hide
Query:  RWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSAN--SPGGPSSTMFA
        RWG CW   SCF  QK  KRI  A  +PE   VT+          +   TV+ P  +APPSSPASF  S   S   SP   +SL++N  SP  P S +F 
Subjt:  RWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPSSPASFTNSALPSTVQSPSCFMSLSAN--SPGGPSSTMFA

Query:  TGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDL----KGTGKANYIASNDLQ-ATYSLYPGSP-ASSLVSPISRT
         GPYA+ETQ V+PP+FSAF TEPSTAP TPPPE + H+TTPSSP+VPFAQ L+SS++L      +G     +S+  +  +  + PGSP   +L+SP S  
Subjt:  TGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAQFLSSSVDL----KGTGKANYIASNDLQ-ATYSLYPGSP-ASSLVSPISRT

Query:  SGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLAS
        S    SS +P +      +P    + G+ P+      F   K G+   S
Subjt:  SGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCTGCTGTTAATGCTTTGAAGTTGGACATATGGACTCCAATGATTGAGAAATCCAGCATGAACTGGATATGTGGAAAGTTCCTTTCCTTTCAGAAGGGTGGTTG
TTTCTGTTGCTCAAGCTTCTTGGTCAGTATGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACGCCCAGAAAGGAGAGAAGCGCATTGTACCTGCAT
CTCGTTTACCTGAGGGCAATGCTGTGACAACCCAACCTAATGGACCTCAAGCAGCAGGAATGAGCAACCAGGCCACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCT
TCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTTGC
TACAGGGCCATATGCGCACGAAACACAGCTGGTCTCTCCTCCTATTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCTGAACTAGCTCACC
TAACCACACCTTCTTCCCCTGATGTGCCATTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCA
ACATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCTTCATTTCCTGAGAGGGACTTTCCACCACA
GTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTACATCATTGGCATCTCAGGATTCTAATT
TCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTAATACTGGTGGGAGGTTAAGCGTATCGAAGGATTCAGATGCCTACTCCTCTAGTGGG
AATGGATACCAGAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACA
GTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTTTGTCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAAC
TAAAATCCACACATACAACTTCACAGAGTCAGAGAAGTAGTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGTTGGCATTATGCAATGGTTATAAA
GATAATAAATTGCAAAGACAACCTGGTAACATGTCAGGATCAAGTACTTCAAACCAAGTTGAAAAAGACGTATTCTCAAGGATAGGGTCATTCAAAAATAGTCGCAAGTA
TAATCTTGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGGGGAAGGAGCCTAAGGGAGACCAAGGGAGATTTTTCATGGCATGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCTGCTGTTAATGCTTTGAAGTTGGACATATGGACTCCAATGATTGAGAAATCCAGCATGAACTGGATATGTGGAAAGTTCCTTTCCTTTCAGAAGGGTGGTTG
TTTCTGTTGCTCAAGCTTCTTGGTCAGTATGGGAAAGAGATGGGGTGGATGTTGGGGTGCATTATCTTGTTTTCACGCCCAGAAAGGAGAGAAGCGCATTGTACCTGCAT
CTCGTTTACCTGAGGGCAATGCTGTGACAACCCAACCTAATGGACCTCAAGCAGCAGGAATGAGCAACCAGGCCACAGTGATAACTCCATCCCTTCTAGCCCCACCTTCT
TCACCAGCATCTTTTACAAATTCTGCACTCCCTTCAACAGTCCAATCACCTAGCTGTTTCATGTCGTTGTCTGCCAACTCACCTGGAGGTCCTTCATCCACAATGTTTGC
TACAGGGCCATATGCGCACGAAACACAGCTGGTCTCTCCTCCTATTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCCCTCACCCCCCCACCTGAACTAGCTCACC
TAACCACACCTTCTTCCCCTGATGTGCCATTTGCTCAGTTCCTATCCTCATCGGTGGATCTCAAAGGAACTGGAAAGGCCAATTACATTGCTTCAAATGATCTTCAAGCA
ACATATTCTCTCTACCCTGGAAGTCCTGCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCTGGCGATTGCTTATCATCTTCATTTCCTGAGAGGGACTTTCCACCACA
GTGGAATCCTTCAGCTTCTCTCCAAGATGGAAAATATCCAAGAAGTGGTTCTGGTCGGCTATTTGGAAATGAGAAAGCTGGTACATCATTGGCATCTCAGGATTCTAATT
TCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTAATACTGGTGGGAGGTTAAGCGTATCGAAGGATTCAGATGCCTACTCCTCTAGTGGG
AATGGATACCAGAACCGGCACAGTAAGTCTCCAAAACAAGATGTGGAGGAAATAGAAGCTTACCGAGCATCGTTTGGTTTCAGTGCGGATGAAATTATAACTACTACACA
GTATGTGGAGATATCTGATGTAATGGAGGATTCCTTTACTATGAGACCTTTTACCTCAACTAGTTTGTCAGCAGAAGAAAGTATTGAACCTCCATTGTTGGGTGAAAAAC
TAAAATCCACACATACAACTTCACAGAGTCAGAGAAGTAGTAAATCAGCACCTGAGGTTGTCGAAAAGGAAACCTGCACTGAAGTGTTGGCATTATGCAATGGTTATAAA
GATAATAAATTGCAAAGACAACCTGGTAACATGTCAGGATCAAGTACTTCAAACCAAGTTGAAAAAGACGTATTCTCAAGGATAGGGTCATTCAAAAATAGTCGCAAGTA
TAATCTTGGTTTATCCTGCTCTGATGCAGAAGTTGACTACAGAAGGGGAAGGAGCCTAAGGGAGACCAAGGGAGATTTTTCATGGCATGACTAA
Protein sequenceShow/hide protein sequence
MSPAVNALKLDIWTPMIEKSSMNWICGKFLSFQKGGCFCCSSFLVSMGKRWGGCWGALSCFHAQKGEKRIVPASRLPEGNAVTTQPNGPQAAGMSNQATVITPSLLAPPS
SPASFTNSALPSTVQSPSCFMSLSANSPGGPSSTMFATGPYAHETQLVSPPIFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAQFLSSSVDLKGTGKANYIASNDLQA
TYSLYPGSPASSLVSPISRTSGDCLSSSFPERDFPPQWNPSASLQDGKYPRSGSGRLFGNEKAGTSLASQDSNFFCPATFAQFYLDNPPFPNTGGRLSVSKDSDAYSSSG
NGYQNRHSKSPKQDVEEIEAYRASFGFSADEIITTTQYVEISDVMEDSFTMRPFTSTSLSAEESIEPPLLGEKLKSTHTTSQSQRSSKSAPEVVEKETCTEVLALCNGYK
DNKLQRQPGNMSGSSTSNQVEKDVFSRIGSFKNSRKYNLGLSCSDAEVDYRRGRSLRETKGDFSWHD