; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C002575 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C002575
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionHydroxyproline-rich glycoprotein
Genome locationchr12:21704574..21709694
RNA-Seq ExpressionMELO3C002575
SyntenyMELO3C002575
Gene Ontology termsNA
InterPro domainsIPR040265 - Protein CHUP1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK26388.1 hydroxyproline-rich glycoprotein [Cucumis melo var. makuwa]1.2e-22696.23Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
        PEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
Subjt:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK

Query:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
        DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
Subjt:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA

Query:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
        IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
Subjt:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE

Query:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSP---PSTC
        RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPP  P   P  C
Subjt:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSP---PSTC

XP_008442261.1 PREDICTED: uncharacterized protein At4g04980 isoform X1 [Cucumis melo]2.5e-22794.03Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
        PEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
Subjt:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK

Query:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
        DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
Subjt:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA

Query:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
        IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
Subjt:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE

Query:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANANATTST
        RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPP PP   P +  L  QP+  N    T T
Subjt:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANANATTST

XP_031736251.1 uncharacterized protein At4g04980 isoform X1 [Cucumis sativus]2.1e-21087Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLE--------------------TMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLE                    TMK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRRKAYGLE--------------------TMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE

Query:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIID
        RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIID
Subjt:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIID

Query:  CIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHL
        CIVSMANERFDAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHL
Subjt:  CIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHL

Query:  LLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEV
        LLPRLSHCGVNVCPAPTRVAIVEES MD+DDKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NE+IEV D KEEKLNLSRTASLKADRNEEIEV
Subjt:  LLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEV

Query:  IDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTC--PSSATL
        IDIEEEK CL++ NSQ+DIAERT+DFDSQA A  QELPTSDLPTVVSKPLPLL  MAPPPPPP PP+    PS  TL
Subjt:  IDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTC--PSSATL

XP_031736252.1 uncharacterized protein At4g04980 isoform X2 [Cucumis sativus]5.4e-21490.81Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFR+KAYGLETMK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
        PEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
Subjt:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK

Query:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
        DSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
Subjt:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA

Query:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIA
        IVEES MD+DDKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NE+IEV D KEEKLNLSRTASLKADRNEEIEVIDIEEEK CL++ NSQ+DIA
Subjt:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIA

Query:  ERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTC--PSSATL
        ERT+DFDSQA A  QELPTSDLPTVVSKPLPLL  MAPPPPPP PP+    PS  TL
Subjt:  ERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTC--PSSATL

XP_031736253.1 uncharacterized protein At4g04980 isoform X3 [Cucumis sativus]2.1e-21087Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLE--------------------TMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE
        MATGGWCGLGPLLFR+KAYGLE                    TMK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLE
Subjt:  MATGGWCGLGPLLFRRKAYGLE--------------------TMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLE

Query:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIID
        RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIID
Subjt:  RSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIID

Query:  CIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHL
        CIVSMANERFDAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHL
Subjt:  CIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHL

Query:  LLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEV
        LLPRLSHCGVNVCPAPTRVAIVEES MD+DDKL SENTDAADANNEMEVCDIKEEKDLSKEASQKAD NE+IEV D KEEKLNLSRTASLKADRNEEIEV
Subjt:  LLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEV

Query:  IDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTC--PSSATL
        IDIEEEK CL++ NSQ+DIAERT+DFDSQA A  QELPTSDLPTVVSKPLPLL  MAPPPPPP PP+    PS  TL
Subjt:  IDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTC--PSSATL

TrEMBL top hitse value%identityAlignment
A0A0A0LSA3 Uncharacterized protein6.2e-19286.65Show/hide
Query:  MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRG
        MK+SSYVFSKTYSKK KLSKGARSKKSS CKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIE     
Subjt:  MKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRG

Query:  NNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSET
             QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSS+SRTSSFGKSSSSTDSCSET
Subjt:  NNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSET

Query:  NSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADA
        NSSCCSSPETPTSVLANFR+SERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEES MD+DDKL SENTDAADA
Subjt:  NSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADA

Query:  NNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLP
        NNEMEVCDIKEEKDLSKEASQKAD NE+IEV D KEEKLNLSRTASLKADRNEEIEVIDIEEEK CL++ NSQ+DIAERT+DFDSQA A  QELPTSDLP
Subjt:  NNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEK-CLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLP

Query:  TVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANAN
        TV            PPPPPP P         + +QPA A  N
Subjt:  TVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANAN

A0A1S3B5Y5 uncharacterized protein At4g04980 isoform X11.2e-22794.03Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
        PEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
Subjt:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK

Query:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
        DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
Subjt:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA

Query:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
        IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
Subjt:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE

Query:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANANATTST
        RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPP PP   P +  L  QP+  N    T T
Subjt:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANANATTST

A0A1S3B627 uncharacterized protein At4g04980 isoform X23.9e-17890.59Show/hide
Query:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVL
        PS   +     LVVGTMEDLQKLYPEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVL
Subjt:  PSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVL

Query:  GIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPID
        GIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPID
Subjt:  GIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPID

Query:  VKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNE
        VKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNE
Subjt:  VKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNE

Query:  EIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANANATTST
        EIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPP PP   P +  L  QP+  N    T T
Subjt:  EIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANANATTST

A0A5A7UTX7 Hydroxyproline-rich glycoprotein1.2e-22794.03Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
        PEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
Subjt:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK

Query:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
        DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
Subjt:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA

Query:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
        IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
Subjt:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE

Query:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANANATTST
        RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPP PP   P +  L  QP+  N    T T
Subjt:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSPPSTCPSSATLHSQPATANANATTST

A0A5D3DSA9 Hydroxyproline-rich glycoprotein6.0e-22796.23Show/hide
Query:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
        MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY
Subjt:  MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLY

Query:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
        PEIISDIQYSEMKTTCIE          QSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK
Subjt:  PEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSK

Query:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
        DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA
Subjt:  DSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVA

Query:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
        IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE
Subjt:  IVEESTMDVDDKLTSENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAE

Query:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSP---PSTC
        RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPP  P   P  C
Subjt:  RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSP---PSTC

SwissProt top hitse value%identityAlignment
Q1PEB4 Uncharacterized protein At4g049803.1e-1528.33Show/hide
Query:  MEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDA
        M+DLQKL PEI++  Q  EM+   ++          + L +F   L++IGDSW+++ +W  +SKY  S   +N S   +VE VL  +D ++    ERF  
Subjt:  MEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDA

Query:  MD-------EFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRL
        MD        F   K  S     SF +S+    S SE+N+S   SP TP SVL         S        +SP LW+LR QA+++L+P+D+K   +  L
Subjt:  MD-------EFVNSKDSSFSRTSSFGKSSSSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRL

Query:  SHCGVNVCPAPTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDIK--------------EEKDLSK------------EASQKADGNEKIEVLDT
        S    +   + T++ I EE+     + ++   E+ D +    E    +IK              E KD S+            E   + D ++ IE  +T
Subjt:  SHCGVNVCPAPTRVAIVEESTMD--VDDKLTSENTDAADANNEMEVCDIK--------------EEKDLSK------------EASQKADGNEKIEVLDT

Query:  KEEKLNLSRTASLKADRNEEIEVIDIEEEKC----LSKANSQEDIAE-RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLS--------KMAPPPPPPSP
        +   +    T   + D N+ IE  + E            N  ED +E  T++ DS  ++  +++P     T    P P +S        +  PPPPPPSP
Subjt:  KEEKLNLSRTASLKADRNEEIEVIDIEEEKC----LSKANSQEDIAE-RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLS--------KMAPPPPPPSP

Query:  PSTCPS
            P+
Subjt:  PSTCPS

Arabidopsis top hitse value%identityAlignment
AT1G11070.1 BEST Arabidopsis thaliana protein match is: Hydroxyproline-rich glycoprotein family protein (TAIR:AT1G61080.1)6.9e-1830.14Show/hide
Query:  RRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKT
        RR A   E +KN+       + +  K+S  + S  SS+   NF+ M+ELR+KI+  R IIDLP L    SI+ +V+ TM+DL KL PEII   Q  EM+ 
Subjt:  RRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKT

Query:  TCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSS
          ++ KL  N         F  ALKSIGDSW+ +HEW  KSKY  S+ ++N S   +VE VL  +D ++   NER +  +   N       +      S 
Subjt:  TCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSS

Query:  SSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLT
         ST + ++ + S    P  P +VL    S   K      +S S+ L  ++R+QA+ KL+PIDVK L +   S                         +  
Subjt:  SSTDSCSETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLT

Query:  SENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKA--------DRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFD
        S N D  D + +++  + +E+   +KEA  +   + K ++ D    K+++      K+          N  I V     E  L+ +  +E  A       
Subjt:  SENTDAADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKA--------DRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFD

Query:  SQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSP
          A      LP +    V + PLP     A PPPPP P
Subjt:  SQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPPPPSP

AT1G61080.1 Hydroxyproline-rich glycoprotein family protein3.5e-3033.11Show/hide
Query:  KGARSKKSS---RCKDNFVQMMELRKKILILRDIIDLPSLERSASINE---------LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQS
        K ARS K+S   +   NF+ M+ELR+KI   RDIIDL +L+ S SI +         +V+ TM+DLQK+ PEII      E++   ++          + 
Subjt:  KGARSKKSS---RCKDNFVQMMELRKKILILRDIIDLPSLERSASINE---------LVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQS

Query:  LAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAM--DEFVNSKDS--------SFSRTSSFGKSSSSTDSC
        L +F  ALKSIGDSW+ N EW  KSKY  SS  +N S   +VE VL  +D ++ M+ ERFD M  DE    K+S        S SR  S  +S S + S 
Subjt:  LAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAM--DEFVNSKDS--------SFSRTSSFGKSSSSTDSC

Query:  SETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDA
          + +S C SP TP SVL             +  + +S LLW++RVQA+EKL+PIDVK L +  LS                             +    
Subjt:  SETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDA

Query:  ADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQE----L
         + +N+ +V  + E      E  QK D  E I+V    EE +NL   + +     + I  I   E    SK N  E     +  F              +
Subjt:  ADANNEMEVCDIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQE----L

Query:  PTSDLPTVVSKPLPLLSKMA-----PPPPPPSPPSTCP
         T+ LP     P P ++ +A     PPPPPP PP+  P
Subjt:  PTSDLPTVVSKPLPLLSKMA-----PPPPPPSPPSTCP

AT4G04980.1 unknown protein2.3e-2930.41Show/hide
Query:  SKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSL
        SKT +  P+  K      S +C  NF+ M+ELRK I   RD+IDLPSL+ S S+ E++  TM+DLQKL PEI++  Q  EM+   ++          + L
Subjt:  SKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYSEMKTTCIEQKLRGNNVMYQSL

Query:  AYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------EFVNSKDSSFSRTSSFGKSSSSTDSCSETN
         +F   L++IGDSW+++ +W  +SKY  S   +N S   +VE VL  +D ++    ERF  MD        F   K  S     SF +S+    S SE+N
Subjt:  AYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMD-------EFVNSKDSSFSRTSSFGKSSSSTDSCSETN

Query:  SSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMD--VDDKLTSENTDAAD
        +S   SP TP SVL         S        +SP LW+LR QA+++L+P+D+K   +  LS    +   + T++ I EE+     + ++   E+ D + 
Subjt:  SSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMD--VDDKLTSENTDAAD

Query:  ANNEMEVCDIK--------------EEKDLSK------------EASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKC----LSKA
           E    +IK              E KD S+            E   + D ++ IE  +T+   +    T   + D N+ IE  + E            
Subjt:  ANNEMEVCDIK--------------EEKDLSK------------EASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKC----LSKA

Query:  NSQEDIAE-RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLS--------KMAPPPPPPSPPSTCPS
        N  ED +E  T++ DS  ++  +++P     T    P P +S        +  PPPPPPSP    P+
Subjt:  NSQEDIAE-RTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLS--------KMAPPPPPPSPPSTCPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACCGGAGGCTGGTGTGGTTTAGGCCCTTTGTTGTTTCGTAGAAAGGCCTATGGACTCGAGACAATGAAGAACTCTTCTTACGTGTTCTCGAAGACATATTCTAA
GAAACCAAAGCTATCCAAAGGTGCTAGAAGTAAGAAGTCTTCAAGATGCAAAGATAATTTTGTTCAAATGATGGAGCTAAGGAAGAAAATCTTGATTCTTAGAGATATCA
TTGATTTGCCTTCTTTAGAACGCTCTGCTTCTATAAATGAGCTGGTGGTGGGAACGATGGAAGATCTTCAAAAGCTTTATCCTGAAATCATTTCGGATATCCAATATTCC
GAAATGAAGACGACATGTATTGAACAGAAACTTAGAGGAAATAATGTCATGTATCAGAGTCTTGCCTATTTCTGCACTGCACTGAAATCGATTGGCGATTCGTGGATGCT
GAACCATGAGTGGAGGGACAAATCTAAATATAATTTGTCATCATTTCAGGAAAACTCCAGCTTCCAAGAAATTGTTGAATCTGTGTTGGGTATTATTGATTGCATCGTTA
GTATGGCGAACGAAAGGTTTGATGCGATGGACGAATTTGTTAACTCAAAGGATTCTTCTTTTTCAAGAACTAGTTCCTTTGGTAAGAGCTCGAGTTCGACAGATTCCTGC
TCCGAAACCAATAGCTCTTGCTGCTCTTCTCCAGAAACTCCGACGTCCGTCCTTGCAAACTTTCGGAGCAGTGAAAGAAAATCTTCTGAAAAGGAGAAAGTCTCTTGTAG
CTCTCCTCTTTTATGGTCTCTTAGAGTTCAAGCAGTGGAAAAGTTGAACCCCATTGATGTCAAGCACCTTTTGCTTCCTAGGTTGTCTCACTGCGGAGTAAATGTCTGCC
CCGCCCCAACCAGAGTGGCGATCGTTGAGGAATCAACGATGGATGTGGATGACAAGCTCACCTCTGAAAACACTGATGCTGCTGATGCAAATAATGAAATGGAAGTGTGT
GATATTAAAGAAGAGAAGGATTTGAGCAAGGAAGCTAGTCAAAAGGCAGATGGAAATGAGAAAATTGAAGTGTTAGATACCAAAGAAGAAAAATTGAATTTGAGCAGGAC
AGCTAGTCTAAAGGCAGATAGAAATGAAGAAATTGAAGTGATTGATATCGAAGAAGAGAAGTGTTTGAGCAAGGCAAATAGCCAAGAAGACATTGCTGAGAGAACCAATG
ATTTTGATTCCCAAGCTGCTGCAATTGCTCAAGAATTGCCTACATCAGATTTACCAACTGTTGTATCAAAGCCATTACCGCTGCTATCCAAGATGGCTCCCCCTCCCCCT
CCCCCCTCCCCCCCCTCCACCTGCCCGAGCTCTGCAACCCTCCATAGTCAACCTGCAACTGCCAACGCCAACGCCACCACCTCCACCACCACCACCAATGATGCAACAAA
ATGCAGTATTAGCTCAACAACTTTCACAGCCACCTCCTCCACCACCACCACCACCACCACCAATGATACAACAAAATGCAACATTGGTCCAACATCTTTCACAGCCACCT
CCTCCCCCACCAATGCCTCAGATGAAAGCACAGCCTGCTGCAGCAGAGTCAAATGCGCCTCCTCCACCTCCACAATTGTTAAAGGTAATCGAAACGGTGATCAAAGTCAA
TGGACCACCACCACCACCACCACCATCAAACATTACTGGAACGATGGTGAGAGCAGGTGTACCGCCACCTCCCCCTATGGTGCCCTCAAAAGGGAGTGCAGGTCCAGCAC
CCCCTCCTCGGATGGCCCAAGGCAATGGGTTTGCTCCACCACCACCTCCACCAGGTGGTGCATTACGGTCCTTGCGCCCCAAGAAAACCTCTACCAAACTAAAAAGATCT
CATCAATTGGGAAATCTTTACCGGACACTCAAAGGAAAGGTGGAGGGATGCAATCAAAATCTTAA
mRNA sequenceShow/hide mRNA sequence
TTGTCTTCCTCTTTCTTGGCTTGTCTCTTTATAGAGAAAGTTCCAATGCAGTAAATTCCATTTGGTTCTTTTTCTTTTTTTTCTTTTTTCCCTTCTTTCTTCTTCTTCTT
CTTTAGGATCTTCCATCAATCCTAACCCAAAAAGCTCCTTCCTTCCAATTCCATTTTAAATCTTCCCCTTCTGTGTTGTGCTTCAAACGAAAAAAACAAAAAGACAAAAA
ACGAAGAAAAAAAAAAAAAAAGCCTTACAATATATATCTTTTTGTTGTTTGCCGGATCTGTACATAGAACATTTTATGTAAATAAGTTTTGTGAACGGTTGGTTTTCATT
TTCCCCTTGAGAAACCTTCAGCACAGCTGACCAGAGGAATAAAAAACAAAATTTTTCTGTAAAAAATTTTGAAACATTTCCCTTCACCCCACTATATCATCTGCGTTAGT
TAAAAATATATATATTTTTTTTCTTTCTGTGCCATTCAAGTCTTTTTAAAGGCTTGTTTTGGTCACATTTCATGGCTACCGGAGGCTGGTGTGGTTTAGGCCCTTTGTTG
TTTCGTAGAAAGGCCTATGGACTCGAGACAATGAAGAACTCTTCTTACGTGTTCTCGAAGACATATTCTAAGAAACCAAAGCTATCCAAAGGTGCTAGAAGTAAGAAGTC
TTCAAGATGCAAAGATAATTTTGTTCAAATGATGGAGCTAAGGAAGAAAATCTTGATTCTTAGAGATATCATTGATTTGCCTTCTTTAGAACGCTCTGCTTCTATAAATG
AGCTGGTGGTGGGAACGATGGAAGATCTTCAAAAGCTTTATCCTGAAATCATTTCGGATATCCAATATTCCGAAATGAAGACGACATGTATTGAACAGAAACTTAGAGGA
AATAATGTCATGTATCAGAGTCTTGCCTATTTCTGCACTGCACTGAAATCGATTGGCGATTCGTGGATGCTGAACCATGAGTGGAGGGACAAATCTAAATATAATTTGTC
ATCATTTCAGGAAAACTCCAGCTTCCAAGAAATTGTTGAATCTGTGTTGGGTATTATTGATTGCATCGTTAGTATGGCGAACGAAAGGTTTGATGCGATGGACGAATTTG
TTAACTCAAAGGATTCTTCTTTTTCAAGAACTAGTTCCTTTGGTAAGAGCTCGAGTTCGACAGATTCCTGCTCCGAAACCAATAGCTCTTGCTGCTCTTCTCCAGAAACT
CCGACGTCCGTCCTTGCAAACTTTCGGAGCAGTGAAAGAAAATCTTCTGAAAAGGAGAAAGTCTCTTGTAGCTCTCCTCTTTTATGGTCTCTTAGAGTTCAAGCAGTGGA
AAAGTTGAACCCCATTGATGTCAAGCACCTTTTGCTTCCTAGGTTGTCTCACTGCGGAGTAAATGTCTGCCCCGCCCCAACCAGAGTGGCGATCGTTGAGGAATCAACGA
TGGATGTGGATGACAAGCTCACCTCTGAAAACACTGATGCTGCTGATGCAAATAATGAAATGGAAGTGTGTGATATTAAAGAAGAGAAGGATTTGAGCAAGGAAGCTAGT
CAAAAGGCAGATGGAAATGAGAAAATTGAAGTGTTAGATACCAAAGAAGAAAAATTGAATTTGAGCAGGACAGCTAGTCTAAAGGCAGATAGAAATGAAGAAATTGAAGT
GATTGATATCGAAGAAGAGAAGTGTTTGAGCAAGGCAAATAGCCAAGAAGACATTGCTGAGAGAACCAATGATTTTGATTCCCAAGCTGCTGCAATTGCTCAAGAATTGC
CTACATCAGATTTACCAACTGTTGTATCAAAGCCATTACCGCTGCTATCCAAGATGGCTCCCCCTCCCCCTCCCCCCTCCCCCCCCTCCACCTGCCCGAGCTCTGCAACC
CTCCATAGTCAACCTGCAACTGCCAACGCCAACGCCACCACCTCCACCACCACCACCAATGATGCAACAAAATGCAGTATTAGCTCAACAACTTTCACAGCCACCTCCTC
CACCACCACCACCACCACCACCAATGATACAACAAAATGCAACATTGGTCCAACATCTTTCACAGCCACCTCCTCCCCCACCAATGCCTCAGATGAAAGCACAGCCTGCT
GCAGCAGAGTCAAATGCGCCTCCTCCACCTCCACAATTGTTAAAGGTAATCGAAACGGTGATCAAAGTCAATGGACCACCACCACCACCACCACCATCAAACATTACTGG
AACGATGGTGAGAGCAGGTGTACCGCCACCTCCCCCTATGGTGCCCTCAAAAGGGAGTGCAGGTCCAGCACCCCCTCCTCGGATGGCCCAAGGCAATGGGTTTGCTCCAC
CACCACCTCCACCAGGTGGTGCATTACGGTCCTTGCGCCCCAAGAAAACCTCTACCAAACTAAAAAGATCTCATCAATTGGGAAATCTTTACCGGACACTCAAAGGAAAG
GTGGAGGGATGCAATCAAAATCTTAAGTCGGCTAACGGAAGGAAAGGTGGCGTCGGAAACAGTAACGGAGGAAAACAAGGAATGGCTGATGCATTGGCAGAGATGACAAA
AAGATCAGCATACTTCCAGCAAATTGAAGAAGATGTTAAAAAACACGCCAAATCGATCACCGCGCTTAAATCTTCCATTTCATCTTTCCAATCATCAGACATGAATGACC
TGCTCCTTTTCCACAAGCAAGTGGAATCTGTACTAGAGAATTTAACTGATGAATCACAGGTACTAGCAAGGTTTGAAGGATTTCCCATCAAAAAGTTGGAAACTTTGAGA
ATTGCAGCAGCATTATATCTAAAGTTAGATACAATTGTCTATCAACTACAGAACTGGAAGTTTGTTTCTCCCATGGGACTGCTTCTCGACCGAGTCGAAAACTACTTCTC
TAAGATCAAAGGAGAAGTCGATGCACTTGAACGAACCAAGGATGAAGAATCAAAGAGATTCCGAGGTCACGGTATTCAATTTGATTTCAGTGTGTTAATACGGATCAAGG
AATCAATGGTGGATGTTTCTTCTGGCTGCATGGAGTTGGCTCTGAAGGAAAAAAGAGAGTTGAAGGCAGCAGCAGAAAAGACACGAAAAGGAGGCCGATCTGAAAATTCG
AACAAGGCACGTTCAAAGATGCTATGGAGGGCATTCCAATTCGCATACCGAGTTTACACCTTCGCCGGTGGACACGACGAGCGTGCTGATAGACTGACCAGAGAGTTGGC
TATAGAAATAGAGAGTGAATCCCATCACCTATGATTCTCTCCTCTCTCTTTTAAAAAGTTGTTTTCCTCTCCTTTCTTCTTCTTTTTGGCTGCAAGAGAAAAAAAAAAAA
AAGAATTTTTTTTAAAACACTGGATCATGTGAGATCAGTGGTTGGTCTTACACCATATCTAGCATTTTACTGTTTGGAATGGTATTATTGAGTAATGTTTGACATTTATG
TTTAAGTGCTAAACGTAACCTTTTTCCTCTATCTGGATAGAACCCTGTAAAAAATTATGTAATTAATATGAGCTCTTACGATGCTATCCATACATATATATTATACATAT
ACAACTTATTTAC
Protein sequenceShow/hide protein sequence
MATGGWCGLGPLLFRRKAYGLETMKNSSYVFSKTYSKKPKLSKGARSKKSSRCKDNFVQMMELRKKILILRDIIDLPSLERSASINELVVGTMEDLQKLYPEIISDIQYS
EMKTTCIEQKLRGNNVMYQSLAYFCTALKSIGDSWMLNHEWRDKSKYNLSSFQENSSFQEIVESVLGIIDCIVSMANERFDAMDEFVNSKDSSFSRTSSFGKSSSSTDSC
SETNSSCCSSPETPTSVLANFRSSERKSSEKEKVSCSSPLLWSLRVQAVEKLNPIDVKHLLLPRLSHCGVNVCPAPTRVAIVEESTMDVDDKLTSENTDAADANNEMEVC
DIKEEKDLSKEASQKADGNEKIEVLDTKEEKLNLSRTASLKADRNEEIEVIDIEEEKCLSKANSQEDIAERTNDFDSQAAAIAQELPTSDLPTVVSKPLPLLSKMAPPPP
PPSPPSTCPSSATLHSQPATANANATTSTTTTNDATKCSISSTTFTATSSTTTTTTTNDTTKCNIGPTSFTATSSPTNASDESTACCSRVKCASSTSTIVKGNRNGDQSQ
WTTTTTTTIKHYWNDGESRCTATSPYGALKRECRSSTPSSDGPRQWVCSTTTSTRWCITVLAPQENLYQTKKISSIGKSLPDTQRKGGGMQSKS