; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G008860 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G008860
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationchr07:10163658..10166445
RNA-Seq ExpressionLsi07G008860
SyntenyLsi07G008860
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044829.1 mucin-2 [Cucumis melo var. makuwa]2.3e-23088.63Show/hide
Query:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP
        MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPSSE HENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP

Query:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD
        VS LQSEP S  QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQF+ P+ QK ESD
Subjt:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD

Query:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP
        +QY FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTL N+DK SIHNW+QRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD
         TSESM DHHATNESQNIQILID   K   EEEP ATNHRFSFELSDGDVL QSVGSKPLESNEL V SSPIHEPFET KENSP GDHT NV EEKTKAD
Subjt:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD

Query:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        G+EA QHQEHHS+ LGSVKEFNFDN NGSDTH   INS+WWTNAKD  TEGTT GAWSFFP  QQR
Subjt:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

XP_004146564.1 uncharacterized protein LOC101220378 isoform X1 [Cucumis sativus]3.0e-23590.34Show/hide
Query:  MRRRTDADDLRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSS
        MRRRTD DD RPV NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGS+KQRKRIGHAVLVPEPSPSSE HENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDADDLRPV-NNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSS

Query:  PVSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD
        PVS LQSEP S  QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQF+QPTL K ESD
Subjt:  PVSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD

Query:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP
        +QY FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDK SIHNW+QRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD
        QTSESMSDHHATNESQNIQILID   K  +EEEP ATNHRFSFELSDGDVLLQSVGSKPLESNELAV SSPIHEPFET KENSP GDHT NV EEKTKAD
Subjt:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD

Query:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        G+EA Q QEHHS+TLGSVKEFNFDNGNGSDTH  NINSEWW NAKD  TE T  G WSFFPM QQR
Subjt:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

XP_008452032.1 PREDICTED: uncharacterized protein LOC103493162 isoform X1 [Cucumis melo]1.7e-23089.08Show/hide
Query:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSS
        MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPSSE HENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSS

Query:  PVSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPES
        PVS LQSEP S  QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQF+ P+LQK ES
Subjt:  PVSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPES

Query:  DHQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLN
        D+QY FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDK SIHNW+QRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DHQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKA
        P TSESM DHHATNESQNIQILID   K   EEEP ATNHRFSFELSDGDVL QSVGSKPLESNEL V SSPIHEPFET KENSP GDHT NV EEKTKA
Subjt:  PQTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKA

Query:  DGEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        DG+EA QHQEHHS+ LGSVKEFNFDN NGSDTH   INS+WWTNAKD  TEGTT GAWSFFP  QQR
Subjt:  DGEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

XP_008452033.1 PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo]7.0e-23289.27Show/hide
Query:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP
        MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPSSE HENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP

Query:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD
        VS LQSEP S  QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQF+ P+LQK ESD
Subjt:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD

Query:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP
        +QY FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDK SIHNW+QRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD
         TSESM DHHATNESQNIQILID   K   EEEP ATNHRFSFELSDGDVL QSVGSKPLESNEL V SSPIHEPFET KENSP GDHT NV EEKTKAD
Subjt:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD

Query:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        G+EA QHQEHHS+ LGSVKEFNFDN NGSDTH   INS+WWTNAKD  TEGTT GAWSFFP  QQR
Subjt:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

XP_038884079.1 uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida]1.5e-24593.79Show/hide
Query:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP
        MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPE SPSSESHEN+LQSPDIVLPFAAPPSSP
Subjt:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP

Query:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD
        VSFLQSEP S TQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQFLQPTLQK ESD
Subjt:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD

Query:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP
        HQYPFPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNW+QRQSTDSCTQDSIE KSSNDFVLNP
Subjt:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPV-GDHTPNVSEEKTKA
        QTSESMSDHHATNESQNIQILIDG QK  EEE P ATNHRFSFELSDGD LLQSVGSKPL+SNE+AVASSPIHEPFETAKENSPV  DHT NV+E KTKA
Subjt:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPV-GDHTPNVSEEKTKA

Query:  DGEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        + EEA QHQEHHSITLGSVKEFNFDNGNGSDTHKAN+NSEWWTNAKDVDTEGTTNGAWSFFPM QQR
Subjt:  DGEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X18.4e-23189.08Show/hide
Query:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSS
        MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAV QKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPSSE HENTLQSPDIVLPFAAPPSS
Subjt:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSS

Query:  PVSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPES
        PVS LQSEP S  QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQF+ P+LQK ES
Subjt:  PVSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPES

Query:  DHQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLN
        D+QY FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDK SIHNW+QRQSTDSCTQDSIEFKSSNDFVLN
Subjt:  DHQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLN

Query:  PQTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKA
        P TSESM DHHATNESQNIQILID   K   EEEP ATNHRFSFELSDGDVL QSVGSKPLESNEL V SSPIHEPFET KENSP GDHT NV EEKTKA
Subjt:  PQTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKA

Query:  DGEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        DG+EA QHQEHHS+ LGSVKEFNFDN NGSDTH   INS+WWTNAKD  TEGTT GAWSFFP  QQR
Subjt:  DGEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X23.4e-23289.27Show/hide
Query:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP
        MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPSSE HENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP

Query:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD
        VS LQSEP S  QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQF+ P+LQK ESD
Subjt:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD

Query:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP
        +QY FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDK SIHNW+QRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD
         TSESM DHHATNESQNIQILID   K   EEEP ATNHRFSFELSDGDVL QSVGSKPLESNEL V SSPIHEPFET KENSP GDHT NV EEKTKAD
Subjt:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD

Query:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        G+EA QHQEHHS+ LGSVKEFNFDN NGSDTH   INS+WWTNAKD  TEGTT GAWSFFP  QQR
Subjt:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

A0A5A7TUB1 Mucin-21.1e-23088.63Show/hide
Query:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP
        MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPSSE HENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP

Query:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD
        VS LQSEP S  QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQF+ P+ QK ESD
Subjt:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD

Query:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP
        +QY FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPL+VPPTL N+DK SIHNW+QRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD
         TSESM DHHATNESQNIQILID   K   EEEP ATNHRFSFELSDGDVL QSVGSKPLESNEL V SSPIHEPFET KENSP GDHT NV EEKTKAD
Subjt:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD

Query:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        G+EA QHQEHHS+ LGSVKEFNFDN NGSDTH   INS+WWTNAKD  TEGTT GAWSFFP  QQR
Subjt:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

A0A5D3CYQ2 Mucin-23.4e-23289.27Show/hide
Query:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP
        MRRRTD DD RPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSC SIYWCFGSLKQRKRIGHAVLVPEPSPSSE HENTLQSPDIVLPFAAPPSSP
Subjt:  MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSP

Query:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD
        VS LQSEP S  QSPTALISFTSLTANMYSPDGPSSIFAIGPFAHE QLVSPPLNFSTLTTEPST PFTPPESIHLTTPSSPEVPFAQF+ P+LQK ESD
Subjt:  VSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESD

Query:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP
        +QY FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTL NLDK SIHNW+QRQSTDSCTQDSIEFKSSNDFVLNP
Subjt:  HQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD
         TSESM DHHATNESQNIQILID   K   EEEP ATNHRFSFELSDGDVL QSVGSKPLESNEL V SSPIHEPFET KENSP GDHT NV EEKTKAD
Subjt:  QTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKAD

Query:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR
        G+EA QHQEHHS+ LGSVKEFNFDN NGSDTH   INS+WWTNAKD  TEGTT GAWSFFP  QQR
Subjt:  GEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQR

A0A6J1C828 uncharacterized protein At1g76660-like1.1e-20981.1Show/hide
Query:  MRRRTDAD---DLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPP
        MRRR DAD   DL PVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPS+E  ENTLQSPDIVLPFAAPP
Subjt:  MRRRTDAD---DLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPP

Query:  SSPVSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKP
        SSPVSFLQSEP S TQSPTA++SFTSLTANMYSPDGPSSIFA+GPFAHETQLVSPPLNFST+TT+PST PFTPPESIHLTTPSSPEVPFAQ+LQP+ QK 
Subjt:  SSPVSFLQSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKP

Query:  ESDHQY-PFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDF
        ESDHQY  FPNDDFQSYQFYPGSP+SHLISPRSVISRSGASSPLPD DF   GS F NFP+EVPPTLLNLD+ SI +W+ +QS+DSCTQ+S+ +KSSNDF
Subjt:  ESDHQY-PFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDF

Query:  VLNPQTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEK
        VLNPQTSES+SD+HA+NE  NIQIL DG Q+    +E AA NHRFSFELSD D LL+SV +KPLESNELAVASSPIHEP ETAKE S VG HT N +EE+
Subjt:  VLNPQTSESMSDHHATNESQNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEK

Query:  TKADGEEAQQHQ--EHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQ
         KADGEE   HQ  EHHS+TLG+VKEFNFDNGNG DT K NINS WW N KD +TEGTT GAWSFFP+ QQ
Subjt:  TKADGEEAQQHQ--EHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFPMAQQ

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.3e-3147.62Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIV---------LPFAAPPSSPVSFLQSEPTSGTQSPTALISFTSLTANMYSP
        Q++RWG C  ++ CF S K  KRI  A  +PE    S S  N      ++         L   APPSSP SF  S   S TQSP     + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIV---------LPFAAPPSSPVSFLQSEPTSGTQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFLQPTLQKPESDHQYPFPNDDFQSYQFYPGSPISHLISPRS
         GP SS++A GP+AHETQLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFLQPTLQKPESDHQYPFPNDDFQSYQFYPGSPISHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)9.0e-5250.57Show/hide
Query:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENT----LQSPDIVLPFAAPPSSPVSFLQSEP
        NN F TI AAA AIA+ D R  +++ + +KR+W + WS+  CFGS +QRKRIG++VLVPEP   S S+  T     +S    LPF APPSSP SF QSEP
Subjt:  NNTFQTITAAADAIATVDHRFPRATAV-QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENT----LQSPDIVLPFAAPPSSPVSFLQSEP

Query:  TSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPP---ESIHL--TTPSSPEVPFAQFLQPTLQKPESDHQY
         S TQSP  ++SF+ L  N        SIFAIGP+AHETQLVSPP+ FST TTEPS+ P TPP    SI+L  TTPSSPEVPFAQ      Q     +++
Subjt:  TSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPP---ESIHL--TTPSSPEVPFAQFLQPTLQKPESDHQY

Query:  PFPND-DFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLN
        P  +  +FQ YQ  PGSP+  LISP      SG +SP PD +     S F +F +  PP LL+
Subjt:  PFPND-DFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLN

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.6e-3247.62Show/hide
Query:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIV---------LPFAAPPSSPVSFLQSEPTSGTQSPTALISFTSLTANMYSP
        Q++RWG C  ++ CF S K  KRI  A  +PE    S S  N      ++         L   APPSSP SF  S   S TQSP     + SL AN  SP
Subjt:  QKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIV---------LPFAAPPSSPVSFLQSEPTSGTQSPTALISFTSLTANMYSP

Query:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFLQPTLQKPESDHQYPFPNDDFQSYQFYPGSPISHLISPRS
         GP SS++A GP+AHETQLVSPP+ FST TTEPST PFT PPE   LT PSSP+VP+A+FL  ++    S   +   ND   +Y  YPGSP S L SP S
Subjt:  DGP-SSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFT-PPESIHLTTPSSPEVPFAQFLQPTLQKPESDHQYPFPNDDFQSYQFYPGSPISHLISPRS

Query:  VISRSGASSP
          S  G  SP
Subjt:  VISRSGASSP

AT4G25620.1 hydroxyproline-rich glycoprotein family protein2.2e-5036.31Show/hide
Query:  LRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSES----HENTLQSPDIVLPFAAPPSSPVSFL
        +R VNN +  T+ AAA AI + + R  + ++VQK+R GS WS+YWCFGS K  KRIGHAVLVPEP+ S  +      ++  S  I +PF APPSSP SFL
Subjt:  LRPVNN-TFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSES----HENTLQSPDIVLPFAAPPSSPVSFL

Query:  QSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPE------
         S P S + +P   +   SLT N      P S F IGP+AHETQ V+PP+ FS  TTEPST PFTPP      +PSSPEVPFAQ L  +L++        
Subjt:  QSEPTSGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPPESIHLTTPSSPEVPFAQFLQPTLQKPE------

Query:  SDHQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDS--------------CT
         + ++   + +F+S Q YPGSP  +LISP      SG SSP P           + F +  PP  L  +  +   W  R  + S               T
Subjt:  SDHQYPFPNDDFQSYQFYPGSPISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDS--------------CT

Query:  QDSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDGRQKE------------EEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPI
         D  +  S    V+ P  +E++      N +     L+D +  E               +E     HR SFEL+  DV  + + SK   S     AS   
Subjt:  QDSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDGRQKE------------EEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPI

Query:  HEPFETAKENSPVGDH-TPNVSEEKTKADGE-EAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDT-EGTTNGAWSFFPM
                     G+H  PN      K  GE E++Q Q+  S + GS KEF FD+ N     K  I SEWW N K     + +   +W+FFP+
Subjt:  HEPFETAKENSPVGDH-TPNVSEEKTKADGE-EAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDT-EGTTNGAWSFFPM

AT5G52430.1 hydroxyproline-rich glycoprotein family protein3.4e-5938.63Show/hide
Query:  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSS---ESHENTLQSPDIVLPFAAPPSSPVSFLQSEPT
        VNN+ +T+ AAA AI T + R  + ++ QK RWG CWS+Y CFG+ K  KRIG+AVLVPEP  S     + +N+  S  +VLPF APPSSP SFLQS+P+
Subjt:  VNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSS---ESHENTLQSPDIVLPFAAPPSSPVSFLQSEPT

Query:  SGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPP--ESIHLTTPSSPEVPFAQFLQPTLQKPESD------H
        S + SP   +   SLT+N +SP  P S+F +GP+A+ETQ V+PP+ FS   TEPST P+TPP   S+H+TTPSSPEVPFAQ L  +L+    D       
Subjt:  SGTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPST-PFTPP--ESIHLTTPSSPEVPFAQFLQPTLQKPESD------H

Query:  QYPFPNDDFQSYQFYPGSP-ISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP
        ++   + +F+S Q  PGSP   +LISP SVIS SG SSP P        S  + F +  PP  L  +  +   W  R  + S T             L P
Subjt:  QYPFPNDDFQSYQFYPGSP-ISHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNP

Query:  QTSESMSDHHATNES----QNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEK
           E +S +   N +    QN    +      +   E    +HR SFEL+  DV  + + SK   S++    +  I        E S   D   N+  EK
Subjt:  QTSESMSDHHATNES----QNIQILIDGRQKEEEEEEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEK

Query:  TKADGEEAQQH-QEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFP
           D E  Q   Q+  S ++GS KEF FD                  N KD + E     +WSFFP
Subjt:  TKADGEEAQQH-QEHHSITLGSVKEFNFDNGNGSDTHKANINSEWWTNAKDVDTEGTTNGAWSFFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGACGTACGGATGCTGATGATTTGAGGCCTGTTAACAATACTTTCCAAACCATTACTGCCGCCGCTGATGCGATCGCCACCGTCGATCATCGTTTTCCTCGGGC
TACTGCCGTCCAGAAAAGAAGATGGGGTAGTTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCACGCTGTTTTGGTACCAGAACCAA
GTCCTTCATCTGAGTCTCATGAAAATACATTGCAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTTTCCTTCCTTCAATCTGAGCCAACTTCT
GGTACACAATCACCTACAGCTTTGATCTCTTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAAC
ACAACTAGTTTCTCCACCTCTCAATTTCTCCACTCTCACCACTGAACCATCAACTCCCTTCACTCCTCCTGAGTCTATCCACTTGACAACCCCTTCTTCCCCTGAAGTTC
CTTTTGCTCAGTTTCTTCAACCTACCCTTCAGAAACCTGAGTCTGATCATCAATATCCATTTCCTAATGATGACTTCCAATCTTACCAATTCTATCCAGGCAGTCCTATC
AGTCACCTCATATCACCACGGTCTGTCATTTCTCGTTCTGGGGCTTCCTCACCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGA
AGTTCCACCTACTTTGTTGAACCTTGACAAACAATCCATTCATAACTGGCAACAACGACAAAGTACTGATTCTTGCACTCAAGATTCTATAGAATTCAAATCAAGTAATG
ATTTTGTTTTGAATCCCCAAACTTCAGAATCTATGTCAGATCACCACGCAACAAATGAATCTCAAAATATTCAAATTCTCATTGATGGAAGGCAAAAGGAGGAGGAGGAG
GAGGAGCCAGCTGCTACTAATCATAGGTTCTCATTTGAGTTATCTGATGGAGATGTTTTATTGCAAAGCGTAGGAAGTAAGCCACTGGAATCAAATGAACTTGCAGTTGC
ATCGTCTCCAATACATGAACCATTTGAAACGGCTAAAGAAAATTCTCCTGTTGGTGACCATACTCCAAATGTTTCAGAAGAAAAGACAAAAGCAGACGGTGAAGAAGCAC
AGCAGCATCAAGAACATCATTCCATAACTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATACACATAAGGCAAATATAAATTCAGAATGGTGG
ACTAATGCAAAGGATGTTGACACAGAAGGCACGACCAACGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGA
mRNA sequenceShow/hide mRNA sequence
CTTTATTAATTACATAATAATATTGTTTATTATTTAATTTCATTTCCGTCTTCTTCCTTCTCTCTCGCTAACGAACCTTCTCTTCACTTTCTTACTGCAAAATCTCCCGT
TTTGATTATGCCTAAAAATTTCATGATCTCTTGTATATGATCGGAACTACCGATTCTTTCTATGAAACCGATCGGCGATTCTCTGGTTTCGAATAACGATAGCGATGAGA
CGACGTACGGATGCTGATGATTTGAGGCCTGTTAACAATACTTTCCAAACCATTACTGCCGCCGCTGATGCGATCGCCACCGTCGATCATCGTTTTCCTCGGGCTACTGC
CGTCCAGAAAAGAAGATGGGGTAGTTGTTGGAGTATTTATTGGTGCTTTGGATCTCTCAAACAGAGGAAAAGAATTGGGCACGCTGTTTTGGTACCAGAACCAAGTCCTT
CATCTGAGTCTCATGAAAATACATTGCAATCACCAGATATTGTGCTTCCTTTTGCTGCACCTCCCTCTTCCCCTGTTTCCTTCCTTCAATCTGAGCCAACTTCTGGTACA
CAATCACCTACAGCTTTGATCTCTTTCACTTCTCTCACTGCTAACATGTATTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACACAACT
AGTTTCTCCACCTCTCAATTTCTCCACTCTCACCACTGAACCATCAACTCCCTTCACTCCTCCTGAGTCTATCCACTTGACAACCCCTTCTTCCCCTGAAGTTCCTTTTG
CTCAGTTTCTTCAACCTACCCTTCAGAAACCTGAGTCTGATCATCAATATCCATTTCCTAATGATGACTTCCAATCTTACCAATTCTATCCAGGCAGTCCTATCAGTCAC
CTCATATCACCACGGTCTGTCATTTCTCGTTCTGGGGCTTCCTCACCTTTGCCTGACTATGATTTTGCTTCCTTTGGTTCTCAATTTTTGAATTTCCCATTAGAAGTTCC
ACCTACTTTGTTGAACCTTGACAAACAATCCATTCATAACTGGCAACAACGACAAAGTACTGATTCTTGCACTCAAGATTCTATAGAATTCAAATCAAGTAATGATTTTG
TTTTGAATCCCCAAACTTCAGAATCTATGTCAGATCACCACGCAACAAATGAATCTCAAAATATTCAAATTCTCATTGATGGAAGGCAAAAGGAGGAGGAGGAGGAGGAG
CCAGCTGCTACTAATCATAGGTTCTCATTTGAGTTATCTGATGGAGATGTTTTATTGCAAAGCGTAGGAAGTAAGCCACTGGAATCAAATGAACTTGCAGTTGCATCGTC
TCCAATACATGAACCATTTGAAACGGCTAAAGAAAATTCTCCTGTTGGTGACCATACTCCAAATGTTTCAGAAGAAAAGACAAAAGCAGACGGTGAAGAAGCACAGCAGC
ATCAAGAACATCATTCCATAACTCTTGGGTCTGTGAAGGAATTCAATTTTGATAATGGCAATGGAAGTGATACACATAAGGCAAATATAAATTCAGAATGGTGGACTAAT
GCAAAGGATGTTGACACAGAAGGCACGACCAACGGGGCCTGGTCATTCTTTCCAATGGCGCAGCAAAGATGAGCTGACTGGTGCTTACTTATACTCTGGAATTTCCTCAT
GCCCATCATCTTTTGCAGTTGCAAATTGATAGGCGAGAAAAAGAGGAATGATGGGCTTTGAAGGTATTAAAGAGGTCGTCAAATCATGAGAGAGCCAGACCAGAAGCCTT
TGTTTTTTCCCCCAACAATATGACCTAAAACAAACAAAGCCAGATATTATTAGAACGATAGAGAAATTTCTAGATTCGATAGGGCCTTATTAACAAACAATTGTGGCTCC
ACTTGAATTGTAATAGATATTAGTAGTCTAATAGAAATTGGAATTGTGTAAATATGGTAATAAAAAGTTTTTTTTTTTTCCATATTCACAATATTTCGTTTATTGATTTT
GAATTAGTTGGAACGTAGTGAGTTGGGATACCCAAAAATTGCAAAAAGTTTTTGCAATTAGAAATGTTTCATGCTATGAAATTAAT
Protein sequenceShow/hide protein sequence
MRRRTDADDLRPVNNTFQTITAAADAIATVDHRFPRATAVQKRRWGSCWSIYWCFGSLKQRKRIGHAVLVPEPSPSSESHENTLQSPDIVLPFAAPPSSPVSFLQSEPTS
GTQSPTALISFTSLTANMYSPDGPSSIFAIGPFAHETQLVSPPLNFSTLTTEPSTPFTPPESIHLTTPSSPEVPFAQFLQPTLQKPESDHQYPFPNDDFQSYQFYPGSPI
SHLISPRSVISRSGASSPLPDYDFASFGSQFLNFPLEVPPTLLNLDKQSIHNWQQRQSTDSCTQDSIEFKSSNDFVLNPQTSESMSDHHATNESQNIQILIDGRQKEEEE
EEPAATNHRFSFELSDGDVLLQSVGSKPLESNELAVASSPIHEPFETAKENSPVGDHTPNVSEEKTKADGEEAQQHQEHHSITLGSVKEFNFDNGNGSDTHKANINSEWW
TNAKDVDTEGTTNGAWSFFPMAQQR