; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg08584 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg08584
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationCarg_Chr09:473..4780
RNA-Seq ExpressionCarg08584
SyntenyCarg08584
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591175.1 hypothetical protein SDJN03_13521, partial [Cucurbita argyrosperma subsp. sororia]1.5e-27096.94Show/hide
Query:  VTMEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGG
        VTMEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQ ILSRIVRLLDNFENDPCFLV LTFLPSENLLV      GKKWGG
Subjt:  VTMEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGG

Query:  CWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAH
        CWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR PA GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAH
Subjt:  CWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAH

Query:  ETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDF
        ETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDF
Subjt:  ETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDF

Query:  PPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRAS
        PPQWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRAS
Subjt:  PPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRAS

Query:  FGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        FGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENL SAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYE   L
Subjt:  FGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

KAG7024061.1 hypothetical protein SDJN02_12874 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MGSEQNRFPQQERNLRVGFWILCSVHLDKSVTMEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFEN
        MGSEQNRFPQQERNLRVGFWILCSVHLDKSVTMEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFEN
Subjt:  MGSEQNRFPQQERNLRVGFWILCSVHLDKSVTMEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFEN

Query:  DPCFLVGLTFLPSENLLVVCCDYRGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPST
        DPCFLVGLTFLPSENLLVVCCDYRGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPST
Subjt:  DPCFLVGLTFLPSENLLVVCCDYRGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPST

Query:  AQSPSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYS
        AQSPSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYS
Subjt:  AQSPSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYS

Query:  LYPGSPSSSLVSPISRTSGDCLSSFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD
        LYPGSPSSSLVSPISRTSGDCLSSFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD
Subjt:  LYPGSPSSSLVSPISRTSGDCLSSFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSD

Query:  VYSPGGNVLQNRHNKSPKQDVEELEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVV
        VYSPGGNVLQNRHNKSPKQDVEELEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVV
Subjt:  VYSPGGNVLQNRHNKSPKQDVEELEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVV

Query:  QKDTCTEVLALCSVYEGELLFPVTMCIHLFHKMRSRKKVNT
        QKDTCTEVLALCSVYEGELLFPVTMCIHLFHKMRSRKKVNT
Subjt:  QKDTCTEVLALCSVYEGELLFPVTMCIHLFHKMRSRKKVNT

XP_022975613.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita maxima]1.3e-23787.7Show/hide
Query:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW
        ME+QESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK                                                GKKWGGCW
Subjt:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW

Query:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
        GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR P  GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
Subjt:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET

Query:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP
        QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSS+DLKG GKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP
Subjt:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP

Query:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG
        QWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNV QNRHNKSPKQDVEELEAYRASFG
Subjt:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG

Query:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        FSADEII TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYE   L
Subjt:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

XP_023521113.1 uncharacterized protein At1g76660-like isoform X1 [Cucurbita pepo subsp. pepo]9.0e-23988.32Show/hide
Query:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW
        ME+QESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK                                               +GKKWGGCW
Subjt:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW

Query:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
        GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR PA GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
Subjt:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET

Query:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP
        QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDF P
Subjt:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP

Query:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG
        QWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG
Subjt:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG

Query:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        FSADEII TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYE   L
Subjt:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

XP_023521115.1 uncharacterized protein At1g76660-like isoform X2 [Cucurbita pepo subsp. pepo]2.0e-23888.32Show/hide
Query:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW
        ME+QESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK                                                GKKWGGCW
Subjt:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW

Query:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
        GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR PA GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
Subjt:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET

Query:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP
        QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDF P
Subjt:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP

Query:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG
        QWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG
Subjt:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG

Query:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        FSADEII TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYE   L
Subjt:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

TrEMBL top hitse value%identityAlignment
A0A6J1F8I2 uncharacterized protein At1g76660-like isoform X32.1e-21797.48Show/hide
Query:  RGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF
        RGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR PA GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF
Subjt:  RGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF

Query:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS
        ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS
Subjt:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS

Query:  SFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEE
        SFPERDFPPQWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEE
Subjt:  SFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEE

Query:  LEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        LEAYRASFGFSADEII TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEP LLAENLNSAHTTLQS RRIKSPPDVVQKDTCTEVLALC+VYE   L
Subjt:  LEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

A0A6J1F9B9 uncharacterized protein At1g76660-like isoform X18.2e-23887.91Show/hide
Query:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW
        ME+QESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK                                                GKKWGGCW
Subjt:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW

Query:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
        GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR PA GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
Subjt:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET

Query:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP
        QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP
Subjt:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP

Query:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG
        QWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG
Subjt:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG

Query:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        FSADEII TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEP LLAENLNSAHTTLQS RRIKSPPDVVQKDTCTEVLALC+VYE   L
Subjt:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

A0A6J1IEQ0 uncharacterized protein At1g76660-like isoform X31.6e-21797.23Show/hide
Query:  RGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF
        RGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR P  GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF
Subjt:  RGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF

Query:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS
        ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSS+DLKG GKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS
Subjt:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS

Query:  SFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEE
        SFPERDFPPQWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNV QNRHNKSPKQDVEE
Subjt:  SFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEE

Query:  LEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        LEAYRASFGFSADEII TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYE   L
Subjt:  LEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

A0A6J1IH75 uncharacterized protein At1g76660-like isoform X24.7e-21796.98Show/hide
Query:  RGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF
        +GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR P  GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF
Subjt:  RGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMF

Query:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS
        ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSS+DLKG GKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS
Subjt:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLS

Query:  SFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEE
        SFPERDFPPQWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNV QNRHNKSPKQDVEE
Subjt:  SFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEE

Query:  LEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        LEAYRASFGFSADEII TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYE   L
Subjt:  LEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

A0A6J1IL36 uncharacterized protein At1g76660-like isoform X16.3e-23887.7Show/hide
Query:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW
        ME+QESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQK                                                GKKWGGCW
Subjt:  MEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTFLPSENLLVVCCDYRGKKWGGCW

Query:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
        GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNR P  GMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET
Subjt:  GALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFATGPYAHET

Query:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP
        QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSS+DLKG GKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP
Subjt:  QLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFPP

Query:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG
        QWNP VSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNV QNRHNKSPKQDVEELEAYRASFG
Subjt:  QWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFG

Query:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL
        FSADEII TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYE   L
Subjt:  FSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELL

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766603.0e-11262.26Show/hide
Query:  KKWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNRLPAVGMAIQ--ATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTM
        K+WGGC G  SCF SQKG KRIVPASR+PE GNV  +QPN     G+     A  I+ SLLAPPSSPASFTNSALPST QSP+C+LS++ANSPGGPSS+M
Subjt:  KKWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNRLPAVGMAIQ--ATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTM

Query:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCL
        +ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A FL+SS+DLK +GK +Y   NDLQ  YSLYPGSP+S+L SPISR SGD L
Subjt:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCL

Query:  SSFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSPG--GNVLQNRHNKSPKQ
                       +SPQ+GK  R+ SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY     GN  QNR N+SPKQ
Subjt:  SSFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSPG--GNVLQNRHNKSPKQ

Query:  DVEELEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQ
        D+EELEAYRASFGFSADEII T+QYVEI+ VM+ SF    ++ +        E  LL++    +   L SQ
Subjt:  DVEELEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQ

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)6.2e-2842.11Show/hide
Query:  YRGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTM
        ++ +KW   W  L CF S +  KRI  +  +PE   V+   +         ++ +     +APPSSPASF  S  PS  QSP   LS S   P     ++
Subjt:  YRGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTM

Query:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS
        FA GPYAHETQLVSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFA+  +S+      G +  ++S+     Y L PGSP   L+SP S  S
Subjt:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS

Query:  GDCLSSFPE
        G   S FP+
Subjt:  GDCLSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown2.1e-11362.26Show/hide
Query:  KKWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNRLPAVGMAIQ--ATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTM
        K+WGGC G  SCF SQKG KRIVPASR+PE GNV  +QPN     G+     A  I+ SLLAPPSSPASFTNSALPST QSP+C+LS++ANSPGGPSS+M
Subjt:  KKWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNRLPAVGMAIQ--ATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTM

Query:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCL
        +ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A FL+SS+DLK +GK +Y   NDLQ  YSLYPGSP+S+L SPISR SGD L
Subjt:  FATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCL

Query:  SSFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSPG--GNVLQNRHNKSPKQ
                       +SPQ+GK  R+ SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY     GN  QNR N+SPKQ
Subjt:  SSFPERDFPPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSPG--GNVLQNRHNKSPKQ

Query:  DVEELEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQ
        D+EELEAYRASFGFSADEII T+QYVEI+ VM+ SF    ++ +        E  LL++    +   L SQ
Subjt:  DVEELEAYRASFGFSADEIIITTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQ

AT4G25620.1 hydroxyproline-rich glycoprotein family protein4.7e-2844.5Show/hide
Query:  KKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQS--PSCFLSMSANSPGGPSSTMF
        KK G  W    CF S+K  KRI  A  +PE     +     P    +  +T I    +APPSSPASF  S  PS + +  P    S++ N P  PS+  F
Subjt:  KKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQS--PSCFLSMSANSPGGPSSTMF

Query:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLK-----GTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS
          GPYAHETQ V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFA+ L+SS++       G   + + A++    +  +YPGSP  +L+SP S TS
Subjt:  ATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLK-----GTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein6.0e-3146.86Show/hide
Query:  KWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSAN--SPGGPSSTMFA
        +WG CW   SCF +QK  KRI  A  +PE   VT+    +     A   TV+ P  +APPSSPASF  S   S + SP   LS+++N  SP  P S +F 
Subjt:  KWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSAN--SPGGPSSTMFA

Query:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAEFLSSSVDL----KGTGKENYIASNDLQ-TAYSLYPGSP-SSSLVSPISRT
         GPYA+ETQ V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFA+ L+SS++L      +G     +S+  +  +  + PGSP   +L+SP S  
Subjt:  TGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAEFLSSSVDL----KGTGKENYIASNDLQ-TAYSLYPGSP-SSSLVSPISRT

Query:  SGDCLSS
        S    SS
Subjt:  SGDCLSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCCGAGCAGAATAGATTCCCTCAGCAGGAACGGAATCTTAGAGTTGGATTCTGGATCCTTTGCAGTGTGCACTTGGATAAGAGTGTTACTATGGAGCACCAGGA
AAGTACAGGATTTGGGATGCCTCCTGCGGTTAATACTTTGATGTTGGACTTATGGACTCCAATAATTGAGAAATCCAGCATGAACTGGATATGTGGAAAGTTCCTTTCCT
TTCAGAAGGGTGGCTGTTTATCTGTTCCTCAACAAGTTATCTTGTCCAGGATTGTTAGGCTTCTGGATAATTTCGAGAACGACCCTTGCTTCCTAGTGGGTTTGACATTT
TTGCCTTCAGAGAATTTGCTTGTGGTATGCTGTGATTACCGGGGAAAGAAATGGGGTGGATGCTGGGGTGCATTATCTTGTTTTCACTCGCAGAAAGGAGAAAAGCGCAT
TGTACCTGCATCTCGTTTACCTGAGGGAAATGTTGTGACAACCCAGCCAAATAGACTTCCAGCAGTCGGAATGGCCATCCAGGCTACAGTGATAGATCCATCCCTACTAG
CCCCACCTTCTTCTCCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCAAGCTGTTTCTTGTCGATGTCTGCCAACTCACCTGGAGGTCCTTCATCG
ACAATGTTTGCTACAGGGCCATATGCACACGAAACACAGCTGGTTTCGCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCTCCAGA
ACTAGCTCACCTGACCACACCTTCTTCCCCCGATGTGCCGTTTGCTGAGTTCCTATCCTCATCAGTGGATCTTAAAGGAACAGGAAAGGAAAATTACATTGCTTCAAATG
ATCTTCAAACTGCATATTCTCTCTACCCTGGAAGTCCTTCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGTGATTGCTTATCATCATTTCCTGAAAGGGACTTC
CCACCGCAGTGGAATCCTCCAGTTTCTCCCCAAGATGGAAAATATCCTAGAACTGGTTCCGGTCGGCTATTTGGACATGAGAAAGCTGGTACATCTTTGGTATCTCAGGA
TTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGTAGGTTAAGTGTATCGAAGGATTCAGATGTTTACTCGC
CTGGTGGGAATGTACTCCAAAATCGGCACAATAAGTCTCCAAAACAAGATGTGGAGGAACTAGAAGCATACCGAGCATCGTTTGGTTTCAGTGCAGATGAAATTATAATT
ACTACACAATATGTGGAGATATCTGGAGTAATGGAGGATTCCTTTACTATGAAGCCTTTCACTTCAACTAGTCTGTCAGCAGAAGAAAGTTTTGAACCTCCATTGTTGGC
TGAAAATCTAAATTCCGCACATACAACCTTACAGAGTCAGAGGAGAATTAAATCACCACCTGATGTTGTCCAAAAGGATACCTGCACTGAAGTGCTGGCATTATGCAGTG
TTTATGAAGGTGAGCTGCTGTTCCCTGTTACTATGTGCATACACTTGTTCCATAAAATGAGAAGTCGAAAGAAAGTAAACACGTAA
mRNA sequenceShow/hide mRNA sequence
AGGAGGAGGTGGAGGAGGAGGAAGAGGAGGTGGAGGAGGAGGGCCACGAGTCTCGTAGGAAGAGGCAGCTTAGGACACGAAAAGATAAGTAGAGACTTTGTCGGGCTACT
GGTTACGAATGGGGTCCGAGCAGAATAGATTCCCTCAGCAGGAACGGAATCTTAGAGTTGGATTCTGGATCCTTTGCAGTGTGCACTTGGATAAGAGTGTTACTATGGAG
CACCAGGAAAGTACAGGATTTGGGATGCCTCCTGCGGTTAATACTTTGATGTTGGACTTATGGACTCCAATAATTGAGAAATCCAGCATGAACTGGATATGTGGAAAGTT
CCTTTCCTTTCAGAAGGGTGGCTGTTTATCTGTTCCTCAACAAGTTATCTTGTCCAGGATTGTTAGGCTTCTGGATAATTTCGAGAACGACCCTTGCTTCCTAGTGGGTT
TGACATTTTTGCCTTCAGAGAATTTGCTTGTGGTATGCTGTGATTACCGGGGAAAGAAATGGGGTGGATGCTGGGGTGCATTATCTTGTTTTCACTCGCAGAAAGGAGAA
AAGCGCATTGTACCTGCATCTCGTTTACCTGAGGGAAATGTTGTGACAACCCAGCCAAATAGACTTCCAGCAGTCGGAATGGCCATCCAGGCTACAGTGATAGATCCATC
CCTACTAGCCCCACCTTCTTCTCCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCAAGCTGTTTCTTGTCGATGTCTGCCAACTCACCTGGAGGTC
CTTCATCGACAATGTTTGCTACAGGGCCATATGCACACGAAACACAGCTGGTTTCGCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCC
CCTCCAGAACTAGCTCACCTGACCACACCTTCTTCCCCCGATGTGCCGTTTGCTGAGTTCCTATCCTCATCAGTGGATCTTAAAGGAACAGGAAAGGAAAATTACATTGC
TTCAAATGATCTTCAAACTGCATATTCTCTCTACCCTGGAAGTCCTTCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGTGATTGCTTATCATCATTTCCTGAAA
GGGACTTCCCACCGCAGTGGAATCCTCCAGTTTCTCCCCAAGATGGAAAATATCCTAGAACTGGTTCCGGTCGGCTATTTGGACATGAGAAAGCTGGTACATCTTTGGTA
TCTCAGGATTCTAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGTAGGTTAAGTGTATCGAAGGATTCAGATGT
TTACTCGCCTGGTGGGAATGTACTCCAAAATCGGCACAATAAGTCTCCAAAACAAGATGTGGAGGAACTAGAAGCATACCGAGCATCGTTTGGTTTCAGTGCAGATGAAA
TTATAATTACTACACAATATGTGGAGATATCTGGAGTAATGGAGGATTCCTTTACTATGAAGCCTTTCACTTCAACTAGTCTGTCAGCAGAAGAAAGTTTTGAACCTCCA
TTGTTGGCTGAAAATCTAAATTCCGCACATACAACCTTACAGAGTCAGAGGAGAATTAAATCACCACCTGATGTTGTCCAAAAGGATACCTGCACTGAAGTGCTGGCATT
ATGCAGTGTTTATGAAGGTGAGCTGCTGTTCCCTGTTACTATGTGCATACACTTGTTCCATAAAATGAGAAGTCGAAAGAAAGTAAACACGTAA
Protein sequenceShow/hide protein sequence
MGSEQNRFPQQERNLRVGFWILCSVHLDKSVTMEHQESTGFGMPPAVNTLMLDLWTPIIEKSSMNWICGKFLSFQKGGCLSVPQQVILSRIVRLLDNFENDPCFLVGLTF
LPSENLLVVCCDYRGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRLPAVGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSS
TMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDF
PPQWNPPVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFGFSADEIII
TTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPPLLAENLNSAHTTLQSQRRIKSPPDVVQKDTCTEVLALCSVYEGELLFPVTMCIHLFHKMRSRKKVNT