; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G000010 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G000010
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationCmo_Chr09:751..5556
RNA-Seq ExpressionCmoCh09G000010
SyntenyCmoCh09G000010
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022936805.1 uncharacterized protein At1g76660-like isoform X2 [Cucurbita moschata]6.2e-26899.79Show/hide
Query:  MGSEQNRFPQQER-GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM
        MGSEQNRFPQQER GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM
Subjt:  MGSEQNRFPQQER-GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM

Query:  SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL
        SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL
Subjt:  SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL

Query:  VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ
        VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ
Subjt:  VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ

Query:  NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA
        NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA
Subjt:  NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA

Query:  LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

XP_022936806.1 uncharacterized protein At1g76660-like isoform X3 [Cucurbita moschata]2.5e-269100Show/hide
Query:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
        MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
Subjt:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV
        ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV

Query:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN
        SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN
Subjt:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN

Query:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL
        RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL
Subjt:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL

Query:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

XP_022975616.1 uncharacterized protein At1g76660-like isoform X3 [Cucurbita maxima]1.3e-26598.52Show/hide
Query:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
        MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPP AGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
Subjt:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV
        ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSS+DLKG GKENYIASNDLQTAYSLYPGSPSSSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV

Query:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN
        SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNV QN
Subjt:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN

Query:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL
        RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEP LLAENLNSAHTTLQS RRIKSPPDVVQKDTCTEVLAL
Subjt:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL

Query:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        C+VYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

XP_023521116.1 uncharacterized protein At1g76660-like isoform X3 [Cucurbita pepo subsp. pepo]4.9e-26598.94Show/hide
Query:  MGSEQNRFPQQER-GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM
        MGSEQNRFPQQER GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM
Subjt:  MGSEQNRFPQQER-GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM

Query:  SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL
        SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL
Subjt:  SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL

Query:  VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ
        VSPISRTSGDCLSSFPERDF PQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ
Subjt:  VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ

Query:  NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA
        NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEP LLAENLNSAHTTLQS RRIKSPPDVVQKDTCTEVLA
Subjt:  NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA

Query:  LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        LC+VYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

XP_023521117.1 uncharacterized protein At1g76660-like isoform X4 [Cucurbita pepo subsp. pepo]2.0e-26699.15Show/hide
Query:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
        MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
Subjt:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV
        ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV

Query:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN
        SPISRTSGDCLSSFPERDF PQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN
Subjt:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN

Query:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL
        RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEP LLAENLNSAHTTLQS RRIKSPPDVVQKDTCTEVLAL
Subjt:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL

Query:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        C+VYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

TrEMBL top hitse value%identityAlignment
A0A6J1F8I2 uncharacterized protein At1g76660-like isoform X31.2e-269100Show/hide
Query:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
        MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
Subjt:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV
        ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV

Query:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN
        SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN
Subjt:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN

Query:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL
        RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL
Subjt:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL

Query:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

A0A6J1F9B9 uncharacterized protein At1g76660-like isoform X12.5e-26298.71Show/hide
Query:  RFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGG
        +F   ++GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGG
Subjt:  RFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGG

Query:  PSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRT
        PSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRT
Subjt:  PSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRT

Query:  SGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSP
        SGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSP
Subjt:  SGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSP

Query:  KQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLALCTVYED
        KQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLALCTVYED
Subjt:  KQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLALCTVYED

Query:  NKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        NKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  NKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

A0A6J1FER5 uncharacterized protein At1g76660-like isoform X23.0e-26899.79Show/hide
Query:  MGSEQNRFPQQER-GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM
        MGSEQNRFPQQER GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM
Subjt:  MGSEQNRFPQQER-GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM

Query:  SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL
        SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL
Subjt:  SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL

Query:  VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ
        VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ
Subjt:  VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ

Query:  NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA
        NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA
Subjt:  NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA

Query:  LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

A0A6J1IEQ0 uncharacterized protein At1g76660-like isoform X36.2e-26698.52Show/hide
Query:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
        MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPP AGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS
Subjt:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMS

Query:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV
        ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSS+DLKG GKENYIASNDLQTAYSLYPGSPSSSLV
Subjt:  ANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLV

Query:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN
        SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNV QN
Subjt:  SPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQN

Query:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL
        RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEP LLAENLNSAHTTLQS RRIKSPPDVVQKDTCTEVLAL
Subjt:  RHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLAL

Query:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        C+VYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  CTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

A0A6J1IH75 uncharacterized protein At1g76660-like isoform X21.5e-26498.31Show/hide
Query:  MGSEQNRFPQQER-GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM
        MGSEQNRFPQQER GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPP AGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM
Subjt:  MGSEQNRFPQQER-GKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSM

Query:  SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL
        SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSS+DLKG GKENYIASNDLQTAYSLYPGSPSSSL
Subjt:  SANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSL

Query:  VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ
        VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNV Q
Subjt:  VSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQ

Query:  NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA
        NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEP LLAENLNSAHTTLQS RRIKSPPDVVQKDTCTEVLA
Subjt:  NRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLA

Query:  LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
        LC+VYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD
Subjt:  LCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766602.4e-12156.42Show/hide
Query:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNRPPAAGMAIQ--ATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFL
        MGSE      Q++ K+WGGC G  SCF SQKG KRIVPASR+PE GNV  +QPN    AG+     A  I+ SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNRPPAAGMAIQ--ATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSS
        S++ANSPGGPSS+M+ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A FL+SS+DLK +GK +Y   NDLQ  YSLYPGSP+S
Subjt:  SMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSS

Query:  SLVSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSPG--
        +L SPISR SGD L               +SPQ+GK  R+ SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY     
Subjt:  SLVSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSPG--

Query:  GNVLQNRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQS-LRRIKSPPDVVQKDT
        GN  QNR N+SPKQD+EELEAYRASFGFSADEIITT+QYVEI+ VM+ SF    ++ +        E +LL++    +   L S +   +SP        
Subjt:  GNVLQNRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQS-LRRIKSPPDVVQKDT

Query:  CTEVLALCTVYEDNKLQRQPGNMSGSSTLNQVGSD---VFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAK
                  Y+D+K QR           N++ +D   + SR+G  K SR Y+  +S SDAEV+YRRGRSLRE++
Subjt:  CTEVLALCTVYEDNKLQRQPGNMSGSSTLNQVGSD---VFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAK

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)1.2e-2742.72Show/hide
Query:  KKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFAT
        +KW   W  L CF S +  KRI  +  +PE   V+   +    +    ++ +     +APPSSPASF  S  PS  QSP   LS S   P     ++FA 
Subjt:  KKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSSTMFAT

Query:  GPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDC
        GPYAHETQLVSPPVFS +TTEPS+AP+TPP + + +    TTPSSP+VPFA+  +S+      G +  ++S+     Y L PGSP   L+SP S  SG  
Subjt:  GPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHL----TTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDC

Query:  LSSFPE
         S FP+
Subjt:  LSSFPE

AT1G76660.1 FUNCTIONS IN: molecular_function unknown1.7e-12256.42Show/hide
Query:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNRPPAAGMAIQ--ATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFL
        MGSE      Q++ K+WGGC G  SCF SQKG KRIVPASR+PE GNV  +QPN    AG+     A  I+ SLLAPPSSPASFTNSALPST QSP+C+L
Subjt:  MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPE-GNVVTTQPNRPPAAGMAIQ--ATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFL

Query:  SMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSS
        S++ANSPGGPSS+M+ATGPYAHETQLVSPPVFS FTTEPSTAP TPPPELA LT PSSPDVP+A FL+SS+DLK +GK +Y   NDLQ  YSLYPGSP+S
Subjt:  SMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSS

Query:  SLVSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSPG--
        +L SPISR SGD L               +SPQ+GK  R+ SG  FG++  G S   Q+SNFFCP TFA+FYLD +P  P  GGRLSVSKDSDVY     
Subjt:  SLVSPISRTSGDCLSSFPERDFPPQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLD-NPPFPHTGGRLSVSKDSDVYSPG--

Query:  GNVLQNRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQS-LRRIKSPPDVVQKDT
        GN  QNR N+SPKQD+EELEAYRASFGFSADEIITT+QYVEI+ VM+ SF    ++ +        E +LL++    +   L S +   +SP        
Subjt:  GNVLQNRHNKSPKQDVEELEAYRASFGFSADEIITTTQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQS-LRRIKSPPDVVQKDT

Query:  CTEVLALCTVYEDNKLQRQPGNMSGSSTLNQVGSD---VFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAK
                  Y+D+K QR           N++ +D   + SR+G  K SR Y+  +S SDAEV+YRRGRSLRE++
Subjt:  CTEVLALCTVYEDNKLQRQPGNMSGSSTLNQVGSD---VFSRIGPSKNSRKYNLGLSCSDAEVDYRRGRSLREAK

AT4G25620.1 hydroxyproline-rich glycoprotein family protein3.2e-2842.73Show/hide
Query:  SEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAI--------QATVIDPSLLAPPSSPASFTNSALPSTAQS--
        S ++R       KK G  W    CF S+K  KRI  A  +PE          P A+G A+         +T I    +APPSSPASF  S  PS + +  
Subjt:  SEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAI--------QATVIDPSLLAPPSSPASFTNSALPSTAQS--

Query:  PSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLK-----GTGKENYIASNDLQTA
        P    S++ N P  PS+  F  GPYAHETQ V+PPVFSAFTTEPSTAP TPPPE     +PSSP+VPFA+ L+SS++       G   + + A++    +
Subjt:  PSCFLSMSANSPGGPSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLK-----GTGKENYIASNDLQTA

Query:  YSLYPGSPSSSLVSPISRTS
          +YPGSP  +L+SP S TS
Subjt:  YSLYPGSPSSSLVSPISRTS

AT5G52430.1 hydroxyproline-rich glycoprotein family protein1.2e-3045.79Show/hide
Query:  PQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSAN--SPGG
        P   +  +WG CW   SCF +QK  KRI  A  +PE   VT+          A   TV+ P  +APPSSPASF  S   S + SP   LS+++N  SP  
Subjt:  PQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSAN--SPGG

Query:  PSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAEFLSSSVDL----KGTGKENYIASNDLQ-TAYSLYPGSP-SSSL
        P S +F  GPYA+ETQ V+PPVFSAF TEPSTAP TPPPE + H+TTPSSP+VPFA+ L+SS++L      +G     +S+  +  +  + PGSP   +L
Subjt:  PSSTMFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELA-HLTTPSSPDVPFAEFLSSSVDL----KGTGKENYIASNDLQ-TAYSLYPGSP-SSSL

Query:  VSPISRTSGDCLSS
        +SP S  S    SS
Subjt:  VSPISRTSGDCLSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCCGAGCAGAATAGATTCCCTCAGCAGGAACGGGGAAAGAAATGGGGTGGATGCTGGGGTGCATTATCTTGTTTTCACTCGCAGAAAGGAGAAAAGCGCATTGT
ACCTGCATCTCGTTTACCTGAGGGAAATGTTGTGACAACCCAGCCAAATAGACCTCCAGCAGCCGGAATGGCCATCCAGGCTACAGTGATAGATCCATCCCTACTAGCCC
CACCTTCTTCTCCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCGAGCTGTTTCTTGTCGATGTCTGCCAACTCACCTGGAGGTCCTTCATCGACA
ATGTTTGCTACAGGGCCATATGCACACGAAACACAGCTGGTTTCGCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCTCCAGAACT
AGCTCACCTGACCACACCTTCTTCCCCCGATGTGCCGTTTGCTGAGTTCCTATCCTCATCAGTGGATCTTAAAGGAACAGGAAAGGAAAATTACATTGCTTCAAATGATC
TTCAAACTGCATATTCTCTCTACCCTGGAAGTCCTTCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGTGATTGCTTATCATCATTTCCTGAAAGGGACTTCCCA
CCGCAGTGGAATCCTTCAGTTTCTCCCCAAGATGGAAAATATCCTAGAACTGGTTCCGGTCGGCTATTTGGACATGAGAAAGCTGGTACATCTTTGGTATCTCAGGATTC
TAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGTAGGTTAAGTGTATCAAAGGATTCAGATGTTTACTCGCCTG
GTGGGAATGTACTCCAAAATCGGCACAATAAGTCTCCAAAACAAGATGTGGAGGAACTAGAAGCATACCGAGCATCGTTTGGTTTCAGTGCAGATGAAATTATAACTACT
ACACAATATGTGGAGATATCTGGAGTAATGGAGGATTCCTTTACTATGAAGCCTTTCACTTCAACTAGTCTGTCAGCAGAAGAAAGTTTTGAACCATCATTGTTGGCTGA
AAATCTAAATTCCGCACATACAACCTTACAGAGTCTGAGGAGAATTAAATCACCACCTGATGTTGTCCAAAAGGATACCTGCACTGAAGTGCTGGCATTATGCACTGTTT
ATGAAGATAATAAATTGCAAAGACAACCTGGTAACATGTCAGGATCAAGTACCTTAAACCAAGTTGGATCAGATGTATTTTCAAGGATAGGGCCATCAAAAAATAGTCGG
AAGTATAATCTTGGTTTATCCTGCTCTGATGCGGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGAGAGGCCAAGGGAGATTTTTTATGGCATGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTCCGAGCAGAATAGATTCCCTCAGCAGGAACGGGGAAAGAAATGGGGTGGATGCTGGGGTGCATTATCTTGTTTTCACTCGCAGAAAGGAGAAAAGCGCATTGT
ACCTGCATCTCGTTTACCTGAGGGAAATGTTGTGACAACCCAGCCAAATAGACCTCCAGCAGCCGGAATGGCCATCCAGGCTACAGTGATAGATCCATCCCTACTAGCCC
CACCTTCTTCTCCAGCATCCTTTACAAATTCTGCACTCCCTTCAACAGCCCAATCACCGAGCTGTTTCTTGTCGATGTCTGCCAACTCACCTGGAGGTCCTTCATCGACA
ATGTTTGCTACAGGGCCATATGCACACGAAACACAGCTGGTTTCGCCTCCTGTTTTCTCAGCCTTCACCACTGAACCGTCAACTGCTCCACTCACTCCCCCTCCAGAACT
AGCTCACCTGACCACACCTTCTTCCCCCGATGTGCCGTTTGCTGAGTTCCTATCCTCATCAGTGGATCTTAAAGGAACAGGAAAGGAAAATTACATTGCTTCAAATGATC
TTCAAACTGCATATTCTCTCTACCCTGGAAGTCCTTCCAGTAGCCTCGTGTCACCAATTTCAAGGACCTCCGGTGATTGCTTATCATCATTTCCTGAAAGGGACTTCCCA
CCGCAGTGGAATCCTTCAGTTTCTCCCCAAGATGGAAAATATCCTAGAACTGGTTCCGGTCGGCTATTTGGACATGAGAAAGCTGGTACATCTTTGGTATCTCAGGATTC
TAATTTCTTCTGCCCTGCTACATTTGCACAATTCTATCTGGACAATCCACCATTCCCTCATACTGGTGGTAGGTTAAGTGTATCAAAGGATTCAGATGTTTACTCGCCTG
GTGGGAATGTACTCCAAAATCGGCACAATAAGTCTCCAAAACAAGATGTGGAGGAACTAGAAGCATACCGAGCATCGTTTGGTTTCAGTGCAGATGAAATTATAACTACT
ACACAATATGTGGAGATATCTGGAGTAATGGAGGATTCCTTTACTATGAAGCCTTTCACTTCAACTAGTCTGTCAGCAGAAGAAAGTTTTGAACCATCATTGTTGGCTGA
AAATCTAAATTCCGCACATACAACCTTACAGAGTCTGAGGAGAATTAAATCACCACCTGATGTTGTCCAAAAGGATACCTGCACTGAAGTGCTGGCATTATGCACTGTTT
ATGAAGATAATAAATTGCAAAGACAACCTGGTAACATGTCAGGATCAAGTACCTTAAACCAAGTTGGATCAGATGTATTTTCAAGGATAGGGCCATCAAAAAATAGTCGG
AAGTATAATCTTGGTTTATCCTGCTCTGATGCGGAAGTTGACTACAGAAGAGGAAGGAGCCTAAGAGAGGCCAAGGGAGATTTTTTATGGCATGACTAAGAGAGCCATCT
CTGCTAGTTTACAGAATGTTTATCTTGTCTCACTCGATGGTTTAACGATATAGTTTGTGTTCCTTATCGTGCCAATTTATGGAATGGATCGAATTTCTGTATTTGGTTGA
TGAAGTTTACTTTTTTTCAATTAAAAATGATACAATCTGCTTGCATGTTCTGTACAACCGTCGTCAGTGGGTGGTTGTTAAAAGTTAG
Protein sequenceShow/hide protein sequence
MGSEQNRFPQQERGKKWGGCWGALSCFHSQKGEKRIVPASRLPEGNVVTTQPNRPPAAGMAIQATVIDPSLLAPPSSPASFTNSALPSTAQSPSCFLSMSANSPGGPSST
MFATGPYAHETQLVSPPVFSAFTTEPSTAPLTPPPELAHLTTPSSPDVPFAEFLSSSVDLKGTGKENYIASNDLQTAYSLYPGSPSSSLVSPISRTSGDCLSSFPERDFP
PQWNPSVSPQDGKYPRTGSGRLFGHEKAGTSLVSQDSNFFCPATFAQFYLDNPPFPHTGGRLSVSKDSDVYSPGGNVLQNRHNKSPKQDVEELEAYRASFGFSADEIITT
TQYVEISGVMEDSFTMKPFTSTSLSAEESFEPSLLAENLNSAHTTLQSLRRIKSPPDVVQKDTCTEVLALCTVYEDNKLQRQPGNMSGSSTLNQVGSDVFSRIGPSKNSR
KYNLGLSCSDAEVDYRRGRSLREAKGDFLWHD