; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G30520 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G30520
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSUN domain-containing protein
Genome locationChr5:28404786..28412441
RNA-Seq ExpressionCSPI05G30520
SyntenyCSPI05G30520
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0043621 - protein self-association (molecular function)
InterPro domainsIPR004263 - Exostosin-like
IPR008979 - Galactose-binding-like domain superfamily
IPR012919 - SUN domain
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039335.1 putative glycosyltransferase [Cucumis melo var. makuwa]0.0e+0095.76Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS
        MRKPVGALLHDRRAV+VPISGRNHLYKVSISLVFILWGL+FLFSLW S G GCQE SILLPDGVSTTNESKLENNKDSDVL EPPNGESHCTIHLNNSCS
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS

Query:  INASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGA
        INAS+PGSDNE+LSSEESSSHIQATTRLPED SSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGA
Subjt:  INASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGA

Query:  KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW
        KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW
Subjt:  KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW

Query:  VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVDEVSHRRELQSLANEEGDDGVDIELSKSNTPEPVE
        VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEAT DKRVIPSQPGP+DEVSH RELQSLANEEG DGVD+ELSKSNTP+PVE
Subjt:  VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVDEVSHRRELQSLANEEGDDGVDIELSKSNTPEPVE

Query:  ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGL
        ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQ DIRNILKIQD TDKDLRDLISWKSMVSLQLDGL
Subjt:  ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGL

Query:  QRHNSILRSEIERVQKNQISLENKGIVVFLEESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIY
        QRHNSILRSEIERVQKNQ SLENKGIV   EE QKTVAKDKEANGKSA PGISKT+ YSKLKKLEEKLGRARAAIR+ASQ+HNLTSIHHDPDYVPTGPIY
Subjt:  QRHNSILRSEIERVQKNQISLENKGIVVFLEESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIY

Query:  RNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINV
        RNPNAFHRSY+EME+LLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIG AITDYINV
Subjt:  RNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINV

Query:  ISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLL
        IS KHPFWDRSLGADHFMLSCHDWGPRTTS+VPLLFNNSIRVLCNANVSEGF PSKDASFPEIHLRTGEIDGL+GGLSPSRRSVLAFFAGRLHGHIRYLL
Subjt:  ISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLL

Query:  LQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQY
        LQEWKEKDEDVLVY+ELPSGISY+SMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNW SF+VQIQVKDIPNIKKIL GISQTQY
Subjt:  LQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQY

Query:  LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
        LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
Subjt:  LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD

KAE8648979.1 hypothetical protein Csa_009042 [Cucumis sativus]0.0e+0099.26Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS
        MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS

Query:  INASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGA
        INASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVS+GKSETGQAGNTIHRLEPGGAEYNYASASKGA
Subjt:  INASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGA

Query:  KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW
        KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW
Subjt:  KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW

Query:  VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVDEVSHRRELQSLANEEGDDGVDIELSKSNTPEPVE
        VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGP+DEVSHRRELQS+ANEEGDDGVDIELSKSNTPEPVE
Subjt:  VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVDEVSHRRELQSLANEEGDDGVDIELSKSNTPEPVE

Query:  ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGL
        ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGL
Subjt:  ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGL

Query:  QRHNSILRSEIERVQKNQISLENKGIVVFLEESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIY
        QRHNSILRSEIERVQKNQISLENKGI    EESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIY
Subjt:  QRHNSILRSEIERVQKNQISLENKGIVVFLEESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIY

Query:  RNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINV
        RNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINV
Subjt:  RNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINV

Query:  ISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLL
        ISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLL
Subjt:  ISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLL

Query:  LQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQY
        LQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQY
Subjt:  LQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQY

Query:  LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
        LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
Subjt:  LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD

KAF3440963.1 hypothetical protein FNV43_RR19249 [Rhamnella rubrinervis]0.0e+0062.34Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS
        M++   ALL  RRA++  I+GRN    VS+SL F+LWGLVFLFSLW S G G  +  + L  G+ST NE+KL++ K SD     P  E+   +H ++  S
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS

Query:  INASTPGS----------DNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAE
         N  TP S          +N+  S+E S +++    + PE  SSS+  K E+  PK D  S  V +GL+EFKSR + ++ KS  G AG   HR+EPGGAE
Subjt:  INASTPGS----------DNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAE

Query:  YNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHA
        YNYAS SKGAKVLAFNKE+KGASNILG+D+DKYLRNPCS E KFV+IELSEETLV TIEIANFEH+SSNLK+FE+ GSLVYPTD W KLGNFTAPN K A
Subjt:  YNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHA

Query:  HRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQP----GPVDEVSHRRELQSLANEEGDDGV
         RFVL++PKWVRYLKLN L+HYGSEFYCTLS VEV+G+DAVE MLEDLIS Q    +S   T D++ + SQP    G     +  +E+ S A   G+  V
Subjt:  HRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQP----GPVDEVSHRRELQSLANEEGDDGV

Query:  DIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDL
        + E+ KS+ P+PVEE+ HQQ GRMPGDTV+KIL QKVR+LD++LSVLERYLE+LTS+YGNIFKE DKDIG+ ++L+EK +AD+RN+L  Q +  K++ DL
Subjt:  DIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDL

Query:  ISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFLEES--------------------------------------------------QKT
        +SWKS+VS QLD L R N+ILR E+E+V++ Q S+E K +V+FL +S                                                     
Subjt:  ISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFLEES--------------------------------------------------QKT

Query:  VAKDKEANGKSATPGISKT-ERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPC
        V +DK+   ++     +K  +RYSKL+KLE  L RAR +I+EA+Q+ NLTSIH D DYVP GPIYRN NAFH SY+EMEKL KIYVY+EG+PP+FH GPC
Subjt:  VAKDKEANGKSATPGISKT-ERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPC

Query:  KSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLL
        KSIYSTEGRFIHEMEKGN + T DPD+AL+YFLPFSVV +V+YLY P+SH+  AI  AITDYINVIS+KHPFW+RSLGADHFMLSCHDWGP T+S+VP L
Subjt:  KSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLL

Query:  FNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLC
        F+ SIRVLCNAN SEGF PSKD SFPEIHLRTGEI GL+GG SPSRRS+LAFFAGRLHGHIRYLLL++WKEKD+DV VYD+LPSG+SY+SMLKKS+FCLC
Subjt:  FNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLC

Query:  PSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWL
        PSGYEVASPRVVEAIYAECVPVLIS+ YVPPFSDVLNW SF+VQ+QVKDIPNIKKIL GISQ+QYLRM RRVKQVQRHFV NG PKRFD FHMI+HSIWL
Subjt:  PSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWL

Query:  RRLNIHIQD
        RRLN+ I++
Subjt:  RRLNIHIQD

KAG6592335.1 putative glycosyltransferase, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.88Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIH------
        MR+ VGALL DRRAV+V ISGRNHL KVS+SLVF+LWGL+FLFSLWF  G GCQE S+LLPDG S +NES LE+NKDSDVL EP  GE+ CT H      
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIH------

Query:  ----------------------LNNSCSINASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKS
                              LN+SCSI+A++  SDNE+LSSEESSSH+ A T LPE  SSST VK ESKP K DISSDTVLLGLEEFKSR F SR K 
Subjt:  ----------------------LNNSCSINASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKS

Query:  ETGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYP
        ETGQAGNTIHR+EPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFE+HGSLVYP
Subjt:  ETGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYP

Query:  TDVWFKLGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVD-EVSHR
        TDVWFKLGNFTAPNAKHAHRFVLKDPKWVRYLKLN LTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEAT DKRV PSQPGP D    HR
Subjt:  TDVWFKLGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVD-EVSHR

Query:  RELQSLANEEG-DDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIR
        RE QSLANEE  DD V +ELSKSN P+PVEESHHQQPGRMPGDTVLKILTQKVRSLD SLSVLERYLED TSKYGNIFKEFDKDIGNN LLIEKT+ DIR
Subjt:  RELQSLANEEG-DDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIR

Query:  NILKIQDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVF-----------------------------------------
        NILK+QD+TDKDL DLISWKS VSLQLDGLQRHN+ILRSEIERVQKNQ  LENKGIVVF                                         
Subjt:  NILKIQDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVF-----------------------------------------

Query:  ----------------------LEESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFH
                              +E SQ TV ++KE  GK ATPGIS+ ERYSKL+K+EEKLGRARAAIREA ++ NLTS+H DPDYVP GPIYRNPNAFH
Subjt:  ----------------------LEESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFH

Query:  RSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPF
        RSY+EME+LLKIY+YKEGEPPMFH GPCKSIYSTEGRFIHEMEKGN YTTNDPDQALLYFLPFSVVNLVQYLY PNSH+VNAIG A+ DYI+VISNKH F
Subjt:  RSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPF

Query:  WDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEK
        W+RSLGADHFMLSCHDWGPRTTS+VP LFNNSIRVLCNANVSEGF PSKDASFPEIHLRTGEIDGLLGGLSPSRR +LAFFAGRLHGHIRYLLLQ+WKEK
Subjt:  WDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEK

Query:  DEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRV
        D+DV+VYDELPSG+SY+SMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSF VQI+VKDI NIK+IL GISQ+QYLRMQRRV
Subjt:  DEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRV

Query:  KQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
        KQVQRHFV+NGTPKR+DAFHMILHSIWLRRLN+HIQD
Subjt:  KQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD

TQD89737.1 hypothetical protein C1H46_024731 [Malus baccata]0.0e+0059.96Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGES---HCT-----
        M++   ALL +RRA  + ISGR+ LYKVS+SLVF+LWGLVFLFSLWFS G G ++ S + P G+ST +E+KL+ ++  D+ +E   G S    CT     
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGES---HCT-----

Query:  ------IHLNNSCSINASTPGSDNEVL--------SSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAG
                       +AST GS  + L        S+E S  H  A    PE  ++ + VK E+  PK       V LGL+EFKS+   S+ KS  GQAG
Subjt:  ------IHLNNSCSINASTPGSDNEVL--------SSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAG

Query:  NTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFK
           HR+EPGGAEYNYASA+KGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAE KFV IELSEETLV TIEIAN EH+SSNLK+FEV GSL YPT+ W  
Subjt:  NTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFK

Query:  LGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPG--PVDEVSHRRELQS
        LGN TA N K   RFVL+ PKWVRY+KL  L+HYGSEFYCTLS +E+YG+DAVE MLEDLIS +    +S+ AT D++ +PS P    VDE  H    +S
Subjt:  LGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPG--PVDEVSHRRELQS

Query:  LAN-EEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKI
              G   V+ ++  S  P+ V+E HHQQ  RMPGDTVLKIL QKVRSLD SLSVLERYLE+ TSKYG+IF EFDKD+G  +  ++K + DIRN+++ 
Subjt:  LAN-EEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKI

Query:  QDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFL---------------------------------------------
        Q+   KD+ +LISW+S+V++QL+ L R N+ILRSE+E+V++ QIS++NK +V  L                                             
Subjt:  QDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFL---------------------------------------------

Query:  --------EESQKTV-----------------AKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNA
                 ++ +TV                  +  E NG SA    +  +RYS+L+KLE  L   RA+IREA+++ NLTS H DPDYVP GPIYRN NA
Subjt:  --------EESQKTV-----------------AKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNA

Query:  FHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKH
        FHRSY++MEK  KIYVY+EGEPP+FH GPCKSIYSTEGRFIHEME  N+Y T DPDQAL+YFLPFSVV LVQYLYV +SH+   IG A+ DY+NVIS+KH
Subjt:  FHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKH

Query:  PFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWK
        PFW+RSLGADHFMLSCHDWGP T+++VP L+ NSIRVLCNAN SEGF PSKD SFPEIHLRTGE  GLLGGLSPSRRS+LAFFAGRLHGHIRYLLL EWK
Subjt:  PFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWK

Query:  EKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQR
        EKD+DV VYD+LP+G+SY+SMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVL+S+SYVPPFSDVL W SF+VQ+QVKDIPNIK+IL GISQ+QYLRMQR
Subjt:  EKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQR

Query:  RVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
        RVKQVQRHFV+NG  KRFD FHMI+HSIWLRRLNI I+D
Subjt:  RVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD

TrEMBL top hitse value%identityAlignment
A0A314YQN6 Putative glycosyltransferase0.0e+0063.35Show/hide
Query:  SSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEE
        SS + VK E+  PK       V LGL+EFKS+ F S+ KS  GQAG+  HR+EPGGAEYNYASA+KGAKVLAFNKEAKGASNILG+DKDKYLRNPCSAE 
Subjt:  SSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEE

Query:  KFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVE
        KFV IELSEETLV TI+IAN EH+SSNLK FE+ GSLVYPTD W  LGNFTA N K A R+ L++PKWVRY+KLN L+H+GSEFYCTLS +E+YG+DAVE
Subjt:  KFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVE

Query:  MMLEDLISAQHKPSISDEATHDKRVIPSQPG--PVDEVSHR--RELQSLANEEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDL
         MLEDLIS +  P +S+ AT D++   S P    VDE  H   +EL+   +  G   +  E+ KS  P+ ++E  H Q  RMPGDTVLKIL QKVRSLD 
Subjt:  MMLEDLISAQHKPSISDEATHDKRVIPSQPG--PVDEVSHR--RELQSLANEEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDL

Query:  SLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVV
        SLSVLERYLE+  SKYG+IF+EFDKD+G  +L ++K + DIRN+L+ Q+   KD+ +LISW+S+VS+QL  L R N+ILRSE+E+V++ Q S++NK +V 
Subjt:  SLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVV

Query:  F--------------------------------------------------------------LEESQKTVAKDK-----EANGKSATPGISKTERYSKL
                                                                       + E Q+ V  D+     E N  SA+    K  R+S+L
Subjt:  F--------------------------------------------------------------LEESQKTVAKDK-----EANGKSATPGISKTERYSKL

Query:  KKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEME-KGNLYTTNDP
        +KLE  L   RA+IREA+++ NLTS H DPDYVP GPIYRN NAFHRSY+EME+L KIYVY+EG+PP+FH GPCKSIYSTEGRFIHEME   N+Y T DP
Subjt:  KKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEME-KGNLYTTNDP

Query:  DQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASF
        D+AL+YFLPFSVV LVQYLY  +SH  ++IG A+ DY+NVIS+KHPFW+RSLGADHFMLSCHDWGPRT+S+VP L++ SIRVLCNAN SEGF PSKDASF
Subjt:  DQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASF

Query:  PEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLIS
        PEIHLRTGE  GL+GGLSPSRRS+LAFFAGRLHGHIRYLLL EWKEKD+DV VYD+LP G+SY+SMLKKSRFCLCPSGYEVASPRVVEAIYAEC+PVLIS
Subjt:  PEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLIS

Query:  ESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
        +SYVPPFSDVL+W SF+VQ+QVKDIPNIK IL GISQ+QYLRM RRVKQVQRHFV+NG  KRFD F+MI+HSIWLRRLNI I+D
Subjt:  ESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD

A0A498IR71 SUN domain-containing protein0.0e+0056.31Show/hide
Query:  LHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGES---HCT-----------IH
        L +RRA  + ISGRN LYKVS+SLVF+LWGLVFLFSLWFS G G ++ S + P G+ST +E+KL+ ++  D+ +E   G S    CT             
Subjt:  LHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGES---HCT-----------IH

Query:  LNNSCSINASTPGSDNEVL--------SSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEP
               +AS  GS  + L        S+E S  H  A    PE  ++ + VK E+  PK       V LGL+EFKS+ F S+ KS  GQAG   HR+EP
Subjt:  LNNSCSINASTPGSDNEVL--------SSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEP

Query:  GGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPN
        GGAEYNYASA+KGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAE KFV IELSEETLV TIEIAN EH+SSNLK+FEV GSL YPT+ W  LGN TA N
Subjt:  GGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPN

Query:  AKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQP--GPVDEVSHRRELQSLAN-EEGD
         K   RFVL+ PKWVRY+KL  L+HYGSEFYCTLS +E+YG+DAVE MLEDLIS +    +S+ AT D++ +PS P    VDE  H    +S      G 
Subjt:  AKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQP--GPVDEVSHRRELQSLAN-EEGD

Query:  DGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDL
          V+ ++  S  P+PV+E  HQQ  RMPGDTVLKIL QKVRSLD SLSVLERYLE+ TSKYG+IF EFDKD+G     ++K + DIRN+++ Q+   KD+
Subjt:  DGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDL

Query:  RDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFL-----------------------------------------------------
         +LISW+S+V++QL+ L R N+ILRSE+E+V++ QIS++NKGI++FL                                                     
Subjt:  RDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFL-----------------------------------------------------

Query:  ---------------------------------------------------------------------------------EESQKTV------------
                                                                                          +S +TV            
Subjt:  ---------------------------------------------------------------------------------EESQKTV------------

Query:  ---AKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGP
            +  E NG SA    +  +RYS+L+KLE  L   RA+IREA+++ NLTS H DPDYVP GPIYRN NAFHRSY++MEK  KIYVY+EGEPP+FH GP
Subjt:  ---AKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGP

Query:  CKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPL
        CKSIYSTEGRFIHEME  N+Y T DPDQAL+YFLPFSVV LVQYLYV +SH+   IG A+ DY+NVIS+KHPFW+RSLGADHFMLSCHDWGP T+++VP 
Subjt:  CKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPL

Query:  LFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCL
        L+ NSIRVLCNAN SEGF PSKD SFPEIHLRTGE  GLLGGLSPSRRS+LAFFAGRLHGHIRYLLL EWKEKD+DV VYD+LP+G+SY+SMLKKSRFCL
Subjt:  LFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCL

Query:  CPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIW
        CPSGYEVASPRVVEAIYAECVPVLIS+SYVPPFSDVL W SF+VQ+QVKDIPNIK+IL GISQ+QYLRMQRRVKQVQRHFV+NG  KRFD FHMI+HSIW
Subjt:  CPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIW

Query:  LRRLNIHIQD
        LRRLNI I+D
Subjt:  LRRLNIHIQD

A0A540LTE3 SUN domain-containing protein0.0e+0059.96Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGES---HCT-----
        M++   ALL +RRA  + ISGR+ LYKVS+SLVF+LWGLVFLFSLWFS G G ++ S + P G+ST +E+KL+ ++  D+ +E   G S    CT     
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGES---HCT-----

Query:  ------IHLNNSCSINASTPGSDNEVL--------SSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAG
                       +AST GS  + L        S+E S  H  A    PE  ++ + VK E+  PK       V LGL+EFKS+   S+ KS  GQAG
Subjt:  ------IHLNNSCSINASTPGSDNEVL--------SSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAG

Query:  NTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFK
           HR+EPGGAEYNYASA+KGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAE KFV IELSEETLV TIEIAN EH+SSNLK+FEV GSL YPT+ W  
Subjt:  NTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFK

Query:  LGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPG--PVDEVSHRRELQS
        LGN TA N K   RFVL+ PKWVRY+KL  L+HYGSEFYCTLS +E+YG+DAVE MLEDLIS +    +S+ AT D++ +PS P    VDE  H    +S
Subjt:  LGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPG--PVDEVSHRRELQS

Query:  LAN-EEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKI
              G   V+ ++  S  P+ V+E HHQQ  RMPGDTVLKIL QKVRSLD SLSVLERYLE+ TSKYG+IF EFDKD+G  +  ++K + DIRN+++ 
Subjt:  LAN-EEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKI

Query:  QDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFL---------------------------------------------
        Q+   KD+ +LISW+S+V++QL+ L R N+ILRSE+E+V++ QIS++NK +V  L                                             
Subjt:  QDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFL---------------------------------------------

Query:  --------EESQKTV-----------------AKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNA
                 ++ +TV                  +  E NG SA    +  +RYS+L+KLE  L   RA+IREA+++ NLTS H DPDYVP GPIYRN NA
Subjt:  --------EESQKTV-----------------AKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNA

Query:  FHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKH
        FHRSY++MEK  KIYVY+EGEPP+FH GPCKSIYSTEGRFIHEME  N+Y T DPDQAL+YFLPFSVV LVQYLYV +SH+   IG A+ DY+NVIS+KH
Subjt:  FHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKH

Query:  PFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWK
        PFW+RSLGADHFMLSCHDWGP T+++VP L+ NSIRVLCNAN SEGF PSKD SFPEIHLRTGE  GLLGGLSPSRRS+LAFFAGRLHGHIRYLLL EWK
Subjt:  PFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWK

Query:  EKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQR
        EKD+DV VYD+LP+G+SY+SMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVL+S+SYVPPFSDVL W SF+VQ+QVKDIPNIK+IL GISQ+QYLRMQR
Subjt:  EKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQR

Query:  RVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
        RVKQVQRHFV+NG  KRFD FHMI+HSIWLRRLNI I+D
Subjt:  RVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD

A0A5A7TD36 Putative glycosyltransferase0.0e+0095.76Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS
        MRKPVGALLHDRRAV+VPISGRNHLYKVSISLVFILWGL+FLFSLW S G GCQE SILLPDGVSTTNESKLENNKDSDVL EPPNGESHCTIHLNNSCS
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS

Query:  INASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGA
        INAS+PGSDNE+LSSEESSSHIQATTRLPED SSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGA
Subjt:  INASTPGSDNEVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGA

Query:  KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW
        KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW
Subjt:  KVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKW

Query:  VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVDEVSHRRELQSLANEEGDDGVDIELSKSNTPEPVE
        VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEAT DKRVIPSQPGP+DEVSH RELQSLANEEG DGVD+ELSKSNTP+PVE
Subjt:  VRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVDEVSHRRELQSLANEEGDDGVDIELSKSNTPEPVE

Query:  ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGL
        ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQ DIRNILKIQD TDKDLRDLISWKSMVSLQLDGL
Subjt:  ESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGL

Query:  QRHNSILRSEIERVQKNQISLENKGIVVFLEESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIY
        QRHNSILRSEIERVQKNQ SLENKGIV   EE QKTVAKDKEANGKSA PGISKT+ YSKLKKLEEKLGRARAAIR+ASQ+HNLTSIHHDPDYVPTGPIY
Subjt:  QRHNSILRSEIERVQKNQISLENKGIVVFLEESQKTVAKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIY

Query:  RNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINV
        RNPNAFHRSY+EME+LLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIG AITDYINV
Subjt:  RNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINV

Query:  ISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLL
        IS KHPFWDRSLGADHFMLSCHDWGPRTTS+VPLLFNNSIRVLCNANVSEGF PSKDASFPEIHLRTGEIDGL+GGLSPSRRSVLAFFAGRLHGHIRYLL
Subjt:  ISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLL

Query:  LQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQY
        LQEWKEKDEDVLVY+ELPSGISY+SMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNW SF+VQIQVKDIPNIKKIL GISQTQY
Subjt:  LQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQY

Query:  LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
        LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
Subjt:  LRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD

A0A5N5FL53 SUN domain-containing protein1.3e-30254.63Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS
        M++   ALL +RRA  + ISGR+ LYKVS+SLVF+LWGLVFLFSLWFS G G ++ S + P G+ST +E+KL+ ++  D+ +E   G S         C+
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS

Query:  INASTPGSDNEVL---------------------------SSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSE
            T G + E                             S+E S  H  A    PE  ++ + VK E+  PK       V LGL+EFKS+ F S+ KS 
Subjt:  INASTPGSDNEVL---------------------------SSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSE

Query:  TGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPT
         GQAG   HR+EPGGAEYNYASA+KGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFV IELSEETLV TIEIAN EH+SSNLK+F V GSL YPT
Subjt:  TGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPT

Query:  DVWFKLGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPG--PVDEVSHR
        + W  LGN TA N K   RFVL+ PKWVRY+KL  L+HYGSEFYCTLST+E+YG+DAVE MLEDLIS +    +S+ AT D++ +PS P    VDE  H 
Subjt:  DVWFKLGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSISDEATHDKRVIPSQPG--PVDEVSHR

Query:  RELQSLAN-EEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIR
           +S      G   V+ ++  S   +PV+E  HQQ  RMPGDTVLKIL QKVRSLD SLSVLERYLE+ TSKYG+IF EFDKD+G  +  ++K + DIR
Subjt:  RELQSLAN-EEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIR

Query:  NILKIQDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFL----------------------------------------
        N+++ Q+    D+ +L SW+S+V++QL+ L R N+ILRSE+ERV++ QIS++NKG+++FL                                        
Subjt:  NILKIQDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFL----------------------------------------

Query:  --------------------------------------EESQKTV-----------------AKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAI
                                               +  +TV                  +  E NG+SA    +  +RYS+L+KLE  L   RA+I
Subjt:  --------------------------------------EESQKTV-----------------AKDKEANGKSATPGISKTERYSKLKKLEEKLGRARAAI

Query:  REASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNL
        REA++I NLTS H DPDYVP GPIYRN NAFHR                                                T DPDQAL+YFLPFSVV L
Subjt:  REASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNL

Query:  VQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLG
        VQYLYV +SH+   IG A+ DY+NVIS+KHPFW+RSLGADHFMLSCHDWGP T+++VP L+ NSIRVLCNAN SEGF PSKD SFPEIHLRTGE  GLLG
Subjt:  VQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLG

Query:  GLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNS
        GLSPSRR +LAFFAGRLHGHIRYLLL EWKEKD+DV VYD+LP+G+SY+SMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLIS+SYVPPFSDVL W S
Subjt:  GLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNS

Query:  FAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD
        F+VQ+QVKDIPNIK+IL GISQ+QYLRMQRRVKQVQRHFV+NG  KRFD FHMI+HSIWLRRLNI I+D
Subjt:  FAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD

SwissProt top hitse value%identityAlignment
F4I8I0 SUN domain-containing protein 41.0e-11545.36Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS
        M++   ALL  RR  +   +GRN  YKVS+SLVF++WGLVFL +LW SH  G +  S++        ++ + +   +S       +   H    L++   
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS

Query:  INASTPGSDNE-VLSSEESSSHIQATTRLPEDGSS--------STRVKP----ESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPG
        I A+     +E +L   E  + I     + E   +        +    P    E+   K D  S  V LGL+EFKSRA  SR KS +GQ    IHR+EPG
Subjt:  INASTPGSDNE-VLSSEESSSHIQATTRLPEDGSS--------STRVKP----ESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPG

Query:  GAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNA
        G EYNYA+ASKGAKVL+ NKEAKGAS+I+ +DKDKYLRNPCS E KFVVIELSEETLV TI+IANFEH+SSNLK+FE+ G+LVYPTD W  LGNFTA N 
Subjt:  GAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNA

Query:  KHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSI----SDEATHDKRVIPSQ---PGPVDEVSHRRELQSLANE
        KH   F   DPKWVRYLKLN L+HYGSEFYCTLS +EVYG+DAVE MLEDLIS Q K  +     D    +K+ + ++       D+   + + Q  + E
Subjt:  KHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSI----SDEATHDKRVIPSQ---PGPVDEVSHRRELQSLANE

Query:  EGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTD
              ++ L K   P+PVEE  HQ   RMPGDTVLKIL QK+RSLD+SLSVLE YLE+ + KYG IFKE D +       +E  + ++  + + ++ T 
Subjt:  EGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTD

Query:  KDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVF
        K+  ++  W+  V  +L+  +     ++  +E+V +    +E KG+VVF
Subjt:  KDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVF

Q3E7Q9 Probable glycosyltransferase At5g253102.8e-12153.55Show/hide
Query:  SKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKG
        SK E+ ++   +E+ L +ARA+I EAS   N T    D   +P   IYRNP+A +RSY+EMEK  K+YVY+EGEPP+ H GPCKS+Y+ EGRFI EMEK 
Subjt:  SKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKG

Query:  NL-YTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEG
           + T DP+QA +YFLPFSV  LV+YLY  NS +   + T ++DYI ++S  HPFW+R+ GADHFML+CHDWGP T+     LFN SIRV+CNAN SEG
Subjt:  NL-YTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEG

Query:  FRPSKDASFPEIHLRTGEID---GLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVE
        F P+KD + PEI L  GE+D    L   LS S R  L FFAG +HG +R +LL+ WK++D D+ VY+ LP  ++Y   ++ S+FC CPSGYEVASPRV+E
Subjt:  FRPSKDASFPEIHLRTGEID---GLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVE

Query:  AIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHI
        AIY+EC+PV++S ++V PF+DVL W +F+V + V +IP +K+IL  IS  +Y  ++  ++ V+RHF LN  P+RFDAFH+ LHSIWLRRLN+ +
Subjt:  AIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHI

Q3EAR7 Probable glycosyltransferase At3g421804.3e-11451.01Show/hide
Query:  ERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEME-----
        +R S L+K EE+L +ARAAIR A +  N TS      Y+PTG IYRN  AFH+S+IEM K  K++ YKEGE P+ H GP   IY  EG+FI E+      
Subjt:  ERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEME-----

Query:  KGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVN----AIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNA
            +  + P++A  +FLPFSV N+V Y+Y P +   +     +     DY++V+++KHPFW++S GADHFM+SCHDW P      P  F N +R LCNA
Subjt:  KGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVN----AIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNA

Query:  NVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRV
        N SEGFR + D S PEI++   ++     G +P  R++LAFFAGR HG+IR +L   WK KD+DV VYD L  G +Y  ++  S+FCLCPSGYEVASPR 
Subjt:  NVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRV

Query:  VEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHI
        VEAIY+ CVPV+IS++Y  PF+DVL+W+ F+V+I V  IP+IKKIL  I   +YLRM R V +V+RHFV+N   + FD  HMILHS+WLRRLNI +
Subjt:  VEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHI

Q9FFN2 Probable glycosyltransferase At5g037951.4e-14158.42Show/hide
Query:  ANGKSATPGISKTER--YSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYS
        +N  S+T  +   +R   S L+K+E KL +ARA+I+ AS    +     DPDYVP GP+Y N   FHRSY+EMEK  KIYVYKEGEPP+FH GPCKSIYS
Subjt:  ANGKSATPGISKTER--YSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYS

Query:  TEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSI
         EG FI+E+E    + TN+PD+A +++LPFSVV +V+Y+Y  NS + + I   + DYIN++ +K+P+W+RS+GADHF+LSCHDWGP  +   P L +NSI
Subjt:  TEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSI

Query:  RVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYE
        R LCNAN SE F+P KD S PEI+LRTG + GL+GG SPS R +LAFFAG +HG +R +LLQ W+ KD D+ V+  LP G SY  M++ S+FC+CPSGYE
Subjt:  RVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYE

Query:  VASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNI
        VASPR+VEA+Y+ CVPVLI+  YVPPFSDVLNW SF+V + V+DIPN+K IL  IS  QYLRM RRV +V+RHF +N   KRFD FHMILHSIW+RRLN+
Subjt:  VASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNI

Query:  HIQD
         I++
Subjt:  HIQD

Q9SSE8 Probable glycosyltransferase At3g076203.0e-13156.54Show/hide
Query:  KEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLT--SIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSI
        ++ NG +   G  K +      K+E +L  AR  IREA   ++ T  S   D DYVP G IYRNP AFHRSY+ MEK+ KIYVY+EG+PP+FH G CK I
Subjt:  KEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLT--SIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSI

Query:  YSTEGRFIHEMEKGNL-YTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFN
        YS EG F++ ME   L Y T DPD+A +YFLPFSVV ++ +L+ P   +   +   I DY+ +IS K+P+W+ S G DHFMLSCHDWG R T +V  LF 
Subjt:  YSTEGRFIHEMEKGNL-YTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFN

Query:  NSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPS
        NSIRVLCNAN+SE F P KDA FPEI+L TG+I+ L GGL P  R+ LAFFAG+ HG IR +LL  WKEKD+D+LVY+ LP G+ Y  M++KSRFC+CPS
Subjt:  NSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPS

Query:  GYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRR
        G+EVASPRV EAIY+ CVPVLISE+YV PFSDVLNW  F+V + VK+IP +K+IL  I + +Y+R+   VK+V+RH ++N  PKR+D F+MI+HSIWLRR
Subjt:  GYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRR

Query:  LNIHI
        LN+ +
Subjt:  LNIHI

Arabidopsis top hitse value%identityAlignment
AT1G71360.1 Galactose-binding protein7.3e-11745.36Show/hide
Query:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS
        M++   ALL  RR  +   +GRN  YKVS+SLVF++WGLVFL +LW SH  G +  S++        ++ + +   +S       +   H    L++   
Subjt:  MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCS

Query:  INASTPGSDNE-VLSSEESSSHIQATTRLPEDGSS--------STRVKP----ESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPG
        I A+     +E +L   E  + I     + E   +        +    P    E+   K D  S  V LGL+EFKSRA  SR KS +GQ    IHR+EPG
Subjt:  INASTPGSDNE-VLSSEESSSHIQATTRLPEDGSS--------STRVKP----ESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPG

Query:  GAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNA
        G EYNYA+ASKGAKVL+ NKEAKGAS+I+ +DKDKYLRNPCS E KFVVIELSEETLV TI+IANFEH+SSNLK+FE+ G+LVYPTD W  LGNFTA N 
Subjt:  GAEYNYASASKGAKVLAFNKEAKGASNILGKDKDKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNA

Query:  KHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSI----SDEATHDKRVIPSQ---PGPVDEVSHRRELQSLANE
        KH   F   DPKWVRYLKLN L+HYGSEFYCTLS +EVYG+DAVE MLEDLIS Q K  +     D    +K+ + ++       D+   + + Q  + E
Subjt:  KHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDAVEMMLEDLISAQHKPSI----SDEATHDKRVIPSQ---PGPVDEVSHRRELQSLANE

Query:  EGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTD
              ++ L K   P+PVEE  HQ   RMPGDTVLKIL QK+RSLD+SLSVLE YLE+ + KYG IFKE D +       +E  + ++  + + ++ T 
Subjt:  EGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDLTSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTD

Query:  KDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVF
        K+  ++  W+  V  +L+  +     ++  +E+V +    +E KG+VVF
Subjt:  KDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVF

AT3G07620.1 Exostosin family protein2.1e-13256.54Show/hide
Query:  KEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLT--SIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSI
        ++ NG +   G  K +      K+E +L  AR  IREA   ++ T  S   D DYVP G IYRNP AFHRSY+ MEK+ KIYVY+EG+PP+FH G CK I
Subjt:  KEANGKSATPGISKTERYSKLKKLEEKLGRARAAIREASQIHNLT--SIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSI

Query:  YSTEGRFIHEMEKGNL-YTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFN
        YS EG F++ ME   L Y T DPD+A +YFLPFSVV ++ +L+ P   +   +   I DY+ +IS K+P+W+ S G DHFMLSCHDWG R T +V  LF 
Subjt:  YSTEGRFIHEMEKGNL-YTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFN

Query:  NSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPS
        NSIRVLCNAN+SE F P KDA FPEI+L TG+I+ L GGL P  R+ LAFFAG+ HG IR +LL  WKEKD+D+LVY+ LP G+ Y  M++KSRFC+CPS
Subjt:  NSIRVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPS

Query:  GYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRR
        G+EVASPRV EAIY+ CVPVLISE+YV PFSDVLNW  F+V + VK+IP +K+IL  I + +Y+R+   VK+V+RH ++N  PKR+D F+MI+HSIWLRR
Subjt:  GYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRR

Query:  LNIHI
        LN+ +
Subjt:  LNIHI

AT3G42180.1 Exostosin family protein3.1e-11551.01Show/hide
Query:  ERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEME-----
        +R S L+K EE+L +ARAAIR A +  N TS      Y+PTG IYRN  AFH+S+IEM K  K++ YKEGE P+ H GP   IY  EG+FI E+      
Subjt:  ERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEME-----

Query:  KGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVN----AIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNA
            +  + P++A  +FLPFSV N+V Y+Y P +   +     +     DY++V+++KHPFW++S GADHFM+SCHDW P      P  F N +R LCNA
Subjt:  KGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVN----AIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNA

Query:  NVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRV
        N SEGFR + D S PEI++   ++     G +P  R++LAFFAGR HG+IR +L   WK KD+DV VYD L  G +Y  ++  S+FCLCPSGYEVASPR 
Subjt:  NVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRV

Query:  VEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHI
        VEAIY+ CVPV+IS++Y  PF+DVL+W+ F+V+I V  IP+IKKIL  I   +YLRM R V +V+RHFV+N   + FD  HMILHS+WLRRLNI +
Subjt:  VEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHI

AT5G03795.1 Exostosin family protein1.0e-14258.42Show/hide
Query:  ANGKSATPGISKTER--YSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYS
        +N  S+T  +   +R   S L+K+E KL +ARA+I+ AS    +     DPDYVP GP+Y N   FHRSY+EMEK  KIYVYKEGEPP+FH GPCKSIYS
Subjt:  ANGKSATPGISKTER--YSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYS

Query:  TEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSI
         EG FI+E+E    + TN+PD+A +++LPFSVV +V+Y+Y  NS + + I   + DYIN++ +K+P+W+RS+GADHF+LSCHDWGP  +   P L +NSI
Subjt:  TEGRFIHEMEKGNLYTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSI

Query:  RVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYE
        R LCNAN SE F+P KD S PEI+LRTG + GL+GG SPS R +LAFFAG +HG +R +LLQ W+ KD D+ V+  LP G SY  M++ S+FC+CPSGYE
Subjt:  RVLCNANVSEGFRPSKDASFPEIHLRTGEIDGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYE

Query:  VASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNI
        VASPR+VEA+Y+ CVPVLI+  YVPPFSDVLNW SF+V + V+DIPN+K IL  IS  QYLRM RRV +V+RHF +N   KRFD FHMILHSIW+RRLN+
Subjt:  VASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNI

Query:  HIQD
         I++
Subjt:  HIQD

AT5G25310.1 Exostosin family protein2.0e-12253.55Show/hide
Query:  SKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKG
        SK E+ ++   +E+ L +ARA+I EAS   N T    D   +P   IYRNP+A +RSY+EMEK  K+YVY+EGEPP+ H GPCKS+Y+ EGRFI EMEK 
Subjt:  SKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKG

Query:  NL-YTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEG
           + T DP+QA +YFLPFSV  LV+YLY  NS +   + T ++DYI ++S  HPFW+R+ GADHFML+CHDWGP T+     LFN SIRV+CNAN SEG
Subjt:  NL-YTTNDPDQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEG

Query:  FRPSKDASFPEIHLRTGEID---GLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVE
        F P+KD + PEI L  GE+D    L   LS S R  L FFAG +HG +R +LL+ WK++D D+ VY+ LP  ++Y   ++ S+FC CPSGYEVASPRV+E
Subjt:  FRPSKDASFPEIHLRTGEID---GLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVE

Query:  AIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHI
        AIY+EC+PV++S ++V PF+DVL W +F+V + V +IP +K+IL  IS  +Y  ++  ++ V+RHF LN  P+RFDAFH+ LHSIWLRRLN+ +
Subjt:  AIYAECVPVLISESYVPPFSDVLNWNSFAVQIQVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGAAGCCTGTTGGAGCTCTTCTGCATGATAGAAGAGCTGTTCAAGTGCCTATTAGTGGAAGAAATCATTTGTATAAAGTTTCTATTTCTTTGGTTTTTATTCTTTG
GGGACTTGTCTTCCTCTTCAGCTTATGGTTTAGCCATGGAGTTGGCTGCCAAGAAGAATCAATTTTACTTCCTGATGGTGTATCTACTACAAATGAATCTAAATTGGAAA
ATAACAAGGACTCTGACGTTTTACGTGAGCCTCCAAATGGAGAATCTCATTGTACCATTCATTTAAACAATTCATGTTCAATCAATGCTTCGACCCCTGGTTCTGATAAT
GAAGTACTTTCAAGCGAAGAAAGTAGCAGTCATATTCAAGCTACTACAAGGTTGCCTGAGGATGGGAGCTCTAGCACTAGAGTAAAACCTGAAAGCAAACCTCCCAAGGG
AGATATCTCGTCAGACACTGTTCTACTCGGCCTTGAAGAATTCAAAAGCAGAGCCTTTGTGTCCCGGGGTAAGTCTGAAACTGGCCAGGCTGGGAATACTATCCATAGAT
TAGAACCTGGAGGTGCAGAGTACAATTACGCCTCAGCTTCAAAGGGAGCTAAGGTTTTGGCTTTCAACAAGGAAGCAAAGGGAGCTTCAAACATTTTAGGCAAAGACAAA
GATAAGTACCTCAGAAATCCTTGTTCTGCTGAAGAGAAATTTGTCGTCATAGAACTTTCAGAAGAAACCTTAGTAGTAACAATTGAAATTGCTAATTTCGAGCACCATTC
TTCTAACTTAAAAGAATTTGAGGTACATGGGAGCTTGGTTTATCCAACAGATGTTTGGTTCAAGCTCGGTAATTTCACTGCTCCAAATGCAAAACATGCACATAGATTTG
TTCTCAAGGACCCAAAATGGGTGAGATATCTAAAGTTGAATTTTCTCACCCATTATGGTTCAGAATTCTATTGCACACTCAGCACTGTGGAAGTTTATGGAATGGATGCT
GTTGAGATGATGCTGGAGGATTTAATATCTGCTCAACATAAACCTTCTATATCAGATGAAGCTACGCATGATAAGAGAGTAATTCCTTCCCAGCCTGGCCCCGTTGATGA
AGTGTCACATCGTAGAGAGTTGCAATCTCTTGCTAACGAGGAAGGTGATGATGGTGTTGATATAGAACTTTCAAAGAGTAATACACCTGAACCGGTTGAAGAATCACACC
ATCAACAACCTGGCAGAATGCCTGGTGACACTGTTCTCAAAATTTTGACACAGAAAGTTCGCTCACTAGACCTAAGTTTATCTGTTTTGGAGCGGTATCTGGAGGACTTA
ACGTCCAAATATGGCAATATATTCAAAGAATTTGACAAAGATATCGGAAATAATAATCTACTCATTGAGAAGACCCAAGCGGATATAAGAAATATTCTTAAAATCCAGGA
CACCACAGATAAAGATCTTCGTGATCTCATTTCTTGGAAGTCCATGGTTTCCTTGCAATTGGATGGTCTGCAAAGGCATAATTCTATACTCAGATCTGAGATCGAAAGGG
TACAGAAGAATCAGATTTCTCTTGAAAACAAAGGAATAGTAGTTTTTCTTGAAGAATCACAAAAAACAGTAGCTAAAGACAAAGAAGCCAATGGAAAAAGTGCAACTCCA
GGGATCAGCAAAACTGAGAGATATAGTAAGTTGAAGAAACTAGAGGAGAAATTGGGAAGAGCAAGAGCAGCCATTAGAGAAGCTTCTCAAATTCATAATCTTACATCTAT
ACATCATGATCCTGACTATGTTCCTACTGGCCCAATATATAGGAACCCAAATGCTTTCCACAGGAGCTATATAGAAATGGAAAAGCTTTTGAAGATATATGTATACAAAG
AAGGAGAGCCTCCAATGTTCCATGGAGGTCCATGTAAAAGCATATATTCCACAGAAGGAAGGTTTATTCATGAAATGGAAAAGGGAAATTTGTATACAACCAATGACCCA
GATCAGGCTCTTCTTTATTTCCTCCCATTCAGTGTTGTCAATTTGGTTCAATATCTCTATGTACCAAACTCTCATGAAGTTAATGCCATTGGAACTGCAATCACTGACTA
CATCAACGTCATCTCTAACAAACATCCTTTCTGGGATCGCAGCCTTGGTGCTGATCATTTTATGCTTTCTTGTCATGATTGGGGGCCACGTACGACTTCATTCGTTCCGC
TTTTGTTCAATAACTCCATCAGAGTATTGTGTAATGCGAATGTTTCTGAAGGTTTCCGTCCTTCTAAAGACGCATCATTTCCTGAAATCCATCTTAGAACAGGAGAAATC
GATGGGCTTCTTGGAGGTCTCTCGCCTTCTCGTCGATCCGTTCTTGCATTCTTTGCAGGTAGGCTTCATGGTCATATAAGGTACCTACTCCTGCAGGAATGGAAGGAAAA
AGATGAGGACGTGCTTGTTTACGACGAGCTTCCAAGTGGAATATCATACGATTCGATGTTGAAGAAGAGTAGGTTTTGTTTATGCCCTAGTGGGTATGAGGTAGCCAGTC
CAAGGGTAGTGGAAGCCATTTATGCTGAATGTGTTCCTGTGTTGATATCAGAAAGCTACGTTCCTCCTTTCAGTGATGTTTTGAATTGGAATTCATTTGCTGTGCAAATA
CAAGTGAAGGATATACCAAACATAAAAAAGATACTAAATGGGATATCTCAAACTCAATACTTGAGAATGCAAAGGAGAGTGAAGCAAGTACAAAGACATTTTGTACTCAA
TGGAACTCCCAAGAGATTTGATGCTTTTCATATGATACTTCATTCTATCTGGCTCAGAAGGTTGAATATACACATTCAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGGAAGCCTGTTGGAGCTCTTCTGCATGATAGAAGAGCTGTTCAAGTGCCTATTAGTGGAAGAAATCATTTGTATAAAGTTTCTATTTCTTTGGTTTTTATTCTTTG
GGGACTTGTCTTCCTCTTCAGCTTATGGTTTAGCCATGGAGTTGGCTGCCAAGAAGAATCAATTTTACTTCCTGATGGTGTATCTACTACAAATGAATCTAAATTGGAAA
ATAACAAGGACTCTGACGTTTTACGTGAGCCTCCAAATGGAGAATCTCATTGTACCATTCATTTAAACAATTCATGTTCAATCAATGCTTCGACCCCTGGTTCTGATAAT
GAAGTACTTTCAAGCGAAGAAAGTAGCAGTCATATTCAAGCTACTACAAGGTTGCCTGAGGATGGGAGCTCTAGCACTAGAGTAAAACCTGAAAGCAAACCTCCCAAGGG
AGATATCTCGTCAGACACTGTTCTACTCGGCCTTGAAGAATTCAAAAGCAGAGCCTTTGTGTCCCGGGGTAAGTCTGAAACTGGCCAGGCTGGGAATACTATCCATAGAT
TAGAACCTGGAGGTGCAGAGTACAATTACGCCTCAGCTTCAAAGGGAGCTAAGGTTTTGGCTTTCAACAAGGAAGCAAAGGGAGCTTCAAACATTTTAGGCAAAGACAAA
GATAAGTACCTCAGAAATCCTTGTTCTGCTGAAGAGAAATTTGTCGTCATAGAACTTTCAGAAGAAACCTTAGTAGTAACAATTGAAATTGCTAATTTCGAGCACCATTC
TTCTAACTTAAAAGAATTTGAGGTACATGGGAGCTTGGTTTATCCAACAGATGTTTGGTTCAAGCTCGGTAATTTCACTGCTCCAAATGCAAAACATGCACATAGATTTG
TTCTCAAGGACCCAAAATGGGTGAGATATCTAAAGTTGAATTTTCTCACCCATTATGGTTCAGAATTCTATTGCACACTCAGCACTGTGGAAGTTTATGGAATGGATGCT
GTTGAGATGATGCTGGAGGATTTAATATCTGCTCAACATAAACCTTCTATATCAGATGAAGCTACGCATGATAAGAGAGTAATTCCTTCCCAGCCTGGCCCCGTTGATGA
AGTGTCACATCGTAGAGAGTTGCAATCTCTTGCTAACGAGGAAGGTGATGATGGTGTTGATATAGAACTTTCAAAGAGTAATACACCTGAACCGGTTGAAGAATCACACC
ATCAACAACCTGGCAGAATGCCTGGTGACACTGTTCTCAAAATTTTGACACAGAAAGTTCGCTCACTAGACCTAAGTTTATCTGTTTTGGAGCGGTATCTGGAGGACTTA
ACGTCCAAATATGGCAATATATTCAAAGAATTTGACAAAGATATCGGAAATAATAATCTACTCATTGAGAAGACCCAAGCGGATATAAGAAATATTCTTAAAATCCAGGA
CACCACAGATAAAGATCTTCGTGATCTCATTTCTTGGAAGTCCATGGTTTCCTTGCAATTGGATGGTCTGCAAAGGCATAATTCTATACTCAGATCTGAGATCGAAAGGG
TACAGAAGAATCAGATTTCTCTTGAAAACAAAGGAATAGTAGTTTTTCTTGAAGAATCACAAAAAACAGTAGCTAAAGACAAAGAAGCCAATGGAAAAAGTGCAACTCCA
GGGATCAGCAAAACTGAGAGATATAGTAAGTTGAAGAAACTAGAGGAGAAATTGGGAAGAGCAAGAGCAGCCATTAGAGAAGCTTCTCAAATTCATAATCTTACATCTAT
ACATCATGATCCTGACTATGTTCCTACTGGCCCAATATATAGGAACCCAAATGCTTTCCACAGGAGCTATATAGAAATGGAAAAGCTTTTGAAGATATATGTATACAAAG
AAGGAGAGCCTCCAATGTTCCATGGAGGTCCATGTAAAAGCATATATTCCACAGAAGGAAGGTTTATTCATGAAATGGAAAAGGGAAATTTGTATACAACCAATGACCCA
GATCAGGCTCTTCTTTATTTCCTCCCATTCAGTGTTGTCAATTTGGTTCAATATCTCTATGTACCAAACTCTCATGAAGTTAATGCCATTGGAACTGCAATCACTGACTA
CATCAACGTCATCTCTAACAAACATCCTTTCTGGGATCGCAGCCTTGGTGCTGATCATTTTATGCTTTCTTGTCATGATTGGGGGCCACGTACGACTTCATTCGTTCCGC
TTTTGTTCAATAACTCCATCAGAGTATTGTGTAATGCGAATGTTTCTGAAGGTTTCCGTCCTTCTAAAGACGCATCATTTCCTGAAATCCATCTTAGAACAGGAGAAATC
GATGGGCTTCTTGGAGGTCTCTCGCCTTCTCGTCGATCCGTTCTTGCATTCTTTGCAGGTAGGCTTCATGGTCATATAAGGTACCTACTCCTGCAGGAATGGAAGGAAAA
AGATGAGGACGTGCTTGTTTACGACGAGCTTCCAAGTGGAATATCATACGATTCGATGTTGAAGAAGAGTAGGTTTTGTTTATGCCCTAGTGGGTATGAGGTAGCCAGTC
CAAGGGTAGTGGAAGCCATTTATGCTGAATGTGTTCCTGTGTTGATATCAGAAAGCTACGTTCCTCCTTTCAGTGATGTTTTGAATTGGAATTCATTTGCTGTGCAAATA
CAAGTGAAGGATATACCAAACATAAAAAAGATACTAAATGGGATATCTCAAACTCAATACTTGAGAATGCAAAGGAGAGTGAAGCAAGTACAAAGACATTTTGTACTCAA
TGGAACTCCCAAGAGATTTGATGCTTTTCATATGATACTTCATTCTATCTGGCTCAGAAGGTTGAATATACACATTCAAGATTAA
Protein sequenceShow/hide protein sequence
MRKPVGALLHDRRAVQVPISGRNHLYKVSISLVFILWGLVFLFSLWFSHGVGCQEESILLPDGVSTTNESKLENNKDSDVLREPPNGESHCTIHLNNSCSINASTPGSDN
EVLSSEESSSHIQATTRLPEDGSSSTRVKPESKPPKGDISSDTVLLGLEEFKSRAFVSRGKSETGQAGNTIHRLEPGGAEYNYASASKGAKVLAFNKEAKGASNILGKDK
DKYLRNPCSAEEKFVVIELSEETLVVTIEIANFEHHSSNLKEFEVHGSLVYPTDVWFKLGNFTAPNAKHAHRFVLKDPKWVRYLKLNFLTHYGSEFYCTLSTVEVYGMDA
VEMMLEDLISAQHKPSISDEATHDKRVIPSQPGPVDEVSHRRELQSLANEEGDDGVDIELSKSNTPEPVEESHHQQPGRMPGDTVLKILTQKVRSLDLSLSVLERYLEDL
TSKYGNIFKEFDKDIGNNNLLIEKTQADIRNILKIQDTTDKDLRDLISWKSMVSLQLDGLQRHNSILRSEIERVQKNQISLENKGIVVFLEESQKTVAKDKEANGKSATP
GISKTERYSKLKKLEEKLGRARAAIREASQIHNLTSIHHDPDYVPTGPIYRNPNAFHRSYIEMEKLLKIYVYKEGEPPMFHGGPCKSIYSTEGRFIHEMEKGNLYTTNDP
DQALLYFLPFSVVNLVQYLYVPNSHEVNAIGTAITDYINVISNKHPFWDRSLGADHFMLSCHDWGPRTTSFVPLLFNNSIRVLCNANVSEGFRPSKDASFPEIHLRTGEI
DGLLGGLSPSRRSVLAFFAGRLHGHIRYLLLQEWKEKDEDVLVYDELPSGISYDSMLKKSRFCLCPSGYEVASPRVVEAIYAECVPVLISESYVPPFSDVLNWNSFAVQI
QVKDIPNIKKILNGISQTQYLRMQRRVKQVQRHFVLNGTPKRFDAFHMILHSIWLRRLNIHIQD