; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G006450 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G006450
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionMucin-2
Genome locationCmo_Chr16:3181069..3181945
RNA-Seq ExpressionCmoCh16G006450
SyntenyCmoCh16G006450
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577192.1 hypothetical protein SDJN03_24766, partial [Cucurbita argyrosperma subsp. sororia]7.1e-13384.41Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSS
        MCSPDGPSSIFAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSS
Subjt:  MCSPDGPSSIFAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSS

Query:  SRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST------------------------------------ERRKPVAA
        SRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCS YSWQQRRSTDSYSQDSIGFKS+                                    ERRKPVAA
Subjt:  SRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST------------------------------------ERRKPVAA

Query:  NHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGCDTL
        NHRFSFELSDADALLRSVGSK LESNEL      LHEPFETAKENSPAVCHTS GTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGCDTL
Subjt:  NHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGCDTL

KAG7015191.1 hypothetical protein SDJN02_22824, partial [Cucurbita argyrosperma subsp. argyrosperma]9.0e-12886.38Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSS
        MCSPDGPSSIFAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSS
Subjt:  MCSPDGPSSIFAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSS

Query:  SRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST------------------------------------ERRKPVAA
        SRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCS YSWQQRRSTDSYSQDSIGFKS+                                    ERRKPVAA
Subjt:  SRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST------------------------------------ERRKPVAA

Query:  NHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLG
        NHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLG
Subjt:  NHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLG

KGN53337.1 hypothetical protein Csa_015172 [Cucumis sativus]5.7e-9063.46Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVS--------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPR
        M SPDGPSSIFAIGPFAHE QLVS        T E + PFT PES HL  PSSPEVPFAQ + P+L K ESDNQ +FPND FQSYQFYP SP+SHLISPR
Subjt:  MCSPDGPSSIFAIGPFAHETQLVS--------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPR

Query:  PVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------E
         VISRSG+SS LPD DFAS GSQF NFPLEVPPTLL+LDK S ++W+QR+STDS +QDSI FKS+                                  +
Subjt:  PVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------E

Query:  RRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGCD
          +P A NHRFSFELSD D LL+SVGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQ QEHHS+TLG VKEFNFD+GNG D
Subjt:  RRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGCD

Query:  T
        T
Subjt:  T

XP_038884072.1 uncharacterized protein LOC120075005 isoform X1 [Benincasa hispida]8.8e-9165.02Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP
        M SPDGPSSIFAIGPFAHETQLVS         T  STAPFT PES HL  PSSPEVPFAQ L P+LQK+ESD+Q  FPND FQSYQFYP SP+SHLISP
Subjt:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP

Query:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------
        R VISRSG+SS LPD DFAS GSQF NFPLEVPPTLL+LDK S ++W+QR+STDS +QDSI  KS+                                  
Subjt:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------

Query:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPA-VCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNG
        E   P A NHRFSFELSD DALL+SVGSKPL+SNE+ VASS +HEPFETAKENSP    HTSN TE   K   E AHQHQEHHS+TLG VKEFNFD+GNG
Subjt:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPA-VCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNG

Query:  CDT
         DT
Subjt:  CDT

XP_038884079.1 uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida]8.8e-9165.02Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP
        M SPDGPSSIFAIGPFAHETQLVS         T  STAPFT PES HL  PSSPEVPFAQ L P+LQK+ESD+Q  FPND FQSYQFYP SP+SHLISP
Subjt:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP

Query:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------
        R VISRSG+SS LPD DFAS GSQF NFPLEVPPTLL+LDK S ++W+QR+STDS +QDSI  KS+                                  
Subjt:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------

Query:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPA-VCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNG
        E   P A NHRFSFELSD DALL+SVGSKPL+SNE+ VASS +HEPFETAKENSP    HTSN TE   K   E AHQHQEHHS+TLG VKEFNFD+GNG
Subjt:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPA-VCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNG

Query:  CDT
         DT
Subjt:  CDT

TrEMBL top hitse value%identityAlignment
A0A0A0KY57 Uncharacterized protein2.8e-9063.46Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVS--------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPR
        M SPDGPSSIFAIGPFAHE QLVS        T E + PFT PES HL  PSSPEVPFAQ + P+L K ESDNQ +FPND FQSYQFYP SP+SHLISPR
Subjt:  MCSPDGPSSIFAIGPFAHETQLVS--------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPR

Query:  PVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------E
         VISRSG+SS LPD DFAS GSQF NFPLEVPPTLL+LDK S ++W+QR+STDS +QDSI FKS+                                  +
Subjt:  PVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------E

Query:  RRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGCD
          +P A NHRFSFELSD D LL+SVGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQ QEHHS+TLG VKEFNFD+GNG D
Subjt:  RRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGCD

Query:  T
        T
Subjt:  T

A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X14.0e-8963.25Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP
        M SPDGPSSIFAIGPFAHE QLVS         T  ST PFT PES HL  PSSPEVPFAQ + PSLQK ESDNQ +FPND FQSYQFYP SP+SHLISP
Subjt:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP

Query:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------
        R VISRSG+SS LPD DFAS GSQF NFPLEVPPTL +LDK S ++W+QR+STDS +QDSI FKS+                                  
Subjt:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------

Query:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGC
           +P A NHRFSFELSD D L +SVGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG VKEFNFD+ NG 
Subjt:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGC

Query:  DT
        DT
Subjt:  DT

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X24.0e-8963.25Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP
        M SPDGPSSIFAIGPFAHE QLVS         T  ST PFT PES HL  PSSPEVPFAQ + PSLQK ESDNQ +FPND FQSYQFYP SP+SHLISP
Subjt:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP

Query:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------
        R VISRSG+SS LPD DFAS GSQF NFPLEVPPTL +LDK S ++W+QR+STDS +QDSI FKS+                                  
Subjt:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------

Query:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGC
           +P A NHRFSFELSD D L +SVGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG VKEFNFD+ NG 
Subjt:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGC

Query:  DT
        DT
Subjt:  DT

A0A5A7TUB1 Mucin-21.3e-8762.25Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP
        M SPDGPSSIFAIGPFAHE QLVS         T  ST PFT PES HL  PSSPEVPFAQ + PS QK ESDNQ +FPND FQSYQFYP SP+SHLISP
Subjt:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP

Query:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------
        R VISRSG+SS LPD DFAS GSQF NFPL+VPPTL ++DK S ++W+QR+STDS +QDSI FKS+                                  
Subjt:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------

Query:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGC
           +P A NHRFSFELSD D L +SVGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG VKEFNFD+ NG 
Subjt:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGC

Query:  DT
        DT
Subjt:  DT

A0A5D3CYQ2 Mucin-24.0e-8963.25Show/hide
Query:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP
        M SPDGPSSIFAIGPFAHE QLVS         T  ST PFT PES HL  PSSPEVPFAQ + PSLQK ESDNQ +FPND FQSYQFYP SP+SHLISP
Subjt:  MCSPDGPSSIFAIGPFAHETQLVS---------TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISP

Query:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------
        R VISRSG+SS LPD DFAS GSQF NFPLEVPPTL +LDK S ++W+QR+STDS +QDSI FKS+                                  
Subjt:  RPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKST----------------------------------

Query:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGC
           +P A NHRFSFELSD D L +SVGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG VKEFNFD+ NG 
Subjt:  ERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGC

Query:  DT
        DT
Subjt:  DT

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766608.5e-1243.75Show/hide
Query:  SPDGP-SSIFAIGPFAHETQLVS--------TFESTAPFT--PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPR
        SP GP SS++A GP+AHETQLVS        T  STAPFT  PE   L  PSSP+VP+A+ L  S+    S       ND   +Y  YP SP S L SP 
Subjt:  SPDGP-SSIFAIGPFAHETQLVS--------TFESTAPFT--PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPR

Query:  PVISRSGSSSRLP----DCDFASSGSQF
          ISR+     L      C  + SG+ F
Subjt:  PVISRSGSSSRLP----DCDFASSGSQF

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)4.6e-1332.11Show/hide
Query:  SIFAIGPFAHETQLVS--------TFESTAPFTP----ESTHL--IRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDY-FQSYQFYPCSPISHLISPRPV
        SIFAIGP+AHETQLVS        T  S+AP TP     S +L    PSSPEVPFAQ+   + Q      +    + Y FQ YQ  P SP+  LISP P 
Subjt:  SIFAIGPFAHETQLVS--------TFESTAPFTP----ESTHL--IRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDY-FQSYQFYPCSPISHLISPRPV

Query:  ISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKSTERRKPVAANHR-FSFELSDADALLRSVGSKPLESNEL
           SG +S  PD       S F +F +  PP LL                   S  + G  +  + + +   H+  SF+L DAD ++R V  K       
Subjt:  ISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKSTERRKPVAANHR-FSFELSDADALLRSVGSKPLESNEL

Query:  EVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHH
              L   F  A  +  ++ H+S G+ +      +  H   + H
Subjt:  EVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHH

AT1G76660.1 FUNCTIONS IN: molecular_function unknown6.0e-1343.75Show/hide
Query:  SPDGP-SSIFAIGPFAHETQLVS--------TFESTAPFT--PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPR
        SP GP SS++A GP+AHETQLVS        T  STAPFT  PE   L  PSSP+VP+A+ L  S+    S       ND   +Y  YP SP S L SP 
Subjt:  SPDGP-SSIFAIGPFAHETQLVS--------TFESTAPFT--PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPR

Query:  PVISRSGSSSRLP----DCDFASSGSQF
          ISR+     L      C  + SG+ F
Subjt:  PVISRSGSSSRLP----DCDFASSGSQF

AT4G25620.1 hydroxyproline-rich glycoprotein family protein7.3e-1130.46Show/hide
Query:  PSSIFAIGPFAHETQLV--------STFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDN------QCSFPNDYFQSYQFYPCSPISHLISPRP
        P S F IGP+AHETQ V        +T  STAPFTP       PSSPEVPFAQ+L  SL++A  ++      + S  +  F+S Q YP SP  +LISP  
Subjt:  PSSIFAIGPFAHETQLV--------STFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDN------QCSFPNDYFQSYQFYPCSPISHLISPRP

Query:  VISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKSTERRKPVAANHRFSFEL--SDADALLR-SVGS-KPLE
            SG+SS  P             F +  PP  L  +  +   W  R  + S +    G +          +   S  +  + A+ ++R S G+  PLE
Subjt:  VISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKSTERRKPVAANHRFSFEL--SDADALLR-SVGS-KPLE

Query:  SNEL-----EVAS-------SQLHE----------PFE-TAKENSPAVCHTSNGTEEYAKTNGEH-------------AHQHQEHHSLTLGFVKEFNFDH
         + L     EVAS       S  H            FE T ++ +  +    N +  + K +GEH             + Q Q+  S + G  KEF FD 
Subjt:  SNEL-----EVAS-------SQLHE----------PFE-TAKENSPAVCHTSNGTEEYAKTNGEH-------------AHQHQEHHSLTLGFVKEFNFDH

Query:  GN
         N
Subjt:  GN

AT5G52430.1 hydroxyproline-rich glycoprotein family protein9.9e-1639.75Show/hide
Query:  SPDGPSSIFAIGPFAHETQLVS--------TFESTAPFTP---ESTHLIRPSSPEVPFAQVLLPSLQKAESD-----NQCSFPNDY-FQSYQFYPCSP-I
        SP  P S+F +GP+A+ETQ V+        T  STAP+TP    S H+  PSSPEVPFAQ+L  SL+    D     NQ    + Y F+S Q  P SP  
Subjt:  SPDGPSSIFAIGPFAHETQLVS--------TFESTAPFTP---ESTHLIRPSSPEVPFAQVLLPSLQKAESD-----NQCSFPNDY-FQSYQFYPCSP-I

Query:  SHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDS
         +LISP  VIS SG+SS  P        S    F +  PP  L  +  +   W  R  + S
Subjt:  SHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACACAGCTTGTCTCCACCTTTGAATCAACTGCTCCCTTCACTCCTGAGTCTAC
CCACTTGATTAGGCCTTCTTCCCCTGAAGTTCCTTTTGCTCAGGTTCTTCTACCTAGCCTACAGAAAGCTGAGTCTGATAATCAATGTTCATTTCCTAATGATTACTTCC
AATCTTACCAATTCTATCCTTGCAGCCCGATTAGTCACCTCATATCGCCACGGCCAGTCATTTCTCGTTCTGGGTCGTCATCGCGTTTGCCTGATTGTGATTTTGCTTCC
TCTGGCTCGCAGTTTTCGAATTTCCCATTAGAAGTTCCACCTACATTATTGGACCTTGACAAATGTTCCACTTATAGCTGGCAACAACGGCGAAGCACTGATTCTTACTC
TCAAGATTCTATAGGATTCAAATCAACCGAAAGGAGGAAGCCTGTTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGCAGATGCTTTATTAAGAAGCGTAGGAA
GTAAGCCGCTGGAATCAAATGAACTGGAAGTTGCATCATCTCAATTACATGAACCATTTGAAACGGCTAAAGAAAATTCTCCTGCTGTCTGTCATACCTCAAATGGTACA
GAAGAATATGCAAAAACAAACGGTGAACATGCACATCAGCATCAAGAACACCACTCCCTTACCCTTGGGTTTGTGAAGGAATTCAATTTTGATCATGGCAATGGATGTGA
TACTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACACAGCTTGTCTCCACCTTTGAATCAACTGCTCCCTTCACTCCTGAGTCTAC
CCACTTGATTAGGCCTTCTTCCCCTGAAGTTCCTTTTGCTCAGGTTCTTCTACCTAGCCTACAGAAAGCTGAGTCTGATAATCAATGTTCATTTCCTAATGATTACTTCC
AATCTTACCAATTCTATCCTTGCAGCCCGATTAGTCACCTCATATCGCCACGGCCAGTCATTTCTCGTTCTGGGTCGTCATCGCGTTTGCCTGATTGTGATTTTGCTTCC
TCTGGCTCGCAGTTTTCGAATTTCCCATTAGAAGTTCCACCTACATTATTGGACCTTGACAAATGTTCCACTTATAGCTGGCAACAACGGCGAAGCACTGATTCTTACTC
TCAAGATTCTATAGGATTCAAATCAACCGAAAGGAGGAAGCCTGTTGCTGCTAATCATAGATTCTCATTTGAGTTATCTGATGCAGATGCTTTATTAAGAAGCGTAGGAA
GTAAGCCGCTGGAATCAAATGAACTGGAAGTTGCATCATCTCAATTACATGAACCATTTGAAACGGCTAAAGAAAATTCTCCTGCTGTCTGTCATACCTCAAATGGTACA
GAAGAATATGCAAAAACAAACGGTGAACATGCACATCAGCATCAAGAACACCACTCCCTTACCCTTGGGTTTGTGAAGGAATTCAATTTTGATCATGGCAATGGATGTGA
TACTCTTTAA
Protein sequenceShow/hide protein sequence
MCSPDGPSSIFAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFAS
SGSQFSNFPLEVPPTLLDLDKCSTYSWQQRRSTDSYSQDSIGFKSTERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGT
EEYAKTNGEHAHQHQEHHSLTLGFVKEFNFDHGNGCDTL