; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22200 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22200
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionMucin-2
Genome locationCarg_Chr16:3391183..3392474
RNA-Seq ExpressionCarg22200
SyntenyCarg22200
Gene Ontology termsNA
InterPro domainsIPR040420 - Uncharacterized protein At1g76660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577192.1 hypothetical protein SDJN03_24766, partial [Cucurbita argyrosperma subsp. sororia]1.3e-18487.89Show/hide
Query:  LCLLVLRKGDGVAVGVFIEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCSPDGPSSIFAIGPFAHE
        L LL+ RKGDGVAVGVFI                   AEAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCSPDGPSSIFAIGPFAHE
Subjt:  LCLLVLRKGDGVAVGVFIEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCSPDGPSSIFAIGPFAHE

Query:  TQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFP
        TQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFP
Subjt:  TQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFP

Query:  LEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRSVG
        LEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQN RILIDGSLMEAERRKPVAANHRFSFELSDADALLRSVG
Subjt:  LEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRSVG

Query:  SKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLG-----NSILIMAMDVILFKPYVNSDWWT
        SK LESNEL      LHEPFETAKENSPAVCHTS GTEEYAKTNGEHAHQHQEHHSLTLG     N       D +  KPYVNSDWWT
Subjt:  SKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLG-----NSILIMAMDVILFKPYVNSDWWT

KAG7015191.1 hypothetical protein SDJN02_22824, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-240100Show/hide
Query:  MSLSLSSFSLCLLVLRKGDGVAVGVFIEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCSPDGPSSI
        MSLSLSSFSLCLLVLRKGDGVAVGVFIEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCSPDGPSSI
Subjt:  MSLSLSSFSLCLLVLRKGDGVAVGVFIEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCSPDGPSSI

Query:  FAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFAS
        FAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFAS
Subjt:  FAIGPFAHETQLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFAS

Query:  SGSQFSNFPLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSD
        SGSQFSNFPLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSD
Subjt:  SGSQFSNFPLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSD

Query:  ADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMDVILFKPYVNSDWWTDAKDIETE
        ADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMDVILFKPYVNSDWWTDAKDIETE
Subjt:  ADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMDVILFKPYVNSDWWTDAKDIETE

Query:  GTTTRGMMGIMCFSSKLSKL
        GTTTRGMMGIMCFSSKLSKL
Subjt:  GTTTRGMMGIMCFSSKLSKL

XP_008452033.1 PREDICTED: uncharacterized protein LOC103493162 isoform X2 [Cucumis melo]6.6e-12064.84Show/hide
Query:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----
        ++++      VLVPEPS PS+E H E++L SPDI LP AAP   P+ L  +       S T   SFT   +NM SPDGPSSIFAIGPFAHE QLVS    
Subjt:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----

Query:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF
             T  ST PFT PES HL  PSSPEVPFAQ + PSLQK ESDNQ +FPND FQSYQFYP SP+SHLISPR VISRSG+SS LPD DFAS GSQF NF
Subjt:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF

Query:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS
        PLEVPPTL +LDK SI++W+QR+STDS +QDSI FKSSNDFVLNP  SESM DHHATNESQNI+ILI DGS  E E   P A NHRFSFELSD D L +S
Subjt:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS

Query:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT
        VGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG S+     D         P +NSDWWT+AKD  TEGTT
Subjt:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT

Query:  T
        T
Subjt:  T

XP_038884072.1 uncharacterized protein LOC120075005 isoform X1 [Benincasa hispida]2.7e-12165.25Show/hide
Query:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLF------FNLSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----
        ++++      VLVPE S PS+E+H E+SL SPDI LP AAP   P+            S T   SFT   +NM SPDGPSSIFAIGPFAHETQLVS    
Subjt:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLF------FNLSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----

Query:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF
             T  STAPFT PES HL  PSSPEVPFAQ L P+LQK+ESD+Q  FPND FQSYQFYP SP+SHLISPR VISRSG+SS LPD DFAS GSQF NF
Subjt:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF

Query:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRSV
        PLEVPPTLL+LDK SI++W+QR+STDS +QDSI  KSSNDFVLNPQ SESMSDHHATNESQNI+ILIDG+  + E   P A NHRFSFELSD DALL+SV
Subjt:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRSV

Query:  GSKPLESNELEVASSQLHEPFETAKENSPA-VCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT
        GSKPL+SNE+ VASS +HEPFETAKENSP    HTSN TE   K   E AHQHQEHHS+TLG S+     D        K  +NS+WWT+AKD++TEGTT
Subjt:  GSKPLESNELEVASSQLHEPFETAKENSPA-VCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT

XP_038884079.1 uncharacterized protein LOC120075005 isoform X2 [Benincasa hispida]2.7e-12165.25Show/hide
Query:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLF------FNLSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----
        ++++      VLVPE S PS+E+H E+SL SPDI LP AAP   P+            S T   SFT   +NM SPDGPSSIFAIGPFAHETQLVS    
Subjt:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLF------FNLSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----

Query:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF
             T  STAPFT PES HL  PSSPEVPFAQ L P+LQK+ESD+Q  FPND FQSYQFYP SP+SHLISPR VISRSG+SS LPD DFAS GSQF NF
Subjt:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF

Query:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRSV
        PLEVPPTLL+LDK SI++W+QR+STDS +QDSI  KSSNDFVLNPQ SESMSDHHATNESQNI+ILIDG+  + E   P A NHRFSFELSD DALL+SV
Subjt:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRSV

Query:  GSKPLESNELEVASSQLHEPFETAKENSPA-VCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT
        GSKPL+SNE+ VASS +HEPFETAKENSP    HTSN TE   K   E AHQHQEHHS+TLG S+     D        K  +NS+WWT+AKD++TEGTT
Subjt:  GSKPLESNELEVASSQLHEPFETAKENSPA-VCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT

TrEMBL top hitse value%identityAlignment
A0A1S3BSB0 uncharacterized protein LOC103493162 isoform X13.2e-12064.84Show/hide
Query:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----
        ++++      VLVPEPS PS+E H E++L SPDI LP AAP   P+ L  +       S T   SFT   +NM SPDGPSSIFAIGPFAHE QLVS    
Subjt:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----

Query:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF
             T  ST PFT PES HL  PSSPEVPFAQ + PSLQK ESDNQ +FPND FQSYQFYP SP+SHLISPR VISRSG+SS LPD DFAS GSQF NF
Subjt:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF

Query:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS
        PLEVPPTL +LDK SI++W+QR+STDS +QDSI FKSSNDFVLNP  SESM DHHATNESQNI+ILI DGS  E E   P A NHRFSFELSD D L +S
Subjt:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS

Query:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT
        VGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG S+     D         P +NSDWWT+AKD  TEGTT
Subjt:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT

Query:  T
        T
Subjt:  T

A0A1S3BSY8 uncharacterized protein LOC103493162 isoform X23.2e-12064.84Show/hide
Query:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----
        ++++      VLVPEPS PS+E H E++L SPDI LP AAP   P+ L  +       S T   SFT   +NM SPDGPSSIFAIGPFAHE QLVS    
Subjt:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----

Query:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF
             T  ST PFT PES HL  PSSPEVPFAQ + PSLQK ESDNQ +FPND FQSYQFYP SP+SHLISPR VISRSG+SS LPD DFAS GSQF NF
Subjt:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF

Query:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS
        PLEVPPTL +LDK SI++W+QR+STDS +QDSI FKSSNDFVLNP  SESM DHHATNESQNI+ILI DGS  E E   P A NHRFSFELSD D L +S
Subjt:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS

Query:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT
        VGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG S+     D         P +NSDWWT+AKD  TEGTT
Subjt:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT

Query:  T
        T
Subjt:  T

A0A5A7TUB1 Mucin-21.0e-11864.09Show/hide
Query:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----
        ++++      VLVPEPS PS+E H E++L SPDI LP AAP   P+ L  +       S T   SFT   +NM SPDGPSSIFAIGPFAHE QLVS    
Subjt:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----

Query:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF
             T  ST PFT PES HL  PSSPEVPFAQ + PS QK ESDNQ +FPND FQSYQFYP SP+SHLISPR VISRSG+SS LPD DFAS GSQF NF
Subjt:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF

Query:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS
        PL+VPPTL ++DK SI++W+QR+STDS +QDSI FKSSNDFVLNP  SESM DHHATNESQNI+ILI DGS  E E   P A NHRFSFELSD D L +S
Subjt:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS

Query:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT
        VGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG S+     D         P +NSDWWT+AKD  TEGTT
Subjt:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT

Query:  T
        T
Subjt:  T

A0A5D3CYQ2 Mucin-23.2e-12064.84Show/hide
Query:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----
        ++++      VLVPEPS PS+E H E++L SPDI LP AAP   P+ L  +       S T   SFT   +NM SPDGPSSIFAIGPFAHE QLVS    
Subjt:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFN------LSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----

Query:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF
             T  ST PFT PES HL  PSSPEVPFAQ + PSLQK ESDNQ +FPND FQSYQFYP SP+SHLISPR VISRSG+SS LPD DFAS GSQF NF
Subjt:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNF

Query:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS
        PLEVPPTL +LDK SI++W+QR+STDS +QDSI FKSSNDFVLNP  SESM DHHATNESQNI+ILI DGS  E E   P A NHRFSFELSD D L +S
Subjt:  PLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILI-DGSLMEAERRKPVAANHRFSFELSDADALLRS

Query:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT
        VGSKPLESNEL V SS +HEPFET KENSP   HTSN  EE  K +G+ AHQHQEHHS+ LG S+     D         P +NSDWWT+AKD  TEGTT
Subjt:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEGTT

Query:  T
        T
Subjt:  T

A0A6J1C828 uncharacterized protein At1g76660-like4.2e-11260.3Show/hide
Query:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLF------FNLSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----
        ++++      VLVPEPSP +     E++L SPDI LP AAP   P+            S T   SFT   +NM SPDGPSSIFA+GPFAHETQLVS    
Subjt:  IEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLF------FNLSHTFCYSFT--YSNMCSPDGPSSIFAIGPFAHETQLVS----

Query:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQC-SFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSN
             T  STAPFT PES HL  PSSPEVPFAQ L PS QK ESD+Q   FPND FQSYQFYP SP+SHLISPR VISRSG+SS LPDCDF  SGS FSN
Subjt:  -----TFESTAPFT-PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQC-SFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSN

Query:  FPLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRS
        FP+EVPPTLL+LD+ SI  W+ ++S+DS +Q+S+G+KSSNDFVLNPQ SES+SD+HA+NE  NI+IL DGS    +R +  AANHRFSFELSD DALL+S
Subjt:  FPLEVPPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRS

Query:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGE--HAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEG
        V +KPLESNEL VASS +HEP ETAKE S    HTSN TEE  K +GE  H HQ  EHHS+TLG ++     D        KP +NS WW + KD ETEG
Subjt:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGE--HAHQHQEHHSLTLGNSILIMAMD----VILFKPYVNSDWWTDAKDIETEG

Query:  TTT
        TTT
Subjt:  TTT

SwissProt top hitse value%identityAlignment
Q9SRE5 Uncharacterized protein At1g766601.0e-1142.75Show/hide
Query:  CYSFTYSNMCSPDGP-SSIFAIGPFAHETQLVS--------TFESTAPFT--PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPC
        CY    +N  SP GP SS++A GP+AHETQLVS        T  STAPFT  PE   L  PSSP+VP+A+ L  S+    S       ND   +Y  YP 
Subjt:  CYSFTYSNMCSPDGP-SSIFAIGPFAHETQLVS--------TFESTAPFT--PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPC

Query:  SPISHLISPRPVISRSGSSSRLP----DCDFASSGSQF
        SP S L SP   ISR+     L      C  + SG+ F
Subjt:  SPISHLISPRPVISRSGSSSRLP----DCDFASSGSQF

Arabidopsis top hitse value%identityAlignment
AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1)8.8e-1437.62Show/hide
Query:  VLVPEPSPPSA--EAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYS----FTYSNMCSPDGPSSIFAIGPFAHETQLVS--------TFESTAPF
        VLVPEP   S+           S    LP  AP   P   F +   +   S     ++S +   + P SIFAIGP+AHETQLVS        T  S+AP 
Subjt:  VLVPEPSPPSA--EAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYS----FTYSNMCSPDGPSSIFAIGPFAHETQLVS--------TFESTAPF

Query:  TP----ESTHL--IRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDY-FQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPT
        TP     S +L    PSSPEVPFAQ+   + Q      +    + Y FQ YQ  P SP+  LISP P    SG +S  PD       S F +F +  PP 
Subjt:  TP----ESTHL--IRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDY-FQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPT

Query:  LL
        LL
Subjt:  LL

AT1G76660.1 FUNCTIONS IN: molecular_function unknown7.5e-1342.75Show/hide
Query:  CYSFTYSNMCSPDGP-SSIFAIGPFAHETQLVS--------TFESTAPFT--PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPC
        CY    +N  SP GP SS++A GP+AHETQLVS        T  STAPFT  PE   L  PSSP+VP+A+ L  S+    S       ND   +Y  YP 
Subjt:  CYSFTYSNMCSPDGP-SSIFAIGPFAHETQLVS--------TFESTAPFT--PESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPC

Query:  SPISHLISPRPVISRSGSSSRLP----DCDFASSGSQF
        SP S L SP   ISR+     L      C  + SG+ F
Subjt:  SPISHLISPRPVISRSGSSSRLP----DCDFASSGSQF

AT4G25620.1 hydroxyproline-rich glycoprotein family protein5.5e-1627.72Show/hide
Query:  VLVPEPSPPSAEAH--EEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCS--PDGPSSIFAIGPFAHETQLV--------STFESTAPFTP
        VLVPEP+   A     +  S +S  I +P  AP   P     +   +  ++     +CS   + P S F IGP+AHETQ V        +T  STAPFTP
Subjt:  VLVPEPSPPSAEAH--EEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCS--PDGPSSIFAIGPFAHETQLV--------STFESTAPFTP

Query:  ESTHLIRPSSPEVPFAQVLLPSLQKAESDN------QCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLD
               PSSPEVPFAQ+L  SL++A  ++      + S  +  F+S Q YP SP  +LISP      SG+SS  P             F +  PP  L 
Subjt:  ESTHLIRPSSPEVPFAQVLLPSLQKAESDN------QCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLD

Query:  LDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRIL-------IDGSLMEAE-----------------RRKPVAANHRF
         +  +   W  R  + S +    G +  +   L P  S+  S     N ++ +  +       ++GSL++++                   + +   HR 
Subjt:  LDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRIL-------IDGSLMEAE-----------------RRKPVAANHRF

Query:  SFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNS---ILIMAMDVILFKPYVNSDWW
        SFEL+  D + R + SK   S   E AS +            P  C TS  TE         + Q Q+  S + G++         + ++ K  + S+WW
Subjt:  SFELSDADALLRSVGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNS---ILIMAMDVILFKPYVNSDWW

Query:  TDAK
         + K
Subjt:  TDAK

AT5G52430.1 hydroxyproline-rich glycoprotein family protein4.5e-1831.04Show/hide
Query:  VLVPEPSPPSAE-AHEEDSLHSPDIELPLAAPLPLPLYLF----FNLSHTFCYSFTY-SNMCSPDGPSSIFAIGPFAHETQLVS--------TFESTAPF
        VLVPEP          ++S  S  + LP  AP   P         ++SH+     +  SN  SP  P S+F +GP+A+ETQ V+        T  STAP+
Subjt:  VLVPEPSPPSAE-AHEEDSLHSPDIELPLAAPLPLPLYLF----FNLSHTFCYSFTY-SNMCSPDGPSSIFAIGPFAHETQLVS--------TFESTAPF

Query:  TP---ESTHLIRPSSPEVPFAQVLLPSLQKAESD-----NQCSFPNDY-FQSYQFYPCSP-ISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEV
        TP    S H+  PSSPEVPFAQ+L  SL+    D     NQ    + Y F+S Q  P SP   +LISP  VIS SG+SS  P        S    F +  
Subjt:  TP---ESTHLIRPSSPEVPFAQVLLPSLQKAESD-----NQCSFPNDY-FQSYQFYPCSP-ISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEV

Query:  PPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQ----ISESMSDHHATNESQNIRILIDGSLMEAERRKPV-AANHRFSFELSDADALLRS
        PP  L  +  +   W  R  + S +    G   ++   L P     +S +++ ++ T   QN +I    SL  ++    V  A+HR SFEL+  D + R 
Subjt:  PPTLLDLDKCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQ----ISESMSDHHATNESQNIRILIDGSLMEAERRKPV-AANHRFSFELSDADALLRS

Query:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNS
        + SK   S++        ++  ET + +S  +       E+ +       H+ Q+  S ++G+S
Subjt:  VGSKPLESNELEVASSQLHEPFETAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTATCACTTTCCTCGTTTTCTCTTTGTTTGCTTGTCCTCAGAAAAGGAGATGGGGTAGCTGTTGGAGTATTTATTGAGGAAAAGAGTTGTTGTGCACGCTGTGT
GTTGGTACCCGAACCTAGTCCTCCTTCAGCTGAGGCTCATGAAGAAGATTCATTGCACTCACCCGATATTGAGCTTCCACTTGCTGCACCCCTCCCTCTTCCCTTGTATC
TTTTCTTCAATCTGAGCCACACCTTCTGCTACTCATTCACCTACAGCAACATGTGTTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACA
CAGCTTGTCTCCACCTTTGAATCAACTGCTCCCTTCACTCCTGAGTCTACCCACTTGATTAGGCCTTCTTCCCCTGAAGTTCCTTTTGCTCAGGTTCTTCTACCTAGCCT
ACAGAAAGCTGAGTCTGATAATCAATGTTCATTTCCTAATGATTACTTCCAATCTTACCAATTCTACCCTTGCAGCCCGATTAGTCACCTCATATCGCCACGGCCAGTCA
TTTCTCGTTCTGGGTCGTCATCGCGTTTGCCTGATTGTGATTTTGCTTCCTCTGGCTCTCAGTTTTCGAATTTCCCATTAGAAGTTCCACCTACATTATTGGACCTTGAC
AAATGTTCCATTTATAGCTGGCAACAACGGCGAAGCACTGATTCTTACTCTCAAGATTCTATAGGATTCAAATCAAGTAATGATTTTGTTTTGAATCCCCAAATTTCAGA
ATCTATGTCAGATCACCACGCAACAAATGAATCCCAAAATATTCGAATTCTCATTGATGGAAGCCTGATGGAAGCCGAAAGGAGGAAGCCTGTTGCTGCTAATCATAGAT
TCTCATTTGAGTTATCTGATGCAGATGCTTTATTAAGAAGCGTAGGAAGTAAGCCGCTGGAATCAAATGAACTGGAAGTTGCATCATCTCAATTACATGAACCATTTGAA
ACGGCTAAAGAAAATTCTCCTGCTGTCTGTCATACCTCAAATGGTACAGAAGAATATGCAAAAACAAACGGTGAACATGCACATCAGCATCAAGAACACCACTCCCTTAC
CCTTGGGAATTCAATTTTGATCATGGCAATGGATGTGATACTCTTTAAGCCATATGTAAATTCAGACTGGTGGACGGATGCAAAGGATATAGAGACAGAAGGCACGACCA
CTAGAGGAATGATGGGCATCATGTGCTTTTCCTCGAAGTTGAGCAAACTG
mRNA sequenceShow/hide mRNA sequence
ATGTCATTATCACTTTCCTCGTTTTCTCTTTGTTTGCTTGTCCTCAGAAAAGGAGATGGGGTAGCTGTTGGAGTATTTATTGAGGAAAAGAGTTGTTGTGCACGCTGTGT
GTTGGTACCCGAACCTAGTCCTCCTTCAGCTGAGGCTCATGAAGAAGATTCATTGCACTCACCCGATATTGAGCTTCCACTTGCTGCACCCCTCCCTCTTCCCTTGTATC
TTTTCTTCAATCTGAGCCACACCTTCTGCTACTCATTCACCTACAGCAACATGTGTTCTCCTGATGGGCCTTCCTCCATTTTTGCCATTGGCCCATTTGCTCATGAAACA
CAGCTTGTCTCCACCTTTGAATCAACTGCTCCCTTCACTCCTGAGTCTACCCACTTGATTAGGCCTTCTTCCCCTGAAGTTCCTTTTGCTCAGGTTCTTCTACCTAGCCT
ACAGAAAGCTGAGTCTGATAATCAATGTTCATTTCCTAATGATTACTTCCAATCTTACCAATTCTACCCTTGCAGCCCGATTAGTCACCTCATATCGCCACGGCCAGTCA
TTTCTCGTTCTGGGTCGTCATCGCGTTTGCCTGATTGTGATTTTGCTTCCTCTGGCTCTCAGTTTTCGAATTTCCCATTAGAAGTTCCACCTACATTATTGGACCTTGAC
AAATGTTCCATTTATAGCTGGCAACAACGGCGAAGCACTGATTCTTACTCTCAAGATTCTATAGGATTCAAATCAAGTAATGATTTTGTTTTGAATCCCCAAATTTCAGA
ATCTATGTCAGATCACCACGCAACAAATGAATCCCAAAATATTCGAATTCTCATTGATGGAAGCCTGATGGAAGCCGAAAGGAGGAAGCCTGTTGCTGCTAATCATAGAT
TCTCATTTGAGTTATCTGATGCAGATGCTTTATTAAGAAGCGTAGGAAGTAAGCCGCTGGAATCAAATGAACTGGAAGTTGCATCATCTCAATTACATGAACCATTTGAA
ACGGCTAAAGAAAATTCTCCTGCTGTCTGTCATACCTCAAATGGTACAGAAGAATATGCAAAAACAAACGGTGAACATGCACATCAGCATCAAGAACACCACTCCCTTAC
CCTTGGGAATTCAATTTTGATCATGGCAATGGATGTGATACTCTTTAAGCCATATGTAAATTCAGACTGGTGGACGGATGCAAAGGATATAGAGACAGAAGGCACGACCA
CTAGAGGAATGATGGGCATCATGTGCTTTTCCTCGAAGTTGAGCAAACTG
Protein sequenceShow/hide protein sequence
MSLSLSSFSLCLLVLRKGDGVAVGVFIEEKSCCARCVLVPEPSPPSAEAHEEDSLHSPDIELPLAAPLPLPLYLFFNLSHTFCYSFTYSNMCSPDGPSSIFAIGPFAHET
QLVSTFESTAPFTPESTHLIRPSSPEVPFAQVLLPSLQKAESDNQCSFPNDYFQSYQFYPCSPISHLISPRPVISRSGSSSRLPDCDFASSGSQFSNFPLEVPPTLLDLD
KCSIYSWQQRRSTDSYSQDSIGFKSSNDFVLNPQISESMSDHHATNESQNIRILIDGSLMEAERRKPVAANHRFSFELSDADALLRSVGSKPLESNELEVASSQLHEPFE
TAKENSPAVCHTSNGTEEYAKTNGEHAHQHQEHHSLTLGNSILIMAMDVILFKPYVNSDWWTDAKDIETEGTTTRGMMGIMCFSSKLSKL