; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh06G005190 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh06G005190
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationCma_Chr06:2429356..2430198
RNA-Seq ExpressionCmaCh06G005190
SyntenyCmaCh06G005190
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596613.1 hypothetical protein SDJN03_09793, partial [Cucurbita argyrosperma subsp. sororia]1.5e-11292.77Show/hide
Query:  MASSSDNQQ--SKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALII
        MASSSDNQQ  SKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPP AYYHNSPQNY VEPFHAA IRGIVTALII
Subjt:  MASSSDNQQ--SKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALII

Query:  LVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSS
        LVVLMML+SIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFL VERTNLMRVRWTSS
Subjt:  LVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSS

Query:  SPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTT
        SPDDPG+WEETEEKLGKEKATRK   N R   W +
Subjt:  SPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTT

KAG7028149.1 NDR1/HIN1-like protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma]4.8e-15197.12Show/hide
Query:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV
        MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPP AYYHNSPQNY VEPFHAA IRGIVTALIILV
Subjt:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV

Query:  VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP
        VLMML+SIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFL VERTNLMRVRWTSSSP
Subjt:  VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP

Query:  DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTV
        DDPG+WEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSA+GHHMHC V
Subjt:  DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTV

XP_022941877.1 uncharacterized protein LOC111447106 [Cucurbita moschata]2.6e-14995.74Show/hide
Query:  MASSSDNQQ--SKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALII
        MASSSDNQQ  SKSKSTDSQPLPPPSAAHNP PIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPP AYYHNSPQNY VEPFHA+ IRGIVTALII
Subjt:  MASSSDNQQ--SKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALII

Query:  LVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSS
        LVVLMML+SIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFL VERTNLMRVRWTSS
Subjt:  LVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSS

Query:  SPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
        SPDDPG+WEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSA+GHHMHC VLM
Subjt:  SPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM

XP_023005718.1 uncharacterized protein LOC111498631 [Cucurbita maxima]7.6e-157100Show/hide
Query:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV
        MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV
Subjt:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV

Query:  VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP
        VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP
Subjt:  VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP

Query:  DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
        DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
Subjt:  DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM

XP_023539989.1 uncharacterized protein LOC111800503 [Cucurbita pepo subsp. pepo]5.7e-15297.14Show/hide
Query:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV
        MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPP AYYHNSPQNY VEPFHAA IRGIVTALIILV
Subjt:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV

Query:  VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP
        VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFL VERTNLMRVRWTSSSP
Subjt:  VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP

Query:  DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
        DDPG+WEETEEKLGKEKATRKV FNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSA+GHHMHC VLM
Subjt:  DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM

TrEMBL top hitse value%identityAlignment
A0A0A0LGS8 Uncharacterized protein1.3e-9362.12Show/hide
Query:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT--------------MGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHA
        MASSS++QQS+SK+TD  P  P SA +NPPP+YPPPT              MGYPP P PGYPPAPG YPPYN Y YAQAPP AYY N+PQNY  +   A
Subjt:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT--------------MGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHA

Query:  ALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVE
          +RGIVTALI+LV +M LSSIITWI+LRP+IP F+VD+  V+NFNISK NYSGNWN +L V+NPN KL +  +RIQ FV YK+NTLAMS+ADPFF+ VE
Subjt:  ALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVE

Query:  RTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVL
        +++ MRV+ TSSSPDDPG+W ETEEK+G+EKA+  VSFNLRFF WT F+SGSWWTR ++++VFC+DLK+ F  P + +G + A  H   C+VL
Subjt:  RTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVL

A0A1S3B6W4 uncharacterized protein LOC1034866745.4e-9262.03Show/hide
Query:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT---------------MGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFH
        MASSS++QQS+SK+TD  P  P SA +NPPP+YPPPT               MGYPPAPHP YPPA G YPPYN Y YAQAPP AYY N+PQNY      
Subjt:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT---------------MGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFH

Query:  AALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGV
        A  +RGIV ALI+LV +M LSSIITWIILRPE+P F+VD+  V+NFNISK NYSGNW+A++ VQNPN KLN+  +RIQ FV YK NTLAMS+ADPFFL V
Subjt:  AALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGV

Query:  ERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
        E++  M+V+ TSSSPDDPG+W ETEEKLG+E+AT  VSFNLRFF WTTF++GSWWTR V++RV C+D+K+ F  P + +  + A  H   C+VL+
Subjt:  ERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM

A0A5A7TLT1 Protein YLS95.4e-9262.03Show/hide
Query:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT---------------MGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFH
        MASSS++QQS+SK+TD  P  P SA +NPPP+YPPPT               MGYPPAPHP YPPA G YPPYN Y YAQAPP AYY N+PQNY      
Subjt:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPT---------------MGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFH

Query:  AALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGV
        A  +RGIV ALI+LV +M LSSIITWIILRPE+P F+VD+  V+NFNISK NYSGNW+A++ VQNPN KLN+  +RIQ FV YK NTLAMS+ADPFFL V
Subjt:  AALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGV

Query:  ERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
        E++  M+V+ TSSSPDDPG+W ETEEKLG+E+AT  VSFNLRFF WTTF++GSWWTR V++RV C+D+K+ F  P + +  + A  H   C+VL+
Subjt:  ERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM

A0A6J1FNP1 uncharacterized protein LOC1114471061.3e-14995.74Show/hide
Query:  MASSSDNQQ--SKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALII
        MASSSDNQQ  SKSKSTDSQPLPPPSAAHNP PIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPP AYYHNSPQNY VEPFHA+ IRGIVTALII
Subjt:  MASSSDNQQ--SKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALII

Query:  LVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSS
        LVVLMML+SIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFL VERTNLMRVRWTSS
Subjt:  LVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSS

Query:  SPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
        SPDDPG+WEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSA+GHHMHC VLM
Subjt:  SPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM

A0A6J1L2Y7 uncharacterized protein LOC1114986313.7e-157100Show/hide
Query:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV
        MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV
Subjt:  MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILV

Query:  VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP
        VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP
Subjt:  VLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSP

Query:  DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
        DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM
Subjt:  DDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM

SwissProt top hitse value%identityAlignment
Q8LD98 NDR1/HIN1-like protein 67.0e-0426.05Show/hide
Query:  LIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISK-SNYSGNWNATLVVQNPNKKLNLTFKRIQGF-VGYKDNTLAMSFADPFFLGVERTNLMRV
        L++LVV +  S  I +++ +P++P + +D L +T F +++ S+ +  +N T+  +NPN+K+ + ++      V Y ++ L+      F+ G E T ++ V
Subjt:  LIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISK-SNYSGNWNATLVVQNPNKKLNLTFKRIQGF-VGYKDNTLAMSFADPFFLGVERTNLMRV

Query:  RWTSSSPDDPGHWEETEEK
          T  + +  G     EE+
Subjt:  RWTSSSPDDPGHWEETEEK

Q9SJ52 NDR1/HIN1-like protein 102.3e-0723.18Show/hide
Query:  AYPPYNGYAYA-QAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWN--ATLVVQN
        A  P NG  Y    PP A      + +G       L+   V  +I L+V++ ++++I W+I+RP    F V    +T F+ +  +    +N   T+ V+N
Subjt:  AYPPYNGYAYA-QAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWN--ATLVVQN

Query:  PNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFC
        PNK++ L + RI+    Y+    +     PF+ G + T ++   +   +       +     L  E+ +   +  ++F +   F+ G    R +  +V C
Subjt:  PNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFC

Query:  DDLKIDFGTPNSVNGSFSAY
        DDL++   T N    + + +
Subjt:  DDLKIDFGTPNSVNGSFSAY

Q9SRN1 NDR1/HIN1-like protein 22.4e-0424.23Show/hide
Query:  PGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSN---YSGNW
        P  PP P A+  YN      +P       S     +      ++  I   LI + V++ ++++I W+I RP    F V    +  F+   +N   YS + 
Subjt:  PGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSN---YSGNW

Query:  NATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEET----EEKLGKEKATRKVSFNLRFFVWTTFQSGS
        N T  ++NPN+++ + +        Y D     +    F+ G + T ++  +    +    G    T    +EK G  +   K+  ++RF  W      S
Subjt:  NATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEET----EEKLGKEKATRKVSFNLRFFVWTTFQSGS

Query:  WWTRHVILRVFCDDLKIDFGTPNSVNG
        W  +    ++ CDDLKI  G+ NS  G
Subjt:  WWTRHVILRVFCDDLKIDFGTPNSVNG

Arabidopsis top hitse value%identityAlignment
AT2G27260.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family2.8e-1629.22Show/hide
Query:  PTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPF-HAALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNIS
        P  GY P P+P YP      PP NGY    A     Y N    Y  +P   A +IR +       ++L+ L   I ++I+RP++P   +++L V+NFN+S
Subjt:  PTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPF-HAALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNIS

Query:  KSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATR-KVSFNLRFFVWTT
         +  SG W+  L  +NPN K++L ++     + Y   +L+ +   PF  G +   ++    + S     G      + +GKE++ +  V F+LR   + T
Subjt:  KSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATR-KVSFNLRFFVWTT

Query:  FQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHC
        F+ G++  R  +  V+CDD+ +  G P S +G     G    C
Subjt:  FQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHC

AT2G35980.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.7e-0823.18Show/hide
Query:  AYPPYNGYAYA-QAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWN--ATLVVQN
        A  P NG  Y    PP A      + +G       L+   V  +I L+V++ ++++I W+I+RP    F V    +T F+ +  +    +N   T+ V+N
Subjt:  AYPPYNGYAYA-QAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWN--ATLVVQN

Query:  PNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFC
        PNK++ L + RI+    Y+    +     PF+ G + T ++   +   +       +     L  E+ +   +  ++F +   F+ G    R +  +V C
Subjt:  PNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFC

Query:  DDLKIDFGTPNSVNGSFSAY
        DDL++   T N    + + +
Subjt:  DDLKIDFGTPNSVNGSFSAY

AT3G52460.1 hydroxyproline-rich glycoprotein family protein9.9e-3839.38Show/hide
Query:  SQPLPPPSAAHNPPPIYP----PPTMGY-----PPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNS---PQNYGVE-PFHAALIRGIVTALIILVVLM
        +QP PPP  +  PPP       PP MGY     PP P+P YP A     PY  Y YAQAPP +YY +S    QN   + P  +  +RGI T LI+LVVL+
Subjt:  SQPLPPPSAAHNPPPIYP----PPTMGY-----PPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNS---PQNYGVE-PFHAALIRGIVTALIILVVLM

Query:  MLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGY-----KDNTLAMSFADPFFLGVERTNLMRVRWTSS
         +S+ ITW++LRP+IP F V+   V+NFN++   +S  W A L ++N N KL   F RIQG V +     +D  LA +F  P F+  +++ ++    T+ 
Subjt:  MLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGY-----KDNTLAMSFADPFFLGVERTNLMRVRWTSS

Query:  SPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDF
          + P       +++ KE+ T  V+F+LR  VW TF++  W  R   L+VFC  LK+ F
Subjt:  SPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDF

AT3G52470.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.2e-0627.66Show/hide
Query:  LIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSN-YSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP
        ++R +  A+I  +V+++++  + W+ILRP  P F +    V  FN+S+ N  + N+  T+  +NPN K+ + + R+  +  Y +  + +  A P
Subjt:  LIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSN-YSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADP

AT5G22870.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family7.9e-1123.2Show/hide
Query:  ALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNY-SGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGV
        +LI  I   ++ L+ +  +  +ITW+  +P+   + V+   V NFN++  N+ S  +  T+   NPN ++++ +  ++ FV +KD TLA    +PF    
Subjt:  ALIRGIVTALIILVVLMMLSSIITWIILRPEIPTFRVDTLGVTNFNISKSNY-SGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGV

Query:  ERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGS
         R N+ ++  T  + ++    +   + L  + +  K+ F +       F+ G W + H   ++ C  + +    PN    S
Subjt:  ERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATRKVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTTCATCGGACAATCAACAATCCAAATCCAAATCCACCGACTCCCAACCTCTTCCGCCGCCCTCCGCCGCACACAACCCACCTCCGATCTACCCTCCTCCCAC
CATGGGGTACCCTCCAGCCCCACATCCGGGGTACCCTCCAGCGCCAGGGGCTTACCCACCATACAATGGCTACGCCTACGCCCAAGCCCCTCCCACAGCCTATTACCACA
ACAGCCCCCAAAATTACGGGGTGGAGCCGTTTCACGCCGCCTTAATCCGCGGCATTGTCACCGCCTTAATAATTCTGGTGGTTCTAATGATGCTCTCCAGCATAATCACC
TGGATCATCCTCCGACCAGAAATCCCAACGTTCAGAGTTGATACCTTGGGCGTCACCAATTTTAACATCTCCAAATCGAATTACTCCGGAAACTGGAACGCGACCTTGGT
GGTGCAGAATCCCAACAAGAAACTGAACCTGACTTTCAAGCGGATCCAGGGGTTCGTGGGCTATAAGGACAACACGCTGGCGATGTCGTTTGCGGACCCATTTTTTCTTG
GTGTGGAGAGGACTAACCTAATGCGGGTAAGATGGACGTCGAGTAGCCCTGATGATCCGGGGCATTGGGAGGAGACGGAGGAGAAATTGGGGAAGGAGAAGGCGACGAGG
AAAGTGAGTTTCAATTTGAGATTCTTCGTATGGACCACTTTCCAATCTGGGTCTTGGTGGACCAGGCACGTTATTTTGAGAGTCTTTTGTGACGATTTGAAGATCGACTT
CGGCACCCCCAACTCCGTTAATGGCTCCTTCTCCGCCTATGGCCACCACATGCATTGCACGGTTCTCATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCTTCATCGGACAATCAACAATCCAAATCCAAATCCACCGACTCCCAACCTCTTCCGCCGCCCTCCGCCGCACACAACCCACCTCCGATCTACCCTCCTCCCAC
CATGGGGTACCCTCCAGCCCCACATCCGGGGTACCCTCCAGCGCCAGGGGCTTACCCACCATACAATGGCTACGCCTACGCCCAAGCCCCTCCCACAGCCTATTACCACA
ACAGCCCCCAAAATTACGGGGTGGAGCCGTTTCACGCCGCCTTAATCCGCGGCATTGTCACCGCCTTAATAATTCTGGTGGTTCTAATGATGCTCTCCAGCATAATCACC
TGGATCATCCTCCGACCAGAAATCCCAACGTTCAGAGTTGATACCTTGGGCGTCACCAATTTTAACATCTCCAAATCGAATTACTCCGGAAACTGGAACGCGACCTTGGT
GGTGCAGAATCCCAACAAGAAACTGAACCTGACTTTCAAGCGGATCCAGGGGTTCGTGGGCTATAAGGACAACACGCTGGCGATGTCGTTTGCGGACCCATTTTTTCTTG
GTGTGGAGAGGACTAACCTAATGCGGGTAAGATGGACGTCGAGTAGCCCTGATGATCCGGGGCATTGGGAGGAGACGGAGGAGAAATTGGGGAAGGAGAAGGCGACGAGG
AAAGTGAGTTTCAATTTGAGATTCTTCGTATGGACCACTTTCCAATCTGGGTCTTGGTGGACCAGGCACGTTATTTTGAGAGTCTTTTGTGACGATTTGAAGATCGACTT
CGGCACCCCCAACTCCGTTAATGGCTCCTTCTCCGCCTATGGCCACCACATGCATTGCACGGTTCTCATGTAG
Protein sequenceShow/hide protein sequence
MASSSDNQQSKSKSTDSQPLPPPSAAHNPPPIYPPPTMGYPPAPHPGYPPAPGAYPPYNGYAYAQAPPTAYYHNSPQNYGVEPFHAALIRGIVTALIILVVLMMLSSIIT
WIILRPEIPTFRVDTLGVTNFNISKSNYSGNWNATLVVQNPNKKLNLTFKRIQGFVGYKDNTLAMSFADPFFLGVERTNLMRVRWTSSSPDDPGHWEETEEKLGKEKATR
KVSFNLRFFVWTTFQSGSWWTRHVILRVFCDDLKIDFGTPNSVNGSFSAYGHHMHCTVLM