; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017941 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017941
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionLate embryogenesis abundant hydroxyproline-rich glycoprotein family, putative
Genome locationtig00153057:759568..764636
RNA-Seq ExpressionSgr017941
SyntenySgr017941
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607728.1 NDR1/HIN1-like protein 3, partial [Cucurbita argyrosperma subsp. sororia]7.7e-11781.88Show/hide
Query:  PPLSPPPP-PFSAPSARSPSN-AAAPPATTPSPAS-AVAHTSPAL-TSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQ
        PPL PPPP P   PSA SPS+ AAA P T PSPAS AV   S AL  +ASPS QA+ T L+QIV+Q++AT+PHTPLLN N IDSS +RK  +LQQ RPRQ
Subjt:  PPLSPPPP-PFSAPSARSPSN-AAAPPATTPSPAS-AVAHTSPAL-TSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQ

Query:  TNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQP
        TNPIIWCFAILCL+FSLLLIF GIATLIIFLVVRPRNP+FDIPNASLSTIYFD+PEYLNGDFTVL NFTNPNHR+DVRYE+ADIELFFGDRLIATQAIQP
Subjt:  TNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQP

Query:  FSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        FSQR++EVRL+PVHLISSLVYLPQNSG +LRRQV NNKVIYNIRGTFRVRASLG IHYSYWLHSRCQLVMTSPPTGVLVARSC T+R
Subjt:  FSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

XP_004139769.1 uncharacterized protein LOC101218998 [Cucumis sativus]4.5e-11778.55Show/hide
Query:  KLAQMPPLSPPPPPFS-APSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAE-PTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRP
        +L   PP  PP PP S AP A SPS+ AA P T PSP SA    +   T AS S +AE  +L QI+VQNRATKPHTPLLN+NH DSSK+RKP +LQQ RP
Subjt:  KLAQMPPLSPPPPPFS-APSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAE-PTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRP

Query:  RQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAI
        RQTNPIIWCFA LCL FSLLLIF GIATLIIFLV+RPRNPLFDIPNASLSTIYFD+PEYLNGDFT+L NFTNPNHRVDVRYE+ADIELFFGDRLIATQAI
Subjt:  RQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAI

Query:  QPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        QPFSQR+ E+RL+PVHL SSLV+LPQN GLVLRRQV NNKV+YNIRGTFRV+AS+GIIHYS+WLHSRCQLVMTSPPTG+LVARSC+TKR
Subjt:  QPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

XP_022139590.1 uncharacterized protein LOC111010452 [Momordica charantia]2.0e-12889.08Show/hide
Query:  PPLSPPPPPFSAPSARSPSNAAAPPATTPSPASAV-AHTSPALTSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQTNP
        PPL PPPPPF+APSA SPSNAAAPP   PSPA A  A TSP LTSA+PS QAE T L+QIV+QNRATKPHTPLLN +HIDSS +RK PILQQ RPRQTNP
Subjt:  PPLSPPPPPFSAPSARSPSNAAAPPATTPSPASAV-AHTSPALTSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQTNP

Query:  IIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQ
        IIWCFA LC+LFS+LLIFFGIATLI+FLVVRPRNPLFDIPNASLSTIYFDS EYLNGDFTVLTNFTNPNHRVDVRYE+ADIELFFGDRLIATQAIQPFSQ
Subjt:  IIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQ

Query:  RRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        RRKEVRLEPVHLISSLVYLPQNSGLVLRRQV NNKVIYNIRGTF+VRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
Subjt:  RRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

XP_022941198.1 uncharacterized protein LOC111446574 [Cucurbita moschata]3.5e-11782.23Show/hide
Query:  PPLSPPPP-PFSAPSARSPSN-AAAPPATTPSPAS-AVAHTSPAL-TSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQ
        PPL PPPP P   PSA SPS+ AAA P T PSPAS AV   S AL  +ASPS QA+ T L+QIV+Q++AT+PHTPLLN N IDSS +RK  +LQQ RPRQ
Subjt:  PPLSPPPP-PFSAPSARSPSN-AAAPPATTPSPAS-AVAHTSPAL-TSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQ

Query:  TNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQP
        TNPIIWCFAILCL+FSLLLIF GIATLIIFLVVRPRNP+FDIPNASLSTIYFD+PEYLNGDFTVL NFTNPNHR+DVRYE+ADIELFFGDRLIATQAIQP
Subjt:  TNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQP

Query:  FSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        FSQR++EVRL+PVHLISSLVYLPQNSG +LRRQV NNKVIYNIRGTFRVRASLG IHYSYWLHSRCQLVMTSPPTGVLVARSC TKR
Subjt:  FSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

XP_038897659.1 uncharacterized protein LOC120085632 [Benincasa hispida]6.1e-12282.76Show/hide
Query:  MPPLSPPPPP------FSAPSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAE-PTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTR
        MPPL PPPPP      F  PSA SPSN AA P TTPSPASAV     + T AS S  AE  +L QI+VQNRATKPHTPLLN+NH DSSK+RKP +LQQ R
Subjt:  MPPLSPPPPP------FSAPSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAE-PTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTR

Query:  PRQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQA
        PRQTNPIIWCFA LCLLFSLLLIF GIATLIIFLVVRPRNPLFDIPNASLSTIYFD+PEYLNGDFTVL NFTNPNHRVD+RYE+ADIELFFGDRLIATQA
Subjt:  PRQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQA

Query:  IQPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        IQPFSQR+KEVRL+PVHLISSLV+LPQN GLVLRRQV NNKV+YNIRGTFRV+ASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSC+ KR
Subjt:  IQPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

TrEMBL top hitse value%identityAlignment
A0A0A0K372 LEA_2 domain-containing protein2.2e-11778.55Show/hide
Query:  KLAQMPPLSPPPPPFS-APSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAE-PTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRP
        +L   PP  PP PP S AP A SPS+ AA P T PSP SA    +   T AS S +AE  +L QI+VQNRATKPHTPLLN+NH DSSK+RKP +LQQ RP
Subjt:  KLAQMPPLSPPPPPFS-APSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAE-PTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRP

Query:  RQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAI
        RQTNPIIWCFA LCL FSLLLIF GIATLIIFLV+RPRNPLFDIPNASLSTIYFD+PEYLNGDFT+L NFTNPNHRVDVRYE+ADIELFFGDRLIATQAI
Subjt:  RQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAI

Query:  QPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        QPFSQR+ E+RL+PVHL SSLV+LPQN GLVLRRQV NNKV+YNIRGTFRV+AS+GIIHYS+WLHSRCQLVMTSPPTG+LVARSC+TKR
Subjt:  QPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

A0A1S3BHM7 uncharacterized protein LOC1034901778.3e-11778.2Show/hide
Query:  KLAQMPPLSPPPPPFS-APSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAE-PTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRP
        +L   PP  PP PP S AP   SPS+ A+ P   PSPASA    +   T AS S +AE  +L QI+VQNRATKPHTPLLN+NH DSSK+RKP +LQQ RP
Subjt:  KLAQMPPLSPPPPPFS-APSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAE-PTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRP

Query:  RQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAI
        RQTNPIIWCFA LCL FSLLLIF GIATL+IFLVVRPRNPLFDIPNASLSTIYFD+PEYLNGDFT+L NFTNPNHRVDVRYE+ADIELFFGDRLIATQAI
Subjt:  RQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAI

Query:  QPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        QPFSQR+ EVRL+PVHLISSLV+LPQN GLVLRRQV NNK++YNIRGTFRV+AS+GIIHYS+WLHSRCQLVMTSPPTG+LVARSC+TKR
Subjt:  QPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

A0A6J1CCQ5 uncharacterized protein LOC1110104529.5e-12989.08Show/hide
Query:  PPLSPPPPPFSAPSARSPSNAAAPPATTPSPASAV-AHTSPALTSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQTNP
        PPL PPPPPF+APSA SPSNAAAPP   PSPA A  A TSP LTSA+PS QAE T L+QIV+QNRATKPHTPLLN +HIDSS +RK PILQQ RPRQTNP
Subjt:  PPLSPPPPPFSAPSARSPSNAAAPPATTPSPASAV-AHTSPALTSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQTNP

Query:  IIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQ
        IIWCFA LC+LFS+LLIFFGIATLI+FLVVRPRNPLFDIPNASLSTIYFDS EYLNGDFTVLTNFTNPNHRVDVRYE+ADIELFFGDRLIATQAIQPFSQ
Subjt:  IIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQ

Query:  RRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        RRKEVRLEPVHLISSLVYLPQNSGLVLRRQV NNKVIYNIRGTF+VRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
Subjt:  RRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

A0A6J1FRG3 uncharacterized protein LOC1114465741.7e-11782.23Show/hide
Query:  PPLSPPPP-PFSAPSARSPSN-AAAPPATTPSPAS-AVAHTSPAL-TSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQ
        PPL PPPP P   PSA SPS+ AAA P T PSPAS AV   S AL  +ASPS QA+ T L+QIV+Q++AT+PHTPLLN N IDSS +RK  +LQQ RPRQ
Subjt:  PPLSPPPP-PFSAPSARSPSN-AAAPPATTPSPAS-AVAHTSPAL-TSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQ

Query:  TNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQP
        TNPIIWCFAILCL+FSLLLIF GIATLIIFLVVRPRNP+FDIPNASLSTIYFD+PEYLNGDFTVL NFTNPNHR+DVRYE+ADIELFFGDRLIATQAIQP
Subjt:  TNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQP

Query:  FSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        FSQR++EVRL+PVHLISSLVYLPQNSG +LRRQV NNKVIYNIRGTFRVRASLG IHYSYWLHSRCQLVMTSPPTGVLVARSC TKR
Subjt:  FSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

A0A6J1J284 NDR1/HIN1-like protein 61.4e-11680.49Show/hide
Query:  PPLSPPPP-PFSAPSARSPSN-AAAPPATTPSPASAVAHTSPA--LTSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQ
        PPL PPPP P   PSA SPS+ AAA P T PSPASA    + A  + +ASPS QA+ T L+QIV+QN+AT+P TPLLN N IDSS +RK  +LQQ RPRQ
Subjt:  PPLSPPPP-PFSAPSARSPSN-AAAPPATTPSPASAVAHTSPA--LTSASPSSQAEPT-LQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQ

Query:  TNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQP
        TNPIIWCFAILCL+FSLLLIF GIATLIIFLVV+PRNP+FDIPNASLSTIYFD+PEYLNGDFTVL NFTNPNHR+DVRYE+ADIELFFGDRLIATQAIQP
Subjt:  TNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQP

Query:  FSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        FSQR++EVRL+PVHLISSLVYLPQNSG +LRRQV NNKVIYNIRGTFRVRASLG +HYSYWLHSRCQLVMTSPPTGVLVARSC+TKR
Subjt:  FSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

SwissProt top hitse value%identityAlignment
F4I1X0 Heavy metal-associated isoprenylated plant protein 412.5e-0930.82Show/hide
Query:  DGETEFCRMHKRLVYEFFKNASQMLRA--------------------------------------DDYPGYNNKRGQGHRCDDPFHLGECSTFKFSINHR
        + ++   R H+ LV+ FF  AS++LRA                                      ++YPGY NKRG G RCD PF LGECSTFKF  +  
Subjt:  DGETEFCRMHKRLVYEFFKNASQMLRA--------------------------------------DDYPGYNNKRGQGHRCDDPFHLGECSTFKFSINHR

Query:  A-------IKTSGVVQMNAIEMQRILPFQEIPISY-HCHPYPTAFE
        A       +++  V +  ++  + IL  Q  P+S+ H +   T FE
Subjt:  A-------IKTSGVVQMNAIEMQRILPFQEIPISY-HCHPYPTAFE

Q8VZ13 Uncharacterized protein At1g081602.6e-0625.5Show/hide
Query:  SKHRKPPILQQTRPR-QTNPIIWCFAILCLLFSLLL--IFFGIATLIIFLVVRPRNPLFDIPNASLSTIYF-DSPEYLNGDFTVLTNFTNPNHRVDVRYE
        ++  +P +  Q++PR Q  P      +LC++ +L+L  +  G+A LI +L +RP+  ++ +  AS+      ++ +++N  F+ +    NP   V VRY 
Subjt:  SKHRKPPILQQTRPR-QTNPIIWCFAILCLLFSLLL--IFFGIATLIIFLVVRPRNPLFDIPNASLSTIYF-DSPEYLNGDFTVLTNFTNPNHRVDVRYE

Query:  HADIELFFGDRLIATQAIQPFSQRRK-EVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLV
           I     ++ +A + I PF QR K E R+E   L+S  V L + +   LR +     +   +  T RV        Y  W+    +  + +  T V++
Subjt:  HADIELFFGDRLIATQAIQPFSQRRK-EVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLV

Q9FNH6 NDR1/HIN1-like protein 31.1e-0429.22Show/hide
Query:  ILCLLFSLLL---IFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGD-RLIATQAIQPFSQRR
        IL ++F++L+   +  GIA LII+L+ RP    F + +A L+    D    L  +  +     NPN R+ V Y+  ++  ++GD R   +  I  F Q  
Subjt:  ILCLLFSLLL---IFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGD-RLIATQAIQPFSQRR

Query:  KEVRLEPVHLI-SSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFR--VRASLGII
        K   +    L+   LV L       L   V  N  IY I    R  +R   G+I
Subjt:  KEVRLEPVHLI-SSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFR--VRASLGII

Q9SJ52 NDR1/HIN1-like protein 105.7e-0628.38Show/hide
Query:  CLLFSL-------LLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPE-YLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQ
        C L SL       L++  G+A LI +L+VRPR   F + +ASL+     SP+  L  +  +     NPN R+ + Y+  +   ++  +  +T  + PF Q
Subjt:  CLLFSL-------LLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPE-YLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQ

Query:  RRKEVR-LEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVR
          K    L P     +LV         L  + ++   +YNI   FR+R
Subjt:  RRKEVR-LEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVR

Q9ZVD2 NDR1/HIN1-like protein 138.3e-0522.53Show/hide
Query:  CFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYE-HADIELFFGDRLIATQAIQPFSQRR
        CF        +L++  GI+  +++L+ RP  P + I   S+S I  +S   ++  F V     N N ++ V YE  + +++++ D  I+   +  F Q  
Subjt:  CFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYE-HADIELFFGDRLIATQAIQPFSQRR

Query:  KEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIH-YSYWLHSRCQLV---MTSPPTGVLVARSC
        K V +  + L  S + L       +R +V    V + ++    V+   G +  ++  ++  C +    +T+P    +V+R C
Subjt:  KEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIH-YSYWLHSRCQLV---MTSPPTGVLVARSC

Arabidopsis top hitse value%identityAlignment
AT1G13050.1 unknown protein8.7e-3438.66Show/hide
Query:  QTRPRQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIA
        Q  P++T P+     I C +  ++LI  G+  L+++L  RPR+P FDI  A+L+T   D    LNGD  V+ NFTNP+ +  V + +   EL+F + LIA
Subjt:  QTRPRQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIA

Query:  TQAIQPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLG-IIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        T+ I+PF   +        HL+SS V +       L+ Q+    V+ N+RGTF  R++LG ++ YSYWLH++C + + +PP G + AR C TKR
Subjt:  TQAIQPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLG-IIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

AT1G13050.2 unknown protein1.1e-3139.66Show/hide
Query:  ILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQRRKEVR
        I C +  ++LI  G+  L+++L  RPR+P FDI  A+L+T   D    LNGD  V+ NFTNP+ +  V + +   EL+F + LIAT+ I+PF   +    
Subjt:  ILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQRRKEVR

Query:  LEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLG-IIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
            HL+SS V +       L+ Q+    V+ N+RGTF  R++LG ++ YSYWLH++C + + +PP G + AR C TKR
Subjt:  LEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLG-IIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

AT3G26350.1 LOCATED IN: chloroplast4.0e-3133.33Show/hide
Query:  KLAQMPPLSPP-----------PPPFSAPSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAEPTLQQIVVQN----RATKPHTPL---LNSNHI
        K  Q  P++PP             P   P         +P    P   +  +   P L S   + Q  P   Q   +N     +T P  P      +   
Subjt:  KLAQMPPLSPP-----------PPPFSAPSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAEPTLQQIVVQN----RATKPHTPL---LNSNHI

Query:  DSSKHRKPPILQQTRPRQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHA
         S  HR+ P L     R+TN + W  A  C +F ++LI  G+  LI++LV RPR+P  DI  A+L+  Y D    LNGD T+L N TNP+ +  V + + 
Subjt:  DSSKHRKPPILQQTRPRQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHA

Query:  DIELFFGDRLIATQAIQPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLG-IIHYSYWLHSRCQLVMTSPPTGVLVAR
          EL++ + LIATQ I+PF   +K      VHL+SS V L       L+RQ+    V+ N+RG F  R+ +G +  YSY LH+ C + +  PP G + AR
Subjt:  DIELFFGDRLIATQAIQPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLG-IIHYSYWLHSRCQLVMTSPPTGVLVAR

Query:  SCRTKR
         C TKR
Subjt:  SCRTKR

AT4G26490.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family9.2e-6850.35Show/hide
Query:  LAQMPPLSPPP--PPFSAPSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAEPTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPR
        +  +P L PPP   P   PS  +PS+    P TTP                  S+Q+ P  Q ++     TKP T  +  N +D+   +   IL+Q R  
Subjt:  LAQMPPLSPPP--PPFSAPSARSPSNAAAPPATTPSPASAVAHTSPALTSASPSSQAEPTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPR

Query:  QTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQ
        +T+  IWC A  C +FSLLLIFF IATLI+FL +RPR P+FDIPNA+L TIYFD+PE+ NGD ++L NFTNPN +++V++E   IELFF +RLIA Q +Q
Subjt:  QTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQ

Query:  PFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        PF Q++ E RLEP+ LISSLV LP N  + LRRQ+ NNK+ Y IRGTF+V+A  G+IHYSY LH RCQL MT PPTG+L++R+C TK+
Subjt:  PFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR

AT5G56050.1 FUNCTIONS IN: molecular_function unknown7.1e-6048.77Show/hide
Query:  QMPPLSPPPPPFSAPSARSPSNAAAPPATTPSPASAVAHTSP-ALTSASPSSQAEPTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQTN
        Q  P  P  PP+  PS++  S    P  TTP    +   T+P ALT    S    P   Q         P TP L+S  +++    +  +L Q R  +TN
Subjt:  QMPPLSPPPPPFSAPSARSPSNAAAPPATTPSPASAVAHTSP-ALTSASPSSQAEPTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQTN

Query:  PIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFS
        P IWC A LC +FS+LLI FGIATLI++L V+PR P+FDI NA L+TI F+SP Y NGD  +  NFTNPN +++VR+E+  +EL+F D  IATQ + PFS
Subjt:  PIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFDIPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFS

Query:  QRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR
        QR  + RLEP+ LIS+LV+LP N  L LRRQV +N++ Y IR  FRV+A  G+IHYSY LH  CQL ++SPP G LV R+C TKR
Subjt:  QRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRASLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAATTCTGCTTATTGTTTGGATGGGGAAACTGAGTTTTGCAGGATGCATAAGAGACTTGTGTATGAGTTCTTCAAGAATGCAAGCCAAATGCTGCGAGCCGATGA
CTACCCTGGATATAACAACAAGAGAGGGCAGGGCCATAGATGCGACGATCCTTTCCATTTGGGTGAGTGCTCTACTTTCAAATTTAGTATTAACCATAGAGCCATAAAGA
CATCAGGAGTAGTGCAGATGAATGCCATAGAAATGCAAAGAATTCTGCCATTTCAGGAGATCCCAATCTCATATCACTGCCATCCATATCCAACTGCATTTGAACCAAGT
TGTTCTCACATTTCTACACAGTTCCATCTGACCCGGTTCGAACCCGATTATTCTCAAAACATATCATACAAGCTCACTAACAATACTGCAGAAGCATATGCTAGTAGACA
TCCTACTGTTGAGGGATCTCTTATGAATGCAGTGGGTGTATTGCATAGTAGATTCATGGCAGGAATGCCCAAGAGAACGTCGAACGGCAACGTCTATCGGTCGAGCATTT
TGAGAGGATTCACATTCAAACTTGCTCAGATGCCTCCTCTATCTCCACCACCACCACCTTTCTCAGCTCCATCAGCAAGGTCACCGTCCAACGCTGCTGCTCCACCGGCA
ACCACACCGTCCCCTGCCTCAGCAGTCGCTCATACATCCCCTGCTCTGACCTCTGCTTCACCCTCAAGTCAAGCAGAACCGACTCTGCAGCAAATAGTTGTACAAAATCG
AGCTACCAAACCACACACTCCACTGCTGAATTCAAATCATATAGATTCCAGTAAGCACAGAAAACCGCCAATACTACAGCAAACTCGGCCACGCCAAACGAATCCAATTA
TATGGTGTTTTGCAATCCTCTGCCTGCTTTTTAGTCTTCTCCTCATCTTCTTTGGAATTGCAACTTTAATCATTTTCCTTGTGGTTAGACCAAGAAACCCTCTGTTTGAC
ATACCAAATGCAAGCCTCAGCACCATCTACTTCGATTCACCTGAATATCTCAACGGCGACTTCACCGTTCTCACAAATTTCACCAACCCAAATCACAGAGTCGATGTAAG
GTACGAGCATGCAGACATAGAACTCTTTTTCGGGGATAGACTCATAGCAACTCAAGCAATCCAGCCTTTCAGTCAAAGAAGAAAAGAAGTGAGGTTAGAACCTGTTCACT
TGATATCCAGTCTGGTTTACTTGCCACAGAATTCCGGGTTGGTACTTCGAAGGCAGGTACTGAACAACAAGGTTATCTATAACATCAGAGGAACTTTCAGAGTTAGAGCT
TCACTGGGTATCATCCATTACTCCTACTGGCTGCATAGCCGATGCCAGTTGGTGATGACTAGTCCACCAACTGGTGTTTTAGTTGCTCGGAGTTGCAGAACCAAGAGGTG
A
mRNA sequenceShow/hide mRNA sequence
ATGGCTAATTCTGCTTATTGTTTGGATGGGGAAACTGAGTTTTGCAGGATGCATAAGAGACTTGTGTATGAGTTCTTCAAGAATGCAAGCCAAATGCTGCGAGCCGATGA
CTACCCTGGATATAACAACAAGAGAGGGCAGGGCCATAGATGCGACGATCCTTTCCATTTGGGTGAGTGCTCTACTTTCAAATTTAGTATTAACCATAGAGCCATAAAGA
CATCAGGAGTAGTGCAGATGAATGCCATAGAAATGCAAAGAATTCTGCCATTTCAGGAGATCCCAATCTCATATCACTGCCATCCATATCCAACTGCATTTGAACCAAGT
TGTTCTCACATTTCTACACAGTTCCATCTGACCCGGTTCGAACCCGATTATTCTCAAAACATATCATACAAGCTCACTAACAATACTGCAGAAGCATATGCTAGTAGACA
TCCTACTGTTGAGGGATCTCTTATGAATGCAGTGGGTGTATTGCATAGTAGATTCATGGCAGGAATGCCCAAGAGAACGTCGAACGGCAACGTCTATCGGTCGAGCATTT
TGAGAGGATTCACATTCAAACTTGCTCAGATGCCTCCTCTATCTCCACCACCACCACCTTTCTCAGCTCCATCAGCAAGGTCACCGTCCAACGCTGCTGCTCCACCGGCA
ACCACACCGTCCCCTGCCTCAGCAGTCGCTCATACATCCCCTGCTCTGACCTCTGCTTCACCCTCAAGTCAAGCAGAACCGACTCTGCAGCAAATAGTTGTACAAAATCG
AGCTACCAAACCACACACTCCACTGCTGAATTCAAATCATATAGATTCCAGTAAGCACAGAAAACCGCCAATACTACAGCAAACTCGGCCACGCCAAACGAATCCAATTA
TATGGTGTTTTGCAATCCTCTGCCTGCTTTTTAGTCTTCTCCTCATCTTCTTTGGAATTGCAACTTTAATCATTTTCCTTGTGGTTAGACCAAGAAACCCTCTGTTTGAC
ATACCAAATGCAAGCCTCAGCACCATCTACTTCGATTCACCTGAATATCTCAACGGCGACTTCACCGTTCTCACAAATTTCACCAACCCAAATCACAGAGTCGATGTAAG
GTACGAGCATGCAGACATAGAACTCTTTTTCGGGGATAGACTCATAGCAACTCAAGCAATCCAGCCTTTCAGTCAAAGAAGAAAAGAAGTGAGGTTAGAACCTGTTCACT
TGATATCCAGTCTGGTTTACTTGCCACAGAATTCCGGGTTGGTACTTCGAAGGCAGGTACTGAACAACAAGGTTATCTATAACATCAGAGGAACTTTCAGAGTTAGAGCT
TCACTGGGTATCATCCATTACTCCTACTGGCTGCATAGCCGATGCCAGTTGGTGATGACTAGTCCACCAACTGGTGTTTTAGTTGCTCGGAGTTGCAGAACCAAGAGGTG
A
Protein sequenceShow/hide protein sequence
MANSAYCLDGETEFCRMHKRLVYEFFKNASQMLRADDYPGYNNKRGQGHRCDDPFHLGECSTFKFSINHRAIKTSGVVQMNAIEMQRILPFQEIPISYHCHPYPTAFEPS
CSHISTQFHLTRFEPDYSQNISYKLTNNTAEAYASRHPTVEGSLMNAVGVLHSRFMAGMPKRTSNGNVYRSSILRGFTFKLAQMPPLSPPPPPFSAPSARSPSNAAAPPA
TTPSPASAVAHTSPALTSASPSSQAEPTLQQIVVQNRATKPHTPLLNSNHIDSSKHRKPPILQQTRPRQTNPIIWCFAILCLLFSLLLIFFGIATLIIFLVVRPRNPLFD
IPNASLSTIYFDSPEYLNGDFTVLTNFTNPNHRVDVRYEHADIELFFGDRLIATQAIQPFSQRRKEVRLEPVHLISSLVYLPQNSGLVLRRQVLNNKVIYNIRGTFRVRA
SLGIIHYSYWLHSRCQLVMTSPPTGVLVARSCRTKR