; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011814 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011814
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein MIZU-KUSSEI 1
Genome locationchr1:33259083..33267072
RNA-Seq ExpressionLag0011814
SyntenyLag0011814
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0010274 - hydrotropism (biological process)
GO:0016020 - membrane (cellular component)
GO:0098827 - endoplasmic reticulum subcompartment (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR006460 - Protein MIZU-KUSSEI 1-like, plant
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027634.1 Protein MIZU-KUSSEI 1, partial [Cucurbita argyrosperma subsp. argyrosperma]7.7e-10981.06Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT
        MTGLQRT+SCSAAIKTTT IIPSSISA HHLRSSS         DADNFS +FL++KN  H  HR  H      GKFS +LHSLLKLLS+PTCKWLSIPT
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT

Query:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR
        HLSVSPSLGRRVTGTLFGRRRGHVTFA+QLDPRS+PVLLLE ATSTSSL+KEMSSGLVRIALES+AP T DG    R+L EEPRWTMYCNGRKCG A SR
Subjt:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR

Query:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
        TCGEFEW+VLNTVQSVSVGAGVIP+VDDGRKG GSEGELLYMRAKFERV+GSRDSEAFYMMNPD
Subjt:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

XP_022924911.1 protein MIZU-KUSSEI 1 [Cucurbita moschata]4.5e-10981.06Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT
        MTGLQRT+SCSAAI+TTT IIPSSISA HHLRSSS         DADNFS +FL++KN  H  HR  H      GKFSS+LHSLLKLLS+PTCKWLSIPT
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT

Query:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR
        HLSVSPSLGRRVTGTLFGRRRGHVTFA+QLDPRS+PVLLLE ATSTSSL+KEMSSGLVRIALES+AP T DG    R+L EEPRWTMYCNGRKCG A SR
Subjt:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR

Query:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
        TCGEFEW+VLNTVQSVSVGAGVIP+VDDGRKG GSEGELLYMRAKFERV+GSRDSEAFYMMNPD
Subjt:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

XP_022966431.1 protein MIZU-KUSSEI 1-like [Cucurbita maxima]3.6e-10680.3Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT
        MTGLQRT+SCSAAIKTTT IIPSSIS  HHLRSSS         DADNFS++FL++KN  H  HR  H      GKFSS+LHSLLKL+S+PTCKWLSIPT
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT

Query:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR
        HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLE ATSTSSL+KEMSSGLVRIALES+A  T DG    R+L EEPRWTMYCNGRKCG A SR
Subjt:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR

Query:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
        TCGEFEW+VLNTVQSVSVGAGVIP+VDDG  G  SEGELLYMRAKFERVVGSRDSEAFYMMNPD
Subjt:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

XP_023518638.1 protein MIZU-KUSSEI 1-like [Cucurbita pepo subsp. pepo]8.6e-10881.06Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT
        MTGLQRTRSCSAAIKTTT IIPSSISA HHLRSSS         DADNFS +FL++KN  H  HR  H      GKFSS+LHSLLKLLS+PTCKWL IPT
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT

Query:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR
        HLSVSPSLGRRVTGTLFGRRRGHVTFA+QLDPRS+PVLLLE ATSTSSL+KEMSSGLVRIALES++P T DG    R+L EEPRWTMY NGRKCG A SR
Subjt:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR

Query:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
        TCGEFEW+VLNTVQSVSVGAGVIP+VDDGRKG GSEGELLYMRAKFERVVGSRDSEAFYMMNPD
Subjt:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

XP_038877466.1 protein MIZU-KUSSEI 1-like [Benincasa hispida]5.2e-11381.65Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSIS----AHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWL
        MTGLQRTRSC AAIKT+T +IPSSIS     HHHLRS SSSN+PPEY++ DNFS RFL+ KN GHH    HHH   A  KFS++LHSLLKLLSVPTCKW 
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSIS----AHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWL

Query:  SIPTHLSVSPSLGRR-VTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHA
        SIPTHLSVS SLGRR VTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTS LVKEMSSGLVRIALESEAPA ++   R+LFEEPRWTMYCNGRKCGH+
Subjt:  SIPTHLSVSPSLGRR-VTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHA

Query:  TSRTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
        T R CGEFEW+VLNTVQ VSVGAGVIPVVDDGRKG  SEGELLYMRA +ERVVGSRDSEAFYMMNPD
Subjt:  TSRTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

TrEMBL top hitse value%identityAlignment
A0A1S3CNY4 protein MIZU-KUSSEI 1-like5.4e-10076.32Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHH--LRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSI
        MT LQRTRS SAAIK TT +IPSSIS H H   RS SSSN+PPEY++ +NFS         GHH HR   H      KFS +LHSLLKLLSVPTCKWLSI
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHH--LRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSI

Query:  PTHLSVSPSLGRR-VTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATS
        PTHLSV PSLGRR VTGTLFGRRRGHVTFAVQ+ P+SEPVLLLELATSTSSLVKEMSSGLVRIALESEAPA +DG  ++LFEEPRWTMYCNGRKCGH+  
Subjt:  PTHLSVSPSLGRR-VTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATS

Query:  RTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPDA
        RTCGEF+ +VLN VQSVSVGAGVIPVV++G K   SEGELLYMRAKFERVVGSRDSEAFYM+NPD+
Subjt:  RTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPDA

A0A5A7SPQ9 Protein MIZU-KUSSEI 1-like5.4e-10076.32Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHH--LRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSI
        MT LQRTRS SAAIK TT +IPSSIS H H   RS SSSN+PPEY++ +NFS         GHH HR   H      KFS +LHSLLKLLSVPTCKWLSI
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHH--LRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSI

Query:  PTHLSVSPSLGRR-VTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATS
        PTHLSV PSLGRR VTGTLFGRRRGHVTFAVQ+ P+SEPVLLLELATSTSSLVKEMSSGLVRIALESEAPA +DG  ++LFEEPRWTMYCNGRKCGH+  
Subjt:  PTHLSVSPSLGRR-VTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATS

Query:  RTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPDA
        RTCGEF+ +VLN VQSVSVGAGVIPVV++G K   SEGELLYMRAKFERVVGSRDSEAFYM+NPD+
Subjt:  RTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPDA

A0A6J1DFH1 protein MIZU-KUSSEI 16.8e-10376.67Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIP-SSISAHH----HLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKW
        M+ LQRTRSCSAAIKTTT I P +S+S HH    H+RSSSSS         D FS   L++K  G       HHHHP AGKFSS+ HSLL+LLS+PTCKW
Subjt:  MTGLQRTRSCSAAIKTTTNIIP-SSISAHH----HLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKW

Query:  LSIPTHLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHA
        LSIP HLS +PSL R+VTGTLFG RRGHVTFAVQL+PRSEPV LLELA ST+SLVKEMSSGLVRIALESEAPA +D G RRLFEEPRWTMYCNGRKCGHA
Subjt:  LSIPTHLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHA

Query:  TSRTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGG---GSEGELLYMRAKFERVVGSRDSEAFYMMNPD
        TSRTCG+FEWHVLNTVQSVSVGAGVIPVVDDGRKGG   GS+GELLYMRA+FERVVGS DSEAFYMMNPD
Subjt:  TSRTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGG---GSEGELLYMRAKFERVVGSRDSEAFYMMNPD

A0A6J1EAS0 protein MIZU-KUSSEI 12.2e-10981.06Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT
        MTGLQRT+SCSAAI+TTT IIPSSISA HHLRSSS         DADNFS +FL++KN  H  HR  H      GKFSS+LHSLLKLLS+PTCKWLSIPT
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT

Query:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR
        HLSVSPSLGRRVTGTLFGRRRGHVTFA+QLDPRS+PVLLLE ATSTSSL+KEMSSGLVRIALES+AP T DG    R+L EEPRWTMYCNGRKCG A SR
Subjt:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR

Query:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
        TCGEFEW+VLNTVQSVSVGAGVIP+VDDGRKG GSEGELLYMRAKFERV+GSRDSEAFYMMNPD
Subjt:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

A0A6J1HPB2 protein MIZU-KUSSEI 1-like1.7e-10680.3Show/hide
Query:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT
        MTGLQRT+SCSAAIKTTT IIPSSIS  HHLRSSS         DADNFS++FL++KN  H  HR  H      GKFSS+LHSLLKL+S+PTCKWLSIPT
Subjt:  MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPT

Query:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR
        HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLE ATSTSSL+KEMSSGLVRIALES+A  T DG    R+L EEPRWTMYCNGRKCG A SR
Subjt:  HLSVSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGS--RRLFEEPRWTMYCNGRKCGHATSR

Query:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
        TCGEFEW+VLNTVQSVSVGAGVIP+VDDG  G  SEGELLYMRAKFERVVGSRDSEAFYMMNPD
Subjt:  TCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

SwissProt top hitse value%identityAlignment
O22227 Protein MIZU-KUSSEI 11.9e-5451.54Show/hide
Query:  HLRSSSSSNIPPEYTDADNF-----SSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPTHLSV---------------SPSLG
        H+RS SSS + P   + + F        ++   +     H     H     KFS +L S + ++++P CK LS+P+  S                S SLG
Subjt:  HLRSSSSSNIPPEYTDADNF-----SSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPTHLSV---------------SPSLG

Query:  RRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSR--TCGEFEWHV
        RRVTGTL+G +RGHVTF+VQ + RS+PVLLL+LA ST++LVKEMSSGLVRIALE E          +LF+EP+WTMYCNGRKCG+A SR   C + +W V
Subjt:  RRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSR--TCGEFEWHV

Query:  LNTVQSVSVGAGVIP---VVDD-GRKGGGSE-GELLYMRAKFERVVGSRDSEAFYMMNPD
        LNTV  V+VGAGVIP    +DD    G G+E GELLYMR KFERVVGSRDSEAFYMMNPD
Subjt:  LNTVQSVSVGAGVIP---VVDD-GRKGGGSE-GELLYMRAKFERVVGSRDSEAFYMMNPD

Arabidopsis top hitse value%identityAlignment
AT2G41660.1 Protein of unknown function, DUF6171.4e-5551.54Show/hide
Query:  HLRSSSSSNIPPEYTDADNF-----SSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPTHLSV---------------SPSLG
        H+RS SSS + P   + + F        ++   +     H     H     KFS +L S + ++++P CK LS+P+  S                S SLG
Subjt:  HLRSSSSSNIPPEYTDADNF-----SSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPTHLSV---------------SPSLG

Query:  RRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSR--TCGEFEWHV
        RRVTGTL+G +RGHVTF+VQ + RS+PVLLL+LA ST++LVKEMSSGLVRIALE E          +LF+EP+WTMYCNGRKCG+A SR   C + +W V
Subjt:  RRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSR--TCGEFEWHV

Query:  LNTVQSVSVGAGVIP---VVDD-GRKGGGSE-GELLYMRAKFERVVGSRDSEAFYMMNPD
        LNTV  V+VGAGVIP    +DD    G G+E GELLYMR KFERVVGSRDSEAFYMMNPD
Subjt:  LNTVQSVSVGAGVIP---VVDD-GRKGGGSE-GELLYMRAKFERVVGSRDSEAFYMMNPD

AT3G25640.1 Protein of unknown function, DUF6171.2e-4658.68Show/hide
Query:  LGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSRTCGEFEWHV
        LG RV GTLFG RRGHV FAVQ DP   P +L++L T TS LV+EM+SGLVRIALE+ A  TD    ++L EE  W  YCNG+KCG+A  + CGE EW V
Subjt:  LGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSRTCGEFEWHV

Query:  LNTVQSVSVGAGVIP----VVDDGRKG--GGSEGELLYMRAKFERVVGSRDSEAFYMMNPDAAPTTP
        L  V  +++GAGV+P     VD+   G  G  +GEL+YMRA+FERVVGSRDSEAFYMMNPD +   P
Subjt:  LNTVQSVSVGAGVIP----VVDDGRKG--GGSEGELLYMRAKFERVVGSRDSEAFYMMNPDAAPTTP

AT4G39610.1 Protein of unknown function, DUF6176.7e-3438Show/hide
Query:  SSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLL-----SVPTCKWLSIPTHLSV-SPSLG--------RRVTGTLFG
        SSSSS+IPP        S+    R++P            P + K      ++ ++L     S P     S+   + V  P LG         R+TGTLFG
Subjt:  SSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLL-----SVPTCKWLSIPTHLSV-SPSLG--------RRVTGTLFG

Query:  RRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESE-APATDDGGSRR-----LFEEPRWTMYCNGRKCGHATSRTCGEFEWHVLNTVQ
         R+G V+ ++Q +P+  P L++ELA  T++L KE+S+G+VRIALE+E  P  D+  S+      + EEP WTMYC G K G+   R   E + +V+  ++
Subjt:  RRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESE-APATDDGGSRR-----LFEEPRWTMYCNGRKCGHATSRTCGEFEWHVLNTVQ

Query:  SVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
         VS+GAGV+P      +  G +GE+ YMRA FERV+GS+DSE FYM++P+
Subjt:  SVSVGAGVIPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

AT5G06990.1 Protein of unknown function, DUF6171.9e-3650.31Show/hide
Query:  GRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSRTCGEFEWHVL
        G RVTGTLFG R+  V  AVQ +PRS P+LLLELA  T  L++++  GLVRIALE E   ++     ++ +EP W +YCNG+K G+   R   E +  V+
Subjt:  GRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSRTCGEFEWHVL

Query:  NTVQSVSVGAGVIPV-------VDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD
          + +VS+GAGV+PV          G  GG  EG+L YMRA FERV+GSRDSE +YMMNPD
Subjt:  NTVQSVSVGAGVIPV-------VDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPD

AT5G23100.1 Protein of unknown function, DUF6174.8e-4847.83Show/hide
Query:  HHHHHHPAAGKFSS---ILHSLLKLLSVPTCKWLSIPTHLS------VSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSG
        HH++  P++   SS   ++    KL S    +  S+   LS       +  LG RV GTLFG RRGHV F++Q DP S P  L+ELAT  S LVKEM+SG
Subjt:  HHHHHHPAAGKFSS---ILHSLLKLLSVPTCKWLSIPTHLS------VSPSLGRRVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSG

Query:  LVRIALESEAPATDDGG-----------------------SRRLFEEPRWTMYCNGRKCGHATSRTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGS
        LVRIALE +    ++ G                       SRRL EEP W  YCNG+KCG AT R CGE E  VL  ++ VS+GAGV+P  ++   GGG 
Subjt:  LVRIALESEAPATDDGG-----------------------SRRLFEEPRWTMYCNGRKCGHATSRTCGEFEWHVLNTVQSVSVGAGVIPVVDDGRKGGGS

Query:  EGELLYMRAKFERVVGSRDSEAFYMMNPDA
         G+++YMRAKFER+VGSRDSEAFYMMNPD+
Subjt:  EGELLYMRAKFERVVGSRDSEAFYMMNPDA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGCCTGCAAAGAACCAGAAGTTGCAGCGCCGCCATTAAAACCACCACCAACATCATCCCATCTTCCATCTCCGCCCACCACCACCTCAGATCCTCTTCTTCCTC
CAACATCCCGCCGGAATATACCGACGCCGATAATTTTTCCAGCAGATTCCTCATCCGGAAGAACCCCGGCCACCACCGCCACCGCCACCACCACCACCACCACCCGGCTG
CCGGAAAATTCTCCTCCATCCTCCATTCCCTGCTCAAGCTTCTCTCCGTTCCCACCTGCAAATGGCTCTCCATCCCCACCCACCTCTCCGTCAGCCCCTCTCTTGGGCGG
AGAGTCACCGGAACTTTGTTTGGCCGCCGCCGAGGCCATGTCACCTTCGCCGTCCAACTTGACCCCCGGTCCGAGCCGGTTTTGTTGCTGGAGTTGGCCACGTCGACCTC
GTCCCTCGTCAAGGAAATGTCCTCCGGTCTGGTTCGTATCGCCCTCGAGTCTGAGGCTCCGGCCACCGATGATGGTGGTTCAAGGAGGTTGTTTGAGGAGCCGAGATGGA
CGATGTACTGCAACGGGAGAAAGTGTGGACACGCGACGTCGCGCACGTGTGGAGAGTTTGAGTGGCACGTGCTGAACACCGTGCAGAGCGTGTCGGTCGGCGCCGGAGTG
ATTCCGGTGGTCGACGACGGCCGGAAAGGCGGCGGGTCGGAGGGCGAGTTACTGTACATGAGAGCAAAGTTTGAGCGAGTCGTAGGTAGCCGTGACTCGGAAGCTTTTTA
CATGATGAATCCAGATGCAGCTCCGACGACCCCAACGGCTCCGGCGTTTCCTTCTCTCTCGTTCGACTCCGGCGTGTCCAGCAACGTAACTCCGCGGCGTCTTCATCCCC
GTTCTAGCAGCTCCAACTCCACGCCGTCTTCATCCCACGGATTTTTCAGGCCGTTTCCAGCAGCTCCGAAGACTCCACCAGTCAAGAAGAGATTGACTGAAACCCAATTA
GATATGTTTAGGCAAACTATATTTGGCCCTATTTTAGACAGCAACATATTGTTTAATGGTCAGTTAATCCACCATCTACTACTTAGGGAGGTTGAGGATCCCAGGAAGGA
TGTAATTAGTTTCGATATATTTGGAAATAAGGTGTCGTTTGGCAAGGAAGAATTCGATCTAATCACCGGATTTAGACACAATAGGAGGATAGTTGATAGACATGAGTCGG
GGGTTAGATTGAGGCGTCTGTACTTTAATGACAGTGTCAAAGATACCGTAATGGATGCTGAAAAAAGATTCTTAGACATACAGTTTCAGTCAGATGAAGATGCGGTGAAG
GTAGCGCTCGCATATTTTATCGAGCTAGCAATGTTTGGGCGGGAGAGGAAACAAAAATTCAATTGGTCTTTATTGGGTATCGTGGACGATTGGGAGATATTCTGCAATTA
TGACTGGAGCAAAGTAATTTTTGAGATGACGATAAGGAGTTTGAAGAAAGCACTCAGTCATGCCACCCAAAGAGACGTTGTGGCCGGAGAGGCTAGTCGATTGGAAAGAT
ATAGTCTTTACGGCTTTCCACATGCTTTTCAGGTATGGGCGTATGAGACTATTTCGTCTCTAACGAACCGTGTTGCGAACCGGATGAACCAGGATGCGATCCCACGGTTT
TCTCGGTGGTCATGCTCTCATTCTCCTACGTACACCCAACTTAGCAGTGAGATATTTGGCTTGACGGAGGCAAGGGTGACAGTGCAATTGGTTCCAAGCGAAGCAGAGCT
CGAACATATGCGTCGTATTGTTTTGCCGCCACAACTACAGGCCCCTGTTTTGCCGCCACAACTAGAGGCCCCTGTTTCGCCGCCACATCCAGAGGCCCCTGTTTTGTCGC
CACAACCAGATGCAAACCTAGATGATCCTGTGGGGAGTGATAGAGGGTCAGAGGAGGCTGGTTTGGATATGAGTTCACCGAAAAAGGATGTAGAAATGGTTAGGCTCGAT
GAACAATCGACACACGACGGTCTACCTGAAGGCGTGGGCAAGACCTGCCAATGTGACTGCAAGCAAGCATACGAGTCACTAGACCGACGGATGAAGGTGGTGGAGTCCGA
TGTAAAAGAGATGAAATCTGACTTAAAGTCGATCAAGAAGTATTTGCGCCGGTTATCTAAGGGTCAAATGGTGGTTGATCCTACCAAGTATTTGGGTCCCGACCGTAGTG
CAGCTGCATCAGGTGATGAACCATCCGATAAAGGAAAGAACCATGTCGTGGAGGAGGGGGGTGGTGGGGTTTCAATAGATGCGATGGTAGAGCACCATGATATGGACAAG
GGTGTTGAATCAGACTCCCATGAGGTTGAAGAGATCCCGAAACCTGGAGAAATGGTGAAGCGTCGGGGAGATCGGAAAAGAACTCTTTCTTGGAAACTTCGAACTCCGTG
GAAGGATACGAGGGAAGGGGCCAAAAAACAAAAGGTCATGCCATACAACCCCTTAGTTGAGATTCCTGGGAAGCTTGATAGACGTTTCCAAAAGTGGTTGGACGACACGG
AGGTGGACAATGCTCCAAGGAAGACGACATATGCTTTTAGGGACAAAGTGTGGTTTCAAAACCTTTTGAAACCCTGCTATTGGATGAGCGATGAGGTCATTGACTCACTT
TTTATGTTCGTCCGGAAGAAAATGCAACAGCGGGCAGACTTATGTCGTTGGAAGTTTGTCACTGCAGATATTGTTGTTACCGATTTTCTGAGGCGTAGCGACGACATAGC
TGAAGAGTTGAAGAAGGTGCAAGATCCTTCGTTGATTACGTACGACTGGAGTACGGCCAATACTGTGATAGACTACGTTTTGGGTCGACACTCGGACCACGATACACATT
GGATGGGAAAATTGACTGTCCTCGATTCATTCATAGCGTTGACATCAGATGCAACCTTGAAGAAAGAGTTGAGCACTCTAGCCACAGTATTGCCAGTGCTACTGTTCAAG
TGCGATGTCATGAAAGCGAAGCCACATCTCCCAGTTCACGAATGGGAAATACATAGAGATAGTTCAGTGCCTCAACAAACGAACGGTGGGGATTGTGGTATGTTCGCGGT
AAAGTTTTTTGAATATGATGTTACTGGAAGTGAAATAAACACTCTGAATCAAGATAGGATTAATTTTTGTAGACGTCAATTTGCTGTTCAAATTTGGGCCAACAGGCCGA
TATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAGGCCTGCAAAGAACCAGAAGTTGCAGCGCCGCCATTAAAACCACCACCAACATCATCCCATCTTCCATCTCCGCCCACCACCACCTCAGATCCTCTTCTTCCTC
CAACATCCCGCCGGAATATACCGACGCCGATAATTTTTCCAGCAGATTCCTCATCCGGAAGAACCCCGGCCACCACCGCCACCGCCACCACCACCACCACCACCCGGCTG
CCGGAAAATTCTCCTCCATCCTCCATTCCCTGCTCAAGCTTCTCTCCGTTCCCACCTGCAAATGGCTCTCCATCCCCACCCACCTCTCCGTCAGCCCCTCTCTTGGGCGG
AGAGTCACCGGAACTTTGTTTGGCCGCCGCCGAGGCCATGTCACCTTCGCCGTCCAACTTGACCCCCGGTCCGAGCCGGTTTTGTTGCTGGAGTTGGCCACGTCGACCTC
GTCCCTCGTCAAGGAAATGTCCTCCGGTCTGGTTCGTATCGCCCTCGAGTCTGAGGCTCCGGCCACCGATGATGGTGGTTCAAGGAGGTTGTTTGAGGAGCCGAGATGGA
CGATGTACTGCAACGGGAGAAAGTGTGGACACGCGACGTCGCGCACGTGTGGAGAGTTTGAGTGGCACGTGCTGAACACCGTGCAGAGCGTGTCGGTCGGCGCCGGAGTG
ATTCCGGTGGTCGACGACGGCCGGAAAGGCGGCGGGTCGGAGGGCGAGTTACTGTACATGAGAGCAAAGTTTGAGCGAGTCGTAGGTAGCCGTGACTCGGAAGCTTTTTA
CATGATGAATCCAGATGCAGCTCCGACGACCCCAACGGCTCCGGCGTTTCCTTCTCTCTCGTTCGACTCCGGCGTGTCCAGCAACGTAACTCCGCGGCGTCTTCATCCCC
GTTCTAGCAGCTCCAACTCCACGCCGTCTTCATCCCACGGATTTTTCAGGCCGTTTCCAGCAGCTCCGAAGACTCCACCAGTCAAGAAGAGATTGACTGAAACCCAATTA
GATATGTTTAGGCAAACTATATTTGGCCCTATTTTAGACAGCAACATATTGTTTAATGGTCAGTTAATCCACCATCTACTACTTAGGGAGGTTGAGGATCCCAGGAAGGA
TGTAATTAGTTTCGATATATTTGGAAATAAGGTGTCGTTTGGCAAGGAAGAATTCGATCTAATCACCGGATTTAGACACAATAGGAGGATAGTTGATAGACATGAGTCGG
GGGTTAGATTGAGGCGTCTGTACTTTAATGACAGTGTCAAAGATACCGTAATGGATGCTGAAAAAAGATTCTTAGACATACAGTTTCAGTCAGATGAAGATGCGGTGAAG
GTAGCGCTCGCATATTTTATCGAGCTAGCAATGTTTGGGCGGGAGAGGAAACAAAAATTCAATTGGTCTTTATTGGGTATCGTGGACGATTGGGAGATATTCTGCAATTA
TGACTGGAGCAAAGTAATTTTTGAGATGACGATAAGGAGTTTGAAGAAAGCACTCAGTCATGCCACCCAAAGAGACGTTGTGGCCGGAGAGGCTAGTCGATTGGAAAGAT
ATAGTCTTTACGGCTTTCCACATGCTTTTCAGGTATGGGCGTATGAGACTATTTCGTCTCTAACGAACCGTGTTGCGAACCGGATGAACCAGGATGCGATCCCACGGTTT
TCTCGGTGGTCATGCTCTCATTCTCCTACGTACACCCAACTTAGCAGTGAGATATTTGGCTTGACGGAGGCAAGGGTGACAGTGCAATTGGTTCCAAGCGAAGCAGAGCT
CGAACATATGCGTCGTATTGTTTTGCCGCCACAACTACAGGCCCCTGTTTTGCCGCCACAACTAGAGGCCCCTGTTTCGCCGCCACATCCAGAGGCCCCTGTTTTGTCGC
CACAACCAGATGCAAACCTAGATGATCCTGTGGGGAGTGATAGAGGGTCAGAGGAGGCTGGTTTGGATATGAGTTCACCGAAAAAGGATGTAGAAATGGTTAGGCTCGAT
GAACAATCGACACACGACGGTCTACCTGAAGGCGTGGGCAAGACCTGCCAATGTGACTGCAAGCAAGCATACGAGTCACTAGACCGACGGATGAAGGTGGTGGAGTCCGA
TGTAAAAGAGATGAAATCTGACTTAAAGTCGATCAAGAAGTATTTGCGCCGGTTATCTAAGGGTCAAATGGTGGTTGATCCTACCAAGTATTTGGGTCCCGACCGTAGTG
CAGCTGCATCAGGTGATGAACCATCCGATAAAGGAAAGAACCATGTCGTGGAGGAGGGGGGTGGTGGGGTTTCAATAGATGCGATGGTAGAGCACCATGATATGGACAAG
GGTGTTGAATCAGACTCCCATGAGGTTGAAGAGATCCCGAAACCTGGAGAAATGGTGAAGCGTCGGGGAGATCGGAAAAGAACTCTTTCTTGGAAACTTCGAACTCCGTG
GAAGGATACGAGGGAAGGGGCCAAAAAACAAAAGGTCATGCCATACAACCCCTTAGTTGAGATTCCTGGGAAGCTTGATAGACGTTTCCAAAAGTGGTTGGACGACACGG
AGGTGGACAATGCTCCAAGGAAGACGACATATGCTTTTAGGGACAAAGTGTGGTTTCAAAACCTTTTGAAACCCTGCTATTGGATGAGCGATGAGGTCATTGACTCACTT
TTTATGTTCGTCCGGAAGAAAATGCAACAGCGGGCAGACTTATGTCGTTGGAAGTTTGTCACTGCAGATATTGTTGTTACCGATTTTCTGAGGCGTAGCGACGACATAGC
TGAAGAGTTGAAGAAGGTGCAAGATCCTTCGTTGATTACGTACGACTGGAGTACGGCCAATACTGTGATAGACTACGTTTTGGGTCGACACTCGGACCACGATACACATT
GGATGGGAAAATTGACTGTCCTCGATTCATTCATAGCGTTGACATCAGATGCAACCTTGAAGAAAGAGTTGAGCACTCTAGCCACAGTATTGCCAGTGCTACTGTTCAAG
TGCGATGTCATGAAAGCGAAGCCACATCTCCCAGTTCACGAATGGGAAATACATAGAGATAGTTCAGTGCCTCAACAAACGAACGGTGGGGATTGTGGTATGTTCGCGGT
AAAGTTTTTTGAATATGATGTTACTGGAAGTGAAATAAACACTCTGAATCAAGATAGGATTAATTTTTGTAGACGTCAATTTGCTGTTCAAATTTGGGCCAACAGGCCGA
TATTTTAG
Protein sequenceShow/hide protein sequence
MTGLQRTRSCSAAIKTTTNIIPSSISAHHHLRSSSSSNIPPEYTDADNFSSRFLIRKNPGHHRHRHHHHHHPAAGKFSSILHSLLKLLSVPTCKWLSIPTHLSVSPSLGR
RVTGTLFGRRRGHVTFAVQLDPRSEPVLLLELATSTSSLVKEMSSGLVRIALESEAPATDDGGSRRLFEEPRWTMYCNGRKCGHATSRTCGEFEWHVLNTVQSVSVGAGV
IPVVDDGRKGGGSEGELLYMRAKFERVVGSRDSEAFYMMNPDAAPTTPTAPAFPSLSFDSGVSSNVTPRRLHPRSSSSNSTPSSSHGFFRPFPAAPKTPPVKKRLTETQL
DMFRQTIFGPILDSNILFNGQLIHHLLLREVEDPRKDVISFDIFGNKVSFGKEEFDLITGFRHNRRIVDRHESGVRLRRLYFNDSVKDTVMDAEKRFLDIQFQSDEDAVK
VALAYFIELAMFGRERKQKFNWSLLGIVDDWEIFCNYDWSKVIFEMTIRSLKKALSHATQRDVVAGEASRLERYSLYGFPHAFQVWAYETISSLTNRVANRMNQDAIPRF
SRWSCSHSPTYTQLSSEIFGLTEARVTVQLVPSEAELEHMRRIVLPPQLQAPVLPPQLEAPVSPPHPEAPVLSPQPDANLDDPVGSDRGSEEAGLDMSSPKKDVEMVRLD
EQSTHDGLPEGVGKTCQCDCKQAYESLDRRMKVVESDVKEMKSDLKSIKKYLRRLSKGQMVVDPTKYLGPDRSAAASGDEPSDKGKNHVVEEGGGGVSIDAMVEHHDMDK
GVESDSHEVEEIPKPGEMVKRRGDRKRTLSWKLRTPWKDTREGAKKQKVMPYNPLVEIPGKLDRRFQKWLDDTEVDNAPRKTTYAFRDKVWFQNLLKPCYWMSDEVIDSL
FMFVRKKMQQRADLCRWKFVTADIVVTDFLRRSDDIAEELKKVQDPSLITYDWSTANTVIDYVLGRHSDHDTHWMGKLTVLDSFIALTSDATLKKELSTLATVLPVLLFK
CDVMKAKPHLPVHEWEIHRDSSVPQQTNGGDCGMFAVKFFEYDVTGSEINTLNQDRINFCRRQFAVQIWANRPIF