; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019103 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019103
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHaloacid dehalogenase-like hydrolase domain-containing protein
Genome locationtig00153285:494049..505356
RNA-Seq ExpressionSgr019103
SyntenySgr019103
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR023198 - Phosphoglycolate phosphatase-like, domain 2
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily
IPR044999 - Protein CbbY-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581504.1 CBBY-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.6e-16185.75Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA
        MEITSCS LYT P++RTTSCN  + +   N P +R + SSPHLSV SRS + +GKSLR+R   A S++S SN  SSQELAVLLEVEGVLVDAYRSTNRQA
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA

Query:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL
        FNEAF+KLGLDCANWTDP+YSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNAS ELM SQ LPLRPGVEDFIDNAY+EGIPVIIL
Subjt:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL

Query:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL
        TAYSKSGEEIARSI+NKLGPERISKVKIVGNEE R SLYS+VV GQAKHSGLDEQLAKEAMKAASAEKQ+IAEKVA+ LKLSVEINTTSSESLD II AL
Subjt:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL

Query:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        RAGAELA IPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANA+MDGFG
Subjt:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

XP_022146591.1 CBBY-like protein isoform X1 [Momordica charantia]1.2e-16387.15Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA
        MEITSCS LY RPL+RT +CN S SY+I NQPRSRL+ SSPHLSV SRS NF GKSLRL RL AFSS S S++DSSQELAVLLEVEGVLVDAYRSTNRQA
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA

Query:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL
        FNEAF+KLGLDCANWT+P+YSDLVRKNA+NEERMLI YFNRIGWPTSLPTNEKESFIK VL+EKKNAS ELM SQ LPLRPGVEDFIDNAYNEGIPVIIL
Subjt:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL

Query:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL
        TAYSK+GEEIARSI+NKLGPERISKVKIVGNEEAR SLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQ+IAEKVA+ LKLSVEIN TSSESL+ I  AL
Subjt:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL

Query:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        RAGAELAGIPVSNCILIAGTQ GVDGA +IGMP IVLRSSLTSRAEFPSANA+MDGFG
Subjt:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

XP_022925818.1 uncharacterized protein LOC111433112 isoform X1 [Cucurbita moschata]4.4e-16186.03Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA
        MEITSCS LYT P++RTTSCN  +S+   N P +R + SSPHLSV SRS + IGKSLR+R   A S++S SN  SSQELAVLLEVEGVLVDAYRSTNRQA
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA

Query:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL
        FNEAF+KLGLDCANWTDP+YSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNAS ELM SQ LPLRPGVEDFIDNAY+EGIPVIIL
Subjt:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL

Query:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL
        TAYSKSGEEIARSI+NKLGPERISKVKIVGNEE R SLYS+VV GQAKHSGLDEQLAKEAMKAASAEKQ+IAEKVA+ LKLSVEINTTSSESLD II AL
Subjt:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL

Query:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        RAGAELA IPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSA+A+MDGFG
Subjt:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

XP_022977624.1 uncharacterized protein LOC111477887 isoform X1 [Cucurbita maxima]4.0e-16286.31Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA
        MEITSCS LYT P++RTTSCN  +S+   N P +R + SSPHLSV SRS+   GKSLR+R   A S++S SN  SSQELAVLLEVEGVLVDAYRSTNRQA
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA

Query:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL
        FNEAF+KLGLDCANWTDP+YSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNAS ELM SQ LPLRPGVEDFIDNAYNEGIPVIIL
Subjt:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL

Query:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL
        TAYSKSGEEIARSI+NKLGPERISKVKIVGNEE R SLYS+VV GQAKHSGLDEQLAKEAMKAASAEKQ+IAEKVA+ LKLSVEINTTSSESLD II AL
Subjt:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL

Query:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        RAGAELA IPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANA+MDGFG
Subjt:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

XP_038881961.1 CBBY-like protein isoform X1 [Benincasa hispida]6.4e-16085.24Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLY-SSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQ
        ME+TS S+L+T P++RTT+CN S+S VI  QP SR Y SSSP LSV S++YNF GKSLR+ RLTAFSSSS SN DS+QELAVLLEVEGVLVDAYRSTNRQ
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLY-SSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQ

Query:  AFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVII
        AFNEAF+KLGLDCANWT+P+YSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKK AS ELM SQ LPLRPGVEDFIDNAYNEGIPVII
Subjt:  AFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVII

Query:  LTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRA
        LTAYSKSGEEIARSI+ KLGPERISKVKIVGNEE R SLYS+ V GQAK SGL+E+LAKEAMKAASAEKQ+IA+KVA+ LKLSVEINTTSSESLD II A
Subjt:  LTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRA

Query:  LRAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        LRAGAELA  PVSNCILIAGTQSG+DGAERIGMPRIVLRSSLTSRAEFPSANA+MDGFG
Subjt:  LRAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

TrEMBL top hitse value%identityAlignment
A0A0A0KJ23 Uncharacterized protein9.3e-15784.12Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFI-GKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQ
        MEIT  S+LYT P++RTT+CN S+S+ I     SR Y SSP LSV SRSYNFI   SLR+RRLTAFSSSS SN DS QELAVLLEVEGVLVDAYRSTNRQ
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFI-GKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQ

Query:  AFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVII
        AFNEAF+KLGLDCANWT+P+YSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREK  AS ELM SQ LPLRPGVEDFIDNA+NEGIPVII
Subjt:  AFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVII

Query:  LTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRA
        LTAYSKSGEEIARSI+NKLGPERISKVKIVGNEE R SLYS+ V GQAK SGL+E+LAKEAMKAASAEKQ+IA+KVA+ LKLSVEINTTSSESLD II A
Subjt:  LTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRA

Query:  LRAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        LRAG+ELAG PVSNCIL+AGTQSG+DGAERIGMPRIV+RSSLTSRAEFPSANA+MDGFG
Subjt:  LRAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

A0A5A7UG24 Haloacid dehalogenase-like hydrolase domain-containing protein1.3e-15582.73Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFI-GKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQ
        MEIT  S+LYT P++RTT+CN S+S+ I     S  Y SSP LSV  RSYNFI   SLR+RRLTAFSSSS SN DS QELAVLLEVEGVLVDAYRSTNRQ
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFI-GKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQ

Query:  AFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVII
        AFNEAF+KLGLDCANWT+P+YSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKK AS ELM SQ LPLRPGVEDFID+A+NEGIPV+I
Subjt:  AFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVII

Query:  LTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRA
        LTAYSKSGEEIARSI+NKLGPERISKVKIVGNEE R SLYS+ V  QA  SGL+E+LAKEAMKAASAEKQ+IA+KVA+ LKLSVEINTTSSESLD II A
Subjt:  LTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRA

Query:  LRAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        LRAG+ELAG PVSNCIL+AGTQSG+DGAERIGMPR+VLRSSLTSRAEFPSANA+MDGFG
Subjt:  LRAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

A0A6J1CZU3 CBBY-like protein isoform X16.0e-16487.15Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA
        MEITSCS LY RPL+RT +CN S SY+I NQPRSRL+ SSPHLSV SRS NF GKSLRL RL AFSS S S++DSSQELAVLLEVEGVLVDAYRSTNRQA
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA

Query:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL
        FNEAF+KLGLDCANWT+P+YSDLVRKNA+NEERMLI YFNRIGWPTSLPTNEKESFIK VL+EKKNAS ELM SQ LPLRPGVEDFIDNAYNEGIPVIIL
Subjt:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL

Query:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL
        TAYSK+GEEIARSI+NKLGPERISKVKIVGNEEAR SLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQ+IAEKVA+ LKLSVEIN TSSESL+ I  AL
Subjt:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL

Query:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        RAGAELAGIPVSNCILIAGTQ GVDGA +IGMP IVLRSSLTSRAEFPSANA+MDGFG
Subjt:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

A0A6J1EGB6 uncharacterized protein LOC111433112 isoform X12.1e-16186.03Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA
        MEITSCS LYT P++RTTSCN  +S+   N P +R + SSPHLSV SRS + IGKSLR+R   A S++S SN  SSQELAVLLEVEGVLVDAYRSTNRQA
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA

Query:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL
        FNEAF+KLGLDCANWTDP+YSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNAS ELM SQ LPLRPGVEDFIDNAY+EGIPVIIL
Subjt:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL

Query:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL
        TAYSKSGEEIARSI+NKLGPERISKVKIVGNEE R SLYS+VV GQAKHSGLDEQLAKEAMKAASAEKQ+IAEKVA+ LKLSVEINTTSSESLD II AL
Subjt:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL

Query:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        RAGAELA IPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSA+A+MDGFG
Subjt:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

A0A6J1IRX3 uncharacterized protein LOC111477887 isoform X11.9e-16286.31Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA
        MEITSCS LYT P++RTTSCN  +S+   N P +R + SSPHLSV SRS+   GKSLR+R   A S++S SN  SSQELAVLLEVEGVLVDAYRSTNRQA
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQA

Query:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL
        FNEAF+KLGLDCANWTDP+YSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNAS ELM SQ LPLRPGVEDFIDNAYNEGIPVIIL
Subjt:  FNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIIL

Query:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL
        TAYSKSGEEIARSI+NKLGPERISKVKIVGNEE R SLYS+VV GQAKHSGLDEQLAKEAMKAASAEKQ+IAEKVA+ LKLSVEINTTSSESLD II AL
Subjt:  TAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRAL

Query:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
        RAGAELA IPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANA+MDGFG
Subjt:  RAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

SwissProt top hitse value%identityAlignment
O33513 Protein CbbY1.1e-0527.17Show/hide
Query:  AVLLEVEGVLVDAYRSTNRQAFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPL
        A++ +V+G L +     +RQAFNE F   GLD   W+   Y  L+R     +ERM     N       L +   ++ I  + + K    VE++ S ++ L
Subjt:  AVLLEVEGVLVDAYRSTNRQAFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPL

Query:  RPGVEDFIDNAYNEGIPVIILTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGL
         PGV + ID A   G+ + I T  +++  +   +         I +V   G+E A+      V L   +  GL
Subjt:  RPGVEDFIDNAYNEGIPVIILTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGL

P40119 Protein CbbY, chromosomal1.7e-0936.07Show/hide
Query:  AVLLEVEGVLVDAYRSTNRQAFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPL
        A++ +V+G L D   S + QAFN AF ++GLD   W  P+Y+ L+ K A  +ER  +M++ R+  P      + +  I +V   K     E +G+  LPL
Subjt:  AVLLEVEGVLVDAYRSTNRQAFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPL

Query:  RPGVEDFIDNAYNEGIPVIILT
        RPG+   ID A   G+P+ I T
Subjt:  RPGVEDFIDNAYNEGIPVIILT

Q94K71 CBBY-like protein9.4e-2128.37Show/hide
Query:  SCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSS-GSNIDSSQEL-----AVLLEVEGVLVDAYRSTNRQAFNEAFQKLGLDC
        S +SS S +      S + +++P  +V  R   F GKSLR + +   SS S G    +S  L     A+L + +GVLVD  +  +R +FN+ F++  L+ 
Subjt:  SCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSS-GSNIDSSQEL-----AVLLEVEGVLVDAYRSTNRQAFNEAFQKLGLDC

Query:  ANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKE--SFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIILTAYSKSGEEI
          W   +Y +L+ K    +ERM   YFN++GWP   P +E E   FI  + ++K    + L+  + LPLRPGV   +D A   G+ V +    S S E+ 
Subjt:  ANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKE--SFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIILTAYSKSGEEI

Query:  ARSIVN-KLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRALRAGAELAGI
          +IV+  LGPER  K+KI   +          V+ + K       LA                                              AE  G+
Subjt:  ARSIVN-KLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRALRAGAELAGI

Query:  PVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
          S C+++  +  G+  A+  GM  IV +S  T+  +F +A+AV D  G
Subjt:  PVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

Arabidopsis top hitse value%identityAlignment
AT2G01640.1 unknown protein9.9e-2651.45Show/hide
Query:  RSVDRSFRFEPQSWRTIAALNAQKTISKKKKLRSRKKQLKAFDLSTLSEFLPGLEA-PKQKSSAAELKVNCKSRLKFILKERKQLDTVLTHPAFQEDPLR
        +  ++  +F  +   T+ +L+ QK I KKKK+RSR+K+LKA+DL+ LSEFLP   A  K    A ELK+NCK R K +L E ++L+ VL HPAFQ DP+ 
Subjt:  RSVDRSFRFEPQSWRTIAALNAQKTISKKKKLRSRKKQLKAFDLSTLSEFLPGLEA-PKQKSSAAELKVNCKSRLKFILKERKQLDTVLTHPAFQEDPLR

Query:  AIHRHLESTQ-PVEEPKKKKTNKNGSKKRKEKKSKASA
        +I +HL S Q PVEE  KKKTN NGSKKR +KK K  +
Subjt:  AIHRHLESTQ-PVEEPKKKKTNKNGSKKRKEKKSKASA

AT2G01640.2 unknown protein9.9e-2651.45Show/hide
Query:  RSVDRSFRFEPQSWRTIAALNAQKTISKKKKLRSRKKQLKAFDLSTLSEFLPGLEA-PKQKSSAAELKVNCKSRLKFILKERKQLDTVLTHPAFQEDPLR
        +  ++  +F  +   T+ +L+ QK I KKKK+RSR+K+LKA+DL+ LSEFLP   A  K    A ELK+NCK R K +L E ++L+ VL HPAFQ DP+ 
Subjt:  RSVDRSFRFEPQSWRTIAALNAQKTISKKKKLRSRKKQLKAFDLSTLSEFLPGLEA-PKQKSSAAELKVNCKSRLKFILKERKQLDTVLTHPAFQEDPLR

Query:  AIHRHLESTQ-PVEEPKKKKTNKNGSKKRKEKKSKASA
        +I +HL S Q PVEE  KKKTN NGSKKR +KK K  +
Subjt:  AIHRHLESTQ-PVEEPKKKKTNKNGSKKRKEKKSKASA

AT3G48420.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein6.6e-2228.37Show/hide
Query:  SCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSS-GSNIDSSQEL-----AVLLEVEGVLVDAYRSTNRQAFNEAFQKLGLDC
        S +SS S +      S + +++P  +V  R   F GKSLR + +   SS S G    +S  L     A+L + +GVLVD  +  +R +FN+ F++  L+ 
Subjt:  SCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSS-GSNIDSSQEL-----AVLLEVEGVLVDAYRSTNRQAFNEAFQKLGLDC

Query:  ANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKE--SFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIILTAYSKSGEEI
          W   +Y +L+ K    +ERM   YFN++GWP   P +E E   FI  + ++K    + L+  + LPLRPGV   +D A   G+ V +    S S E+ 
Subjt:  ANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKE--SFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIILTAYSKSGEEI

Query:  ARSIVN-KLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRALRAGAELAGI
          +IV+  LGPER  K+KI   +          V+ + K       LA                                              AE  G+
Subjt:  ARSIVN-KLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRALRAGAELAGI

Query:  PVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG
          S C+++  +  G+  A+  GM  IV +S  T+  +F +A+AV D  G
Subjt:  PVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFG

AT5G45170.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein5.3e-10454.26Show/hide
Query:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSS----PHLSVSSRSYNFIGKSLRLRRLTAFS-SSSGSNIDSSQELAVLLEVEGVLVDAYRS
        MEI SCS+L    +    SC  +  +  +   RS   +      P  +   +S   +GK LRL+R ++   S+S  +++ S+E AV+LEV+ V++D + S
Subjt:  MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSS----PHLSVSSRSYNFIGKSLRLRRLTAFS-SSSGSNIDSSQELAVLLEVEGVLVDAYRS

Query:  TNRQAFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGI
        +NRQAFN AFQKLGLDCANW +P+YSDL+RK AA+EE+ML++YFN+IGWP+SLPT+EK SF+KSVLREKKNA  E + S+ LPLR GV++FIDNAY E +
Subjt:  TNRQAFNEAFQKLGLDCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGI

Query:  PVIILTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDN
        PV I+TAY KSG+++A SIV  LG ER+  VK++G+ E   S+Y Q+VLG+   S L+EQL KE  KAASAEKQ+IAE+VA++LKLSV+I+TTSSE L+ 
Subjt:  PVIILTAYSKSGEEIARSIVNKLGPERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDN

Query:  IIRALRAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFGFSQTLRAKAGNQV
        I+ ALRA AE  G+PV+NC+L+AG+Q GV  A+ IGMP +V+RSSLT+R EFPSA  VMDGFG +     K  N++
Subjt:  IIRALRAGAELAGIPVSNCILIAGTQSGVDGAERIGMPRIVLRSSLTSRAEFPSANAVMDGFGFSQTLRAKAGNQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATAACTTCCTGCTCAATGCTATATACTCGTCCTTTGCAAAGAACCACAAGTTGCAACTCCTCCCACTCCTACGTGATATTTAATCAGCCAAGAAGCAGACTCTA
TTCGTCTTCTCCACATCTTTCTGTATCATCGAGAAGCTATAATTTCATTGGAAAGAGTTTACGCCTCAGAAGATTGACTGCTTTCAGCAGTTCCAGCGGTTCCAACATCG
ACTCATCCCAAGAACTCGCAGTTCTTCTTGAAGTTGAAGGAGTCCTCGTGGATGCATATCGCTCAACTAATCGACAAGCTTTCAATGAGGCATTTCAAAAGCTTGGACTT
GACTGTGCAAATTGGACTGATCCTATATATTCAGACCTTGTCAGGAAGAATGCTGCTAATGAGGAACGGATGCTAATTATGTATTTCAACCGTATTGGTTGGCCAACTTC
ACTGCCAACAAATGAGAAGGAATCATTTATTAAAAGTGTTCTGCGAGAAAAGAAAAATGCATCAGTTGAATTGATGGGCTCACAAAGGTTACCTTTACGGCCTGGAGTTG
AAGATTTCATTGACAATGCATATAATGAAGGAATACCTGTGATTATTCTCACAGCCTACAGCAAAAGTGGAGAAGAAATTGCTAGATCTATCGTTAATAAGCTTGGACCT
GAGAGAATATCCAAAGTAAAGATTGTTGGGAATGAGGAGGCAAGACTGAGTTTATATAGCCAAGTTGTGCTTGGTCAAGCAAAGCATTCAGGTTTGGATGAGCAACTAGC
TAAGGAAGCAATGAAAGCAGCCTCTGCCGAGAAACAAAAGATAGCTGAAAAGGTTGCAGCAGTGCTGAAGTTGAGTGTGGAAATTAATACTACCTCATCTGAAAGTTTGG
ACAATATCATACGTGCATTGCGTGCTGGAGCAGAGCTTGCAGGCATACCTGTTTCCAATTGCATCCTTATTGCAGGAACCCAATCTGGGGTTGATGGAGCTGAGCGAATA
GGGATGCCACGTATTGTACTACGTAGTAGTTTGACATCAAGAGCTGAGTTCCCTTCAGCAAATGCTGTCATGGATGGCTTTGGATTTTCTCAGACTCTGAGAGCGAAGGC
TGGGAACCAGGTATGCCATTGGTCTGCGAACAACCGAGTAAAACTGGAACTGGAGGTTTTCGTAAACCTTATTGTTTTTTTTTTCTTTTCTTTTTTTCGATCTGTGGACA
GAAGTTTCCGATTTGAGCCACAGAGTTGGAGAACTATTGCTGCCTTAAATGCTCAAAAGACTATTTCTAAGAAGAAAAAACTGCGAAGTCGAAAAAAGCAGTTAAAAGCT
TTCGATCTTTCTACTCTATCAGAGTTTCTTCCTGGATTGGAGGCTCCTAAACAAAAATCTTCCGCAGCCGAGTTAAAAGTAAATTGCAAGTCTAGGCTGAAGTTTATATT
GAAGGAAAGAAAGCAACTGGATACAGTTCTTACTCACCCTGCATTCCAGGAAGACCCCTTGAGAGCTATTCATCGACATTTAGAGAGCACCCAACCAGTTGAGGAACCGA
AGAAAAAGAAGACGAACAAAAATGGGAGCAAGAAGAGGAAAGAGAAAAAGTCGAAAGCCTCTGCAAGACCTTCATCTATGGAGACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATAACTTCCTGCTCAATGCTATATACTCGTCCTTTGCAAAGAACCACAAGTTGCAACTCCTCCCACTCCTACGTGATATTTAATCAGCCAAGAAGCAGACTCTA
TTCGTCTTCTCCACATCTTTCTGTATCATCGAGAAGCTATAATTTCATTGGAAAGAGTTTACGCCTCAGAAGATTGACTGCTTTCAGCAGTTCCAGCGGTTCCAACATCG
ACTCATCCCAAGAACTCGCAGTTCTTCTTGAAGTTGAAGGAGTCCTCGTGGATGCATATCGCTCAACTAATCGACAAGCTTTCAATGAGGCATTTCAAAAGCTTGGACTT
GACTGTGCAAATTGGACTGATCCTATATATTCAGACCTTGTCAGGAAGAATGCTGCTAATGAGGAACGGATGCTAATTATGTATTTCAACCGTATTGGTTGGCCAACTTC
ACTGCCAACAAATGAGAAGGAATCATTTATTAAAAGTGTTCTGCGAGAAAAGAAAAATGCATCAGTTGAATTGATGGGCTCACAAAGGTTACCTTTACGGCCTGGAGTTG
AAGATTTCATTGACAATGCATATAATGAAGGAATACCTGTGATTATTCTCACAGCCTACAGCAAAAGTGGAGAAGAAATTGCTAGATCTATCGTTAATAAGCTTGGACCT
GAGAGAATATCCAAAGTAAAGATTGTTGGGAATGAGGAGGCAAGACTGAGTTTATATAGCCAAGTTGTGCTTGGTCAAGCAAAGCATTCAGGTTTGGATGAGCAACTAGC
TAAGGAAGCAATGAAAGCAGCCTCTGCCGAGAAACAAAAGATAGCTGAAAAGGTTGCAGCAGTGCTGAAGTTGAGTGTGGAAATTAATACTACCTCATCTGAAAGTTTGG
ACAATATCATACGTGCATTGCGTGCTGGAGCAGAGCTTGCAGGCATACCTGTTTCCAATTGCATCCTTATTGCAGGAACCCAATCTGGGGTTGATGGAGCTGAGCGAATA
GGGATGCCACGTATTGTACTACGTAGTAGTTTGACATCAAGAGCTGAGTTCCCTTCAGCAAATGCTGTCATGGATGGCTTTGGATTTTCTCAGACTCTGAGAGCGAAGGC
TGGGAACCAGGTATGCCATTGGTCTGCGAACAACCGAGTAAAACTGGAACTGGAGGTTTTCGTAAACCTTATTGTTTTTTTTTTCTTTTCTTTTTTTCGATCTGTGGACA
GAAGTTTCCGATTTGAGCCACAGAGTTGGAGAACTATTGCTGCCTTAAATGCTCAAAAGACTATTTCTAAGAAGAAAAAACTGCGAAGTCGAAAAAAGCAGTTAAAAGCT
TTCGATCTTTCTACTCTATCAGAGTTTCTTCCTGGATTGGAGGCTCCTAAACAAAAATCTTCCGCAGCCGAGTTAAAAGTAAATTGCAAGTCTAGGCTGAAGTTTATATT
GAAGGAAAGAAAGCAACTGGATACAGTTCTTACTCACCCTGCATTCCAGGAAGACCCCTTGAGAGCTATTCATCGACATTTAGAGAGCACCCAACCAGTTGAGGAACCGA
AGAAAAAGAAGACGAACAAAAATGGGAGCAAGAAGAGGAAAGAGAAAAAGTCGAAAGCCTCTGCAAGACCTTCATCTATGGAGACGTGA
Protein sequenceShow/hide protein sequence
MEITSCSMLYTRPLQRTTSCNSSHSYVIFNQPRSRLYSSSPHLSVSSRSYNFIGKSLRLRRLTAFSSSSGSNIDSSQELAVLLEVEGVLVDAYRSTNRQAFNEAFQKLGL
DCANWTDPIYSDLVRKNAANEERMLIMYFNRIGWPTSLPTNEKESFIKSVLREKKNASVELMGSQRLPLRPGVEDFIDNAYNEGIPVIILTAYSKSGEEIARSIVNKLGP
ERISKVKIVGNEEARLSLYSQVVLGQAKHSGLDEQLAKEAMKAASAEKQKIAEKVAAVLKLSVEINTTSSESLDNIIRALRAGAELAGIPVSNCILIAGTQSGVDGAERI
GMPRIVLRSSLTSRAEFPSANAVMDGFGFSQTLRAKAGNQVCHWSANNRVKLELEVFVNLIVFFFFSFFRSVDRSFRFEPQSWRTIAALNAQKTISKKKKLRSRKKQLKA
FDLSTLSEFLPGLEAPKQKSSAAELKVNCKSRLKFILKERKQLDTVLTHPAFQEDPLRAIHRHLESTQPVEEPKKKKTNKNGSKKRKEKKSKASARPSSMET