; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014704 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014704
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUPF0301 protein
Genome locationtig00001047:207640..212450
RNA-Seq ExpressionSgr014704
SyntenySgr014704
Gene Ontology termsNA
InterPro domainsIPR003774 - Protein of unknown function UPF0301


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578309.1 hypothetical protein SDJN03_22757, partial [Cucurbita argyrosperma subsp. sororia]5.8e-15677.75Show/hide
Query:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD
        MDL AVNVKNTAT P PFSLKHSFPDRPIS SL K +RR S +HPFGA+LL LLE+RV RP++CS  S  RSF+VRAIAKKN DNSPSP NGD S+PGDD
Subjt:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD

Query:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        AKSNNISDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D+++QS NAHES+ LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG
Subjt:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV
        +RHPQEGPFGV                                    ASMFLLKTGEKSKLHGFEEVIPG+CFGARNSLDEAA LVKKGILKPQDF+FFV
Subjt:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV

Query:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        GYAGWQLDQLREEIESDYWYVAACSSNL+CGGASDSSSEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

XP_022152489.1 uncharacterized protein LOC111020207 [Momordica charantia]1.8e-15780.27Show/hide
Query:  MDLLAVNVKNTATPTPFSLKHSFPDRPISSSLVKTTRRLSSTHP--FGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGD
        MDLLAV+VKNTATP+P  L    P+RPISSS +K+TRR SS HP  FGAELLGLLE+RV RPK+CSTAS YRSF+V AIAKKNHDNSPSPGNGD SIPG+
Subjt:  MDLLAVNVKNTATPTPFSLKHSFPDRPISSSLVKTTRRLSSTHP--FGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGD

Query:  DAKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRS
        DAKSNN SDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSAN HESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRS
Subjt:  DAKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRS

Query:  GTRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFF
        GTRHPQEGPFGV                                    ASMFLLKTGEK KLHGFEEVIPG+C+GARNSLD AAGLVKKGILKPQDF+FF
Subjt:  GTRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFF

Query:  VGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        VGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGGHYSELSRKPKQDM
Subjt:  VGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

XP_022939198.1 uncharacterized protein LOC111445188 isoform X2 [Cucurbita moschata]9.9e-15678.02Show/hide
Query:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD
        MDL AVNVKNTAT P PFSLKHSFPDRPIS SL K +RR S +HPFGA+LL LLE+RV RP++CS  S  RSF+VRAIAKKN DNSPSP NGD S+PGDD
Subjt:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD

Query:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        AKSNNISD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS NAHESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG
Subjt:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV
        +RHPQEGPFGV                                    ASMFLLKTGEKSKLHGFEEVIPG+CFGARNSLDEAA LVKKGILKPQDF+FFV
Subjt:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV

Query:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        GYAGWQLDQLREEIESDYWYVAACSSNL+CGGASDSSSEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

XP_022993090.1 uncharacterized protein LOC111489210 isoform X2 [Cucurbita maxima]2.0e-15678.3Show/hide
Query:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD
        MDL AVNVKNTAT P PFSLKHSFPDRPIS SL K +RR S +HPFGA+LL LLE+RV RP++CS  S  RSF+VRAIAKKN DNSPSP NGD S+PGDD
Subjt:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD

Query:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        AKSNNISDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS NAHESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG
Subjt:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV
        +RHPQEGPFGV                                    ASMFLLKTGEKSKLHGFEEVIPG+CFGARNSLDEAA LVKKGILKPQDF+FFV
Subjt:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV

Query:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        GYAGWQLDQLREEIESDYWYVAACSSNL+CGGASDSSSEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

XP_023550640.1 uncharacterized protein LOC111808722 isoform X2 [Cucurbita pepo subsp. pepo]2.6e-15678.02Show/hide
Query:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD
        MDL AVNVKNTAT P PFSLKHSFPDRPIS SL K +RR S +HPFGA+LL LLE+RV RP++CS  S  RSF+VRAIAKKN DNSPSP NGD S+PGDD
Subjt:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD

Query:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        AKSNNISDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D+++QS NAHESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG
Subjt:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV
        +RHPQEGPFGV                                    ASMFLLKTGEKSKLHGFEEVIPG+CFGARNSLDEAA LVKKGILKPQDF+FFV
Subjt:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV

Query:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        GYAGWQLDQLREEIESDYWYVAACSSNL+CGGASDSSSEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

TrEMBL top hitse value%identityAlignment
A0A6J1DG55 uncharacterized protein LOC1110202078.7e-15880.27Show/hide
Query:  MDLLAVNVKNTATPTPFSLKHSFPDRPISSSLVKTTRRLSSTHP--FGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGD
        MDLLAV+VKNTATP+P  L    P+RPISSS +K+TRR SS HP  FGAELLGLLE+RV RPK+CSTAS YRSF+V AIAKKNHDNSPSPGNGD SIPG+
Subjt:  MDLLAVNVKNTATPTPFSLKHSFPDRPISSSLVKTTRRLSSTHP--FGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGD

Query:  DAKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRS
        DAKSNN SDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSAN HESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRS
Subjt:  DAKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRS

Query:  GTRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFF
        GTRHPQEGPFGV                                    ASMFLLKTGEK KLHGFEEVIPG+C+GARNSLD AAGLVKKGILKPQDF+FF
Subjt:  GTRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFF

Query:  VGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        VGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGGHYSELSRKPKQDM
Subjt:  VGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1FG48 uncharacterized protein LOC111445188 isoform X17.9e-15170.2Show/hide
Query:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPG----------
        MDL AVNVKNTAT P PFSLKHSFPDRPIS SL K +RR S +HPFGA+LL LLE+RV RP++CS  S  RSF+VRAIAKKN DNSPSPG          
Subjt:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPG----------

Query:  --------------------------------NGDRSIPGDDAKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESK
                                        NGD S+PGDDAKSNNISD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS NAHESK
Subjt:  --------------------------------NGDRSIPGDDAKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESK

Query:  PLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGV------------------------------------ASMFLLKTGEK
         LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG+RHPQEGPFGV                                    ASMFLLKTGEK
Subjt:  PLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGV------------------------------------ASMFLLKTGEK

Query:  SKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSR
        SKLHGFEEVIPG+CFGARNSLDEAA LVKKGILKPQDF+FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGASDSSSEGLWEEILQLMGG YSELSR
Subjt:  SKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSR

Query:  KPKQDM
        KPKQDM
Subjt:  KPKQDM

A0A6J1FL02 uncharacterized protein LOC111445188 isoform X24.8e-15678.02Show/hide
Query:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD
        MDL AVNVKNTAT P PFSLKHSFPDRPIS SL K +RR S +HPFGA+LL LLE+RV RP++CS  S  RSF+VRAIAKKN DNSPSP NGD S+PGDD
Subjt:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD

Query:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        AKSNNISD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS NAHESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG
Subjt:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV
        +RHPQEGPFGV                                    ASMFLLKTGEKSKLHGFEEVIPG+CFGARNSLDEAA LVKKGILKPQDF+FFV
Subjt:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV

Query:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        GYAGWQLDQLREEIESDYWYVAACSSNL+CGGASDSSSEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1JXJ8 uncharacterized protein LOC111489210 isoform X29.6e-15778.3Show/hide
Query:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD
        MDL AVNVKNTAT P PFSLKHSFPDRPIS SL K +RR S +HPFGA+LL LLE+RV RP++CS  S  RSF+VRAIAKKN DNSPSP NGD S+PGDD
Subjt:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDD

Query:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        AKSNNISDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS NAHESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG
Subjt:  AKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV
        +RHPQEGPFGV                                    ASMFLLKTGEKSKLHGFEEVIPG+CFGARNSLDEAA LVKKGILKPQDF+FFV
Subjt:  TRHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFV

Query:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        GYAGWQLDQLREEIESDYWYVAACSSNL+CGGASDSSSEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  GYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1JZ91 uncharacterized protein LOC111489210 isoform X11.6e-15170.44Show/hide
Query:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPG----------
        MDL AVNVKNTAT P PFSLKHSFPDRPIS SL K +RR S +HPFGA+LL LLE+RV RP++CS  S  RSF+VRAIAKKN DNSPSPG          
Subjt:  MDLLAVNVKNTAT-PTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPG----------

Query:  --------------------------------NGDRSIPGDDAKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESK
                                        NGD S+PGDDAKSNNISDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS NAHESK
Subjt:  --------------------------------NGDRSIPGDDAKSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESK

Query:  PLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGV------------------------------------ASMFLLKTGEK
         LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG+RHPQEGPFGV                                    ASMFLLKTGEK
Subjt:  PLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGV------------------------------------ASMFLLKTGEK

Query:  SKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSR
        SKLHGFEEVIPG+CFGARNSLDEAA LVKKGILKPQDF+FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGASDSSSEGLWEEILQLMGG YSELSR
Subjt:  SKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSR

Query:  KPKQDM
        KPKQDM
Subjt:  KPKQDM

SwissProt top hitse value%identityAlignment
B3QMC9 UPF0301 protein Cpar_06625.3e-1129.41Show/hide
Query:  GPFGVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLW
        GP  V ++  L T     +   +EV+PG+ +G     ++ + L+  G+++P + +FF+GYAGW   QL++E E   WY A  S+  V         E +W
Subjt:  GPFGVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLW

Query:  EEILQLMGGHYSELSRKPK
           ++  GG Y  ++  P+
Subjt:  EEILQLMGGHYSELSRKPK

B4SD86 UPF0301 protein Ppha_21426.9e-1129.41Show/hide
Query:  GPFGVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLW
        GP  V ++  L +     + G  E++PG+ +G     +E + L+  G++ P + +FF+GYAGW   QL  E E   WY A  S +++   A     E +W
Subjt:  GPFGVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLW

Query:  EEILQLMGGHYSELSRKPK
           ++  GG Y  ++  P+
Subjt:  EEILQLMGGHYSELSRKPK

Q3AQ69 UPF0301 protein Cag_16019.7e-1330.58Show/hide
Query:  QEGPFGVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEG
        Q GP  V S+  L +     +H  +EV+PG+ +G     DE + L+  G++ P + +F++GYAGW   QL  E E   WY A  + +++   A     E 
Subjt:  QEGPFGVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEG

Query:  LWEEILQLMGGHYSELSRKPK
        +W   ++  GG Y  ++  P+
Subjt:  LWEEILQLMGGHYSELSRKPK

Q3B561 UPF0301 protein Plut_06371.4e-1132.67Show/hide
Query:  LHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKP
        + G E+++PG+ +G     +E   L+  G+LKP + +FF+GYAGW   QL  E E   WY A  +  +V  G      E +W   ++  GG Y  ++  P
Subjt:  LHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKP

Query:  K
        +
Subjt:  K

Q8KEM4 UPF0301 protein CT06635.3e-1131.93Show/hide
Query:  GPFGVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLW
        GP  V ++ +L T     + G  EVIPG+ +G     ++ + L+  G++K  + +FF+GYAGW   QL  E E   WY A  SS  V         E +W
Subjt:  GPFGVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLW

Query:  EEILQLMGGHYSELSRKPK
           ++  GG Y  ++  P+
Subjt:  EEILQLMGGHYSELSRKPK

Arabidopsis top hitse value%identityAlignment
AT1G33780.1 Protein of unknown function (DUF179)6.5e-9755.37Show/hide
Query:  MDLLAVNVKNTATPTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDDA
        MDL  + +K+     P     S P++  S S     R+L             LE R +  KV  +AS YRS VVRA +KK++D+S S        PGD +
Subjt:  MDLLAVNVKNTATPTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDDA

Query:  KSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGT
        + N  S+GNKS ++++ KS  +N DWREFRANLF +EQ EK +       A  HES+P+GLKWAHPIP PETGCVLVATEKLDG RTF RTVVLLLR+GT
Subjt:  KSNNISDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGT

Query:  RHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVG
        RHPQEGPFGV                                    ASMFLLKTG+K+K+ GFEEV+PG+ FG RNSLDEAA LVKKG+LKPQ+F+FFVG
Subjt:  RHPQEGPFGV------------------------------------ASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVG

Query:  YAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM
        YAGWQLDQLREEIESDYW+VAACSS+L+CG    +SSE LWEEILQLMGG YSELSRKPK D+
Subjt:  YAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM

AT3G19780.1 LOCATED IN: endomembrane system2.5e-0826.73Show/hide
Query:  NNISDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG--
        N   + NK +++SS   ++   D  +     L  RE AE+   +VN    N+       L  A   P  +TG VLVATEKL    TF ++ +L++++G  
Subjt:  NNISDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG--

Query:  -----------------------------TRHPQEGPF---GVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAG
                                     T     GP    G+  + L +  + S  H   E+ PGV F    S+      +K   L P ++ FF+GY+ 
Subjt:  -----------------------------TRHPQEGPF---GVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAG

Query:  WQLDQLREEIESDYWYV
        W  +QL +EI    W V
Subjt:  WQLDQLREEIESDYWYV

AT3G19780.2 LOCATED IN: endomembrane system2.5e-0826.73Show/hide
Query:  NNISDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG--
        N   + NK +++SS   ++   D  +     L  RE AE+   +VN    N+       L  A   P  +TG VLVATEKL    TF ++ +L++++G  
Subjt:  NNISDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG--

Query:  -----------------------------TRHPQEGPF---GVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAG
                                     T     GP    G+  + L +  + S  H   E+ PGV F    S+      +K   L P ++ FF+GY+ 
Subjt:  -----------------------------TRHPQEGPF---GVASMFLLKTGEKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAG

Query:  WQLDQLREEIESDYWYV
        W  +QL +EI    W V
Subjt:  WQLDQLREEIESDYWYV

AT3G29240.1 Protein of unknown function (DUF179)8.6e-3338.14Show/hide
Query:  DWREFRANLFAREQAEKVDTD----------VNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHP-----------
        DWREFRA L A EQA   + D          V+ Q +++     +G KWAH I  PETGC+L+ATEKLDGV  FE+TV+LLL  G   P           
Subjt:  DWREFRANLFAREQAEKVDTD----------VNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHP-----------

Query:  -----------QEGPFGVASMFL---LKTG------------EKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEI
                     G F    +F    L+ G            E  K   F +V+ G+ +G R S+  AA +VK+ ++   + +FF GY GW+ +QL+ EI
Subjt:  -----------QEGPFGVASMFL---LKTG------------EKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEI

Query:  ESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMG
           YW VAACSS +V  G S   S GLW+E+L L+G
Subjt:  ESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMG

AT3G29240.2 Protein of unknown function (DUF179)8.6e-3338.14Show/hide
Query:  DWREFRANLFAREQAEKVDTD----------VNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHP-----------
        DWREFRA L A EQA   + D          V+ Q +++     +G KWAH I  PETGC+L+ATEKLDGV  FE+TV+LLL  G   P           
Subjt:  DWREFRANLFAREQAEKVDTD----------VNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHP-----------

Query:  -----------QEGPFGVASMFL---LKTG------------EKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEI
                     G F    +F    L+ G            E  K   F +V+ G+ +G R S+  AA +VK+ ++   + +FF GY GW+ +QL+ EI
Subjt:  -----------QEGPFGVASMFL---LKTG------------EKSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEI

Query:  ESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMG
           YW VAACSS +V  G S   S GLW+E+L L+G
Subjt:  ESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCTTAGCTGTAAATGTCAAAAACACCGCCACTCCCACCCCCTTCTCACTCAAACACTCCTTTCCCGATAGACCCATTTCTTCCAGTTTGGTGAAGACTACGAG
GAGACTCTCTTCTACACATCCCTTTGGCGCTGAACTTCTGGGGCTGCTTGAGGTCCGAGTAGTCAGGCCCAAGGTTTGCTCTACCGCTTCTGGTTATCGTTCTTTTGTGG
TGAGAGCCATAGCGAAGAAGAATCACGATAATTCTCCATCTCCTGGAAATGGAGATCGCTCAATTCCTGGAGATGACGCTAAATCAAACAATATTTCTGATGGCAACAAA
TCTAATGAAACTTCTTCCCAGAAATCACATCATATTAATTTGGACTGGCGAGAATTCAGAGCAAACCTATTTGCTCGTGAGCAGGCAGAGAAAGTGGACACTGACGTGAA
TGTCCAAAGTGCAAATGCCCACGAGTCTAAGCCTCTTGGCCTGAAGTGGGCACATCCCATTCCTATACCTGAAACTGGCTGTGTCCTTGTTGCCACAGAGAAGTTGGATG
GGGTTCGCACTTTTGAGCGAACAGTTGTCCTTCTCCTCAGATCTGGAACCAGACATCCTCAGGAGGGGCCATTTGGAGTTGCGAGCATGTTTTTGCTGAAAACAGGAGAA
AAATCAAAACTCCATGGCTTTGAAGAAGTGATCCCGGGCGTCTGCTTTGGCGCTAGAAACAGCCTTGATGAAGCTGCAGGGCTGGTGAAGAAGGGAATCCTTAAACCTCA
GGATTTCAAATTTTTTGTGGGTTATGCTGGGTGGCAACTGGACCAATTGAGGGAGGAGATTGAATCAGATTACTGGTATGTGGCTGCTTGTAGCTCCAATCTAGTTTGTG
GAGGCGCATCAGATTCCTCATCGGAGGGACTGTGGGAGGAGATTTTGCAGCTAATGGGTGGTCACTATTCAGAGTTGAGCAGAAAGCCTAAGCAAGATATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCTTAGCTGTAAATGTCAAAAACACCGCCACTCCCACCCCCTTCTCACTCAAACACTCCTTTCCCGATAGACCCATTTCTTCCAGTTTGGTGAAGACTACGAG
GAGACTCTCTTCTACACATCCCTTTGGCGCTGAACTTCTGGGGCTGCTTGAGGTCCGAGTAGTCAGGCCCAAGGTTTGCTCTACCGCTTCTGGTTATCGTTCTTTTGTGG
TGAGAGCCATAGCGAAGAAGAATCACGATAATTCTCCATCTCCTGGAAATGGAGATCGCTCAATTCCTGGAGATGACGCTAAATCAAACAATATTTCTGATGGCAACAAA
TCTAATGAAACTTCTTCCCAGAAATCACATCATATTAATTTGGACTGGCGAGAATTCAGAGCAAACCTATTTGCTCGTGAGCAGGCAGAGAAAGTGGACACTGACGTGAA
TGTCCAAAGTGCAAATGCCCACGAGTCTAAGCCTCTTGGCCTGAAGTGGGCACATCCCATTCCTATACCTGAAACTGGCTGTGTCCTTGTTGCCACAGAGAAGTTGGATG
GGGTTCGCACTTTTGAGCGAACAGTTGTCCTTCTCCTCAGATCTGGAACCAGACATCCTCAGGAGGGGCCATTTGGAGTTGCGAGCATGTTTTTGCTGAAAACAGGAGAA
AAATCAAAACTCCATGGCTTTGAAGAAGTGATCCCGGGCGTCTGCTTTGGCGCTAGAAACAGCCTTGATGAAGCTGCAGGGCTGGTGAAGAAGGGAATCCTTAAACCTCA
GGATTTCAAATTTTTTGTGGGTTATGCTGGGTGGCAACTGGACCAATTGAGGGAGGAGATTGAATCAGATTACTGGTATGTGGCTGCTTGTAGCTCCAATCTAGTTTGTG
GAGGCGCATCAGATTCCTCATCGGAGGGACTGTGGGAGGAGATTTTGCAGCTAATGGGTGGTCACTATTCAGAGTTGAGCAGAAAGCCTAAGCAAGATATGTAG
Protein sequenceShow/hide protein sequence
MDLLAVNVKNTATPTPFSLKHSFPDRPISSSLVKTTRRLSSTHPFGAELLGLLEVRVVRPKVCSTASGYRSFVVRAIAKKNHDNSPSPGNGDRSIPGDDAKSNNISDGNK
SNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANAHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVASMFLLKTGE
KSKLHGFEEVIPGVCFGARNSLDEAAGLVKKGILKPQDFKFFVGYAGWQLDQLREEIESDYWYVAACSSNLVCGGASDSSSEGLWEEILQLMGGHYSELSRKPKQDM