; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003567 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003567
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUPF0301 protein
Genome locationscaffold234:3613589..3618663
RNA-Seq ExpressionMS003567
SyntenyMS003567
Gene Ontology termsNA
InterPro domainsIPR003774 - Protein of unknown function UPF0301


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578309.1 hypothetical protein SDJN03_22757, partial [Cucurbita argyrosperma subsp. sororia]1.4e-17183.38Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN D+SPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP

Query:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        GDDAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D+++QS N HES+ LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPLHKKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM

XP_022152489.1 uncharacterized protein LOC111020207 [Momordica charantia]1.9e-20599.16Show/hide
Query:  MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIPGDDAK
        MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHD+SPSPGNGDHSIPG+DAK
Subjt:  MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIPGDDAK

Query:  SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR
        SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR
Subjt:  SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR

Query:  HPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY
        HPQEGPFGVVINRPL KKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY
Subjt:  HPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY

Query:  AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM
        AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM
Subjt:  AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM

XP_022939198.1 uncharacterized protein LOC111445188 isoform X2 [Cucurbita moschata]2.4e-17183.65Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN D+SPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP

Query:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        GDDAKSNN SD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPLHKKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM

XP_022993090.1 uncharacterized protein LOC111489210 isoform X2 [Cucurbita maxima]4.8e-17283.92Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN D+SPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP

Query:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        GDDAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPLHKKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM

XP_023550640.1 uncharacterized protein LOC111808722 isoform X2 [Cucurbita pepo subsp. pepo]6.3e-17283.65Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN D+SPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP

Query:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        GDDAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D+++QS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPLHKKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM

TrEMBL top hitse value%identityAlignment
A0A6J1DG55 uncharacterized protein LOC1110202079.4e-20699.16Show/hide
Query:  MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIPGDDAK
        MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHD+SPSPGNGDHSIPG+DAK
Subjt:  MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIPGDDAK

Query:  SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR
        SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR
Subjt:  SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR

Query:  HPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY
        HPQEGPFGVVINRPL KKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY
Subjt:  HPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY

Query:  AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM
        AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM
Subjt:  AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1FG48 uncharacterized protein LOC111445188 isoform X11.9e-16675.31Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPG-------
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN D+SPSPG       
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPG-------

Query:  -----------------------------------NGDHSIPGDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVH
                                           NGDHS+PGDDAKSNN SD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N H
Subjt:  -----------------------------------NGDHSIPGDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVH

Query:  ESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKT
        ESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG+RHPQEGPFGVVINRPLHKKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKT
Subjt:  ESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKT

Query:  GEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSE
        GEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGG YSE
Subjt:  GEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSE

Query:  LSRKPKQDM
        LSRKPKQDM
Subjt:  LSRKPKQDM

A0A6J1FL02 uncharacterized protein LOC111445188 isoform X21.2e-17183.65Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN D+SPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP

Query:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        GDDAKSNN SD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPLHKKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1JXJ8 uncharacterized protein LOC111489210 isoform X22.3e-17283.92Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN D+SPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIP

Query:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        GDDAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPLHKKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1JZ91 uncharacterized protein LOC111489210 isoform X13.9e-16775.55Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPG-------
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN D+SPSPG       
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPG-------

Query:  -----------------------------------NGDHSIPGDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVH
                                           NGDHS+PGDDAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N H
Subjt:  -----------------------------------NGDHSIPGDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVH

Query:  ESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKT
        ESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG+RHPQEGPFGVVINRPLHKKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKT
Subjt:  ESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKT

Query:  GEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSE
        GEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGA    SEGLWEEILQLMGG YSE
Subjt:  GEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGA----SEGLWEEILQLMGGHYSE

Query:  LSRKPKQDM
        LSRKPKQDM
Subjt:  LSRKPKQDM

SwissProt top hitse value%identityAlignment
A1BEV6 UPF0301 protein Cpha266_08851.2e-1928.19Show/hide
Query:  ETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCS--LHFGGPLEASMFLLKTGEKPKLHGFEEVIP
        ++G +L+A+  L     F+RTV+++      H + G  G ++NRP+  K+        +    F +    LH GGP++             + G  E+ P
Subjt:  ETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCS--LHFGGPLEASMFLLKTGEKPKLHGFEEVIP

Query:  GLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
        GL +G     +  + L+  G+++P + RFF+GY+GW   QL EE E   WY+A  S +++   A E +W   ++  GG Y  ++  P+
Subjt:  GLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

B3QMC9 UPF0301 protein Cpar_06629.9e-1928.72Show/hide
Query:  ETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFS--DCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIP
        + G +L+A+  L     F+RTV+L+      H  EG  G ++N+P+  K+        +  + F   D  LH GGP++             +   +EV+P
Subjt:  ETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFS--DCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIP

Query:  GLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
        GL +G     +  + L+  G+++P + RFF+GYAGW   QL++E E   WY A  S+  +     E +W   ++  GG Y  ++  P+
Subjt:  GLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

B4SD86 UPF0301 protein Ppha_21429.9e-1928.9Show/hide
Query:  TFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCS--LHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGA-RNSLDGAAG
        +F+RTV+++      H + G    ++NRP+  K+        +  + F +    LH GGP+E             + G  E++PG+ +G  +N L   + 
Subjt:  TFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCS--LHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGA-RNSLDGAAG

Query:  LVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
        L+  G++ P + RFF+GYAGW   QL  E E   WY A  S +++   A E +W   ++  GG Y  ++  P+
Subjt:  LVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

Q3AQ69 UPF0301 protein Cag_16011.6e-2130.77Show/hide
Query:  FERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKK
        F+RTV+L+      H +EG  G ++NRPL  K++       D+     D  LH GGP++ +           +H  +EV+PG+ +G     D  + L+  
Subjt:  FERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKK

Query:  GILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
        G++ P + RF++GYAGW   QL  E E   WY A  + +++   A E +W   ++  GG Y  ++  P+
Subjt:  GILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

Q3B561 UPF0301 protein Plut_06375.2e-2030.99Show/hide
Query:  FERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASM--FLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLV
        F+RTV+++      H  +G  G ++NRP+  +++       ++     D  LH GGP++++   FL   G+   + G E+++PGL +G      G   L+
Subjt:  FERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASM--FLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLV

Query:  KKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
          G+LKP + RFF+GYAGW   QL  E E   WY A  +  ++  G  E +W   ++  GG Y  ++  P+
Subjt:  KKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

Arabidopsis top hitse value%identityAlignment
AT1G33780.1 Protein of unknown function (DUF179)9.2e-12168.01Show/hide
Query:  SSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIPGDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFARE
        S P  S+  +  L  LE R    K+  +AS YRS +V A +KK++DDS S        PGD ++ N  S+GNKS ++++ KS  +N DWREFRANLF +E
Subjt:  SSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIPGDDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFARE

Query:  QAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDC
        Q EK +       A  HES+P+GLKWAHPIP PETGCVLVATEKLDG RTF RTVVLLLR+GTRHPQEGPFGVVINRPLHK IKHMK    +LATTFS+C
Subjt:  QAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDC

Query:  SLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLW
        SL+FGGPLEASMFLLKTG+K K+ GFEEV+PGL +G RNSLD AA LVKKG+LKPQ+FRFFVGYAGWQLDQLREEIESDYW+VAACSS+L+CG +SE LW
Subjt:  SLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLW

Query:  EEILQLMGGHYSELSRKPKQDM
        EEILQLMGG YSELSRKPK D+
Subjt:  EEILQLMGGHYSELSRKPKQDM

AT3G19780.1 LOCATED IN: endomembrane system1.7e-1328.89Show/hide
Query:  NNFSDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEK-VDTD-VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        N   + NK +++SS   ++   D  +     L  RE AE+ V+ D VN QS  +H             P  +TG VLVATEKL    TF ++ +L++++G
Subjt:  NNFSDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEK-VDTD-VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLE----ASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDF
           P+ G  G++ N+ +  K     P+  + A    +  L FGGP+       + L +  +    H   E+ PG+ +    S+      +K   L P ++
Subjt:  TRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLE----ASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDF

Query:  RFFVGYAGWQLDQLREEIESDYWYV
         FF+GY+ W  +QL +EI    W V
Subjt:  RFFVGYAGWQLDQLREEIESDYWYV

AT3G19780.2 LOCATED IN: endomembrane system1.7e-1328.89Show/hide
Query:  NNFSDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEK-VDTD-VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        N   + NK +++SS   ++   D  +     L  RE AE+ V+ D VN QS  +H             P  +TG VLVATEKL    TF ++ +L++++G
Subjt:  NNFSDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEK-VDTD-VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLE----ASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDF
           P+ G  G++ N+ +  K     P+  + A    +  L FGGP+       + L +  +    H   E+ PG+ +    S+      +K   L P ++
Subjt:  TRHPQEGPFGVVINRPLHKKIKHMKPNNLDLATTFSDCSLHFGGPLE----ASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDF

Query:  RFFVGYAGWQLDQLREEIESDYWYV
         FF+GY+ W  +QL +EI    W V
Subjt:  RFFVGYAGWQLDQLREEIESDYWYV

AT3G29240.1 Protein of unknown function (DUF179)8.5e-5046.44Show/hide
Query:  DWREFRANLFAREQAEKVDTD----------VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINR
        DWREFRA L A EQA   + D          V+ Q ++      +G KWAH I  PETGC+L+ATEKLDGV  FE+TV+LLL  G      GP GV++NR
Subjt:  DWREFRANLFAREQAEKVDTD----------VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINR

Query:  PLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLK-----TGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQL
        P    IK  K   LD+A TFSD  L FGGPLE  +FL+        E  K   F +V+ GL YG R S+  AA +VK+ ++   + RFF GY GW+ +QL
Subjt:  PLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLK-----TGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQL

Query:  REEIESDYWYVAACSSNLLCGGA---SEGLWEEILQLMG
        + EI   YW VAACSS ++  G+   S GLW+E+L L+G
Subjt:  REEIESDYWYVAACSSNLLCGGA---SEGLWEEILQLMG

AT3G29240.2 Protein of unknown function (DUF179)8.5e-5046.44Show/hide
Query:  DWREFRANLFAREQAEKVDTD----------VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINR
        DWREFRA L A EQA   + D          V+ Q ++      +G KWAH I  PETGC+L+ATEKLDGV  FE+TV+LLL  G      GP GV++NR
Subjt:  DWREFRANLFAREQAEKVDTD----------VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINR

Query:  PLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLK-----TGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQL
        P    IK  K   LD+A TFSD  L FGGPLE  +FL+        E  K   F +V+ GL YG R S+  AA +VK+ ++   + RFF GY GW+ +QL
Subjt:  PLHKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLK-----TGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQL

Query:  REEIESDYWYVAACSSNLLCGGA---SEGLWEEILQLMG
        + EI   YW VAACSS ++  G+   S GLW+E+L L+G
Subjt:  REEIESDYWYVAACSSNLLCGGA---SEGLWEEILQLMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCCTTGCTGTACATGTTAAGAACACCGCCACTCCCAGCCCCAGCCCCCTCTCACCCAATAGACCTATTTCTTCCAGTTTCCTCAAGTCTACCAGAAGATTTTC
TTCTCCACATCCCTCTGCCTTTGGCGCTGAACTTCTCGGCCTCCTTGAGATCCGAGTATTCAGGCCCAAGCTCTGCTCTACCGCTTCTGCCTATCGCTCTTTTCTGGTCA
CAGCGATTGCCAAGAAGAATCACGACGACTCTCCATCTCCTGGAAATGGAGATCACTCAATTCCTGGAGATGATGCTAAATCAAACAATTTTTCTGATGGCAACAAAAGT
AATGAAACTTCTTCCCAGAAATCACATCATATTAATTTGGACTGGCGAGAATTCAGGGCAAACTTATTTGCTCGTGAGCAGGCAGAAAAGGTGGACACCGATGTGAATGT
CCAAAGTGCAAATGTCCACGAGTCTAAACCTCTTGGCCTGAAGTGGGCACATCCTATTCCTATACCAGAGACTGGCTGTGTCCTTGTGGCTACGGAGAAGTTGGATGGTG
TTCGCACTTTTGAGCGAACAGTCGTCCTTCTCCTCAGATCTGGAACCAGACATCCTCAAGAGGGGCCGTTCGGAGTAGTTATTAATCGCCCACTTCACAAAAAGATCAAG
CATATGAAACCAAATAATCTTGACTTGGCAACTACATTTTCTGATTGCTCTCTGCATTTTGGAGGGCCTCTTGAGGCGAGCATGTTTTTGCTGAAAACCGGAGAAAAACC
AAAACTCCATGGCTTTGAAGAAGTGATCCCTGGCCTCTGCTATGGCGCTCGAAACAGCCTCGATGGAGCTGCAGGGCTGGTGAAGAAGGGAATCCTTAAACCTCAGGACT
TCAGATTCTTTGTGGGTTATGCTGGGTGGCAACTGGATCAGTTGAGGGAGGAGATTGAATCAGATTACTGGTATGTGGCTGCTTGTAGCTCAAATCTACTTTGTGGAGGC
GCATCAGAGGGACTGTGGGAGGAGATTTTGCAGTTAATGGGTGGCCACTATTCAGAGTTGAGCAGAAAGCCTAAGCAAGACATG
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCCTTGCTGTACATGTTAAGAACACCGCCACTCCCAGCCCCAGCCCCCTCTCACCCAATAGACCTATTTCTTCCAGTTTCCTCAAGTCTACCAGAAGATTTTC
TTCTCCACATCCCTCTGCCTTTGGCGCTGAACTTCTCGGCCTCCTTGAGATCCGAGTATTCAGGCCCAAGCTCTGCTCTACCGCTTCTGCCTATCGCTCTTTTCTGGTCA
CAGCGATTGCCAAGAAGAATCACGACGACTCTCCATCTCCTGGAAATGGAGATCACTCAATTCCTGGAGATGATGCTAAATCAAACAATTTTTCTGATGGCAACAAAAGT
AATGAAACTTCTTCCCAGAAATCACATCATATTAATTTGGACTGGCGAGAATTCAGGGCAAACTTATTTGCTCGTGAGCAGGCAGAAAAGGTGGACACCGATGTGAATGT
CCAAAGTGCAAATGTCCACGAGTCTAAACCTCTTGGCCTGAAGTGGGCACATCCTATTCCTATACCAGAGACTGGCTGTGTCCTTGTGGCTACGGAGAAGTTGGATGGTG
TTCGCACTTTTGAGCGAACAGTCGTCCTTCTCCTCAGATCTGGAACCAGACATCCTCAAGAGGGGCCGTTCGGAGTAGTTATTAATCGCCCACTTCACAAAAAGATCAAG
CATATGAAACCAAATAATCTTGACTTGGCAACTACATTTTCTGATTGCTCTCTGCATTTTGGAGGGCCTCTTGAGGCGAGCATGTTTTTGCTGAAAACCGGAGAAAAACC
AAAACTCCATGGCTTTGAAGAAGTGATCCCTGGCCTCTGCTATGGCGCTCGAAACAGCCTCGATGGAGCTGCAGGGCTGGTGAAGAAGGGAATCCTTAAACCTCAGGACT
TCAGATTCTTTGTGGGTTATGCTGGGTGGCAACTGGATCAGTTGAGGGAGGAGATTGAATCAGATTACTGGTATGTGGCTGCTTGTAGCTCAAATCTACTTTGTGGAGGC
GCATCAGAGGGACTGTGGGAGGAGATTTTGCAGTTAATGGGTGGCCACTATTCAGAGTTGAGCAGAAAGCCTAAGCAAGACATG
Protein sequenceShow/hide protein sequence
MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDDSPSPGNGDHSIPGDDAKSNNFSDGNKS
NETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLHKKIK
HMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGG
ASEGLWEEILQLMGGHYSELSRKPKQDM