; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g2009 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g2009
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUPF0301 protein
Genome locationMC04:26806176..26811803
RNA-Seq ExpressionMC04g2009
SyntenyMC04g2009
Gene Ontology termsNA
InterPro domainsIPR003774 - Protein of unknown function UPF0301


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578309.1 hypothetical protein SDJN03_22757, partial [Cucurbita argyrosperma subsp. sororia]1.18e-21483.11Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN DNSPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP

Query:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        G+DAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D+++QS N HES+ LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPL KKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGAS    EGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM

XP_022152489.1 uncharacterized protein LOC111020207 [Momordica charantia]2.70e-262100Show/hide
Query:  MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAK
        MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAK
Subjt:  MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAK

Query:  SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR
        SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR
Subjt:  SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR

Query:  HPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY
        HPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY
Subjt:  HPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY

Query:  AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM
        AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM
Subjt:  AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM

XP_022939198.1 uncharacterized protein LOC111445188 isoform X2 [Cucurbita moschata]2.38e-21483.38Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN DNSPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP

Query:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        G+DAKSNN SD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPL KKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGAS    EGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM

XP_022993090.1 uncharacterized protein LOC111489210 isoform X2 [Cucurbita maxima]2.90e-21583.65Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN DNSPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP

Query:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        G+DAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPL KKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGAS    EGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM

XP_023550640.1 uncharacterized protein LOC111808722 isoform X2 [Cucurbita pepo subsp. pepo]4.12e-21583.38Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN DNSPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP

Query:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        G+DAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D+++QS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPL KKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGAS    EGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM

TrEMBL top hitse value%identityAlignment
A0A6J1DG55 uncharacterized protein LOC1110202071.31e-262100Show/hide
Query:  MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAK
        MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAK
Subjt:  MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAK

Query:  SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR
        SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR
Subjt:  SNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTR

Query:  HPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY
        HPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY
Subjt:  HPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGY

Query:  AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM
        AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM
Subjt:  AGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1FG48 uncharacterized protein LOC111445188 isoform X13.81e-20775.06Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPG-------
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN DNSPSPG       
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPG-------

Query:  -----------------------------------NGDHSIPGEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVH
                                           NGDHS+PG+DAKSNN SD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N H
Subjt:  -----------------------------------NGDHSIPGEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVH

Query:  ESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKT
        ESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG+RHPQEGPFGVVINRPL KKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKT
Subjt:  ESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKT

Query:  GEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSE
        GEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGAS    EGLWEEILQLMGG YSE
Subjt:  GEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSE

Query:  LSRKPKQDM
        LSRKPKQDM
Subjt:  LSRKPKQDM

A0A6J1FL02 uncharacterized protein LOC111445188 isoform X21.15e-21483.38Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN DNSPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP

Query:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        G+DAKSNN SD +KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPL KKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGAS    EGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1JXJ8 uncharacterized protein LOC111489210 isoform X21.40e-21583.65Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN DNSPSP NGDHS+P
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIP

Query:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL
        G+DAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N HESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLL
Subjt:  GEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLL

Query:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR
        RSG+RHPQEGPFGVVINRPL KKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKTGEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFR
Subjt:  RSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFR

Query:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM
        FFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGAS    EGLWEEILQLMGG YSELSRKPKQDM
Subjt:  FFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSELSRKPKQDM

A0A6J1JZ91 uncharacterized protein LOC111489210 isoform X14.65e-20875.31Show/hide
Query:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPG-------
        MDL AV+VKNTATP P P S     P+RPIS S  K +RRFS  HP  FGA+LL LLEIRVFRP++CS  S  RSFLV AIAKKN DNSPSPG       
Subjt:  MDLLAVHVKNTATPSPSPLS-----PNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPG-------

Query:  -----------------------------------NGDHSIPGEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVH
                                           NGDHS+PG+DAKSNN SDG+KSNETSS+K+HHINLDWREFRANLF+REQAEKV+ D++VQS N H
Subjt:  -----------------------------------NGDHSIPGEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVH

Query:  ESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKT
        ESK LGLKWAHPIP+PETGCVLVATEKLDGVRTFERTV+LLLRSG+RHPQEGPFGVVINRPL KKIKHMKP NLDLATTFSDCSLHFGGPLEASMFLLKT
Subjt:  ESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKT

Query:  GEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSE
        GEK KLHGFEEVIPGLC+GARNSLD AA LVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNL+CGGAS    EGLWEEILQLMGG YSE
Subjt:  GEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGAS----EGLWEEILQLMGGHYSE

Query:  LSRKPKQDM
        LSRKPKQDM
Subjt:  LSRKPKQDM

SwissProt top hitse value%identityAlignment
A1BEV6 UPF0301 protein Cpha266_08856.8e-2028.19Show/hide
Query:  ETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCS--LHFGGPLEASMFLLKTGEKPKLHGFEEVIP
        ++G +L+A+  L     F+RTV+++      H + G  G ++NRP++ K+        +    F +    LH GGP++             + G  E+ P
Subjt:  ETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCS--LHFGGPLEASMFLLKTGEKPKLHGFEEVIP

Query:  GLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
        GL +G     +  + L+  G+++P + RFF+GY+GW   QL EE E   WY+A  S +++   A E +W   ++  GG Y  ++  P+
Subjt:  GLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

B3QMC9 UPF0301 protein Cpar_06625.8e-1928.72Show/hide
Query:  ETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFS--DCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIP
        + G +L+A+  L     F+RTV+L+      H  EG  G ++N+P++ K+        +  + F   D  LH GGP++             +   +EV+P
Subjt:  ETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFS--DCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIP

Query:  GLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
        GL +G     +  + L+  G+++P + RFF+GYAGW   QL++E E   WY A  S+  +     E +W   ++  GG Y  ++  P+
Subjt:  GLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

B4SD86 UPF0301 protein Ppha_21425.8e-1928.9Show/hide
Query:  TFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCS--LHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGA-RNSLDGAAG
        +F+RTV+++      H + G    ++NRP++ K+        +  + F +    LH GGP+E             + G  E++PG+ +G  +N L   + 
Subjt:  TFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCS--LHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGA-RNSLDGAAG

Query:  LVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
        L+  G++ P + RFF+GYAGW   QL  E E   WY A  S +++   A E +W   ++  GG Y  ++  P+
Subjt:  LVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

Q3AQ69 UPF0301 protein Cag_16019.5e-2230.77Show/hide
Query:  FERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKK
        F+RTV+L+      H +EG  G ++NRPL+ K++       D+     D  LH GGP++ +           +H  +EV+PG+ +G     D  + L+  
Subjt:  FERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKK

Query:  GILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
        G++ P + RF++GYAGW   QL  E E   WY A  + +++   A E +W   ++  GG Y  ++  P+
Subjt:  GILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

Q3B561 UPF0301 protein Plut_06373.1e-2030.99Show/hide
Query:  FERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASM--FLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLV
        F+RTV+++      H  +G  G ++NRP++ +++       ++     D  LH GGP++++   FL   G+   + G E+++PGL +G      G   L+
Subjt:  FERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASM--FLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLV

Query:  KKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK
          G+LKP + RFF+GYAGW   QL  E E   WY A  +  ++  G  E +W   ++  GG Y  ++  P+
Subjt:  KKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLWEEILQLMGGHYSELSRKPK

Arabidopsis top hitse value%identityAlignment
AT1G33780.1 Protein of unknown function (DUF179)6.6e-11967.08Show/hide
Query:  SSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFARE
        S P  S+  +  L  LE R    K+  +AS YRS +V A +KK++D+S SPG+         ++ N  S+GNKS ++++ KS  +N DWREFRANLF +E
Subjt:  SSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAKSNNFSDGNKSNETSSQKSHHINLDWREFRANLFARE

Query:  QAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDC
        Q EK +       A  HES+P+GLKWAHPIP PETGCVLVATEKLDG RTF RTVVLLLR+GTRHPQEGPFGVVINRPL K IKHMK    +LATTFS+C
Subjt:  QAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDC

Query:  SLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLW
        SL+FGGPLEASMFLLKTG+K K+ GFEEV+PGL +G RNSLD AA LVKKG+LKPQ+FRFFVGYAGWQLDQLREEIESDYW+VAACSS+L+CG +SE LW
Subjt:  SLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGGASEGLW

Query:  EEILQLMGGHYSELSRKPKQDM
        EEILQLMGG YSELSRKPK D+
Subjt:  EEILQLMGGHYSELSRKPKQDM

AT3G19780.1 LOCATED IN: endomembrane system1.3e-1328.89Show/hide
Query:  NNFSDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEK-VDTD-VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        N   + NK +++SS   ++   D  +     L  RE AE+ V+ D VN QS  +H             P  +TG VLVATEKL    TF ++ +L++++G
Subjt:  NNFSDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEK-VDTD-VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLE----ASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDF
           P+ G  G++ N+ ++ K     P+  + A    +  L FGGP+       + L +  +    H   E+ PG+ +    S+      +K   L P ++
Subjt:  TRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLE----ASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDF

Query:  RFFVGYAGWQLDQLREEIESDYWYV
         FF+GY+ W  +QL +EI    W V
Subjt:  RFFVGYAGWQLDQLREEIESDYWYV

AT3G19780.2 LOCATED IN: endomembrane system1.3e-1328.89Show/hide
Query:  NNFSDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEK-VDTD-VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG
        N   + NK +++SS   ++   D  +     L  RE AE+ V+ D VN QS  +H             P  +TG VLVATEKL    TF ++ +L++++G
Subjt:  NNFSDGNKSNETSSQKSHHINLDWREF-RANLFAREQAEK-VDTD-VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSG

Query:  TRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLE----ASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDF
           P+ G  G++ N+ ++ K     P+  + A    +  L FGGP+       + L +  +    H   E+ PG+ +    S+      +K   L P ++
Subjt:  TRHPQEGPFGVVINRPLQKKIKHMKPNNLDLATTFSDCSLHFGGPLE----ASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDF

Query:  RFFVGYAGWQLDQLREEIESDYWYV
         FF+GY+ W  +QL +EI    W V
Subjt:  RFFVGYAGWQLDQLREEIESDYWYV

AT3G29240.1 Protein of unknown function (DUF179)6.5e-5046.44Show/hide
Query:  DWREFRANLFAREQAEKVDTD----------VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINR
        DWREFRA L A EQA   + D          V+ Q ++      +G KWAH I  PETGC+L+ATEKLDGV  FE+TV+LLL  G      GP GV++NR
Subjt:  DWREFRANLFAREQAEKVDTD----------VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINR

Query:  PLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLK-----TGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQL
        P    IK  K   LD+A TFSD  L FGGPLE  +FL+        E  K   F +V+ GL YG R S+  AA +VK+ ++   + RFF GY GW+ +QL
Subjt:  PLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLK-----TGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQL

Query:  REEIESDYWYVAACSSNLLCGGA---SEGLWEEILQLMG
        + EI   YW VAACSS ++  G+   S GLW+E+L L+G
Subjt:  REEIESDYWYVAACSSNLLCGGA---SEGLWEEILQLMG

AT3G29240.2 Protein of unknown function (DUF179)6.5e-5046.44Show/hide
Query:  DWREFRANLFAREQAEKVDTD----------VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINR
        DWREFRA L A EQA   + D          V+ Q ++      +G KWAH I  PETGC+L+ATEKLDGV  FE+TV+LLL  G      GP GV++NR
Subjt:  DWREFRANLFAREQAEKVDTD----------VNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINR

Query:  PLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLK-----TGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQL
        P    IK  K   LD+A TFSD  L FGGPLE  +FL+        E  K   F +V+ GL YG R S+  AA +VK+ ++   + RFF GY GW+ +QL
Subjt:  PLQKKIKHMKPNNLDLATTFSDCSLHFGGPLEASMFLLK-----TGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQL

Query:  REEIESDYWYVAACSSNLLCGGA---SEGLWEEILQLMG
        + EI   YW VAACSS ++  G+   S GLW+E+L L+G
Subjt:  REEIESDYWYVAACSSNLLCGGA---SEGLWEEILQLMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCCTTGCTGTACATGTTAAGAACACCGCCACTCCCAGCCCCAGCCCCCTCTCACCCAATAGACCTATTTCTTCCAGTTTCCTCAAGTCTACCAGAAGATTTTC
TTCTCCACATCCCTCTGCCTTTGGCGCTGAACTTCTCGGCCTCCTTGAGATCCGAGTATTCAGGCCCAAGCTCTGCTCTACCGCTTCTGCCTATCGCTCTTTTCTGGTCA
CAGCGATTGCCAAGAAGAATCACGACAACTCTCCATCTCCTGGAAATGGAGATCACTCAATTCCTGGAGAAGACGCTAAATCAAACAATTTTTCTGATGGCAACAAAAGT
AATGAAACTTCTTCCCAGAAATCACATCATATTAATTTGGACTGGCGAGAATTCAGGGCAAACTTATTTGCTCGTGAGCAGGCAGAAAAGGTGGACACCGATGTGAATGT
CCAAAGTGCAAATGTCCACGAGTCTAAACCTCTTGGCCTGAAGTGGGCACATCCTATTCCTATACCAGAGACTGGCTGTGTCCTTGTTGCTACGGAGAAGTTGGATGGTG
TTCGCACTTTTGAGCGAACAGTCGTCCTTCTCCTCAGATCTGGAACCAGACATCCTCAAGAGGGGCCGTTCGGAGTAGTTATTAATCGCCCACTTCAAAAAAAGATCAAG
CATATGAAACCAAATAATCTTGACTTGGCAACTACATTTTCTGATTGCTCTCTGCATTTTGGAGGGCCTCTTGAGGCGAGCATGTTTTTGCTGAAAACCGGAGAAAAACC
AAAACTCCATGGCTTTGAAGAAGTGATCCCTGGCCTCTGCTATGGCGCTCGAAACAGCCTCGATGGAGCTGCAGGGCTGGTGAAGAAGGGAATCCTTAAACCTCAGGACT
TCAGATTCTTTGTGGGTTATGCTGGGTGGCAACTGGATCAGTTGAGGGAGGAGATTGAATCAGATTACTGGTATGTGGCTGCTTGTAGCTCAAATCTACTTTGTGGAGGC
GCATCAGAGGGACTGTGGGAGGAGATTTTGCAGTTAATGGGTGGCCACTATTCAGAGTTGAGCAGAAAGCCTAAGCAAGACATGTAG
mRNA sequenceShow/hide mRNA sequence
GACGGCTTAACCTCACTCGGTAACTCCCCCACTCGCAGTCTCACTGTATGACTTCGCCCACACCTCACCTCACCAAACCAACATAATTCCGTTAAATTATGGCATATTCT
CTCTCTCCCTCTGCGTCGTTTTCATCTCTCTCTACAAGTCTGTTAGCTGCTGATTGTTTGGCTTTTAATCTTCAATATTCTCTCGTTTCCCTTTTTCTATCTTCTTCTCT
CGGGTGGATCATCGTTTCCGACGCCCACGAAATTGATAATCGGATATGGATCTCCTTGCTGTACATGTTAAGAACACCGCCACTCCCAGCCCCAGCCCCCTCTCACCCAA
TAGACCTATTTCTTCCAGTTTCCTCAAGTCTACCAGAAGATTTTCTTCTCCACATCCCTCTGCCTTTGGCGCTGAACTTCTCGGCCTCCTTGAGATCCGAGTATTCAGGC
CCAAGCTCTGCTCTACCGCTTCTGCCTATCGCTCTTTTCTGGTCACAGCGATTGCCAAGAAGAATCACGACAACTCTCCATCTCCTGGAAATGGAGATCACTCAATTCCT
GGAGAAGACGCTAAATCAAACAATTTTTCTGATGGCAACAAAAGTAATGAAACTTCTTCCCAGAAATCACATCATATTAATTTGGACTGGCGAGAATTCAGGGCAAACTT
ATTTGCTCGTGAGCAGGCAGAAAAGGTGGACACCGATGTGAATGTCCAAAGTGCAAATGTCCACGAGTCTAAACCTCTTGGCCTGAAGTGGGCACATCCTATTCCTATAC
CAGAGACTGGCTGTGTCCTTGTTGCTACGGAGAAGTTGGATGGTGTTCGCACTTTTGAGCGAACAGTCGTCCTTCTCCTCAGATCTGGAACCAGACATCCTCAAGAGGGG
CCGTTCGGAGTAGTTATTAATCGCCCACTTCAAAAAAAGATCAAGCATATGAAACCAAATAATCTTGACTTGGCAACTACATTTTCTGATTGCTCTCTGCATTTTGGAGG
GCCTCTTGAGGCGAGCATGTTTTTGCTGAAAACCGGAGAAAAACCAAAACTCCATGGCTTTGAAGAAGTGATCCCTGGCCTCTGCTATGGCGCTCGAAACAGCCTCGATG
GAGCTGCAGGGCTGGTGAAGAAGGGAATCCTTAAACCTCAGGACTTCAGATTCTTTGTGGGTTATGCTGGGTGGCAACTGGATCAGTTGAGGGAGGAGATTGAATCAGAT
TACTGGTATGTGGCTGCTTGTAGCTCAAATCTACTTTGTGGAGGCGCATCAGAGGGACTGTGGGAGGAGATTTTGCAGTTAATGGGTGGCCACTATTCAGAGTTGAGCAG
AAAGCCTAAGCAAGACATGTAGCTATTTATTTAAACTTTAAAGGAATATTGAGAAATCTGAAGTGAGATGTGTAATCTAAGTCAAGTTAAAGCCATAAAAGCTGAAAGCC
AAATGTACAGAACTTAGCTGGCTTTCGAAATTTAGCAGTAAGGGACAACAACCACCTCTTAACTATTTATAAATGTAATTTATGTAGAGCATATTCTCATTTCCCTGATT
CAATAGCTTTAGACTCTGTCTACTGTCTACCTCTAATAATAACTTCAATTTTTTTTGTTATTTTTCCATTTAAAATTAAACTTCTTT
Protein sequenceShow/hide protein sequence
MDLLAVHVKNTATPSPSPLSPNRPISSSFLKSTRRFSSPHPSAFGAELLGLLEIRVFRPKLCSTASAYRSFLVTAIAKKNHDNSPSPGNGDHSIPGEDAKSNNFSDGNKS
NETSSQKSHHINLDWREFRANLFAREQAEKVDTDVNVQSANVHESKPLGLKWAHPIPIPETGCVLVATEKLDGVRTFERTVVLLLRSGTRHPQEGPFGVVINRPLQKKIK
HMKPNNLDLATTFSDCSLHFGGPLEASMFLLKTGEKPKLHGFEEVIPGLCYGARNSLDGAAGLVKKGILKPQDFRFFVGYAGWQLDQLREEIESDYWYVAACSSNLLCGG
ASEGLWEEILQLMGGHYSELSRKPKQDM