; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g0015 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g0015
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionVacuolar sorting-associated protein 62
Genome locationMC09:155473..159187
RNA-Seq ExpressionMC09g0015
SyntenyMC09g0015
Gene Ontology termsNA
InterPro domainsIPR014807 - Cytochrome c oxidase assembly factor 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7014597.1 hypothetical protein SDJN02_24776 [Cucurbita argyrosperma subsp. argyrosperma]8.58e-10087.04Show/hide
Query:  CSSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTG
        CSSS++P+EEGVNK WGRKAVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+ DAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTG
Subjt:  CSSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTG

Query:  ILQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAA--CVAC
        I+QLKAVR+GE+SWISF+RPR+WDILIMDALLHVP N+GK+QT+RINLTEKF PAA  CVAC
Subjt:  ILQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAA--CVAC

XP_022157492.1 uncharacterized protein LOC111024183 isoform X1 [Momordica charantia]3.30e-116100Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD
        LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD

XP_022157493.1 uncharacterized protein LOC111024183 isoform X2 [Momordica charantia]3.18e-116100Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD
        LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD

XP_038876218.1 uncharacterized protein LOC120068500 isoform X1 [Benincasa hispida]4.40e-10387.5Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKP+E+GVNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVAC---QPPDTD
        LQLKAVR+GE+SWISF+RPR+WDILIMDALLHVPAN+GK++T+RINLTEKF PAACV+C   QPP+T+
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVAC---QPPDTD

XP_038876219.1 uncharacterized protein LOC120068500 isoform X2 [Benincasa hispida]4.25e-10387.5Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKP+E+GVNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVAC---QPPDTD
        LQLKAVR+GE+SWISF+RPR+WDILIMDALLHVPAN+GK++T+RINLTEKF PAACV+C   QPP+T+
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVAC---QPPDTD

TrEMBL top hitse value%identityAlignment
A0A1S3CBF0 uncharacterized protein LOC103498925 isoform X11.31e-9983.04Show/hide
Query:  FTCSSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQG
        +  S+SIKP+E  VNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA++DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQG
Subjt:  FTCSSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQG

Query:  TGILQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVAC---QPPDTD
         GILQLKAVR+GE+SWISF+RPR+WDIL+MDALL+VP N+GK++TLRINLTEKF PAACV+C   QPP+T+
Subjt:  TGILQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVAC---QPPDTD

A0A1S3CBW1 uncharacterized protein LOC103498925 isoform X23.70e-9984.43Show/hide
Query:  SSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIL
        +SIKP+E  VNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA++DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQG GIL
Subjt:  SSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGIL

Query:  QLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVAC---QPPDTD
        QLKAVR+GE+SWISF+RPR+WDIL+MDALL+VP N+GK++TLRINLTEKF PAACV+C   QPP+T+
Subjt:  QLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVAC---QPPDTD

A0A6J1DTH6 uncharacterized protein LOC111024183 isoform X21.54e-116100Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD
        LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD

A0A6J1DUL5 uncharacterized protein LOC111024183 isoform X11.60e-116100Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD
        LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD

A0A6J1JLR1 uncharacterized protein LOC111488050 isoform X24.08e-9987.58Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSS++P+EEGVNK WGRKAVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+ DAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAA--CVAC
        +QLKAVR+GE+SWISF+RPR+WDILIMDALLHVP N+GK+QTLRINLTEKF PAA  CVAC
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAA--CVAC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20390.1 unknown protein1.3e-5666.88Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        S+S      G    +GRKAVSFVLITVTGGVALSALDDL+IY  CSSKA+EK  N++ + +AIGEPI KGPWYNASLAV+H+RHS+SC+FPV GPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPA
        L LKAVR+GE+S   F++ R+WDILIMDAL+HVP+N+G +QTLRIN+T+   P+
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPA

AT2G20390.2 unknown protein3.6e-5154.21Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        S+S      G    +GRKAVSFVLITVTGGVALSALDDL+IY  CSSKA+EK  N++ + +AIGEPI KGPWYNASLAV+H+RHS+SC+FPV GPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSG------------------------------------EESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPA
        L LKAVR+G                                    E+S   F++ R+WDILIMDAL+HVP+N+G +QTLRIN+T+   P+
Subjt:  LQLKAVRSG------------------------------------EESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTTACTTGCAGCAGTTCCATAAAGCCAGCGGAAGAAGGGGTGAATAAATTCTGGGGTCGTAAGGCAGTTTCGTTTGTACTTATTACTGTTACTGGTGGTGTTGCTTTGAG
TGCTTTAGATGATCTTGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAACCAAGCACTTAGAGATGCTATTGGGGAACCCATTGTTAAAGGTC
CATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTATCCTGCACGTTTCCTGTGTCAGGACCACAGGGCACTGGGATCCTCCAATTGAAGGCAGTTCGT
AGTGGAGAGGAGTCCTGGATCTCTTTTGTCCGGCCACGTGAGTGGGACATTCTAATCATGGATGCACTTCTACATGTTCCTGCAAACCAAGGGAAGGAGCAGACATTGCG
CATCAATCTCACTGAGAAATTTCCTCCCGCTGCCTGCGTCGCTTGTCAGCCTCCAGACACAGAC
mRNA sequenceShow/hide mRNA sequence
TTTACTTGCAGCAGTTCCATAAAGCCAGCGGAAGAAGGGGTGAATAAATTCTGGGGTCGTAAGGCAGTTTCGTTTGTACTTATTACTGTTACTGGTGGTGTTGCTTTGAG
TGCTTTAGATGATCTTGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAACCAAGCACTTAGAGATGCTATTGGGGAACCCATTGTTAAAGGTC
CATGGTACAATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTATCCTGCACGTTTCCTGTGTCAGGACCACAGGGCACTGGGATCCTCCAATTGAAGGCAGTTCGT
AGTGGAGAGGAGTCCTGGATCTCTTTTGTCCGGCCACGTGAGTGGGACATTCTAATCATGGATGCACTTCTACATGTTCCTGCAAACCAAGGGAAGGAGCAGACATTGCG
CATCAATCTCACTGAGAAATTTCCTCCCGCTGCCTGCGTCGCTTGTCAGCCTCCAGACACAGAC
Protein sequenceShow/hide protein sequence
FTCSSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVR
SGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTD