; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002520 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002520
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCytochrome oxidase assembly protein
Genome locationscaffold318:772690..776409
RNA-Seq ExpressionMS002520
SyntenyMS002520
Gene Ontology termsNA
InterPro domainsIPR014807 - Cytochrome c oxidase assembly factor 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039892.1 Vacuolar sorting-associated protein 62 [Cucumis melo var. makuwa]4.7e-7784.52Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        S+SIKP+E  VNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA++DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQG GI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTD
        LQLKAVR+GE+SWISF+RPR+WDIL+MDALL+VP N+GK++TLRINLTEKF PAACV+   CQPP+T+
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTD

XP_022157492.1 uncharacterized protein LOC111024183 isoform X1 [Momordica charantia]1.9e-91100Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH
        LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH

XP_022157493.1 uncharacterized protein LOC111024183 isoform X2 [Momordica charantia]1.9e-91100Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH
        LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH

XP_038876218.1 uncharacterized protein LOC120068500 isoform X1 [Benincasa hispida]2.0e-8087.57Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKP+E+GVNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTDT
        LQLKAVR+GE+SWISF+RPR+WDILIMDALLHVPAN+GK++T+RINLTEKF PAACV+   CQPP+T+T
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTDT

XP_038876219.1 uncharacterized protein LOC120068500 isoform X2 [Benincasa hispida]2.0e-8087.57Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKP+E+GVNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA+ DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTDT
        LQLKAVR+GE+SWISF+RPR+WDILIMDALLHVPAN+GK++T+RINLTEKF PAACV+   CQPP+T+T
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTDT

TrEMBL top hitse value%identityAlignment
A0A1S3CBF0 uncharacterized protein LOC103498925 isoform X12.3e-7784.52Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        S+SIKP+E  VNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA++DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQG GI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTD
        LQLKAVR+GE+SWISF+RPR+WDIL+MDALL+VP N+GK++TLRINLTEKF PAACV+   CQPP+T+
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTD

A0A5D3DMW4 Vacuolar sorting-associated protein 622.3e-7784.52Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        S+SIKP+E  VNK WGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQA++DAIGEPI KGPWYNASLAVAHKRHSLSCTFPVSGPQG GI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTD
        LQLKAVR+GE+SWISF+RPR+WDIL+MDALL+VP N+GK++TLRINLTEKF PAACV+   CQPP+T+
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVA---CQPPDTD

A0A6J1DTH6 uncharacterized protein LOC111024183 isoform X29.4e-92100Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH
        LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH

A0A6J1DUL5 uncharacterized protein LOC111024183 isoform X19.4e-92100Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH
        LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH

A0A6J1JLR1 uncharacterized protein LOC111488050 isoform X26.6e-7787.58Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        SSS++P+EEGVNK WGRKAVSFVL+TVTGGVALSALDDLAIYHSCSSKAIEKA+NN+A+ DAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAA--CVAC
        +QLKAVR+GE+SWISF+RPR+WDILIMDALLHVP N+GK+QTLRINLTEKF PAA  CVAC
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAA--CVAC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G20390.1 unknown protein1.7e-5666.88Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        S+S      G    +GRKAVSFVLITVTGGVALSALDDL+IY  CSSKA+EK  N++ + +AIGEPI KGPWYNASLAV+H+RHS+SC+FPV GPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPA
        L LKAVR+GE+S   F++ R+WDILIMDAL+HVP+N+G +QTLRIN+T+   P+
Subjt:  LQLKAVRSGEESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPA

AT2G20390.2 unknown protein4.7e-5154.21Show/hide
Query:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI
        S+S      G    +GRKAVSFVLITVTGGVALSALDDL+IY  CSSKA+EK  N++ + +AIGEPI KGPWYNASLAV+H+RHS+SC+FPV GPQGTGI
Subjt:  SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGI

Query:  LQLKAVRSG------------------------------------EESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPA
        L LKAVR+G                                    E+S   F++ R+WDILIMDAL+HVP+N+G +QTLRIN+T+   P+
Subjt:  LQLKAVRSG------------------------------------EESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGCAGTTCCATAAAGCCAGCGGAAGAAGGGGTGAATAAATTCTGGGGTCGTAAGGCAGTTTCGTTTGTACTTATTACTGTTACTGGTGGTGTTGCTTTGAGTGCTTTAGA
TGATCTTGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAACCAAGCACTTAGAGATGCTATTGGGGAACCCATTGTTAAAGGTCCATGGTACA
ATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTATCCTGCACGTTTCCTGTGTCAGGACCACAGGGCACTGGGATCCTCCAATTGAAGGCAGTTCGTAGTGGAGAG
GAGTCCTGGATCTCTTTTGTCCGGCCACGTGAGTGGGACATTCTAATCATGGATGCACTTCTACATGTTCCTGCAAACCAAGGGAAGGAGCAGACATTGCGCATCAATCT
CACTGAGAAATTTCCTCCCGCTGCCTGCGTCGCTTGTCAGCCTCCAGACACAGACACACAC
mRNA sequenceShow/hide mRNA sequence
AGCAGTTCCATAAAGCCAGCGGAAGAAGGGGTGAATAAATTCTGGGGTCGTAAGGCAGTTTCGTTTGTACTTATTACTGTTACTGGTGGTGTTGCTTTGAGTGCTTTAGA
TGATCTTGCCATTTATCATAGCTGTAGCAGCAAAGCCATAGAGAAAGCCAGAAACAACCAAGCACTTAGAGATGCTATTGGGGAACCCATTGTTAAAGGTCCATGGTACA
ATGCATCACTTGCAGTAGCTCATAAGAGACATTCTCTATCCTGCACGTTTCCTGTGTCAGGACCACAGGGCACTGGGATCCTCCAATTGAAGGCAGTTCGTAGTGGAGAG
GAGTCCTGGATCTCTTTTGTCCGGCCACGTGAGTGGGACATTCTAATCATGGATGCACTTCTACATGTTCCTGCAAACCAAGGGAAGGAGCAGACATTGCGCATCAATCT
CACTGAGAAATTTCCTCCCGCTGCCTGCGTCGCTTGTCAGCCTCCAGACACAGACACACAC
Protein sequenceShow/hide protein sequence
SSSIKPAEEGVNKFWGRKAVSFVLITVTGGVALSALDDLAIYHSCSSKAIEKARNNQALRDAIGEPIVKGPWYNASLAVAHKRHSLSCTFPVSGPQGTGILQLKAVRSGE
ESWISFVRPREWDILIMDALLHVPANQGKEQTLRINLTEKFPPAACVACQPPDTDTH