; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC00g0827 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC00g0827
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCytochrome c biogenesis FN
Genome locationscaffold247:2089..2657
RNA-Seq ExpressionMC00g0827
SyntenyMC00g0827
Gene Ontology termsGO:0015886 - heme transport (biological process)
GO:0017004 - cytochrome complex assembly (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0015232 - heme transporter activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR002541 - Cytochrome c assembly protein
IPR003567 - Cytochrome c-type biogenesis protein
IPR003569 - Probable cytochrome c biosynthesis protein, plants


Homology Show/hide homology
GenBank top hitse value%identityAlignment
NP_001154502.1 Cytochrome C assembly protein [Arabidopsis thaliana]3.25e-10097.4Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERKRPLLR FVGPPARTQWSGWLVVSGSR NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLW FF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        LLMTGISMILF QMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

XP_015584163.1 LOW QUALITY PROTEIN: uncharacterized protein LOC8267610 [Ricinus communis]6.70e-10097.4Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERK PLLR FVGPPARTQWSGWLVVSGSR NASFMPRVLATARIHSVILPLLHSWTSFLNIVT PCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

XP_021601353.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110606713 [Manihot esculenta]1.32e-9996.75Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERK PLLR FVGPPARTQWSGWLVVSGSR NASFMPRVLATARIHSVILPLLHSWTSFLNIVT PCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        LLMTGISMILFSQMKQQASVRRTYKKEMVVARST VHLRHSARAQPRPVMLWKN
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

XP_021666154.1 LOW QUALITY PROTEIN: uncharacterized protein LOC110654457 [Hevea brasiliensis]3.25e-10097.4Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERK PLLR FVGPPARTQWSGWLVVSGSR NASFMPRVLATARIHSVILPLLHSWTSFLNIVT PCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

XP_038899898.1 LOW QUALITY PROTEIN: probable cytochrome c biosynthesis protein [Benincasa hispida]5.39e-9996.1Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERKRPL RFFVGP  RTQWSGWLVVSGSR NASFMPRVLATARIHSVILP LHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        LLMTGISMILFSQMKQQASVRRTYKK MVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

TrEMBL top hitse value%identityAlignment
A0A2U8JDE4 Cytochrome c biogenesis (Fragment)5.92e-7993.18Show/hide
Query:  GWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVRR
        GW       NASFMPRVLATARIHSVILPLLHSWTSFLNIVT PCCVSGTFSIRSGLLAPVHSFATDDTRG FLWRFFLLMTGISMILFSQMKQQASVRR
Subjt:  GWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVRR

Query:  TYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        TYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  TYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

A0A3P6DYH1 Uncharacterized protein9.27e-8484.71Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSRNASFMPRVLATARIHSVI----LPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLW
        MERKRPLLR FVGPPARTQWSGWLVVSGSR   F     A+   HS        LLHSWTSFLNIVTFPCCV GTFSIRSGLLAPVHSFATDDTRGIFLW
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSRNASFMPRVLATARIHSVI----LPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLW

Query:  RFFLLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
         FFLLMTGISMILF QMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  RFFLLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

A0A6P5EER8 LOW QUALITY PROTEIN: uncharacterized protein LOC1097055912.15e-8988.96Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERKRP LR FVGPPARTQWSGWLVVSGSR NASFMPRVLATARIHSVILPL HSWTS LNI+T PCCVSGTFSIRSGLLAPVHS ATDDTRG FLW FF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        LL+TGIS+ LF QMKQQASV RTYKKE+VVARSTLVHLRHSARAQPRP+MLWKN
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

A0A6P8C2J0 LOW QUALITY PROTEIN: uncharacterized protein LOC1161900625.31e-8187.84Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERK  LLR F+GPPARTQWSGWLVVSGSR NASFMPRVLATARIHSVILPLLHS TSFLNIVT  CCV GTFSIRSGLLA VHSFATDDTRGIFLWRFF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRP
        L+MTGISMILFSQMKQQASV RTY+KEMVVARSTLVHLRHS    P P
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRP

F4IL90 Cytochrome C assembly protein1.57e-10097.4Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERKRPLLR FVGPPARTQWSGWLVVSGSR NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLW FF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGSR-NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        LLMTGISMILF QMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

SwissProt top hitse value%identityAlignment
P36180 Probable cytochrome c biosynthesis protein5.7e-3267.33Show/hide
Query:  WSGWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV
        W GW       NASFMP VLATA IHSVILP L+ WT FLN+VTF CC+ GTF +RSGLLA VHSFATD TRGIFLW FFLL+T IS + F +MKQQ+S 
Subjt:  WSGWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV

Query:  R
        +
Subjt:  R

Q04647 Probable cytochrome c biosynthesis protein3.6e-6392.42Show/hide
Query:  GWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVRR
        GW       NASFMPRVLATARIHSVILPLLHSWTSFLNIVT PCCVSGT SIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVRR
Subjt:  GWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVRR

Query:  TYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        TYKKEMV+ARSTLVHLRHSARAQPRPVMLWKN
Subjt:  TYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

Q04648 Probable cytochrome c biosynthesis protein3.0e-5785.07Show/hide
Query:  WSGWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV
        W GW       NASFMP VLATA IHSVILPLLHS T  +NIVTF CCV GTFSIRSGLLA VHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV
Subjt:  WSGWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV

Query:  RRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        RRTY+KEMVVARSTLVHLRHSARAQPRP +LWKN
Subjt:  RRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

Q33884 Cytochrome c biogenesis CcmF N-terminal-like mitochondrial protein 2 (Fragment)2.1e-5886.57Show/hide
Query:  WSGWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV
        W GW       NASFMP VLATA IHSVILP LHSWT FLNIVTF CCV GTFSIRSGLLA VHSFATDDTRGIFLW FFLLMTGI MILF QMKQQASV
Subjt:  WSGWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV

Query:  RRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        RRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  RRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

Q9I3N2 Cytochrome c-type biogenesis protein CcmF1.8e-1443.75Show/hide
Query:  WSGWLVVSGSRNASFMPRVLATARIHSVILP----LLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFS
        W GW       NASFMP ++ TA IHS+ +     +  SWT  L I  F   + GTF +RSG+L  VH+FA+D  RG+F+  F LL+ G S+ LF+
Subjt:  WSGWLVVSGSRNASFMPRVLATARIHSVILP----LLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFS

Arabidopsis top hitse value%identityAlignment
AT2G07768.1 Cytochrome C assembly protein5.2e-8197.4Show/hide
Query:  MERKRPLLRFFVGPPARTQWSGWLVVSGS-RNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF
        MERKRPLLR FVGPPARTQWSGWLVVSGS RNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLW FF
Subjt:  MERKRPLLRFFVGPPARTQWSGWLVVSGS-RNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFF

Query:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        LLMTGISMILF QMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  LLMTGISMILFSQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN

ATMG00960.1 Cytochrome C assembly protein1.2e-6493.18Show/hide
Query:  GWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVRR
        GW       NASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLW FFLLMTGISMILF QMKQQASVRR
Subjt:  GWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVRR

Query:  TYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
        TYKKEMVVARSTLVHLRHSARAQPRPVMLWKN
Subjt:  TYKKEMVVARSTLVHLRHSARAQPRPVMLWKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGAAAGAGACCACTACTTCGCTTCTTTGTTGGACCGCCGGCGCGAACACAGTGGTCGGGGTGGCTGGTGGTTTCGGGATCCCGTAATGCTTCTTTTATGCCTCG
GGTATTAGCCACAGCTCGTATTCATTCAGTCATTTTACCCCTTCTTCATTCTTGGACCTCGTTTCTTAATATTGTGACTTTTCCATGCTGTGTCTCAGGAACCTTTTCAA
TACGGTCCGGATTGCTAGCTCCCGTTCATAGTTTTGCTACAGATGATACACGAGGAATCTTTTTATGGCGGTTCTTCCTTCTAATGACCGGCATATCAATGATTCTTTTC
TCCCAGATGAAGCAGCAGGCATCGGTCCGTAGAACCTATAAAAAAGAGATGGTTGTGGCGCGAAGTACTCTTGTGCACCTACGTCACTCGGCTCGCGCGCAACCCCGCCC
CGTTATGTTATGGAAGAAT
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGAAAGAGACCACTACTTCGCTTCTTTGTTGGACCGCCGGCGCGAACACAGTGGTCGGGGTGGCTGGTGGTTTCGGGATCCCGTAATGCTTCTTTTATGCCTCG
GGTATTAGCCACAGCTCGTATTCATTCAGTCATTTTACCCCTTCTTCATTCTTGGACCTCGTTTCTTAATATTGTGACTTTTCCATGCTGTGTCTCAGGAACCTTTTCAA
TACGGTCCGGATTGCTAGCTCCCGTTCATAGTTTTGCTACAGATGATACACGAGGAATCTTTTTATGGCGGTTCTTCCTTCTAATGACCGGCATATCAATGATTCTTTTC
TCCCAGATGAAGCAGCAGGCATCGGTCCGTAGAACCTATAAAAAAGAGATGGTTGTGGCGCGAAGTACTCTTGTGCACCTACGTCACTCGGCTCGCGCGCAACCCCGCCC
CGTTATGTTATGGAAGAAT
Protein sequenceShow/hide protein sequence
MERKRPLLRFFVGPPARTQWSGWLVVSGSRNASFMPRVLATARIHSVILPLLHSWTSFLNIVTFPCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILF
SQMKQQASVRRTYKKEMVVARSTLVHLRHSARAQPRPVMLWKN