; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C09G177251 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C09G177251
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionCytochrome c biogenesis FN
Genome locationCla97Chr09:26969533..26973962
RNA-Seq ExpressionCla97C09G177251
SyntenyCla97C09G177251
Gene Ontology termsGO:0015886 - heme transport (biological process)
GO:0017004 - cytochrome complex assembly (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0015232 - heme transporter activity (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR003567 - Cytochrome c-type biogenesis protein
IPR003569 - Probable cytochrome c biosynthesis protein, plants


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ATW72861.1 ccmFN [Solanum commersonii]5.3e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

QEM01536.1 cytochrome C biogenesis factor N [Solanum tuberosum]5.3e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

YP_009049706.1 cytochrome c maturation protein CcmFN [Capsicum annuum]5.3e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

YP_009430473.1 cytochrome c maturation protein CcmFN [Solanum lycopersicum]5.3e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

YP_009682049.1 cytochrome c biogenesis FN [Physochlaina orientalis]5.3e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

TrEMBL top hitse value%identityAlignment
A0A0C5B9X0 Cytochrome c biogenesis FN2.6e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

A0A290WKZ9 CcmFn2.6e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

A0A5C1H778 Cytochrome C biogenesis factor N2.6e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

A0A7S9H4F1 Cytochrome c biogenesis factor N2.6e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

A0A7S9H578 Cytochrome c biogenesis factor N2.6e-5085.27Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG
        ARSTLVH  L    +  K   +VF+ C G
Subjt:  ARSTLVHLCLHDYLKNVKPHEVVFEKCHG

SwissProt top hitse value%identityAlignment
P36180 Probable cytochrome c biosynthesis protein1.8e-2973.03Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQAS
        ENASFMP VLAT  IHSVILP L+ WT FLN+VTF CC+ GTF +RSGLLA VHSFATD TRGIFLW FFLL+T IS + F +MKQQ+S
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQAS

Q04647 Probable cytochrome c biosynthesis protein4.2e-5094.44Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVT  CCVSGT SIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTYKKEMV+
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHL
        ARSTLVHL
Subjt:  ARSTLVHL

Q04648 Probable cytochrome c biosynthesis protein7.0e-4588.89Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMP VLAT  IHSVILPLLHS T  +NIVTF CCV GTFSIRSGLLA VHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASV RTY+KEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHL
        ARSTLVHL
Subjt:  ARSTLVHL

Q33884 Cytochrome c biogenesis CcmF N-terminal-like mitochondrial protein 2 (Fragment)5.4e-4588.89Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMP VLAT  IHSVILP LHSWT FLNIVTF CCV GTFSIRSGLLA VHSFATDDTRGIFLW FFLLMTGI MILF QMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHL
        ARSTLVHL
Subjt:  ARSTLVHL

Q9I3N2 Cytochrome c-type biogenesis protein CcmF1.6e-1246.51Show/hide
Query:  ENASFMPRVLATTRIHSVILP----LLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFS
        ENASFMP ++ T  IHS+ +     +  SWT  L I  FS  + GTF +RSG+L  VH+FA+D  RG+F+  F LL+ G S+ LF+
Subjt:  ENASFMPRVLATTRIHSVILP----LLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFS

Arabidopsis top hitse value%identityAlignment
AT2G07768.1 Cytochrome C assembly protein1.8e-5193.64Show/hide
Query:  SEENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEM
        S  NASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLW FFLLMTGISMILF QMKQQASV RTYKKEM
Subjt:  SEENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEM

Query:  VVARSTLVHL
        VVARSTLVHL
Subjt:  VVARSTLVHL

ATMG00960.1 Cytochrome C assembly protein1.3e-5195.37Show/hide
Query:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV
        ENASFMPRVLAT RIHSVILPLLHSWTSFLNIVTF CCVSGTFSIRSGLLAPVHSFATDDTRGIFLW FFLLMTGISMILF QMKQQASV RTYKKEMVV
Subjt:  ENASFMPRVLATTRIHSVILPLLHSWTSFLNIVTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVV

Query:  ARSTLVHL
        ARSTLVHL
Subjt:  ARSTLVHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTGCGAAGAATACATATTAAAACAGAAAGTAAAATACAGGATGAAAAAGCAGGCAAGAACAACTGGTCTGCAAAAATAAGAGGCAAAGAACAACTCATGAGCAA
GGACATCTCTATAGGTGCGACCGTGGTGACAGACGACTTTGTAGGTGTAGTGAACGATTCTGTAAGTGCGACGATGGCGATGGACGAGCAAGATGATGGTGTGGTTGGTG
ACTTTAAGAGTGAGGAAAATGCTTCTTTTATGCCTCGGGTATTAGCCACAACTCGTATTCATTCAGTCATTTTACCCCTTCTTCATTCTTGGACCTCATTTCTGAATATT
GTGACTTTTTCATGCTGTGTCTCAGGAACCTTTTCAATACGGTCCGGATTGCTAGCTCCCGTTCATAGTTTTGCTACAGATGATACACGAGGAATCTTTTTATGGCGGTT
CTTCCTTCTAATGACCGGCATATCAATGATTCTTTTCTCCCAAATGAAGCAGCAGGCATCGGTCTGTAGAACCTATAAGAAAGAGATGGTTGTGGCACGAAGTACTCTTG
TGCACCTATGTCTACATGACTATTTGAAAAATGTAAAACCGCACGAGGTAGTATTTGAAAAATGTCATGGTAATCCTCAGCGAACTCATCATCATCTACATGACTATTTG
AAAAATGTAAAACCATACGAGGGGATCTCTGGTTCGAGTTTGCTAAGGCTTCATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTGCGAAGAATACATATTAAAACAGAAAGTAAAATACAGGATGAAAAAGCAGGCAAGAACAACTGGTCTGCAAAAATAAGAGGCAAAGAACAACTCATGAGCAA
GGACATCTCTATAGGTGCGACCGTGGTGACAGACGACTTTGTAGGTGTAGTGAACGATTCTGTAAGTGCGACGATGGCGATGGACGAGCAAGATGATGGTGTGGTTGGTG
ACTTTAAGAGTGAGGAAAATGCTTCTTTTATGCCTCGGGTATTAGCCACAACTCGTATTCATTCAGTCATTTTACCCCTTCTTCATTCTTGGACCTCATTTCTGAATATT
GTGACTTTTTCATGCTGTGTCTCAGGAACCTTTTCAATACGGTCCGGATTGCTAGCTCCCGTTCATAGTTTTGCTACAGATGATACACGAGGAATCTTTTTATGGCGGTT
CTTCCTTCTAATGACCGGCATATCAATGATTCTTTTCTCCCAAATGAAGCAGCAGGCATCGGTCTGTAGAACCTATAAGAAAGAGATGGTTGTGGCACGAAGTACTCTTG
TGCACCTATGTCTACATGACTATTTGAAAAATGTAAAACCGCACGAGGTAGTATTTGAAAAATGTCATGGTAATCCTCAGCGAACTCATCATCATCTACATGACTATTTG
AAAAATGTAAAACCATACGAGGGGATCTCTGGTTCGAGTTTGCTAAGGCTTCATGAATAG
Protein sequenceShow/hide protein sequence
MKLRRIHIKTESKIQDEKAGKNNWSAKIRGKEQLMSKDISIGATVVTDDFVGVVNDSVSATMAMDEQDDGVVGDFKSEENASFMPRVLATTRIHSVILPLLHSWTSFLNI
VTFSCCVSGTFSIRSGLLAPVHSFATDDTRGIFLWRFFLLMTGISMILFSQMKQQASVCRTYKKEMVVARSTLVHLCLHDYLKNVKPHEVVFEKCHGNPQRTHHHLHDYL
KNVKPYEGISGSSLLRLHE