; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004754 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004754
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionComplex 1 LYR protein
Genome locationChr08:20143059..20143529
RNA-Seq ExpressionHG10004754
SyntenyHG10004754
Gene Ontology termsGO:0032981 - mitochondrial respiratory chain complex I assembly (biological process)
GO:0005759 - mitochondrial matrix (cellular component)
InterPro domainsIPR008011 - Complex 1 LYR protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144304.1 uncharacterized protein LOC111014019 [Momordica charantia]3.2e-7290.38Show/hide
Query:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDE-DELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV
        MT+RLVGKCLNLSIHK+FL CNKIG HPRVLHNGPDTIDELLDRHVVKKEKSF+DE DELITQR LT+TRREALSLYRDI+RATRFFMWPDSRG+LWRDV
Subjt:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDE-DELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV

Query:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ
        LR+NARKEFE+ARYESDPE+VTRLLIGGRDAVQSALDKLAEKQMQEIDKKQ G DQ
Subjt:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ

XP_022961619.1 uncharacterized protein LOC111462339 [Cucurbita moschata]6.1e-7188.46Show/hide
Query:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSF-DDEDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV
        M+MRLV KC+ LS HKNFL CNK+G HPR+LHNGPDTIDELLDRHVVKKEKSF DDE+ELITQRRLT+TRREALSLYRDILRATRFFMWPDSRG+LWRDV
Subjt:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSF-DDEDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV

Query:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ
        LRQNARKEFEDAR+E+DPE+VTRLLIGGRDAVQSALDKLAEKQ QEIDKKQSGRD+
Subjt:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ

XP_023545212.1 uncharacterized protein LOC111804691 [Cucurbita pepo subsp. pepo]6.7e-7088.46Show/hide
Query:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSF-DDEDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV
        M+MRLV KC+ LS HKNFL CNKIG  PR LHNGPDTIDELLDRHVVKKEKSF DDE+ELITQRRLT+TRREALSLYRDILRATRFFMWPDSRG+LWRDV
Subjt:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSF-DDEDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV

Query:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ
        LRQNARKEFEDAR+E+DPE+VTRLLIGGRDAVQSALDKLAEKQ QEIDKKQSGRD+
Subjt:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ

XP_031736624.1 uncharacterized protein LOC101214249 [Cucumis sativus]2.7e-7190.51Show/hide
Query:  MTMRLVGKCLNL-SIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDEDE-LITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRD
        MTMRLVGK LNL SIHKNFLTC+KIG HPRVLHNGPDT++ELLDRHVVKKEK FDDED+ LITQRRLTTTRREALSLYRDILRATRFFMWPDSRG+LWRD
Subjt:  MTMRLVGKCLNL-SIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDEDE-LITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRD

Query:  VLRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL
        VLR+NARKEFEDAR ESDPEMVTRLLIGGRDAVQSALDKLAEKQ +EIDKKQ GRDQL
Subjt:  VLRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL

XP_038884246.1 uncharacterized protein LOC120075144 [Benincasa hispida]4.8e-7695.48Show/hide
Query:  MRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDD-EDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDVLR
        MRLVGKCLNLSIHKNFLTCNKIG HPRVLHNGPDT+ ELLDRHVVKKEKSFDD EDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRG+LWRDVLR
Subjt:  MRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDD-EDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDVLR

Query:  QNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL
        QNARKEFEDARYESDPE+VTRLLIGGRDAVQSALDKLAEKQMQEIDKKQ+GRDQL
Subjt:  QNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL

TrEMBL top hitse value%identityAlignment
A0A0A0LMK5 Uncharacterized protein1.3e-7190.51Show/hide
Query:  MTMRLVGKCLNL-SIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDEDE-LITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRD
        MTMRLVGK LNL SIHKNFLTC+KIG HPRVLHNGPDT++ELLDRHVVKKEK FDDED+ LITQRRLTTTRREALSLYRDILRATRFFMWPDSRG+LWRD
Subjt:  MTMRLVGKCLNL-SIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDEDE-LITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRD

Query:  VLRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL
        VLR+NARKEFEDAR ESDPEMVTRLLIGGRDAVQSALDKLAEKQ +EIDKKQ GRDQL
Subjt:  VLRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL

A0A1S4DWJ1 uncharacterized protein LOC1034885692.1e-6987.97Show/hide
Query:  MTMRLVGKCLNL-SIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDE-DELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRD
        MTMRLVGK LNL SI KNFL C+KIG HPRVLHNGPDT++ELLDRHVVKKEKS DD+ D+LITQRRLT+TRREALSLYRDILRATRFFMWPDS+G+LWRD
Subjt:  MTMRLVGKCLNL-SIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDE-DELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRD

Query:  VLRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL
        VLRQNARKEFEDAR+ESDPE+VTRLLIGGRDAVQSALDKLAEKQ +EIDKKQSGRDQL
Subjt:  VLRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL

A0A5A7VCD6 Complex 1 LYR protein2.1e-6987.97Show/hide
Query:  MTMRLVGKCLNL-SIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDE-DELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRD
        MTMRLVGK LNL SI KNFL C+KIG HPRVLHNGPDT++ELLDRHVVKKEKS DD+ D+LITQRRLT+TRREALSLYRDILRATRFFMWPDS+G+LWRD
Subjt:  MTMRLVGKCLNL-SIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDE-DELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRD

Query:  VLRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL
        VLRQNARKEFEDAR+ESDPE+VTRLLIGGRDAVQSALDKLAEKQ +EIDKKQSGRDQL
Subjt:  VLRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL

A0A6J1CRY1 uncharacterized protein LOC1110140191.6e-7290.38Show/hide
Query:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDE-DELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV
        MT+RLVGKCLNLSIHK+FL CNKIG HPRVLHNGPDTIDELLDRHVVKKEKSF+DE DELITQR LT+TRREALSLYRDI+RATRFFMWPDSRG+LWRDV
Subjt:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDE-DELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV

Query:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ
        LR+NARKEFE+ARYESDPE+VTRLLIGGRDAVQSALDKLAEKQMQEIDKKQ G DQ
Subjt:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ

A0A6J1HEL0 uncharacterized protein LOC1114623392.9e-7188.46Show/hide
Query:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSF-DDEDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV
        M+MRLV KC+ LS HKNFL CNK+G HPR+LHNGPDTIDELLDRHVVKKEKSF DDE+ELITQRRLT+TRREALSLYRDILRATRFFMWPDSRG+LWRDV
Subjt:  MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSF-DDEDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDV

Query:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ
        LRQNARKEFEDAR+E+DPE+VTRLLIGGRDAVQSALDKLAEKQ QEIDKKQSGRD+
Subjt:  LRQNARKEFEDARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76060.1 LYR family of Fe/S cluster biogenesis protein6.1e-4571.43Show/hide
Query:  RVLHNGPDTIDELLDRHVVKKEKSFDDED--ELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDVLRQNARKEFEDARYESDPEMVTRLLIG
        R LH GPDT++ELL+RH+ KKEK   D D  E + +RRLT+TRREALSLYRDILRATRFF W DSRG LWRDVLR+NARKEFE AR+E+DPE++TRLLIG
Subjt:  RVLHNGPDTIDELLDRHVVKKEKSFDDED--ELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDVLRQNARKEFEDARYESDPEMVTRLLIG

Query:  GRDAVQSALDKLAEKQMQEIDKKQSG
        G DAV SALDKLAEKQ + I+K++ G
Subjt:  GRDAVQSALDKLAEKQMQEIDKKQSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATGAGGTTAGTAGGGAAATGTCTTAACCTGTCAATTCATAAGAATTTTCTGACTTGCAATAAAATTGGTCTCCACCCTCGAGTTCTGCATAATGGTCCAGACAC
AATAGATGAACTCTTAGACAGACATGTGGTTAAGAAAGAAAAGTCCTTTGATGATGAAGATGAATTGATAACTCAGCGACGCCTCACCACTACTCGGCGTGAGGCTCTAA
GTCTGTATAGAGACATTCTCCGAGCAACTCGATTCTTTATGTGGCCAGATTCTCGAGGTATCTTGTGGCGAGATGTTCTCAGGCAGAATGCACGGAAGGAGTTTGAGGAT
GCTCGCTATGAGTCTGATCCAGAAATGGTGACCCGACTTCTCATTGGTGGGCGTGATGCAGTGCAGTCGGCTCTTGACAAGCTTGCTGAGAAGCAGATGCAGGAGATTGA
TAAGAAACAAAGTGGTCGAGATCAACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCATGAGGTTAGTAGGGAAATGTCTTAACCTGTCAATTCATAAGAATTTTCTGACTTGCAATAAAATTGGTCTCCACCCTCGAGTTCTGCATAATGGTCCAGACAC
AATAGATGAACTCTTAGACAGACATGTGGTTAAGAAAGAAAAGTCCTTTGATGATGAAGATGAATTGATAACTCAGCGACGCCTCACCACTACTCGGCGTGAGGCTCTAA
GTCTGTATAGAGACATTCTCCGAGCAACTCGATTCTTTATGTGGCCAGATTCTCGAGGTATCTTGTGGCGAGATGTTCTCAGGCAGAATGCACGGAAGGAGTTTGAGGAT
GCTCGCTATGAGTCTGATCCAGAAATGGTGACCCGACTTCTCATTGGTGGGCGTGATGCAGTGCAGTCGGCTCTTGACAAGCTTGCTGAGAAGCAGATGCAGGAGATTGA
TAAGAAACAAAGTGGTCGAGATCAACTCTGA
Protein sequenceShow/hide protein sequence
MTMRLVGKCLNLSIHKNFLTCNKIGLHPRVLHNGPDTIDELLDRHVVKKEKSFDDEDELITQRRLTTTRREALSLYRDILRATRFFMWPDSRGILWRDVLRQNARKEFED
ARYESDPEMVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGRDQL