; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018521 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018521
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionComplex 1 LYR protein
Genome locationtig00153204:974983..975450
RNA-Seq ExpressionSgr018521
SyntenySgr018521
Gene Ontology termsGO:0032981 - mitochondrial respiratory chain complex I assembly (biological process)
GO:0005759 - mitochondrial matrix (cellular component)
InterPro domainsIPR008011 - Complex 1 LYR protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029455.1 hypothetical protein SDJN02_07794, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-6784.21Show/hide
Query:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR
        MRLV KC+ LS HK+F+ACNKIG+HPR LHNGPDTIDELLDRHVVK+E SFDD+++ELIT+RRLTSTRREALSLYRDILRATRFFMWPD RGVLWRDVLR
Subjt:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR

Query:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGG
        +NARKEFE+AR+E+DPEIVTRLLI GRDAV S LDKLAEKQ QEIDKKQ GG
Subjt:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGG

XP_022144304.1 uncharacterized protein LOC111014019 [Momordica charantia]7.6e-7490.91Show/hide
Query:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR
        +RLVGKCLNLSIHKHF+ACNKIGFHPRVLHNGPDTIDELLDRHVVK+E SF+DEDDELIT+R LTSTRREALSLYRDI+RATRFFMWPD RGVLWRDVLR
Subjt:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR

Query:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ
        ENARKEFEEARYESDPE+VTRLLI GRDAV S LDKLAEKQMQEIDKKQ GGDQ
Subjt:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ

XP_022961619.1 uncharacterized protein LOC111462339 [Cucurbita moschata]2.4e-6783.12Show/hide
Query:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR
        MRLV KC+ LS HK+F+ACNK+GFHPR+LHNGPDTIDELLDRHVVK+E SFDD+++ELIT+RRLTSTRREALSLYRDILRATRFFMWPD RGVLWRDVLR
Subjt:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR

Query:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ
        +NARKEFE+AR+E+DPEIVTRLLI GRDAV S LDKLAEKQ QEIDKKQ G D+
Subjt:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ

XP_031736624.1 uncharacterized protein LOC101214249 [Cucumis sativus]4.1e-6784.62Show/hide
Query:  MRLVGKCLNL-SIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVL
        MRLVGK LNL SIHK+F+ C+KIGFHPRVLHNGPDT++ELLDRHVVK+E  FDDEDD+LIT+RRLT+TRREALSLYRDILRATRFFMWPD RGVLWRDVL
Subjt:  MRLVGKCLNL-SIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVL

Query:  RENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL
        RENARKEFE+AR ESDPE+VTRLLI GRDAV S LDKLAEKQ +EIDKKQGG DQL
Subjt:  RENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL

XP_038884246.1 uncharacterized protein LOC120075144 [Benincasa hispida]7.1e-7288.39Show/hide
Query:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR
        MRLVGKCLNLSIHK+F+ CNKIGFHPRVLHNGPDT+ ELLDRHVVK+E SFDDE+DELIT+RRLT+TRREALSLYRDILRATRFFMWPD RGVLWRDVLR
Subjt:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR

Query:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL
        +NARKEFE+ARYESDPEIVTRLLI GRDAV S LDKLAEKQMQEIDKKQ G DQL
Subjt:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL

TrEMBL top hitse value%identityAlignment
A0A0A0LMK5 Uncharacterized protein2.0e-6784.62Show/hide
Query:  MRLVGKCLNL-SIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVL
        MRLVGK LNL SIHK+F+ C+KIGFHPRVLHNGPDT++ELLDRHVVK+E  FDDEDD+LIT+RRLT+TRREALSLYRDILRATRFFMWPD RGVLWRDVL
Subjt:  MRLVGKCLNL-SIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVL

Query:  RENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL
        RENARKEFE+AR ESDPE+VTRLLI GRDAV S LDKLAEKQ +EIDKKQGG DQL
Subjt:  RENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL

A0A5A7VCD6 Complex 1 LYR protein1.6e-6482.05Show/hide
Query:  MRLVGKCLNL-SIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVL
        MRLVGK LNL SI K+F+ C+KIGFHPRVLHNGPDT++ELLDRHVVK+E S DD+ D+LIT+RRLTSTRREALSLYRDILRATRFFMWPD +GVLWRDVL
Subjt:  MRLVGKCLNL-SIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVL

Query:  RENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL
        R+NARKEFE+AR+ESDPEIVTRLLI GRDAV S LDKLAEKQ +EIDKKQ G DQL
Subjt:  RENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL

A0A6J1CRY1 uncharacterized protein LOC1110140193.7e-7490.91Show/hide
Query:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR
        +RLVGKCLNLSIHKHF+ACNKIGFHPRVLHNGPDTIDELLDRHVVK+E SF+DEDDELIT+R LTSTRREALSLYRDI+RATRFFMWPD RGVLWRDVLR
Subjt:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR

Query:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ
        ENARKEFEEARYESDPE+VTRLLI GRDAV S LDKLAEKQMQEIDKKQ GGDQ
Subjt:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ

A0A6J1HEL0 uncharacterized protein LOC1114623391.2e-6783.12Show/hide
Query:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR
        MRLV KC+ LS HK+F+ACNK+GFHPR+LHNGPDTIDELLDRHVVK+E SFDD+++ELIT+RRLTSTRREALSLYRDILRATRFFMWPD RGVLWRDVLR
Subjt:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR

Query:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ
        +NARKEFE+AR+E+DPEIVTRLLI GRDAV S LDKLAEKQ QEIDKKQ G D+
Subjt:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ

A0A6J1K786 uncharacterized protein LOC1114922891.7e-6681.82Show/hide
Query:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR
        MRLV KC+ LS HK+F+ACNK G+HPR LHNGPDTIDELLD HVVK+E SFDD+++ELIT+RRLTSTRRE+LSLYRDILRATRFFMWPD RGVLWRDVLR
Subjt:  MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLR

Query:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ
        +NARKEFE+AR+E+DPEIVTRLLI GRDAV S LDKLAEKQ QEIDKKQ GGD+
Subjt:  ENARKEFEEARYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76060.1 LYR family of Fe/S cluster biogenesis protein2.6e-4369.05Show/hide
Query:  RVLHNGPDTIDELLDRHVVKRENSFDDEDD-ELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLRENARKEFEEARYESDPEIVTRLLIA
        R LH GPDT++ELL+RH+ K+E    D D+ E +  RRLTSTRREALSLYRDILRATRFF W D RG LWRDVLRENARKEFE AR+E+DPE++TRLLI 
Subjt:  RVLHNGPDTIDELLDRHVVKRENSFDDEDD-ELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLRENARKEFEEARYESDPEIVTRLLIA

Query:  GRDAVHSTLDKLAEKQMQEIDKKQGG
        G DAV S LDKLAEKQ + I+K++ G
Subjt:  GRDAVHSTLDKLAEKQMQEIDKKQGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGTTGGTGGGAAAATGTCTAAACCTGTCAATTCATAAACATTTTATGGCTTGCAACAAAATTGGTTTCCACCCTCGAGTTTTGCATAATGGTCCAGACACAATAGA
TGAACTTTTAGACAGACATGTGGTTAAGAGAGAAAATTCCTTTGATGATGAAGATGATGAATTGATAACTGAGCGACGACTCACCAGCACTCGACGCGAGGCTTTAAGTC
TCTATCGAGACATCCTCCGTGCAACTCGATTCTTTATGTGGCCAGATTTTCGAGGTGTGTTGTGGCGAGACGTTCTCAGAGAGAATGCCCGAAAGGAATTTGAGGAGGCC
CGCTATGAGTCTGATCCAGAAATTGTGACCCGACTCCTTATTGCCGGGCGCGATGCAGTGCATTCTACTCTTGACAAGCTTGCTGAGAAGCAGATGCAGGAGATTGATAA
GAAACAAGGTGGTGGAGATCAACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGTTGGTGGGAAAATGTCTAAACCTGTCAATTCATAAACATTTTATGGCTTGCAACAAAATTGGTTTCCACCCTCGAGTTTTGCATAATGGTCCAGACACAATAGA
TGAACTTTTAGACAGACATGTGGTTAAGAGAGAAAATTCCTTTGATGATGAAGATGATGAATTGATAACTGAGCGACGACTCACCAGCACTCGACGCGAGGCTTTAAGTC
TCTATCGAGACATCCTCCGTGCAACTCGATTCTTTATGTGGCCAGATTTTCGAGGTGTGTTGTGGCGAGACGTTCTCAGAGAGAATGCCCGAAAGGAATTTGAGGAGGCC
CGCTATGAGTCTGATCCAGAAATTGTGACCCGACTCCTTATTGCCGGGCGCGATGCAGTGCATTCTACTCTTGACAAGCTTGCTGAGAAGCAGATGCAGGAGATTGATAA
GAAACAAGGTGGTGGAGATCAACTTTGA
Protein sequenceShow/hide protein sequence
MRLVGKCLNLSIHKHFMACNKIGFHPRVLHNGPDTIDELLDRHVVKRENSFDDEDDELITERRLTSTRREALSLYRDILRATRFFMWPDFRGVLWRDVLRENARKEFEEA
RYESDPEIVTRLLIAGRDAVHSTLDKLAEKQMQEIDKKQGGGDQL