; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000341 (gene) of Snake gourd v1 genome

Gene IDTan0000341
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionComplex 1 LYR protein
Genome locationLG02:96449948..96452805
RNA-Seq ExpressionTan0000341
SyntenyTan0000341
Gene Ontology termsGO:0032981 - mitochondrial respiratory chain complex I assembly (biological process)
GO:0005759 - mitochondrial matrix (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029455.1 hypothetical protein SDJN02_07794, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-7090.07Show/hide
Query:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
        MRL+ KC+ LS HKNFLACNKI +HPR LHNGPDTIDELLDRHVVKKEKS+DD+++ELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
Subjt:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR

Query:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSG
        +NARKEFEDARFE+DPEIVTRLLIGGRDAVQSALDKLAEKQ QEIDKKQSG
Subjt:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSG

XP_022144304.1 uncharacterized protein LOC111014019 [Momordica charantia]4.2e-7290.91Show/hide
Query:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
        +RL+GKCLNLS HK+FLACNKI FHPRVLHNGPDTIDELLDRHVVKKEKS++DEDDELITQR LTSTRREALSLYRDI+RATRFFMWPDSRGVLWRDVLR
Subjt:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR

Query:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ
        ENARKEFE+AR+ESDPE+VTRLLIGGRDAVQSALDKLAEKQMQEIDKKQ G DQ
Subjt:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ

XP_022961619.1 uncharacterized protein LOC111462339 [Cucurbita moschata]3.5e-7188.96Show/hide
Query:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
        MRL+ KC+ LS HKNFLACNK+ FHPR+LHNGPDTIDELLDRHVVKKEKS+DD+++ELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
Subjt:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR

Query:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ
        +NARKEFEDARFE+DPEIVTRLLIGGRDAVQSALDKLAEKQ QEIDKKQSG+D+
Subjt:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ

XP_023545212.1 uncharacterized protein LOC111804691 [Cucurbita pepo subsp. pepo]6.7e-7088.31Show/hide
Query:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
        MRL+ KC+ LS HKNFLACNKI + PR LHNGPDTIDELLDRHVVKKEKS+DD+++ELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
Subjt:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR

Query:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ
        +NARKEFEDARFE+DPEIVTRLLIGGRDAVQSALDKLAEKQ QEIDKKQSG+D+
Subjt:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ

XP_038884246.1 uncharacterized protein LOC120075144 [Benincasa hispida]1.3e-7391.61Show/hide
Query:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
        MRL+GKCLNLS HKNFL CNKI FHPRVLHNGPDT+ ELLDRHVVKKEKS+DDE+DELITQRRLT+TRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
Subjt:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR

Query:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQL
        +NARKEFEDAR+ESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQ+G+DQL
Subjt:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQL

TrEMBL top hitse value%identityAlignment
A0A0A0LMK5 Uncharacterized protein2.1e-6987.82Show/hide
Query:  MRLLGKCLNLSN-HKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVL
        MRL+GK LNLS+ HKNFL C+KI FHPRVLHNGPDT++ELLDRHVVKKEK +DDEDD+LITQRRLT+TRREALSLYRDILRATRFFMWPDSRGVLWRDVL
Subjt:  MRLLGKCLNLSN-HKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVL

Query:  RENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQL
        RENARKEFEDAR ESDPE+VTRLLIGGRDAVQSALDKLAEKQ +EIDKKQ G+DQL
Subjt:  RENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQL

A0A5A7VCD6 Complex 1 LYR protein4.0e-6887.82Show/hide
Query:  MRLLGKCLNLSN-HKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVL
        MRL+GK LNLS+  KNFL C+KI FHPRVLHNGPDT++ELLDRHVVKKEKS DD+ D+LITQRRLTSTRREALSLYRDILRATRFFMWPDS+GVLWRDVL
Subjt:  MRLLGKCLNLSN-HKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVL

Query:  RENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQL
        R+NARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQ +EIDKKQSG+DQL
Subjt:  RENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQL

A0A6J1CRY1 uncharacterized protein LOC1110140192.0e-7290.91Show/hide
Query:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
        +RL+GKCLNLS HK+FLACNKI FHPRVLHNGPDTIDELLDRHVVKKEKS++DEDDELITQR LTSTRREALSLYRDI+RATRFFMWPDSRGVLWRDVLR
Subjt:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR

Query:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ
        ENARKEFE+AR+ESDPE+VTRLLIGGRDAVQSALDKLAEKQMQEIDKKQ G DQ
Subjt:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ

A0A6J1HEL0 uncharacterized protein LOC1114623391.7e-7188.96Show/hide
Query:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
        MRL+ KC+ LS HKNFLACNK+ FHPR+LHNGPDTIDELLDRHVVKKEKS+DD+++ELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
Subjt:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR

Query:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ
        +NARKEFEDARFE+DPEIVTRLLIGGRDAVQSALDKLAEKQ QEIDKKQSG+D+
Subjt:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ

A0A6J1K786 uncharacterized protein LOC1114922894.7e-6987.01Show/hide
Query:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR
        MRL+ KC+ LS HKNFLACNK  +HPR LHNGPDTIDELLD HVVKKEKS+DD+++ELITQRRLTSTRRE+LSLYRDILRATRFFMWPDSRGVLWRDVLR
Subjt:  MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLR

Query:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ
        +NARKEFEDARFE+DPEIVTRLLIGGRDAVQSALDKLAEKQ QEIDKKQSG D+
Subjt:  ENARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76060.1 LYR family of Fe/S cluster biogenesis protein8.5e-4764.47Show/hide
Query:  LGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDD-ELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLREN
        L K  NL+   + +  +  V   R LH GPDT++ELL+RH+ KKEK   D D+ E + +RRLTSTRREALSLYRDILRATRFF W DSRG LWRDVLREN
Subjt:  LGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDD-ELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLREN

Query:  ARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ
        ARKEFE ARFE+DPE++TRLLIGG DAV SALDKLAEKQ + I+K++ G  +
Subjt:  ARKEFEDARFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCTGCTGGGAAAATGTTTAAACCTATCAAATCATAAGAATTTTCTGGCTTGCAATAAAATTGTATTCCACCCTCGAGTTTTGCACAATGGTCCAGACACAATAGA
TGAACTTTTAGATAGACATGTGGTTAAGAAAGAAAAGTCCTATGATGATGAAGATGATGAATTGATAACTCAGCGACGCCTCACCAGCACTCGACGAGAGGCTTTAAGTC
TCTACCGAGACATTCTCCGTGCAACTCGATTCTTTATGTGGCCGGATTCTCGAGGTGTCCTGTGGCGAGATGTTCTTAGGGAGAATGCTCGGAAGGAGTTTGAGGATGCC
CGCTTCGAGTCAGATCCAGAAATTGTGACCCGACTTCTCATTGGTGGGCGCGATGCAGTGCAGTCAGCTCTCGACAAGCTTGCTGAGAAGCAGATGCAGGAGATTGATAA
GAAACAAAGTGGTCAAGATCAACTCTGA
mRNA sequenceShow/hide mRNA sequence
CTTTCATCTCTTTTCTCGTCTCCTTCCTCAGTTCTCGCGAACTCGCCCATTTGTCAATCTCCCGCTCTCTCTCTCTCTGCATCAGCAGAATCATCAATTGCACTAAAAAA
TCGCTTCTCTCTTTTCTAGCTTATCTCGATCAATCGTCTTCCCCCTCAATCTCACTCGCCTGGATCCTTCTAGGAAAATTACTGTACCCAATCCCAAGCACTTTGAGAGG
CCCAGACTCTCCCATCGAAGCACTTTAAGAGGCAGTTGTAACGTCCGAGAATTTCAAGTGCAGACAATTTAGAGTTTTCGATGAAAGTACACGGGTAGTGTCTTGAGGCA
TTGGTAAGGGCTATAAGCAAAGCTGTCCACGATCAAGATAGGGAAGATTGAGATTGCTGCCCACACGCAGACCAGATTCCAAGTAAATCATTCGGCATTGTCTCATTGAG
ATCAAAGGGAATTATTTAAATTGTCGAGTTGACCAATGCAGTAGAGCAGGGGTGACCATGAGGCTGCTGGGAAAATGTTTAAACCTATCAAATCATAAGAATTTTCTGGC
TTGCAATAAAATTGTATTCCACCCTCGAGTTTTGCACAATGGTCCAGACACAATAGATGAACTTTTAGATAGACATGTGGTTAAGAAAGAAAAGTCCTATGATGATGAAG
ATGATGAATTGATAACTCAGCGACGCCTCACCAGCACTCGACGAGAGGCTTTAAGTCTCTACCGAGACATTCTCCGTGCAACTCGATTCTTTATGTGGCCGGATTCTCGA
GGTGTCCTGTGGCGAGATGTTCTTAGGGAGAATGCTCGGAAGGAGTTTGAGGATGCCCGCTTCGAGTCAGATCCAGAAATTGTGACCCGACTTCTCATTGGTGGGCGCGA
TGCAGTGCAGTCAGCTCTCGACAAGCTTGCTGAGAAGCAGATGCAGGAGATTGATAAGAAACAAAGTGGTCAAGATCAACTCTGATCTGTCATATGGTTACTCTGATCTC
TTCTACTATTGCAATTGAAGTTGAAGTCTGGATGACATTTGGGCTACAAAAAGTAATAGGGCTTAGTTGCTGATAAAATGAAGAAAGATTCGTCAATATGTCAGTAACAA
GCCATCAACTTTTGCCTTGATGCAATATTTATCGCTGTACACCACAATGCATCACTTGTTTCGGCAATGTAAGTTAAAAGGTTGGCATTCACCAGTTTTGAATTTTTTGT
ACTCTGAAGAAACCGAGTTGTACTTTGCTGTTTGCAATTTTGTTCCACCCGTTCACTTGTTTTGTCCTATTTACTCACTTTAGCCGTAGTTCAGTTCGGCTTCTTGGGAA
AAAGAGTTTGCATCAGAAAATACATATTGTCGAACTTCTACTTGTCTTGGCACTTCTTATTTTTATACTTCAATAAATGTAAGTTTTTGGAATAGATGTCTTTTAATATT
ACAGATTGGTATATTTTGTTGACTTTCTCTTTTGCCCTTATAATAGCTGCGGAAAAATGAGAAGCAGAACATTGTGTCTCTAGTCTAAAGGGTGAATGCAATGTCATTGA
CTACTGGTGGTTTCTTAGAGAATGTTTTAGGGGCAGAACTATGTTGATATTGGGAAGAATTTGATTTGGAGGCGAAATTGGTTGCTGGTCTC
Protein sequenceShow/hide protein sequence
MRLLGKCLNLSNHKNFLACNKIVFHPRVLHNGPDTIDELLDRHVVKKEKSYDDEDDELITQRRLTSTRREALSLYRDILRATRFFMWPDSRGVLWRDVLRENARKEFEDA
RFESDPEIVTRLLIGGRDAVQSALDKLAEKQMQEIDKKQSGQDQL