; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006828 (gene) of Snake gourd v1 genome

Gene IDTan0006828
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCytochrome c oxidase assembly protein cox16
Genome locationLG04:5141580..5147315
RNA-Seq ExpressionTan0006828
SyntenyTan0006828
Gene Ontology termsGO:0005743 - mitochondrial inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446874.1 PREDICTED: uncharacterized protein LOC103489460 isoform X1 [Cucumis melo]3.7e-6091.54Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TSTEAGM +STE KKI+SA+NAARASPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMR+ALSRTGPIDAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQE+VDINNYEYKRIP+PTDR+S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

XP_022956367.1 uncharacterized protein LOC111458129 [Cucurbita moschata]2.5e-6193.85Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TSTEAGM QSTE KK  SANNAAR+SPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQEKVDINNYEYKRIPKPTDR+S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

XP_022969704.1 uncharacterized protein LOC111468648 [Cucurbita maxima]7.4e-6193.08Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TSTEAG+ QSTE KK  SANNAAR+SPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQEKVDINNYEYKRIPKPTDR+S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

XP_023533207.1 uncharacterized protein LOC111795161 [Cucurbita pepo subsp. pepo]5.7e-6193.08Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TSTEAGM QSTE KK  SANNAAR+SPSMF+RWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQEKVDINNYEYKRIPKPTDR+S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

XP_038891357.1 uncharacterized protein LOC120080793 isoform X2 [Benincasa hispida]1.7e-6092.31Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TSTEAG+ QSTE KKIAS NNA RASPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKV+DDQEWEITEMRKALSRTGPIDAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQEKVDINNYEYKRIPKPTD++S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

TrEMBL top hitse value%identityAlignment
A0A1S4DWC3 uncharacterized protein LOC103489460 isoform X11.8e-6091.54Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TSTEAGM +STE KKI+SA+NAARASPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMR+ALSRTGPIDAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQE+VDINNYEYKRIP+PTDR+S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

A0A5A7SXX4 Cytochrome c oxidase assembly protein cox161.5e-5181.06Show/hide
Query:  TTTSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN
        T   TEAGM +STE KKI+SA+NAARASPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMR+ALSRTGPIDA+   +
Subjt:  TTTSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN

Query:  LSLEEELRALQEKVDINNYEYKRIPKPTDRSS
        L + ++ +ALQE+VDINNYEYKRIP+PTDR+S
Subjt:  LSLEEELRALQEKVDINNYEYKRIPKPTDRSS

A0A6J1CBH5 uncharacterized protein LOC1110100642.4e-5787.69Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TS E GMVQS E  KI+SA N ARASPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGP+DAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQEKV+IN+YEYKRIPKP+ R+S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

A0A6J1GWL5 uncharacterized protein LOC1114581291.2e-6193.85Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TSTEAGM QSTE KK  SANNAAR+SPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQEKVDINNYEYKRIPKPTDR+S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

A0A6J1HX89 uncharacterized protein LOC1114686483.6e-6193.08Show/hide
Query:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS
        TSTEAG+ QSTE KK  SANNAAR+SPSMFRRWGRRHPF+RYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN+S
Subjt:  TSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKNLS

Query:  LEEELRALQEKVDINNYEYKRIPKPTDRSS
        LEEELRALQEKVDINNYEYKRIPKPTDR+S
Subjt:  LEEELRALQEKVDINNYEYKRIPKPTDRSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G14145.1 unknown protein1.8e-4466.67Show/hide
Query:  TTTSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN
        TT  T     +S+ S    +     + S + F+RWGRRHPFVRYGLPMISLTV GA+GLG LLQGSKDIAKVKDDQEWEI E RKALSRTGP+DAYKPKN
Subjt:  TTTSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKALSRTGPIDAYKPKN

Query:  LSLEEELRALQEKVDINNYEYKRIPKPTDRSS
         S+E+EL+A+QEKVDIN YEYK+IPK  +  S
Subjt:  LSLEEELRALQEKVDINNYEYKRIPKPTDRSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATCTTTGGTTGGGTTGGCAGTCCCTCGATCTTGGTTTCATAGCCAAAGTTTTTCCACGGTTTGCACAACGACGAGCACTGAGGCAGGAATGGTTCAGTCTACAGA
ATCAAAGAAGATTGCCTCTGCCAACAATGCAGCTCGGGCATCTCCATCCATGTTTAGAAGATGGGGTAGAAGACACCCATTTGTCAGATATGGACTTCCAATGATCTCTC
TCACTGTGCTTGGGGCAGTTGGTCTTGGCCATCTCTTGCAAGGAAGTAAAGATATTGCAAAGGTAAAAGATGATCAAGAATGGGAGATCACTGAGATGAGAAAAGCGCTT
TCAAGAACCGGACCTATCGATGCATATAAGCCGAAGAACTTATCTTTGGAGGAGGAGTTGAGGGCTTTACAAGAAAAGGTAGACATCAATAACTATGAGTACAAGAGAAT
TCCAAAGCCTACCGATCGATCGTCGTAA
mRNA sequenceShow/hide mRNA sequence
CGTGCGCAGCCTCCTCCTCATTCTTCTTCTTCTTCACGCCGTCTCCTCCTCCGTTTCAGCAGTGCCACCGGTTCTCTCCTGTTCAGCCGCCGCCCTCGTCGTTGCGCCGC
CGAACGCCACTCGCGAACCAACCGCACGCCGCTCGCGAAAACAGCCGCGCCGCCCAGATCCGAGAAGAAGCTCCGCCACGAGTCGCAAGCCACTGCCTCGCCTGCTCGTC
TCCGTCGCGCCGCCCATCACCCACTGAGGGCAACACCACCAAGCCTACCTCAATTTACTCTCTCGCGCGATCTGCTCAGACAACAAGGTTTCTTATGATATCTTTGGTTG
GGTTGGCAGTCCCTCGATCTTGGTTTCATAGCCAAAGTTTTTCCACGGTTTGCACAACGACGAGCACTGAGGCAGGAATGGTTCAGTCTACAGAATCAAAGAAGATTGCC
TCTGCCAACAATGCAGCTCGGGCATCTCCATCCATGTTTAGAAGATGGGGTAGAAGACACCCATTTGTCAGATATGGACTTCCAATGATCTCTCTCACTGTGCTTGGGGC
AGTTGGTCTTGGCCATCTCTTGCAAGGAAGTAAAGATATTGCAAAGGTAAAAGATGATCAAGAATGGGAGATCACTGAGATGAGAAAAGCGCTTTCAAGAACCGGACCTA
TCGATGCATATAAGCCGAAGAACTTATCTTTGGAGGAGGAGTTGAGGGCTTTACAAGAAAAGGTAGACATCAATAACTATGAGTACAAGAGAATTCCAAAGCCTACCGAT
CGATCGTCGTAAAGTTTTGCCCGCGTTGCATACGAGATCTGCTCAGGTATTTTTGTGTTTCTACCTGTGATGCAAGTTAGATTCAATTTCCAAAGAGAACAATTGGAATG
AATGATCTAGCTAAATCCTTGTTGAAGATTATGCTAATAAGCGTCATTTTCAAAGGAGCAGAATGTGCCAAGAAAAAAAAAGAATTAAGCTTGTTTAGCGTGATAAACAC
CATTGATAAAATATAGATAGATATTGGAGCCTTTATAATTACGAGTTCCTCTCTTCTTTGAAAGTTAATACGAGTATTAGTTTTTACTTGATTAAATTAATAACATGAGT
TATAATAGTCAA
Protein sequenceShow/hide protein sequence
MISLVGLAVPRSWFHSQSFSTVCTTTSTEAGMVQSTESKKIASANNAARASPSMFRRWGRRHPFVRYGLPMISLTVLGAVGLGHLLQGSKDIAKVKDDQEWEITEMRKAL
SRTGPIDAYKPKNLSLEEELRALQEKVDINNYEYKRIPKPTDRSS