; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004579 (gene) of Snake gourd v1 genome

Gene IDTan0004579
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUPF0235 protein C15orf40 homolog
Genome locationLG07:15250827..15253220
RNA-Seq ExpressionTan0004579
SyntenyTan0004579
Gene Ontology termsNA
InterPro domainsIPR003746 - Protein of unknown function DUF167
IPR036591 - YggU-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151587.1 uncharacterized protein LOC101218498 [Cucumis sativus]5.9e-5692.19Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        MPP KKGKTKAPKATESIQSSVK+NN+PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAP KDGEANAALLDYMS+VLGVK+RQVS+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALTCE
        KSR KVVIVE+VS+QSVFDALNKALTCE
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALTCE

XP_008460904.1 PREDICTED: UPF0235 protein LHK_03181 [Cucumis melo]5.9e-5692.97Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        MPP KKGKTKAPKATESIQSSVKTNN+PSCLRSVSPSSVAITIHAKPGSKIASITDF DDALGVQIDAP KDGEANAALLDYMS+VLGVK+RQVS+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALTCE
        KSR KVVIVEEVS+QSVFDALNKALTCE
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALTCE

XP_022143592.1 uncharacterized protein LOC111013451 [Momordica charantia]6.1e-5389.84Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        MPP KKGKTKAPKATESIQS    N FPSCLRSV+PSSVAITIHAKPGSKIASITDFGDDALGVQIDAP KDGEANAALLDY+STVLGVK+RQVS+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALTCE
        KSRDKVVIVEEVS+Q+VFDALNKALTCE
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALTCE

XP_022971730.1 UPF0235 protein C15orf40 homolog [Cucurbita maxima]1.4e-5289.68Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        M P KKGKTKAPKATESIQSS+K NN+PSCLRSVS SS+AITIHAKPGSKIASITDFGDDALGVQIDAP KDGEANAALLDYMS+VLGVK+RQ+S+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALT
        KSRDKVVIVEEVS+QSVFDALNKALT
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALT

XP_038902101.1 UPF0235 protein C15orf40 homolog isoform X2 [Benincasa hispida]2.7e-5692.97Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        MPP KKGKTKA KATESIQSSVKTNN+PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAP KDGEANAALLDYMS+VLGVK+RQ+S+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALTCE
        KSRDKVVIVEEVS+QSVFDALNKALTCE
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALTCE

TrEMBL top hitse value%identityAlignment
A0A0A0LMY2 Uncharacterized protein2.9e-5692.19Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        MPP KKGKTKAPKATESIQSSVK+NN+PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAP KDGEANAALLDYMS+VLGVK+RQVS+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALTCE
        KSR KVVIVE+VS+QSVFDALNKALTCE
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALTCE

A0A1S3CDI8 UPF0235 protein LHK_031812.9e-5692.97Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        MPP KKGKTKAPKATESIQSSVKTNN+PSCLRSVSPSSVAITIHAKPGSKIASITDF DDALGVQIDAP KDGEANAALLDYMS+VLGVK+RQVS+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALTCE
        KSR KVVIVEEVS+QSVFDALNKALTCE
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALTCE

A0A6J1CR98 uncharacterized protein LOC1110134513.0e-5389.84Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        MPP KKGKTKAPKATESIQS    N FPSCLRSV+PSSVAITIHAKPGSKIASITDFGDDALGVQIDAP KDGEANAALLDY+STVLGVK+RQVS+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALTCE
        KSRDKVVIVEEVS+Q+VFDALNKALTCE
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALTCE

A0A6J1EMK4 uncharacterized protein LOC1114348898.1e-5187.3Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        M P KKGKTKA K TES +S +KTNN+PSCLRSVS SSVAITIHAKPGSKIASITDFGDDALGVQIDAP KDGEANAALLDYMS+VLGVK+RQ+S+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALT
        KSRDKVVIVEEVS+QSVFDALNKALT
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALT

A0A6J1I6J5 UPF0235 protein C15orf40 homolog6.6e-5389.68Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        M P KKGKTKAPKATESIQSS+K NN+PSCLRSVS SS+AITIHAKPGSKIASITDFGDDALGVQIDAP KDGEANAALLDYMS+VLGVK+RQ+S+GSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKALT
        KSRDKVVIVEEVS+QSVFDALNKALT
Subjt:  KSRDKVVIVEEVSMQSVFDALNKALT

SwissProt top hitse value%identityAlignment
Q3ZBP8 UPF0235 protein C15orf40 homolog1.9e-1246.51Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVV-IVEEVSMQSVFDALNK
        V+I IHAKPGSK  ++TD   +A+ V I APP +GEANA L  Y+S VL ++K  V L  G KSR+KVV ++     + + + L K
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVV-IVEEVSMQSVFDALNK

Q505I4 UPF0235 protein C15orf40 homolog4.9e-1345.1Show/hide
Query:  KGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDK
        K +TK P+        V T+         S   V I IHAKPGSK  ++TD   +A+GV I APP +GEANA L  Y+S VL ++K  V L  G KSR+K
Subjt:  KGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDK

Query:  VV
        VV
Subjt:  VV

Q54UW1 UPF0235 protein5.1e-1030.4Show/hide
Query:  KKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRD
        KKG + + K  +  Q  +  NN       V    + I ++  P SK +SI  F D  L ++I  PP DG+AN  +++++S  L ++K  + +G GSKSR+
Subjt:  KKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRD

Query:  KVVIV----EEVSMQSVFDALNKAL
        K V +    E ++   +F+ +   L
Subjt:  KVVIV----EEVSMQSVFDALNKAL

Q8WUR7 UPF0235 protein C15orf401.1e-1246.51Show/hide
Query:  VAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVV-IVEEVSMQSVFDALNK
        V I IHAKPGSK  ++TD   +A+ V I APP +GEANA L  Y+S VL ++K  V L  G KSR+KVV ++   + + + + L K
Subjt:  VAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVV-IVEEVSMQSVFDALNK

Q9CRC3 UPF0235 protein C15orf40 homolog1.9e-1243.14Show/hide
Query:  KGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDK
        K +TK P+        V T+             V I IHAKPGS+  ++TD   +A+GV I APP +GEANA L  Y+S VL ++K  V L  G KSR+K
Subjt:  KGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDK

Query:  VV
        VV
Subjt:  VV

Arabidopsis top hitse value%identityAlignment
AT1G49170.1 Protein of unknown function (DUF167)1.4e-3970.16Show/hide
Query:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS
        M P KKGK K   A ES  +  ++++FP+CLR ++PSSVAITIHAKPGSK ASITD  D+A+GVQIDAP +DGEANAALL+YMS+VLGVK+RQVSLGSGS
Subjt:  MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGS

Query:  KSRDKVVIVEEVSMQSVFDALNKA
        KSRDKVVIVE+++ QSVF AL++A
Subjt:  KSRDKVVIVEEVSMQSVFDALNKA

AT5G63440.2 Protein of unknown function (DUF167)3.4e-0931.63Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVVIVEEVSMQSVFDALNKAL
        P C+  +    V + I  +  ++ ++IT    D + V + AP   GEAN  LL++M  VLG++  Q++L  G  S+ K+++VE++S + V++ L +A+
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVVIVEEVSMQSVFDALNKAL

AT5G63440.3 Protein of unknown function (DUF167)3.4e-0931.63Show/hide
Query:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVVIVEEVSMQSVFDALNKAL
        P C+  +    V + I  +  ++ ++IT    D + V + AP   GEAN  LL++M  VLG++  Q++L  G  S+ K+++VE++S + V++ L +A+
Subjt:  PSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVVIVEEVSMQSVFDALNKAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCGGTGAAGAAGGGGAAAACGAAAGCTCCGAAAGCTACTGAATCAATCCAATCCTCCGTCAAAACCAACAATTTCCCTTCTTGTCTTCGCTCTGTTTCTCCTTC
TTCCGTCGCCATCACCATCCACGCAAAGCCTGGTTCCAAGATCGCCTCTATCACAGACTTTGGCGATGATGCGTTGGGAGTGCAAATCGACGCACCGCCCAAAGATGGAG
AAGCCAATGCTGCACTTCTTGATTACATGAGCACTGTTCTAGGTGTCAAAAAAAGACAAGTATCATTAGGTTCTGGCTCTAAATCAAGAGACAAGGTTGTGATCGTGGAG
GAGGTAAGTATGCAAAGTGTTTTTGATGCTTTGAATAAAGCTTTAACATGCGAGTGA
mRNA sequenceShow/hide mRNA sequence
CGTGGATCAAAACTGTGACCGCTCGCGAAACCCCGTTTTGATTTGGAAAATATTCAAAATTGGGAAAAGAAATGCCTCCGGTGAAGAAGGGGAAAACGAAAGCTCCGAAA
GCTACTGAATCAATCCAATCCTCCGTCAAAACCAACAATTTCCCTTCTTGTCTTCGCTCTGTTTCTCCTTCTTCCGTCGCCATCACCATCCACGCAAAGCCTGGTTCCAA
GATCGCCTCTATCACAGACTTTGGCGATGATGCGTTGGGAGTGCAAATCGACGCACCGCCCAAAGATGGAGAAGCCAATGCTGCACTTCTTGATTACATGAGCACTGTTC
TAGGTGTCAAAAAAAGACAAGTATCATTAGGTTCTGGCTCTAAATCAAGAGACAAGGTTGTGATCGTGGAGGAGGTAAGTATGCAAAGTGTTTTTGATGCTTTGAATAAA
GCTTTAACATGCGAGTGATTCTATGGGATCTTGTGTAGACCCATTAACGAAATATCTGCATACTGTAATAAGTACAGGTGGATTAACTACTATTTTGCTCCTGAAACATT
TTTAGTCGTGTCACTGGAAAGTTATTCAGCTAGATGTAACACTTATATGTTTATATTTGATTTCATCCTAAACAATATTTTATTTGTTGTATATTTGTAGAGATTTTCAA
CTTATGTTACTGAACTGAAATAACTTAACCTGACTGCTAGCCTTATATGCTTCTGCAAAGATGATATTTCAGTGAAATTGAAA
Protein sequenceShow/hide protein sequence
MPPVKKGKTKAPKATESIQSSVKTNNFPSCLRSVSPSSVAITIHAKPGSKIASITDFGDDALGVQIDAPPKDGEANAALLDYMSTVLGVKKRQVSLGSGSKSRDKVVIVE
EVSMQSVFDALNKALTCE