; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015904 (gene) of Snake gourd v1 genome

Gene IDTan0015904
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function, DUF538
Genome locationLG09:56705265..56706597
RNA-Seq ExpressionTan0015904
SyntenyTan0015904
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK28128.1 uncharacterized protein E5676_scaffold289G00340 [Cucumis melo var. makuwa]8.6e-4966.01Show/hide
Query:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV
        MASFSR    FSL LLIL IS + HLSF   ++ LKS+DIHD+L LYG P GLLP NVKSYTLSDDGSF I+L+S+CYV F  LV+Y K IKGKLSYGS+
Subjt:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV

Query:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        +D+SGIQ KK FAW P+TG++   D ++IEFQVGFLSE LP  MF  IP+CRK
Subjt:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

XP_008450299.1 PREDICTED: uncharacterized protein LOC103491951 [Cucumis melo]8.6e-4966.01Show/hide
Query:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV
        MASFSR    FSL LLIL IS + HLSF   ++ LKS+DIHD+L LYG P GLLP NVKSYTLSDDGSF I+L+S+CYV F  LV+Y K IKGKLSYGS+
Subjt:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV

Query:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        +D+SGIQ KK FAW P+TG++   D ++IEFQVGFLSE LP  MF  IP+CRK
Subjt:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

XP_022933750.1 uncharacterized protein LOC111441073 [Cucurbita moschata]5.5e-4867.79Show/hide
Query:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS
        MAS S FSLLLLILTI  + HL+  V +IAL STDIH++L LYG PKGLLP+NVKSYTLS DGSFEI+LES CYV F  LV+YDKI+KGKL YGSVAD+S
Subjt:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS

Query:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        GIQ+KK F W  +TGIKAN    TI+F VG LSETLP + F  IP+C +
Subjt:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

XP_023006181.1 uncharacterized protein LOC111498989 [Cucurbita maxima]1.5e-4867.11Show/hide
Query:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS
        MAS S FSLLLLILTI+ + H++  V +IAL STDIH++L LYG PKGLLP+NVKSYTLSDDGSFEI+LES CYV F  LV+Y+KI+KGKLSYGSVAD+S
Subjt:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS

Query:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        GIQ+KK F W  +TGI+AN    TI+F VG LSETLP + F  IP+C +
Subjt:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

XP_023531629.1 uncharacterized protein LOC111793813 [Cucurbita pepo subsp. pepo]2.5e-4867.11Show/hide
Query:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS
        MAS SRFSLLLLILTI+ + HL+  V +IAL STDIH++L LYG PKGLLP+NVKSYTLS DGSFEI+LES CYV F  LV+YDKI+KGKL YGSVAD+S
Subjt:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS

Query:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        GI++KK F W  +TGI+AN    TI+F VG LSETLP + F  IP+C +
Subjt:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

TrEMBL top hitse value%identityAlignment
A0A1S3BPX6 uncharacterized protein LOC1034919514.1e-4966.01Show/hide
Query:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV
        MASFSR    FSL LLIL IS + HLSF   ++ LKS+DIHD+L LYG P GLLP NVKSYTLSDDGSF I+L+S+CYV F  LV+Y K IKGKLSYGS+
Subjt:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV

Query:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        +D+SGIQ KK FAW P+TG++   D ++IEFQVGFLSE LP  MF  IP+CRK
Subjt:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

A0A5A7UJ73 Uncharacterized protein4.1e-4966.01Show/hide
Query:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV
        MASFSR    FSL LLIL IS + HLSF   ++ LKS+DIHD+L LYG P GLLP NVKSYTLSDDGSF I+L+S+CYV F  LV+Y K IKGKLSYGS+
Subjt:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV

Query:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        +D+SGIQ KK FAW P+TG++   D ++IEFQVGFLSE LP  MF  IP+CRK
Subjt:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

A0A5D3DXH8 Uncharacterized protein4.1e-4966.01Show/hide
Query:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV
        MASFSR    FSL LLIL IS + HLSF   ++ LKS+DIHD+L LYG P GLLP NVKSYTLSDDGSF I+L+S+CYV F  LV+Y K IKGKLSYGS+
Subjt:  MASFSR----FSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSV

Query:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        +D+SGIQ KK FAW P+TG++   D ++IEFQVGFLSE LP  MF  IP+CRK
Subjt:  ADISGIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

A0A6J1F5P9 uncharacterized protein LOC1114410732.7e-4867.79Show/hide
Query:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS
        MAS S FSLLLLILTI  + HL+  V +IAL STDIH++L LYG PKGLLP+NVKSYTLS DGSFEI+LES CYV F  LV+YDKI+KGKL YGSVAD+S
Subjt:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS

Query:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        GIQ+KK F W  +TGIKAN    TI+F VG LSETLP + F  IP+C +
Subjt:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

A0A6J1KX25 uncharacterized protein LOC1114989897.1e-4967.11Show/hide
Query:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS
        MAS S FSLLLLILTI+ + H++  V +IAL STDIH++L LYG PKGLLP+NVKSYTLSDDGSFEI+LES CYV F  LV+Y+KI+KGKLSYGSVAD+S
Subjt:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADIS

Query:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK
        GIQ+KK F W  +TGI+AN    TI+F VG LSETLP + F  IP+C +
Subjt:  GIQSKKFFAWFPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55265.1 Protein of unknown function, DUF5381.5e-3858.06Show/hide
Query:  ALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFT-SLVFYDKIIKGKLSYGSVADISGIQSKKFFAWFPLTGIKANPDRRTIEFQ
        +L + DIHD+L  YG PKGLLP+NVKSYT+SDDG F + L SSCYV F+  LVFY K I GKLSYGSV D+ GIQ+K+ F W P+T ++++P   T+ F 
Subjt:  ALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFT-SLVFYDKIIKGKLSYGSVADISGIQSKKFFAWFPLTGIKANPDRRTIEFQ

Query:  VGFLSETLPVEMFADIPSCRKNPN
        VGF+S+TLP  MF ++PSC +N N
Subjt:  VGFLSETLPVEMFADIPSCRKNPN

AT1G61667.1 Protein of unknown function, DUF5384.4e-1935.34Show/hide
Query:  TDIHDVLSLYGLPKGLLPHNVKSYTLSD-DGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADISGIQSKKFFAWFPLTGIKAN-PDRRTIEFQVGF
        + I ++L   GLP GL P NV+SY+L D  G  E++L++ C+  F + V++D++IK  LSYG +  + G+  ++ F W P+ GI  N P    + F +G 
Subjt:  TDIHDVLSLYGLPKGLLPHNVKSYTLSD-DGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADISGIQSKKFFAWFPLTGIKAN-PDRRTIEFQVGF

Query:  LSETLPVEMFADIPSC
          + +   +F D P C
Subjt:  LSETLPVEMFADIPSC

AT3G07460.1 Protein of unknown function, DUF5381.5e-1428.19Show/hide
Query:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLS-DDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADI
        M    + +LL  +L   I       +  +  ++  I ++L   GLP GL P  VK +T++ + G F + L  SC   + + + YD+I+ G + Y  + D+
Subjt:  MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLS-DDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADI

Query:  SGIQSKKFFAWFPLTGIKAN-PDRRTIEFQVGFLSETLPVEMFADIPSC
        SGI +++ F W  + GI+ + P    I F VG L +   + +F     C
Subjt:  SGIQSKKFFAWFPLTGIKAN-PDRRTIEFQVGFLSETLPVEMFADIPSC

AT5G19860.1 Protein of unknown function, DUF5384.9e-2645.69Show/hide
Query:  IHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADISGIQSKKFFAWFPLTGIKAN-PDRRTIEFQVGFLSE
        ++++L  YGLP GLLP  V  +TLSDDG F + L +SC + F  LV YDK I G++ YGS+ ++ GIQ KKFF W  +  IK + P   +I F+VGF+++
Subjt:  IHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADISGIQSKKFFAWFPLTGIKAN-PDRRTIEFQVGFLSE

Query:  TLPVEMFADIPSCRKN
         L ++ F  I SC  N
Subjt:  TLPVEMFADIPSCRKN

AT5G54530.1 Protein of unknown function, DUF5382.8e-2135.46Show/hide
Query:  LLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADISGIQSKKFF
        ++LL+ T+ +   LS P          +HDVL   GLP GLLP  V SY L +DG  E+ L + CY  F + V ++ +++G LSYGS+  + G+  K+ F
Subjt:  LLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADISGIQSKKFF

Query:  AWFPLTGIKA-NPDRRTIEFQVGFLSETLPVEMFADIPSCR
         W  +  I   NP+   I F +G   + L + +F D P C+
Subjt:  AWFPLTGIKA-NPDRRTIEFQVGFLSETLPVEMFADIPSCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTTCTCCAGATTCTCACTTTTGCTTCTAATTCTCACCATTTCCATTGAAGCCCATCTCTCATTTCCCGTAGGGGAAATCGCCCTGAAGTCAACAGACATCCA
CGATGTTTTGTCCCTTTACGGTCTTCCAAAGGGTCTCTTACCCCACAATGTAAAGTCCTACACTCTCTCAGACGACGGCAGCTTTGAAATCAAACTGGAAAGCAGTTGTT
ATGTGCTTTTCACTTCTTTAGTCTTTTACGATAAGATAATCAAGGGGAAATTGAGTTATGGGTCTGTCGCTGATATTTCTGGAATTCAATCGAAGAAGTTTTTCGCGTGG
TTCCCTCTTACTGGGATTAAGGCCAATCCAGACCGCCGAACCATCGAATTTCAAGTTGGGTTTTTGTCTGAGACTTTGCCGGTCGAAATGTTCGCGGACATTCCTTCATG
TAGAAAAAACCCGAACAGGGACGTTGCCTAG
mRNA sequenceShow/hide mRNA sequence
GCAAAGTCTTCTCTTCCCTCTTCTGCTGATAACTTTTTCGATTTTCCAGACCTCAAAACCTGCATCAATGGCTTCCTTCTCCAGATTCTCACTTTTGCTTCTAATTCTCA
CCATTTCCATTGAAGCCCATCTCTCATTTCCCGTAGGGGAAATCGCCCTGAAGTCAACAGACATCCACGATGTTTTGTCCCTTTACGGTCTTCCAAAGGGTCTCTTACCC
CACAATGTAAAGTCCTACACTCTCTCAGACGACGGCAGCTTTGAAATCAAACTGGAAAGCAGTTGTTATGTGCTTTTCACTTCTTTAGTCTTTTACGATAAGATAATCAA
GGGGAAATTGAGTTATGGGTCTGTCGCTGATATTTCTGGAATTCAATCGAAGAAGTTTTTCGCGTGGTTCCCTCTTACTGGGATTAAGGCCAATCCAGACCGCCGAACCA
TCGAATTTCAAGTTGGGTTTTTGTCTGAGACTTTGCCGGTCGAAATGTTCGCGGACATTCCTTCATGTAGAAAAAACCCGAACAGGGACGTTGCCTAGGACAAACAACGG
AACCCATTTCATCAGGAGGGAATAATAGCTTTTCAAGCATTTTTATCCTTTTCAT
Protein sequenceShow/hide protein sequence
MASFSRFSLLLLILTISIEAHLSFPVGEIALKSTDIHDVLSLYGLPKGLLPHNVKSYTLSDDGSFEIKLESSCYVLFTSLVFYDKIIKGKLSYGSVADISGIQSKKFFAW
FPLTGIKANPDRRTIEFQVGFLSETLPVEMFADIPSCRKNPNRDVA