; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016794 (gene) of Snake gourd v1 genome

Gene IDTan0016794
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function, DUF538
Genome locationLG09:56630621..56633358
RNA-Seq ExpressionTan0016794
SyntenyTan0016794
Gene Ontology termsNA
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588240.1 hypothetical protein SDJN03_16805, partial [Cucurbita argyrosperma subsp. sororia]3.9e-6884.18Show/hide
Query:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI
        +LSPFSLLLLILTI+ QTH++ S RDIAL STDIHELL LYGFPKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+VYYD  +KGKL +GSV DVSGI
Subjt:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI

Query:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        QAKKLFLWVSVTGIKANQGSGTIDFYVG+LSETLPAQQFQKIP C R+ C+G RTEA+
Subjt:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

KAG7022156.1 hypothetical protein SDJN02_15885, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-6985.44Show/hide
Query:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI
        +LSPFSLLLLILTI+ QTHL+ S RDIAL STDIHELL LYGFPKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+VYYD  +KGKL +GSV DVSGI
Subjt:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI

Query:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        QAKKLFLWVSVTGIKANQGSGTIDFYVG+LSETLPAQQFQKIPAC R+ C+G RTEA+
Subjt:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

XP_022933750.1 uncharacterized protein LOC111441073 [Cucurbita moschata]4.6e-6985.44Show/hide
Query:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI
        +LSPFSLLLLILTI  QTHL+ S+RDIAL STDIHELL LYGFPKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+VYYD  +KGKL +GSV DVSGI
Subjt:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI

Query:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        QAKKLFLWVSVTGIKANQGSGTIDFYVG+LSETLPAQQFQKIPAC R+ C+G RTEA+
Subjt:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

XP_023006181.1 uncharacterized protein LOC111498989 [Cucurbita maxima]1.2e-6985.44Show/hide
Query:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI
        +LSPFSLLLLILTI+ QTH++ S+RDIAL STDIHELL LYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDL+VYY+  +KGKLS+GSV DVSGI
Subjt:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI

Query:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        QAKKLFLWVSVTGI+ANQGSGTIDFYVG+LSETLPAQQFQKIPAC RKAC G  TEA+
Subjt:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

XP_023531629.1 uncharacterized protein LOC111793813 [Cucurbita pepo subsp. pepo]7.8e-6985.44Show/hide
Query:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI
        +LS FSLLLLILTI+ QTHL+ S+RDIAL STDIHELL LYGFPKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+VYYD  +KGKL +GSV DVSGI
Subjt:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI

Query:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        +AKKLFLWVSVTGI+ANQGSGTIDFYVG+LSETLPAQQFQKIPAC RKACLG RTEA+
Subjt:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

TrEMBL top hitse value%identityAlignment
A0A5A7UJ73 Uncharacterized protein2.0e-5467.68Show/hide
Query:  MASFSITLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSV
        MASFS  LSPFSL LLIL ISTQTHLSFS RD+ LKS+DIH+LL LYGFP GLLP+NVKSYTLSDDGSF IEL+S CYV+F  +VYY   IKGKLS+GS+
Subjt:  MASFSITLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSV

Query:  TDVSGIQAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        +DVSGIQ KKLF W+ +TG++    S +I+F VG LSE LP   F+ IP CR+KACL  +TEA+
Subjt:  TDVSGIQAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

A0A6J1DIG2 uncharacterized protein LOC1110208037.2e-6074.71Show/hide
Query:  MASFSITLSPFSLLLLILTISTQTHLSFSIRD---------IALKST-DIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNK
        MASFS T+  FS  LLIL I TQTHLS S+ D           LKST D+HELL  YGFPKGLLP+NVKSYTLSDDGSFEIELESECYVKF L+VYYD K
Subjt:  MASFSITLSPFSLLLLILTISTQTHLSFSIRD---------IALKST-DIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNK

Query:  IKGKLSFGSVTDVSGIQAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        IKGKLS+GSV D SGIQAKKLFLWVSVTGIKAN   GTIDF+VG LSETL AQQFQKIP C+R  CLG RTEAI
Subjt:  IKGKLSFGSVTDVSGIQAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

A0A6J1F5P9 uncharacterized protein LOC1114410732.2e-6985.44Show/hide
Query:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI
        +LSPFSLLLLILTI  QTHL+ S+RDIAL STDIHELL LYGFPKGLLPNNVKSYTLS DGSFEIELESECYVKFDL+VYYD  +KGKL +GSV DVSGI
Subjt:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI

Query:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        QAKKLFLWVSVTGIKANQGSGTIDFYVG+LSETLPAQQFQKIPAC R+ C+G RTEA+
Subjt:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

A0A6J1H9I3 uncharacterized protein LOC1114613584.2e-6074.39Show/hide
Query:  MASFSITLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSV
        MASFS  LSPFSL LLIL ISTQTHLSFS  D  L  TDIHELL  YGFP GLLPNNVKSYTLSDDG+FEIEL+++CYV F  +VYY+ KI GKLS+GSV
Subjt:  MASFSITLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSV

Query:  TDVSGIQAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        TDVSGIQ KKLFLW+SV+G K+NQGSGTI F+VG LSET PA+ F+ IP CRRK CLG RTEA+
Subjt:  TDVSGIQAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

A0A6J1KX25 uncharacterized protein LOC1114989895.8e-7085.44Show/hide
Query:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI
        +LSPFSLLLLILTI+ QTH++ S+RDIAL STDIHELL LYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDL+VYY+  +KGKLS+GSV DVSGI
Subjt:  TLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGI

Query:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI
        QAKKLFLWVSVTGI+ANQGSGTIDFYVG+LSETLPAQQFQKIPAC RKAC G  TEA+
Subjt:  QAKKLFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55265.1 Protein of unknown function, DUF5383.8e-3757.85Show/hide
Query:  ALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKF-DLMVYYDNKIKGKLSFGSVTDVSGIQAKKLFLWVSVTGIKANQGSGTIDFY
        +L + DIH+LL  YGFPKGLLPNNVKSYT+SDDG F ++L S CYVKF D +V+Y   I GKLS+GSV DV GIQAK+ FLW+ +T ++++  S T+ F 
Subjt:  ALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKF-DLMVYYDNKIKGKLSFGSVTDVSGIQAKKLFLWVSVTGIKANQGSGTIDFY

Query:  VGILSETLPAQQFQKIPACRR
        VG +S+TLPA  F+ +P+C R
Subjt:  VGILSETLPAQQFQKIPACRR

AT1G61667.1 Protein of unknown function, DUF5383.0e-1835.46Show/hide
Query:  LLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSD-DGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGIQAKKL
        LLLL+L +   T  S          + I  LL   G P GL P+NV+SY+L D  G  E++L++ C+ +F+  VY+D  IK  LS+G +  + G+  ++L
Subjt:  LLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSD-DGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGIQAKKL

Query:  FLWVSVTGIKANQ-GSGTIDFYVGILSETLPAQQFQKIPAC
        FLW+ V GI  N   SG + F +G+  + +    F+  P C
Subjt:  FLWVSVTGIKANQ-GSGTIDFYVGILSETLPAQQFQKIPAC

AT3G07460.1 Protein of unknown function, DUF5381.6e-1633.6Show/hide
Query:  SIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLS-DDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGIQAKKLFLWVSVTGIKAN-QGS
        SI  +  ++  I E+L   G P GL P  VK +T++ + G F + L   C  K++  ++YD  + G + +  + D+SGI A++LFLW+ V GI+ +   S
Subjt:  SIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLS-DDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGIQAKKLFLWVSVTGIKAN-QGS

Query:  GTIDFYVGILSETLPAQQFQKIPAC
        G I F VG+L +      F+    C
Subjt:  GTIDFYVGILSETLPAQQFQKIPAC

AT5G19860.1 Protein of unknown function, DUF5383.3e-2540.38Show/hide
Query:  ITLSPF--SLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDV
        + L PF  S+ +  LT+ T T  S +  D    ST ++ELL  YG P GLLP+ V  +TLSDDG F + L + C ++FD +V+YD  I G++ +GS+T++
Subjt:  ITLSPF--SLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDV

Query:  SGIQAKKLFLWVSVTGIKAN-QGSGTIDFYVGILSETLPAQQFQKIPACRRKACLG
         GIQ KK F+W+ V  IK +   S +I F VG +++ L   QF+ I +C      G
Subjt:  SGIQAKKLFLWVSVTGIKAN-QGSGTIDFYVGILSETLPAQQFQKIPACRRKACLG

AT5G54530.1 Protein of unknown function, DUF5382.1e-1935.46Show/hide
Query:  LLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGIQAKKLF
        L+  ++ + T   LS S+   +  +  +H++L   G P GLLP  V SY L +DG  E+ L + CY KF+  V+++  ++G LS+GS+  V G+  K+LF
Subjt:  LLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGIQAKKLF

Query:  LWVSVTGIKA-NQGSGTIDFYVGILSETLPAQQFQKIPACR
        LW+ V  I   N  SG I F +G+  + L    F+  P C+
Subjt:  LWVSVTGIKA-NQGSGTIDFYVGILSETLPAQQFQKIPACR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTTCTCCATTACCCTCTCCCCATTCTCACTTCTCCTTCTAATTCTCACCATTTCCACTCAGACCCATCTCTCCTTTTCCATAAGGGATATCGCCCTCAAGTC
AACAGACATCCACGAACTCTTGACCCTTTACGGTTTCCCAAAGGGTCTCTTACCCAACAATGTCAAGTCCTACACTCTCTCAGACGACGGTAGCTTCGAAATCGAACTCG
AAAGCGAGTGTTATGTGAAGTTCGATTTGATGGTCTATTACGATAACAAAATCAAGGGGAAATTGAGTTTTGGGTCTGTCACGGATGTTTCTGGAATTCAAGCCAAGAAA
CTGTTCTTGTGGGTCTCTGTTACTGGAATCAAGGCTAATCAGGGCTCTGGAACCATCGATTTTTATGTTGGGATTTTGTCTGAGACTTTGCCGGCTCAACAGTTCCAGAA
GATTCCTGCATGTAGAAGGAAGGCTTGCCTAGGACATAGAACAGAGGCCATATGA
mRNA sequenceShow/hide mRNA sequence
AAGCGAACGCAGGCAAGTCTTCGTTCTTTCTTATCACTTTCCACACCTCAGCTTCCTTCAATGGCTTCCTTCTCCATTACCCTCTCCCCATTCTCACTTCTCCTTCTAAT
TCTCACCATTTCCACTCAGACCCATCTCTCCTTTTCCATAAGGGATATCGCCCTCAAGTCAACAGACATCCACGAACTCTTGACCCTTTACGGTTTCCCAAAGGGTCTCT
TACCCAACAATGTCAAGTCCTACACTCTCTCAGACGACGGTAGCTTCGAAATCGAACTCGAAAGCGAGTGTTATGTGAAGTTCGATTTGATGGTCTATTACGATAACAAA
ATCAAGGGGAAATTGAGTTTTGGGTCTGTCACGGATGTTTCTGGAATTCAAGCCAAGAAACTGTTCTTGTGGGTCTCTGTTACTGGAATCAAGGCTAATCAGGGCTCTGG
AACCATCGATTTTTATGTTGGGATTTTGTCTGAGACTTTGCCGGCTCAACAGTTCCAGAAGATTCCTGCATGTAGAAGGAAGGCTTGCCTAGGACATAGAACAGAGGCCA
TATGAGGTGGGAATAATAGCTTTAATTTCCAAGCATTTTTATCCTTTTTCATTTGGAGTGAGAAGCAAAGCATGTTACATTTGCAGGTGAAATCATGTATCTTTGTTTTT
ATGTCATGAACCTACAGTGCCCCTTGTAAGTCCTTTTTATATATGTTGTATCTATATCTATGGTATAAAATTTCTGCTTGTTTACCAAATGTTTCTTTCAATAATATGAT
ATTCTTTTACTTTA
Protein sequenceShow/hide protein sequence
MASFSITLSPFSLLLLILTISTQTHLSFSIRDIALKSTDIHELLTLYGFPKGLLPNNVKSYTLSDDGSFEIELESECYVKFDLMVYYDNKIKGKLSFGSVTDVSGIQAKK
LFLWVSVTGIKANQGSGTIDFYVGILSETLPAQQFQKIPACRRKACLGHRTEAI