; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015397 (gene) of Snake gourd v1 genome

Gene IDTan0015397
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4050 domain-containing protein
Genome locationLG02:95502037..95504689
RNA-Seq ExpressionTan0015397
SyntenyTan0015397
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020386.1 hypothetical protein SDJN02_17070 [Cucurbita argyrosperma subsp. argyrosperma]9.2e-10192.82Show/hide
Query:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW
        MYSRCCLLSRLEGCSSKPCCSFLQFSG+YLRALIVL+VDN+KLLFHRRSC G CTGPALG+AMDGPS GLRVEDQEAKKQCLPENF SSSTCEMDNSTVW
Subjt:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW

Query:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        SQRSMASAQSHDSH+N+GSST+FVNSGLLLWNETRKQWVGNK SESQK+VREPKISWNATY+SLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
Subjt:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

XP_022951409.1 uncharacterized protein LOC111454240 isoform X1 [Cucurbita moschata]9.2e-10192.82Show/hide
Query:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW
        MYSRCCLLSRLEGCSSKPCCSFLQFSG+YLRALIVL+VDN+KLLFHRRSC G CTGPALG+AMDGPS GLRVEDQEAKKQCLPENF SSSTCEMDNSTVW
Subjt:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW

Query:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        SQRSMASAQSHDSH+N+GSST+FVNSGLLLWNETRKQWVGNK SESQK+VREPKISWNATY+SLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
Subjt:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

XP_023002499.1 uncharacterized protein LOC111496323 isoform X1 [Cucurbita maxima]6.0e-10091.79Show/hide
Query:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW
        MYSRCCLLSRLEGCSSKPCCSFLQFSG+YLRALIVL+VDN+KLLFHRRSC G CTGPALG+AMDGPS GLRV+DQEAKKQCLP+NF SSSTCEMDNSTVW
Subjt:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW

Query:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        SQRSMASAQSHDSH+N+GSST+FVNSGLLLWNETRKQWVGNK SESQK+VREPKISWNATY+SLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
Subjt:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

XP_023537962.1 uncharacterized protein LOC111798846 [Cucurbita pepo subsp. pepo]6.6e-9991.79Show/hide
Query:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW
        MYSRCCLLSRLEGCSSKPCCSFLQFSG+YLRALIVL+VDN+KLLFHRRSC G CTGPALG+AMDG S GLRVEDQEAKKQCLPENF SSS CEMDNSTVW
Subjt:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW

Query:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        SQRSMASAQSHDSH+N+GSST+FVNSGLLLWNETRKQWVGNK S+SQKQVREPKISWNATY+SLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
Subjt:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

XP_038885342.1 uncharacterized protein LOC120075759 isoform X1 [Benincasa hispida]4.7e-9788.78Show/hide
Query:  MYSRCCLLSRLEGCSS-KPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL RLEGCSS KPCCSFLQFSGEYLRALI+LMVDN+KLLFHRRSCHGCCT  AL NAMDGPS+GLRV+DQEAKKQCLPEN PSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSS-KPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        WSQRSMASA SHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNK SE QKQV+EPKISW+ATY+SLL TNKPFPE +PL EMI+FLVDVWEQ+GLYD
Subjt:  WSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A0A0LPL3 Uncharacterized protein2.1e-9587.76Show/hide
Query:  MYSRCCLLSRLEGCSSK-PCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALI+LMVD +KLLFH+R   GCCT  ALGNAMDGPS+GLRV+++EAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSK-PCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        WSQRSMAS Q+HDSHSNIGSSTDFVNSGLLLWNETRKQWVGNK S SQKQV+EPKISWNATY++LLTTNKPFPEAIPL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

A0A1S3BC47 uncharacterized protein LOC103488310 isoform X13.3e-9688.78Show/hide
Query:  MYSRCCLLSRLEGCSSK-PCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALI+LMVD +KLLFH+R   GCC+  ALGNAMDGPS+GLRV+D+EAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSK-PCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        WSQRSMASAQSHDS SNIGSSTDFVNSGLLLWNETRKQWVGNK S+SQKQV+EPKISWNATY+SLLTTNKPFPEAIPL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

A0A5A7VGA9 Uncharacterized protein3.3e-9688.78Show/hide
Query:  MYSRCCLLSRLEGCSSK-PCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV
        MYSRCCLL+RLEGCSSK PCCSFLQFSGEY+RALI+LMVD +KLLFH+R   GCC+  ALGNAMDGPS+GLRV+D+EAKKQCLPENFPSSSTCEMDNSTV
Subjt:  MYSRCCLLSRLEGCSSK-PCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTV

Query:  WSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        WSQRSMASAQSHDS SNIGSSTDFVNSGLLLWNETRKQWVGNK S+SQKQV+EPKISWNATY+SLLTTNKPFPEAIPL EMIEFLVDVWEQEGLYD
Subjt:  WSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

A0A6J1GIP5 uncharacterized protein LOC111454240 isoform X14.5e-10192.82Show/hide
Query:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW
        MYSRCCLLSRLEGCSSKPCCSFLQFSG+YLRALIVL+VDN+KLLFHRRSC G CTGPALG+AMDGPS GLRVEDQEAKKQCLPENF SSSTCEMDNSTVW
Subjt:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW

Query:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        SQRSMASAQSHDSH+N+GSST+FVNSGLLLWNETRKQWVGNK SESQK+VREPKISWNATY+SLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
Subjt:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

A0A6J1KQM2 uncharacterized protein LOC111496323 isoform X12.9e-10091.79Show/hide
Query:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW
        MYSRCCLLSRLEGCSSKPCCSFLQFSG+YLRALIVL+VDN+KLLFHRRSC G CTGPALG+AMDGPS GLRV+DQEAKKQCLP+NF SSSTCEMDNSTVW
Subjt:  MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVW

Query:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        SQRSMASAQSHDSH+N+GSST+FVNSGLLLWNETRKQWVGNK SESQK+VREPKISWNATY+SLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
Subjt:  SQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein1.0e-2548.55Show/hide
Query:  DGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASA-QSHDSHS---NIGSSTDFVNSGLLLWNETRKQWVG-NKRSESQKQVREPKISWN
        D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+ Q+ DS S   N  +  ++VN GLLLWN+TR++WVG +K +      +  K++WN
Subjt:  DGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASA-QSHDSHS---NIGSSTDFVNSGLLLWNETRKQWVG-NKRSESQKQVREPKISWN

Query:  -ATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
         ATY+SLL +NK FP+ IPL EM++FLVD+WEQEGLYD
Subjt:  -ATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

AT1G15350.2 unknown protein1.0e-2548.55Show/hide
Query:  DGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASA-QSHDSHS---NIGSSTDFVNSGLLLWNETRKQWVG-NKRSESQKQVREPKISWN
        D PS  +    +  KK  + E+F S+ST +MDN T  SQ S++S+ Q+ DS S   N  +  ++VN GLLLWN+TR++WVG +K +      +  K++WN
Subjt:  DGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASA-QSHDSHS---NIGSSTDFVNSGLLLWNETRKQWVG-NKRSESQKQVREPKISWN

Query:  -ATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
         ATY+SLL +NK FP+ IPL EM++FLVD+WEQEGLYD
Subjt:  -ATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

AT4G32342.1 unknown protein4.8e-3148.1Show/hide
Query:  NVKLLFHRRSCHGCCTGP-ALGNAMDGPSRGLRVEDQEAKK-QCLPENFPSSSTCEMD-NSTVWSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRK
        N K L +  +C GCC     L   +D PS+GL+++ +  KK     ++F S+STC+MD N T+ SQ   +S    D   +  +ST+FVN GL+LWN TR+
Subjt:  NVKLLFHRRSCHGCCTGP-ALGNAMDGPSRGLRVEDQEAKK-QCLPENFPSSSTCEMD-NSTVWSQRSMASAQSHDSHSNIGSSTDFVNSGLLLWNETRK

Query:  QWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLY
        QW     +  Q  V EP ISWN+TY+SLL+TNK FP+ IPL EM+ FLVDVWE+EGLY
Subjt:  QWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLY

AT5G25360.1 unknown protein6.1e-4257.72Show/hide
Query:  CHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQSHDSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKRSES
        C GCC  P L  A+D PS+GLR++ +  KK  + E+F S+STCEMDNST+ SQRSM+S    ++ S   S+   T+FVN GL LWN+TR+QW+ N  S+ 
Subjt:  CHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQSHDSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKRSES

Query:  QKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        + +VREP ISWNATYESLL  NK F   IPL EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD

AT5G25360.2 unknown protein6.1e-4257.72Show/hide
Query:  CHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQSHDSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKRSES
        C GCC  P L  A+D PS+GLR++ +  KK  + E+F S+STCEMDNST+ SQRSM+S    ++ S   S+   T+FVN GL LWN+TR+QW+ N  S+ 
Subjt:  CHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQSHDSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKRSES

Query:  QKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD
        + +VREP ISWNATYESLL  NK F   IPL EM++FLVDVWEQEGLYD
Subjt:  QKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATTCTAGGTGTTGTCTCCTCAGCCGCTTAGAGGGCTGCTCTAGCAAACCATGTTGCTCGTTTTTACAGTTTTCTGGAGAATATCTGCGCGCTCTTATAGTTTTGAT
GGTGGATAATGTCAAGCTTCTTTTCCACAGAAGAAGCTGTCATGGATGCTGCACTGGACCTGCACTAGGTAATGCAATGGATGGGCCGTCTAGAGGTCTGAGAGTTGAAG
ACCAAGAAGCAAAGAAACAATGCTTACCGGAAAATTTCCCAAGCTCTAGCACATGTGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATGGCATCAGCCCAGTCA
CATGATTCCCACAGCAATATTGGGAGCAGTACAGACTTCGTAAATTCTGGACTTCTTCTTTGGAATGAAACCAGGAAGCAATGGGTCGGAAATAAAAGGTCAGAGAGCCA
AAAGCAAGTTCGAGAACCCAAAATAAGTTGGAATGCTACTTACGAGAGCTTATTAACGACGAACAAGCCATTCCCCGAGGCCATACCTCTTGCTGAGATGATAGAGTTTC
TTGTTGATGTCTGGGAGCAGGAGGGTCTATATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTATTCTAGGTGTTGTCTCCTCAGCCGCTTAGAGGGCTGCTCTAGCAAACCATGTTGCTCGTTTTTACAGTTTTCTGGAGAATATCTGCGCGCTCTTATAGTTTTGAT
GGTGGATAATGTCAAGCTTCTTTTCCACAGAAGAAGCTGTCATGGATGCTGCACTGGACCTGCACTAGGTAATGCAATGGATGGGCCGTCTAGAGGTCTGAGAGTTGAAG
ACCAAGAAGCAAAGAAACAATGCTTACCGGAAAATTTCCCAAGCTCTAGCACATGTGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATGGCATCAGCCCAGTCA
CATGATTCCCACAGCAATATTGGGAGCAGTACAGACTTCGTAAATTCTGGACTTCTTCTTTGGAATGAAACCAGGAAGCAATGGGTCGGAAATAAAAGGTCAGAGAGCCA
AAAGCAAGTTCGAGAACCCAAAATAAGTTGGAATGCTACTTACGAGAGCTTATTAACGACGAACAAGCCATTCCCCGAGGCCATACCTCTTGCTGAGATGATAGAGTTTC
TTGTTGATGTCTGGGAGCAGGAGGGTCTATATGACTGA
Protein sequenceShow/hide protein sequence
MYSRCCLLSRLEGCSSKPCCSFLQFSGEYLRALIVLMVDNVKLLFHRRSCHGCCTGPALGNAMDGPSRGLRVEDQEAKKQCLPENFPSSSTCEMDNSTVWSQRSMASAQS
HDSHSNIGSSTDFVNSGLLLWNETRKQWVGNKRSESQKQVREPKISWNATYESLLTTNKPFPEAIPLAEMIEFLVDVWEQEGLYD