; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001453 (gene) of Snake gourd v1 genome

Gene IDTan0001453
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG05:6633380..6634501
RNA-Seq ExpressionTan0001453
SyntenyTan0001453
Gene Ontology termsGO:0009855 - determination of bilateral symmetry (biological process)
GO:0010087 - phloem or xylem histogenesis (biological process)
GO:0010305 - leaf vascular tissue pattern formation (biological process)
GO:0010588 - cotyledon vascular tissue pattern formation (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151872.1 uncharacterized protein LOC101212188 [Cucumis sativus]3.4e-7793.04Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MKL+GVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQ+ELEKS PNRQISIACL+FT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        WIILAVGMS+LVIGA+ NNKSRASCGFTHHHFLSIGGILCFVHGLFCVA+YV++ AAE
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

XP_008455827.1 PREDICTED: uncharacterized protein LOC103495927 [Cucumis melo]2.6e-7793.67Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MKL+GVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQ+ELEKS PNRQISIACLVFT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        WIILAVGMS+LVIGA+ NNKSRASCGFTHHHFLSIGGILCFVHGLFCVA+YV++ AAE
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

XP_022140603.1 uncharacterized protein LOC111011214 isoform X2 [Momordica charantia]3.9e-7387.97Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MK++GVLICLLVVAMDIVAGLLGIEA+IAQNKVK LRLWIFECR+PSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEE++KS PNRQ+S+ACL+FT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        WIILAVGMS+LVIG L+NNKSRASCGFTHHHFLSIGGILCFVH LFCVA+YVS+ A E
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

XP_022971119.1 uncharacterized protein LOC111469886 [Cucurbita maxima]1.1e-7290.45Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MKL+GVL+CLLVVAMDIVAGLL IEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGG NCIC +EELEKS PN+QISI CLVFT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA
        WIILAVGMSLLVIGALAN+KSR SCGFTHHHFLS+GGILCFVHGLFCVA+YVSS  A
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA

XP_038902419.1 uncharacterized protein LOC120089064 [Benincasa hispida]3.7e-7691.14Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        M+L+GVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKS PN+Q+SIACL+FT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        WIILAVGMS+LVIGAL NNKSRA+CGFTHHHFLSIGGILCFVHGLFCVA+YV++ A+E
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

TrEMBL top hitse value%identityAlignment
A0A0A0LR43 Uncharacterized protein1.6e-7793.04Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MKL+GVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQ+ELEKS PNRQISIACL+FT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        WIILAVGMS+LVIGA+ NNKSRASCGFTHHHFLSIGGILCFVHGLFCVA+YV++ AAE
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

A0A1S3C1S7 uncharacterized protein LOC1034959271.3e-7793.67Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MKL+GVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQ+ELEKS PNRQISIACLVFT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        WIILAVGMS+LVIGA+ NNKSRASCGFTHHHFLSIGGILCFVHGLFCVA+YV++ AAE
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

A0A6J1CID0 uncharacterized protein LOC111011214 isoform X21.9e-7387.97Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MK++GVLICLLVVAMDIVAGLLGIEA+IAQNKVK LRLWIFECR+PSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEE++KS PNRQ+S+ACL+FT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        WIILAVGMS+LVIG L+NNKSRASCGFTHHHFLSIGGILCFVH LFCVA+YVS+ A E
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

A0A6J1G6V2 uncharacterized protein LOC1114514055.4e-7389.17Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MKL+GVL+CLL+VAMDIVAGLL IEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGG NCICS+EELEKS PN+QISI CLVFT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA
        WIILAVGMSLLVIGALAN+KSR SCGF+HHHFLS+GGILCF+HGLFCVA+YVSS  A
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA

A0A6J1I7N7 uncharacterized protein LOC1114698865.4e-7390.45Show/hide
Query:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT
        MKL+GVL+CLLVVAMDIVAGLL IEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGG NCIC +EELEKS PN+QISI CLVFT
Subjt:  MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFT

Query:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA
        WIILAVGMSLLVIGALAN+KSR SCGFTHHHFLS+GGILCFVHGLFCVA+YVSS  A
Subjt:  WIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)4.3e-2238.06Show/hide
Query:  VLICL-LVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGG--CNCICSQEELEKSHPNRQISIACLVFTWI
        +++C+ L V +DIVAG +G++A  AQ  VKH +L   EC+ PS+ AF LG+ A   L  AH+ AN++G    N   +   L K+      ++ACL   W+
Subjt:  VLICL-LVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGG--CNCICSQEELEKSHPNRQISIACLVFTWI

Query:  ILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA
        +   G  +L  G  +N +SR  C FT++H  SIGG +CF+H +    +Y+SS  A
Subjt:  ILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA

AT1G11500.1 Protein of unknown function (DUF1218)1.7e-2640.12Show/hide
Query:  IGVLICLLVVAMDIVAGLLGIEADIAQNKV------KHLRLWIFEC-RDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIAC
        +G L+ ++++  DI A +LGIEA+IAQ+K       +H R     C R PS+ AF  G+ A  LL + H++AN+LGGC  I S+++ +++  N+ +++A 
Subjt:  IGVLICLLVVAMDIVAGLLGIEADIAQNKV------KHLRLWIFEC-RDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIAC

Query:  LVFTWIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        LV +WI   V  S L+IG LAN+++   C   H  F  IGGI C  HG+   A+YVS+ AA+
Subjt:  LVFTWIILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

AT2G32280.1 Protein of unknown function (DUF1218)2.3e-6063.69Show/hide
Query:  KLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFTW
        K+ G+L+CL++V +D+ A +LGI+A++AQN+VKH+RLW+FECR+PS+ AF+LGLGAA +L +AH++ NL+GGC CICSQ+E ++S   RQIS+ACLV TW
Subjt:  KLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFTW

Query:  IILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE
        I+ AVG   +VIG ++N+KSR+SCGFTHHHFLSIGGILCF+H LFCVA+YVS+ AA+
Subjt:  IILAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE

AT4G21310.1 Protein of unknown function (DUF1218)3.9e-5562.34Show/hide
Query:  IGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFTWII
        +G  IC+L++AMD+ AG+LGIEA+IAQNKVKHL++WIFECRDPS  AFK GL A  LL LAH+ AN LGGC C+ S+++LEKS  N+Q+++A L+FTWII
Subjt:  IGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFTWII

Query:  LAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA
        LA+  S+L++G +AN++SR +CG +HH  LSIGGILCFVHGLF VA+Y+S+ A+
Subjt:  LAVGMSLLVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTGATTGGCGTGTTGATTTGTTTGCTGGTCGTGGCTATGGATATTGTGGCTGGCTTGCTCGGCATCGAAGCTGACATAGCACAGAACAAGGTGAAGCACTTGCG
GCTATGGATATTCGAGTGCAGAGACCCAAGTGAGCAAGCTTTCAAATTGGGATTAGGAGCAGCAGGGCTGTTGGGATTAGCCCACATAATTGCTAATTTGCTTGGTGGTT
GCAATTGCATTTGCTCTCAAGAAGAGCTTGAAAAATCTCATCCAAACAGGCAAATCTCCATCGCATGCCTCGTTTTCACATGGATAATTCTGGCTGTGGGAATGTCGTTG
CTGGTGATTGGGGCATTGGCCAACAACAAATCCAGAGCTTCTTGTGGATTCACACACCATCACTTTCTGTCGATCGGAGGGATTTTGTGTTTTGTTCATGGCTTGTTTTG
TGTTGCTTTTTATGTTTCTTCCGCTGCTGCTGAGTAA
mRNA sequenceShow/hide mRNA sequence
CCCCATGAAAGGAATCCAAGATGAAAAAGAAAACCAACATTCACTCTTCAAAGTTAAAGCTTTCCAGTGTTTGTGTGTGTAATCTGTAAAGAAGCTTTTTAGCTAACGAC
CCCCCATAAAATTTCTCAAACACTGCTCTGCTTTGCTTTTTGAAAGGTCATAAATAGTCCATTTCCCCTTCTCATCAGCCTTTGCTTTCATTTCCCTCTCATACAAAGAA
GAAAAGGGATGAAGTTGATTGGCGTGTTGATTTGTTTGCTGGTCGTGGCTATGGATATTGTGGCTGGCTTGCTCGGCATCGAAGCTGACATAGCACAGAACAAGGTGAAG
CACTTGCGGCTATGGATATTCGAGTGCAGAGACCCAAGTGAGCAAGCTTTCAAATTGGGATTAGGAGCAGCAGGGCTGTTGGGATTAGCCCACATAATTGCTAATTTGCT
TGGTGGTTGCAATTGCATTTGCTCTCAAGAAGAGCTTGAAAAATCTCATCCAAACAGGCAAATCTCCATCGCATGCCTCGTTTTCACATGGATAATTCTGGCTGTGGGAA
TGTCGTTGCTGGTGATTGGGGCATTGGCCAACAACAAATCCAGAGCTTCTTGTGGATTCACACACCATCACTTTCTGTCGATCGGAGGGATTTTGTGTTTTGTTCATGGC
TTGTTTTGTGTTGCTTTTTATGTTTCTTCCGCTGCTGCTGAGTAATAATAATGAAAATTAATTACGTCACTAATATTGCTTAATGCTCCAAATTCCCTTTTTTAATTTTT
TAAATTAAAGGTCCTTGTCCTGTGTGGCATATGAGACAAAAACTAATCTAATCATTGTTCTTTTCTTCCCTATTAAGCATTTCCCTGTTCTGTTCTTATAACTTAAATGA
CTAGGAAGGCACAAATATTGGAATCTGTTTAGGGCATTACGTAACAGAGAGGGGAATTAAGGAGGCTTTTTTTTATGAAACACGG
Protein sequenceShow/hide protein sequence
MKLIGVLICLLVVAMDIVAGLLGIEADIAQNKVKHLRLWIFECRDPSEQAFKLGLGAAGLLGLAHIIANLLGGCNCICSQEELEKSHPNRQISIACLVFTWIILAVGMSL
LVIGALANNKSRASCGFTHHHFLSIGGILCFVHGLFCVAFYVSSAAAE