; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020232 (gene) of Snake gourd v1 genome

Gene IDTan0020232
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein CMSS1
Genome locationLG10:12851674..12854834
RNA-Seq ExpressionTan0020232
SyntenyTan0020232
Gene Ontology termsNA
InterPro domainsIPR032704 - Protein Cms1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK16952.1 protein CMSS1 [Cucumis melo var. makuwa]1.7e-5770.97Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKD CILGPP+SSVQDDK+LVKHIK                   +TEPGSPAVLIISTSALRSIELLKG RS+TQECHAVKLFSKHMKVEEQV+LLKNRV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV
        NIA         LIDIEALGLSRLAVIVLD+ PDVKGYSLFSLPQVR  F          RVV+G+LR+CL+GPLQP RKRRKKEV
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV

XP_008453922.1 PREDICTED: protein CMSS1 [Cucumis melo]1.7e-5770.97Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKD CILGPP+SSVQDDK+LVKHIK                   +TEPGSPAVLIISTSALRSIELLKG RS+TQECHAVKLFSKHMKVEEQV+LLKNRV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV
        NIA         LIDIEALGLSRLAVIVLD+ PDVKGYSLFSLPQVR  F          RVV+G+LR+CL+GPLQP RKRRKKEV
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV

XP_022138200.1 protein CMS1 [Momordica charantia]1.7e-5771.81Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKDTCILGPPQSSVQDDKTL+K+IK                   K EPGSPAVLIISTSALRSIELLK LRSLTQECHAVKLFSKHMKVEEQVELLKN V
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEVTT
        NIA         LIDIEALGLSRLAVIVLDI PDVKGYSLFSLPQVR  F          RVV+ KLR+CL+GPLQP RK+RKKEVT+
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEVTT

XP_023527304.1 protein CMS1 [Cucurbita pepo subsp. pepo]1.3e-5770.9Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHI-------------KGR----KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MK  CILGPPQSS+QDDKTLVKHI             KG+    +T PGSPAVLIISTSA+R+IELLKG RSLTQECHAVKLFSKHMK+EEQVELLKNRV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHI-------------KGR----KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEVTTP
        NIA         LIDIEALGLSRLAVIVLDI PDVKGYSLFSLPQVR  F          RVV+G LR+CL+GPLQP RKRRKKEVT+P
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEVTTP

XP_038906109.1 protein CMS1 isoform X3 [Benincasa hispida]1.6e-5870.9Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHI-------------KGR----KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKD CILGPP+SSVQDDKTLVKHI             KG+    KTEPGSPAVLIISTSALRSIELLKG RSLT+ECHAVKLFSKHMK+EEQV+LLKNRV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHI-------------KGR----KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEVTTP
        NIA         LID EALGLSRLAVIVLDI PD+KGYSLFSLPQVR  F          RVV+G+LR+CL+GPLQP RKRRKKE+T+P
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEVTTP

TrEMBL top hitse value%identityAlignment
A0A0A0KUA9 Uncharacterized protein1.9e-5769.35Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKD CILGPP+SSVQDDK+LVKH+K                   +TEPGSPAVLIISTSALRSIELLKG RS+TQECHAVKLFSKHMKVEEQV+LLKNRV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV
        NIA         LIDIEALGLSRLAVIVLD+ PDVKGYSLFSLPQVR  F          R+V+G+LR+CL+GPLQP RKRRKKE+
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV

A0A1S3BYM0 protein CMSS18.3e-5870.97Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKD CILGPP+SSVQDDK+LVKHIK                   +TEPGSPAVLIISTSALRSIELLKG RS+TQECHAVKLFSKHMKVEEQV+LLKNRV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV
        NIA         LIDIEALGLSRLAVIVLD+ PDVKGYSLFSLPQVR  F          RVV+G+LR+CL+GPLQP RKRRKKEV
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV

A0A5A7TR86 Protein CMSS18.3e-5870.97Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKD CILGPP+SSVQDDK+LVKHIK                   +TEPGSPAVLIISTSALRSIELLKG RS+TQECHAVKLFSKHMKVEEQV+LLKNRV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV
        NIA         LIDIEALGLSRLAVIVLD+ PDVKGYSLFSLPQVR  F          RVV+G+LR+CL+GPLQP RKRRKKEV
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV

A0A5D3CZG6 Protein CMSS18.3e-5870.97Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKD CILGPP+SSVQDDK+LVKHIK                   +TEPGSPAVLIISTSALRSIELLKG RS+TQECHAVKLFSKHMKVEEQV+LLKNRV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV
        NIA         LIDIEALGLSRLAVIVLD+ PDVKGYSLFSLPQVR  F          RVV+G+LR+CL+GPLQP RKRRKKEV
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEV

A0A6J1C8S6 protein CMS18.3e-5871.81Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        MKDTCILGPPQSSVQDDKTL+K+IK                   K EPGSPAVLIISTSALRSIELLK LRSLTQECHAVKLFSKHMKVEEQVELLKN V
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIKGR-----------------KTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEVTT
        NIA         LIDIEALGLSRLAVIVLDI PDVKGYSLFSLPQVR  F          RVV+ KLR+CL+GPLQP RK+RKKEVT+
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKEVTT

SwissProt top hitse value%identityAlignment
Q2T9Y1 Uncharacterized protein C3orf26 homolog5.5e-0632.03Show/hide
Query:  KHIKGRK--TEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV----------NIALIDIEALGLSRLAVIVLDIH-PD
        K +K RK  +E  S  +LII  SA+R++EL++ + +   +   +KLF+KH+KV+EQV+LL+ RV             LI    L L+ L  ++ D +  D
Subjt:  KHIKGRK--TEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV----------NIALIDIEALGLSRLAVIVLDIH-PD

Query:  VKGYSLFSLPQVRATFTRVVD-GKLRMC
         K   +  +P++R     +++ G L +C
Subjt:  VKGYSLFSLPQVRATFTRVVD-GKLRMC

Q5FVR6 Protein CMSS14.9e-0732.03Show/hide
Query:  KHIKGRKT--EPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNR-VNIA---------LIDIEALGLSRLAVIVLDIH-PD
        K +K RKT  E  S  +LI+ +SA+R++EL++ L +   +   +KLF+KH+KV+EQV+LL+ R +++          L+  + L L+ L  +V D +  D
Subjt:  KHIKGRKT--EPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNR-VNIA---------LIDIEALGLSRLAVIVLDIH-PD

Query:  VKGYSLFSLPQVRATFTRVVD-GKLRMC
         K   +  +P++R     ++D G   +C
Subjt:  VKGYSLFSLPQVRATFTRVVD-GKLRMC

Q5XJK9 Protein CMSS19.9e-0832Show/hide
Query:  TEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRVN----------IALIDIEALGLSRLAVIVLD-IHPDVKGYSLFSL
        T+  S  +LI+  SALR+I+L+K L +   +   +KLF+KH+KVEEQ++ L   V            AL++ E L +  L  +VLD  + D K   +  +
Subjt:  TEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRVN----------IALIDIEALGLSRLAVIVLD-IHPDVKGYSLFSL

Query:  PQVRATFTRVVD-GKLRMCLYGPLQ
        P+V+    +++D G ++ C  G ++
Subjt:  PQVRATFTRVVD-GKLRMCLYGPLQ

Q9BQ75 Protein CMSS17.1e-0632.03Show/hide
Query:  KHIKGRK--TEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV----------NIALIDIEALGLSRLAVIVLDIH-PD
        K +K RK  +E  S  +LII +SA+R++EL++ + +   +   +KLF+KH+KV+ QV+LL+ RV             L+    L LS L  +V D +  D
Subjt:  KHIKGRK--TEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV----------NIALIDIEALGLSRLAVIVLDIH-PD

Query:  VKGYSLFSLPQVRATFTRVVD-GKLRMC
         K   +  +P++R     +++ G L +C
Subjt:  VKGYSLFSLPQVRATFTRVVD-GKLRMC

Q9CZT6 Protein CMSS14.9e-0732.81Show/hide
Query:  KHIKGRKT--EPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV----------NIALIDIEALGLSRLAVIVLDIH-PD
        K +K RKT  E  S  +LI+ +SA+R++EL++ L +   +   +KLF+KH+KV+EQV+LL+ RV             L+  + L L+ L  +V D +  D
Subjt:  KHIKGRKT--EPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV----------NIALIDIEALGLSRLAVIVLDIH-PD

Query:  VKGYSLFSLPQVRATFTRVVD-GKLRMC
         K   +  +P++R     ++D G   +C
Subjt:  VKGYSLFSLPQVRATFTRVVD-GKLRMC

Arabidopsis top hitse value%identityAlignment
AT2G43110.1 unknown protein3.7e-4252.43Show/hide
Query:  MKDTCILGPPQSSVQDDKTLVKHIK-----------------GRKTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV
        +KDTCI+   Q   QD   L +HIK                  RK EPG+P+VL+IS+SALRS+ELL+GL SLT++C AVKLFSKH+KVEEQV LLK RV
Subjt:  MKDTCILGPPQSSVQDDKTLVKHIK-----------------GRKTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRV

Query:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKE
        NI          L+DIEALGLSRL +IV+D+HPDVKG+SLF+LPQVR  F          RV++G+LR+C+YGP   P  ++K +
Subjt:  NIA---------LIDIEALGLSRLAVIVLDIHPDVKGYSLFSLPQVRATF---------TRVVDGKLRMCLYGPLQPPRKRRKKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATACATGCATTCTGGGTCCTCCTCAAAGTTCAGTTCAGGATGATAAAACTTTGGTGAAACATATCAAAGGAAGAAAAACTGAACCAGGAAGTCCTGCTGTTCT
TATAATCAGCACATCTGCATTGAGGTCGATTGAACTTTTAAAGGGTTTACGATCACTAACTCAAGAATGCCATGCTGTTAAGCTATTTTCAAAGCACATGAAGGTTGAGG
AACAGGTAGAGCTGTTGAAGAACCGTGTTAACATTGCATTGATCGATATTGAGGCATTGGGGCTATCCAGATTAGCAGTTATCGTGCTTGACATTCACCCAGATGTTAAG
GGCTATTCTTTATTTTCACTTCCACAAGTCAGAGCTACTTTCACCCGAGTAGTCGATGGAAAGCTGCGAATGTGCCTCTATGGACCTTTACAACCTCCTCGGAAACGAAG
GAAAAAAGAAGTTACCACTCCAACGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATACATGCATTCTGGGTCCTCCTCAAAGTTCAGTTCAGGATGATAAAACTTTGGTGAAACATATCAAAGGAAGAAAAACTGAACCAGGAAGTCCTGCTGTTCT
TATAATCAGCACATCTGCATTGAGGTCGATTGAACTTTTAAAGGGTTTACGATCACTAACTCAAGAATGCCATGCTGTTAAGCTATTTTCAAAGCACATGAAGGTTGAGG
AACAGGTAGAGCTGTTGAAGAACCGTGTTAACATTGCATTGATCGATATTGAGGCATTGGGGCTATCCAGATTAGCAGTTATCGTGCTTGACATTCACCCAGATGTTAAG
GGCTATTCTTTATTTTCACTTCCACAAGTCAGAGCTACTTTCACCCGAGTAGTCGATGGAAAGCTGCGAATGTGCCTCTATGGACCTTTACAACCTCCTCGGAAACGAAG
GAAAAAAGAAGTTACCACTCCAACGGAATAAATCACTGACGAAGTTGAAATTTTGATCATCCATTTTGGCTATGTATGTATCCCTCAAATATTTTGTCCCACAAACTAAT
TAACTCTCAATCATTTTTCCTTGTCCTCCCTTCCAATGTACTTGGTTCTTTAGATTGTATCAAAATTTTGCAACTGTAGGATATACCTGAAAGCCCGAGTTGA
Protein sequenceShow/hide protein sequence
MKDTCILGPPQSSVQDDKTLVKHIKGRKTEPGSPAVLIISTSALRSIELLKGLRSLTQECHAVKLFSKHMKVEEQVELLKNRVNIALIDIEALGLSRLAVIVLDIHPDVK
GYSLFSLPQVRATFTRVVDGKLRMCLYGPLQPPRKRRKKEVTTPTE