; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013863 (gene) of Snake gourd v1 genome

Gene IDTan0013863
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRossmann-like alpha/beta/alpha sandwich fold containing protein
Genome locationLG01:28214265..28216807
RNA-Seq ExpressionTan0013863
SyntenyTan0013863
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596257.1 Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. sororia]9.2e-9988.32Show/hide
Query:  MAFARCLLALPFQTSKHPISSIF-----TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLS
        MAFARCLL LP +T KHP SSIF     +SSST CSFTV FNS SRRRG F LPISTLCCK GHRVKAK  DSEAT+VA SFTEFKHLLLPITDRNPFLS
Subjt:  MAFARCLLALPFQTSKHPISSIF-----TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLS

Query:  EGTRQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANL
        EGTRQAIATTAALAKNNGADITV+LIDEKQKESFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSMEA+HSKHVDANL
Subjt:  EGTRQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANL

Query:  LAEFIPCPVMLLPL
        LAEFIPCPVMLLPL
Subjt:  LAEFIPCPVMLLPL

KAG7027803.1 hypothetical protein SDJN02_08980, partial [Cucurbita argyrosperma subsp. argyrosperma]9.2e-9988.32Show/hide
Query:  MAFARCLLALPFQTSKHPISSIF-----TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLS
        MAFARCLL LP +T KHP SSIF     +SSST CSFTV FNS SRRRG F LPISTLCCK GHRVKAK  DSEAT+VA SFTEFKHLLLPITDRNPFLS
Subjt:  MAFARCLLALPFQTSKHPISSIF-----TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLS

Query:  EGTRQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANL
        EGTRQAIATTAALAKNNGADITV+LIDEKQKESFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSMEA+HSKHVDANL
Subjt:  EGTRQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANL

Query:  LAEFIPCPVMLLPL
        LAEFIPCPVMLLPL
Subjt:  LAEFIPCPVMLLPL

XP_022943871.1 uncharacterized protein LOC111448467 isoform X1 [Cucurbita moschata]3.2e-9990Show/hide
Query:  MAFARCLLALPFQTSKHPISSIF-TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR
        MAFARCLL LP +T KHP SSIF +SSST CSFTV FNS SRRRG F LPISTLCCK GHRVKAK  DSEAT+VA SFTEFKHLLLPITDRNPFLSEGTR
Subjt:  MAFARCLLALPFQTSKHPISSIF-TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR

Query:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF
        QAIATTAALAKNNGADITV+LIDEKQKESFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSMEA+HSKHVDANLLAEF
Subjt:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF

Query:  IPCPVMLLPL
        IPCPVMLLPL
Subjt:  IPCPVMLLPL

XP_022943879.1 uncharacterized protein LOC111448467 isoform X2 [Cucurbita moschata]3.2e-9990Show/hide
Query:  MAFARCLLALPFQTSKHPISSIF-TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR
        MAFARCLL LP +T KHP SSIF +SSST CSFTV FNS SRRRG F LPISTLCCK GHRVKAK  DSEAT+VA SFTEFKHLLLPITDRNPFLSEGTR
Subjt:  MAFARCLLALPFQTSKHPISSIF-TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR

Query:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF
        QAIATTAALAKNNGADITV+LIDEKQKESFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSMEA+HSKHVDANLLAEF
Subjt:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF

Query:  IPCPVMLLPL
        IPCPVMLLPL
Subjt:  IPCPVMLLPL

XP_022971539.1 uncharacterized protein LOC111470227 [Cucurbita maxima]5.4e-9989.15Show/hide
Query:  MAFARCLLALPFQTSKHPISSIF---TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEG
        MAFARCLL LP +T KHP SSIF   +SSST CSFTV FNS SRRRG F LPISTLCCK GHRVKAK  DSEAT+VA SFTEFKHLLLPITDRNPFLSEG
Subjt:  MAFARCLLALPFQTSKHPISSIF---TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEG

Query:  TRQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLA
        TRQAIATTAALAKNNGADITV+LIDEKQKESFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSMEA+HSKHVDANLLA
Subjt:  TRQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLA

Query:  EFIPCPVMLLPL
        EFIPCPVMLLPL
Subjt:  EFIPCPVMLLPL

TrEMBL top hitse value%identityAlignment
A0A0A0L108 Uncharacterized protein6.0e-9688.1Show/hide
Query:  MAFARCLLALPFQTSKHPIS-SIFTSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR
        MA ARC L  P +TSKHP+S SIFTSSSTD SF + F+S SRR  GFKLPI+TLCCKM HR+KAK QDSEATLVA SFTEFKHLLLPITDRNP+LSEGTR
Subjt:  MAFARCLLALPFQTSKHPIS-SIFTSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR

Query:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF
        QAIATTAALAKNNGADITVVLID KQK+S PEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF
Subjt:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF

Query:  IPCPVMLLPL
        IPCPVMLLPL
Subjt:  IPCPVMLLPL

A0A6J1CTX9 uncharacterized protein LOC1110147501.8e-9588.1Show/hide
Query:  MAFARCLLALPFQTSKHPISSIFT-SSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR
        MA ARCLLALP +TSKHP   IFT SSSTD SF V FNS SRR G FKLPIS+LCCKM HR+KAK QDSEATLVA SFT+FKHLLLPITDRNPFLSEGTR
Subjt:  MAFARCLLALPFQTSKHPISSIFT-SSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR

Query:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF
        QAIA TAALAKNNGADITVVLIDEKQKESFPEHE QLSSIRWHLSEGG+QEF+LLERLGEGSKPTAIIGEVAD+LNLDLVV+SMEAIHSKHVDANLLAEF
Subjt:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF

Query:  IPCPVMLLPL
        IPCPVMLLPL
Subjt:  IPCPVMLLPL

A0A6J1FSW2 uncharacterized protein LOC111448467 isoform X11.5e-9990Show/hide
Query:  MAFARCLLALPFQTSKHPISSIF-TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR
        MAFARCLL LP +T KHP SSIF +SSST CSFTV FNS SRRRG F LPISTLCCK GHRVKAK  DSEAT+VA SFTEFKHLLLPITDRNPFLSEGTR
Subjt:  MAFARCLLALPFQTSKHPISSIF-TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR

Query:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF
        QAIATTAALAKNNGADITV+LIDEKQKESFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSMEA+HSKHVDANLLAEF
Subjt:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF

Query:  IPCPVMLLPL
        IPCPVMLLPL
Subjt:  IPCPVMLLPL

A0A6J1FVI3 uncharacterized protein LOC111448467 isoform X21.5e-9990Show/hide
Query:  MAFARCLLALPFQTSKHPISSIF-TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR
        MAFARCLL LP +T KHP SSIF +SSST CSFTV FNS SRRRG F LPISTLCCK GHRVKAK  DSEAT+VA SFTEFKHLLLPITDRNPFLSEGTR
Subjt:  MAFARCLLALPFQTSKHPISSIF-TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTR

Query:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF
        QAIATTAALAKNNGADITV+LIDEKQKESFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSMEA+HSKHVDANLLAEF
Subjt:  QAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEF

Query:  IPCPVMLLPL
        IPCPVMLLPL
Subjt:  IPCPVMLLPL

A0A6J1I8V3 uncharacterized protein LOC1114702272.6e-9989.15Show/hide
Query:  MAFARCLLALPFQTSKHPISSIF---TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEG
        MAFARCLL LP +T KHP SSIF   +SSST CSFTV FNS SRRRG F LPISTLCCK GHRVKAK  DSEAT+VA SFTEFKHLLLPITDRNPFLSEG
Subjt:  MAFARCLLALPFQTSKHPISSIF---TSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEG

Query:  TRQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLA
        TRQAIATTAALAKNNGADITV+LIDEKQKESFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSMEA+HSKHVDANLLA
Subjt:  TRQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLA

Query:  EFIPCPVMLLPL
        EFIPCPVMLLPL
Subjt:  EFIPCPVMLLPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G66090.1 unknown protein1.4e-5756.87Show/hide
Query:  SKHPISSIFTSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCK------------MGHRVKAKSQDSEATL----VASSFTEFKHLLLPITDRNPFLSEGT
        S H I+++    +     T+P +S+S        P+S+L  K            +  RVKA+++++EA+     V  +F+  KHLLLP+ DRNP+LSEGT
Subjt:  SKHPISSIFTSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCK------------MGHRVKAKSQDSEATL----VASSFTEFKHLLLPITDRNPFLSEGT

Query:  RQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAE
        RQA ATT +LAK  GADITVV+IDE+++ES  EHE Q+S+IRWHLSEGGF+EFKLLERLGEG K TAIIGEVAD+L ++LVV+SMEAIHSK++DANLLAE
Subjt:  RQAIATTAALAKNNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAE

Query:  FIPCPVMLLPL
        FIPCPV+LLPL
Subjt:  FIPCPVMLLPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTTGCTCGCTGCTTGCTCGCTCTTCCTTTCCAAACTTCGAAGCATCCCATTTCTTCCATTTTCACTTCTTCTTCCACTGACTGTTCTTTTACTGTCCCGTTTAA
TTCCACTTCTCGTCGGCGCGGCGGTTTTAAGCTCCCAATCTCCACTCTCTGCTGCAAAATGGGTCATCGCGTAAAAGCCAAGTCACAAGATTCGGAAGCAACATTAGTTG
CAAGCTCCTTCACTGAATTCAAACACCTGCTGCTTCCAATAACCGATCGCAATCCGTTTCTTTCAGAGGGAACGAGACAGGCTATTGCTACTACTGCTGCTTTGGCAAAG
AACAATGGGGCTGACATAACAGTAGTCTTGATTGATGAAAAGCAGAAGGAATCGTTTCCGGAGCATGAGAACCAACTCTCGAGCATTCGTTGGCATTTGTCTGAAGGTGG
ATTCCAAGAGTTTAAATTGTTAGAGCGACTGGGGGAAGGGAGCAAGCCAACAGCAATCATTGGGGAAGTGGCTGATGATCTAAACTTAGATTTGGTAGTTCTAAGCATGG
AAGCCATTCATTCTAAGCATGTGGATGCAAACCTGTTGGCTGAGTTCATTCCATGCCCTGTTATGCTCTTGCCACTATGA
mRNA sequenceShow/hide mRNA sequence
TCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCAACTATGGCGTTTGCTCGCTGCTTGCTCGCTCTTCCTTTCCAAACTTCGAAGCATCCCA
TTTCTTCCATTTTCACTTCTTCTTCCACTGACTGTTCTTTTACTGTCCCGTTTAATTCCACTTCTCGTCGGCGCGGCGGTTTTAAGCTCCCAATCTCCACTCTCTGCTGC
AAAATGGGTCATCGCGTAAAAGCCAAGTCACAAGATTCGGAAGCAACATTAGTTGCAAGCTCCTTCACTGAATTCAAACACCTGCTGCTTCCAATAACCGATCGCAATCC
GTTTCTTTCAGAGGGAACGAGACAGGCTATTGCTACTACTGCTGCTTTGGCAAAGAACAATGGGGCTGACATAACAGTAGTCTTGATTGATGAAAAGCAGAAGGAATCGT
TTCCGGAGCATGAGAACCAACTCTCGAGCATTCGTTGGCATTTGTCTGAAGGTGGATTCCAAGAGTTTAAATTGTTAGAGCGACTGGGGGAAGGGAGCAAGCCAACAGCA
ATCATTGGGGAAGTGGCTGATGATCTAAACTTAGATTTGGTAGTTCTAAGCATGGAAGCCATTCATTCTAAGCATGTGGATGCAAACCTGTTGGCTGAGTTCATTCCATG
CCCTGTTATGCTCTTGCCACTATGATTTTTGTACATTTTGTAACCAATATCAGAGTTTTATATAGTCATATGTGTTCATGTTTTTTCAGTAATAATGTACTGGGTGTACT
TGTGATTATGTAAAGTGCTTTTGGAGTTAAATTAGGTCAGGTTAATATTATACAATGGTTGGGTTAAAATGTTAATTTACAGAGGCAGCCTTTGAACTT
Protein sequenceShow/hide protein sequence
MAFARCLLALPFQTSKHPISSIFTSSSTDCSFTVPFNSTSRRRGGFKLPISTLCCKMGHRVKAKSQDSEATLVASSFTEFKHLLLPITDRNPFLSEGTRQAIATTAALAK
NNGADITVVLIDEKQKESFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMEAIHSKHVDANLLAEFIPCPVMLLPL