; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009850 (gene) of Snake gourd v1 genome

Gene IDTan0009850
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC103494847
Genome locationLG01:5369863..5370951
RNA-Seq ExpressionTan0009850
SyntenyTan0009850
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576890.1 hypothetical protein SDJN03_24464, partial [Cucurbita argyrosperma subsp. sororia]1.7e-6380.75Show/hide
Query:  VFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKKM
        +FAINLR P  H +IHNF+FQ  PSP  S +R PCQITYCRKK SDADLASDLA EVAKINTNLIQ EEAM KSRE LFTELCGFLGLKSEETKR WKKM
Subjt:  VFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKKM

Query:  DEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL
        DEEAKL+L+ EFVSEWGFNFQPLS RS KEMVEEYV NGEN  AISSASSLISSLK+++GL
Subjt:  DEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL

KAG7014917.1 hypothetical protein SDJN02_22548, partial [Cucurbita argyrosperma subsp. argyrosperma]4.4e-6480.86Show/hide
Query:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK
        M+FAINLR P  H +IHNF+FQ  PSP  S +R PCQITYCRKK SDADLASDLA EVAKINTNLIQ EEAM KSRE LFTELCGFLGLKSEETKR WKK
Subjt:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK

Query:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL
        MDEEAKL+L+ EFVSEWGFNFQPLS RS KEMVEEYV NGEN  AISSASSLISSLK+++GL
Subjt:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL

XP_022149245.1 uncharacterized protein LOC111017712 [Momordica charantia]1.2e-6682.72Show/hide
Query:  MVFAINL-RLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWK
        M+FAINL RLPH  +EIH+FRFQ  PSP  S NR PCQI+YC KKLSDA+LASDLATEVAK++TNLIQREEAMKKS+EFLFTELCGFLGLKSEETK+SWK
Subjt:  MVFAINL-RLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWK

Query:  KMDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL
        KMDEEAKL+LV EFV+EWGFNFQPLS RSVKE+VEEYVNGEN+SAISSA SLI SLKR MGL
Subjt:  KMDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL

XP_022922671.1 uncharacterized protein LOC111430603 [Cucurbita moschata]5.7e-6480.86Show/hide
Query:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK
        M+FAINLR P  H +IHNF+FQ  PSP  S +R PCQITYCRKK SDADLASDLA EVAKINTNLIQ EEAM KSRE LFTELCGFLGLKSEETKR WKK
Subjt:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK

Query:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL
        MDEEAKL+L+ EFVSEWGFNFQPLS RS KEMVEEYV NGEN  AISSASSLISSLK+++GL
Subjt:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL

XP_023551996.1 uncharacterized protein LOC111809798 [Cucurbita pepo subsp. pepo]3.9e-6582.72Show/hide
Query:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK
        M+FAINLR P  H +IHNF+FQ  PSP  S +R PCQITYCRKK SDADLASDLA EVAKINTNLIQ EEAM KSRE LFTELCGFLGLKSEETKR WKK
Subjt:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK

Query:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL
        MDEEAKL+LV EFVSEWGFNFQPLS RS KEMVEEYV NGEN  AISSASSLISSLK+TMGL
Subjt:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL

TrEMBL top hitse value%identityAlignment
A0A0A0KYF6 Uncharacterized protein1.2e-6483.85Show/hide
Query:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK
        M+ AINLRLP     IH+FRFQP PSP  S NR P QITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELC FLGLKSEETKR W K
Subjt:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK

Query:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL
        M+EEAKL LV EFVSEWGFNFQPLS R VKEMVEEYVNGENL  ISSASS ISSLK+TMGL
Subjt:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL

A0A1S3BZE3 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948471.2e-5980.12Show/hide
Query:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK
        M+ AINLR P    EIHNFRFQP PSP    NR P QITYCRKKLSDADLA DLATEVAKINTNLIQREEAMKKSR F++  +  FLGLKSEETKR WKK
Subjt:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK

Query:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL
        M+EEAKL LV EFVSEWGFNFQPLS R VKEMVEEYVNGENL  ISSASSLISSLK+TMGL
Subjt:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL

A0A6J1D577 uncharacterized protein LOC1110177125.9e-6782.72Show/hide
Query:  MVFAINL-RLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWK
        M+FAINL RLPH  +EIH+FRFQ  PSP  S NR PCQI+YC KKLSDA+LASDLATEVAK++TNLIQREEAMKKS+EFLFTELCGFLGLKSEETK+SWK
Subjt:  MVFAINL-RLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWK

Query:  KMDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL
        KMDEEAKL+LV EFV+EWGFNFQPLS RSVKE+VEEYVNGEN+SAISSA SLI SLKR MGL
Subjt:  KMDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL

A0A6J1E4R4 uncharacterized protein LOC1114306032.8e-6480.86Show/hide
Query:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK
        M+FAINLR P  H +IHNF+FQ  PSP  S +R PCQITYCRKK SDADLASDLA EVAKINTNLIQ EEAM KSRE LFTELCGFLGLKSEETKR WKK
Subjt:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK

Query:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL
        MDEEAKL+L+ EFVSEWGFNFQPLS RS KEMVEEYV NGEN  AISSASSLISSLK+++GL
Subjt:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL

A0A6J1J388 uncharacterized protein LOC1114829945.2e-6380.25Show/hide
Query:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK
        M+FAINLR P  H +IHNF+FQ  PS   S +R PCQITYCRKK SDADLASDLA EVAKINTNLIQ EEAM KSRE LFT+LCGFL LKSEETKR WKK
Subjt:  MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKK

Query:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL
        MDEEAKL+LV EFVSEWGFNFQPLS RSVKEMVEEYV NGEN  AISSASSLISSLK+++GL
Subjt:  MDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEEYV-NGENLSAISSASSLISSLKRTMGL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G64480.1 unknown protein1.8e-2347.62Show/hide
Query:  RKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKKMDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEE------
        +K ++D++LA+DLA E+ K NT   QR EAMKKS E L+ E C  + LK +E K  W K+ EE KL LV EFV EW  +FQPLS  SVKEMV++      
Subjt:  RKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKKMDEEAKLSLVMEFVSEWGFNFQPLSSRSVKEMVEE------

Query:  -YVNGENLSAISSASSLISSLKRTMG
          +   + S++SS S L   LKR +G
Subjt:  -YVNGENLSAISSASSLISSLKRTMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTCGCAATCAATCTTCGTCTTCCTCACGCCCATACCGAGATCCACAATTTCCGATTTCAACCTCTGCCGTCGCCGGTCTTTTCCGGGAACCGGCCGCCGTGCCA
AATCACCTACTGCAGAAAGAAACTCAGCGACGCGGATCTCGCCTCCGATCTCGCGACGGAAGTGGCGAAAATCAACACCAATTTGATTCAGAGAGAGGAGGCGATGAAGA
AGAGCAGAGAGTTTTTGTTCACGGAGCTCTGCGGATTTCTAGGGCTAAAATCGGAGGAGACGAAGAGAAGTTGGAAGAAGATGGACGAAGAGGCGAAATTGTCACTGGTT
ATGGAGTTTGTTTCCGAGTGGGGATTCAATTTTCAGCCATTGTCGTCTAGGTCTGTGAAGGAAATGGTGGAAGAATACGTTAATGGAGAAAATTTGTCTGCAATTTCTTC
TGCTTCATCGTTGATTTCTTCGTTGAAGAGAACAATGGGATTGTGA
mRNA sequenceShow/hide mRNA sequence
AGTAAATCTCGAGAGATAGTGGAAGTAAATCGGATAAACTTAGCGAGCCTTCGAATTTTGGTTCAATCGCTCTCTCTTTCTCAAACCATATTATCGTCACTTTGTTTTCC
TGCAAATCTCTCGATTGAAGCATTGTCTAGAAGAAATATCTTCAAATGGTTTTCGCAATCAATCTTCGTCTTCCTCACGCCCATACCGAGATCCACAATTTCCGATTTCA
ACCTCTGCCGTCGCCGGTCTTTTCCGGGAACCGGCCGCCGTGCCAAATCACCTACTGCAGAAAGAAACTCAGCGACGCGGATCTCGCCTCCGATCTCGCGACGGAAGTGG
CGAAAATCAACACCAATTTGATTCAGAGAGAGGAGGCGATGAAGAAGAGCAGAGAGTTTTTGTTCACGGAGCTCTGCGGATTTCTAGGGCTAAAATCGGAGGAGACGAAG
AGAAGTTGGAAGAAGATGGACGAAGAGGCGAAATTGTCACTGGTTATGGAGTTTGTTTCCGAGTGGGGATTCAATTTTCAGCCATTGTCGTCTAGGTCTGTGAAGGAAAT
GGTGGAAGAATACGTTAATGGAGAAAATTTGTCTGCAATTTCTTCTGCTTCATCGTTGATTTCTTCGTTGAAGAGAACAATGGGATTGTGAAACACATAATTAGATGCAT
TCATGGAAGAAATCATGCATTTGGATCGTTAATTCCCAAATTCCATGTAAGATATTATCTGCATTTTTTTTTATGATTATGGATTTTAATTCTCAAATTGAGTAGAACGT
ATTTCTGTTTCTTATTTTTTCCCTTTACCAAAAGAATTTACCCTTTTTTTTTTCATTGGGATGGGGATGGAACCTTCAAAAGAAGAGTTAGCTCAAGTATTAAAATATAG
ACGTCCATATTGATAATTCAATTTTACCATGGATGTATTGAAATTCTTGTAAATAAGATCGACTTTGATAATTTTCTGATTTTTAGTTTTTAAAAATTAAGACTCAGGTA
GTTTACAAAAATTAAGACTCAGGTAGTTTTTTAAATTTAGCTAAATATTCATATAAAAATGGATGAGAAAACCACCATAATTTTTAAAAACTAAAAATA
Protein sequenceShow/hide protein sequence
MVFAINLRLPHAHTEIHNFRFQPLPSPVFSGNRPPCQITYCRKKLSDADLASDLATEVAKINTNLIQREEAMKKSREFLFTELCGFLGLKSEETKRSWKKMDEEAKLSLV
MEFVSEWGFNFQPLSSRSVKEMVEEYVNGENLSAISSASSLISSLKRTMGL