; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000418 (gene) of Snake gourd v1 genome

Gene IDTan0000418
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:34021359..34052513
RNA-Seq ExpressionTan0000418
SyntenyTan0000418
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035445.1 putative polyprotein [Cucumis melo var. makuwa]5.1e-0783.78Show/hide
Query:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECP
        LLAF KL GDNY T KSNLNTILV+DD RFVLTEECP
Subjt:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECP

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]3.9e-0781.58Show/hide
Query:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP
        LLA +KL GDNYG  KSNLNTILV+DD RFVLTEECPP
Subjt:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP

XP_022158791.1 uncharacterized protein LOC111025258 [Momordica charantia]3.9e-0781.58Show/hide
Query:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP
        LLA +KL GDNYG  KSNLNTILV+DD RFVLTEECPP
Subjt:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP

XP_038876370.1 uncharacterized protein LOC120068812, partial [Benincasa hispida]1.0e-0784.21Show/hide
Query:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP
        LLA  KL GDNYGT KSNLNTILV+DD RFVLTEECPP
Subjt:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP

XP_038904195.1 uncharacterized protein LOC120090541 [Benincasa hispida]1.7e-0781.58Show/hide
Query:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP
        LLA  KL GDNYGT KSN+NTILV+DD RFVLTEECPP
Subjt:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP

TrEMBL top hitse value%identityAlignment
A0A5A7T1X5 Putative polyprotein2.5e-0783.78Show/hide
Query:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECP
        LLAF KL GDNY T KSNLNTILV+DD RFVLTEECP
Subjt:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECP

A0A6J1DFQ7 uncharacterized protein LOC111020396 isoform X34.2e-0751.79Show/hide
Query:  SLCCQLKSSSNSSVSLLSNNCNLVSVCLDTTNYVLWRYQISPLLKSHKLFKYADGS
        SLC  +++  NS + LLSN CNL+S+ LD+TN++LW++Q++ +LK+HKLF + DGS
Subjt:  SLCCQLKSSSNSSVSLLSNNCNLVSVCLDTTNYVLWRYQISPLLKSHKLFKYADGS

A0A6J1DIP4 uncharacterized protein LOC111020396 isoform X14.2e-0751.79Show/hide
Query:  SLCCQLKSSSNSSVSLLSNNCNLVSVCLDTTNYVLWRYQISPLLKSHKLFKYADGS
        SLC  +++  NS + LLSN CNL+S+ LD+TN++LW++Q++ +LK+HKLF + DGS
Subjt:  SLCCQLKSSSNSSVSLLSNNCNLVSVCLDTTNYVLWRYQISPLLKSHKLFKYADGS

A0A6J1DWG6 uncharacterized protein LOC1110250211.9e-0781.58Show/hide
Query:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP
        LLA +KL GDNYG  KSNLNTILV+DD RFVLTEECPP
Subjt:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP

A0A6J1E205 uncharacterized protein LOC1110252581.9e-0781.58Show/hide
Query:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP
        LLA +KL GDNYG  KSNLNTILV+DD RFVLTEECPP
Subjt:  LLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGAAAAAAGTTTAGGGAACGACCGACCACTTGTTAAATTGGTCGGCATAAATGGAAAAGCCCGACCATTCTTTAAAATGGTCGGCATAAGCGAACTAAATTATCT
AATTAGGAGATTGACTAACGTCGTCATATCATTTTTCCCTTCTCTCAGTCGCCACGTCTCCCCGCATCACCACATATCCCTTTGTTGTCAATTGAAGTCATCTTCAAACT
CTTCAGTTTCTCTCCTTTCCAATAACTGCAATTTGGTTTCTGTTTGTCTGGATACAACCAATTATGTGCTTTGGCGCTATCAGATTTCACCTCTCCTCAAGTCGCACAAG
TTGTTCAAATATGCCGATGGATCGTTTAAAGCCCATGATCCGATCATTCGATTTGATGGGTCTCATAATCTGTCCTTTGATGAACTTCATGTTCTAATGAAGACTGGGGA
GAATGCGCTCGATAAACGGGCCAAGATTGATGAGGTTGCTTCTGTTTCGCATCTAGCCATGGCAGCTAATCTTGAATCTCAAGGTCGAGGGAACTGGAAACATAATGGAA
GAGTGAGAGGTCGAGTCGATAATAATAATAGATCCAGTGGGCGTGGGAGTTGTCGACGACGACCCCTCGGGGCTTGTCTTGTGGCACCGGTGGTTACAAACAATTCACTA
ATAGAATTCATTCTTTTCAAAGATGTCGAACTCTTTTATCCATTACTCGCTTTCAATAAACTCGGTGGCGACAATTATGGAACCCGGAAATCAAACTTGAATACGATTCT
TGTTCTTGATGATCCGAGGTTCGTCTTAACGGAGGAATGTCCTCCCCCCTACTCGACAAAGAAACCGAATTGTTCGGGATGCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGAAAAAAGTTTAGGGAACGACCGACCACTTGTTAAATTGGTCGGCATAAATGGAAAAGCCCGACCATTCTTTAAAATGGTCGGCATAAGCGAACTAAATTATCT
AATTAGGAGATTGACTAACGTCGTCATATCATTTTTCCCTTCTCTCAGTCGCCACGTCTCCCCGCATCACCACATATCCCTTTGTTGTCAATTGAAGTCATCTTCAAACT
CTTCAGTTTCTCTCCTTTCCAATAACTGCAATTTGGTTTCTGTTTGTCTGGATACAACCAATTATGTGCTTTGGCGCTATCAGATTTCACCTCTCCTCAAGTCGCACAAG
TTGTTCAAATATGCCGATGGATCGTTTAAAGCCCATGATCCGATCATTCGATTTGATGGGTCTCATAATCTGTCCTTTGATGAACTTCATGTTCTAATGAAGACTGGGGA
GAATGCGCTCGATAAACGGGCCAAGATTGATGAGGTTGCTTCTGTTTCGCATCTAGCCATGGCAGCTAATCTTGAATCTCAAGGTCGAGGGAACTGGAAACATAATGGAA
GAGTGAGAGGTCGAGTCGATAATAATAATAGATCCAGTGGGCGTGGGAGTTGTCGACGACGACCCCTCGGGGCTTGTCTTGTGGCACCGGTGGTTACAAACAATTCACTA
ATAGAATTCATTCTTTTCAAAGATGTCGAACTCTTTTATCCATTACTCGCTTTCAATAAACTCGGTGGCGACAATTATGGAACCCGGAAATCAAACTTGAATACGATTCT
TGTTCTTGATGATCCGAGGTTCGTCTTAACGGAGGAATGTCCTCCCCCCTACTCGACAAAGAAACCGAATTGTTCGGGATGCTTATGA
Protein sequenceShow/hide protein sequence
MVEKSLGNDRPLVKLVGINGKARPFFKMVGISELNYLIRRLTNVVISFFPSLSRHVSPHHHISLCCQLKSSSNSSVSLLSNNCNLVSVCLDTTNYVLWRYQISPLLKSHK
LFKYADGSFKAHDPIIRFDGSHNLSFDELHVLMKTGENALDKRAKIDEVASVSHLAMAANLESQGRGNWKHNGRVRGRVDNNNRSSGRGSCRRRPLGACLVAPVVTNNSL
IEFILFKDVELFYPLLAFNKLGGDNYGTRKSNLNTILVLDDPRFVLTEECPPPYSTKKPNCSGCL