; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000785 (gene) of Snake gourd v1 genome

Gene IDTan0000785
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG07:60734463..60734723
RNA-Seq ExpressionTan0000785
SyntenyTan0000785
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-2068Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        M+TA+EIM SLQ MFGQ    + +D +KY+YN+RM EGASVREH+L+MM HFNVAE+NGAVI+E SQV+FI+ESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-2068Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        M+TA+EIM SLQ MFGQ    + +D +KY+YN+RM EGASVREH+L+MM HFNVAE+NGAVI+E SQV+FI+ESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-2068Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        M+TA+EIM SLQ MFGQ    + +D +KY+YN+RM EGASVREH+L+MM HFNVAE+NGAVI+E SQV+FI+ESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

XP_038876370.1 uncharacterized protein LOC120068812, partial [Benincasa hispida]1.4e-2173.33Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        MVTAKEIM SLQA+FGQ  SS  +D +K+VYN RMKEG +VREH+LDMM HFN+ EVN AV+NEKSQV FIMESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

XP_038880476.1 uncharacterized protein LOC120072136 [Benincasa hispida]4.2e-2172Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        MV AKEIM SLQA+FGQ  SS  +D +KYVYN RMKEG +VREH+LDMM HFN+ EVNGAV+NEK+Q  FIMESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.0e-2068Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        M+TA+EIM SLQ MFGQ    + +D +KY+YN+RM EGASVREH+L+MM HFNVAE+NGAVI+E SQV+FI+ESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

A0A5A7TU93 Gag/pol protein1.0e-2068Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        M+TA+EIM SLQ MFGQ    + +D +KY+YN+RM EGASVREH+L+MM HFNVAE+NGAVI+E SQV+FI+ESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

A0A5A7TWB9 Gag/pol protein1.0e-2068Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        M+TA+EIM SLQ MFGQ    + +D +KY+YN+RM EGASVREH+L+MM HFNVAE+NGAVI+E SQV+FI+ESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

A0A5A7V4M1 Gag/pol protein1.0e-2068Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        M+TA+EIM SLQ MFGQ    + +D +KY+YN+RM EGASVREH+L+MM HFNVAE+NGAVI+E SQV+FI+ESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

A0A5D3CPJ6 Gag/pol protein1.0e-2068Show/hide
Query:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL
        M+TA+EIM SLQ MFGQ    + +D +KY+YN+RM EGASVREH+L+MM HFNVAE+NGAVI+E SQV+FI+ESL
Subjt:  MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCACCGCAAAGGAGATCATGGGATCATTACAAGCGATGTTTGGACAAACGTGCTCATCGGTCCATTATGATACTGTCAAATACGTTTACAACTCCCGTATGAAGGA
GGGAGCCTCTGTTAGGGAACATATCCTTGACATGATGACCCACTTCAACGTGGCTGAAGTAAATGGGGCAGTCATAAATGAGAAAAGTCAGGTAACCTTTATTATGGAAT
CTCTTAGAAGAGTTTCCTGCCATTCCGCACAAATGCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCACCGCAAAGGAGATCATGGGATCATTACAAGCGATGTTTGGACAAACGTGCTCATCGGTCCATTATGATACTGTCAAATACGTTTACAACTCCCGTATGAAGGA
GGGAGCCTCTGTTAGGGAACATATCCTTGACATGATGACCCACTTCAACGTGGCTGAAGTAAATGGGGCAGTCATAAATGAGAAAAGTCAGGTAACCTTTATTATGGAAT
CTCTTAGAAGAGTTTCCTGCCATTCCGCACAAATGCGATGA
Protein sequenceShow/hide protein sequence
MVTAKEIMGSLQAMFGQTCSSVHYDTVKYVYNSRMKEGASVREHILDMMTHFNVAEVNGAVINEKSQVTFIMESLRRVSCHSAQMR