; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009746 (gene) of Snake gourd v1 genome

Gene IDTan0009746
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:38637050..38637271
RNA-Seq ExpressionTan0009746
SyntenyTan0009746
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-2278.08Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQEMFGQ S QI+H++LKY+Y +RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-2278.08Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQEMFGQ S QI+H++LKY+Y +RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

KAA0050670.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-2278.08Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQEMFGQ S QI+H++L Y+Y +RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLL+SFL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

XP_022155999.1 uncharacterized protein LOC111022974 [Momordica charantia]9.3e-2278.08Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SL++MFGQPS Q RHE+LK++Y SRM EG+S++EHVL+LMVHFNVAEMNGAVIDE SQVSFILESL KSFL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]7.1e-2280.82Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQ MFGQPS Q RHE+LK+VY SRM EGSSVREHVL+LMVHFNVAE NG VIDEQSQ SFILESL K+FL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.5e-2278.08Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQEMFGQ S QI+H++LKY+Y +RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

A0A5A7U676 Gag/pol protein2.6e-2278.08Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQEMFGQ S QI+H++L Y+Y +RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLL+SFL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

A0A5A7V4M1 Gag/pol protein4.5e-2278.08Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQEMFGQ S QI+H++LKY+Y +RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

A0A5D3CPJ6 Gag/pol protein4.5e-2278.08Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQEMFGQ S QI+H++LKY+Y +RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

A0A6J1DWL0 uncharacterized protein LOC1110247343.4e-2280.82Show/hide
Query:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL
        M SLQ MFGQPS Q RHE+LK+VY SRM EGSSVREHVL+LMVHFNVAE NG VIDEQSQ SFILESL K+FL
Subjt:  MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCGCTGCAGGAAATGTTTGGACAACCGTCTGGACAGATTCGACACGAATCCCTCAAATACGTTTATAAGTCCCGTATGATGGAGGGGTCATCGGTGAGAGAACA
CGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAGATGAATGGAGCGGTCATTGACGAGCAAAGTCAGGTATCGTTCATCTTGGAATCTCTTCTGAAGAGTTTCCTGT
AA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCGCTGCAGGAAATGTTTGGACAACCGTCTGGACAGATTCGACACGAATCCCTCAAATACGTTTATAAGTCCCGTATGATGGAGGGGTCATCGGTGAGAGAACA
CGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAGATGAATGGAGCGGTCATTGACGAGCAAAGTCAGGTATCGTTCATCTTGGAATCTCTTCTGAAGAGTTTCCTGT
AA
Protein sequenceShow/hide protein sequence
MSSLQEMFGQPSGQIRHESLKYVYKSRMMEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLLKSFL