; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022514 (gene) of Snake gourd v1 genome

Gene IDTan0022514
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG10:31175470..31175691
RNA-Seq ExpressionTan0022514
SyntenyTan0022514
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-2275.34Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ MFGQ S Q++H++LKY+YN+RMNEG+SVREHVL++MVHFNVAEMN AVID+ SQVSFILESLP+SFL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-2275.34Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ MFGQ S Q++H++LKY+YN+RMNEG+SVREHVL++MVHFNVAEMN AVID+ SQVSFILESLP+SFL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-2275.34Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ MFGQ S Q++H++LKY+YN+RMNEG+SVREHVL++MVHFNVAEMN AVID+ SQVSFILESLP+SFL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]7.1e-2275.34Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ MFGQ S Q++H++LKY+YN+RMNEG+SVREHVL++MVHFNVAEMN AVID+ SQVSFILESLP+SFL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]4.2e-2279.45Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ+MFGQPS Q RHE+LK+VYNSRM EGSSVREHVL+LMVHFNVAE N  VID+QSQ SFILESLPK+FL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.4e-2275.34Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ MFGQ S Q++H++LKY+YN+RMNEG+SVREHVL++MVHFNVAEMN AVID+ SQVSFILESLP+SFL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

A0A5A7TU93 Gag/pol protein3.4e-2275.34Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ MFGQ S Q++H++LKY+YN+RMNEG+SVREHVL++MVHFNVAEMN AVID+ SQVSFILESLP+SFL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

A0A5A7V4M1 Gag/pol protein3.4e-2275.34Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ MFGQ S Q++H++LKY+YN+RMNEG+SVREHVL++MVHFNVAEMN AVID+ SQVSFILESLP+SFL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

A0A5D3CPJ6 Gag/pol protein3.4e-2275.34Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ MFGQ S Q++H++LKY+YN+RMNEG+SVREHVL++MVHFNVAEMN AVID+ SQVSFILESLP+SFL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

A0A6J1DWL0 uncharacterized protein LOC1110247342.0e-2279.45Show/hide
Query:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL
        M  LQ+MFGQPS Q RHE+LK+VYNSRM EGSSVREHVL+LMVHFNVAE N  VID+QSQ SFILESLPK+FL
Subjt:  MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTGTTGCAGAACATGTTTGGACAACCGTCTGGACAGCTTCGACACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAATGAGGGGTCGTCGGTGAGAGAACA
CGTTCTCGATCTGATGGTCCACTTTAACGTGGCTGAAATGAACGACGCAGTCATAGATAAGCAAAGTCAGGTGTCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCTTGT
AA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTGTTGCAGAACATGTTTGGACAACCGTCTGGACAGCTTCGACACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAATGAGGGGTCGTCGGTGAGAGAACA
CGTTCTCGATCTGATGGTCCACTTTAACGTGGCTGAAATGAACGACGCAGTCATAGATAAGCAAAGTCAGGTGTCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCTTGT
AA
Protein sequenceShow/hide protein sequence
MSLLQNMFGQPSGQLRHESLKYVYNSRMNEGSSVREHVLDLMVHFNVAEMNDAVIDKQSQVSFILESLPKSFL