; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009972 (gene) of Snake gourd v1 genome

Gene IDTan0009972
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:26490473..26490949
RNA-Seq ExpressionTan0009972
SyntenyTan0009972
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.3e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

A0A5A7TU93 Gag/pol protein1.3e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

A0A5A7TWB9 Gag/pol protein1.3e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

A0A5A7V4M1 Gag/pol protein1.3e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

A0A5D3CPJ6 Gag/pol protein1.3e-5573.89Show/hide
Query:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI
        Y+RW KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQ+MFGQ   Q++H++LKY+YN+RM EG+SVREHVL++MVH NVAEMN AVIDE SQVSFI
Subjt:  YDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFI

Query:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF
        LESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+SLMK KGQ  GEAN+   +R F
Subjt:  LESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGACCGTTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAGTTCTAGCCAAAAAGTACGAGGGCATGGTCTCAGCTCGTGAGAT
CATGAGTTCGTTGCAAGATATGTTTGGACAACCGTTTGGACAGCTTCGGCACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCGTCGGTGAGAGAAC
ATGTTCTCGATCTTATGGTCCACTTGAACGTGGCTGAAATGAACGACGCGGTCATTGACGAGCAAAGTCAGGTCTCGTTCATCCTGGAATCTCTTCCGAAGAGCTTCCTG
CAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTATAACTTGACTACTCTCCTTAATGAACTACAAACTTTTCAGTCTCTTATGAAGAATAAGGGACAGACTGATGG
AGAGGCAAATCTGTTTGCCCATTCCAGAAGTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGACCGTTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAGTTCTAGCCAAAAAGTACGAGGGCATGGTCTCAGCTCGTGAGAT
CATGAGTTCGTTGCAAGATATGTTTGGACAACCGTTTGGACAGCTTCGGCACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCGTCGGTGAGAGAAC
ATGTTCTCGATCTTATGGTCCACTTGAACGTGGCTGAAATGAACGACGCGGTCATTGACGAGCAAAGTCAGGTCTCGTTCATCCTGGAATCTCTTCCGAAGAGCTTCCTG
CAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTATAACTTGACTACTCTCCTTAATGAACTACAAACTTTTCAGTCTCTTATGAAGAATAAGGGACAGACTGATGG
AGAGGCAAATCTGTTTGCCCATTCCAGAAGTTTCTAG
Protein sequenceShow/hide protein sequence
MYDRWIKANDKAKVYILASVSEVLAKKYEGMVSAREIMSSLQDMFGQPFGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHLNVAEMNDAVIDEQSQVSFILESLPKSFL
QFRSNAVMNKIEYNLTTLLNELQTFQSLMKNKGQTDGEANLFAHSRSF