; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008525 (gene) of Snake gourd v1 genome

Gene IDTan0008525
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:23697393..23697983
RNA-Seq ExpressionTan0008525
SyntenyTan0008525
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.5e-6966.84Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        M+ SI+ LL S  L G+N++ WKSNLNTILVVDDL+FVLTEECPQ P  N  ++V+EAYDRW+KANDKA+VYILAS+++VLAKKH+ + +A+GIM SL++
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQPS  LRH+++K++Y  RMKEG+SVREHVLD+++HFN+AE+N   IDE +QVSFIL+SLPKSF+ F++NA +NKIE+NLTTLLNELQ FQ
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-6967.36Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        M+++ + +L ++ L G N+ +WK+ +NT+L++DDL+FVL EECPQVP  N  ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AR IM SLQ+
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQ S Q++H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMN AVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-6967.88Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        M+++ + +L ++ L G N+  WK+ +NT+L++DDL+FVL EECPQVP  N  Q+V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AR IM SLQ+
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQ S Q++H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMN AVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

KAA0063887.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-6970.47Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        MS+SIIALLK + LTGEN+ TWKS LN ILV+ DL+FVL EECP  PT+N  QSVK+AYD W KANDKA +Y+LAS+S++L+KKHE MV+AR IM SL++
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQPS Q++ +++KYVYN+RMKEG SVREHVL +IV+FNVAEMN A+ DE+SQVS+IL+SL KSFLQF SN  MNKIEYN+TTLL ELQTFQ
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

TYK15919.1 gag/pol protein [Cucumis melo var. makuwa]4.2e-6970.47Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        MS+SIIALLK + LTGEN+ TWKS LN ILV+ DL+FVL EECP  PT+N  QSVK+AYD W KANDKA +Y+LAS+S++L+KKHE MV+AR IM SL++
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQPS Q++ +++KYVYN+RMKEG SVREHVL +IV+FNVAEMN A+ DE+SQVS+IL+SL KSFLQF SN  MNKIEYN+TTLL ELQTFQ
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.7e-6967.36Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        M+++ + +L ++ L G N+ +WK+ +NT+L++DDL+FVL EECPQVP  N  ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AR IM SLQ+
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQ S Q++H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMN AVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

A0A5A7V6N0 Gag/pol protein1.2e-6967.88Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        M+++ + +L ++ L G N+  WK+ +NT+L++DDL+FVL EECPQVP  N  Q+V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AR IM SLQ+
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQ S Q++H +LKY+YN+RM EG+SVREHVL+++VHFNVAEMN AVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y LTTLLNELQTF+
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

A0A5A7VA67 Gag/pol protein2.0e-6970.47Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        MS+SIIALLK + LTGEN+ TWKS LN ILV+ DL+FVL EECP  PT+N  QSVK+AYD W KANDKA +Y+LAS+S++L+KKHE MV+AR IM SL++
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQPS Q++ +++KYVYN+RMKEG SVREHVL +IV+FNVAEMN A+ DE+SQVS+IL+SL KSFLQF SN  MNKIEYN+TTLL ELQTFQ
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

A0A5D3D0D9 Gag/pol protein2.0e-6970.47Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        MS+SIIALLK + LTGEN+ TWKS LN ILV+ DL+FVL EECP  PT+N  QSVK+AYD W KANDKA +Y+LAS+S++L+KKHE MV+AR IM SL++
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQPS Q++ +++KYVYN+RMKEG SVREHVL +IV+FNVAEMN A+ DE+SQVS+IL+SL KSFLQF SN  MNKIEYN+TTLL ELQTFQ
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

E2GK51 Gag/pol protein (Fragment)1.2e-6966.84Show/hide
Query:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD
        M+ SI+ LL S  L G+N++ WKSNLNTILVVDDL+FVLTEECPQ P  N  ++V+EAYDRW+KANDKA+VYILAS+++VLAKKH+ + +A+GIM SL++
Subjt:  MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQD

Query:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ
        MFGQPS  LRH+++K++Y  RMKEG+SVREHVLD+++HFN+AE+N   IDE +QVSFIL+SLPKSF+ F++NA +NKIE+NLTTLLNELQ FQ
Subjt:  MFGQPSGQLRHKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCCTCAATTATAGCCTTACTGAAAAGCAATCATTTAACTGGTGAAAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTTGTTGTTGACGACCTTCAGTT
TGTACTGACTGAGGAATGTCCTCAGGTCCCTACTCGAAACCCTCCTCAATCTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTATATTT
TGGCTAGTGTTTCTGAAGTTCTAGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGGGATCATGAGTTCGTTGCAGGATATGTTTGGACAACCGTCTGGACAGCTTCGA
CACAAATCCCTCAAATATGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACATGTTCTCGATCTGATTGTCCATTTCAACGTGGCTGAGATGAACGACGC
AGTCATTGACGAGCAAAGTCAGGTCTCGTTCATCCTAGAATCTCTTCCGAAGAGTTTCCTACAATTCCGCAGTAATGCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCTCCTTAATGAACTACAAACTTTCCAAGGGACAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCCTCAATTATAGCCTTACTGAAAAGCAATCATTTAACTGGTGAAAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTTGTTGTTGACGACCTTCAGTT
TGTACTGACTGAGGAATGTCCTCAGGTCCCTACTCGAAACCCTCCTCAATCTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTATATTT
TGGCTAGTGTTTCTGAAGTTCTAGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGGGATCATGAGTTCGTTGCAGGATATGTTTGGACAACCGTCTGGACAGCTTCGA
CACAAATCCCTCAAATATGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACATGTTCTCGATCTGATTGTCCATTTCAACGTGGCTGAGATGAACGACGC
AGTCATTGACGAGCAAAGTCAGGTCTCGTTCATCCTAGAATCTCTTCCGAAGAGTTTCCTACAATTCCGCAGTAATGCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCTCCTTAATGAACTACAAACTTTCCAAGGGACAGGCTGA
Protein sequenceShow/hide protein sequence
MSASIIALLKSNHLTGENFTTWKSNLNTILVVDDLQFVLTEECPQVPTRNPPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSARGIMSSLQDMFGQPSGQLR
HKSLKYVYNSRMKEGSSVREHVLDLIVHFNVAEMNDAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNLTTLLNELQTFQGTG