; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003185 (gene) of Snake gourd v1 genome

Gene IDTan0003185
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG11:33475034..33475630
RNA-Seq ExpressionTan0003185
SyntenyTan0003185
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-7269.54Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY +WK+ +NT+L++DD RFVL EECPQVP  NA + V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-7269.54Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY +WK+ +NT+L++DD RFVL EECPQVP  NA + V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-7269.54Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY +WK+ +NT+L++DD RFVL EECPQVP  NA + V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-7370.05Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY  WK+ +NT+L++DD RFVL EECPQVP  NA Q V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.1e-7269.54Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY +WK+ +NT+L++DD RFVL EECPQVP  NA + V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.2e-7369.54Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY +WK+ +NT+L++DD RFVL EECPQVP  NA + V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

A0A5A7TU93 Gag/pol protein5.2e-7369.54Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY +WK+ +NT+L++DD RFVL EECPQVP  NA + V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

A0A5A7TWB9 Gag/pol protein5.2e-7369.54Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY +WK+ +NT+L++DD RFVL EECPQVP  NA + V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

A0A5A7V6N0 Gag/pol protein2.3e-7370.05Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY  WK+ +NT+L++DD RFVL EECPQVP  NA Q V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

A0A5D3CPJ6 Gag/pol protein5.2e-7369.54Show/hide
Query:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN
        M+S+ + +L   +L G NY +WK+ +NT+L++DD RFVL EECPQVP  NA + V++ Y+RW KAN KA+ YILAS+SEVLAKKHE M++AREIM SLQ 
Subjt:  MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR NA+MNKI Y LTTLLNELQTF+SLMK
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCAATTATAGCCTTACTCAAAAGAAAACGTTTAACTGGCGAAAACTATACTACGTGGAAGTCCAACCTAAATACGATTCTTGTTGTTGACGACTTTCGGTT
TGTACTAACTGAGGAATGTCCTCAGGTCCCTACTCGAAACGCTTTTCAACCTGTTAAGGATGCATACGACCGCTGGATCAAGGCCAATAATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTGCAGAACATGTTTGGACAACCGTCTGGACAGCTTCGA
CACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAAATGAATGGCGC
GGTCATAGACGAGCAAAGTCAGGTGTCGTTCATCCTGGAATCTCTTCCGAAGAGTTTTCTGCAATTCCGCAGGAATGCGATGATGAACAAGATAGAGTACAACCTGACTA
CTCTCCTTAATGAACTGCAGACTTTCCAGTCTCTTATGAAAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCAATTATAGCCTTACTCAAAAGAAAACGTTTAACTGGCGAAAACTATACTACGTGGAAGTCCAACCTAAATACGATTCTTGTTGTTGACGACTTTCGGTT
TGTACTAACTGAGGAATGTCCTCAGGTCCCTACTCGAAACGCTTTTCAACCTGTTAAGGATGCATACGACCGCTGGATCAAGGCCAATAATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTGCAGAACATGTTTGGACAACCGTCTGGACAGCTTCGA
CACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAAATGAATGGCGC
GGTCATAGACGAGCAAAGTCAGGTGTCGTTCATCCTGGAATCTCTTCCGAAGAGTTTTCTGCAATTCCGCAGGAATGCGATGATGAACAAGATAGAGTACAACCTGACTA
CTCTCCTTAATGAACTGCAGACTTTCCAGTCTCTTATGAAAAATTAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKRKRLTGENYTTWKSNLNTILVVDDFRFVLTEECPQVPTRNAFQPVKDAYDRWIKANNKAKVYILASVSEVLAKKHEGMVSAREIMSSLQNMFGQPSGQLR
HESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRRNAMMNKIEYNLTTLLNELQTFQSLMKN