; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000740 (gene) of Snake gourd v1 genome

Gene IDTan0000740
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:52528708..52529277
RNA-Seq ExpressionTan0000740
SyntenyTan0000740
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-6666.67Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY +WK+ +NT+L++DDLRFVL EECPQVPA NA ++V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-6666.67Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY +WK+ +NT+L++DDLRFVL EECPQVPA NA ++V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-6666.67Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY +WK+ +NT+L++DDLRFVL EECPQVPA NA ++V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]6.5e-6767.2Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY  WK+ +NT+L++DDLRFVL EECPQVPA NA Q+V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-6666.67Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY +WK+ +NT+L++DDLRFVL EECPQVPA NA ++V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein7.0e-6766.67Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY +WK+ +NT+L++DDLRFVL EECPQVPA NA ++V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

A0A5A7TU93 Gag/pol protein7.0e-6766.67Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY +WK+ +NT+L++DDLRFVL EECPQVPA NA ++V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

A0A5A7TWB9 Gag/pol protein7.0e-6766.67Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY +WK+ +NT+L++DDLRFVL EECPQVPA NA ++V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

A0A5A7V6N0 Gag/pol protein3.1e-6767.2Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY  WK+ +NT+L++DDLRFVL EECPQVPA NA Q+V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

A0A5D3CPJ6 Gag/pol protein7.0e-6766.67Show/hide
Query:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN
        M+S+ + +L +++L   NY +WK+ +NT+L++DDLRFVL EECPQVPA NA ++V++ Y+RW KAN+KA+ YILAS+SEVL KKHE M++AREIM S Q 
Subjt:  MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQN

Query:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        MFGQ S +++H++LKY+YN+ M EG+SVREHVL++MV FNVAEMNGAVIDE SQV FIL+SLP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRKLRHESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCAATTATAGCCTTACTCAAAAGTGAACGTTTAACTGACGAAAACTATACTACGTGGAAGTCCAACCTGAATACGATTCTTGTTGTTGACGACCTTCGGTT
TGTACTGACTGAGGAATGTCCTCAGGTCCCTGCTCGAAACGCTCCTCAATCTGTTAAGGATGCGTACGACCGTTGGATCAAGGCCAATGATAAGGCTAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGACCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTTCAAAACATGTTTGGACAACCGTCTAGAAAACTTCGG
CACGAATCCCTCAAGTACGTTTATAACTCATGTATGAAGGAGGGGTCGTCGGTGAGAGAACACGTTCTCGATCTGATGGTCCCCTTCAACGTGGCTGAAATGAATGGCGC
GGTCATAGACGAGCAAAGTCAGGTCTTGTTCATCCTGAAATCTCTTCCGAAAAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTACAACCTGACTA
CTCTCCTCAATGAACTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCAATTATAGCCTTACTCAAAAGTGAACGTTTAACTGACGAAAACTATACTACGTGGAAGTCCAACCTGAATACGATTCTTGTTGTTGACGACCTTCGGTT
TGTACTGACTGAGGAATGTCCTCAGGTCCCTGCTCGAAACGCTCCTCAATCTGTTAAGGATGCGTACGACCGTTGGATCAAGGCCAATGATAAGGCTAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGACCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTTCAAAACATGTTTGGACAACCGTCTAGAAAACTTCGG
CACGAATCCCTCAAGTACGTTTATAACTCATGTATGAAGGAGGGGTCGTCGGTGAGAGAACACGTTCTCGATCTGATGGTCCCCTTCAACGTGGCTGAAATGAATGGCGC
GGTCATAGACGAGCAAAGTCAGGTCTTGTTCATCCTGAAATCTCTTCCGAAAAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTACAACCTGACTA
CTCTCCTCAATGAACTGTAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKSERLTDENYTTWKSNLNTILVVDDLRFVLTEECPQVPARNAPQSVKDAYDRWIKANDKAKVYILASVSEVLTKKHEGMVSAREIMSSFQNMFGQPSRKLR
HESLKYVYNSCMKEGSSVREHVLDLMVPFNVAEMNGAVIDEQSQVLFILKSLPKSFLQFRSNAVMNKIEYNLTTLLNEL