; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005394 (gene) of Snake gourd v1 genome

Gene IDTan0005394
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:45242943..45243524
RNA-Seq ExpressionTan0005394
SyntenyTan0005394
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-6868.39Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL EECPQ+PA NA + ++E ++  W KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQ S QI++++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVI+E SQVSFILESL +SFLQFR+NAVMNKI Y LTTLLNELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-6968.91Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        M+S+ + +L +D+L G N+  WK+ +NT+L++DDLRFVL EECPQ+PA NA Q ++E ++  W KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQ S QI++++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVI+E SQVSFILESL +SFLQFR+NAVMNKI Y LTTLLNELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

KAA0063887.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-6970.98Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        MSSSIIALLK D+LTGEN+ TWKS LN ILV+ DLRFVL EECP  P +NA Q +K+A+DH W KANDKA +Y+LAS+S++L+KKHE MV+AR+IM SL+
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQPS QI+ E++KYVYN+RMKEG SVREHVL ++V+FNVAEMN A+ +E+SQVS+IL+SLSKSFLQF +N  MNKIEYN+TTLL ELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-6868.39Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL EECPQ+PA NA + ++E ++  W KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQ S QI++++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVI+E SQVSFILESL +SFLQFR+NAVMNKI Y LTTLLNELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

TYK15919.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-6970.98Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        MSSSIIALLK D+LTGEN+ TWKS LN ILV+ DLRFVL EECP  P +NA Q +K+A+DH W KANDKA +Y+LAS+S++L+KKHE MV+AR+IM SL+
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQPS QI+ E++KYVYN+RMKEG SVREHVL ++V+FNVAEMN A+ +E+SQVS+IL+SLSKSFLQF +N  MNKIEYN+TTLL ELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein7.6e-6968.39Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL EECPQ+PA NA + ++E ++  W KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQ S QI++++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVI+E SQVSFILESL +SFLQFR+NAVMNKI Y LTTLLNELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

A0A5A7V6N0 Gag/pol protein3.4e-6968.91Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        M+S+ + +L +D+L G N+  WK+ +NT+L++DDLRFVL EECPQ+PA NA Q ++E ++  W KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQ S QI++++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVI+E SQVSFILESL +SFLQFR+NAVMNKI Y LTTLLNELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

A0A5A7VA67 Gag/pol protein2.6e-6970.98Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        MSSSIIALLK D+LTGEN+ TWKS LN ILV+ DLRFVL EECP  P +NA Q +K+A+DH W KANDKA +Y+LAS+S++L+KKHE MV+AR+IM SL+
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQPS QI+ E++KYVYN+RMKEG SVREHVL ++V+FNVAEMN A+ +E+SQVS+IL+SLSKSFLQF +N  MNKIEYN+TTLL ELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

A0A5D3CPJ6 Gag/pol protein7.6e-6968.39Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL EECPQ+PA NA + ++E ++  W KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQ
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQ S QI++++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVI+E SQVSFILESL +SFLQFR+NAVMNKI Y LTTLLNELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

A0A5D3D0D9 Gag/pol protein2.6e-6970.98Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ
        MSSSIIALLK D+LTGEN+ TWKS LN ILV+ DLRFVL EECP  P +NA Q +K+A+DH W KANDKA +Y+LAS+S++L+KKHE MV+AR+IM SL+
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQ

Query:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF
        EMFGQPS QI+ E++KYVYN+RMKEG SVREHVL ++V+FNVAEMN A+ +E+SQVS+IL+SLSKSFLQF +N  MNKIEYN+TTLL ELQTF
Subjt:  EMFGQPSGQIRNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCTATCATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAATCCAACCTGAATACGATTCTCGTTGTTGACGACCTACGGTT
TGTACTGTCTGAGGAATGTCCACAAATCCCTGCTCGTAATGCTCCTCAATGCATTAAGGAGGCGCACGATCACATTTGGATCAAGGCCAATGATAAGGCCAAGGTCTACA
TTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGAAATGTTTGGACAACCGTCTGGACAGATT
CGAAACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCCGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGG
AGCGGTCATTGAGGAGCAAAGTCAGGTATCGTTCATCCTTGAGTCTCTTTCGAAGAGTTTTCTGCAATTCCGCAACAATGCGGTGATGAACAAGATAGAGTATAACCTGA
CTACTCTCCTTAATGAACTACAAACTTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCTATCATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAATCCAACCTGAATACGATTCTCGTTGTTGACGACCTACGGTT
TGTACTGTCTGAGGAATGTCCACAAATCCCTGCTCGTAATGCTCCTCAATGCATTAAGGAGGCGCACGATCACATTTGGATCAAGGCCAATGATAAGGCCAAGGTCTACA
TTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGAAATGTTTGGACAACCGTCTGGACAGATT
CGAAACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCCGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGG
AGCGGTCATTGAGGAGCAAAGTCAGGTATCGTTCATCCTTGAGTCTCTTTCGAAGAGTTTTCTGCAATTCCGCAACAATGCGGTGATGAACAAGATAGAGTATAACCTGA
CTACTCTCCTTAATGAACTACAAACTTTCTAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQCIKEAHDHIWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQEMFGQPSGQI
RNESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIEEQSQVSFILESLSKSFLQFRNNAVMNKIEYNLTTLLNELQTF