; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004991 (gene) of Snake gourd v1 genome

Gene IDTan0004991
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG07:49398133..49398669
RNA-Seq ExpressionTan0004991
SyntenyTan0004991
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]7.4e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.6e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

A0A5A7TU93 Gag/pol protein3.6e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

A0A5A7TWB9 Gag/pol protein3.6e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

A0A5D3CPJ6 Gag/pol protein3.6e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

A0A5D3CSZ6 Gag/pol protein3.6e-6568.54Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK+ +NT+L++DDLRFVL E+CPQ PA NA R+V+E Y+RW KAN+KA+ YILAS+S+V+AKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQE

Query:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI
        MFGQ   QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI
Subjt:  MFGQPFGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGTGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCGTTGTTGACGACCTTCGGTT
CGTACTAACTGAGAAATGTCCTCAGAATCCTGCTCGTAATGCTCCTCGATCTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCTAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTAAAGTTATGGCAAAAAAGCATGAGGGCATGGTCTCAGCTCGTGAGATAATGAGTTCACTGCAGGAAATGTTTGGACAACCGTTTGGACAGATTCGA
CACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAATGTGGCTGAGATGAACGGAGC
GGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCTGCAACAATGCGGTGATGAATAAGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGTGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCGTTGTTGACGACCTTCGGTT
CGTACTAACTGAGAAATGTCCTCAGAATCCTGCTCGTAATGCTCCTCGATCTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCTAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTAAAGTTATGGCAAAAAAGCATGAGGGCATGGTCTCAGCTCGTGAGATAATGAGTTCACTGCAGGAAATGTTTGGACAACCGTTTGGACAGATTCGA
CACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAATGTGGCTGAGATGAACGGAGC
GGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCTGCAACAATGCGGTGATGAATAAGATATAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLTEKCPQNPARNAPRSVKEAYDRWIKANDKAKVYILASVSKVMAKKHEGMVSAREIMSSLQEMFGQPFGQIR
HESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFCNNAVMNKI