; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007461 (gene) of Snake gourd v1 genome

Gene IDTan0007461
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:52115314..52115892
RNA-Seq ExpressionTan0007461
SyntenyTan0007461
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-6870.11Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+ +WK+ +NT+LI+DD+RFVL EECPQ+PA NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-6870.11Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+ +WK+ +NT+LI+DD+RFVL EECPQ+PA NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-6870.11Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+ +WK+ +NT+LI+DD+RFVL EECPQ+PA NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]7.0e-6970.65Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+  WK+ +NT+LI+DD+RFVL EECPQ+PA NA Q+++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-6870.11Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+ +WK+ +NT+LI+DD+RFVL EECPQ+PA NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein7.6e-6970.11Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+ +WK+ +NT+LI+DD+RFVL EECPQ+PA NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

A0A5A7TU93 Gag/pol protein7.6e-6970.11Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+ +WK+ +NT+LI+DD+RFVL EECPQ+PA NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

A0A5A7TWB9 Gag/pol protein7.6e-6970.11Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+ +WK+ +NT+LI+DD+RFVL EECPQ+PA NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

A0A5A7V6N0 Gag/pol protein3.4e-6970.65Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+  WK+ +NT+LI+DD+RFVL EECPQ+PA NA Q+++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

A0A5D3CPJ6 Gag/pol protein7.6e-6970.11Show/hide
Query:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L   N+ +WK+ +NT+LI+DD+RFVL EECPQ+PA NA ++++E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT
        MFGQ S QI+H++LKYIYN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQFR+NAVMNKI Y LTT
Subjt:  MFGQPSGQIRHESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGCGATCGTTTAACTTATGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCATTGTTGACGACGTACGATT
TGTACTAACTGAGGAATGTCCTCAGATCCCTGCTCGTAACGCTCCTCAATCTATTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTAGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCACTCCAGGAAATGTTTGGACAACCGTCTGGACAGATTCGG
CATGAATCCCTCAAATACATTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGC
AGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCAACAATGCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCACTACTACAAAAACTGTATTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGCGATCGTTTAACTTATGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCATTGTTGACGACGTACGATT
TGTACTAACTGAGGAATGTCCTCAGATCCCTGCTCGTAACGCTCCTCAATCTATTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTAGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCACTCCAGGAAATGTTTGGACAACCGTCTGGACAGATTCGG
CATGAATCCCTCAAATACATTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGC
AGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCAACAATGCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCACTACTACAAAAACTGTATTTCTTGA
Protein sequenceShow/hide protein sequence
MSSSIIALLKSDRLTYENFTTWKSNLNTILIVDDVRFVLTEECPQIPARNAPQSIKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQEMFGQPSGQIR
HESLKYIYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFRNNAVMNKIEYNLTTHYYKNCIS