; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008858 (gene) of Snake gourd v1 genome

Gene IDTan0008858
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:77621307..77621894
RNA-Seq ExpressionTan0008858
SyntenyTan0008858
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-6667.55Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK  +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-6667.55Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK  +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-6667.55Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK  +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-6668.09Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+  WK  +NT+L++D+LRFVL EECPQ+PA NA Q+V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-6667.55Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK  +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.6e-6667.55Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK  +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

A0A5A7TU93 Gag/pol protein1.6e-6667.55Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK  +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

A0A5A7TWB9 Gag/pol protein1.6e-6667.55Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK  +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

A0A5A7V6N0 Gag/pol protein7.2e-6768.09Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+  WK  +NT+L++D+LRFVL EECPQ+PA NA Q+V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

A0A5D3CPJ6 Gag/pol protein1.6e-6667.55Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S+ + +L +D+L G N+ +WK  +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+RW KAN+K + YILAS+SEVLAKKHE M++AREIM SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN
        +FGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNV EMNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  L +L+N
Subjt:  IFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein1.6e-0521.02Show/hide
Query:  SSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDR-------WIKANDKTKVYILASVSEVLAKKH-EGMVSARE
        SS   +++K  R  G+++  W + +   L    L +VL+E CP I +   P++      R       W++ +     +++ S+S+ L +++ +    A+E
Subjt:  SSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDR-------WIKANDKTKVYILASVSEVLAKKH-EGMVSARE

Query:  IMSSLQEIFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQF
        +   L+ ++     + +   ++     RM E   + E V       + +   G  +DE   VS I+   P S+  F
Subjt:  IMSSLQEIFGQPSGQIQHESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGACCAACTTGAATACGATTCTCGTTGTTGACAACCTTCGGTT
CGTACTAACTGAGGAATGTCCTCAGATTCCTGCTCGTAACGCTCCTCAATCTGTTAAGGAGGCATACGACCGCTGGATCAAGGCCAATGATAAGACCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCACTGCAGGAAATATTTGGACAACCATCTGGACAGATTCAG
CACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGTTGAGATGAATGGAGC
AGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAAAGTATAACCGACTAC
TCTCCTTAATGAACTACAGACATTCCAGTCTCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGACCAACTTGAATACGATTCTCGTTGTTGACAACCTTCGGTT
CGTACTAACTGAGGAATGTCCTCAGATTCCTGCTCGTAACGCTCCTCAATCTGTTAAGGAGGCATACGACCGCTGGATCAAGGCCAATGATAAGACCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCACTGCAGGAAATATTTGGACAACCATCTGGACAGATTCAG
CACGAATCCCTCAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGTTGAGATGAATGGAGC
AGTCATTGACGAGCAAAGTCAGGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAAAGTATAACCGACTAC
TCTCCTTAATGAACTACAGACATTCCAGTCTCTTATGA
Protein sequenceShow/hide protein sequence
MSSSIIALLKSDRLTGENFTTWKTNLNTILVVDNLRFVLTEECPQIPARNAPQSVKEAYDRWIKANDKTKVYILASVSEVLAKKHEGMVSAREIMSSLQEIFGQPSGQIQ
HESLKYVYNSRMKEGSSVREHVLDLMVHFNVVEMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIKYNRLLSLMNYRHSSLL