; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011282 (gene) of Snake gourd v1 genome

Gene IDTan0011282
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:55969816..55970406
RNA-Seq ExpressionTan0011282
SyntenyTan0011282
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-7067.86Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+ +WK+ +N +L++DDLRFVL EECPQ+ A NA ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-7067.86Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+ +WK+ +N +L++DDLRFVL EECPQ+ A NA ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-7067.86Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+ +WK+ +N +L++DDLRFVL EECPQ+ A NA ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-7068.37Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+  WK+ +N +L++DDLRFVL EECPQ+ A NA Q+V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-7067.86Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+ +WK+ +N +L++DDLRFVL EECPQ+ A NA ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.8e-7067.86Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+ +WK+ +N +L++DDLRFVL EECPQ+ A NA ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

A0A5A7TU93 Gag/pol protein1.8e-7067.86Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+ +WK+ +N +L++DDLRFVL EECPQ+ A NA ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

A0A5A7TWB9 Gag/pol protein1.8e-7067.86Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+ +WK+ +N +L++DDLRFVL EECPQ+ A NA ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

A0A5A7V6N0 Gag/pol protein8.2e-7168.37Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+  WK+ +N +L++DDLRFVL EECPQ+ A NA Q+V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

A0A5D3CPJ6 Gag/pol protein1.8e-7067.86Show/hide
Query:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE
        M+S+ + +L +D+L G N+ +WK+ +N +L++DDLRFVL EECPQ+ A NA ++V+E Y+RW KAN+KA+ YILAS+SEVLAKKHE M++AREIM+SLQE
Subjt:  MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQE

Query:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL
        MFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVA+MNGAVIDE SQVSFILESLP+SFLQFRSNAVMNKI Y  TTLLNEL     L+
Subjt:  MFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein7.0e-0621.59Show/hide
Query:  SSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDR-------WIKANDKAKVYILASVSEVLAKKH-EGMVSARE
        SS   +++K  R  G+++  W S + + L    L +VL+E CP I +   P++      R       W++ +     +++ S+S+ L +++ +    A+E
Subjt:  SSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDR-------WIKANDKAKVYILASVSEVLAKKH-EGMVSARE

Query:  IMNSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQF
        + + L+ ++     + +   ++     RM E   + E V       +     G  +DE   VS I+   P S+  F
Subjt:  IMNSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATATGATTCTAGTTGTTGACGACCTTCGATT
CGTACTAACTGAGGAATGTCCTCAGATTCTTGCTCGTAACGCTCCTCAATCTGTTAAAGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAATTCACTGCAGGAAATGTTTGGACAACCGTCTGGACAGATTCGG
CACGAATCCCTCAAATATGTTTATAACTCTCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTAAGATGAATGGAGC
GGTCATCGACGAGCAGAGTCAGGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTATAACTCGACTA
CTCTCCTTAATGAACTACTGAGACTTTCCGGTCTCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCAATAATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATATGATTCTAGTTGTTGACGACCTTCGATT
CGTACTAACTGAGGAATGTCCTCAGATTCTTGCTCGTAACGCTCCTCAATCTGTTAAAGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAATTCACTGCAGGAAATGTTTGGACAACCGTCTGGACAGATTCGG
CACGAATCCCTCAAATATGTTTATAACTCTCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTAAGATGAATGGAGC
GGTCATCGACGAGCAGAGTCAGGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTATAACTCGACTA
CTCTCCTTAATGAACTACTGAGACTTTCCGGTCTCTTATGA
Protein sequenceShow/hide protein sequence
MSSSIIALLKSDRLTGENFTTWKSNLNMILVVDDLRFVLTEECPQILARNAPQSVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMNSLQEMFGQPSGQIR
HESLKYVYNSRMKEGSSVREHVLDLMVHFNVAKMNGAVIDEQSQVSFILESLPKSFLQFRSNAVMNKIEYNSTTLLNELLRLSGLL