; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012769 (gene) of Snake gourd v1 genome

Gene IDTan0012769
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:64848715..64849284
RNA-Seq ExpressionTan0012769
SyntenyTan0012769
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-6767.72Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+ +WK+ +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-6767.72Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+ +WK+ +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-6767.72Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+ +WK+ +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-6768.25Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+  WK+ +NT+L++D+LRFVL EECPQ+PA NA Q+V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-6767.72Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+ +WK+ +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.4e-6767.72Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+ +WK+ +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

A0A5A7TU93 Gag/pol protein1.4e-6767.72Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+ +WK+ +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

A0A5A7TWB9 Gag/pol protein1.4e-6767.72Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+ +WK+ +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

A0A5A7V6N0 Gag/pol protein6.3e-6868.25Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+  WK+ +NT+L++D+LRFVL EECPQ+PA NA Q+V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

A0A5D3CPJ6 Gag/pol protein1.4e-6767.72Show/hide
Query:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE
        M+ + + +L +D+L G N+ +WK+ +NT+L++D+LRFVL EECPQ+PA NA ++V+E Y+ W KAN+KA+ YILAS+SEVLAKK+E M++AREIM SLQE
Subjt:  MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQE

Query:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL
        MFGQ S QI+H++LKY+YN+RM E +S+REHVL++MVHFNVAEMNGAVIDE S VSFILESLP+SFLQFR+NAVMNKI Y LTTLLNEL
Subjt:  MFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein2.0e-0521.64Show/hide
Query:  ALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQS-------VKEAYDHWIKANDKAKVYILASVSEVLAKK-NEGMVSAREIMSSL
        +++K  R  G+++  W S +   L    L +VLSE CP I +   P++              W++ +     +++ S+S+ L ++ ++    A+E+   L
Subjt:  ALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQS-------VKEAYDHWIKANDKAKVYILASVSEVLAKK-NEGMVSAREIMSSL

Query:  QEMFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQF
        + ++     + +   ++     RM E   + E V       +     G  +DE  HVS I+   P S+  F
Subjt:  QEMFGQPSRQIRHESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCTCAATAATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCGTTGTTGACAACCTACGGTT
TGTATTGTCTGAGGAATGTCCTCAGATCCCTGCTCGTAACGCTCCTCAATCTGTTAAGGAGGCGTACGACCACTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGAACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGAAATGTTTGGACAACCGTCTAGACAGATTCGG
CACGAATCCCTCAAATATGTTTATAACTCCCGTATGAAGGAGAGGTCATCGATGAGAGAACACGTTCTTGATCTAATGGTCCACTTCAACGTGGCTGAGATGAACGGAGC
GGTCATTGACGAGCAAAGTCATGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCAACAATGCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCTCCTTAATGAACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCTCAATAATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCGTTGTTGACAACCTACGGTT
TGTATTGTCTGAGGAATGTCCTCAGATCCCTGCTCGTAACGCTCCTCAATCTGTTAAGGAGGCGTACGACCACTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGAACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGAAATGTTTGGACAACCGTCTAGACAGATTCGG
CACGAATCCCTCAAATATGTTTATAACTCCCGTATGAAGGAGAGGTCATCGATGAGAGAACACGTTCTTGATCTAATGGTCCACTTCAACGTGGCTGAGATGAACGGAGC
GGTCATTGACGAGCAAAGTCATGTATCGTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCAACAATGCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCTCCTTAATGAACTCTGA
Protein sequenceShow/hide protein sequence
MSFSIIALLKSDRLTGENFTTWKSNLNTILVVDNLRFVLSEECPQIPARNAPQSVKEAYDHWIKANDKAKVYILASVSEVLAKKNEGMVSAREIMSSLQEMFGQPSRQIR
HESLKYVYNSRMKERSSMREHVLDLMVHFNVAEMNGAVIDEQSHVSFILESLPKSFLQFRNNAVMNKIEYNLTTLLNEL