; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001335 (gene) of Snake gourd v1 genome

Gene IDTan0001335
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:54387206..54387616
RNA-Seq ExpressionTan0001335
SyntenyTan0001335
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-4878.12Show/hide
Query:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES
        W KAN+KA+ YILAS+SEVLAKKHE M++AREIM  LQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAV+DE ++VSFILES
Subjt:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES

Query:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL
        LP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-4878.12Show/hide
Query:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES
        W KAN+KA+ YILAS+SEVLAKKHE M++AREIM  LQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAV+DE ++VSFILES
Subjt:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES

Query:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL
        LP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-4878.12Show/hide
Query:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES
        W KAN+KA+ YILAS+SEVLAKKHE M++AREIM  LQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAV+DE ++VSFILES
Subjt:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES

Query:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL
        LP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-4878.12Show/hide
Query:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES
        W KAN+KA+ YILAS+SEVLAKKHE M++AREIM  LQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAV+DE ++VSFILES
Subjt:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES

Query:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL
        LP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL

XP_022155999.1 uncharacterized protein LOC111022974 [Momordica charantia]7.5e-4977.61Show/hide
Query:  RGGVGHWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKV
        R     WIKANDKAKVYILAS+S+VLAKKHE MV+A+EIM  L++MFGQPS Q RHE+LK++YNSRMKEG+S++EHVL+LMVHFNVAEMNGAV+DE ++V
Subjt:  RGGVGHWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKV

Query:  SFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        SFILESL KSFLQFRSNAVMNKIEYNLT LL EL
Subjt:  SFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.0e-4878.12Show/hide
Query:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES
        W KAN+KA+ YILAS+SEVLAKKHE M++AREIM  LQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAV+DE ++VSFILES
Subjt:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES

Query:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL
        LP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL

A0A5A7TU93 Gag/pol protein4.0e-4878.12Show/hide
Query:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES
        W KAN+KA+ YILAS+SEVLAKKHE M++AREIM  LQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAV+DE ++VSFILES
Subjt:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES

Query:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL
        LP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL

A0A5A7V4M1 Gag/pol protein4.0e-4878.12Show/hide
Query:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES
        W KAN+KA+ YILAS+SEVLAKKHE M++AREIM  LQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAV+DE ++VSFILES
Subjt:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES

Query:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL
        LP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL

A0A5D3CPJ6 Gag/pol protein4.0e-4878.12Show/hide
Query:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES
        W KAN+KA+ YILAS+SEVLAKKHE M++AREIM  LQEMFGQ S QI+H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAV+DE ++VSFILES
Subjt:  WIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILES

Query:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL
        LP+SFLQFRSNAVMNKI Y LTTLLNEL
Subjt:  LPKSFLQFRSNAVMNKIEYNLTTLLNEL

A0A6J1DRZ2 uncharacterized protein LOC1110229743.6e-4977.61Show/hide
Query:  RGGVGHWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKV
        R     WIKANDKAKVYILAS+S+VLAKKHE MV+A+EIM  L++MFGQPS Q RHE+LK++YNSRMKEG+S++EHVL+LMVHFNVAEMNGAV+DE ++V
Subjt:  RGGVGHWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKV

Query:  SFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEL
        SFILESL KSFLQFRSNAVMNKIEYNLT LL EL
Subjt:  SFILESLPKSFLQFRSNAVMNKIEYNLTTLLNEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAGGGGAGGCGTCGGCCATTGGATCAAGGCTAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTC
AGCTCGTGAGATCATGAGTTTGCTGCAGGAAATGTTTGGACAACCGTCTAGACAGATTCGACACGAATCCCTAAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCAT
CGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATGGACGAGCAAAATAAGGTATCGTTCATCCTGGAGTCTCTTCCG
AAGAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTATAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAGGGGAGGCGTCGGCCATTGGATCAAGGCTAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTC
AGCTCGTGAGATCATGAGTTTGCTGCAGGAAATGTTTGGACAACCGTCTAGACAGATTCGACACGAATCCCTAAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCAT
CGGTGAGAGAACACGTTCTTGATCTGATGGTCCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATGGACGAGCAAAATAAGGTATCGTTCATCCTGGAGTCTCTTCCG
AAGAGTTTCCTGCAATTCCGCAGCAATGCGGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTATAG
Protein sequenceShow/hide protein sequence
MHRGGVGHWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSLLQEMFGQPSRQIRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVMDEQNKVSFILESLP
KSFLQFRSNAVMNKIEYNLTTLLNEL