; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003213 (gene) of Snake gourd v1 genome

Gene IDTan0003213
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG03:15061921..15062574
RNA-Seq ExpressionTan0003213
SyntenyTan0003213
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein3.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

A0A5A7TU93 Gag/pol protein3.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

A0A5A7TWB9 Gag/pol protein3.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

A0A5D3CPJ6 Gag/pol protein3.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

A0A5D3CSZ6 Gag/pol protein3.7e-7266.33Show/hide
Query:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK
        M+SA + +LA DKL G+NY SWKN INT+L+ DD++ VL EECPQ+P + A+R+VR+ ++RW +ANEKA+ YI+ S+S+VLAKKHE M++A+EIM+SLQ+
Subjt:  MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQK

Query:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR
        MFGQ S+Q++HD+LKY++NA M EG+SVREHVL+MM HFN++EMNGA IDE+SQVSFILE+LP+SFLQFRSN VMNKI+YTLTTLLNELQ F+SL +++
Subjt:  MFGQLSFQVRHDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCGCAATAATAGCACTACTCGCGTATGATAAGTTAGTGGGAGATAATTACCAAAGTTGGAAAAACAACATTAACACGATTTTGGTAACTGACGACATAAAGCT
CGTGTTGTCTGAGGAGTGTCCTCAGATGCCGGGCTCGACCGCATCGCGAAGTGTTCGCGATGCGCATGATCGGTGGATCAGGGCAAATGAAAAGGCCAAGGTCTACATAA
TTGTCAGCATGTCTGATGTCTTGGCAAAGAAGCATGAGCTGATGGTCTCTGCCAAGGAGATCATGGAGTCCTTGCAGAAAATGTTTGGACAACTATCCTTTCAGGTCCGG
CATGACTCCCTCAAATACGTTTTCAACGCATGGATGAAGGAGGGGTCGTCTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTGTCCGAGATGAACGGGGC
TTCGATCGACGAGTCGAGCCAGGTTAGTTTTATTTTGGAGACTCTTCCAAAGAGTTTCCTTCAGTTTCGTAGCAATGTTGTTATGAACAAAATTAGCTACACTCTGACTA
CCCTCCTCAATGAGCTACAAAATTTCCAATCCTTGAAAAGGGTAAGGAATCTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCGCGGATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCGCAATAATAGCACTACTCGCGTATGATAAGTTAGTGGGAGATAATTACCAAAGTTGGAAAAACAACATTAACACGATTTTGGTAACTGACGACATAAAGCT
CGTGTTGTCTGAGGAGTGTCCTCAGATGCCGGGCTCGACCGCATCGCGAAGTGTTCGCGATGCGCATGATCGGTGGATCAGGGCAAATGAAAAGGCCAAGGTCTACATAA
TTGTCAGCATGTCTGATGTCTTGGCAAAGAAGCATGAGCTGATGGTCTCTGCCAAGGAGATCATGGAGTCCTTGCAGAAAATGTTTGGACAACTATCCTTTCAGGTCCGG
CATGACTCCCTCAAATACGTTTTCAACGCATGGATGAAGGAGGGGTCGTCTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTGTCCGAGATGAACGGGGC
TTCGATCGACGAGTCGAGCCAGGTTAGTTTTATTTTGGAGACTCTTCCAAAGAGTTTCCTTCAGTTTCGTAGCAATGTTGTTATGAACAAAATTAGCTACACTCTGACTA
CCCTCCTCAATGAGCTACAAAATTTCCAATCCTTGAAAAGGGTAAGGAATCTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCGCGGATTTAA
Protein sequenceShow/hide protein sequence
MSSAIIALLAYDKLVGDNYQSWKNNINTILVTDDIKLVLSEECPQMPGSTASRSVRDAHDRWIRANEKAKVYIIVSMSDVLAKKHELMVSAKEIMESLQKMFGQLSFQVR
HDSLKYVFNAWMKEGSSVREHVLDMMTHFNLSEMNGASIDESSQVSFILETLPKSFLQFRSNVVMNKISYTLTTLLNELQNFQSLKRVRNLRQMLPTGLITGVRPRI