; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009137 (gene) of Snake gourd v1 genome

Gene IDTan0009137
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:16641886..16642527
RNA-Seq ExpressionTan0009137
SyntenyTan0009137
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-7670.79Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        M+S+ + +L  DKLNG+NY +WK+ +N  L++DDLRF+L+EECP  P++ A R VR+ Y+RW  ANEKAR YILA++S+VL+KKHESM TA+EIM SLQ 
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        MFGQ S  ++HDALKY+YN RM EGASVREHVLNMMVHFNVAE+NG VIDE SQVSFI+ESLP+SFLQFR+NAVMNKI Y LTTLLNELQTFESLMK KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

Query:  KK
        +K
Subjt:  KK

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-7670.79Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        M+S+ + +L  DKLNG+NY +WK+ +N  L++DDLRF+L+EECP  P++ A R VR+ Y+RW  ANEKAR YILA++S+VL+KKHESM TA+EIM SLQ 
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        MFGQ S  ++HDALKY+YN RM EGASVREHVLNMMVHFNVAE+NG VIDE SQVSFI+ESLP+SFLQFR+NAVMNKI Y LTTLLNELQTFESLMK KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

Query:  KK
        +K
Subjt:  KK

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.6e-7670.79Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        M+S+ + +L  DKLNG+NY +WK+ +N  L++DDLRF+L+EECP  P++ A R VR+ Y+RW  ANEKAR YILA++S+VL+KKHESM TA+EIM SLQ 
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        MFGQ S  ++HDALKY+YN RM EGASVREHVLNMMVHFNVAE+NG VIDE SQVSFI+ESLP+SFLQFR+NAVMNKI Y LTTLLNELQTFESLMK KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

Query:  KK
        +K
Subjt:  KK

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]9.8e-8075.5Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        MS+S IQLL  DKLNGDNY  WKSNLN  LV+DDLRF+L EECPP P+  ANR VRDAYDRW+ ANEKARVYILA+IS+VLSKKHE +AT +EIM SLQA
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        +FGQPS++L HDA+KYVYN RMKEG+SVREHVLNMMVHFNVAEVN  V++E SQV FIM+SLPKS+ QF+ NA+MNKIEY+LTTLLNELQ +ESL+K+KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

XP_038885834.1 uncharacterized protein LOC120076130 [Benincasa hispida]2.9e-7677.25Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        M+SS IQLL FDKL G+NY TWK+NLN  LV+DDL+FIL EECPP PSS ANR VRDAY+RWI  N+K   YILANISDVL+KKHESM T K+IM  L+ 
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNEL
        MFGQPS SLRHD++KY+YN  MKEGASVREHVLNMMVHFNVAEVN VV+DEKSQ+ FI+ESLPKSFLQF TNA+MNKIEYNLTTLLNEL
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.2e-7670.79Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        M+S+ + +L  DKLNG+NY +WK+ +N  L++DDLRF+L+EECP  P++ A R VR+ Y+RW  ANEKAR YILA++S+VL+KKHESM TA+EIM SLQ 
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        MFGQ S  ++HDALKY+YN RM EGASVREHVLNMMVHFNVAE+NG VIDE SQVSFI+ESLP+SFLQFR+NAVMNKI Y LTTLLNELQTFESLMK KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

Query:  KK
        +K
Subjt:  KK

A0A5A7TU93 Gag/pol protein4.2e-7670.79Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        M+S+ + +L  DKLNG+NY +WK+ +N  L++DDLRF+L+EECP  P++ A R VR+ Y+RW  ANEKAR YILA++S+VL+KKHESM TA+EIM SLQ 
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        MFGQ S  ++HDALKY+YN RM EGASVREHVLNMMVHFNVAE+NG VIDE SQVSFI+ESLP+SFLQFR+NAVMNKI Y LTTLLNELQTFESLMK KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

Query:  KK
        +K
Subjt:  KK

A0A5A7TWB9 Gag/pol protein4.2e-7670.79Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        M+S+ + +L  DKLNG+NY +WK+ +N  L++DDLRF+L+EECP  P++ A R VR+ Y+RW  ANEKAR YILA++S+VL+KKHESM TA+EIM SLQ 
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        MFGQ S  ++HDALKY+YN RM EGASVREHVLNMMVHFNVAE+NG VIDE SQVSFI+ESLP+SFLQFR+NAVMNKI Y LTTLLNELQTFESLMK KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

Query:  KK
        +K
Subjt:  KK

A0A5D3CPJ6 Gag/pol protein4.2e-7670.79Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        M+S+ + +L  DKLNG+NY +WK+ +N  L++DDLRF+L+EECP  P++ A R VR+ Y+RW  ANEKAR YILA++S+VL+KKHESM TA+EIM SLQ 
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        MFGQ S  ++HDALKY+YN RM EGASVREHVLNMMVHFNVAE+NG VIDE SQVSFI+ESLP+SFLQFR+NAVMNKI Y LTTLLNELQTFESLMK KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

Query:  KK
        +K
Subjt:  KK

A0A6J1DWG6 uncharacterized protein LOC1110250214.7e-8075.5Show/hide
Query:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA
        MS+S IQLL  DKLNGDNY  WKSNLN  LV+DDLRF+L EECPP P+  ANR VRDAYDRW+ ANEKARVYILA+IS+VLSKKHE +AT +EIM SLQA
Subjt:  MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQA

Query:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG
        +FGQPS++L HDA+KYVYN RMKEG+SVREHVLNMMVHFNVAEVN  V++E SQV FIM+SLPKS+ QF+ NA+MNKIEY+LTTLLNELQ +ESL+K+KG
Subjt:  MFGQPSSSLRHDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGCTCTTTTATCCAGTTACTTGTCTTCGATAAACTTAACGGTGATAACTACAGAACCTGGAAATCAAACTTGAATATGTTTCTTGTTGTTGATGATCTGAGGTT
CATCTTAATGGAGGAATGTCCTCCCCCTCCCAGCTCGACTGCAAACCGAATTGTTCGGGATGCATATGACAGATGGATTACGGCTAATGAGAAGGCCAGAGTCTACATCT
TAGCCAATATATCTGATGTGTTGTCTAAGAAGCATGAGAGCATGGCCACCGCGAAGGAGATCATGGGGTCATTACAGGCGATGTTTGGACAACCGTCCTCATCGCTCCGC
CATGATGCTCTCAAATACGTTTACAACTTTCGTATGAAGGAGGGAGCTTCTGTTAGGGAACATGTCCTCAACATGATGGTCCACTTCAACGTGGCAGAAGTAAATGGTGT
TGTCATAGATGAAAAGAGTCAGGTTAGCTTTATTATGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCACCAATGCGGTGATGAATAAAATAGAATATAACCTGACTA
CTCTCCTCAATGAGCTTCAGACTTTTGAATCCCTGATGAAATCAAAGGGAAAAAAAAGGAGGCAAATGTTGTCACTTCAAAGAAGTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGCTCTTTTATCCAGTTACTTGTCTTCGATAAACTTAACGGTGATAACTACAGAACCTGGAAATCAAACTTGAATATGTTTCTTGTTGTTGATGATCTGAGGTT
CATCTTAATGGAGGAATGTCCTCCCCCTCCCAGCTCGACTGCAAACCGAATTGTTCGGGATGCATATGACAGATGGATTACGGCTAATGAGAAGGCCAGAGTCTACATCT
TAGCCAATATATCTGATGTGTTGTCTAAGAAGCATGAGAGCATGGCCACCGCGAAGGAGATCATGGGGTCATTACAGGCGATGTTTGGACAACCGTCCTCATCGCTCCGC
CATGATGCTCTCAAATACGTTTACAACTTTCGTATGAAGGAGGGAGCTTCTGTTAGGGAACATGTCCTCAACATGATGGTCCACTTCAACGTGGCAGAAGTAAATGGTGT
TGTCATAGATGAAAAGAGTCAGGTTAGCTTTATTATGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCGCACCAATGCGGTGATGAATAAAATAGAATATAACCTGACTA
CTCTCCTCAATGAGCTTCAGACTTTTGAATCCCTGATGAAATCAAAGGGAAAAAAAAGGAGGCAAATGTTGTCACTTCAAAGAAGTTCTTAA
Protein sequenceShow/hide protein sequence
MSSSFIQLLVFDKLNGDNYRTWKSNLNMFLVVDDLRFILMEECPPPPSSTANRIVRDAYDRWITANEKARVYILANISDVLSKKHESMATAKEIMGSLQAMFGQPSSSLR
HDALKYVYNFRMKEGASVREHVLNMMVHFNVAEVNGVVIDEKSQVSFIMESLPKSFLQFRTNAVMNKIEYNLTTLLNELQTFESLMKSKGKKRRQMLSLQRSS