; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017097 (gene) of Snake gourd v1 genome

Gene IDTan0017097
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:35793784..35794353
RNA-Seq ExpressionTan0017097
SyntenyTan0017097
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]6.0e-6568.25Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        MS+S IQLL SDKLN DN+G WKSNLNT+LVIDDLRF   EEC P P+ +A RTVR  +D+W +AN+KA VYILAS S+VLSKKHE + T +EIM SL A
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        +FGQPS+++ +DA+KYVYN RMK G+SVRE VLNMMVHFNVA VN  V++E SQV FIM+SLPK + QF+ N++MNKIEY+LTTLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

XP_022158791.1 uncharacterized protein LOC111025258 [Momordica charantia]2.7e-6569.84Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        MS+S IQLL SDKLN DN+G WKSNLNT+LVIDDLRF   EEC P  + ++ +TVR   D+W +AN+KA VYILAS SDVLSKKHEG+ TA+EIM SL A
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        +FGQPS+SI +DA+KYVYN RMK G+SVRE VLNMMVHFNVA VN  V++E SQV FIM+SLPK + QF+TN++MNKIEY+LTTLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

XP_038876370.1 uncharacterized protein LOC120068812, partial [Benincasa hispida]2.5e-6367.2Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        MS+S  +LL S+KLN DN+GTWKSNLNT+LVIDDLRF   EEC P P+ +A RTV    D+WT+A +KA VYIL S SD+LSKKHE M+TAKEIM SL A
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        +FGQPSSS  +DA+K+VYN RMK G +VRE VL+MMVHFN+  VN  V++E SQV FIMESLPK F QFR N++MNKI+YNLTT+LNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

XP_038880476.1 uncharacterized protein LOC120072136 [Benincasa hispida]6.0e-6567.2Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        MS+S +Q L S+KLNDDN+GTWKSNLNT+LVIDDL+F   EEC P P+ +  RT+    D+WT+AN+KA VYILAS SD+LSKKHE M+ AKEIM SL A
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        +FGQPSSS  +DA+KYVYN RMK GT+VRE VL+MMVHFN+  VNG V++E +Q  FIMESLPK F QFRTN+++NKI+YNL TLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

XP_038885834.1 uncharacterized protein LOC120076130 [Benincasa hispida]9.7e-6367.2Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        M+SS IQLL  DKL  +N+ TWK+NLNT+LVIDDL+F   EEC P PSS+A RTVR  +++W R NDK   YILA+ SDVL+KKHE M T K+IM  L  
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        MFGQPS S+R+D++KY+YN  MK G SVRE VLNMMVHFNVA VN VV+DE SQ+ FI+ESLPK FLQF TN++MNKIEYNLTTLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.8e-6264.55Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        M+S+ + +L +DKLN +N+ +WK+ +NTVL+IDDLRF  +EEC   P+++ATRTVR  +++W +AN+KA  YILAS S+VL+KKHE M+TA+EIM SL  
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        MFGQ S  I++DALKY+YN+RM  G SVRE VLNMMVHFNVA +NG VIDE SQVSFI+ESLP+ FLQFR+N+VMNKI Y LTTLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

A0A5A7TU93 Gag/pol protein1.8e-6264.55Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        M+S+ + +L +DKLN +N+ +WK+ +NTVL+IDDLRF  +EEC   P+++ATRTVR  +++W +AN+KA  YILAS S+VL+KKHE M+TA+EIM SL  
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        MFGQ S  I++DALKY+YN+RM  G SVRE VLNMMVHFNVA +NG VIDE SQVSFI+ESLP+ FLQFR+N+VMNKI Y LTTLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

A0A5D3CPJ6 Gag/pol protein1.8e-6264.55Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        M+S+ + +L +DKLN +N+ +WK+ +NTVL+IDDLRF  +EEC   P+++ATRTVR  +++W +AN+KA  YILAS S+VL+KKHE M+TA+EIM SL  
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        MFGQ S  I++DALKY+YN+RM  G SVRE VLNMMVHFNVA +NG VIDE SQVSFI+ESLP+ FLQFR+N+VMNKI Y LTTLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

A0A6J1DWG6 uncharacterized protein LOC1110250212.9e-6568.25Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        MS+S IQLL SDKLN DN+G WKSNLNT+LVIDDLRF   EEC P P+ +A RTVR  +D+W +AN+KA VYILAS S+VLSKKHE + T +EIM SL A
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        +FGQPS+++ +DA+KYVYN RMK G+SVRE VLNMMVHFNVA VN  V++E SQV FIM+SLPK + QF+ N++MNKIEY+LTTLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

A0A6J1E205 uncharacterized protein LOC1110252581.3e-6569.84Show/hide
Query:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA
        MS+S IQLL SDKLN DN+G WKSNLNT+LVIDDLRF   EEC P  + ++ +TVR   D+W +AN+KA VYILAS SDVLSKKHEG+ TA+EIM SL A
Subjt:  MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPA

Query:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL
        +FGQPS+SI +DA+KYVYN RMK G+SVRE VLNMMVHFNVA VN  V++E SQV FIM+SLPK + QF+TN++MNKIEY+LTTLLNEL
Subjt:  MFGQPSSSIRYDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGCTCTTTTATTCAATTACTCACCTCAGACAAACTTAACGACGATAACTTTGGAACTTGGAAATCAAACTTGAATACGGTTCTTGTAATTGATGATCTAAGGTT
CGCCTCGATGGAGGAATGTTGTCCCCCTCCCAGCTCGTCTGCAACCCGAACAGTTCGACATACATTTGACAAATGGACTAGGGCTAATGATAAAGCCTGGGTCTACATCT
TAGCCAGCACATCTGATGTGTTGTCTAAGAAACATGAGGGCATGATCACCGCAAAGGAGATCATGGGATCACTACCGGCCATGTTTGGACAACCGTCATCGTCGATCCGT
TATGATGCTCTCAAGTACGTTTACAACTCTCGAATGAAGGTGGGAACTTCTGTTAGGGAGCCTGTCCTTAATATGATGGTCCATTTCAACGTGGCAGTGGTAAACGGGGT
TGTCATAGATGAGAACAGTCAGGTTAGCTTTATAATGGAATCTCTTCCGAAGATTTTTCTGCAGTTCCGCACCAATTCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCTACTTAACGAGCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGCTCTTTTATTCAATTACTCACCTCAGACAAACTTAACGACGATAACTTTGGAACTTGGAAATCAAACTTGAATACGGTTCTTGTAATTGATGATCTAAGGTT
CGCCTCGATGGAGGAATGTTGTCCCCCTCCCAGCTCGTCTGCAACCCGAACAGTTCGACATACATTTGACAAATGGACTAGGGCTAATGATAAAGCCTGGGTCTACATCT
TAGCCAGCACATCTGATGTGTTGTCTAAGAAACATGAGGGCATGATCACCGCAAAGGAGATCATGGGATCACTACCGGCCATGTTTGGACAACCGTCATCGTCGATCCGT
TATGATGCTCTCAAGTACGTTTACAACTCTCGAATGAAGGTGGGAACTTCTGTTAGGGAGCCTGTCCTTAATATGATGGTCCATTTCAACGTGGCAGTGGTAAACGGGGT
TGTCATAGATGAGAACAGTCAGGTTAGCTTTATAATGGAATCTCTTCCGAAGATTTTTCTGCAGTTCCGCACCAATTCGGTGATGAACAAGATAGAGTATAACCTGACTA
CTCTACTTAACGAGCTATAG
Protein sequenceShow/hide protein sequence
MSSSFIQLLTSDKLNDDNFGTWKSNLNTVLVIDDLRFASMEECCPPPSSSATRTVRHTFDKWTRANDKAWVYILASTSDVLSKKHEGMITAKEIMGSLPAMFGQPSSSIR
YDALKYVYNSRMKVGTSVREPVLNMMVHFNVAVVNGVVIDENSQVSFIMESLPKIFLQFRTNSVMNKIEYNLTTLLNEL