; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020591 (gene) of Snake gourd v1 genome

Gene IDTan0020591
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG11:24181672..24183876
RNA-Seq ExpressionTan0020591
SyntenyTan0020591
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.1e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

A0A5A7TU93 Gag/pol protein4.1e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

A0A5A7TWB9 Gag/pol protein4.1e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

A0A5A7V4M1 Gag/pol protein4.1e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

A0A5D3CPJ6 Gag/pol protein4.1e-4168.29Show/hide
Query:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES
        +VP++NATR VR+ Y+RW KAN+KA+ Y+LAS+S++LAKKHE M+TA+EIMDS++ MFGQ S Q  H+ LKYI+N+RM EG+SVR+HVL+MMVHFN+AE 
Subjt:  KVPSSNATRNVRDAYDRWIKANDKAKVYMLASMSDILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAES

Query:  NGASIDESSQVSFILETLPDSFL
        NGA IDE+SQVSFILE+LP+SFL
Subjt:  NGASIDESSQVSFILETLPDSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCGTTGCAACGCCAGCGCAGGCGTCGCAACGCTGAAGCATCTGAGACGCGCGACATGAAGGAGTCAGCGTTGCAACGCGATCCTCAACGTTGCAACGCTACTGG
AGTAACGCTTCAATGGGCAAAAACGGACAAGGTGCAGCGTTGCAACACCGAAGTCCTCCCTTCGGCGGATTTTATTGGATGGGGACTTTGGGGTCTAAAGATAGAGGGTT
ACACTCATAAGGTGCCAAGCTCAAATGCTACACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCTATATGCTGGCAAGTATGTCT
GACATATTAGCCAAGAAGCATGAGGGCATGATGACCGCTAAGGAAATCATGGATTCAATGAAGGGTATGTTTGGACAACAGTCTACACAGGCCTGTCACAATGACCTCAA
GTATATATTCAACTCGAGGATGCCAGAGGGTTCATCTGTTCGAGATCATGTCCTAGATATGATGGTACACTTTAACATCGCGGAGTCGAATGGTGCTTCCATCGATGAAT
CGAGTCAGGTCAGCTTCATTCTGGAAACTCTTCCAGATAGTTTCCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGGCGTTGCAACGCCAGCGCAGGCGTCGCAACGCTGAAGCATCTGAGACGCGCGACATGAAGGAGTCAGCGTTGCAACGCGATCCTCAACGTTGCAACGCTACTGG
AGTAACGCTTCAATGGGCAAAAACGGACAAGGTGCAGCGTTGCAACACCGAAGTCCTCCCTTCGGCGGATTTTATTGGATGGGGACTTTGGGGTCTAAAGATAGAGGGTT
ACACTCATAAGGTGCCAAGCTCAAATGCTACACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCTATATGCTGGCAAGTATGTCT
GACATATTAGCCAAGAAGCATGAGGGCATGATGACCGCTAAGGAAATCATGGATTCAATGAAGGGTATGTTTGGACAACAGTCTACACAGGCCTGTCACAATGACCTCAA
GTATATATTCAACTCGAGGATGCCAGAGGGTTCATCTGTTCGAGATCATGTCCTAGATATGATGGTACACTTTAACATCGCGGAGTCGAATGGTGCTTCCATCGATGAAT
CGAGTCAGGTCAGCTTCATTCTGGAAACTCTTCCAGATAGTTTCCTATAG
Protein sequenceShow/hide protein sequence
MMALQRQRRRRNAEASETRDMKESALQRDPQRCNATGVTLQWAKTDKVQRCNTEVLPSADFIGWGLWGLKIEGYTHKVPSSNATRNVRDAYDRWIKANDKAKVYMLASMS
DILAKKHEGMMTAKEIMDSMKGMFGQQSTQACHNDLKYIFNSRMPEGSSVRDHVLDMMVHFNIAESNGASIDESSQVSFILETLPDSFL