; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004697 (gene) of Snake gourd v1 genome

Gene IDTan0004697
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:32637477..32637872
RNA-Seq ExpressionTan0004697
SyntenyTan0004697
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-4373.23Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKAR YILAS+S+VL+KKHE+M+TA+EIM SLQ MFGQ S  + +DA+KY+YN+RM + ASVREHVL+MM HFNVAE+NGAVIDE SQV+FI+ESLP+S
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNELLDF
        FL FR+NAVMNKI Y LTTLLNEL  F
Subjt:  FLPFRTNAVMNKIEYNLTTLLNELLDF

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-4373.23Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKAR YILAS+S+VL+KKHE+M+TA+EIM SLQ MFGQ S  + +DA+KY+YN+RM + ASVREHVL+MM HFNVAE+NGAVIDE SQV+FI+ESLP+S
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNELLDF
        FL FR+NAVMNKI Y LTTLLNEL  F
Subjt:  FLPFRTNAVMNKIEYNLTTLLNELLDF

XP_022158791.1 uncharacterized protein LOC111025258 [Momordica charantia]5.3e-4476.61Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKARVYILASISDVLSKKHE + TA+EIM SLQA+FGQPS+S+ +DA+KYVYN RMK+ +SVREHVL+MM HFNVAEVN AV++E SQV FIM+SLPKS
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNEL
        +  F+TNA+MNKIEY+LTTLLNEL
Subjt:  FLPFRTNAVMNKIEYNLTTLLNEL

XP_038876370.1 uncharacterized protein LOC120068812, partial [Benincasa hispida]7.0e-4476.42Show/hide
Query:  EKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKSF
        EKA+VYIL SISD+LSKKHE MVTAKEIM SLQA+FGQPSSS  +DA+K+VYN RMK+  +VREHVLDMM HFN+ EVN AV++EKSQV FIMESLPKSF
Subjt:  EKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKSF

Query:  LPFRTNAVMNKIEYNLTTLLNEL
          FR NA+MNKI+YNLTT+LNEL
Subjt:  LPFRTNAVMNKIEYNLTTLLNEL

XP_038880476.1 uncharacterized protein LOC120072136 [Benincasa hispida]2.4e-4476.61Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKA+VYILASISD+LSKKHE MV AKEIM SLQA+FGQPSSS  +DA+KYVYN RMK+  +VREHVLDMM HFN+ EVNGAV++EK+Q  FIMESLPKS
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNEL
        F  FRTNA++NKI+YNL TLLNEL
Subjt:  FLPFRTNAVMNKIEYNLTTLLNEL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.3e-4373.23Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKAR YILAS+S+VL+KKHE+M+TA+EIM SLQ MFGQ S  + +DA+KY+YN+RM + ASVREHVL+MM HFNVAE+NGAVIDE SQV+FI+ESLP+S
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNELLDF
        FL FR+NAVMNKI Y LTTLLNEL  F
Subjt:  FLPFRTNAVMNKIEYNLTTLLNELLDF

A0A5A7V4M1 Gag/pol protein1.3e-4373.23Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKAR YILAS+S+VL+KKHE+M+TA+EIM SLQ MFGQ S  + +DA+KY+YN+RM + ASVREHVL+MM HFNVAE+NGAVIDE SQV+FI+ESLP+S
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNELLDF
        FL FR+NAVMNKI Y LTTLLNEL  F
Subjt:  FLPFRTNAVMNKIEYNLTTLLNELLDF

A0A5D3CPJ6 Gag/pol protein1.3e-4373.23Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKAR YILAS+S+VL+KKHE+M+TA+EIM SLQ MFGQ S  + +DA+KY+YN+RM + ASVREHVL+MM HFNVAE+NGAVIDE SQV+FI+ESLP+S
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNELLDF
        FL FR+NAVMNKI Y LTTLLNEL  F
Subjt:  FLPFRTNAVMNKIEYNLTTLLNELLDF

A0A5D3DII7 Gag/pol protein1.3e-4370.87Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKARVYILAS+SDVL+KKHE++ T KEIM SL+ MFGQP  S+ + A+KY+Y  +MK+ ASVREHVLDMM HFN+AEVNG  IDE +QV+FI+ESLPKS
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNELLDF
        F+PF+TNA +NKIEYN+TTLLNEL  F
Subjt:  FLPFRTNAVMNKIEYNLTTLLNELLDF

A0A6J1E205 uncharacterized protein LOC1110252582.6e-4476.61Show/hide
Query:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS
        NEKARVYILASISDVLSKKHE + TA+EIM SLQA+FGQPS+S+ +DA+KYVYN RMK+ +SVREHVL+MM HFNVAEVN AV++E SQV FIM+SLPKS
Subjt:  NEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKS

Query:  FLPFRTNAVMNKIEYNLTTLLNEL
        +  F+TNA+MNKIEY+LTTLLNEL
Subjt:  FLPFRTNAVMNKIEYNLTTLLNEL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGGCCTAATGAGAAGGCCCGAGTCTATATCTTAGCCAGCATATCTGATGTGTTATCTAAGAAACATGAGACCATGGTCACCGCAAAGGAGATCATGGGATCATT
ACAGGCGATGTTTGGACAACCATCCTCATCGGTCCATTATGATGCTGTCAAATACGTTTACAACTCCCGTATGAAGAAGAGAGCCTCTGTTAGGGAACATGTCCTTGACA
TGATGACCCACTTCAACGTGGCTGAAGTAAATGGGGCAGTCATAGATGAGAAAAGTCAGGTAACTTTTATTATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACA
AATGCGGTGATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTACTGGACTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTGGCCTAATGAGAAGGCCCGAGTCTATATCTTAGCCAGCATATCTGATGTGTTATCTAAGAAACATGAGACCATGGTCACCGCAAAGGAGATCATGGGATCATT
ACAGGCGATGTTTGGACAACCATCCTCATCGGTCCATTATGATGCTGTCAAATACGTTTACAACTCCCGTATGAAGAAGAGAGCCTCTGTTAGGGAACATGTCCTTGACA
TGATGACCCACTTCAACGTGGCTGAAGTAAATGGGGCAGTCATAGATGAGAAAAGTCAGGTAACTTTTATTATGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCACA
AATGCGGTGATGAATAAAATAGAGTATAACCTGACTACTCTCCTCAACGAGCTACTGGACTTTTGA
Protein sequenceShow/hide protein sequence
MDWPNEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVHYDAVKYVYNSRMKKRASVREHVLDMMTHFNVAEVNGAVIDEKSQVTFIMESLPKSFLPFRT
NAVMNKIEYNLTTLLNELLDF