; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0013045 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0013045
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGag protease polyprotein
Genome locationchr05:16702276..16705286
RNA-Seq ExpressionPI0013045
SyntenyPI0013045
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0006885 - regulation of pH (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:1902600 - proton transmembrane transport (biological process)
GO:0012505 - endomembrane system (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0015299 - solute:proton antiporter activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033252.1 gag-protease polyprotein [Cucumis melo var. makuwa]3.1e-4553.5Show/hide
Query:  CVCCPRTKEVCKAQKS---FCEFL-LPSVVFASARVPPGPCSRAGLSRLSL---LRKSLDREPTVRRGVSVGHLLA---LANSYRVVLVDLASEAKHLRD
        C  C  T+ +C +  S    C  L +  ++  S  +    C+  G +RL     +++  DR    R  +  GH+ A   L  S  VV   L+ EAKHLRD
Subjt:  CVCCPRTKEVCKAQKS---FCEFL-LPSVVFASARVPPGPCSRAGLSRLSL---LRKSLDREPTVRRGVSVGHLLA---LANSYRVVLVDLASEAKHLRD

Query:  FRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKDTKRQEFLDLK
        FRKY+P TFDGSL+DP RA++WLSS+ETIFRYM+C EDQKVQCAVF+L  RG  WW +TERMLGGDVSQITW QFKESFY KFFSA+L+D KRQEFL+L+
Subjt:  FRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKDTKRQEFLDLK

KAA0049906.1 gag-protease polyprotein [Cucumis melo var. makuwa]9.0e-4574.14Show/hide
Query:  RVVLVDLASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFF
        +VVL  L++EAKHLRDFRKY+P TFDGSL+DP RA++WLSS+ETIFRYM+CHEDQKVQC VF+L  RG  WW +TERMLGGDV QITW QFKESFY KFF
Subjt:  RVVLVDLASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFF

Query:  SANLKDTKRQEFLDLK
        S +L+D KRQEFL+L+
Subjt:  SANLKDTKRQEFLDLK

KAA0066483.1 gag protease polyprotein [Cucumis melo var. makuwa]2.4e-4579.09Show/hide
Query:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD
        L++EAKHLRDFRKY+P TFDGSL+DP RA+MWLSS+ETIFRYM+CHEDQKVQCAVF+L  RG  WW +TERMLGGDVSQITW QFKESFY KFFSA+L+D
Subjt:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD

Query:  TKRQEFLDLK
         KRQEFL+L+
Subjt:  TKRQEFLDLK

KGN57866.2 hypothetical protein Csa_011500 [Cucumis sativus]1.2e-4478.18Show/hide
Query:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD
        L++EAKHLRDFRKYDPQTFDGSL+DP +AEMWLSSVETIF YMRC E+ +VQCA FLLR RG+IWW +T RMLGGDV QITWDQFK+ FY KFFSANL+D
Subjt:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD

Query:  TKRQEFLDLK
         K QEFL+LK
Subjt:  TKRQEFLDLK

XP_031744116.1 uncharacterized protein LOC116404788 [Cucumis sativus]1.2e-4478.18Show/hide
Query:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD
        L++EAKHLRDFRKYDPQTFDGSL+DP +AEMWLSSVETIF YMRC E+ +VQCA FLLR RG+IWW +T RMLGGDV QITWDQFK+ FY KFFSANL+D
Subjt:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD

Query:  TKRQEFLDLK
         K QEFL+LK
Subjt:  TKRQEFLDLK

TrEMBL top hitse value%identityAlignment
A0A5A7SPY6 Gag-protease polyprotein1.5e-4553.5Show/hide
Query:  CVCCPRTKEVCKAQKS---FCEFL-LPSVVFASARVPPGPCSRAGLSRLSL---LRKSLDREPTVRRGVSVGHLLA---LANSYRVVLVDLASEAKHLRD
        C  C  T+ +C +  S    C  L +  ++  S  +    C+  G +RL     +++  DR    R  +  GH+ A   L  S  VV   L+ EAKHLRD
Subjt:  CVCCPRTKEVCKAQKS---FCEFL-LPSVVFASARVPPGPCSRAGLSRLSL---LRKSLDREPTVRRGVSVGHLLA---LANSYRVVLVDLASEAKHLRD

Query:  FRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKDTKRQEFLDLK
        FRKY+P TFDGSL+DP RA++WLSS+ETIFRYM+C EDQKVQCAVF+L  RG  WW +TERMLGGDVSQITW QFKESFY KFFSA+L+D KRQEFL+L+
Subjt:  FRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKDTKRQEFLDLK

A0A5A7TD13 Reverse transcriptase5.7e-4575Show/hide
Query:  RVVLVDLASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFF
        +VV   L++EAKHLRDFRKY+P TFDGSL+DP RA++WLSS+ETIFRYM+C EDQKVQCAVF+L  RG +WW +TERMLGGDVSQITW QFKESFY KFF
Subjt:  RVVLVDLASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFF

Query:  SANLKDTKRQEFLDLK
        SA+L+D KRQEFL+L+
Subjt:  SANLKDTKRQEFLDLK

A0A5A7THE6 Reverse transcriptase5.7e-4579.09Show/hide
Query:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD
        L++EAKHLRDFRKY+P TFDGSL+DP RA+MWLSS+ETIFRYM+C EDQKVQCAVF+L  RG  WW +TERMLGGDVSQITW QFKESFY KFFSA+L+D
Subjt:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD

Query:  TKRQEFLDLK
         KRQEFL+LK
Subjt:  TKRQEFLDLK

A0A5A7U6V1 Gag-protease polyprotein4.3e-4574.14Show/hide
Query:  RVVLVDLASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFF
        +VVL  L++EAKHLRDFRKY+P TFDGSL+DP RA++WLSS+ETIFRYM+CHEDQKVQC VF+L  RG  WW +TERMLGGDV QITW QFKESFY KFF
Subjt:  RVVLVDLASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFF

Query:  SANLKDTKRQEFLDLK
        S +L+D KRQEFL+L+
Subjt:  SANLKDTKRQEFLDLK

A0A5A7VEL1 Gag protease polyprotein1.1e-4579.09Show/hide
Query:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD
        L++EAKHLRDFRKY+P TFDGSL+DP RA+MWLSS+ETIFRYM+CHEDQKVQCAVF+L  RG  WW +TERMLGGDVSQITW QFKESFY KFFSA+L+D
Subjt:  LASEAKHLRDFRKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKD

Query:  TKRQEFLDLK
         KRQEFL+L+
Subjt:  TKRQEFLDLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAATCGCCGGTCTCCCTCTGCCTCTACCTCTATCGGAAATATTTAACATGAAAGAGAGTGATAACATCAAGCCTAGCCATCAACTACCACCACTTATTACTTCACA
TAATACTGTATGGTCAGCAAGCTGTAACGACCGCAAACTCGGGATCATTGAGTTCTTCACTTCTCATTCTGGCACTCAGCCGCCGAGTAGCACTCTTTCTCCCAGTTCCC
TTGCGCAAGGCGTTGTCTTCGCCGTTCGAAAGCTCTCCAGTCATCGTTTCACTGGAAAGGAAGCTTTGGTCTTTGGTTTGGGCGTTCTGGGCCGTTTTCGTGAAGTGTGG
AGGTGCTTCGTGGCTTTCTCGTCTTGTTGGTTTTCTTCCGTCTTTTCTTCGCGAGGTTGTGTGTGTTGTCCGCGAACTAAGGAAGTTTGCAAAGCTCAGAAGTCGTTCTG
CGAGTTTCTCTTGCCGTCAGTGGTGTTCGCGTCAGCTCGCGTCCCACCTGGTCCTTGCTCGCGAGCTGGATTGTCTCGCCTCTCCCTTCTTCGCAAGTCACTTGATCGCG
AGCCTACTGTTCGCCGAGGAGTCAGCGTTGGTCATCTTCTCGCGTTAGCCAACTCTTACCGCGTGGTGCTTGTGGACTTGGCATCCGAGGCAAAACATCTAAGAGATTTC
AGGAAGTACGACCCCCAGACATTTGATGGGTCATTGGATGATCCATGCAGGGCGGAGATGTGGTTGTCCTCTGTGGAGACCATCTTCCGCTATATGAGGTGCCATGAGGA
CCAGAAGGTCCAGTGTGCAGTTTTCCTTCTGAGGGGCAGAGGGGTGATTTGGTGGTGCTCAACAGAGAGGATGCTGGGTGGTGACGTGAGCCAGATCACGTGGGATCAGT
TTAAGGAGAGCTTTTATGACAAGTTCTTCTCCGCGAATCTGAAGGACACCAAGCGTCAGGAGTTCTTGGACCTAAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAATCGCCGGTCTCCCTCTGCCTCTACCTCTATCGGAAATATTTAACATGAAAGAGAGTGATAACATCAAGCCTAGCCATCAACTACCACCACTTATTACTTCACA
TAATACTGTATGGTCAGCAAGCTGTAACGACCGCAAACTCGGGATCATTGAGTTCTTCACTTCTCATTCTGGCACTCAGCCGCCGAGTAGCACTCTTTCTCCCAGTTCCC
TTGCGCAAGGCGTTGTCTTCGCCGTTCGAAAGCTCTCCAGTCATCGTTTCACTGGAAAGGAAGCTTTGGTCTTTGGTTTGGGCGTTCTGGGCCGTTTTCGTGAAGTGTGG
AGGTGCTTCGTGGCTTTCTCGTCTTGTTGGTTTTCTTCCGTCTTTTCTTCGCGAGGTTGTGTGTGTTGTCCGCGAACTAAGGAAGTTTGCAAAGCTCAGAAGTCGTTCTG
CGAGTTTCTCTTGCCGTCAGTGGTGTTCGCGTCAGCTCGCGTCCCACCTGGTCCTTGCTCGCGAGCTGGATTGTCTCGCCTCTCCCTTCTTCGCAAGTCACTTGATCGCG
AGCCTACTGTTCGCCGAGGAGTCAGCGTTGGTCATCTTCTCGCGTTAGCCAACTCTTACCGCGTGGTGCTTGTGGACTTGGCATCCGAGGCAAAACATCTAAGAGATTTC
AGGAAGTACGACCCCCAGACATTTGATGGGTCATTGGATGATCCATGCAGGGCGGAGATGTGGTTGTCCTCTGTGGAGACCATCTTCCGCTATATGAGGTGCCATGAGGA
CCAGAAGGTCCAGTGTGCAGTTTTCCTTCTGAGGGGCAGAGGGGTGATTTGGTGGTGCTCAACAGAGAGGATGCTGGGTGGTGACGTGAGCCAGATCACGTGGGATCAGT
TTAAGGAGAGCTTTTATGACAAGTTCTTCTCCGCGAATCTGAAGGACACCAAGCGTCAGGAGTTCTTGGACCTAAAGTAG
Protein sequenceShow/hide protein sequence
MSIAGLPLPLPLSEIFNMKESDNIKPSHQLPPLITSHNTVWSASCNDRKLGIIEFFTSHSGTQPPSSTLSPSSLAQGVVFAVRKLSSHRFTGKEALVFGLGVLGRFREVW
RCFVAFSSCWFSSVFSSRGCVCCPRTKEVCKAQKSFCEFLLPSVVFASARVPPGPCSRAGLSRLSLLRKSLDREPTVRRGVSVGHLLALANSYRVVLVDLASEAKHLRDF
RKYDPQTFDGSLDDPCRAEMWLSSVETIFRYMRCHEDQKVQCAVFLLRGRGVIWWCSTERMLGGDVSQITWDQFKESFYDKFFSANLKDTKRQEFLDLK