; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G020260 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G020260
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionGag/pol protein
Genome locationCma_Chr04:12854914..12856412
RNA-Seq ExpressionCmaCh04G020260
SyntenyCmaCh04G020260
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-6588.19Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLN GA++WRSIKQGCI+DSTMEAEYVAACEAAKE+VWLRKFL DLEV+PNM+L +TLYCDNSGAV NSKEPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        EIVQRGDVIVT+I  EHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-1089.19Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRD
        M+LEMESMYFNSVWELVD P+GVKPI CKWIYKRKRD
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRD

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-6588.19Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLN GA++WRSIKQGCI+DSTMEAEYVAACEAAKE+VWLRKFL DLEV+PNM+L +TLYCDNSGAV NSKEPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        EIVQRGDVIVT+I  EHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-6688.89Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLNEGA++WRSIKQGCI+DSTMEAEYVAACEAAKE+VWLRKFL DLEV+PNM+L +TLYCDNSGAV NSKEPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        EIVQRGDVIVT+I  EHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

KAA0062799.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-7155.43Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKR----------------------------------------------------------------
        MNLE+ESMYFNSVW+LVDQ DGVKPI CKW YKRKR                                                                
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKR----------------------------------------------------------------

Query:  -----------------------------DQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVT
                                        DSRKSTSGSVFTLN GA++WRSIKQGCI+DSTMEAEY+AACEAAKE+VWLRKFL DLEV+PNM   +T
Subjt:  -----------------------------DQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVT

Query:  LYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        LYCDNSGAV NS+EPRSHKRGKHIERKYHLIREI  RGDVIVTQI L HN+ DPFTKPLTAKVFEGHL SLGLR M
Subjt:  LYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

TYJ96755.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-7155.43Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKR----------------------------------------------------------------
        MNLE+ESMYFNSVW+LVDQ DGVKPI CKW YKRKR                                                                
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKR----------------------------------------------------------------

Query:  -----------------------------DQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVT
                                        DSRKSTSGSVFTLN GA++WRSIKQGCI+DSTMEAEY+AACEAAKE+VWLRKFL DLEV+PNM   +T
Subjt:  -----------------------------DQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVT

Query:  LYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        LYCDNSGAV NS+EPRSHKRGKHIERKYHLIREI  RGDVIVTQI L HN+ DPFTKPLTAKVFEGHL SLGLR M
Subjt:  LYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein5.7e-6688.19Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLN GA++WRSIKQGCI+DSTMEAEYVAACEAAKE+VWLRKFL DLEV+PNM+L +TLYCDNSGAV NSKEPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        EIVQRGDVIVT+I  EHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5A7TZD0 Gag/pol protein1.2e-1089.19Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRD
        M+LEMESMYFNSVWELVD P+GVKPI CKWIYKRKRD
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRD

A0A5A7TZD0 Gag/pol protein5.7e-6688.19Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLN GA++WRSIKQGCI+DSTMEAEYVAACEAAKE+VWLRKFL DLEV+PNM+L +TLYCDNSGAV NSKEPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        EIVQRGDVIVT+I  EHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5A7UYE8 Gag/pol protein1.2e-1089.19Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRD
        M+LEMESMYFNSVWELVD P+GVKPI CKWIYKRKRD
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRD

A0A5A7V1F5 Gag/pol protein1.5e-6688.89Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR
        DSRKSTSGSVFTLNEGA++WRSIKQGCI+DSTMEAEYVAACEAAKE+VWLRKFL DLEV+PNM+L +TLYCDNSGAV NSKEPRSHKRGKHIERKYHLIR
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        EIVQRGDVIVT+I  EHNIADPFTK LTAKVFEGHL SLGLR M
Subjt:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5A7VB15 Gag/pol protein9.1e-7255.43Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKR----------------------------------------------------------------
        MNLE+ESMYFNSVW+LVDQ DGVKPI CKW YKRKR                                                                
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKR----------------------------------------------------------------

Query:  -----------------------------DQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVT
                                        DSRKSTSGSVFTLN GA++WRSIKQGCI+DSTMEAEY+AACEAAKE+VWLRKFL DLEV+PNM   +T
Subjt:  -----------------------------DQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVT

Query:  LYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        LYCDNSGAV NS+EPRSHKRGKHIERKYHLIREI  RGDVIVTQI L HN+ DPFTKPLTAKVFEGHL SLGLR M
Subjt:  LYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

A0A5D3BE74 Gag/pol protein9.1e-7255.43Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKR----------------------------------------------------------------
        MNLE+ESMYFNSVW+LVDQ DGVKPI CKW YKRKR                                                                
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKR----------------------------------------------------------------

Query:  -----------------------------DQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVT
                                        DSRKSTSGSVFTLN GA++WRSIKQGCI+DSTMEAEY+AACEAAKE+VWLRKFL DLEV+PNM   +T
Subjt:  -----------------------------DQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVT

Query:  LYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM
        LYCDNSGAV NS+EPRSHKRGKHIERKYHLIREI  RGDVIVTQI L HN+ DPFTKPLTAKVFEGHL SLGLR M
Subjt:  LYCDNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.8e-2038.57Show/hide
Query:  RKSTSGSVFTLNE-GAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIRE
        RKST+G +F + +   I W + +Q  ++ S+ EAEY+A  EA +E++WL+  LT + +   +   + +Y DN G ++ +  P  HKR KHI+ KYH  RE
Subjt:  RKSTSGSVFTLNE-GAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIRE

Query:  IVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGL
         VQ   + +  I  E+ +AD FTKPL A  F      LGL
Subjt:  IVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGL

P0CV72 Secreted RxLR effector protein 1617.6e-0752Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWL
        +SR+STSG +F LN G + WRS KQ  ++ S+ E EY+A  EA +E+VWL
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.7e-2242.11Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR
        D+RKS++G +FT + GAI W+S  Q C++ ST EAEY+AA E  KE +WL++FL +L +    ++   +YCD+  A+  SK    H R KHI+ +YH IR
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR

Query:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFE
        E+V    + V +I    N AD  TK +    FE
Subjt:  EIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-0345.95Show/hide
Query:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRD
        M  EMES+  N  ++LV+ P G +P+ CKW++K K+D
Subjt:  MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.1e-1238.61Show/hide
Query:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR
        D+R+ST+G    L    I W+S KQ  +S S+ EAEY A   A  E +WL +F  +L++  +   L  L+CDN+ A+  +     H+R KHIE   H +R
Subjt:  DSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYCDNSGAVTNSKEPRSHKRGKHIERKYHLIR

Query:  E
        E
Subjt:  E


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCAGATGGGGTAAAACCTATTAGTTGCAAATGGATCTATAAGAGGAAACGAGA
CCAAACCGATTCGAGGAAATCGACATCAGGATCTGTCTTCACTTTGAACGAAGGAGCAATAATATGGCGAAGCATAAAGCAAGGTTGCATTTCTGATTCCACCATGGAGG
CTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGAAAGTTCTTAACTGATTTGGAAGTCATTCCAAATATGCATCTTCTCGTCACTCTTTATTGT
GATAACAGTGGAGCAGTTACAAATTCGAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAGCGAAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGT
GATCGTCACACAGATAGTTTTGGAGCACAACATTGCTGATCCATTTACAAAGCCTCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGT
AA
mRNA sequenceShow/hide mRNA sequence
ATGAATCTTGAAATGGAGTCAATGTACTTCAATTCAGTCTGGGAACTTGTAGATCAACCAGATGGGGTAAAACCTATTAGTTGCAAATGGATCTATAAGAGGAAACGAGA
CCAAACCGATTCGAGGAAATCGACATCAGGATCTGTCTTCACTTTGAACGAAGGAGCAATAATATGGCGAAGCATAAAGCAAGGTTGCATTTCTGATTCCACCATGGAGG
CTGAGTATGTTGCTGCTTGTGAAGCAGCGAAAGAATCTGTATGGCTTAGAAAGTTCTTAACTGATTTGGAAGTCATTCCAAATATGCATCTTCTCGTCACTCTTTATTGT
GATAACAGTGGAGCAGTTACAAATTCGAAAGAACCAAGAAGCCATAAGCGAGGAAAACATATTGAGCGAAAATATCATCTCATAAGGGAGATTGTGCAACGAGGAGATGT
GATCGTCACACAGATAGTTTTGGAGCACAACATTGCTGATCCATTTACAAAGCCTCTCACGGCTAAAGTGTTTGAAGGGCACCTAGTGAGTCTAGGACTACGAGTTATGT
AA
Protein sequenceShow/hide protein sequence
MNLEMESMYFNSVWELVDQPDGVKPISCKWIYKRKRDQTDSRKSTSGSVFTLNEGAIIWRSIKQGCISDSTMEAEYVAACEAAKESVWLRKFLTDLEVIPNMHLLVTLYC
DNSGAVTNSKEPRSHKRGKHIERKYHLIREIVQRGDVIVTQIVLEHNIADPFTKPLTAKVFEGHLVSLGLRVM