; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12210 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12210
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag-pol polyprotein
Genome locationChr4:10534800..10536190
RNA-Seq ExpressionCSPI04G12210
SyntenyCSPI04G12210
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032462.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.0e-7361.67Show/hide
Query:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS
        M++I+ G S ++SP+LDG NY++WKP M  F+K++D ++W+A++ G+EPPM+ V G S PKPE DWTDAE+QAS GNA A+NAI+N VD N FKLI++C+
Subjt:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS

Query:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK
        +AKEAWK LEVAYEGTSKVK +RLQL++SKFE+  M+EDES++++N RVL+IAN+  +LGE+I ES++VRKVLRSLPR+FD+KVT +EEA DI ++KLD+
Subjt:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK

Query:  LFDSLHAFEILISDIEDKKCKGIGFQS
        LF SL  FE+ +SD E KK KGI F+S
Subjt:  LFDSLHAFEILISDIEDKKCKGIGFQS

KAA0040963.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.2e-7361.67Show/hide
Query:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS
        M++I+ G S ++SP+LDG NY++WKP M  F+K +D ++W+A+++G++PPMI V G   PKPE DWTDAE+QAS GNA ALNAI+N VD NVFKLI++CS
Subjt:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS

Query:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK
        ++KEAWK+LEVAYEGTSKVK +RLQL++SKFE+L M EDES++++N RVL+IAN+  +L E+I +SE+VRKVLRSLPR+F +KVT +EEAHDI +LKLD+
Subjt:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK

Query:  LFDSLHAFEILISDIEDKKCKGIGFQS
        LF SL  FE+  +D E KK K I F+S
Subjt:  LFDSLHAFEILISDIEDKKCKGIGFQS

KAA0054435.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.7e-7260.99Show/hide
Query:  KGRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCSSAKEA
        +G S  + P+LDG NY++WKP M  F+K++D ++W+A++ G+EP MI+V G S PKPE DWTDAE+QAS G A A+NAI+NGVD NVFKLI+ C++AKEA
Subjt:  KGRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCSSAKEA

Query:  WKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDKLFDSL
        WK LEVAYEGTSKVK ++L+L++SKFE+L M EDE+++E+N RVL+I N+  +LGE+I ES++V KVLRSLPR+FDIKVT ++EA DI +L LD+LF SL
Subjt:  WKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDKLFDSL

Query:  HAFEILISDIEDKKCKGIGFQSI
          FE+ ISD E KK KGI F+S+
Subjt:  HAFEILISDIEDKKCKGIGFQSI

KAA0067564.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.4e-7261.23Show/hide
Query:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS
        M++I+ G S +  P+LDG NY++WKP M  F+K++D ++W+ ++ G+EPPM+ V   S PKPE DWTDAE+QAS GNA A+NAI+ GVD NVFKLI++C+
Subjt:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS

Query:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK
        +AKEA K L+VAYEGTSKVK +RLQL++SKFE+L M EDES++E+N RVL+IAN+  +LGE+ISES++V KVLRSLPR+FD+KV  +EEA DI +LKLD+
Subjt:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK

Query:  LFDSLHAFEILISDIEDKKCKGIGFQS
        LF SL  FE+ +SDIE KK KGI F+S
Subjt:  LFDSLHAFEILISDIEDKKCKGIGFQS

TYK02457.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]4.0e-7359.58Show/hide
Query:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS
        M++I+ G S ++ P+LDG NY++WKP M  F+K++D ++W+A+++G++PPMI V G S PKPE DWTDAE+QAS GNA ALN I+NGVD NVFKLI++CS
Subjt:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS

Query:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK
        +AKEAWK+LEVA EGTSKVK +RLQL++ KFE+L M EDES++++N RVL+IANE  +LGE+I +S+LVRKVLRS P++ D+KVT  EEAHDI +LKLD+
Subjt:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK

Query:  LFDSLHAFEILISDIEDKKCKGIGFQS--IPRAFVPNTDS
        LF SL  FE+  ++ E KK KGI F+S  +    V NT+S
Subjt:  LFDSLHAFEILISDIEDKKCKGIGFQS--IPRAFVPNTDS

TrEMBL top hitse value%identityAlignment
A0A5A7TCV9 Gag-pol polyprotein2.5e-7361.67Show/hide
Query:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS
        M++I+ G S ++SP+LDG NY++WKP M  F+K +D ++W+A+++G++PPMI V G   PKPE DWTDAE+QAS GNA ALNAI+N VD NVFKLI++CS
Subjt:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS

Query:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK
        ++KEAWK+LEVAYEGTSKVK +RLQL++SKFE+L M EDES++++N RVL+IAN+  +L E+I +SE+VRKVLRSLPR+F +KVT +EEAHDI +LKLD+
Subjt:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK

Query:  LFDSLHAFEILISDIEDKKCKGIGFQS
        LF SL  FE+  +D E KK K I F+S
Subjt:  LFDSLHAFEILISDIEDKKCKGIGFQS

A0A5D3BBI3 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-7261.23Show/hide
Query:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS
        M++I+ G S +  P+LDG NY++WKP M  F+K++D ++W+ ++ G+EPPM+ V   S PKPE DWTDAE+QAS GNA A+NAI+ GVD NVFKLI++C+
Subjt:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS

Query:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK
        +AKEA K L+VAYEGTSKVK +RLQL++SKFE+L M EDES++E+N RVL+IAN+  +LGE+ISES++V KVLRSLPR+FD+KV  +EEA DI +LKLD+
Subjt:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK

Query:  LFDSLHAFEILISDIEDKKCKGIGFQS
        LF SL  FE+ +SDIE KK KGI F+S
Subjt:  LFDSLHAFEILISDIEDKKCKGIGFQS

A0A5D3BSP9 Gag-proteinase polyprotein1.9e-7359.58Show/hide
Query:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS
        M++I+ G S ++ P+LDG NY++WKP M  F+K++D ++W+A+++G++PPMI V G S PKPE DWTDAE+QAS GNA ALN I+NGVD NVFKLI++CS
Subjt:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS

Query:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK
        +AKEAWK+LEVA EGTSKVK +RLQL++ KFE+L M EDES++++N RVL+IANE  +LGE+I +S+LVRKVLRS P++ D+KVT  EEAHDI +LKLD+
Subjt:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK

Query:  LFDSLHAFEILISDIEDKKCKGIGFQS--IPRAFVPNTDS
        LF SL  FE+  ++ E KK KGI F+S  +    V NT+S
Subjt:  LFDSLHAFEILISDIEDKKCKGIGFQS--IPRAFVPNTDS

A0A5D3CS19 Gag-pol polyprotein2.8e-7260.99Show/hide
Query:  KGRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCSSAKEA
        +G S  + P+LDG NY++WKP M  F+K++D ++W+A++ G+EP MI+V G S PKPE DWTDAE+QAS G A A+NAI+NGVD NVFKLI+ C++AKEA
Subjt:  KGRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCSSAKEA

Query:  WKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDKLFDSL
        WK LEVAYEGTSKVK ++L+L++SKFE+L M EDE+++E+N RVL+I N+  +LGE+I ES++V KVLRSLPR+FDIKVT ++EA DI +L LD+LF SL
Subjt:  WKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDKLFDSL

Query:  HAFEILISDIEDKKCKGIGFQSI
          FE+ ISD E KK KGI F+S+
Subjt:  HAFEILISDIEDKKCKGIGFQSI

A0A5D3D1H0 Gag-proteinase polyprotein5.1e-7461.67Show/hide
Query:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS
        M++I+ G S ++SP+LDG NY++WKP M  F+K++D ++W+A++ G+EPPM+ V G S PKPE DWTDAE+QAS GNA A+NAI+N VD N FKLI++C+
Subjt:  MDMIK-GRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCS

Query:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK
        +AKEAWK LEVAYEGTSKVK +RLQL++SKFE+  M+EDES++++N RVL+IAN+  +LGE+I ES++VRKVLRSLPR+FD+KVT +EEA DI ++KLD+
Subjt:  SAKEAWKSLEVAYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDK

Query:  LFDSLHAFEILISDIEDKKCKGIGFQS
        LF SL  FE+ +SD E KK KGI F+S
Subjt:  LFDSLHAFEILISDIEDKKCKGIGFQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATGATCAAAGGCAGATCTACTACTCAATCACCCCTTCTCGATGGAACAAATTATGCTCATTGGAAGCCTTGTATGACAAGTTTCCTCAAATCAATTGACAGTGA
AAGCTGGAAAGCTGTTATCTCCGGTTGGGAACCACCTATGATCCTCGTAGAGGGGAAATCTACACCTAAACCAGAATCAGACTGGACCGATGCTGAAGATCAAGCTTCTC
GTGGCAACGCTTGTGCTCTCAATGCTATTTATAATGGAGTAGATCAAAATGTATTTAAATTGATACATACTTGTTCTTCAGCAAAAGAAGCCTGGAAAAGTCTTGAAGTT
GCATATGAAGGTACATCTAAAGTGAAAACAGCACGTTTACAATTGTTGAGTTCGAAATTTGAATCCTTAATGATGATGGAAGATGAATCAATTGCTGAATTTAATGTCAG
AGTCCTAGACATTGCAAATGAATTCTTTGTTCTAGGTGAAAGGATTTCTGAATCCGAACTGGTAAGAAAAGTCTTGAGATCTTTACCTAGAAGATTTGATATAAAAGTAA
CAATAATGGAAGAAGCTCATGATATTGAATCTTTAAAGTTAGATAAATTATTTGACTCTTTACATGCCTTTGAAATTTTGATATCGGACATAGAAGACAAAAAATGTAAA
GGTATTGGGTTTCAATCTATACCTAGAGCATTTGTCCCTAATACTGATTCCCCTTGTCCCGAGGAGCCTGTTATTAATCCTTCTCAACCATCTGCTTCTACTGCACCTTG
TCCTAAACATGAGGATCCTGTTCATGTTCCTAAGAATGCGAAGAAGTTCTCCAAAGGGCCTAACATAATTACCACACAAGCTGGAAGAAGAAAGCTACCTCCAAATATTC
CATCTGATCAAGATGCTCCAGGGCCTGCTCCTAAAACTCTGTCTCACAGCTACACATTATTTCAAGTCTCTCATGCTCCAGATACTGAGCACTCAGTTCGACCTCCTCGA
GGACCTCAATCTTCTGATACAGAGCATCTTAATATATCTTCCAACAGCCTTATTACCTCATTCCTTGGCCTCTCGGATCTTCAGCATCCTGTCTGCTGA
mRNA sequenceShow/hide mRNA sequence
GTCGATTCAAACACACTTTGTGAGTTTTGAGTTTTTATACTATAAGGGATCTCTATTGGATCATGTACTTTAGGGGTCGATTCAAACACACTTTGTGAGTTTTGAGTTTT
TATACTATAAGGGATCTCTTCTAGATTTTTAAAAAGTTGTTTACTCACTTCAATCTTAATAATATTCTCTTATGTGACCTAAAAGCTCTCTCCCGTCTATTTGTGTTATT
TGCTACTTATACCAGCTAAAAGCTCTCTGATATTTGTTCTTACTGTTCCTCTTTTTCAATTAGTATCAGAGGTTTTAACCAAATGGATATGATCAAAGGCAGATCTACTA
CTCAATCACCCCTTCTCGATGGAACAAATTATGCTCATTGGAAGCCTTGTATGACAAGTTTCCTCAAATCAATTGACAGTGAAAGCTGGAAAGCTGTTATCTCCGGTTGG
GAACCACCTATGATCCTCGTAGAGGGGAAATCTACACCTAAACCAGAATCAGACTGGACCGATGCTGAAGATCAAGCTTCTCGTGGCAACGCTTGTGCTCTCAATGCTAT
TTATAATGGAGTAGATCAAAATGTATTTAAATTGATACATACTTGTTCTTCAGCAAAAGAAGCCTGGAAAAGTCTTGAAGTTGCATATGAAGGTACATCTAAAGTGAAAA
CAGCACGTTTACAATTGTTGAGTTCGAAATTTGAATCCTTAATGATGATGGAAGATGAATCAATTGCTGAATTTAATGTCAGAGTCCTAGACATTGCAAATGAATTCTTT
GTTCTAGGTGAAAGGATTTCTGAATCCGAACTGGTAAGAAAAGTCTTGAGATCTTTACCTAGAAGATTTGATATAAAAGTAACAATAATGGAAGAAGCTCATGATATTGA
ATCTTTAAAGTTAGATAAATTATTTGACTCTTTACATGCCTTTGAAATTTTGATATCGGACATAGAAGACAAAAAATGTAAAGGTATTGGGTTTCAATCTATACCTAGAG
CATTTGTCCCTAATACTGATTCCCCTTGTCCCGAGGAGCCTGTTATTAATCCTTCTCAACCATCTGCTTCTACTGCACCTTGTCCTAAACATGAGGATCCTGTTCATGTT
CCTAAGAATGCGAAGAAGTTCTCCAAAGGGCCTAACATAATTACCACACAAGCTGGAAGAAGAAAGCTACCTCCAAATATTCCATCTGATCAAGATGCTCCAGGGCCTGC
TCCTAAAACTCTGTCTCACAGCTACACATTATTTCAAGTCTCTCATGCTCCAGATACTGAGCACTCAGTTCGACCTCCTCGAGGACCTCAATCTTCTGATACAGAGCATC
TTAATATATCTTCCAACAGCCTTATTACCTCATTCCTTGGCCTCTCGGATCTTCAGCATCCTGTCTGCTGA
Protein sequenceShow/hide protein sequence
MDMIKGRSTTQSPLLDGTNYAHWKPCMTSFLKSIDSESWKAVISGWEPPMILVEGKSTPKPESDWTDAEDQASRGNACALNAIYNGVDQNVFKLIHTCSSAKEAWKSLEV
AYEGTSKVKTARLQLLSSKFESLMMMEDESIAEFNVRVLDIANEFFVLGERISESELVRKVLRSLPRRFDIKVTIMEEAHDIESLKLDKLFDSLHAFEILISDIEDKKCK
GIGFQSIPRAFVPNTDSPCPEEPVINPSQPSASTAPCPKHEDPVHVPKNAKKFSKGPNIITTQAGRRKLPPNIPSDQDAPGPAPKTLSHSYTLFQVSHAPDTEHSVRPPR
GPQSSDTEHLNISSNSLITSFLGLSDLQHPVC