; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G30780 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G30780
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag-pol polyprotein
Genome locationChr1:25285386..25286417
RNA-Seq ExpressionCSPI01G30780
SyntenyCSPI01G30780
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032462.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]3.8e-4961.21Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+GN+RA+NAI+N VDLN FKLI+ C TAKEAW+ +E+ +EGTSKVK SR+ L+TSKFE+ +M+E+E+++K N R LEIAND+  LGE I E+
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE
        ++VRKVLRSLP++FDMKV AIE A D+TT+ LDE+ GSLLTFE++++++   K KG+AF S  E+
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE

KAA0039138.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.3e-4961.21Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+ N+RA+NAI+NGVDLN+FKLI+ C TAKEAW+ +E+ +EGTSKVK SR+ L+TSKFE+L+M E+E+++K N R L+I ND+  LGE I E+
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE
        ++VRKVLRSLP++FDMKV  IE A D+TTL LDE+ GSL+TFE+++T +   KGKG+AF S  E+
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE

KAA0051798.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.9e-4961.21Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+GN+RA+NAI+NGV+L++FKLI+ C TAKEAW+ +E+ FE TSKVK SR+ L+TSKFE+L+M E+ET+++ N R LEIAND+  LGE I E+
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE
        ++V KVLRSLP++FDMKV AIE A D+ TL LDE+ GSLLTFE++++++   KGKG+AF SI ++
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE

KAA0064037.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]1.7e-4961.9Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+GN+RALNAI+N VDLN+FKLI+ C+TAKEAW+ +E+T+EGTSKVK SR+ L+TSKFE+LRM E+E+++  N R LEIAN++  LGE I ++
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPS--IQEEA
        ++V K+LRSLP++FDMKV  IE AHD+TTL LDE+ GSLLTFE++  ++   KGKG+AF S  + EEA
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPS--IQEEA

TYK21443.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.9e-4961.21Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+GN+RA+NAI+NGV+L++FKLI+ C TAKEAW+ +E+ FE TSKVK SR+ L+TSKFE+L+M E+ET+++ N R LEIAND+  LGE I E+
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE
        ++V KVLRSLP++FDMKV AIE A D+ TL LDE+ GSLLTFE++++++   KGKG+AF SI ++
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE

TrEMBL top hitse value%identityAlignment
A0A5A7U931 Gag-pol polyprotein1.4e-4961.21Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+GN+RA+NAI+NGV+L++FKLI+ C TAKEAW+ +E+ FE TSKVK SR+ L+TSKFE+L+M E+ET+++ N R LEIAND+  LGE I E+
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE
        ++V KVLRSLP++FDMKV AIE A D+ TL LDE+ GSLLTFE++++++   KGKG+AF SI ++
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE

A0A5D3D1H0 Gag-proteinase polyprotein1.9e-4961.21Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+GN+RA+NAI+N VDLN FKLI+ C TAKEAW+ +E+ +EGTSKVK SR+ L+TSKFE+ +M+E+E+++K N R LEIAND+  LGE I E+
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE
        ++VRKVLRSLP++FDMKV AIE A D+TT+ LDE+ GSLLTFE++++++   K KG+AF S  E+
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE

A0A5D3DCZ8 Gag-pol polyprotein1.4e-4961.21Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+GN+RA+NAI+NGV+L++FKLI+ C TAKEAW+ +E+ FE TSKVK SR+ L+TSKFE+L+M E+ET+++ N R LEIAND+  LGE I E+
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE
        ++V KVLRSLP++FDMKV AIE A D+ TL LDE+ GSLLTFE++++++   KGKG+AF SI ++
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE

A0A5D3DRL5 Gag-proteinase polyprotein8.4e-5061.9Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+GN+RALNAI+N VDLN+FKLI+ C+TAKEAW+ +E+T+EGTSKVK SR+ L+TSKFE+LRM E+E+++  N R LEIAN++  LGE I ++
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPS--IQEEA
        ++V K+LRSLP++FDMKV  IE AHD+TTL LDE+ GSLLTFE++  ++   KGKG+AF S  + EEA
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPS--IQEEA

A0A5D3DTY9 Gag-pol polyprotein1.1e-4961.21Show/hide
Query:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA
        TDAEEQ S+ N+RA+NAI+NGVDLN+FKLI+ C TAKEAW+ +E+ +EGTSKVK SR+ L+TSKFE+L+M E+E+++K N R L+I ND+  LGE I E+
Subjt:  TDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNNVRELEIANDAFNLGEGISEA

Query:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE
        ++VRKVLRSLP++FDMKV  IE A D+TTL LDE+ GSL+TFE+++T +   KGKG+AF S  E+
Subjt:  RLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGTTGAGAGAGCAGCTCTGGAGTTCAGGAGGAGCCTGATCCATTCTTCAAAGGAAAGTCAGTACCTAAGCCTGAGGAAAATTGGGACTGATGCAGAAGAACAGGG
TTCTTTGGGAAATTCCCGAGCTCTCAATGCTATATACAATGGGGTTGATCTGAATATGTTCAAGCTAATCAGCTTGTGTGCTACTGCGAAAGAAGCATGGAGAAAAATGG
AGATAACATTTGAAGGAACTTCCAAAGTCAAAACATCTCGTGTACTGTTGTTAACTTCTAAATTCGAATCACTAAGAATGATGGAAGAAGAAACTATCACAAAGAATAAT
GTTCGAGAACTGGAAATAGCCAATGATGCATTTAATCTTGGTGAAGGAATCTCAGAAGCAAGATTGGTAAGGAAAGTGTTACGCTCTTTACCAAAGAGATTCGATATGAA
AGTCATCGCGATTGAAGGAGCACATGATGTCACTACTCTCTCGCTTGATGAAATGATTGGTTCATTGCTCACATTTGAAATTTCACTTACCGAAAAAGCAGAAAATAAAG
GCAAAGGGGTGGCGTTTCCATCCATCCAAGAAGAAGCATATCTAGAAAAGGGAAAGGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGTTGAGAGAGCAGCTCTGGAGTTCAGGAGGAGCCTGATCCATTCTTCAAAGGAAAGTCAGTACCTAAGCCTGAGGAAAATTGGGACTGATGCAGAAGAACAGGG
TTCTTTGGGAAATTCCCGAGCTCTCAATGCTATATACAATGGGGTTGATCTGAATATGTTCAAGCTAATCAGCTTGTGTGCTACTGCGAAAGAAGCATGGAGAAAAATGG
AGATAACATTTGAAGGAACTTCCAAAGTCAAAACATCTCGTGTACTGTTGTTAACTTCTAAATTCGAATCACTAAGAATGATGGAAGAAGAAACTATCACAAAGAATAAT
GTTCGAGAACTGGAAATAGCCAATGATGCATTTAATCTTGGTGAAGGAATCTCAGAAGCAAGATTGGTAAGGAAAGTGTTACGCTCTTTACCAAAGAGATTCGATATGAA
AGTCATCGCGATTGAAGGAGCACATGATGTCACTACTCTCTCGCTTGATGAAATGATTGGTTCATTGCTCACATTTGAAATTTCACTTACCGAAAAAGCAGAAAATAAAG
GCAAAGGGGTGGCGTTTCCATCCATCCAAGAAGAAGCATATCTAGAAAAGGGAAAGGCTTAA
Protein sequenceShow/hide protein sequence
MGVERAALEFRRSLIHSSKESQYLSLRKIGTDAEEQGSLGNSRALNAIYNGVDLNMFKLISLCATAKEAWRKMEITFEGTSKVKTSRVLLLTSKFESLRMMEEETITKNN
VRELEIANDAFNLGEGISEARLVRKVLRSLPKRFDMKVIAIEGAHDVTTLSLDEMIGSLLTFEISLTEKAENKGKGVAFPSIQEEAYLEKGKA