; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G22880 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G22880
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPol polyprotein
Genome locationChr1:18515042..18515501
RNA-Seq ExpressionCSPI01G22880
SyntenyCSPI01G22880
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042920.1 pol polyprotein [Cucumis melo var. makuwa]8.6e-5373.83Show/hide
Query:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------
        MGHQSKD  KPKN +KKH QAHITEVDEVS+GV DIDLC VI ECN V NSKEWWVDTGATRHICANK M TSYV VS GEQLFMGN++TS         
Subjt:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------

Query:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN
              KELT N VLHVPDIR NL+SGSLLS N FKLVFVSDKFV SKN
Subjt:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN

KAA0048117.1 hypothetical protein E6C27_scaffold385G001960 [Cucumis melo var. makuwa]1.5e-4971.53Show/hide
Query:  KDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS--------------
        +DCRKPKN +KKHAQAHI +VDEVS+GVADIDLC VILECN + NSKEWWVDTGAT HICANK + T YV +S GEQLFMG++ TS              
Subjt:  KDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS--------------

Query:  -KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN
         KELT N VLHVPDIR NL+SGSLLS NDFKLVFVSDKFVLSKN
Subjt:  -KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN

KAA0060612.1 pol polyprotein [Cucumis melo var. makuwa]7.3e-5272.48Show/hide
Query:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------
        MGHQSKDCRKPKN +KKHAQAHITEVDEVS+GVADIDL  VIL+CN V N KEWWVDT ATRHICANK M  SYV +S GEQLFMG+++TS         
Subjt:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------

Query:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN
              KELT N VLHVPDIR NL+SGSLLS N FKLVFVSDKFVLS+N
Subjt:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN

KAA0062659.1 pol polyprotein [Cucumis melo var. makuwa]1.2e-4672.59Show/hide
Query:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------
        MGHQSKDCRKPK  +KKHAQAHITEVDEVS+GVADIDLC VILECN +DNSKEWWVDTGATRHIC NK +  SYV VS GEQLFMGN+ TS         
Subjt:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------

Query:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDF
              KELT N VLHVPDIR NL+SGSLLS N F
Subjt:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDF

TYK01391.1 pol polyprotein [Cucumis melo var. makuwa]4.3e-5271.81Show/hide
Query:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTSK--------
        MGHQSKDCRKPKN +KKHAQAHITEVDEVS+GVADID+C V+LECN + NSKEWWVDTGAT HICANK + TSYV V+ GEQLFMGN+  SK        
Subjt:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTSK--------

Query:  -------ELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN
               ELT N VLHVPDIR NL+ GSLLS N F+LVFVSDKFVLSKN
Subjt:  -------ELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN

TrEMBL top hitse value%identityAlignment
A0A5A7THY2 Pol polyprotein4.2e-5373.83Show/hide
Query:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------
        MGHQSKD  KPKN +KKH QAHITEVDEVS+GV DIDLC VI ECN V NSKEWWVDTGATRHICANK M TSYV VS GEQLFMGN++TS         
Subjt:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------

Query:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN
              KELT N VLHVPDIR NL+SGSLLS N FKLVFVSDKFV SKN
Subjt:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN

A0A5A7U1K6 Uncharacterized protein7.3e-5071.53Show/hide
Query:  KDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS--------------
        +DCRKPKN +KKHAQAHI +VDEVS+GVADIDLC VILECN + NSKEWWVDTGAT HICANK + T YV +S GEQLFMG++ TS              
Subjt:  KDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS--------------

Query:  -KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN
         KELT N VLHVPDIR NL+SGSLLS NDFKLVFVSDKFVLSKN
Subjt:  -KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN

A0A5D3BN56 Pol polyprotein5.8e-4772.59Show/hide
Query:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------
        MGHQSKDCRKPK  +KKHAQAHITEVDEVS+GVADIDLC VILECN +DNSKEWWVDTGATRHIC NK +  SYV VS GEQLFMGN+ TS         
Subjt:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------

Query:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDF
              KELT N VLHVPDIR NL+SGSLLS N F
Subjt:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDF

A0A5D3BNM4 Pol polyprotein2.1e-5271.81Show/hide
Query:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTSK--------
        MGHQSKDCRKPKN +KKHAQAHITEVDEVS+GVADID+C V+LECN + NSKEWWVDTGAT HICANK + TSYV V+ GEQLFMGN+  SK        
Subjt:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTSK--------

Query:  -------ELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN
               ELT N VLHVPDIR NL+ GSLLS N F+LVFVSDKFVLSKN
Subjt:  -------ELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN

A0A5D3BT19 Pol polyprotein3.5e-5272.48Show/hide
Query:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------
        MGHQSKDCRKPKN +KKHAQAHITEVDEVS+GVADIDL  VIL+CN V N KEWWVDT ATRHICANK M  SYV +S GEQLFMG+++TS         
Subjt:  MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTS---------

Query:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN
              KELT N VLHVPDIR NL+SGSLLS N FKLVFVSDKFVLS+N
Subjt:  ------KELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSKN

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-0528.95Show/hide
Query:  GHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVIL----ECNKVDN-SKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTSK----
        GH  +DC  P+  K K   +     D  +  V + D  V+ +    EC  +     EW VDT A+ H    + +   YV    G  + MGN   SK    
Subjt:  GHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVIL----ECNKVDN-SKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTSK----

Query:  -----------ELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSK
                    L    V HVPD+R NL+SG  L  + ++  F + K+ L+K
Subjt:  -----------ELTFNKVLHVPDIRNNLLSGSLLSNNDFKLVFVSDKFVLSK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.3e-0631.87Show/hide
Query:  LECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYT--------------SKELTFNKVLHVPDIRNNLLSGSLLSNND
        L  N   N+  W +D+GAT HI ++   L+ + P +GG+ + + +  T              S+ L  NKVL+VP+I  NL+S   L N +
Subjt:  LECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYT--------------SKELTFNKVLHVPDIRNNLLSGSLLSNND

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACATCAATCAAAAGATTGTCGTAAGCCAAAGAACCTTAAGAAAAAACATGCTCAAGCTCATATCACAGAAGTTGATGAAGTATCAAATGGTGTTGCAGATATTGA
CCTTTGTGTAGTTATTTTAGAATGCAACAAAGTGGATAACTCCAAGGAATGGTGGGTAGACACTGGGGCTACTCGTCATATTTGTGCCAACAAGTATATGTTGACATCAT
ATGTGCCAGTCTCTGGTGGAGAACAACTATTTATGGGTAACGCATATACTTCAAAGGAACTCACTTTCAACAAGGTGCTTCATGTTCCTGACATTCGGAATAACTTACTT
TCTGGTTCATTGCTTAGTAATAATGATTTTAAGTTGGTGTTTGTATCTGATAAGTTTGTACTTTCCAAGAATGGGACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGACATCAATCAAAAGATTGTCGTAAGCCAAAGAACCTTAAGAAAAAACATGCTCAAGCTCATATCACAGAAGTTGATGAAGTATCAAATGGTGTTGCAGATATTGA
CCTTTGTGTAGTTATTTTAGAATGCAACAAAGTGGATAACTCCAAGGAATGGTGGGTAGACACTGGGGCTACTCGTCATATTTGTGCCAACAAGTATATGTTGACATCAT
ATGTGCCAGTCTCTGGTGGAGAACAACTATTTATGGGTAACGCATATACTTCAAAGGAACTCACTTTCAACAAGGTGCTTCATGTTCCTGACATTCGGAATAACTTACTT
TCTGGTTCATTGCTTAGTAATAATGATTTTAAGTTGGTGTTTGTATCTGATAAGTTTGTACTTTCCAAGAATGGGACGTAG
Protein sequenceShow/hide protein sequence
MGHQSKDCRKPKNLKKKHAQAHITEVDEVSNGVADIDLCVVILECNKVDNSKEWWVDTGATRHICANKYMLTSYVPVSGGEQLFMGNAYTSKELTFNKVLHVPDIRNNLL
SGSLLSNNDFKLVFVSDKFVLSKNGT