; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G19830 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G19830
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr3:15506122..15506751
RNA-Seq ExpressionCSPI03G19830
SyntenyCSPI03G19830
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034938.1 putative Polyprotein [Cucumis melo var. makuwa]1.4e-7570.32Show/hide
Query:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK
        MTDD SVEAQSHEIQKI HEII+EG+ L DQFQ AVIIDKLP LWKDFKNTL          S+ITRL+I+EE R+ D+K+++NAI RKK   +LK N+K
Subjt:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK

Query:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIND
         K NKM   SN QNN    QSRSTVQI CYNCNKPGHLA+NCRN++ PA QANLIEDE VAMIS+VNV G SEGWWLDTGAS HVCH+L LFRKYNE+ D
Subjt:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIND

Query:  KKILLGDHHTTKVACVREV
        K ILLGDHHTTKV  + EV
Subjt:  KKILLGDHHTTKVACVREV

KAA0055815.1 putative Polyprotein [Cucumis melo var. makuwa]1.6e-6671.35Show/hide
Query:  LDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQSRSTVQI
        LDDQFQ AVIIDKLPPLWKDFKNTL          SLITRLRI+EE R+ D+K++ N I RKK   +LKL++K K NKM    N QNN   PQSRSTVQI
Subjt:  LDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQSRSTVQI

Query:  VCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV
        VCYNCNKPGHLARNCRN++ PA QANLIEDE VAMI EVNV G SEGWWLDTGASRHV HDL LFRKYNE+ DK ILLGDHH TKV  + EV
Subjt:  VCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV

KAA0058199.1 putative Polyprotein [Cucumis melo var. makuwa]2.5e-6770.83Show/hide
Query:  LDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQSRSTVQI
        LDDQFQ AVIIDKLPPLWKDFKNTL          SLITRLRI+EE R+ D+K+++NAI RKK   +LK ++KPK N+M  ESN QNN   PQS+S VQI
Subjt:  LDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQSRSTVQI

Query:  VCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV
        VCYNCNKPGHLARNCRN++ P  QANLIEDE VAMISEVNV G  +GWWLDTGASRHVCHDL LFRKYNE  D  ILLGDHHTT VA + EV
Subjt:  VCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV

KAA0067915.1 putative Polyprotein [Cucumis melo var. makuwa]2.4e-7871.23Show/hide
Query:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK
        MTD+  VEAQSHEIQKI HEIISEG+ LDDQFQ AVIIDKLPPLWKDFKNTL          SLITRL I+EEVR+ D+K+++NAI +KK   +LK ++K
Subjt:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK

Query:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIND
        PK NKM   SN QNN   PQS+STVQIVCYNCNK GHLARNCRN++HP  QANLIE+E VAMI EVNV G SEGWWLDTGA  HVCHDL LFRKYNE+ D
Subjt:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIND

Query:  KKILLGDHHTTKVACVREV
        K ILLGDHHTTKV  + EV
Subjt:  KKILLGDHHTTKVACVREV

TYK21961.1 putative Polyprotein [Cucumis melo var. makuwa]2.5e-7266.21Show/hide
Query:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK
        M DD+ +EAQSHEIQKI H+II+EG+ L+DQ Q AVIIDKLPPLWKDFKNTL          SLI RLRI++E R+ DQK+++N I RKKS  +LKL++K
Subjt:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK

Query:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYN-CNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIN
         KENKM H SN +NN QKPQS+ TVQ+VCYN CNK  H+AR  RN+N P   ANL+E++ VAMI+EVNV G S GWWLDTG SRH+CHDLKLFRKYNE  
Subjt:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYN-CNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIN

Query:  DKKILLGDHHTTKVACVRE
        DKKILLG+HHTT VA V E
Subjt:  DKKILLGDHHTTKVACVRE

TrEMBL top hitse value%identityAlignment
A0A5A7UQC7 Putative Polyprotein7.7e-6771.35Show/hide
Query:  LDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQSRSTVQI
        LDDQFQ AVIIDKLPPLWKDFKNTL          SLITRLRI+EE R+ D+K++ N I RKK   +LKL++K K NKM    N QNN   PQSRSTVQI
Subjt:  LDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQSRSTVQI

Query:  VCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV
        VCYNCNKPGHLARNCRN++ PA QANLIEDE VAMI EVNV G SEGWWLDTGASRHV HDL LFRKYNE+ DK ILLGDHH TKV  + EV
Subjt:  VCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV

A0A5A7UXE9 Putative Polyprotein1.2e-6770.83Show/hide
Query:  LDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQSRSTVQI
        LDDQFQ AVIIDKLPPLWKDFKNTL          SLITRLRI+EE R+ D+K+++NAI RKK   +LK ++KPK N+M  ESN QNN   PQS+S VQI
Subjt:  LDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQSRSTVQI

Query:  VCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV
        VCYNCNKPGHLARNCRN++ P  QANLIEDE VAMISEVNV G  +GWWLDTGASRHVCHDL LFRKYNE  D  ILLGDHHTT VA + EV
Subjt:  VCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV

A0A5A7VQD4 Putative Polyprotein1.2e-7871.23Show/hide
Query:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK
        MTD+  VEAQSHEIQKI HEIISEG+ LDDQFQ AVIIDKLPPLWKDFKNTL          SLITRL I+EEVR+ D+K+++NAI +KK   +LK ++K
Subjt:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK

Query:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIND
        PK NKM   SN QNN   PQS+STVQIVCYNCNK GHLARNCRN++HP  QANLIE+E VAMI EVNV G SEGWWLDTGA  HVCHDL LFRKYNE+ D
Subjt:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIND

Query:  KKILLGDHHTTKVACVREV
        K ILLGDHHTTKV  + EV
Subjt:  KKILLGDHHTTKVACVREV

A0A5D3DCJ1 Putative Polyprotein7.0e-7670.32Show/hide
Query:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK
        MTDD SVEAQSHEIQKI HEII+EG+ L DQFQ AVIIDKLP LWKDFKNTL          S+ITRL+I+EE R+ D+K+++NAI RKK   +LK N+K
Subjt:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK

Query:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIND
         K NKM   SN QNN    QSRSTVQI CYNCNKPGHLA+NCRN++ PA QANLIEDE VAMIS+VNV G SEGWWLDTGAS HVCH+L LFRKYNE+ D
Subjt:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIND

Query:  KKILLGDHHTTKVACVREV
        K ILLGDHHTTKV  + EV
Subjt:  KKILLGDHHTTKVACVREV

A0A5D3DE53 Putative Polyprotein1.2e-7266.21Show/hide
Query:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK
        M DD+ +EAQSHEIQKI H+II+EG+ L+DQ Q AVIIDKLPPLWKDFKNTL          SLI RLRI++E R+ DQK+++N I RKKS  +LKL++K
Subjt:  MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTL----------SLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMK

Query:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYN-CNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIN
         KENKM H SN +NN QKPQS+ TVQ+VCYN CNK  H+AR  RN+N P   ANL+E++ VAMI+EVNV G S GWWLDTG SRH+CHDLKLFRKYNE  
Subjt:  PKENKMVHESNNQNNSQKPQSRSTVQIVCYN-CNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEIN

Query:  DKKILLGDHHTTKVACVRE
        DKKILLG+HHTT VA V E
Subjt:  DKKILLGDHHTTKVACVRE

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-0624.19Show/hide
Query:  IVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTLSLITRLRIDEEVRRRDQKKDLNAISRKK-SNTLLKLNMKPKENKMVHESNNQNNS-----QKPQ
        ++ ++ + GV ++++ +A ++++ LP     + N  + I   +   E++       LN   RKK  N    L  + +       SNN   S      K +
Subjt:  IVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTLSLITRLRIDEEVRRRDQKKDLNAISRKK-SNTLLKLNMKPKENKMVHESNNQNNS-----QKPQ

Query:  SRSTVQIVCYNCNKPGHLARNCRN--KNHPATQANLIEDEYVAMISE--------------VNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKIL
        S+S V+  CYNCN+PGH  R+C N  K    T     +D   AM+                ++++G    W +DT AS H      LF +Y   +   + 
Subjt:  SRSTVQIVCYNCNKPGHLARNCRN--KNHPATQANLIEDEYVAMISE--------------VNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKIL

Query:  LGDHHTTKVACVREV
        +G+   +K+A + ++
Subjt:  LGDHHTTKVACVREV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGATGACTTATCAGTGGAGGCTCAATCACATGAGATTCAGAAGATAGTACATGAGATCATAAGTGAAGGTGTGTCACTTGATGATCAATTTCAAGCTGCGGTTAT
TATTGATAAATTACCTCCTCTGTGGAAGGATTTCAAAAATACTCTAAGTCTCATCACACGATTAAGGATTGATGAAGAGGTGAGAAGACGTGACCAAAAGAAAGATTTGA
ACGCAATTTCCAGAAAGAAGTCAAACACATTGCTGAAATTGAATATGAAGCCTAAAGAAAATAAAATGGTACACGAATCCAACAACCAAAACAACTCGCAAAAACCTCAG
TCCAGAAGTACGGTACAAATTGTTTGCTACAATTGTAATAAGCCTGGACACTTAGCTAGGAATTGTAGAAACAAGAATCATCCTGCTACGCAGGCAAACCTGATAGAAGA
TGAATATGTAGCTATGATTTCTGAAGTTAATGTCAATGGAGAGTCTGAAGGTTGGTGGCTAGACACAGGTGCATCTCGCCATGTTTGTCATGACCTTAAGTTATTTAGAA
AGTATAATGAGATAAACGATAAGAAAATCCTTCTAGGAGATCATCACACGACTAAGGTGGCATGCGTTAGAGAAGTATAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGATGACTTATCAGTGGAGGCTCAATCACATGAGATTCAGAAGATAGTACATGAGATCATAAGTGAAGGTGTGTCACTTGATGATCAATTTCAAGCTGCGGTTAT
TATTGATAAATTACCTCCTCTGTGGAAGGATTTCAAAAATACTCTAAGTCTCATCACACGATTAAGGATTGATGAAGAGGTGAGAAGACGTGACCAAAAGAAAGATTTGA
ACGCAATTTCCAGAAAGAAGTCAAACACATTGCTGAAATTGAATATGAAGCCTAAAGAAAATAAAATGGTACACGAATCCAACAACCAAAACAACTCGCAAAAACCTCAG
TCCAGAAGTACGGTACAAATTGTTTGCTACAATTGTAATAAGCCTGGACACTTAGCTAGGAATTGTAGAAACAAGAATCATCCTGCTACGCAGGCAAACCTGATAGAAGA
TGAATATGTAGCTATGATTTCTGAAGTTAATGTCAATGGAGAGTCTGAAGGTTGGTGGCTAGACACAGGTGCATCTCGCCATGTTTGTCATGACCTTAAGTTATTTAGAA
AGTATAATGAGATAAACGATAAGAAAATCCTTCTAGGAGATCATCACACGACTAAGGTGGCATGCGTTAGAGAAGTATAA
Protein sequenceShow/hide protein sequence
MTDDLSVEAQSHEIQKIVHEIISEGVSLDDQFQAAVIIDKLPPLWKDFKNTLSLITRLRIDEEVRRRDQKKDLNAISRKKSNTLLKLNMKPKENKMVHESNNQNNSQKPQ
SRSTVQIVCYNCNKPGHLARNCRNKNHPATQANLIEDEYVAMISEVNVNGESEGWWLDTGASRHVCHDLKLFRKYNEINDKKILLGDHHTTKVACVREV