; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G09970 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G09970
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr4:8002378..8002821
RNA-Seq ExpressionCSPI04G09970
SyntenyCSPI04G09970
Gene Ontology termsGO:0006633 - fatty acid biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0031408 - oxylipin biosynthetic process (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016702 - oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PSR86206.1 Endonuclease [Actinidia chinensis var. chinensis]6.4e-5470.95Show/hide
Query:  IANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISS
        +A TPID +LHL  N G+  +QLEYSRIIGSLMY+MSCTRPDIAY VS+LSRYTSNP  D WKAI+R+L YL++T+++GL+YTR+P VLEGY D NWIS 
Subjt:  IANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISS

Query:  TKDSKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
         KDSKSTSGY+F LGG AV WK  KQTCIARSTM+ EFIALDK  E A
Subjt:  TKDSKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

PSS35063.1 Endonuclease [Actinidia chinensis var. chinensis]6.4e-5470.95Show/hide
Query:  IANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISS
        +A TPID +LHL  N G+  +QLEYSRIIGSLMY+MSCTRPDIAY VS+LSRYTSNP  D WKAI+R+L YL++T+++GL+YTR+P VLEGY D NWIS 
Subjt:  IANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISS

Query:  TKDSKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
         KDSKSTSGY+F LGG AV WK  KQTCIARSTM+ EFIALDK  E A
Subjt:  TKDSKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

RVW57504.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.1e-5370.27Show/hide
Query:  IANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISS
        +A TP+D +LHL  N G+S++Q+EYSR+IGSLMY+MSCTRPDIAY VSKLSRYTSNP    W+ I+R+L YL+ T++YGL+YTRYPVVLEGYSD NWIS+
Subjt:  IANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISS

Query:  TKDSKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
         KDSKS SGY+FTLGG AV WK  KQT IARSTM+ EFIALDK  E A
Subjt:  TKDSKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

TYK23174.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]8.1e-5782.27Show/hide
Query:  ASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKDSKST
        A + LG NNGDSIAQLEY RIIGSLMYIMS TRPDIAY VSKLSRYTSNP  D WKAILR+LGYLKHTKNY L+YTRYPVVLEGYSD NW SSTK+SKST
Subjt:  ASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKDSKST

Query:  SGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
        SGYIFTLGGG V WK  KQTCIARSTM+ +FIALDKI E A
Subjt:  SGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

WP_140189331.1 DDE-type integrase/transposase/recombinase [Xylella fastidiosa]3.1e-5672.97Show/hide
Query:  IANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISS
        IA TP+D SLHL  N G+ ++QLEYSRIIGSLMY+MSCTRPDIAY VS+LSRYTSNP HD WKAI+R+L YL++T++ GL+Y RYP VLEGY D NWIS 
Subjt:  IANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISS

Query:  TKDSKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
         KDSKSTSGY+FTLGG AV WK  KQTCIARSTM+ EFIALDK  E A
Subjt:  TKDSKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

TrEMBL top hitse value%identityAlignment
A0A2N9F5X3 Integrase catalytic domain-containing protein5.7e-5675.17Show/hide
Query:  TPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKD
        TPID +LHL  N G+ I+QLEYS+IIGSLMYIM+CTRPDIAY VSKLSRYTSNP  D WKAI+R+L YLK+T NYG++YTRYP VLEGYSD NWIS T D
Subjt:  TPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKD

Query:  SKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
        +KSTSGY+FTLGG AV WK  KQTCIARSTM+ EFIALDK  E A
Subjt:  SKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

A0A2N9F8T0 Integrase catalytic domain-containing protein3.3e-5675.17Show/hide
Query:  TPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKD
        TPID +LHL  N G+ I+QLEYS+IIGSLMYIM+CTRPDIAY VSKLSRYTSNP  D WKAI+R+L YLK+T NYG++YTRYP VLEGYSD NWIS T D
Subjt:  TPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKD

Query:  SKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
        +KSTSGY+FTLGG AV WK  KQTCIARSTM+ EFIALDK  E A
Subjt:  SKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

A0A2N9GUH4 Integrase catalytic domain-containing protein7.4e-5674.48Show/hide
Query:  TPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKD
        TP+D +LHL  N G+ I+QLEYS+IIGSLMYIM+CTRPDIAY VSKLSRYTSNP  D WKAI+R+L YLK+T NYG++YTRYP VLEGYSD NWIS T D
Subjt:  TPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKD

Query:  SKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
        +KSTSGY+FTLGG AV WK  KQTCIARSTM+ EFIALDK  E A
Subjt:  SKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

A0A2N9GZ77 Integrase catalytic domain-containing protein2.5e-5675.86Show/hide
Query:  TPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKD
        TPID +LHL  N G+ I+QLEYS+IIGSLMYIM+CTRPDIAY VSKLSRYTSNP  D WKAI+R+L YLK+T NYG++YTRYPVVLEGYSD NWIS T D
Subjt:  TPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKD

Query:  SKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
        +KSTSGY+FTLGG AV WK  KQTCIARSTM+ EFIALDK  E A
Subjt:  SKSTSGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

A0A5D3DHS0 Retrotransposon protein, putative, Ty1-copia subclass3.9e-5782.27Show/hide
Query:  ASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKDSKST
        A + LG NNGDSIAQLEY RIIGSLMYIMS TRPDIAY VSKLSRYTSNP  D WKAILR+LGYLKHTKNY L+YTRYPVVLEGYSD NW SSTK+SKST
Subjt:  ASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKDSKST

Query:  SGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA
        SGYIFTLGGG V WK  KQTCIARSTM+ +FIALDKI E A
Subjt:  SGYIFTLGGGAVFWK--KQTCIARSTMKYEFIALDKIEEGA

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.5e-1734.97Show/hide
Query:  NTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTR---YPVVLEGYSDVNWIS
        +TP+ + ++    N D         +IG LMYIM CTRPD+   V+ LSRY+S  + + W+ + R+L YLK T +  L + +   +   + GY D +W  
Subjt:  NTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTR---YPVVLEGYSDVNWIS

Query:  STKDSKSTSGYIFTL-GGGAVFW--KKQTCIARSTMKYEFIAL
        S  D KST+GY+F +     + W  K+Q  +A S+ + E++AL
Subjt:  STKDSKSTSGYIFTL-GGGAVFW--KKQTCIARSTMKYEFIAL

P0CV72 Secreted RxLR effector protein 1612.3e-2242.4Show/hide
Query:  YSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVV-LEGYSDVNWISSTKDSKSTSGYIFTLGGGAVFW--
        Y   +G++MY+M  TRPD+A  V  LS++ S+P    W+A+ R+L YL+ T+ YGL +TR     L GYSD +W    +  +STSGY+F L GG V W  
Subjt:  YSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVV-LEGYSDVNWISSTKDSKSTSGYIFTLGGGAVFW--

Query:  KKQTCIARSTMKYEFIALDKIEEGA
        KKQ  +A S+ + E++AL +  + A
Subjt:  KKQTCIARSTMKYEFIALDKIEEGA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-2242.62Show/hide
Query:  SIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKDSKSTSGYIFTLGGGA
        ++A++ YS  +GSLMY M CTRPDIA+ V  +SR+  NP  + W+A+  IL YL+ T    L +     +L+GY+D +      + KS++GY+FT  GGA
Subjt:  SIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKDSKSTSGYIFTLGGGA

Query:  VFW--KKQTCIARSTMKYEFIA
        + W  K Q C+A ST + E+IA
Subjt:  VFW--KKQTCIARSTMKYEFIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-1637.23Show/hide
Query:  TPIDASLHLGTNNGDSIAQ-LEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTR-YPVVLEGYSDVNWISST
        TP+  S  L   +G  +    EY  I+GSL Y ++ TRPDI+Y V++LS++   P+ +  +A+ RIL YL  T N+G++  +   + L  YSD +W    
Subjt:  TPIDASLHLGTNNGDSIAQ-LEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTR-YPVVLEGYSDVNWISST

Query:  KDSKSTSGYIFTLGGGAVFW--KKQTCIARSTMKYEF
         D  ST+GYI  LG   + W  KKQ  + RS+ + E+
Subjt:  KDSKSTSGYIFTLGGGAVFW--KKQTCIARSTMKYEF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-1839.69Show/hide
Query:  SLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTR-YPVVLEGYSDVNWISSTKDSKST
        +LH GT   D     EY  I+GSL Y ++ TRPD++Y V++LS+Y   P+ D W A+ R+L YL  T ++G++  +   + L  YSD +W   T D  ST
Subjt:  SLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTR-YPVVLEGYSDVNWISSTKDSKST

Query:  SGYIFTLGGGAVFW--KKQTCIARSTMKYEF
        +GYI  LG   + W  KKQ  + RS+ + E+
Subjt:  SGYIFTLGGGAVFW--KKQTCIARSTMKYEF

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.3e-1633.8Show/hide
Query:  ANTPIDASLHLGTNN-GDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYY-TRYPVVLEGYSDVNWIS
        ++ P+D S+    ++ GD +    Y R+IG LMY +  TR DI++ V+KLS+++  P     +A+++IL Y+K T   GL+Y ++  + L+ +SD ++ S
Subjt:  ANTPIDASLHLGTNN-GDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYY-TRYPVVLEGYSDVNWIS

Query:  STKDSKSTSGYIFTLGGGAVFW--KKQTCIARSTMKYEFIAL
             +ST+GY   LG   + W  KKQ  +++S+ + E+ AL
Subjt:  STKDSKSTSGYIFTLGGGAVFW--KKQTCIARSTMKYEFIAL

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.1e-0628.95Show/hide
Query:  MSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYT-RYPVVLEGYSDVNWISSTKDSKSTSGY
        ++ TRPD+ + V++LS+++S     + +A+ ++L Y+K T   GL+Y+    + L+ ++D +W S     +S +G+
Subjt:  MSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYT-RYPVVLEGYSDVNWISSTKDSKSTSGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGCAAACACCCCAATTGATGCTAGTCTCCATTTAGGTACAAATAATGGAGATAGTATAGCACAATTAGAATATTCTCGCATCATTGGTAGTTTGATGTACATCAT
GAGTTGTACACGTCCTGATATAGCGTATTTTGTAAGCAAGTTAAGTCGCTACACAAGTAATCCAAGTCATGATCGTTGGAAAGCTATATTGAGAATTTTGGGATACTTAA
AGCATACTAAAAATTATGGATTATACTATACTCGATATCCTGTTGTACTTGAAGGTTATAGTGATGTCAATTGGATATCAAGCACTAAAGACTCCAAATCTACAAGTGGT
TACATTTTTACCCTTGGAGGCGGTGCTGTTTTTTGGAAGAAACAAACATGTATAGCACGATCCACAATGAAATATGAATTTATAGCTTTAGATAAGATTGAAGAAGGAGC
ATAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGCAAACACCCCAATTGATGCTAGTCTCCATTTAGGTACAAATAATGGAGATAGTATAGCACAATTAGAATATTCTCGCATCATTGGTAGTTTGATGTACATCAT
GAGTTGTACACGTCCTGATATAGCGTATTTTGTAAGCAAGTTAAGTCGCTACACAAGTAATCCAAGTCATGATCGTTGGAAAGCTATATTGAGAATTTTGGGATACTTAA
AGCATACTAAAAATTATGGATTATACTATACTCGATATCCTGTTGTACTTGAAGGTTATAGTGATGTCAATTGGATATCAAGCACTAAAGACTCCAAATCTACAAGTGGT
TACATTTTTACCCTTGGAGGCGGTGCTGTTTTTTGGAAGAAACAAACATGTATAGCACGATCCACAATGAAATATGAATTTATAGCTTTAGATAAGATTGAAGAAGGAGC
ATAA
Protein sequenceShow/hide protein sequence
MIANTPIDASLHLGTNNGDSIAQLEYSRIIGSLMYIMSCTRPDIAYFVSKLSRYTSNPSHDRWKAILRILGYLKHTKNYGLYYTRYPVVLEGYSDVNWISSTKDSKSTSG
YIFTLGGGAVFWKKQTCIARSTMKYEFIALDKIEEGA