; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G18640 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G18640
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:14038514..14039200
RNA-Seq ExpressionCSPI01G18640
SyntenyCSPI01G18640
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004812 - aminoacyl-tRNA ligase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052097.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.1e-6371.79Show/hide
Query:  GPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPHTSQQNGRTERKH
        GPAPTTTVH Y YY+LFIDD+SRFTWIYFLK+RSELSRTYIEFANMIRTQFSCP+KT RTDNALEYKDS LLSFLSQQGTLVQRSCPHTSQQNGR ERKH
Subjt:  GPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPHTSQQNGRTERKH

Query:  RHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVFLAMTQNIK
        RHIL               F  +  L+  I  I+F    F+TSL  K+YMVLL TI TLK  VV  LF CI M+TLNLN VPAS VFLAM QNIK
Subjt:  RHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVFLAMTQNIK

KAA0060225.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-5658.67Show/hide
Query:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH
        +KPFDL+H DIWGPAPTTTVH Y YY+LFIDD+SRFTWIYFLK+RSELS TYIEFANMIRTQF CP+KTLRTDNALEYKDS LLSFLSQQGTLVQRSCP+
Subjt:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH

Query:  TSQQNGRTERKHRHILTQ---SVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASV
        TSQQNGR ERKHRHIL      + S S P   + +G+  L  ++  + +   +    S   K Y    P    LK F         P     L       
Subjt:  TSQQNGRTERKHRHILTQ---SVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASV

Query:  VFLAMTQNIKAFVVGIPFPKDFVYL
         FL      K FVVGIPFP DFVYL
Subjt:  VFLAMTQNIKAFVVGIPFPKDFVYL

KAA0062044.1 seryl-tRNA synthetase isoform X1 [Cucumis melo var. makuwa]9.5e-8179.33Show/hide
Query:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH
        +KPFDL+HSDIWGPAPTTTVH Y YY+LFIDD+SRFTWIYFLK+RSELSRTYIEFANMI TQFSCP KTLRTDNALEYKD  LLSFLSQQGTLVQRS PH
Subjt:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH

Query:  TSQQNGRTERKHRHI-LTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVF
        TSQQNGR E KH HI LTQ VPSF LPH LRNFG K LLH  IPS +F LLFFRTSL  K+YMVLLPT  TLK  VV ALF  I M+TLN NHVP SVV 
Subjt:  TSQQNGRTERKHRHI-LTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVF

Query:  LAMTQNIK
        LAM  NIK
Subjt:  LAMTQNIK

KAA0063009.1 Integrase, catalytic core [Cucumis melo var. makuwa]8.3e-7770.93Show/hide
Query:  MSEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSC
        +S+KPFDL+HSDIWGPAPT+ VH Y YY+LFIDD+SRFTWIYFLK+RS LSRTYIEFANMI TQFSCP+KTLRTDNALEYKDS LLSFLS +  L     
Subjt:  MSEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSC

Query:  PHTSQQNGRTERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVV
        P    +           LTQ  P F LPH LRNF  K LLH  I SI F   FFRTSL LKNYMVLLPTI  LK  VV ALFFCI M+TLNLNHVPASVV
Subjt:  PHTSQQNGRTERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVV

Query:  FLAMTQNIKAFVVGIPFPKDFVYLDMS
        FLAM QNIK FVVGIPFP DFVYL MS
Subjt:  FLAMTQNIKAFVVGIPFPKDFVYLDMS

TYK15021.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]6.2e-5691.38Show/hide
Query:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH
        +KPFDL+HSDIWGPAPTTTVH Y YY+LFIDD+SRFTWIYFLK+RSELSRTYIEFANMIRTQFSCP+KTLRTDNALEYKDS LLSFLSQQGTLVQRSCPH
Subjt:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH

Query:  TSQQNGRTERKHRHIL
        TSQQNGR ERKHRHIL
Subjt:  TSQQNGRTERKHRHIL

TrEMBL top hitse value%identityAlignment
A0A5A7UCC7 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-6471.79Show/hide
Query:  GPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPHTSQQNGRTERKH
        GPAPTTTVH Y YY+LFIDD+SRFTWIYFLK+RSELSRTYIEFANMIRTQFSCP+KT RTDNALEYKDS LLSFLSQQGTLVQRSCPHTSQQNGR ERKH
Subjt:  GPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPHTSQQNGRTERKH

Query:  RHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVFLAMTQNIK
        RHIL               F  +  L+  I  I+F    F+TSL  K+YMVLL TI TLK  VV  LF CI M+TLNLN VPAS VFLAM QNIK
Subjt:  RHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVFLAMTQNIK

A0A5A7V2X4 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-5758.67Show/hide
Query:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH
        +KPFDL+H DIWGPAPTTTVH Y YY+LFIDD+SRFTWIYFLK+RSELS TYIEFANMIRTQF CP+KTLRTDNALEYKDS LLSFLSQQGTLVQRSCP+
Subjt:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH

Query:  TSQQNGRTERKHRHILTQ---SVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASV
        TSQQNGR ERKHRHIL      + S S P   + +G+  L  ++  + +   +    S   K Y    P    LK F         P     L       
Subjt:  TSQQNGRTERKHRHILTQ---SVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASV

Query:  VFLAMTQNIKAFVVGIPFPKDFVYL
         FL      K FVVGIPFP DFVYL
Subjt:  VFLAMTQNIKAFVVGIPFPKDFVYL

A0A5A7V413 Seryl-tRNA synthetase isoform X14.6e-8179.33Show/hide
Query:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH
        +KPFDL+HSDIWGPAPTTTVH Y YY+LFIDD+SRFTWIYFLK+RSELSRTYIEFANMI TQFSCP KTLRTDNALEYKD  LLSFLSQQGTLVQRS PH
Subjt:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH

Query:  TSQQNGRTERKHRHI-LTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVF
        TSQQNGR E KH HI LTQ VPSF LPH LRNFG K LLH  IPS +F LLFFRTSL  K+YMVLLPT  TLK  VV ALF  I M+TLN NHVP SVV 
Subjt:  TSQQNGRTERKHRHI-LTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVF

Query:  LAMTQNIK
        LAM  NIK
Subjt:  LAMTQNIK

A0A5A7VBM6 Integrase, catalytic core4.0e-7770.93Show/hide
Query:  MSEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSC
        +S+KPFDL+HSDIWGPAPT+ VH Y YY+LFIDD+SRFTWIYFLK+RS LSRTYIEFANMI TQFSCP+KTLRTDNALEYKDS LLSFLS +  L     
Subjt:  MSEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSC

Query:  PHTSQQNGRTERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVV
        P    +           LTQ  P F LPH LRNF  K LLH  I SI F   FFRTSL LKNYMVLLPTI  LK  VV ALFFCI M+TLNLNHVPASVV
Subjt:  PHTSQQNGRTERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVV

Query:  FLAMTQNIKAFVVGIPFPKDFVYLDMS
        FLAM QNIK FVVGIPFP DFVYL MS
Subjt:  FLAMTQNIKAFVVGIPFPKDFVYLDMS

A0A5D3CT53 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-5691.38Show/hide
Query:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH
        +KPFDL+HSDIWGPAPTTTVH Y YY+LFIDD+SRFTWIYFLK+RSELSRTYIEFANMIRTQFSCP+KTLRTDNALEYKDS LLSFLSQQGTLVQRSCPH
Subjt:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH

Query:  TSQQNGRTERKHRHIL
        TSQQNGR ERKHRHIL
Subjt:  TSQQNGRTERKHRHIL

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.7e-1130.43Show/hide
Query:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH
        ++P  +VHSD+ GP    T+ D  Y+++F+D ++ +   Y +K +S++   + +F       F+  V  L  DN  EY  + +  F  ++G     + PH
Subjt:  EKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPH

Query:  TSQQNGRTERKHRHI
        T Q NG +ER  R I
Subjt:  TSQQNGRTERKHRHI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-1233.04Show/hide
Query:  DLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPHTSQQ
        DLV+SD+ GP    ++    Y++ FIDD SR  W+Y LK + ++ + + +F  ++  +    +K LR+DN  EY       + S  G   +++ P T Q 
Subjt:  DLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPHTSQQ

Query:  NGRTERKHRHIL
        NG  ER +R I+
Subjt:  NGRTERKHRHIL

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.7e-0830.25Show/hide
Query:  SEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELS--RTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRS
        S +PF  +H+DI+GP          Y+I F D+ +RF W+Y L +R E S    +      I+ QF+  V  ++ D   EY +  L  F + +G     +
Subjt:  SEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELS--RTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRS

Query:  CPHTSQQNGRTERKHRHIL
            S+ +G  ER +R +L
Subjt:  CPHTSQQNGRTERKHRHIL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-1934.87Show/hide
Query:  SEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCP
        S +P + ++SD+W  +P  +  +Y YY++F+D ++R+TW+Y LK +S++  T+I F N++  +F   + T  +DN  E+   AL  + SQ G     S P
Subjt:  SEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCP

Query:  HTSQQNGRTERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLF
        HT + NG +ERKHRHI+                G  LL H  IP   +P  F
Subjt:  HTSQQNGRTERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-2034.87Show/hide
Query:  SEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCP
        S KP + ++SD+W  +P  ++ +Y YY++F+D ++R+TW+Y LK +S++  T+I F +++  +F   + TL +DN  E+    L  +LSQ G     S P
Subjt:  SEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCP

Query:  HTSQQNGRTERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLF
        HT + NG +ERKHRHI+                G  LL H  +P   +P  F
Subjt:  HTSQQNGRTERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAGAAACCTTTTGATTTAGTACATTCTGATATTTGGGGTCCTGCACCTACTACTACTGTTCATGACTATTGCTATTATATTCTGTTCATTGATGATTACTCCCG
ATTTACATGGATTTATTTTCTTAAAAATCGCTCTGAATTATCTCGCACATATATTGAATTTGCTAATATGATTCGCACTCAATTCTCTTGTCCTGTCAAAACCCTTCGCA
CTGATAATGCCTTGGAGTACAAAGACTCCGCTCTCCTTTCTTTTCTCTCCCAGCAGGGCACTCTTGTTCAGCGCTCCTGCCCCCATACCTCTCAGCAAAATGGGCGTACT
GAGCGCAAACATCGCCACATTTTGACTCAGTCCGTGCCCTCCTTCTCTCTACCTCATGTCCTGAGAAATTTTGGGGATAAGCTGCTCTTACATCTGTTTATACCATCAAT
CATCTTCCCTCTTTTGTTCTTCAGAACATCTCTCCTTTTGAAAAATTATATGGTACTCCTCCCAACTATTCAAACCTTAAAGTTTTTTGTTGTGCCTGCTTTGTTCTTCT
GCATCCCCATGAGCACACTAAACTTGAACCACGTGCCCGCCTCTGTTGTTTTCTTGGCTATGACACAAAACATAAAGGCTTTCGTTGTTGGGATCCCCTTTCCAAAAGAC
TTCGTATATCTCGACATGTCAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAGAAACCTTTTGATTTAGTACATTCTGATATTTGGGGTCCTGCACCTACTACTACTGTTCATGACTATTGCTATTATATTCTGTTCATTGATGATTACTCCCG
ATTTACATGGATTTATTTTCTTAAAAATCGCTCTGAATTATCTCGCACATATATTGAATTTGCTAATATGATTCGCACTCAATTCTCTTGTCCTGTCAAAACCCTTCGCA
CTGATAATGCCTTGGAGTACAAAGACTCCGCTCTCCTTTCTTTTCTCTCCCAGCAGGGCACTCTTGTTCAGCGCTCCTGCCCCCATACCTCTCAGCAAAATGGGCGTACT
GAGCGCAAACATCGCCACATTTTGACTCAGTCCGTGCCCTCCTTCTCTCTACCTCATGTCCTGAGAAATTTTGGGGATAAGCTGCTCTTACATCTGTTTATACCATCAAT
CATCTTCCCTCTTTTGTTCTTCAGAACATCTCTCCTTTTGAAAAATTATATGGTACTCCTCCCAACTATTCAAACCTTAAAGTTTTTTGTTGTGCCTGCTTTGTTCTTCT
GCATCCCCATGAGCACACTAAACTTGAACCACGTGCCCGCCTCTGTTGTTTTCTTGGCTATGACACAAAACATAAAGGCTTTCGTTGTTGGGATCCCCTTTCCAAAAGAC
TTCGTATATCTCGACATGTCAACTTGA
Protein sequenceShow/hide protein sequence
MSEKPFDLVHSDIWGPAPTTTVHDYCYYILFIDDYSRFTWIYFLKNRSELSRTYIEFANMIRTQFSCPVKTLRTDNALEYKDSALLSFLSQQGTLVQRSCPHTSQQNGRT
ERKHRHILTQSVPSFSLPHVLRNFGDKLLLHLFIPSIIFPLLFFRTSLLLKNYMVLLPTIQTLKFFVVPALFFCIPMSTLNLNHVPASVVFLAMTQNIKAFVVGIPFPKD
FVYLDMST