; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G20700 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G20700
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationChr6:18783888..18785267
RNA-Seq ExpressionCSPI06G20700
SyntenyCSPI06G20700
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0016020 - membrane (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647279.1 hypothetical protein Csa_002929 [Cucumis sativus]2.5e-10798.91Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGG
        MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGG
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGG

Query:  DLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNRL
        DLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVT DHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNR+
Subjt:  DLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNRL

KAE8648339.1 hypothetical protein Csa_023126 [Cucumis sativus]2.3e-4454.27Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGG
        MKIIS N HGLNS KKRALVK  LQQQNPSIVLL+ETKLDDTDS+IIK IWS P I WTTLD+IDTLG                     G G +  IN  
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGG

Query:  DLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVT----------------FDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNRL
                        + NNM LFNQ IANYHLRD+                   DHFP AM   DIDWGPCPF+ E SWLSTP F P VETWWTNNR+
Subjt:  DLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVT----------------FDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNRL

RVW83303.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]9.7e-2729.17Show/hide
Query:  KIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLE------------------
        KI+SWN  GL S KKR  V+  L  QNP +V+L+ETK +  D  ++  IW    +DW  L      GG++I+W S  F   E                  
Subjt:  KIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLE------------------

Query:  ---------------------ELDDLVGLGGDSWINGGDLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEK
                             EL DL GL    W  GGD N+ R   EK  D  +  NM+ F+++I         T DH P  +      WGP PFR E 
Subjt:  ---------------------ELDDLVGLGGDSWINGGDLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEK

Query:  SWLSTPLFLPLVETWWT--------------------------NNR----LRE-----------LDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQW
         WL  P F      WW                           N R    LRE           +D I  +  L +E +S     R+++E L  +E +QW
Subjt:  SWLSTPLFLPLVETWWT--------------------------NNR----LRE-----------LDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQW

Query:  QQRCKLKWFTEG
        +Q+ ++KW  EG
Subjt:  QQRCKLKWFTEG

TYK17414.1 hypothetical protein E5676_scaffold434G002760 [Cucumis melo var. makuwa]5.0e-3159.35Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLE----------ELDDLVG
        MKI SWN HGLN  KKR +VK L+QQ NPSIVLL+ETKL DTDS ++K I S+  I W+ +D IDT  G LI+  +PDFT+ +          +LDDL G
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLE----------ELDDLVG

Query:  LGGDSWINGGDLNITRWSWEKSH
        LGGDSWI GG+ N+TRWSWEKSH
Subjt:  LGGDSWINGGDLNITRWSWEKSH

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]8.5e-3127.12Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPD----------------------
        MK ++WN  GL+S KK AL+K  + + NP++V+L+ETKL   D  I+K +WS   I+W+ LD      G+LI+W  PD                      
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPD----------------------

Query:  -----------------FTLLEELDDLVGLGGDSWINGGDLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDI-------------------------
                         +   +EL DL  L  + WI  GD N+TRWSWEKS+ + +  +M LFN +I +  L D+                         
Subjt:  -----------------FTLLEELDDLVGLGGDSWINGGDLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDI-------------------------

Query:  ----------------VTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTN----------------------------------------
                         T DHFP  +     +WG  PFR E  WLS   F P +ETWW N                                        
Subjt:  ----------------VTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTN----------------------------------------

Query:  -NRLRELDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQWQQRCKLKWFTEG
         N +  LD++     +  +Q      ++E +  +  +E   W+QRCK KW  EG
Subjt:  -NRLRELDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQWQQRCKLKWFTEG

TrEMBL top hitse value%identityAlignment
A0A0A0KDG4 Uncharacterized protein2.0e-16699.64Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGG
        MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGG
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGG

Query:  DLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNRLRELDNIGNKTQLFVEQL
        DLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVT DHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNRLRELDNIGNKTQLFVEQL
Subjt:  DLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNRLRELDNIGNKTQLFVEQL

Query:  STSRSSREQIEQLTTQEHIQWQQRCKLKWFTEGRSILTMDDIESEFCDFYKNLFTKKTDEVFLAISSVGANKSPILDGFTA
        STSRSSREQIEQLTTQEHIQWQQRCKLKWFTEGRSILTMDDIESEFCDFYKNLFTKKTDEVFLAISSVGANKSPILDGFTA
Subjt:  STSRSSREQIEQLTTQEHIQWQQRCKLKWFTEGRSILTMDDIESEFCDFYKNLFTKKTDEVFLAISSVGANKSPILDGFTA

A0A438HFR2 Transposon TX1 uncharacterized 149 kDa protein4.7e-2729.17Show/hide
Query:  KIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLE------------------
        KI+SWN  GL S KKR  V+  L  QNP +V+L+ETK +  D  ++  IW    +DW  L      GG++I+W S  F   E                  
Subjt:  KIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLE------------------

Query:  ---------------------ELDDLVGLGGDSWINGGDLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEK
                             EL DL GL    W  GGD N+ R   EK  D  +  NM+ F+++I         T DH P  +      WGP PFR E 
Subjt:  ---------------------ELDDLVGLGGDSWINGGDLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEK

Query:  SWLSTPLFLPLVETWWT--------------------------NNR----LRE-----------LDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQW
         WL  P F      WW                           N R    LRE           +D I  +  L +E +S     R+++E L  +E +QW
Subjt:  SWLSTPLFLPLVETWWT--------------------------NNR----LRE-----------LDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQW

Query:  QQRCKLKWFTEG
        +Q+ ++KW  EG
Subjt:  QQRCKLKWFTEG

A0A5D3D2A8 Uncharacterized protein2.4e-3159.35Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLE----------ELDDLVG
        MKI SWN HGLN  KKR +VK L+QQ NPSIVLL+ETKL DTDS ++K I S+  I W+ +D IDT  G LI+  +PDFT+ +          +LDDL G
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLE----------ELDDLVG

Query:  LGGDSWINGGDLNITRWSWEKSH
        LGGDSWI GG+ N+TRWSWEKSH
Subjt:  LGGDSWINGGDLNITRWSWEKSH

A0A5D3D3Q8 Reverse transcriptase2.2e-2454.33Show/hide
Query:  MIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNR---------------------------------------LRELDNIGNKTQLFVEQLSTSRSS
        M A DID GPCPFR E SWLSTP F PLVETWWTNNR                                       LRELDNIGN+ QL VEQLSTS   
Subjt:  MIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNR---------------------------------------LRELDNIGNKTQLFVEQLSTSRSS

Query:  REQIEQLTTQEHIQWQQRCKLKWFTEG
        REQIEQLT QEHIQW+ R K KWF EG
Subjt:  REQIEQLTTQEHIQWQQRCKLKWFTEG

A0A6J1E2G6 uncharacterized protein LOC1110254054.1e-3127.12Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPD----------------------
        MK ++WN  GL+S KK AL+K  + + NP++V+L+ETKL   D  I+K +WS   I+W+ LD      G+LI+W  PD                      
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPD----------------------

Query:  -----------------FTLLEELDDLVGLGGDSWINGGDLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDI-------------------------
                         +   +EL DL  L  + WI  GD N+TRWSWEKS+ + +  +M LFN +I +  L D+                         
Subjt:  -----------------FTLLEELDDLVGLGGDSWINGGDLNITRWSWEKSHDQFIPNNMQLFNQWIANYHLRDI-------------------------

Query:  ----------------VTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTN----------------------------------------
                         T DHFP  +     +WG  PFR E  WLS   F P +ETWW N                                        
Subjt:  ----------------VTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTN----------------------------------------

Query:  -NRLRELDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQWQQRCKLKWFTEG
         N +  LD++     +  +Q      ++E +  +  +E   W+QRCK KW  EG
Subjt:  -NRLRELDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQWQQRCKLKWFTEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATCATCTCATGGAATGGTCATGGCTTGAATTCCTGTAAGAAACGTGCCCTGGTTAAGGGGTTGTTGCAGCAACAGAACCCAAGTATTGTTCTTCTCCGGGAAAC
CAAACTTGATGATACCGATTCCCATATCATTAAATACATTTGGAGCTACCCATTTATCGACTGGACGACGCTTGATGTGATTGATACTTTGGGTGGTCTCCTTATTATAT
GGCGTTCACCCGATTTCACTCTTCTTGAGGAACTTGATGATTTGGTGGGATTGGGAGGCGATTCTTGGATTAATGGTGGTGATCTTAACATAACCAGATGGTCCTGGGAG
AAATCACATGATCAGTTTATTCCAAATAACATGCAGCTTTTCAACCAATGGATTGCAAATTATCATTTGAGGGACATTGTCACTTTTGATCATTTTCCTCCTGCTATGAT
TGCATGTGATATTGATTGGGGTCCATGCCCTTTCAGAATTGAGAAGTCCTGGCTCTCTACTCCATTATTTTTGCCACTTGTGGAAACTTGGTGGACAAACAATAGGCTTC
GTGAGTTGGATAATATAGGCAACAAGACTCAACTCTTTGTAGAACAATTATCAACAAGTCGTTCATCGAGGGAACAAATTGAACAATTAACTACCCAGGAGCACATCCAA
TGGCAGCAACGTTGTAAACTTAAATGGTTCACGGAGGGTCGAAGCATATTGACTATGGACGATATTGAATCTGAATTTTGCGATTTTTACAAAAATCTCTTTACTAAGAA
GACTGATGAGGTTTTCTTGGCTATCTCTTCTGTTGGTGCTAATAAGTCTCCCATACTTGATGGATTTACTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATCATCTCATGGAATGGTCATGGCTTGAATTCCTGTAAGAAACGTGCCCTGGTTAAGGGGTTGTTGCAGCAACAGAACCCAAGTATTGTTCTTCTCCGGGAAAC
CAAACTTGATGATACCGATTCCCATATCATTAAATACATTTGGAGCTACCCATTTATCGACTGGACGACGCTTGATGTGATTGATACTTTGGGTGGTCTCCTTATTATAT
GGCGTTCACCCGATTTCACTCTTCTTGAGGAACTTGATGATTTGGTGGGATTGGGAGGCGATTCTTGGATTAATGGTGGTGATCTTAACATAACCAGATGGTCCTGGGAG
AAATCACATGATCAGTTTATTCCAAATAACATGCAGCTTTTCAACCAATGGATTGCAAATTATCATTTGAGGGACATTGTCACTTTTGATCATTTTCCTCCTGCTATGAT
TGCATGTGATATTGATTGGGGTCCATGCCCTTTCAGAATTGAGAAGTCCTGGCTCTCTACTCCATTATTTTTGCCACTTGTGGAAACTTGGTGGACAAACAATAGGCTTC
GTGAGTTGGATAATATAGGCAACAAGACTCAACTCTTTGTAGAACAATTATCAACAAGTCGTTCATCGAGGGAACAAATTGAACAATTAACTACCCAGGAGCACATCCAA
TGGCAGCAACGTTGTAAACTTAAATGGTTCACGGAGGGTCGAAGCATATTGACTATGGACGATATTGAATCTGAATTTTGCGATTTTTACAAAAATCTCTTTACTAAGAA
GACTGATGAGGTTTTCTTGGCTATCTCTTCTGTTGGTGCTAATAAGTCTCCCATACTTGATGGATTTACTGCATAA
Protein sequenceShow/hide protein sequence
MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSHIIKYIWSYPFIDWTTLDVIDTLGGLLIIWRSPDFTLLEELDDLVGLGGDSWINGGDLNITRWSWE
KSHDQFIPNNMQLFNQWIANYHLRDIVTFDHFPPAMIACDIDWGPCPFRIEKSWLSTPLFLPLVETWWTNNRLRELDNIGNKTQLFVEQLSTSRSSREQIEQLTTQEHIQ
WQQRCKLKWFTEGRSILTMDDIESEFCDFYKNLFTKKTDEVFLAISSVGANKSPILDGFTA