; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy3G058260 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy3G058260
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchrH03:11652638..11653528
RNA-Seq ExpressionChy3G058260
SyntenyChy3G058260
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0016020 - membrane (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647279.1 hypothetical protein Csa_002929 [Cucumis sativus]2.88e-11082.7Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEELDDLVGLGGDSWINGG
        MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDS+IIK IWS+ FID TTLDVIDT GGLLI+W +PD TLLEELDDLVGLGGDSWINGG
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEELDDLVGLGGDSWINGG

Query:  DFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVTLDHLPLAMTACDIDWGPCPFRFENSWLSTPLFRHL---WKLGGQTIG
        D NITRWS EKS+DQF+PNNMQLFNQWIANYH RDIVTLDH P AM ACDIDWGPCPFR E SWLSTPLF  L   W    +  G
Subjt:  DFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVTLDHLPLAMTACDIDWGPCPFRFENSWLSTPLFRHL---WKLGGQTIG

KAE8648339.1 hypothetical protein Csa_023126 [Cucumis sativus]2.47e-4846.25Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEELDDLVGLGGDSWINGG
        MKIIS N HGLNS KKRALVK  LQQQNPSIVLL+ETKLDDTDS IIKSIWS   I  TTLD+IDT G                     G G +  IN  
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEELDDLVGLGGDSWINGG

Query:  DFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVT----------------LDHLPLAMTACDIDWGPCPFRFENSWLSTPLFR---HLWKLGGQTI
                        + NNM LFNQ IANYH RD+                  +DH PLAMT  DIDWGPCPF+FENSWLSTP FR     W    +  
Subjt:  DFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVT----------------LDHLPLAMTACDIDWGPCPFRFENSWLSTPLFR---HLWKLGGQTI

Query:  GL-LVGLDMD***SSKH*KCSF------VHGITTSVERLL
        G    G+ M      K  KCSF       H   T +  L+
Subjt:  GL-LVGLDMD***SSKH*KCSF------VHGITTSVERLL

RVX17353.1 hypothetical protein CK203_003781 [Vitis vinifera]1.28e-2733.03Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEE-----LDDLVGLGGDS
        MKIISWN  GL S KKR +VK  L+ + P +V+ +ETK ++ D   + S+W+    D   L     SGG+LI+W    L+  EE     L D+ GL    
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEE-----LDDLVGLGGDS

Query:  WINGGDFNITRWSLEK-----------SYDQFVPN----------------NMQL------------FNQWIANY--H*RDIV---TLDHLPLAMTACDI
        W  GGDFN+ R S EK            +D F+ +                NMQ+             N+W   +    + ++   T DH P+ +     
Subjt:  WINGGDFNITRWSLEK-----------SYDQFVPN----------------NMQL------------FNQWIANY--H*RDIV---TLDHLPLAMTACDI

Query:  DWGPCPFRFENSWLSTPLFRH
         WGP PFRFEN WL  P F+ 
Subjt:  DWGPCPFRFENSWLSTPLFRH

TYK17414.1 hypothetical protein E5676_scaffold434G002760 [Cucumis melo var. makuwa]5.08e-3660.98Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE----------ELDDLVG
        MKI SWN HGLN  KKR +VK L+QQ NPSIVLL+ETKL DTDS+++KSI SF+ I  + +D IDTS GL IL  APD T+ +          +LDDL G
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE----------ELDDLVG

Query:  LGGDSWINGGDFNITRWSLEKSY
        LGGDSWI GG+FN+TRWS EKS+
Subjt:  LGGDSWINGGDFNITRWSLEKSY

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]2.21e-2731.08Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE-----------------
        MK ++WN  GL+S KK AL+K  + + NP++V+L+ETKL   D  I+KS+WS   I+ + LD    + G+LILW  PDL   E                 
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE-----------------

Query:  ----------------------ELDDLVGLGGDSWINGGDFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDI-------------------------
                              EL DL  L  + WI  GDFN+TRWS EKS  + +  +M LFN +I +    D+                         
Subjt:  ----------------------ELDDLVGLGGDSWINGGDFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDI-------------------------

Query:  ----------------VTLDHLPLAMTACDIDWGPCPFRFENSWLSTPLFR
                         T DH P+ +     +WG  PFRFEN WLS   F+
Subjt:  ----------------VTLDHLPLAMTACDIDWGPCPFRFENSWLSTPLFR

TrEMBL top hitse value%identityAlignment
A0A0A0KDG4 Uncharacterized protein4.9e-8588.24Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEELDDLVGLGGDSWINGG
        MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDS+IIK IWS+ FID TTLDVIDT GGLLI+W +PD TLLEELDDLVGLGGDSWINGG
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEELDDLVGLGGDSWINGG

Query:  DFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVTLDHLPLAMTACDIDWGPCPFRFENSWLSTPLF
        D NITRWS EKS+DQF+PNNMQLFNQWIANYH RDIVTLDH P AM ACDIDWGPCPFR E SWLSTPLF
Subjt:  DFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVTLDHLPLAMTACDIDWGPCPFRFENSWLSTPLF

A0A438F756 LINE-1 retrotransposable element ORF2 protein6.4e-2433.33Show/hide
Query:  KIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE------------------
        KI+SWN  GL S KKR  V+  L  QNP +V+L+ETK +  D  ++ SIW    +D   L     SGG++ILW +      E                  
Subjt:  KIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE------------------

Query:  ---------------------ELDDLVGLGGDSWINGGDFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVTLDHLPLAMTACDIDWGPCPFRFEN
                             EL DL GL    W  GGDFN+ R   EK  D  +  NM+ F+++I         T DH P+ +      WGP PFRFEN
Subjt:  ---------------------ELDDLVGLGGDSWINGGDFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVTLDHLPLAMTACDIDWGPCPFRFEN

Query:  SWLSTPLFRHLWK
         WL  P F+  ++
Subjt:  SWLSTPLFRHLWK

A0A438HFR2 Transposon TX1 uncharacterized 149 kDa protein6.4e-2433.33Show/hide
Query:  KIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE------------------
        KI+SWN  GL S KKR  V+  L  QNP +V+L+ETK +  D  ++ SIW    +D   L     SGG++ILW +      E                  
Subjt:  KIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE------------------

Query:  ---------------------ELDDLVGLGGDSWINGGDFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVTLDHLPLAMTACDIDWGPCPFRFEN
                             EL DL GL    W  GGDFN+ R   EK  D  +  NM+ F+++I         T DH P+ +      WGP PFRFEN
Subjt:  ---------------------ELDDLVGLGGDSWINGGDFNITRWSLEKSYDQFVPNNMQLFNQWIANYH*RDIVTLDHLPLAMTACDIDWGPCPFRFEN

Query:  SWLSTPLFRHLWK
         WL  P F+  ++
Subjt:  SWLSTPLFRHLWK

A0A438K826 Endo/exonuclease/phosphatase domain-containing protein1.4e-2333.18Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLT-----LLEELDDLVGLGGDS
        MKIISWN  GL S KKR +VK  L+ + P +V+ +ETK ++ D   + S+W+    D   L     SGG+LI+W    L+     L  EL D+ GL    
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLT-----LLEELDDLVGLGGDS

Query:  WINGGDFNITRWSLE-----------KSYDQFVP----------------NNMQL------------FNQWIANY--H*RDIV---TLDHLPLAMTACDI
        W  GGDFN+ R S E           K +D F+                 +NMQ+             N+W   +    + ++   T DH P+ +     
Subjt:  WINGGDFNITRWSLE-----------KSYDQFVP----------------NNMQL------------FNQWIANY--H*RDIV---TLDHLPLAMTACDI

Query:  DWGPCPFRFENSWLSTPLFR
         WGP PFRFEN WL  P F+
Subjt:  DWGPCPFRFENSWLSTPLFR

A0A5D3D2A8 Uncharacterized protein2.0e-3060.98Show/hide
Query:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE----------ELDDLVG
        MKI SWN HGLN  KKR +VK L+QQ NPSIVLL+ETKL DTDS+++KSI SF+ I  + +D IDTS G LIL  APD T+ +          +LDDL G
Subjt:  MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLE----------ELDDLVG

Query:  LGGDSWINGGDFNITRWSLEKSY
        LGGDSWI GG+FN+TRWS EKS+
Subjt:  LGGDSWINGGDFNITRWSLEKSY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATCATCTCATGGAATGGTCATGGCTTGAATTCCTGTAAGAAACGTGCATTGGTTAAGGGGTTGTTGCAACAACAGAACCCGAGTATTGTTCTTCTCCGG
GAAACCAAACTTGATGACACCGATTCCTATATCATTAAATCCATTTGGAGCTTCTCGTTTATCGACCGGACAACGCTTGATGTGATTGATACTTCGGGTGGTCTC
CTTATTTTATGGCCTGCACCCGATCTCACTCTTCTTGAGGAACTTGATGATTTGGTGGGATTGGGAGGCGATTCTTGGATTAATGGTGGTGATTTTAACATAACC
AGATGGTCCTTGGAGAAATCATATGATCAATTTGTTCCAAATAACATGCAGCTTTTCAACCAATGGATTGCAAATTATCATTAGAGGGACATTGTCACTTTGGAT
CATCTTCCTCTTGCTATGACTGCATGTGATATTGATTGGGGTCCGTGCCCTTTCAGATTTGAGAACTCCTGGCTCTCTACTCCATTATTTCGCCACTTGTGGAAA
CTTGGTGGACAAACAATAGGGTTGCTGGTTGGCCTGGACATGGATTGATGATGAAGCTCAAAGCATTAGAAATGTTCTTTCGTTCATGGAATAACAACCAGCGTG
GAGAGGCTACTAAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGATCATCTCATGGAATGGTCATGGCTTGAATTCCTGTAAGAAACGTGCATTGGTTAAGGGGTTGTTGCAACAACAGAACCCGAGTATTGTTCTTCTCCGG
GAAACCAAACTTGATGACACCGATTCCTATATCATTAAATCCATTTGGAGCTTCTCGTTTATCGACCGGACAACGCTTGATGTGATTGATACTTCGGGTGGTCTC
CTTATTTTATGGCCTGCACCCGATCTCACTCTTCTTGAGGAACTTGATGATTTGGTGGGATTGGGAGGCGATTCTTGGATTAATGGTGGTGATTTTAACATAACC
AGATGGTCCTTGGAGAAATCATATGATCAATTTGTTCCAAATAACATGCAGCTTTTCAACCAATGGATTGCAAATTATCATTAGAGGGACATTGTCACTTTGGAT
CATCTTCCTCTTGCTATGACTGCATGTGATATTGATTGGGGTCCGTGCCCTTTCAGATTTGAGAACTCCTGGCTCTCTACTCCATTATTTCGCCACTTGTGGAAA
CTTGGTGGACAAACAATAGGGTTGCTGGTTGGCCTGGACATGGATTGATGATGAAGCTCAAAGCATTAGAAATGTTCTTTCGTTCATGGAATAACAACCAGCGTG
GAGAGGCTACTAAA
Protein sequenceShow/hide protein sequence
MKIISWNGHGLNSCKKRALVKGLLQQQNPSIVLLRETKLDDTDSYIIKSIWSFSFIDRTTLDVIDTSGGLLILWPAPDLTLLEELDDLVGLGGDSWINGGDFNIT
RWSLEKSYDQFVPNNMQLFNQWIANYHRDIVTLDHLPLAMTACDIDWGPCPFRFENSWLSTPLFRHLWKLGGQTIGLLVGLDMD**SSKH*KCSFVHGITTSVER
LLX