; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G02550 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G02550
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHNH domain-containing protein
Genome locationChr1:1618309..1620671
RNA-Seq ExpressionCSPI01G02550
SyntenyCSPI01G02550
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR002711 - HNH endonuclease
IPR003615 - HNH nuclease


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138181.1 uncharacterized protein LOC101204599 [Cucumis sativus]5.3e-10299.45Show/hide
Query:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
        MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
Subjt:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ

Query:  TRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGSL
        TRVNRFKSNKDDVDTSELK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGSL
Subjt:  TRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGSL

XP_008453223.1 PREDICTED: uncharacterized protein LOC103494011 isoform X1 [Cucumis melo]7.9e-9896.15Show/hide
Query:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
        MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
Subjt:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ

Query:  TRVNRFKSNKDDVDTSELKSYSCDVKFT----DKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEE
        TRVNRFKSNKDDVDTSELK YSCDVKFT    DKELDIIEMAVYGDVIRPGNQCRCRTVSE+LGQYKSKDRLAPCKLPYNE+
Subjt:  TRVNRFKSNKDDVDTSELKSYSCDVKFT----DKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEE

XP_008453224.1 PREDICTED: uncharacterized protein LOC103494011 isoform X2 [Cucumis melo]1.4e-9998.31Show/hide
Query:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
        MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
Subjt:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ

Query:  TRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEE
        TRVNRFKSNKDDVDTSELK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSE+LGQYKSKDRLAPCKLPYNE+
Subjt:  TRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEE

XP_022933741.1 uncharacterized protein LOC111441065 [Cucurbita moschata]6.0e-9891.94Show/hide
Query:  DGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTAD
        DGV TPMEATSSSPSPTRSRGG R SGSG+G+ERPRFFD KAKA CWAKAD VPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVP+SKGGESTAD
Subjt:  DGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTAD

Query:  NCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGS
        NCQILQTRVNRFKS+KDDVDTS+LK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSE+LGQYKSKDRLAPCKLPYNE+ S
Subjt:  NCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGS

XP_023531696.1 uncharacterized protein LOC111793864 [Cucurbita pepo subsp. pepo]6.0e-9891.94Show/hide
Query:  DGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTAD
        DGV TPMEATSSSPSPTRSRGG R SGSG+G+ERPRFFD KAKA CWAKAD VPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVP+SKGGESTAD
Subjt:  DGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTAD

Query:  NCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGS
        NCQILQTRVNRFKS+KDDVDTS+LK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSE+LGQYKSKDRLAPCKLPYNE+ S
Subjt:  NCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGS

TrEMBL top hitse value%identityAlignment
A0A0A0LUK3 HNHc domain-containing protein5.0e-12299.52Show/hide
Query:  MGLLQFPRSAHSDCGRFFHFCLLDGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNC
        MGLLQFPRSAHSDCGRFFHFCLLDGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNC
Subjt:  MGLLQFPRSAHSDCGRFFHFCLLDGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNC

Query:  QGCLCFEYDHIVPYSKGGESTADNCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPC
        QGCLCFEYDHIVPYSKGGESTADNCQILQTRVNRFKSNKDDVDTSELK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPC
Subjt:  QGCLCFEYDHIVPYSKGGESTADNCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPC

Query:  KLPYNEEGSL
        KLPYNEEGSL
Subjt:  KLPYNEEGSL

A0A1S3BVP6 uncharacterized protein LOC103494011 isoform X27.0e-10098.31Show/hide
Query:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
        MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
Subjt:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ

Query:  TRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEE
        TRVNRFKSNKDDVDTSELK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSE+LGQYKSKDRLAPCKLPYNE+
Subjt:  TRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEE

A0A5A7UWI1 HNH endonuclease7.0e-10098.31Show/hide
Query:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
        MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ
Subjt:  MEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQILQ

Query:  TRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEE
        TRVNRFKSNKDDVDTSELK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSE+LGQYKSKDRLAPCKLPYNE+
Subjt:  TRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEE

A0A6J1EZW3 uncharacterized protein LOC1114410652.9e-9891.94Show/hide
Query:  DGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTAD
        DGV TPMEATSSSPSPTRSRGG R SGSG+G+ERPRFFD KAKA CWAKAD VPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVP+SKGGESTAD
Subjt:  DGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTAD

Query:  NCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGS
        NCQILQTRVNRFKS+KDDVDTS+LK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSE+LGQYKSKDRLAPCKLPYNE+ S
Subjt:  NCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGS

A0A6J1I7G1 uncharacterized protein LOC1114715092.9e-9891.94Show/hide
Query:  DGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTAD
        DGV TPMEATSSSPSPTRSRGG R SGSG+G+ERPRFFD KAKA CWAKAD VPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVP+SKGGESTAD
Subjt:  DGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTAD

Query:  NCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGS
        NCQILQTRVNRFKS+KDDVDTS+LK YSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSE+LGQYKSKDRLAPCKLPYNE+ S
Subjt:  NCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18680.1 HNH endonuclease domain-containing protein5.1e-7168.18Show/hide
Query:  TPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQI
        T     ++ P P+      R +   +G+ERPRFFD KAK  CWA AD+VPGRHPERWRKD AGN+VCKRF NC GCLCFEYDHIVPYSKGGES A+NCQI
Subjt:  TPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDHIVPYSKGGESTADNCQI

Query:  LQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLP
        LQTRVNRFKS +++VD + LKSYSC ++FTDKELD+IEMAVYGDV+RPG +CRC+TV+E+LGQ KSKD  A C+LP
Subjt:  LQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLP

AT3G47490.1 HNH endonuclease1.3e-3442.61Show/hide
Query:  PTPMEATSSSPSPTRSRGGQRDS------------------GSGSGD--ERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCF
        P+      SS SP R+ G  R                    GSG GD    PR F    K  CW KA+ + GR PERWR+D  GN+V ++   C GCLC 
Subjt:  PTPMEATSSSPSPTRSRGGQRDS------------------GSGSGD--ERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCF

Query:  EYDHIVPYSKGGESTADNCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCR
        +YDHIVPYSKGG+ST +NCQ+LQ +VNR K NK D+  SEL   S   +   +++D+IE+  YG+V R      CR
Subjt:  EYDHIVPYSKGGESTADNCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCR

AT3G47490.2 HNH endonuclease5.6e-1741.82Show/hide
Query:  PTPMEATSSSPSPTRSRGGQRDS------------------GSGSGD--ERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCF
        P+      SS SP R+ G  R                    GSG GD    PR F    K  CW KA+ + GR PERWR+D  GN+V ++   C GCLC 
Subjt:  PTPMEATSSSPSPTRSRGGQRDS------------------GSGSGD--ERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCF

Query:  EYDHIVPYSK
        +YDHIVPYSK
Subjt:  EYDHIVPYSK

AT3G47490.3 HNH endonuclease5.2e-3138.27Show/hide
Query:  PTPMEATSSSPSPTRSRGGQRDS------------------GSGSGD--ERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCF
        P+      SS SP R+ G  R                    GSG GD    PR F    K  CW KA+ + GR PERWR+D  GN+V ++   C GCLC 
Subjt:  PTPMEATSSSPSPTRSRGGQRDS------------------GSGSGD--ERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCF

Query:  EYDHIVPYS--------------------KGGESTADNCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCR
        +YDHIVPYS                    KGG+ST +NCQ+LQ +VNR K NK D+  SEL   S   +   +++D+IE+  YG+V R      CR
Subjt:  EYDHIVPYS--------------------KGGESTADNCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCTTCTCCAATTTCCCCGTTCGGCCCATTCGGATTGTGGGCGTTTCTTCCATTTTTGTCTGCTGGACGGCGTCCCGACTCCGATGGAAGCTACTTCCAGTTCGCC
GTCACCGACTCGTTCCCGCGGCGGGCAAAGAGATAGTGGCTCCGGAAGTGGGGACGAACGGCCAAGATTTTTCGACGCAAAAGCGAAAGCTAGTTGTTGGGCTAAGGCGG
ATGTTGTCCCTGGGCGGCATCCCGAGCGTTGGCGTAAGGACGCCGCTGGTAATGTCGTCTGTAAGCGCTTCTGTAACTGCCAGGGCTGCCTCTGTTTTGAGTATGATCAC
ATTGTCCCTTACTCTAAAGGAGGCGAATCCACGGCGGATAATTGCCAGATTCTTCAGACGAGAGTAAACAGATTCAAATCGAATAAAGACGATGTTGATACGTCTGAGTT
GAAAAGCTATTCGTGTGATGTCAAGTTTACTGATAAGGAGCTTGACATAATTGAAATGGCTGTTTACGGAGATGTAATCCGGCCAGGGAACCAATGTCGTTGTAGAACAG
TTTCTGAAGTGCTTGGGCAATATAAATCTAAAGACCGTTTGGCCCCCTGCAAATTGCCATATAATGAAGAAGGTTCTTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCTTCTCCAATTTCCCCGTTCGGCCCATTCGGATTGTGGGCGTTTCTTCCATTTTTGTCTGCTGGACGGCGTCCCGACTCCGATGGAAGCTACTTCCAGTTCGCC
GTCACCGACTCGTTCCCGCGGCGGGCAAAGAGATAGTGGCTCCGGAAGTGGGGACGAACGGCCAAGATTTTTCGACGCAAAAGCGAAAGCTAGTTGTTGGGCTAAGGCGG
ATGTTGTCCCTGGGCGGCATCCCGAGCGTTGGCGTAAGGACGCCGCTGGTAATGTCGTCTGTAAGCGCTTCTGTAACTGCCAGGGCTGCCTCTGTTTTGAGTATGATCAC
ATTGTCCCTTACTCTAAAGGAGGCGAATCCACGGCGGATAATTGCCAGATTCTTCAGACGAGAGTAAACAGATTCAAATCGAATAAAGACGATGTTGATACGTCTGAGTT
GAAAAGCTATTCGTGTGATGTCAAGTTTACTGATAAGGAGCTTGACATAATTGAAATGGCTGTTTACGGAGATGTAATCCGGCCAGGGAACCAATGTCGTTGTAGAACAG
TTTCTGAAGTGCTTGGGCAATATAAATCTAAAGACCGTTTGGCCCCCTGCAAATTGCCATATAATGAAGAAGGTTCTTTGTAACAACAAAGAAGACTTGAAACTAAAGCA
TGTTTGCCCCATCTATAGTTTAATACGATTGATAAACATCATTTTAAGATTACATCAATTAGAATTGTGTGGACCTCCATTACTTGATTAGATCAATATGAATCAACATC
GACCATCATTTTTTTTTTATTATTAGGGTTTGCTTTCAAATATACTAAAATATTTATGCC
Protein sequenceShow/hide protein sequence
MGLLQFPRSAHSDCGRFFHFCLLDGVPTPMEATSSSPSPTRSRGGQRDSGSGSGDERPRFFDAKAKASCWAKADVVPGRHPERWRKDAAGNVVCKRFCNCQGCLCFEYDH
IVPYSKGGESTADNCQILQTRVNRFKSNKDDVDTSELKSYSCDVKFTDKELDIIEMAVYGDVIRPGNQCRCRTVSEVLGQYKSKDRLAPCKLPYNEEGSL