; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G15200 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G15200
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionprotein DOUBLE-STRAND BREAK FORMATION
Genome locationChr1:10796811..10800831
RNA-Seq ExpressionCSPI01G15200
SyntenyCSPI01G15200
Gene Ontology termsGO:0042138 - meiotic DNA double-strand break formation (biological process)
InterPro domainsIPR044969 - Protein DOUBLE-STRAND BREAK FORMATION


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139510.2 protein DOUBLE-STRAND BREAK FORMATION isoform X1 [Cucumis sativus]8.8e-11698.64Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        MDVNSSFKEVLRFESLSIIRETSEKTDD KLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSI+IKAYEQ
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
        ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ

Query:  GRTKSSFVIRPTCSLSSYPSD
        GRT+SSFVIRPTCSLSSYPSD
Subjt:  GRTKSSFVIRPTCSLSSYPSD

XP_008464290.1 PREDICTED: uncharacterized protein LOC103502216 isoform X1 [Cucumis melo]1.9e-9794.47Show/hide
Query:  DVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQA
        DV SSF E+LRFESLSIIRET+EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQA
Subjt:  DVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQA

Query:  LSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
        LSSLQQSDTANYTSHGSFK TEV+EKINRLKDHALNL+GSHSVQALTSDYLKKKVTER+RKISSSCTRKFTASTLF NGIRNYNARKLHEYRS  GVNQ
Subjt:  LSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ

XP_011654706.1 protein DOUBLE-STRAND BREAK FORMATION isoform X2 [Cucumis sativus]8.8e-11698.64Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        MDVNSSFKEVLRFESLSIIRETSEKTDD KLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSI+IKAYEQ
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
        ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ

Query:  GRTKSSFVIRPTCSLSSYPSD
        GRT+SSFVIRPTCSLSSYPSD
Subjt:  GRTKSSFVIRPTCSLSSYPSD

XP_022970619.1 uncharacterized protein LOC111469552 isoform X1 [Cucurbita maxima]9.2e-8988.89Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        MDV S  KE+LRFESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQ
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGV
        ALSSLQQSDTANYTSHGS KR EV+EKI RLKDHAL  AGSHSVQALTS+YLKKKVTERNRKISSSCTRKFTASTLF NGIRN+NA+KLHEY++ EG+
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGV

XP_022970621.1 uncharacterized protein LOC111469552 isoform X2 [Cucurbita maxima]3.2e-8987.62Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        MDV S  KE+LRFESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQ
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
        ALSSLQQSDTANYTSHGS KR EV+EKI RLKDHAL  AGSHSVQALTS+YLKKKVTERNRKISSSCTRKFTASTLF NGIRN+NA+KLHEY++ EG+  
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ

Query:  GR
         R
Subjt:  GR

TrEMBL top hitse value%identityAlignment
A0A0A0LSV1 Uncharacterized protein4.3e-11698.64Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        MDVNSSFKEVLRFESLSIIRETSEKTDD KLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSI+IKAYEQ
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
        ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ

Query:  GRTKSSFVIRPTCSLSSYPSD
        GRT+SSFVIRPTCSLSSYPSD
Subjt:  GRTKSSFVIRPTCSLSSYPSD

A0A1S3CL48 uncharacterized protein LOC103502216 isoform X19.0e-9894.47Show/hide
Query:  DVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQA
        DV SSF E+LRFESLSIIRET+EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQA
Subjt:  DVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQA

Query:  LSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
        LSSLQQSDTANYTSHGSFK TEV+EKINRLKDHALNL+GSHSVQALTSDYLKKKVTER+RKISSSCTRKFTASTLF NGIRNYNARKLHEYRS  GVNQ
Subjt:  LSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ

A0A6J1HMC3 uncharacterized protein LOC111464906 isoform X28.4e-8883.65Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        MDV S  KE+L FESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQ
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
        ALSSLQQSDTANYTSHGS K  EV+EKI RLKDHAL  AGSHSVQALTS+YLKK+VTERNRKISSSCTRKFTASTLF NGIRN+NA++LHEY++ EG+  
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ

Query:  GRTKSSFV
         R    F+
Subjt:  GRTKSSFV

A0A6J1I136 uncharacterized protein LOC111469552 isoform X21.5e-8987.62Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        MDV S  KE+LRFESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQ
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ
        ALSSLQQSDTANYTSHGS KR EV+EKI RLKDHAL  AGSHSVQALTS+YLKKKVTERNRKISSSCTRKFTASTLF NGIRN+NA+KLHEY++ EG+  
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQ

Query:  GR
         R
Subjt:  GR

A0A6J1I645 uncharacterized protein LOC111469552 isoform X14.5e-8988.89Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        MDV S  KE+LRFESLSIIRET EKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFR LKSFNQP LQVSHAEWLNFAEHSL+AGFFSIAIKAYEQ
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGV
        ALSSLQQSDTANYTSHGS KR EV+EKI RLKDHAL  AGSHSVQALTS+YLKKKVTERNRKISSSCTRKFTASTLF NGIRN+NA+KLHEY++ EG+
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGV

SwissProt top hitse value%identityAlignment
Q8RX33 Protein DOUBLE-STRAND BREAK FORMATION2.0e-2546.85Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        ++V S  ++ +R ES+ I  E + ++   KL V+EF  RAFAL+GD+ESCLA+RYEALN R LKS +  WL VSH+EW  FA  S+  GF SIA KA E 
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHS
        AL SL++       S  +    +  EK+ RL+D A +L  SHS
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHS

Arabidopsis top hitse value%identityAlignment
AT1G07060.1 unknown protein1.4e-2646.85Show/hide
Query:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ
        ++V S  ++ +R ES+ I  E + ++   KL V+EF  RAFAL+GD+ESCLA+RYEALN R LKS +  WL VSH+EW  FA  S+  GF SIA KA E 
Subjt:  MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQ

Query:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHS
        AL SL++       S  +    +  EK+ RL+D A +L  SHS
Subjt:  ALSSLQQSDTANYTSHGSFKRTEVMEKINRLKDHALNLAGSHS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTTAATTCCAGCTTTAAAGAAGTTCTCAGATTTGAATCTCTATCTATCATTCGTGAAACCTCTGAGAAAACTGATGATCAAAAGCTTCTAGTCATCGAATTTCT
TGTTCGAGCTTTTGCCCTTGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATACGAGGCCTTGAATTTTCGGGTACTGAAGTCTTTCAATCAACCATGGCTTCAAGTAT
CACACGCAGAATGGTTAAACTTCGCTGAGCATTCATTGCATGCTGGCTTTTTTTCAATTGCCATAAAGGCATATGAGCAAGCGCTGTCAAGCCTTCAGCAGAGTGATACT
GCAAACTACACATCACATGGTTCCTTTAAACGCACAGAAGTTATGGAGAAGATAAATAGACTCAAAGATCATGCTCTGAATTTAGCTGGTTCCCATTCTGTTCAAGCTCT
CACATCTGATTATTTGAAAAAGAAAGTAACTGAAAGGAACAGAAAGATTTCTTCATCCTGCACAAGAAAATTTACAGCGAGCACTCTATTCATAAATGGTATCAGAAACT
ACAATGCAAGAAAGCTGCATGAATATCGAAGTTTTGAAGGGGTTAACCAGGGTCGCACAAAATCCAGTTTTGTGATCAGACCTACATGTAGTCTCTCTTCATATCCATCT
GACTAG
mRNA sequenceShow/hide mRNA sequence
AGCTTTCCCACCAATTATTAATTGATGTTCTGTTCATACTCTCTCTTTCTTTCGCGCCTCAGAAGCCGAAGGTTTGGTTTGGCTATTCACTCTCAACTCTTATCAAACTC
ATTTGATCCAATTTTCCATCCTTAATCTCCATTTATTATCACTTCTTACATTCTCTCTGATTCTTTACCGCTCAAGACTTCCACAGATTTGATGATTCTACTTTGCGAAT
TCTGGAATCATTTCCCGCTTCCAAAGACGCGACGTCGTTGATGGATGTTAATTCCAGCTTTAAAGAAGTTCTCAGATTTGAATCTCTATCTATCATTCGTGAAACCTCTG
AGAAAACTGATGATCAAAAGCTTCTAGTCATCGAATTTCTTGTTCGAGCTTTTGCCCTTGTTGGAGACATTGAGAGTTGCTTAGCTTTGAGATACGAGGCCTTGAATTTT
CGGGTACTGAAGTCTTTCAATCAACCATGGCTTCAAGTATCACACGCAGAATGGTTAAACTTCGCTGAGCATTCATTGCATGCTGGCTTTTTTTCAATTGCCATAAAGGC
ATATGAGCAAGCGCTGTCAAGCCTTCAGCAGAGTGATACTGCAAACTACACATCACATGGTTCCTTTAAACGCACAGAAGTTATGGAGAAGATAAATAGACTCAAAGATC
ATGCTCTGAATTTAGCTGGTTCCCATTCTGTTCAAGCTCTCACATCTGATTATTTGAAAAAGAAAGTAACTGAAAGGAACAGAAAGATTTCTTCATCCTGCACAAGAAAA
TTTACAGCGAGCACTCTATTCATAAATGGTATCAGAAACTACAATGCAAGAAAGCTGCATGAATATCGAAGTTTTGAAGGGGTTAACCAGGGTCGCACAAAATCCAGTTT
TGTGATCAGACCTACATGTAGTCTCTCTTCATATCCATCTGACTAGATAAATGTTCAGGGAGTCGATCAGGGTGACTCGGGCGGCAAAAGAAAAAGTGTTTCTATATAAT
TCCAGTTGATCTCAGTTCTGTGCAAGTCCAAGTTGCTTCTCCCCGTCTGACAGGTTTGTATTATCAATACCCCACCCCACCCCAAATTCTTCATTTCTTCCTTTTTGTAA
GAGTTCAATTTTTCAATACCACAAAATCTTTCATGTGGTAA
Protein sequenceShow/hide protein sequence
MDVNSSFKEVLRFESLSIIRETSEKTDDQKLLVIEFLVRAFALVGDIESCLALRYEALNFRVLKSFNQPWLQVSHAEWLNFAEHSLHAGFFSIAIKAYEQALSSLQQSDT
ANYTSHGSFKRTEVMEKINRLKDHALNLAGSHSVQALTSDYLKKKVTERNRKISSSCTRKFTASTLFINGIRNYNARKLHEYRSFEGVNQGRTKSSFVIRPTCSLSSYPS
D