; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G6838 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G6838
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionYqaJ domain-containing protein
Genome locationctg1522:578872..581693
RNA-Seq ExpressionCucsat.G6838
SyntenyCucsat.G6838
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR011335 - Restriction endonuclease type II-like
IPR011604 - Exonuclease, phage-type/RecB, C-terminal
IPR017482 - Putative phage-type endonuclease
IPR019080 - YqaJ viral recombinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142946.1 uncharacterized protein LOC101223120 isoform X1 [Cucumis sativus]3.14e-28399.74Show/hide
Query:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
        MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQS+SLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
Subjt:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR

Query:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
        PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
Subjt:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN

Query:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
        AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
Subjt:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR

Query:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
        VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
Subjt:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR

XP_004142947.1 uncharacterized protein LOC101223120 isoform X2 [Cucumis sativus]4.68e-237100Show/hide
Query:  MSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNR
        MSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNR
Subjt:  MSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNR

Query:  RIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTI
        RIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTI
Subjt:  RIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTI

Query:  PFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKL
        PFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKL
Subjt:  PFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKL

Query:  FCREIAGHVEFYR
        FCREIAGHVEFYR
Subjt:  FCREIAGHVEFYR

XP_008444417.1 PREDICTED: uncharacterized protein LOC103487752 isoform X1 [Cucumis melo]1.80e-25290.5Show/hide
Query:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
        MKFAAVSFSQSGASRSL HGGSSFNQL PVAS SAR+F   NS+SLLVCGLCRTL QSNS VE AIMSTMNNISIARICCR SRKNA+LYLKRN  IASR
Subjt:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR

Query:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
         FSTC +PSS T NP VIWLPSPL+LASQ NQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRR ELWHEKVFPSE QKT+APQQNAMEWGVLNE N
Subjt:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN

Query:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
        AIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKGKPEKGLPWST+PFYYMPQVQGQMEIMGREW+DLYCWTPNGSTIFR
Subjt:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR

Query:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
        VCRERGYWDLIREIL+EFWWENVVPAKEAL LG EE+AKSYKPTSTHKQTGLAIAKSIKLASEAKL CREIAGHVEFYR
Subjt:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR

XP_022131520.1 uncharacterized protein LOC111004694 isoform X1 [Momordica charantia]5.47e-22280.74Show/hide
Query:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
        MK AAVSFSQSGASR  LHGG SFN+L  VAS+SA +  +F S SLLVCG CRTL Q NS + T  MSTM+  SI+RICCRH   NARL  KR H   SR
Subjt:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR

Query:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
         FSTC S SSST NPLV   PS L+LASQ   S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRR ELWHEKVFP EIQKTEA Q+ AMEWGVLNE  
Subjt:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN

Query:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
        AIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKG+PEKGLPWST+PFYYMPQVQGQMEIM REW DLYCWTPNGSTIFR
Subjt:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR

Query:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
        VCRERGYW+L+RE+LREFWWENVVPA+EAL LG EE+ KSYKPTSTHKQTGLAIAKSIKLASEAKL  REIAGHVEFYR
Subjt:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR

XP_038884042.1 uncharacterized protein LOC120074988 isoform X1 [Benincasa hispida]9.62e-23985.49Show/hide
Query:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
        MK AAVSFS++GAS+ LLHGGSSFN+   VAS +ARQ  +F+S SLLVCGLCRTL  SNS VETAIMSTMNN SI+RICCRHSR NARL L+R H   SR
Subjt:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR

Query:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
         FSTC SPSSS  NPLVI LPS L+LASQ   S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRR ELWHEKVFP EIQK EAPQQNAMEWGVLNE N
Subjt:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN

Query:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
        AIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKGKPEKGLPWST+PFYYMPQVQGQMEIM REWADLYCWTPNGSTIFR
Subjt:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR

Query:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
        VCRERGYWDLIRE+LREFWWENVVPA+EALLLG EE+ KSYKPTSTHKQTGLAIAKSIKLASEAKL CREIAGH+EFYR
Subjt:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR

TrEMBL top hitse value%identityAlignment
A0A0A0LNG4 YqaJ domain-containing protein1.52e-28399.74Show/hide
Query:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
        MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQS+SLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
Subjt:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR

Query:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
        PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
Subjt:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN

Query:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
        AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
Subjt:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR

Query:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
        VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
Subjt:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR

A0A1S3B9S9 uncharacterized protein LOC103487752 isoform X18.69e-25390.5Show/hide
Query:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
        MKFAAVSFSQSGASRSL HGGSSFNQL PVAS SAR+F   NS+SLLVCGLCRTL QSNS VE AIMSTMNNISIARICCR SRKNA+LYLKRN  IASR
Subjt:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR

Query:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
         FSTC +PSS T NP VIWLPSPL+LASQ NQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRR ELWHEKVFPSE QKT+APQQNAMEWGVLNE N
Subjt:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN

Query:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
        AIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKGKPEKGLPWST+PFYYMPQVQGQMEIMGREW+DLYCWTPNGSTIFR
Subjt:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR

Query:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
        VCRERGYWDLIREIL+EFWWENVVPAKEAL LG EE+AKSYKPTSTHKQTGLAIAKSIKLASEAKL CREIAGHVEFYR
Subjt:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR

A0A5A7V2D1 Restriction endonuclease2.53e-21892.01Show/hide
Query:  MSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNR
        MSTMNNISIARICCR SRKNA+LYLKRN  IASR FSTC +PSS T NP VIWLPSPL+LASQ NQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNR
Subjt:  MSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNR

Query:  RIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTI
        R ELWHEKVFPSE QKT+APQQNAMEWGVLNE NAIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKGKPEKGLPWST+
Subjt:  RIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTI

Query:  PFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKL
        PFYYMPQVQGQMEIMGREW+DLYCWTPNGSTIFRVCRERGYWDLIREIL+EFWWENVVPAKEAL LG EE+AKSYKPTSTHKQTGLAIAKSIKLASEAKL
Subjt:  PFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKL

Query:  FCREIAGHVEFYR
         CREIAGHVEFYR
Subjt:  FCREIAGHVEFYR

A0A6J1BPX6 uncharacterized protein LOC111004694 isoform X12.65e-22280.74Show/hide
Query:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
        MK AAVSFSQSGASR  LHGG SFN+L  VAS+SA +  +F S SLLVCG CRTL Q NS + T  MSTM+  SI+RICCRH   NARL  KR H   SR
Subjt:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR

Query:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
         FSTC S SSST NPLV   PS L+LASQ   S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRR ELWHEKVFP EIQKTEA Q+ AMEWGVLNE  
Subjt:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN

Query:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
        AIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKG+PEKGLPWST+PFYYMPQVQGQMEIM REW DLYCWTPNGSTIFR
Subjt:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR

Query:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
        VCRERGYW+L+RE+LREFWWENVVPA+EAL LG EE+ KSYKPTSTHKQTGLAIAKSIKLASEAKL  REIAGHVEFYR
Subjt:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR

A0A6J1GHX8 uncharacterized protein LOC111454092 isoform X12.15e-21979.16Show/hide
Query:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR
        MK AAVSFSQSGASRS LH  SSFN+L  VAS SARQ  +F+S S  VCG CRT  QSNS + TA++STM+N SIARICCR  R NARL+ KR     SR
Subjt:  MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASR

Query:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN
         FST +SP  S  NPL+I LPS L++ASQ   S APQRSEEWFALRRDKLTTSTFSTALGFWKGNRR ELWHEKVFPSEI+K EA QQ AMEWGVLNE N
Subjt:  PFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVN

Query:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR
        AI RYK ITGRDVS LGFATHSEQQ DWLGASPDGLL CFQGGGILEVKCPYNKGKPEKGLPWST+PFYYMPQVQGQ+EIM REWADLYCWTPNGSTIFR
Subjt:  AIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFR

Query:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR
        VCRERGYW+LI E+LREFWWENVVPA+EAL  G E++ +SYKPTSTHKQTG+AIAKSIKLAS+AKL CREIAGHVEFYR
Subjt:  VCRERGYWDLIREILREFWWENVVPAKEALLLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13810.1 Restriction endonuclease, type II-like superfamily protein1.2e-4134.01Show/hide
Query:  EEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATH---SEQQFDWLGASPDGL
        + W  LR+++LT S F+ A+GF    RR  LW EK+  ++          A  W + NEV A++RY  +TG ++ +  F  +      + +WLGASPDG+
Subjt:  EEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATH---SEQQFDWLGASPDGL

Query:  LECFQGG----GILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEAL--
        +   + G    G+LEVKCP++     K  PW  +P+  +PQ+QG MEI+  +W DLYCWT NGS++FRV R+  +W+ ++  L +FW  +V+PA+E    
Subjt:  LECFQGG----GILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEAL--

Query:  --LLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHV
          +   + K + +KP   H+     +  + ++++ A     EI G++
Subjt:  --LLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHV

AT1G67660.1 Restriction endonuclease, type II-like superfamily protein5.7e-10553.63Show/hide
Query:  SSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLP
        SS   +  +++ SAR  G        VC +CR L+ +   + + I+S M   SI+       + +  +  ++      R  ST +S  + T +P      
Subjt:  SSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLP

Query:  SPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATH
        S ++++S  + S  PQ+SEEWFALR+DKLTTSTFSTALGFWKGNRR ELWHEKV+ S+ +  E   + AM WGV  E +AI+RYK I G +V  +GFA H
Subjt:  SPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATH

Query:  SEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWE
        S ++F WLGASPDG+L+CF   GILEVKCPYNKGK E  LPW  +P+YYMPQ+QGQMEIM REW +LYCWT NGST+FRV R+R YW +I ++LREFWWE
Subjt:  SEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWE

Query:  NVVPAKEALLLGSE-EKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFY
        +V+PA+EALLLG E E+ K Y+PTSTHK+T LAIAKS+ LA+E+KL CREIA HVEF+
Subjt:  NVVPAKEALLLGSE-EKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFY

AT1G67660.2 Restriction endonuclease, type II-like superfamily protein7.0e-10363.18Show/hide
Query:  STCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAI
        ST +S  + T +P      S ++++S  + S  PQ+SEEWFALR+DKLTTSTFSTALGFWKGNRR ELWHEKV+ S+ +  E   + AM WGV  E +AI
Subjt:  STCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAI

Query:  DRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVC
        +RYK I G +V  +GFA HS ++F WLGASPDG+L+CF   GILEVKCPYNKGK E  LPW  +P+YYMPQ+QGQMEIM REW +LYCWT NGST+FRV 
Subjt:  DRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVC

Query:  RERGYWDLIREILREFWWENVVPAKEALLLGSE-EKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFY
        R+R YW +I ++LREFWWE+V+PA+EALLLG E E+ K Y+PTSTHK+T LAIAKS+ LA+E+KL CREIA HVEF+
Subjt:  RERGYWDLIREILREFWWENVVPAKEALLLGSE-EKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFY

AT1G67660.3 Restriction endonuclease, type II-like superfamily protein3.7e-10456.02Show/hide
Query:  VCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRR
        VC +CR L+ +   + + I+S M   SI+       + +  +  ++      R  ST +S  + T +P      S ++++S  + S  PQ+SEEWFALR+
Subjt:  VCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSSSTKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRR

Query:  DKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILE
        DKLTTSTFSTALGFWKGNRR ELWHEKV+ S+ +  E   + AM WGV  E +AI+RYK I G +V  +GFA HS ++F WLGASPDG+L+CF   GILE
Subjt:  DKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFATHSEQQFDWLGASPDGLLECFQGGGILE

Query:  VKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEALLLGSE-EKAKSYKPTST
        VKCPYNKGK E  LPW  +P+YYMPQ+QGQMEIM REW +LYCWT NGST+FRV R+R YW +I ++LREFWWE+V+PA+EALLLG E E+ K Y+PTST
Subjt:  VKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEALLLGSE-EKAKSYKPTST

Query:  HKQTGLAIAKSIKLASEAKLFCREIAGHVEFY
        HK+T LAIAKS+ LA+E+KL CREIA HVEF+
Subjt:  HKQTGLAIAKSIKLASEAKLFCREIAGHVEFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTCGCTGCGGTCTCTTTCTCTCAGTCCGGAGCGTCTCGAAGTCTTTTGCACGGAGGTTCATCTTTCAATCAATTGCTGCCCGTTGCTTCAATTTCAGCTCGCCA
ATTTGGTTCATTCAACTCCAATTCTCTTTTGGTGTGTGGGTTGTGCAGAACGCTTCGTCAGAGTAATTCTTTGGTTGAAACTGCCATTATGTCAACAATGAACAACATCT
CCATTGCTAGAATATGCTGCAGACACTCTAGAAAAAATGCAAGACTGTACTTAAAACGAAATCATGAGATTGCTTCAAGACCCTTTTCAACTTGTGTCTCGCCATCTAGT
TCCACAAAAAATCCTCTGGTCATCTGGTTACCCTCACCTTTGGTTTTGGCTTCCCAGGCCAACCAATCAGTTGCCCCTCAGCGTTCAGAAGAATGGTTTGCACTAAGGAG
AGACAAGCTGACAACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGACGCATTGAGCTATGGCATGAGAAAGTATTCCCTTCAGAGATTCAAAAAACAG
AAGCACCACAACAAAATGCCATGGAGTGGGGTGTGCTCAATGAAGTAAACGCCATTGATCGATATAAAGGCATAACAGGTCGAGATGTAAGCTTGTTAGGGTTTGCAACT
CACTCAGAACAGCAATTTGACTGGCTCGGAGCCTCCCCCGACGGCCTATTGGAATGCTTTCAAGGTGGTGGAATCTTGGAAGTAAAATGTCCATACAACAAGGGAAAGCC
TGAGAAGGGACTGCCCTGGTCGACCATACCTTTCTATTACATGCCACAAGTACAGGGTCAAATGGAGATAATGGGTAGAGAATGGGCGGATCTATATTGCTGGACACCAA
ATGGAAGCACCATATTTCGTGTGTGTAGGGAACGTGGTTATTGGGATTTGATACGTGAAATATTGAGGGAATTTTGGTGGGAAAACGTTGTTCCTGCAAAAGAGGCTTTA
TTGTTGGGAAGTGAGGAAAAGGCGAAGTCATATAAGCCAACGTCCACGCACAAGCAAACTGGACTAGCAATTGCTAAGAGCATTAAATTAGCAAGCGAAGCCAAATTGTT
CTGTAGGGAAATTGCTGGGCATGTTGAATTTTACAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTCGCTGCGGTCTCTTTCTCTCAGTCCGGAGCGTCTCGAAGTCTTTTGCACGGAGGTTCATCTTTCAATCAATTGCTGCCCGTTGCTTCAATTTCAGCTCGCCA
ATTTGGTTCATTCAACTCCAATTCTCTTTTGGTGTGTGGGTTGTGCAGAACGCTTCGTCAGAGTAATTCTTTGGTTGAAACTGCCATTATGTCAACAATGAACAACATCT
CCATTGCTAGAATATGCTGCAGACACTCTAGAAAAAATGCAAGACTGTACTTAAAACGAAATCATGAGATTGCTTCAAGACCCTTTTCAACTTGTGTCTCGCCATCTAGT
TCCACAAAAAATCCTCTGGTCATCTGGTTACCCTCACCTTTGGTTTTGGCTTCCCAGGCCAACCAATCAGTTGCCCCTCAGCGTTCAGAAGAATGGTTTGCACTAAGGAG
AGACAAGCTGACAACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGACGCATTGAGCTATGGCATGAGAAAGTATTCCCTTCAGAGATTCAAAAAACAG
AAGCACCACAACAAAATGCCATGGAGTGGGGTGTGCTCAATGAAGTAAACGCCATTGATCGATATAAAGGCATAACAGGTCGAGATGTAAGCTTGTTAGGGTTTGCAACT
CACTCAGAACAGCAATTTGACTGGCTCGGAGCCTCCCCCGACGGCCTATTGGAATGCTTTCAAGGTGGTGGAATCTTGGAAGTAAAATGTCCATACAACAAGGGAAAGCC
TGAGAAGGGACTGCCCTGGTCGACCATACCTTTCTATTACATGCCACAAGTACAGGGTCAAATGGAGATAATGGGTAGAGAATGGGCGGATCTATATTGCTGGACACCAA
ATGGAAGCACCATATTTCGTGTGTGTAGGGAACGTGGTTATTGGGATTTGATACGTGAAATATTGAGGGAATTTTGGTGGGAAAACGTTGTTCCTGCAAAAGAGGCTTTA
TTGTTGGGAAGTGAGGAAAAGGCGAAGTCATATAAGCCAACGTCCACGCACAAGCAAACTGGACTAGCAATTGCTAAGAGCATTAAATTAGCAAGCGAAGCCAAATTGTT
CTGTAGGGAAATTGCTGGGCATGTTGAATTTTACAGATAA
Protein sequenceShow/hide protein sequence
MKFAAVSFSQSGASRSLLHGGSSFNQLLPVASISARQFGSFNSNSLLVCGLCRTLRQSNSLVETAIMSTMNNISIARICCRHSRKNARLYLKRNHEIASRPFSTCVSPSS
STKNPLVIWLPSPLVLASQANQSVAPQRSEEWFALRRDKLTTSTFSTALGFWKGNRRIELWHEKVFPSEIQKTEAPQQNAMEWGVLNEVNAIDRYKGITGRDVSLLGFAT
HSEQQFDWLGASPDGLLECFQGGGILEVKCPYNKGKPEKGLPWSTIPFYYMPQVQGQMEIMGREWADLYCWTPNGSTIFRVCRERGYWDLIREILREFWWENVVPAKEAL
LLGSEEKAKSYKPTSTHKQTGLAIAKSIKLASEAKLFCREIAGHVEFYR