; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g00800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g00800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRestriction endonuclease type II-like protein
Genome locationchr8:474382..476418
RNA-Seq ExpressionMoc08g00800
SyntenyMoc08g00800
Gene Ontology termsNA
InterPro domainsIPR011335 - Restriction endonuclease type II-like
IPR011604 - Exonuclease, phage-type/RecB, C-terminal
IPR017482 - Putative phage-type endonuclease
IPR019080 - YqaJ viral recombinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020121.1 hypothetical protein SDJN02_16803 [Cucurbita argyrosperma subsp. argyrosperma]7.1e-18284.7Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
        MKLAAVSFSQSGASR FLH   SFNRLP VAS SA +VDAF STSL VCGFCRT HQ NSSINT  +STMS TSI+RICCR P  NARL SKRK  +GSR
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR

Query:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
        TFST  S   S TNPL+ R PSALI+ASQVTPSDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELWHEKVFP EI+K EA Q+ AMEWGVLNE  
Subjt:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR

Query:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        VCRERGYWEL+ EMLREFWWENVVPAREALS GRE+EV+SYKPTSTHKQTG+AIAKSIKLAS+AKL+ REIAGHVEFYR
Subjt:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

XP_022131520.1 uncharacterized protein LOC111004694 isoform X1 [Momordica charantia]8.6e-220100Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRT
        MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRT
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRT

Query:  FSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATA
        FSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATA
Subjt:  FSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATA

Query:  IDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRV
        IDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRV
Subjt:  IDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRV

Query:  CRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        CRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
Subjt:  CRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

XP_022131521.1 uncharacterized protein LOC111004694 isoform X2 [Momordica charantia]5.4e-182100Show/hide
Query:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
        MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
Subjt:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR

Query:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
        RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
Subjt:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM

Query:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL
        PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL
Subjt:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL

Query:  MFREIAGHVEFYR
        MFREIAGHVEFYR
Subjt:  MFREIAGHVEFYR

XP_023537251.1 uncharacterized protein LOC111798383 isoform X1 [Cucurbita pepo subsp. pepo]5.4e-18284.43Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
        MKLAAVSFSQSGASR FLH   SFNRLPRVAS SA +VDAF STSL VCGFCRT HQ NSSI+T  +STMS TSI+RICCRHP  NARL SKRK  +GSR
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR

Query:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
        TFST  S   S TNPL+ R PSALI+ASQVTPSDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELWHEKVFP EI+K EA Q+ AMEWGVLNE  
Subjt:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR

Query:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        VCRERGYWEL+ EMLREFWWENVVPAREA S GRE+EV+SYKPTSTHK TG+AIAKSIKLAS+AKL+ REIAGHVEFYR
Subjt:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

XP_038884042.1 uncharacterized protein LOC120074988 isoform X1 [Benincasa hispida]9.2e-19087.6Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
        MKLAAVSFS++GAS+  LHGG SFNR PRVAS +A +VDAF STSLLVCG CRTLH  NSS+ T  MSTM+ TSISRICCRH  +NARL  +RKHGSGSR
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR

Query:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
        TFSTC S SSS TNPLV R PSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFP EIQK EA Q+ AMEWGVLNEA 
Subjt:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
        AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQMEIMDREW DLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR

Query:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        VCRERGYW+L+REMLREFWWENVVPAREAL LGREE+VKSYKPTSTHKQTGLAIAKSIKLASEAKL+ REIAGH+EFYR
Subjt:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

TrEMBL top hitse value%identityAlignment
A0A0A0LNG4 YqaJ domain-containing protein5.5e-17280.47Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
        MK AAVSFSQSGASR  LHGG SFN+L  VAS+SA +  +F S SLLVCG CRTL Q +S + T  MSTM+  SI+RICCRH   NARL  KR H   SR
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR

Query:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
         FSTC S SSST NPLV   PS L+LASQ   S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRR ELWHEKVFP EIQKTEA Q+ AMEWGVLNE  
Subjt:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
        AIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKG+PEKGLPWST+PFYYMPQVQGQMEIM REW DLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR

Query:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        VCRERGYW+L+RE+LREFWWENVVPA+EAL LG EE+ KSYKPTSTHKQTGLAIAKSIKLASEAKL  REIAGHVEFYR
Subjt:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

A0A6J1BPQ6 uncharacterized protein LOC111004694 isoform X22.6e-182100Show/hide
Query:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
        MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
Subjt:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR

Query:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
        RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
Subjt:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM

Query:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL
        PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL
Subjt:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL

Query:  MFREIAGHVEFYR
        MFREIAGHVEFYR
Subjt:  MFREIAGHVEFYR

A0A6J1BPX6 uncharacterized protein LOC111004694 isoform X14.1e-220100Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRT
        MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRT
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRT

Query:  FSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATA
        FSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATA
Subjt:  FSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATA

Query:  IDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRV
        IDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRV
Subjt:  IDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRV

Query:  CRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        CRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
Subjt:  CRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

A0A6J1GHX8 uncharacterized protein LOC111454092 isoform X11.7e-18184.43Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
        MKLAAVSFSQSGASR FLH   SFNRLP VAS SA +VDAF STS  VCGFCRT HQ NSSINT  +STMS TSI+RICCR P  NARL SKRK  +GSR
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR

Query:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
        TFST  S   S TNPL+ R PSALI+ASQVTPSDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELWHEKVFP EI+K EA Q+ AMEWGVLNE  
Subjt:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR

Query:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        VCRERGYWEL+ EMLREFWWENVVPAREALS GRE+EV+SYKPTSTHKQTG+AIAKSIKLAS+AKL+ REIAGHVEFYR
Subjt:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

A0A6J1KS64 uncharacterized protein LOC111496007 isoform X12.2e-18183.91Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
        MKLAAVSFSQS ASR FLH   SFNRLPRVAS S P+VDAF STSL VC FCRT HQ NSSI+T  +STMS TSI+RICC HP  NARL SKRK  +GSR
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR

Query:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
        TFST  S   S TNPL+ R PSALI+ASQVTPSDAPQRSEEWFALRRD+LTTSTFSTALGFWKG+RRFELWHEKVFP EI+K EA Q+ AMEWGVLNE  
Subjt:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ +WLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR

Query:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        VCRERGYWEL+ EMLREFWWENVVPAREALSLGREEEV+SYKPTSTHKQTG+AI+KSIKLAS+AKL+ REIAGHVEFYR
Subjt:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13810.1 Restriction endonuclease, type II-like superfamily protein7.4e-4436.84Show/hide
Query:  EEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGL
        + W  LR++RLT S F+ A+GF    RR  LW EK+      K  A  R A  W + NE  A++RY  +TG ++ +  F  +      + +WLGASPDG+
Subjt:  EEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGL

Query:  LGCFQGG----GILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALS-
        +   + G    G+LEVKCP++     K  PW  +P+  +PQ+QG MEI+D +W+DLYCWT NGS++FRV R+  +WE M+  L +FW  +V+PARE  + 
Subjt:  LGCFQGG----GILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALS-

Query:  ---LGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHV
              + +++ +KP   H+     +  + ++++ A  +F EI G++
Subjt:  ---LGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHV

AT1G67660.1 Restriction endonuclease, type II-like superfamily protein1.2e-10558.23Show/hide
Query:  CRTLHQRNSSINT-TMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLT
        CR L     ++N+  +S M   SIS      P  +  +SS+++    S   S  T + S   +P      S++I++S ++PSD PQ+SEEWFALR+D+LT
Subjt:  CRTLHQRNSSINT-TMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLT

Query:  TSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCP
        TSTFSTALGFWKGNRR ELWHEKV+  + +  E S R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GILEVKCP
Subjt:  TSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCP

Query:  YNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYKPTSTHKQT
        YNKG+ E  LPW  +P+YYMPQ+QGQMEIMDREWV+LYCWT NGST+FRV R+R YW ++ ++LREFWWE+V+PAREAL LG+E EEVK Y+PTSTHK+T
Subjt:  YNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYKPTSTHKQT

Query:  GLAIAKSIKLASEAKLMFREIAGHVEFY
         LAIAKS+ LA+E+KL+ REIA HVEF+
Subjt:  GLAIAKSIKLASEAKLMFREIAGHVEFY

AT1G67660.2 Restriction endonuclease, type II-like superfamily protein9.8e-10559.74Show/hide
Query:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
        +S M   SIS      P  +  +SS+++    S   S  T + S   +P      S++I++S ++PSD PQ+SEEWFALR+D+LTTSTFSTALGFWKGNR
Subjt:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR

Query:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
        R ELWHEKV+  + +  E S R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GILEVKCPYNKG+ E  LPW  +
Subjt:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM

Query:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYKPTSTHKQTGLAIAKSIKLASEAK
        P+YYMPQ+QGQMEIMDREWV+LYCWT NGST+FRV R+R YW ++ ++LREFWWE+V+PAREAL LG+E EEVK Y+PTSTHK+T LAIAKS+ LA+E+K
Subjt:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYKPTSTHKQTGLAIAKSIKLASEAK

Query:  LMFREIAGHVEFY
        L+ REIA HVEF+
Subjt:  LMFREIAGHVEFY

AT1G67660.3 Restriction endonuclease, type II-like superfamily protein6.8e-10657.74Show/hide
Query:  TSLLVCGFCRTLHQRNSSINT-TMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWF
        T + VC  CR L     ++N+  +S M   SIS      P  +  +SS+++    S   S  T + S   +P      S++I++S ++PSD PQ+SEEWF
Subjt:  TSLLVCGFCRTLHQRNSSINT-TMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWF

Query:  ALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGG
        ALR+D+LTTSTFSTALGFWKGNRR ELWHEKV+  + +  E S R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   
Subjt:  ALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGG

Query:  GILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYK
        GILEVKCPYNKG+ E  LPW  +P+YYMPQ+QGQMEIMDREWV+LYCWT NGST+FRV R+R YW ++ ++LREFWWE+V+PAREAL LG+E EEVK Y+
Subjt:  GILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYK

Query:  PTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFY
        PTSTHK+T LAIAKS+ LA+E+KL+ REIA HVEF+
Subjt:  PTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCGCTGCTGTCTCTTTCTCTCAATCTGGAGCATCCCGAGGTTTTCTTCACGGAGGTCCCTCTTTCAATCGTTTGCCGCGCGTCGCTTCATTATCAGCTCCCCG
AGTTGATGCGTTCCGCTCAACTTCTCTTTTGGTCTGTGGGTTTTGCAGGACGCTTCATCAACGAAACTCTTCAATCAATACTACGATGTCAACGATGAGCAAAACCTCCA
TTTCTAGAATCTGCTGTAGACACCCTGGACTGAATGCGAGACTCTCCTCAAAACGAAAGCATGGGAGTGGTTCAAGAACTTTTTCAACATGCACCTCATCGTCGAGCTCC
ACAACGAACCCTCTGGTCACCCGTTTCCCCTCAGCTTTGATTTTGGCCTCCCAGGTCACCCCATCTGATGCACCTCAGCGTTCAGAAGAATGGTTTGCACTACGGAGAGA
CAGGTTGACTACAAGCACCTTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGTCGCTTCGAGCTATGGCATGAGAAAGTGTTTCCTTATGAGATTCAAAAAACGGAAG
CATCGCAACGGTGTGCCATGGAGTGGGGTGTGCTCAATGAAGCAACTGCAATTGATCGGTACAAAAGCATCACAGGCCGAGACGTAAGCTTGTTAGGGTTTGCAACTCAC
TCAGAACAGCAATTCGATTGGCTCGGCGCCTCCCCCGATGGCCTATTGGGATGCTTTCAAGGAGGAGGGATCCTGGAAGTAAAATGTCCATACAACAAGGGAAGGCCAGA
GAAGGGACTACCCTGGTCAACAATGCCATTCTACTACATGCCACAGGTTCAGGGTCAAATGGAGATAATGGACAGAGAGTGGGTGGATCTATATTGCTGGACACCAAATG
GAAGCACAATATTTCGTGTGTGTCGGGAGCGGGGTTATTGGGAACTGATGCGTGAAATGTTAAGGGAATTTTGGTGGGAAAACGTCGTTCCTGCGAGAGAGGCTCTATCT
TTGGGAAGAGAGGAAGAGGTAAAGTCATATAAGCCAACATCCACGCATAAGCAGACTGGACTAGCAATAGCCAAGAGCATCAAATTAGCAAGCGAGGCCAAGTTGATGTT
TAGGGAGATAGCTGGACATGTTGAATTTTACCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCGCTGCTGTCTCTTTCTCTCAATCTGGAGCATCCCGAGGTTTTCTTCACGGAGGTCCCTCTTTCAATCGTTTGCCGCGCGTCGCTTCATTATCAGCTCCCCG
AGTTGATGCGTTCCGCTCAACTTCTCTTTTGGTCTGTGGGTTTTGCAGGACGCTTCATCAACGAAACTCTTCAATCAATACTACGATGTCAACGATGAGCAAAACCTCCA
TTTCTAGAATCTGCTGTAGACACCCTGGACTGAATGCGAGACTCTCCTCAAAACGAAAGCATGGGAGTGGTTCAAGAACTTTTTCAACATGCACCTCATCGTCGAGCTCC
ACAACGAACCCTCTGGTCACCCGTTTCCCCTCAGCTTTGATTTTGGCCTCCCAGGTCACCCCATCTGATGCACCTCAGCGTTCAGAAGAATGGTTTGCACTACGGAGAGA
CAGGTTGACTACAAGCACCTTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGTCGCTTCGAGCTATGGCATGAGAAAGTGTTTCCTTATGAGATTCAAAAAACGGAAG
CATCGCAACGGTGTGCCATGGAGTGGGGTGTGCTCAATGAAGCAACTGCAATTGATCGGTACAAAAGCATCACAGGCCGAGACGTAAGCTTGTTAGGGTTTGCAACTCAC
TCAGAACAGCAATTCGATTGGCTCGGCGCCTCCCCCGATGGCCTATTGGGATGCTTTCAAGGAGGAGGGATCCTGGAAGTAAAATGTCCATACAACAAGGGAAGGCCAGA
GAAGGGACTACCCTGGTCAACAATGCCATTCTACTACATGCCACAGGTTCAGGGTCAAATGGAGATAATGGACAGAGAGTGGGTGGATCTATATTGCTGGACACCAAATG
GAAGCACAATATTTCGTGTGTGTCGGGAGCGGGGTTATTGGGAACTGATGCGTGAAATGTTAAGGGAATTTTGGTGGGAAAACGTCGTTCCTGCGAGAGAGGCTCTATCT
TTGGGAAGAGAGGAAGAGGTAAAGTCATATAAGCCAACATCCACGCATAAGCAGACTGGACTAGCAATAGCCAAGAGCATCAAATTAGCAAGCGAGGCCAAGTTGATGTT
TAGGGAGATAGCTGGACATGTTGAATTTTACCGATGA
Protein sequenceShow/hide protein sequence
MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSS
TTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATH
SEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALS
LGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR