; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0078 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0078
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRestriction endonuclease type II-like protein
Genome locationMC08:474725..476968
RNA-Seq ExpressionMC08g0078
SyntenyMC08g0078
Gene Ontology termsNA
InterPro domainsIPR011335 - Restriction endonuclease type II-like
IPR011604 - Exonuclease, phage-type/RecB, C-terminal
IPR017482 - Putative phage-type endonuclease
IPR019080 - YqaJ viral recombinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020121.1 hypothetical protein SDJN02_16803 [Cucurbita argyrosperma subsp. argyrosperma]1.38e-22984.47Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTM-STMSKTSISRICCRHPGLNARLSSKRKHGSGS
        MKLAAVSFSQSGASR FLH   SFNRLP VAS SA +VDAF STSL +VCGFCRT HQ NSSINT + STMS TSI+RICCR P  NARL SKRK  +GS
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTM-STMSKTSISRICCRHPGLNARLSSKRKHGSGS

Query:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA
        RTFST  S   S TNPL+ R PSALI+ASQVTPSDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELWHEKVFP EI+K EA Q+ AMEWGVLNE 
Subjt:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA

Query:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF
         AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIF
Subjt:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF

Query:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        RVCRERGYWEL+ EMLREFWWENVVPAREALS GRE+EV+SYKPTSTHKQTG+AIAKSIKLAS+AKL+ REIAGHVEFYR
Subjt:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

XP_022131520.1 uncharacterized protein LOC111004694 isoform X1 [Momordica charantia]5.62e-27999.74Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
        MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLL VCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR

Query:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
        TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
Subjt:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
        AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR

Query:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
Subjt:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

XP_022131521.1 uncharacterized protein LOC111004694 isoform X2 [Momordica charantia]7.11e-232100Show/hide
Query:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
        MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
Subjt:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR

Query:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
        RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
Subjt:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM

Query:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL
        PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL
Subjt:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL

Query:  MFREIAGHVEFYR
        MFREIAGHVEFYR
Subjt:  MFREIAGHVEFYR

XP_023537251.1 uncharacterized protein LOC111798383 isoform X1 [Cucurbita pepo subsp. pepo]9.70e-23084.21Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTM-STMSKTSISRICCRHPGLNARLSSKRKHGSGS
        MKLAAVSFSQSGASR FLH   SFNRLPRVAS SA +VDAF STSL +VCGFCRT HQ NSSI+T + STMS TSI+RICCRHP  NARL SKRK  +GS
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTM-STMSKTSISRICCRHPGLNARLSSKRKHGSGS

Query:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA
        RTFST  S   S TNPL+ R PSALI+ASQVTPSDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELWHEKVFP EI+K EA Q+ AMEWGVLNE 
Subjt:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA

Query:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF
         AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIF
Subjt:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF

Query:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        RVCRERGYWEL+ EMLREFWWENVVPAREA S GRE+EV+SYKPTSTHK TG+AIAKSIKLAS+AKL+ REIAGHVEFYR
Subjt:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

XP_038884042.1 uncharacterized protein LOC120074988 isoform X1 [Benincasa hispida]1.67e-23987.37Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGS
        MKLAAVSFS++GAS+  LHGG SFNR PRVAS +A +VDAF STSLL VCG CRTLH  NSS+ T  MSTM+ TSISRICCRH  +NARL  +RKHGSGS
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGS

Query:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA
        RTFSTC S SSS TNPLV R PSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFP EIQK EA Q+ AMEWGVLNEA
Subjt:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA

Query:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF
         AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQMEIMDREW DLYCWTPNGSTIF
Subjt:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF

Query:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        RVCRERGYW+L+REMLREFWWENVVPAREAL LGREE+VKSYKPTSTHKQTGLAIAKSIKLASEAKL+ REIAGH+EFYR
Subjt:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

TrEMBL top hitse value%identityAlignment
A0A0A0LNG4 YqaJ domain-containing protein4.81e-21680.26Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGS
        MK AAVSFSQSGASR  LHGG SFN+L  VAS+SA +  +F S SLL VCG CRTL Q +S + T  MSTM+  SI+RICCRH   NARL  KR H   S
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTT-MSTMSKTSISRICCRHPGLNARLSSKRKHGSGS

Query:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA
        R FSTC S SSST NPLV   PS L+LASQ   S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRR ELWHEKVFP EIQKTEA Q+ AMEWGVLNE 
Subjt:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA

Query:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF
         AIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKG+PEKGLPWST+PFYYMPQVQGQMEIM REW DLYCWTPNGSTIF
Subjt:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF

Query:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        RVCRERGYW+L+RE+LREFWWENVVPA+EAL LG EE+ KSYKPTSTHKQTGLAIAKSIKLASEAKL  REIAGHVEFYR
Subjt:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

A0A6J1BPQ6 uncharacterized protein LOC111004694 isoform X23.44e-232100Show/hide
Query:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
        MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
Subjt:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR

Query:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
        RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
Subjt:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM

Query:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL
        PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL
Subjt:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL

Query:  MFREIAGHVEFYR
        MFREIAGHVEFYR
Subjt:  MFREIAGHVEFYR

A0A6J1BPX6 uncharacterized protein LOC111004694 isoform X12.72e-27999.74Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
        MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLL VCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSR

Query:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
        TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT
Subjt:  TFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEAT

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
        AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFR

Query:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
Subjt:  VCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

A0A6J1GHX8 uncharacterized protein LOC111454092 isoform X15.47e-22984.21Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTM-STMSKTSISRICCRHPGLNARLSSKRKHGSGS
        MKLAAVSFSQSGASR FLH   SFNRLP VAS SA +VDAF STS  +VCGFCRT HQ NSSINT + STMS TSI+RICCR P  NARL SKRK  +GS
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTM-STMSKTSISRICCRHPGLNARLSSKRKHGSGS

Query:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA
        RTFST  S   S TNPL+ R PSALI+ASQVTPSDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELWHEKVFP EI+K EA Q+ AMEWGVLNE 
Subjt:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA

Query:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF
         AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIF
Subjt:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF

Query:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        RVCRERGYWEL+ EMLREFWWENVVPAREALS GRE+EV+SYKPTSTHKQTG+AIAKSIKLAS+AKL+ REIAGHVEFYR
Subjt:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

A0A6J1KS64 uncharacterized protein LOC111496007 isoform X17.77e-22983.68Show/hide
Query:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTM-STMSKTSISRICCRHPGLNARLSSKRKHGSGS
        MKLAAVSFSQS ASR FLH   SFNRLPRVAS S P+VDAF STSL +VC FCRT HQ NSSI+T + STMS TSI+RICC HP  NARL SKRK  +GS
Subjt:  MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTM-STMSKTSISRICCRHPGLNARLSSKRKHGSGS

Query:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA
        RTFST  S   S TNPL+ R PSALI+ASQVTPSDAPQRSEEWFALRRD+LTTSTFSTALGFWKG+RRFELWHEKVFP EI+K EA Q+ AMEWGVLNE 
Subjt:  RTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEA

Query:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF
         AI RYKSITGRDVS LGFATHSEQQ +WLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIF
Subjt:  TAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIF

Query:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR
        RVCRERGYWEL+ EMLREFWWENVVPAREALSLGREEEV+SYKPTSTHKQTG+AI+KSIKLAS+AKL+ REIAGHVEFYR
Subjt:  RVCRERGYWELMREMLREFWWENVVPAREALSLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13810.1 Restriction endonuclease, type II-like superfamily protein7.4e-4436.84Show/hide
Query:  EEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGL
        + W  LR++RLT S F+ A+GF    RR  LW EK+      K  A  R A  W + NE  A++RY  +TG ++ +  F  +      + +WLGASPDG+
Subjt:  EEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGL

Query:  LGCFQGG----GILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALS-
        +   + G    G+LEVKCP++     K  PW  +P+  +PQ+QG MEI+D +W+DLYCWT NGS++FRV R+  +WE M+  L +FW  +V+PARE  + 
Subjt:  LGCFQGG----GILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALS-

Query:  ---LGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHV
              + +++ +KP   H+     +  + ++++ A  +F EI G++
Subjt:  ---LGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHV

AT1G67660.1 Restriction endonuclease, type II-like superfamily protein8.8e-10657.96Show/hide
Query:  AVCGFCRTLHQRNSSINT-TMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALR
        +VC  CR L     ++N+  +S M   SIS      P  +  +SS+++    S   S  T + S   +P      S++I++S ++PSD PQ+SEEWFALR
Subjt:  AVCGFCRTLHQRNSSINT-TMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALR

Query:  RDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGIL
        +D+LTTSTFSTALGFWKGNRR ELWHEKV+  + +  E S R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GIL
Subjt:  RDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGIL

Query:  EVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYKPTS
        EVKCPYNKG+ E  LPW  +P+YYMPQ+QGQMEIMDREWV+LYCWT NGST+FRV R+R YW ++ ++LREFWWE+V+PAREAL LG+E EEVK Y+PTS
Subjt:  EVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYKPTS

Query:  THKQTGLAIAKSIKLASEAKLMFREIAGHVEFY
        THK+T LAIAKS+ LA+E+KL+ REIA HVEF+
Subjt:  THKQTGLAIAKSIKLASEAKLMFREIAGHVEFY

AT1G67660.2 Restriction endonuclease, type II-like superfamily protein9.8e-10559.74Show/hide
Query:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR
        +S M   SIS      P  +  +SS+++    S   S  T + S   +P      S++I++S ++PSD PQ+SEEWFALR+D+LTTSTFSTALGFWKGNR
Subjt:  MSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNR

Query:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM
        R ELWHEKV+  + +  E S R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GILEVKCPYNKG+ E  LPW  +
Subjt:  RFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTM

Query:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYKPTSTHKQTGLAIAKSIKLASEAK
        P+YYMPQ+QGQMEIMDREWV+LYCWT NGST+FRV R+R YW ++ ++LREFWWE+V+PAREAL LG+E EEVK Y+PTSTHK+T LAIAKS+ LA+E+K
Subjt:  PFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYKPTSTHKQTGLAIAKSIKLASEAK

Query:  LMFREIAGHVEFY
        L+ REIA HVEF+
Subjt:  LMFREIAGHVEFY

AT1G67660.3 Restriction endonuclease, type II-like superfamily protein8.8e-10657.44Show/hide
Query:  SLLAVCGFCRTLHQRNSSINT-TMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWF
        + + VC  CR L     ++N+  +S M   SIS      P  +  +SS+++    S   S  T + S   +P      S++I++S ++PSD PQ+SEEWF
Subjt:  SLLAVCGFCRTLHQRNSSINT-TMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSSSTTNPLVTRFPSALILASQVTPSDAPQRSEEWF

Query:  ALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGG
        ALR+D+LTTSTFSTALGFWKGNRR ELWHEKV+  + +  E S R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   
Subjt:  ALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGG

Query:  GILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYK
        GILEVKCPYNKG+ E  LPW  +P+YYMPQ+QGQMEIMDREWV+LYCWT NGST+FRV R+R YW ++ ++LREFWWE+V+PAREAL LG+E EEVK Y+
Subjt:  GILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREALSLGRE-EEVKSYK

Query:  PTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFY
        PTSTHK+T LAIAKS+ LA+E+KL+ REIA HVEF+
Subjt:  PTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCGCTGCTGTCTCTTTCTCTCAATCTGGAGCATCCCGAGGTTTTCTTCACGGAGGTCCCTCTTTCAATCGTTTGCCGCGCGTCGCTTCATTATCAGCTCCCCG
AGTTGATGCGTTCCGCTCAACTTCTCTTTTGGCAGTCTGTGGGTTTTGCAGGACGCTTCATCAACGAAACTCTTCAATCAATACTACGATGTCAACGATGAGCAAAACCT
CCATTTCTAGAATCTGCTGTAGACACCCTGGACTGAATGCGAGACTCTCCTCAAAACGAAAGCATGGGAGTGGTTCAAGAACTTTTTCAACATGCACCTCATCGTCGAGC
TCCACAACGAACCCTCTGGTCACCCGTTTCCCCTCAGCTTTGATTTTGGCCTCCCAGGTCACCCCATCTGATGCACCTCAGCGTTCAGAAGAATGGTTTGCACTACGGAG
AGACAGGTTGACTACAAGCACCTTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGTCGCTTCGAGCTATGGCATGAGAAAGTGTTTCCTTATGAGATTCAAAAAACGG
AAGCATCGCAACGGTGTGCCATGGAGTGGGGTGTGCTCAATGAAGCAACTGCAATTGATCGGTACAAAAGCATCACAGGCCGAGACGTAAGCTTGTTAGGGTTTGCAACT
CACTCAGAACAGCAATTCGATTGGCTCGGCGCCTCCCCCGATGGCCTATTGGGATGCTTTCAAGGAGGAGGGATCCTGGAAGTAAAATGTCCATACAACAAGGGAAGGCC
AGAGAAGGGACTACCCTGGTCAACAATGCCATTCTACTACATGCCACAGGTTCAGGGTCAAATGGAGATAATGGACAGAGAGTGGGTGGATCTATATTGCTGGACACCAA
ATGGAAGCACAATATTTCGTGTGTGTCGGGAGCGGGGTTATTGGGAACTGATGCGTGAAATGTTAAGGGAATTTTGGTGGGAAAACGTCGTTCCTGCGAGAGAGGCTCTA
TCTTTGGGAAGAGAGGAAGAGGTAAAGTCATATAAGCCAACATCCACGCATAAGCAGACTGGACTAGCAATAGCCAAGAGCATCAAATTAGCAAGCGAGGCCAAGTTGAT
GTTTAGGGAGATAGCTGGACATGTTGAATTTTACCGA
mRNA sequenceShow/hide mRNA sequence
AGGCTGTGTAAAATGTCAATACGAACATTCCTTCAAACAATATATCGAGCAAGTTTTGCAGACGGCCCAAACTTTTGACGGAATCCCTCCCCCTCTACCCTGGACCTGTA
CACCAAATTCCATTGCCCCGCATCGGCGCAATTAGCAGCTTGAACTTGAAGATTCATTTCTATGGCGCCCGTCTCCTCTCTTGTTTGACCTGCTGAAACGATGAAGCTCG
CTGCTGTCTCTTTCTCTCAATCTGGAGCATCCCGAGGTTTTCTTCACGGAGGTCCCTCTTTCAATCGTTTGCCGCGCGTCGCTTCATTATCAGCTCCCCGAGTTGATGCG
TTCCGCTCAACTTCTCTTTTGGCAGTCTGTGGGTTTTGCAGGACGCTTCATCAACGAAACTCTTCAATCAATACTACGATGTCAACGATGAGCAAAACCTCCATTTCTAG
AATCTGCTGTAGACACCCTGGACTGAATGCGAGACTCTCCTCAAAACGAAAGCATGGGAGTGGTTCAAGAACTTTTTCAACATGCACCTCATCGTCGAGCTCCACAACGA
ACCCTCTGGTCACCCGTTTCCCCTCAGCTTTGATTTTGGCCTCCCAGGTCACCCCATCTGATGCACCTCAGCGTTCAGAAGAATGGTTTGCACTACGGAGAGACAGGTTG
ACTACAAGCACCTTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGTCGCTTCGAGCTATGGCATGAGAAAGTGTTTCCTTATGAGATTCAAAAAACGGAAGCATCGCA
ACGGTGTGCCATGGAGTGGGGTGTGCTCAATGAAGCAACTGCAATTGATCGGTACAAAAGCATCACAGGCCGAGACGTAAGCTTGTTAGGGTTTGCAACTCACTCAGAAC
AGCAATTCGATTGGCTCGGCGCCTCCCCCGATGGCCTATTGGGATGCTTTCAAGGAGGAGGGATCCTGGAAGTAAAATGTCCATACAACAAGGGAAGGCCAGAGAAGGGA
CTACCCTGGTCAACAATGCCATTCTACTACATGCCACAGGTTCAGGGTCAAATGGAGATAATGGACAGAGAGTGGGTGGATCTATATTGCTGGACACCAAATGGAAGCAC
AATATTTCGTGTGTGTCGGGAGCGGGGTTATTGGGAACTGATGCGTGAAATGTTAAGGGAATTTTGGTGGGAAAACGTCGTTCCTGCGAGAGAGGCTCTATCTTTGGGAA
GAGAGGAAGAGGTAAAGTCATATAAGCCAACATCCACGCATAAGCAGACTGGACTAGCAATAGCCAAGAGCATCAAATTAGCAAGCGAGGCCAAGTTGATGTTTAGGGAG
ATAGCTGGACATGTTGAATTTTACCGA
Protein sequenceShow/hide protein sequence
MKLAAVSFSQSGASRGFLHGGPSFNRLPRVASLSAPRVDAFRSTSLLAVCGFCRTLHQRNSSINTTMSTMSKTSISRICCRHPGLNARLSSKRKHGSGSRTFSTCTSSSS
STTNPLVTRFPSALILASQVTPSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWHEKVFPYEIQKTEASQRCAMEWGVLNEATAIDRYKSITGRDVSLLGFAT
HSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGRPEKGLPWSTMPFYYMPQVQGQMEIMDREWVDLYCWTPNGSTIFRVCRERGYWELMREMLREFWWENVVPAREAL
SLGREEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLMFREIAGHVEFYR