; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004697 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004697
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionYqaJ domain-containing protein
Genome locationchr6:6188496..6190701
RNA-Seq ExpressionLag0004697
SyntenyLag0004697
Gene Ontology termsNA
InterPro domainsIPR011335 - Restriction endonuclease type II-like
IPR011604 - Exonuclease, phage-type/RecB, C-terminal
IPR017482 - Putative phage-type endonuclease
IPR019080 - YqaJ viral recombinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020121.1 hypothetical protein SDJN02_16803 [Cucurbita argyrosperma subsp. argyrosperma]4.1e-18884.05Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MKLAAVSFSQ+GASRSFLH  SSFNRLP VASFSARQVDAFSSTSL V            CG      +T HQSNSS++TA +STMSNTS+ARICCR PR
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
        SNA LFSKRK  +GSRT ST  SP  S+TNPL+IRLPSALI+ASQVTPSDAPQRSEEWFALRR++LTTSTFSTALGFWKGNRRFELWHEKVFPSEI+KPE
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        A Q++AMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCF GGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        WADLYCWTPNGSTIFRVCRERGYWELI EML+EFWW+NVVPAREALS GRE+EV+SYKPTSTHKQTG+AIA+SIKLAS+AKLLCREIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

XP_022131520.1 uncharacterized protein LOC111004694 isoform X1 [Momordica charantia]1.4e-18884.81Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MKLAAVSFSQ+GASR FLHGG SFNRLPRVAS SA +VDAF STSLLV            CG      +TLHQ NSS++T TMSTMS TS++RICCRHP 
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
         NA L SKRKHGSGSRT STC+S  SS TNPLV R PSALILASQVTPSDAPQRSEEWFALRR+RLTTSTFSTALGFWKGNRRFELWHEKVFP EIQK E
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        A QR AMEWGVLNEA AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF GGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDRE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        W DLYCWTPNGSTIFRVCRERGYWEL+REML+EFWW+NVVPAREALSLGREEEVKSYKPTSTHKQTGLAIA+SIKLASEAKL+ REIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

XP_022951164.1 uncharacterized protein LOC111454092 isoform X1 [Cucurbita moschata]2.0e-18783.8Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MKLAAVSFSQ+GASRSFLH  SSFNRLP VASFSARQVDAFSSTS  V            CG      +T HQSNSS++TA +STMSNTS+ARICCR PR
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
        SNA LFSKRK  +GSRT ST  SP  S+TNPL+IRLPSALI+ASQVTPSDAPQRSEEWFALRR++LTTSTFSTALGFWKGNRRFELWHEKVFPSEI+KPE
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        A Q++AMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCF GGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        WADLYCWTPNGSTIFRVCRERGYWELI EML+EFWW+NVVPAREALS GRE+EV+SYKPTSTHKQTG+AIA+SIKLAS+AKLLCREIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

XP_023537251.1 uncharacterized protein LOC111798383 isoform X1 [Cucurbita pepo subsp. pepo]2.1e-18984.3Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MKLAAVSFSQ+GASRSFLH  SSFNRLPRVASFSARQVDAFSSTSL V            CG      +T HQSNSS+DTA +STMSNTS+ARICCRHPR
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
        SNA LFSKRK  +GSRT ST  SP  S+TNPL+IRLPSALI+ASQVTPSDAPQRSEEWFALRR++LTTSTFSTALGFWKGNRRFELWHEKVFPSEI+KPE
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        A Q++AMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCF GGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        WADLYCWTPNGSTIFRVCRERGYWELI EML+EFWW+NVVPAREA S GRE+EV+SYKPTSTHK TG+AIA+SIKLAS+AKLLCREIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

XP_038884042.1 uncharacterized protein LOC120074988 isoform X1 [Benincasa hispida]3.1e-19686.84Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MKLAAVSFS+AGAS+  LHGGSSFNR PRVASF+ARQVDAFSSTSLLV            CGL     +TLH SNSSV+TA MSTM+NTS++RICCRH R
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
         NA L  +RKHGSGSRT STC+SP SSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRR+RLTTSTFSTALGFWKGNRRFELWHEKVFP EIQKPE
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        APQ++AMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF GGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQ+EIMDRE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        WADLYCWTPNGSTIFRVCRERGYW+LIREML+EFWW+NVVPAREAL LGREE+VKSYKPTSTHKQTGLAIA+SIKLASEAKLLCREIAGH+EFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

TrEMBL top hitse value%identityAlignment
A0A0A0LNG4 YqaJ domain-containing protein2.5e-17578.73Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MK AAVSFSQ+GASRS LHGGSSFN+L  VAS SARQ  +F+S SLLV            CGL     +TL QS+S V+TA MSTM+N S+ARICCRH R
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
         NA L+ KR H   SR  STC SP SS  NPLVI LPS L+LASQ   S APQRSEEWFALRR++LTTSTFSTALGFWKGNRR ELWHEKVFPSEIQK E
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        APQ++AMEWGVLNE NAIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CF GGGILEVKCPYNKGKPEKGLPWST+PFYYMPQVQGQ+EIM RE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        WADLYCWTPNGSTIFRVCRERGYW+LIRE+L+EFWW+NVVPA+EAL LG EE+ KSYKPTSTHKQTGLAIA+SIKLASEAKL CREIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

A0A1S3B9S9 uncharacterized protein LOC103487752 isoform X12.5e-17579.75Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MK AAVSFSQ+GASRS  HGGSSFN+LP VASFSAR+    +S SLLV            CGL     +TL QSN SV+ A MSTM+N S+ARICCR  R
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
         NA L+ KR  G  SR+ STC++P S  TNP VI LPS LILASQV  S APQRSEEWFALRR++LTTSTFSTALGFWKGNRRFELWHEKVFPSE QK +
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        APQ++AMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF GGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQ+EIM RE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        W+DLYCWTPNGSTIFRVCRERGYW+LIRE+LKEFWW+NVVPA+EALSLGREE+ KSYKPTSTHKQTGLAIA+SIKLASEAKLLCREIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

A0A6J1BPX6 uncharacterized protein LOC111004694 isoform X16.7e-18984.81Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MKLAAVSFSQ+GASR FLHGG SFNRLPRVAS SA +VDAF STSLLV            CG      +TLHQ NSS++T TMSTMS TS++RICCRHP 
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
         NA L SKRKHGSGSRT STC+S  SS TNPLV R PSALILASQVTPSDAPQRSEEWFALRR+RLTTSTFSTALGFWKGNRRFELWHEKVFP EIQK E
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        A QR AMEWGVLNEA AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCF GGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDRE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        W DLYCWTPNGSTIFRVCRERGYWEL+REML+EFWW+NVVPAREALSLGREEEVKSYKPTSTHKQTGLAIA+SIKLASEAKL+ REIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

A0A6J1GHX8 uncharacterized protein LOC111454092 isoform X19.7e-18883.8Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MKLAAVSFSQ+GASRSFLH  SSFNRLP VASFSARQVDAFSSTS  V            CG      +T HQSNSS++TA +STMSNTS+ARICCR PR
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
        SNA LFSKRK  +GSRT ST  SP  S+TNPL+IRLPSALI+ASQVTPSDAPQRSEEWFALRR++LTTSTFSTALGFWKGNRRFELWHEKVFPSEI+KPE
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        A Q++AMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCF GGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        WADLYCWTPNGSTIFRVCRERGYWELI EML+EFWW+NVVPAREALS GRE+EV+SYKPTSTHKQTG+AIA+SIKLAS+AKLLCREIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

A0A6J1KS64 uncharacterized protein LOC111496007 isoform X16.3e-18783.29Show/hide
Query:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR
        MKLAAVSFSQ+ ASRSFLH  SSFNRLPRVASFS  QVDAFSSTSL V            C       +T HQSNSS+DTA +STMSNTS+ARICC HPR
Subjt:  MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPR

Query:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE
        SNA LFSKRK  +GSRT ST  SP  S+TNPL+IRLPSALI+ASQVTPSDAPQRSEEWFALRR++LTTSTFSTALGFWKG+RRFELWHEKVFPSEI+KPE
Subjt:  SNAGLFSKRKHGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPE

Query:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
        A Q++AMEWGVLNE NAI RYKSITGRDVS LGFATHSEQQ +WLGASPDGLLGCF GGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE
Subjt:  APQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDRE

Query:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR
        WADLYCWTPNGSTIFRVCRERGYWELI EML+EFWW+NVVPAREALSLGREEEV+SYKPTSTHKQTG+AI++SIKLAS+AKLLCREIAGHVEFYR
Subjt:  WADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13810.1 Restriction endonuclease, type II-like superfamily protein1.7e-4336.44Show/hide
Query:  EEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGL
        + W  LR+ RLT S F+ A+GF    RR  LW EK+  +   KP A  R A  W + NE  A++RY  +TG ++ +  F  +      + +WLGASPDG+
Subjt:  EEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGL

Query:  LGCFHGG----GILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALS-
        +     G    G+LEVKCP++     K  PW  +P+  +PQ+QG +EI+D +W DLYCWT NGS++FRV R+  +WE ++  L +FW ++V+PARE  + 
Subjt:  LGCFHGG----GILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALS-

Query:  ---LGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHV
              + +++ +KP   H+     +  + ++++ A  L  EI G++
Subjt:  ---LGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHV

AT1G67660.1 Restriction endonuclease, type II-like superfamily protein1.9e-10656.1Show/hide
Query:  QTLHQSNSSVDTATMSTMSNTSVARICCRHPRSNAGLFSKRKHGSGSRTLSTCS-SPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLT
        + L  +  ++++  +S M   S++      P+S+  + S+++  S + +L T + SPF+           S++I++S ++PSD PQ+SEEWFALR+++LT
Subjt:  QTLHQSNSSVDTATMSTMSNTSVARICCRHPRSNAGLFSKRKHGSGSRTLSTCS-SPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLT

Query:  TSTFSTALGFWKGNRRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCP
        TSTFSTALGFWKGNRR ELWHEKV+ S+ +  E   R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GILEVKCP
Subjt:  TSTFSTALGFWKGNRRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCP

Query:  YNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGRE-EEVKSYKPTSTHKQT
        YNKGK E  LPW  +P+YYMPQ+QGQ+EIMDREW +LYCWT NGST+FRV R+R YW +I ++L+EFWW++V+PAREAL LG+E EEVK Y+PTSTHK+T
Subjt:  YNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGRE-EEVKSYKPTSTHKQT

Query:  GLAIARSIKLASEAKLLCREIAGHVEFY
         LAIA+S+ LA+E+KL+CREIA HVEF+
Subjt:  GLAIARSIKLASEAKLLCREIAGHVEFY

AT1G67660.2 Restriction endonuclease, type II-like superfamily protein1.9e-10658.28Show/hide
Query:  MSTMSNTSVARICCRHPRSNAGLFSKRKHGSGSRTLSTCS-SPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGN
        +S M   S++      P+S+  + S+++  S + +L T + SPF+           S++I++S ++PSD PQ+SEEWFALR+++LTTSTFSTALGFWKGN
Subjt:  MSTMSNTSVARICCRHPRSNAGLFSKRKHGSGSRTLSTCS-SPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGN

Query:  RRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWST
        RR ELWHEKV+ S+ +  E   R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GILEVKCPYNKGK E  LPW  
Subjt:  RRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWST

Query:  MPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGRE-EEVKSYKPTSTHKQTGLAIARSIKLASEA
        +P+YYMPQ+QGQ+EIMDREW +LYCWT NGST+FRV R+R YW +I ++L+EFWW++V+PAREAL LG+E EEVK Y+PTSTHK+T LAIA+S+ LA+E+
Subjt:  MPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGRE-EEVKSYKPTSTHKQTGLAIARSIKLASEA

Query:  KLLCREIAGHVEFY
        KL+CREIA HVEF+
Subjt:  KLLCREIAGHVEFY

AT1G67660.3 Restriction endonuclease, type II-like superfamily protein1.9e-10656.1Show/hide
Query:  QTLHQSNSSVDTATMSTMSNTSVARICCRHPRSNAGLFSKRKHGSGSRTLSTCS-SPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLT
        + L  +  ++++  +S M   S++      P+S+  + S+++  S + +L T + SPF+           S++I++S ++PSD PQ+SEEWFALR+++LT
Subjt:  QTLHQSNSSVDTATMSTMSNTSVARICCRHPRSNAGLFSKRKHGSGSRTLSTCS-SPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLT

Query:  TSTFSTALGFWKGNRRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCP
        TSTFSTALGFWKGNRR ELWHEKV+ S+ +  E   R AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GILEVKCP
Subjt:  TSTFSTALGFWKGNRRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCP

Query:  YNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGRE-EEVKSYKPTSTHKQT
        YNKGK E  LPW  +P+YYMPQ+QGQ+EIMDREW +LYCWT NGST+FRV R+R YW +I ++L+EFWW++V+PAREAL LG+E EEVK Y+PTSTHK+T
Subjt:  YNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWDNVVPAREALSLGRE-EEVKSYKPTSTHKQT

Query:  GLAIARSIKLASEAKLLCREIAGHVEFY
         LAIA+S+ LA+E+KL+CREIA HVEF+
Subjt:  GLAIARSIKLASEAKLLCREIAGHVEFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCGCTGCGGTCTCTTTCTCTCAAGCTGGAGCGTCTCGGAGTTTTCTTCATGGGGGTTCCTCTTTCAATCGATTGCCGCGCGTCGCTTCATTTTCAGCTCGCCA
AGTTGATGCATTCAGCTCAACTTCTCTTTTGGTTGAGGGGCATTCATTGTTTGATGTTGTTGAGGTTCCATGTGGATTAACAAGCTCTGAACCTCAGACGCTTCATCAAA
GTAACTCTTCAGTCGACACTGCCACTATGTCAACAATGAGCAACACCTCCGTTGCTAGAATCTGCTGCAGACACCCTAGATCAAATGCTGGGCTGTTCTCAAAACGAAAA
CATGGGAGTGGTTCAAGAACCCTATCAACTTGCAGCTCACCATTTAGCTCCATAACCAACCCCCTGGTCATCCGTTTACCCTCAGCTTTGATTTTGGCTTCCCAGGTCAC
CCCATCGGATGCCCCTCAGCGTTCAGAAGAATGGTTTGCGCTACGGAGAGAAAGACTGACTACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGCCGCT
TTGAGCTATGGCATGAGAAAGTGTTTCCTTCAGAGATTCAAAAACCAGAAGCACCACAGCGGCATGCCATGGAGTGGGGTGTGCTCAATGAAGCAAATGCCATCGATCGG
TATAAAAGCATAACAGGCCGAGATGTAAGCTTGTTAGGATTTGCAACTCACTCAGAACAGCAATTCGACTGGCTCGGTGCCTCCCCCGACGGCCTATTGGGATGCTTTCA
TGGAGGTGGGATCCTTGAAGTAAAATGTCCATACAACAAGGGAAAGCCCGAGAAGGGACTGCCCTGGTCGACCATGCCTTTCTATTACATGCCACAGGTACAAGGTCAAT
TAGAGATAATGGACAGAGAGTGGGCGGATTTGTATTGCTGGACACCAAATGGAAGCACAATATTTCGCGTGTGTAGGGAACGTGGTTATTGGGAATTGATACGTGAAATG
TTAAAGGAATTCTGGTGGGACAATGTTGTTCCTGCAAGGGAGGCTTTATCGTTGGGAAGAGAGGAAGAGGTCAAGTCATACAAGCCGACATCCACACACAAGCAGACTGG
ACTAGCGATTGCTAGGAGCATTAAATTAGCAAGCGAGGCCAAATTGTTGTGTAGGGAGATAGCTGGCCATGTTGAATTTTACCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCGCTGCGGTCTCTTTCTCTCAAGCTGGAGCGTCTCGGAGTTTTCTTCATGGGGGTTCCTCTTTCAATCGATTGCCGCGCGTCGCTTCATTTTCAGCTCGCCA
AGTTGATGCATTCAGCTCAACTTCTCTTTTGGTTGAGGGGCATTCATTGTTTGATGTTGTTGAGGTTCCATGTGGATTAACAAGCTCTGAACCTCAGACGCTTCATCAAA
GTAACTCTTCAGTCGACACTGCCACTATGTCAACAATGAGCAACACCTCCGTTGCTAGAATCTGCTGCAGACACCCTAGATCAAATGCTGGGCTGTTCTCAAAACGAAAA
CATGGGAGTGGTTCAAGAACCCTATCAACTTGCAGCTCACCATTTAGCTCCATAACCAACCCCCTGGTCATCCGTTTACCCTCAGCTTTGATTTTGGCTTCCCAGGTCAC
CCCATCGGATGCCCCTCAGCGTTCAGAAGAATGGTTTGCGCTACGGAGAGAAAGACTGACTACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGCCGCT
TTGAGCTATGGCATGAGAAAGTGTTTCCTTCAGAGATTCAAAAACCAGAAGCACCACAGCGGCATGCCATGGAGTGGGGTGTGCTCAATGAAGCAAATGCCATCGATCGG
TATAAAAGCATAACAGGCCGAGATGTAAGCTTGTTAGGATTTGCAACTCACTCAGAACAGCAATTCGACTGGCTCGGTGCCTCCCCCGACGGCCTATTGGGATGCTTTCA
TGGAGGTGGGATCCTTGAAGTAAAATGTCCATACAACAAGGGAAAGCCCGAGAAGGGACTGCCCTGGTCGACCATGCCTTTCTATTACATGCCACAGGTACAAGGTCAAT
TAGAGATAATGGACAGAGAGTGGGCGGATTTGTATTGCTGGACACCAAATGGAAGCACAATATTTCGCGTGTGTAGGGAACGTGGTTATTGGGAATTGATACGTGAAATG
TTAAAGGAATTCTGGTGGGACAATGTTGTTCCTGCAAGGGAGGCTTTATCGTTGGGAAGAGAGGAAGAGGTCAAGTCATACAAGCCGACATCCACACACAAGCAGACTGG
ACTAGCGATTGCTAGGAGCATTAAATTAGCAAGCGAGGCCAAATTGTTGTGTAGGGAGATAGCTGGCCATGTTGAATTTTACCGATGA
Protein sequenceShow/hide protein sequence
MKLAAVSFSQAGASRSFLHGGSSFNRLPRVASFSARQVDAFSSTSLLVEGHSLFDVVEVPCGLTSSEPQTLHQSNSSVDTATMSTMSNTSVARICCRHPRSNAGLFSKRK
HGSGSRTLSTCSSPFSSITNPLVIRLPSALILASQVTPSDAPQRSEEWFALRRERLTTSTFSTALGFWKGNRRFELWHEKVFPSEIQKPEAPQRHAMEWGVLNEANAIDR
YKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFHGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREM
LKEFWWDNVVPAREALSLGREEEVKSYKPTSTHKQTGLAIARSIKLASEAKLLCREIAGHVEFYR