; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004231 (gene) of Snake gourd v1 genome

Gene IDTan0004231
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionYqaJ domain-containing protein
Genome locationLG02:91009375..91011900
RNA-Seq ExpressionTan0004231
SyntenyTan0004231
Gene Ontology termsNA
InterPro domainsIPR011335 - Restriction endonuclease type II-like
IPR011604 - Exonuclease, phage-type/RecB, C-terminal
IPR017482 - Putative phage-type endonuclease
IPR019080 - YqaJ viral recombinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020121.1 hypothetical protein SDJN02_16803 [Cucurbita argyrosperma subsp. argyrosperma]7.6e-19287.6Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MKLAAVSFS +GASRSFLH  S FNRLP VASFS R+VDAFSSTSL VCGFCRT HQSNS I+ A++ TMSNTSIARI C  PRSNARLFSKRK  +GSR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        T ST  SP  S+TNPL+IRLPSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSEI+KPEARQQYAMEWGVLNE N
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYWELI EML+EFWWENVVPA+EALS G+E+EV+SYKPTSTHKQTG+AIAKSIKLAS+AKLLCREIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

XP_022951164.1 uncharacterized protein LOC111454092 isoform X1 [Cucurbita moschata]3.8e-19187.34Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MKLAAVSFS +GASRSFLH  S FNRLP VASFS R+VDAFSSTS  VCGFCRT HQSNS I+ A++ TMSNTSIARI C  PRSNARLFSKRK  +GSR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        T ST  SP  S+TNPL+IRLPSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSEI+KPEARQQYAMEWGVLNE N
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYWELI EML+EFWWENVVPA+EALS G+E+EV+SYKPTSTHKQTG+AIAKSIKLAS+AKLLCREIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

XP_023001988.1 uncharacterized protein LOC111496007 isoform X1 [Cucurbita maxima]9.9e-19287.34Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MKLAAVSFS + ASRSFLH  S FNRLPRVASFS  +VDAFSSTSL VC FCRT HQSNS ID A++ TMSNTSIARI C+HPRSNARLFSKRK  +GSR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        T ST  SP  S+TNPL+IRLPSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKG+RRFELW EKVFPSEI+KPEARQQYAMEWGVLNE N
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ +WLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYWELI EML+EFWWENVVPA+EALSLG+EEEV+SYKPTSTHKQTG+AI+KSIKLAS+AKLLCREIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

XP_023537251.1 uncharacterized protein LOC111798383 isoform X1 [Cucurbita pepo subsp. pepo]4.0e-19387.86Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MKLAAVSFS +GASRSFLH  S FNRLPRVASFS R+VDAFSSTSL VCGFCRT HQSNS ID A++ TMSNTSIARI C HPRSNARLFSKRK  +GSR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        T ST  SP  S+TNPL+IRLPSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSEI+KPEARQQYAMEWGVLNE N
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYWELI EML+EFWWENVVPA+EA S G+E+EV+SYKPTSTHK TG+AIAKSIKLAS+AKLLCREIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

XP_038884042.1 uncharacterized protein LOC120074988 isoform X1 [Benincasa hispida]1.7e-19689.45Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MKLAAVSFS AGAS+  LHGGS FNR PRVASF+ R+VDAFSSTSLLVCG CRTLH SNS ++ AIM TM+NTSI+RI C H R NARL  +RKHGSGSR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        T STC SPSSSITNPLVIRLPSALILASQVT SDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELW EKVFP EIQKPEA QQ AMEWGVLNEAN
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQ+EIMDREWADLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYW+LIREML+EFWWENVVPA+EAL LG+EE+VKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGH+EFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

TrEMBL top hitse value%identityAlignment
A0A0A0LNG4 YqaJ domain-containing protein3.0e-17882.06Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MK AAVSFS +GASRS LHGGS FN+L  VAS S R+  +F+S SLLVCG CRTL QS+S ++ AIM TM+N SIARI C H R NARL+ KR H   SR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
          STC SPSSS  NPLVI LPS L+LASQ   S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRR ELW EKVFPSEIQK EA QQ AMEWGVLNE N
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AIDRYK ITGRDVSLLGFATHSEQQFDWLGASPDGLL CFQGGGILEVKCPYNKGKPEKGLPWST+PFYYMPQVQGQ+EIM REWADLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYW+LIRE+L+EFWWENVVPAKEAL LG EE+ KSYKPTSTHKQTGLAIAKSIKLASEAKL CREIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

A0A1S3B9S9 uncharacterized protein LOC103487752 isoform X11.0e-17883.38Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MK AAVSFS +GASRS  HGGS FN+LP VASFS R+    +S SLLVCG CRTL QSNS ++IAIM TM+N SIARI C   R NA+L+ KR  G  SR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        + STC +PSS  TNP VI LPS LILASQV  S APQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSE QK +A QQ AMEWGVLNEAN
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQ+EIM REW+DLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYW+LIRE+LKEFWWENVVPAKEALSLG+EE+ KSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

A0A6J1BPX6 uncharacterized protein LOC111004694 isoform X12.0e-19087.86Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MKLAAVSFS +GASR FLHGG  FNRLPRVAS S   VDAF STSLLVCGFCRTLHQ NS I+   M TMS TSI+RI C HP  NARL SKRKHGSGSR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        T STCTS SSS TNPLV R PSALILASQVT SDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELW EKVFP EIQK EA Q+ AMEWGVLNEA 
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKG+PEKGLPWSTMPFYYMPQVQGQ+EIMDREW DLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYWEL+REML+EFWWENVVPA+EALSLG+EEEVKSYKPTSTHKQTGLAIAKSIKLASEAKL+ REIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

A0A6J1GHX8 uncharacterized protein LOC111454092 isoform X11.8e-19187.34Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MKLAAVSFS +GASRSFLH  S FNRLP VASFS R+VDAFSSTS  VCGFCRT HQSNS I+ A++ TMSNTSIARI C  PRSNARLFSKRK  +GSR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        T ST  SP  S+TNPL+IRLPSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKGNRRFELW EKVFPSEI+KPEARQQYAMEWGVLNE N
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ DWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYWELI EML+EFWWENVVPA+EALS G+E+EV+SYKPTSTHKQTG+AIAKSIKLAS+AKLLCREIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

A0A6J1KS64 uncharacterized protein LOC111496007 isoform X14.8e-19287.34Show/hide
Query:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR
        MKLAAVSFS + ASRSFLH  S FNRLPRVASFS  +VDAFSSTSL VC FCRT HQSNS ID A++ TMSNTSIARI C+HPRSNARLFSKRK  +GSR
Subjt:  MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSR

Query:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN
        T ST  SP  S+TNPL+IRLPSALI+ASQVT SDAPQRSEEWFALRRD+LTTSTFSTALGFWKG+RRFELW EKVFPSEI+KPEARQQYAMEWGVLNE N
Subjt:  TLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEAN

Query:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
        AI RYKSITGRDVS LGFATHSEQQ +WLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR
Subjt:  AIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFR

Query:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR
        VCRERGYWELI EML+EFWWENVVPA+EALSLG+EEEV+SYKPTSTHKQTG+AI+KSIKLAS+AKLLCREIAGHVEFYR
Subjt:  VCRERGYWELIREMLKEFWWENVVPAKEALSLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13810.1 Restriction endonuclease, type II-like superfamily protein3.1e-4235.63Show/hide
Query:  EEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGL
        + W  LR++RLT S F+ A+GF    RR  LW EK+  +   KP A  + A  W + NE  A++RY  +TG ++ +  F  +      + +WLGASPDG+
Subjt:  EEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATH---SEQQFDWLGASPDGL

Query:  LGCFQGG----GILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALS-
        +   + G    G+LEVKCP++     K  PW  +P+  +PQ+QG +EI+D +W DLYCWT NGS++FRV R+  +WE ++  L +FW  +V+PA+E  + 
Subjt:  LGCFQGG----GILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALS-

Query:  ---LGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHV
              + +++ +KP   H+     +  + ++++ A  L  EI G++
Subjt:  ---LGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHV

AT1G67660.1 Restriction endonuclease, type II-like superfamily protein8.0e-10757.01Show/hide
Query:  CRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLT
        CR L  +   ++  I+  M   SI+      P+S+  + S+++    S  LS  T   S   +P      S++I++S ++ SD PQ+SEEWFALR+D+LT
Subjt:  CRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLT

Query:  TSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCP
        TSTFSTALGFWKGNRR ELW EKV+ S+ +  E   ++AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GILEVKCP
Subjt:  TSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCP

Query:  YNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYKPTSTHKQT
        YNKGK E  LPW  +P+YYMPQ+QGQ+EIMDREW +LYCWT NGST+FRV R+R YW +I ++L+EFWWE+V+PA+EAL LGKE EEVK Y+PTSTHK+T
Subjt:  YNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYKPTSTHKQT

Query:  GLAIAKSIKLASEAKLLCREIAGHVEFY
         LAIAKS+ LA+E+KL+CREIA HVEF+
Subjt:  GLAIAKSIKLASEAKLLCREIAGHVEFY

AT1G67660.2 Restriction endonuclease, type II-like superfamily protein4.4e-10558.6Show/hide
Query:  IMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGN
        I+  M   SI+      P+S+  + S+++    S  LS  T   S   +P      S++I++S ++ SD PQ+SEEWFALR+D+LTTSTFSTALGFWKGN
Subjt:  IMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGN

Query:  RRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWST
        RR ELW EKV+ S+ +  E   ++AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   GILEVKCPYNKGK E  LPW  
Subjt:  RRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWST

Query:  MPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYKPTSTHKQTGLAIAKSIKLASEA
        +P+YYMPQ+QGQ+EIMDREW +LYCWT NGST+FRV R+R YW +I ++L+EFWWE+V+PA+EAL LGKE EEVK Y+PTSTHK+T LAIAKS+ LA+E+
Subjt:  MPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYKPTSTHKQTGLAIAKSIKLASEA

Query:  KLLCREIAGHVEFY
        KL+CREIA HVEF+
Subjt:  KLLCREIAGHVEFY

AT1G67660.3 Restriction endonuclease, type II-like superfamily protein6.1e-10756.55Show/hide
Query:  TSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWF
        T + VC  CR L  +   ++  I+  M   SI+      P+S+  + S+++    S  LS  T   S   +P      S++I++S ++ SD PQ+SEEWF
Subjt:  TSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSSSITNPLVIRLPSALILASQVTSSDAPQRSEEWF

Query:  ALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGG
        ALR+D+LTTSTFSTALGFWKGNRR ELW EKV+ S+ +  E   ++AM WGV  E++AI+RYK I G +V  +GFA HS ++F WLGASPDG+L CF   
Subjt:  ALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFATHSEQQFDWLGASPDGLLGCFQGG

Query:  GILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYK
        GILEVKCPYNKGK E  LPW  +P+YYMPQ+QGQ+EIMDREW +LYCWT NGST+FRV R+R YW +I ++L+EFWWE+V+PA+EAL LGKE EEVK Y+
Subjt:  GILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEALSLGKE-EEVKSYK

Query:  PTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFY
        PTSTHK+T LAIAKS+ LA+E+KL+CREIA HVEF+
Subjt:  PTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTCGCTGCGGTCTCCTTTTCTCCAGCTGGAGCGTCTCGAAGTTTTCTTCATGGAGGTTCCCCTTTCAATCGATTGCCGCGCGTCGCTTCATTTTCAACTCGTGA
AGTTGATGCATTCAGCTCAACTTCTCTTTTGGTCTGTGGGTTTTGCAGGACACTTCATCAAAGTAACTCTCCAATCGACATTGCCATTATGCCAACAATGAGCAACACCT
CCATTGCTAGAATCTTCTGTAGCCACCCTAGATCAAATGCAAGGCTGTTCTCAAAACGAAAACATGGGAGTGGTTCAAGAACCCTTTCAACATGCACCTCACCATCTAGC
TCCATAACCAACCCCCTGGTCATCCGTTTACCCTCAGCCTTGATTTTGGCTTCCCAGGTCACCTCTTCAGACGCCCCTCAACGTTCAGAAGAATGGTTTGCGCTACGGAG
GGACAGGCTGACTACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGCCGCTTCGAGCTATGGCAAGAGAAAGTGTTTCCTTCAGAGATTCAAAAACCAG
AAGCACGACAGCAGTATGCCATGGAGTGGGGTGTGCTCAACGAAGCAAATGCCATCGATCGGTATAAAAGCATCACAGGCCGAGATGTAAGCTTGTTAGGATTTGCAACT
CACTCGGAGCAGCAATTCGACTGGCTAGGTGCCTCCCCCGACGGCCTATTGGGATGCTTTCAAGGAGGTGGGATCCTTGAAGTAAAATGTCCATACAACAAGGGAAAGCC
TGAGAAGGGACTACCCTGGTCGACTATGCCTTTCTATTACATGCCACAGGTACAGGGCCAATTGGAGATAATGGACAGAGAGTGGGCGGATTTGTATTGCTGGACACCAA
ATGGAAGCACAATATTTCGCGTTTGTAGGGAACGTGGTTATTGGGAATTGATACGTGAAATGTTAAAGGAATTTTGGTGGGAAAATGTTGTTCCTGCAAAGGAGGCTCTA
TCATTGGGAAAAGAGGAAGAGGTCAAGTCATATAAGCCAACATCCACACACAAACAGACTGGACTAGCAATTGCTAAGAGCATCAAGTTAGCAAGCGAGGCCAAATTGTT
GTGTAGGGAAATTGCTGGGCATGTTGAATTCTACCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTCGCTGCGGTCTCCTTTTCTCCAGCTGGAGCGTCTCGAAGTTTTCTTCATGGAGGTTCCCCTTTCAATCGATTGCCGCGCGTCGCTTCATTTTCAACTCGTGA
AGTTGATGCATTCAGCTCAACTTCTCTTTTGGTCTGTGGGTTTTGCAGGACACTTCATCAAAGTAACTCTCCAATCGACATTGCCATTATGCCAACAATGAGCAACACCT
CCATTGCTAGAATCTTCTGTAGCCACCCTAGATCAAATGCAAGGCTGTTCTCAAAACGAAAACATGGGAGTGGTTCAAGAACCCTTTCAACATGCACCTCACCATCTAGC
TCCATAACCAACCCCCTGGTCATCCGTTTACCCTCAGCCTTGATTTTGGCTTCCCAGGTCACCTCTTCAGACGCCCCTCAACGTTCAGAAGAATGGTTTGCGCTACGGAG
GGACAGGCTGACTACAAGCACATTCAGCACAGCCTTAGGCTTCTGGAAAGGAAACCGCCGCTTCGAGCTATGGCAAGAGAAAGTGTTTCCTTCAGAGATTCAAAAACCAG
AAGCACGACAGCAGTATGCCATGGAGTGGGGTGTGCTCAACGAAGCAAATGCCATCGATCGGTATAAAAGCATCACAGGCCGAGATGTAAGCTTGTTAGGATTTGCAACT
CACTCGGAGCAGCAATTCGACTGGCTAGGTGCCTCCCCCGACGGCCTATTGGGATGCTTTCAAGGAGGTGGGATCCTTGAAGTAAAATGTCCATACAACAAGGGAAAGCC
TGAGAAGGGACTACCCTGGTCGACTATGCCTTTCTATTACATGCCACAGGTACAGGGCCAATTGGAGATAATGGACAGAGAGTGGGCGGATTTGTATTGCTGGACACCAA
ATGGAAGCACAATATTTCGCGTTTGTAGGGAACGTGGTTATTGGGAATTGATACGTGAAATGTTAAAGGAATTTTGGTGGGAAAATGTTGTTCCTGCAAAGGAGGCTCTA
TCATTGGGAAAAGAGGAAGAGGTCAAGTCATATAAGCCAACATCCACACACAAACAGACTGGACTAGCAATTGCTAAGAGCATCAAGTTAGCAAGCGAGGCCAAATTGTT
GTGTAGGGAAATTGCTGGGCATGTTGAATTCTACCGATGA
Protein sequenceShow/hide protein sequence
MKLAAVSFSPAGASRSFLHGGSPFNRLPRVASFSTREVDAFSSTSLLVCGFCRTLHQSNSPIDIAIMPTMSNTSIARIFCSHPRSNARLFSKRKHGSGSRTLSTCTSPSS
SITNPLVIRLPSALILASQVTSSDAPQRSEEWFALRRDRLTTSTFSTALGFWKGNRRFELWQEKVFPSEIQKPEARQQYAMEWGVLNEANAIDRYKSITGRDVSLLGFAT
HSEQQFDWLGASPDGLLGCFQGGGILEVKCPYNKGKPEKGLPWSTMPFYYMPQVQGQLEIMDREWADLYCWTPNGSTIFRVCRERGYWELIREMLKEFWWENVVPAKEAL
SLGKEEEVKSYKPTSTHKQTGLAIAKSIKLASEAKLLCREIAGHVEFYR