| GenBank top hits | e value | %identity | Alignment |
|---|
| KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia] | 4.5e-124 | 70.85 | Show/hide |
Query: MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
+SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH S +L P+D+ ILDQV RMLRLTEKDEDE+R+FQ+LHP
Subjt: MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
Query: KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
AKQ+GFGR+FRSP++FED +KSIL+CNT+W+RTL MA +LCE+QA+M +++KRKRK GNFPNA EVCRMGVE LK H LGYRA +++ FAQ V+
Subjt: KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
Query: NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
+ I+LQ +P+ FPKIKGFGPFATAN+ MCLGFY QLPIDTETIRH+KQVHG Q+C KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Subjt: NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
Query: LSELCSLDYHKISGATLNL
LSEL S DYHKISG+TL+L
Subjt: LSELCSLDYHKISGATLNL
|
|
| XP_021905122.1 uncharacterized protein LOC110820055 isoform X2 [Carica papaya] | 3.2e-85 | 53.25 | Show/hide |
Query: FDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTS-SFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKA
F+LE+AVCNHG FMMPPN W PS KTL+RPLRLSN +SSV+ SI+ S S L IQ+H +S D+ IL+QV RMLR+++KDE+ +R+FQ +H A
Subjt: FDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTS-SFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHPKA
Query: KQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---ARMSSQNRKRKRKLI----------------GNFPNAEEVCRMGVELLKKH-NL
K GFGR+FRSP++FED +KS+LLCN TW RTL MA LCELQ R S +++KRK GNFPNAEE+ + +LL++ L
Subjt: KQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---ARMSSQNRKRKRKLI----------------GNFPNAEEVCRMGVELLKKH-NL
Query: GYRAGFIINFAQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVE
GYRA ++IN AQ V++ +DL N + KIKGFG F AN+ MC+GFY+ +P DTET+RH+KQVHG + C+ T+ +DVK IYDKY+PFQ LAY EL+
Subjt: GYRAGFIINFAQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVE
Query: YYESKFGKLSELCSLDYHKISGA
YYESK GKLSEL Y ++G+
Subjt: YYESKFGKLSELCSLDYHKISGA
|
|
| XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia] | 4.1e-117 | 69.25 | Show/hide |
Query: KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSL
+T S FDLERAVCNHG FMMPPN+WIPSSKTLQRPLRL++S +SV VSI+Q SS LL IQIH SS + SP D+Q ILDQV RMLR+TE+DE+ +R FQ+L
Subjt: KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSL
Query: HPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAEEVCRMGVELLKKHNLGYRAG
H KAK++GFGRLFRSPT+FEDA+KSILLCN TW+RTLAMAGQLCELQA++ + +KRKRK GNFP A E+CRM V LL+KH +GYRA
Subjt: HPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAEEVCRMGVELLKKHNLGYRAG
Query: FIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY
+II+ AQRVQN IDLQ +FPKIKGFGPF TAN+ MCLG Y +LPIDTETIRH+KQVHGRQ CN KT E VK +YDKYAPFQCLAY +ELVEYY
Subjt: FIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY
Query: ESKFGKLSELCSLDYHKISGAT
ES+FGKLSEL DY KISG T
Subjt: ESKFGKLSELCSLDYHKISGAT
|
|
| XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata] | 4.5e-124 | 70.85 | Show/hide |
Query: MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
+SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH S +L P+D+ ILDQV RMLRLTEKDEDE+R+FQ+LHP
Subjt: MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
Query: KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
AKQ+GFGR+FRSP++FED +KSIL+CNT+W+RTL MA +LCE+QA+M +++KRKRK GNFPNA EVCRMGVE LK H LGYRA +++ FAQ V+
Subjt: KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
Query: NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
+ I+LQ +P+ FPKIKGFGPFATAN+ MCLGFY QLPIDTETIRH+KQVHG Q+C KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Subjt: NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
Query: LSELCSLDYHKISGATLNL
LSEL S DYHKISG+TL+L
Subjt: LSELCSLDYHKISGATLNL
|
|
| XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida] | 1.0e-136 | 87.46 | Show/hide |
Query: TMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLH
++SDFDLE+AVCNHGQFMMPPNQWIPSSKTLQRPLRLS+S+SSVFVSINQ SS LLTIQIHSSS LSPQDQQ ILDQVVRMLRLTEKDEDELRKFQSLH
Subjt: TMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLH
Query: PKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQ-NRKRKRKL------IGNFPNAEEVCRMGVELLKKHNLGYRAGFIINF
P+AKQMGFGRLFRSPT+FEDALKSILLCNTTWKRTLAMAGQLCELQA+M Q RKRKRKL IGNFPNAEEVCRMGVELLKKH LGYRA +IINF
Subjt: PKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQ-NRKRKRKL------IGNFPNAEEVCRMGVELLKKHNLGYRAGFIINF
Query: AQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLE
A+ VQ+ IDLQNPN FPKIKGFGPFATAN+LMCLG YRQLPIDTETIRH+KQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAY LE
Subjt: AQRVQNATIDLQNPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLE
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2P5ACW8 DNA glycosylase | 3.9e-81 | 48.74 | Show/hide |
Query: KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLTIQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRK
++ S F++E+AVCNHG FMM PN+W PS+KTLQRPLRL++ SSV VSI+ + S LL I++ S S ALS D IL+QV RMLR+T++DE ++R+
Subjt: KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLTIQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRK
Query: FQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARM---------------SSQNRKRKR-----------KLIGNFPNAEE
FQ +HP+AK+ GFGR+FRSP++FEDA+KSILLCN +W RTL MA LC+LQ + S++ KRKR +++GNFPNA E
Subjt: FQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARM---------------SSQNRKRKR-----------KLIGNFPNAEE
Query: VCRMGVE-LLKKHN--LGYRAGFIINFAQRVQNATID------------LQNPNN---FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQ
+ + L+K+ LGYRA I++ A+ ++ ++ L + KI+GFGPF AN+LMC+ Y +P D+ETIRH++QVHGR+
Subjt: VCRMGVE-LLKKHN--LGYRAGFIINFAQRVQNATID------------LQNPNN---FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQ
Query: FCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL
CN KT+ ++VK+IYDKYAPFQCLAY +EL+EYYE KFGKLSEL Y ISG+ L
Subjt: FCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL
|
|
| A0A2P5FT40 DNA glycosylase | 1.1e-80 | 48.04 | Show/hide |
Query: KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLTIQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRK
++ S F++E+AVCNHG FMM PN+W PS+KTLQRPLRL++ SSV VSI+ + S LL I++ S S LS D IL+QV RMLR+TE+DE ++R+
Subjt: KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQT--SSFLLTIQI--HSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRK
Query: FQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKR------------KLIGNFPNAE
FQ +HP+AK+ GFGR+FRSP++FEDA+KSILLCN +W RTL MA LC+LQ + S+++ KRKR +++GNFPNA
Subjt: FQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKR------------KLIGNFPNAE
Query: EVCRM-GVELLKKHN--LGYRAGFIINFAQRVQNATID-LQNPNN--------------FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGR
E+ + L+K+ LGYRA I++ A+ ++ ++ L+ I+GFGPF AN+LMC+ Y +P D+ETIRH++QVH R
Subjt: EVCRM-GVELLKKHN--LGYRAGFIINFAQRVQNATID-LQNPNN--------------FPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGR
Query: QFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL
+ CN KT++++VK+IYDKYAPFQCLAY +EL+EYYE KFGKLSEL Y ISG+ L
Subjt: QFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGATL
|
|
| A0A6A1W9S6 Uncharacterized protein | 6.4e-84 | 47.84 | Show/hide |
Query: EKTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQ----TSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELR
E+ + F++E+AVCNHG FMM PN WIPS+KTLQRPLRL+NS SV VSI+ T++++L IQ+H + +SPQD++ IL+QV RMLR++E+DE LR
Subjt: EKTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQ----TSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELR
Query: KFQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKRKL-------------------
+FQ+LHP+AK+ GFGR FRSP++FEDA+KS+LLCN TW RTL MA LCELQ AR S+ R KRK
Subjt: KFQSLHPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQ---------------ARMSSQNRKRKRKL-------------------
Query: -----------IGNFPNAEEVCRMGVELLKKH-NLGYRAGFIINFAQRVQNATIDLQNPNN------------FPKIKGFGPFATANLLMCLGFYRQLPI
+GNFP+++EV + L+ H NLGYRA +I+ A++V++ + L+ ++ KIKGFGPFA AN++MC+G+Y+ +P+
Subjt: -----------IGNFPNAEEVCRMGVELLKKH-NLGYRAGFIINFAQRVQNATIDLQNPNN------------FPKIKGFGPFATANLLMCLGFYRQLPI
Query: DTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGA
DTET+RH++QVHGR+ +TV EDVK +YDK+APFQ LAY EL+E+YE KFGKLSEL + Y +SG+
Subjt: DTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGKLSELCSLDYHKISGA
|
|
| A0A6J1DS88 uncharacterized protein LOC111023822 | 2.0e-117 | 69.25 | Show/hide |
Query: KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSL
+T S FDLERAVCNHG FMMPPN+WIPSSKTLQRPLRL++S +SV VSI+Q SS LL IQIH SS + SP D+Q ILDQV RMLR+TE+DE+ +R FQ+L
Subjt: KTMSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSL
Query: HPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAEEVCRMGVELLKKHNLGYRAG
H KAK++GFGRLFRSPT+FEDA+KSILLCN TW+RTLAMAGQLCELQA++ + +KRKRK GNFP A E+CRM V LL+KH +GYRA
Subjt: HPKAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMS----SQNRKRKRK-------LIGNFPNAEEVCRMGVELLKKHNLGYRAG
Query: FIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY
+II+ AQRVQN IDLQ +FPKIKGFGPF TAN+ MCLG Y +LPIDTETIRH+KQVHGRQ CN KT E VK +YDKYAPFQCLAY +ELVEYY
Subjt: FIINFAQRVQNATIDLQNPN---NFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYY
Query: ESKFGKLSELCSLDYHKISGAT
ES+FGKLSEL DY KISG T
Subjt: ESKFGKLSELCSLDYHKISGAT
|
|
| A0A6J1GJ25 uncharacterized protein LOC111454659 | 2.2e-124 | 70.85 | Show/hide |
Query: MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
+SDF+LE+AVCNHG FMM PNQWIPSSKTLQRPLRLSNS++S+ VSINQ+SS LLT+QIH S +L P+D+ ILDQV RMLRLTEKDEDE+R+FQ+LHP
Subjt: MSDFDLERAVCNHGQFMMPPNQWIPSSKTLQRPLRLSNSNSSVFVSINQTSSFLLTIQIHSSSAALSPQDQQTILDQVVRMLRLTEKDEDELRKFQSLHP
Query: KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
AKQ+GFGR+FRSP++FED +KSIL+CNT+W+RTL MA +LCE+QA+M +++KRKRK GNFPNA EVCRMGVE LK H LGYRA +++ FAQ V+
Subjt: KAKQMGFGRLFRSPTVFEDALKSILLCNTTWKRTLAMAGQLCELQARMSSQNRKRKRK---LIGNFPNAEEVCRMGVELLKKHNLGYRAGFIINFAQRVQ
Query: NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
+ I+LQ +P+ FPKIKGFGPFATAN+ MCLGFY QLPIDTETIRH+KQVHG Q+C KTV EDVKQIYD YAP+QCLAY LELV+YYE+KFGK
Subjt: NATIDLQ-------NPNNFPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHIKQVHGRQFCNNKTVREDVKQIYDKYAPFQCLAYRLELVEYYESKFGK
Query: LSELCSLDYHKISGATLNL
LSEL S DYHKISG+TL+L
Subjt: LSELCSLDYHKISGATLNL
|
|