| GenBank top hits | e value | %identity | Alignment |
|---|
| KAG6585875.1 hypothetical protein SDJN03_18608, partial [Cucurbita argyrosperma subsp. sororia] | 3.1e-130 | 73.48 | Show/hide |
Query: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
MI L LGV SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS L P+D+ AILDQV RMLRLTEKDEDE
Subjt: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
Query: LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
+R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM +S+KRKRK GNFPNA EVCRMGVE LK H LGYRA Y
Subjt: LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
Query: IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
++ FAQ V++G INLQ +P+ PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Subjt: IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
Query: EYYESKFGKLSELCSLDYNKISGTTLNL
+YYE+KFGKLSEL S DY+KISG+TL+L
Subjt: EYYESKFGKLSELCSLDYNKISGTTLNL
|
|
| XP_021905122.1 uncharacterized protein LOC110820055 isoform X2 [Carica papaya] | 7.0e-90 | 53.92 | Show/hide |
Query: LNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH-SSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELR
L+LG F+LEKAVCNHG FMM PN W PS KTL+RPLRL SN +SSV+ SI+H S+S L IQ+H +S D+ AIL+QV RMLR+++KDE+ +R
Subjt: LNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINH-SSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELR
Query: KFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAK----MSNQSRKRKRKVI---------------GNFPNAEEVCRMGV
+FQ +H AK GFGR+FRSP+LFED +KS+LLCN TW RTL MA+ LCELQ + +S + RK++++ GNFPNAEE+ +
Subjt: KFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAK----MSNQSRKRKRKVI---------------GNFPNAEEVCRMGV
Query: ELLKKH-NLGYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQC
+LL++ LGYRA Y+IN AQ V++G ++L N L KIKGFG F AN+ MC+GFY+ +P DTET+RHLKQ+HG + C++ T+ +DVK IYDKY+PFQ
Subjt: ELLKKH-NLGYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQC
Query: LAYWLELVEYYESKFGKLSELCSLDYNKISGT
LAYW EL+ YYESK GKLSEL Y ++G+
Subjt: LAYWLELVEYYESKFGKLSELCSLDYNKISGT
|
|
| XP_022156993.1 uncharacterized protein LOC111023822 [Momordica charantia] | 3.8e-120 | 69.49 | Show/hide |
Query: KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
++MI LNLG +S FDLE+AVCNHG FMM PN+WIPSSKTLQRPLRL ++S +SV VSI+ SS LL IQIHSSP SP D+QAILDQV RMLR+TE+DE
Subjt: KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
Query: DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VIGNFPNAEEVCRMGVELLK
+ +R FQ+LH KAK++GFGRLFRSPTLFEDA+KSILLCN TW+RTLAMA QLCELQAK+ +KRKRK GNFP A E+CRM V LL+
Subjt: DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VIGNFPNAEEVCRMGVELLK
Query: KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA
KH +GYRA YII+ AQRVQNG I+LQ PKIKGFGPF TAN+ MCLG Y +LPIDTETIRHLKQ+HGRQ CN KT +E VK +YDKYAPFQCLA
Subjt: KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA
Query: YWLELVEYYESKFGKLSELCSLDYNKISGTT
YW+ELVEYYES+FGKLSEL DY KISGTT
Subjt: YWLELVEYYESKFGKLSELCSLDYNKISGTT
|
|
| XP_022951918.1 uncharacterized protein LOC111454659 [Cucurbita moschata] | 2.4e-130 | 73.48 | Show/hide |
Query: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
MI L LGV SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS L P+D+ AILDQV RMLRLTEKDEDE
Subjt: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
Query: LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
+R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM +S+KRKRK GNFPNA EVCRMGVE LK H LGYRA Y
Subjt: LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
Query: IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
++ FAQ V++G INLQ +P+ PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Subjt: IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
Query: EYYESKFGKLSELCSLDYNKISGTTLNL
+YYE+KFGKLSEL S DY+KISG+TL+L
Subjt: EYYESKFGKLSELCSLDYNKISGTTLNL
|
|
| XP_038877617.1 uncharacterized protein LOC120069874 [Benincasa hispida] | 8.5e-136 | 86.53 | Show/hide |
Query: KMIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIH-SSPPLSPQDQQAILDQVVRMLRLTEKDE
K I LNLGVS SDFDLEKAVCNHGQFMM PNQWIPSSKTLQRPLRL S+S+SSVFVSIN SS LLTIQIH SS PLSPQDQQAILDQVVRMLRLTEKDE
Subjt: KMIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIH-SSPPLSPQDQQAILDQVVRMLRLTEKDE
Query: DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQ-SRKRKRKV------IGNFPNAEEVCRMGVELLKKHNL
DELRKFQSLHP+AKQMGFGRLFRSPTLFEDA+KSILLCNTTWKRTLAMA QLCELQAKM Q +RKRKRK+ IGNFPNAEEVCRMGVELLKKH L
Subjt: DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQ-SRKRKRKV------IGNFPNAEEVCRMGVELLKKHNL
Query: GYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLE
GYRA YIINFA+ VQ+G I+LQNPN+ PKIKGFGPFATAN+LMCLG YRQLPIDTETIRHLKQ+HGRQFCN KTV+EDVKQIYDKYAPFQCLAYWLE
Subjt: GYRAGYIINFAQRVQNGTINLQNPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLE
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2P5ACW8 DNA glycosylase | 1.6e-87 | 51.23 | Show/hide |
Query: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE
++ L LG S S F++EKAVCNHG FMM+PN+W PS+KTLQRPLRL ++ SSV VSI+HS S LL I++ P LS D AIL+QV RMLR+T+
Subjt: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE
Query: KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKM---------------SNQSRKRKR-----------KVI
+DE ++R+FQ +HP+AK+ GFGR+FRSP+LFEDA+KSILLCN +W RTL MAE LC+LQ ++ SN+ KRKR +++
Subjt: KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKM---------------SNQSRKRKR-----------KVI
Query: GNFPNAEEVCRMGVE-LLKKHN--LGYRAGYIINFAQRVQNGTIN---------LQNPNH------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH
GNFPNA E+ + L+K+ LGYRA +I++ A+ ++G +N + +H + KI+GFGPF AN+LMC+ Y +P D+ETIRH
Subjt: GNFPNAEEVCRMGVE-LLKKHN--LGYRAGYIINFAQRVQNGTIN---------LQNPNH------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH
Query: LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTL
L+Q+HGR+ CNKKT+ ++VK+IYDKYAPFQCLAYW+EL+EYYE KFGKLSEL Y ISG+ L
Subjt: LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTL
|
|
| A0A2P5FT40 DNA glycosylase | 1.2e-87 | 51.09 | Show/hide |
Query: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE
++ L LG S S F++EKAVCNHG FMM+PN+W PS+KTLQRPLRL ++ SSV VSI+HS S LL I++ P LS D AIL+QV RMLR+TE
Subjt: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHS--SSFLLTIQIHSSPP---LSPQDQQAILDQVVRMLRLTE
Query: KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQ---------------AKMSNQSRKRKR------------KV
+DE ++R+FQ +HP+AK+ GFGR+FRSP+LFEDA+KSILLCN +W RTL MAE LC+LQ + SN+ KRKR ++
Subjt: KDEDELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQ---------------AKMSNQSRKRKR------------KV
Query: IGNFPNAEEVCRM-GVELLKKHN--LGYRAGYIINFAQRVQNGTIN--------LQNPNH-------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIR
+GNFPNA E+ + L+K+ LGYRA +I++ A+ ++G +N + H + I+GFGPF AN+LMC+ Y +P D+ETIR
Subjt: IGNFPNAEEVCRM-GVELLKKHN--LGYRAGYIINFAQRVQNGTIN--------LQNPNH-------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIR
Query: HLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTL
HL+Q+H R+ CNKKT+Q++VK+IYDKYAPFQCLAYW+EL+EYYE KFGKLSEL Y ISG+ L
Subjt: HLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGTTL
|
|
| A0A6A1W9S6 Uncharacterized protein | 9.9e-90 | 51.52 | Show/hide |
Query: FDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSS---FLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHP
F++EKAVCNHG FMM+PN WIPS+KTLQRPLRL +NS SV VSI+H +S + IQ+H + +SPQD++AIL+QV RMLR++E+DE LR+FQ+LHP
Subjt: FDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSS---FLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDELRKFQSLHP
Query: KAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSN-------------QSRKR--KRK---------------------------
+AK+ GFGR FRSP+LFEDAIKS+LLCN TW RTL MA+ LCELQ +++N SRKR KRK
Subjt: KAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSN-------------QSRKR--KRK---------------------------
Query: ---VIGNFPNAEEVCRMGVELLKKH-NLGYRAGYIINFAQRVQNGTINLQ--NPNH----------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH
+GNFP+++EV + L+ H NLGYRA YI+ A++V++G + L+ + +H L KIKGFGPFA AN++MC+G+Y+ +P+DTET+RH
Subjt: ---VIGNFPNAEEVCRMGVELLKKH-NLGYRAGYIINFAQRVQNGTINLQ--NPNH----------LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRH
Query: LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGT
L+Q+HGR+ K+TV EDVK +YDK+APFQ LAYW EL+E+YE KFGKLSEL + Y +SG+
Subjt: LKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELVEYYESKFGKLSELCSLDYNKISGT
|
|
| A0A6J1DS88 uncharacterized protein LOC111023822 | 1.9e-120 | 69.49 | Show/hide |
Query: KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
++MI LNLG +S FDLE+AVCNHG FMM PN+WIPSSKTLQRPLRL ++S +SV VSI+ SS LL IQIHSSP SP D+QAILDQV RMLR+TE+DE
Subjt: KKMIVLNLG-VSSDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDE
Query: DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VIGNFPNAEEVCRMGVELLK
+ +R FQ+LH KAK++GFGRLFRSPTLFEDA+KSILLCN TW+RTLAMA QLCELQAK+ +KRKRK GNFP A E+CRM V LL+
Subjt: DELRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMS----NQSRKRKRK-------VIGNFPNAEEVCRMGVELLK
Query: KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA
KH +GYRA YII+ AQRVQNG I+LQ PKIKGFGPF TAN+ MCLG Y +LPIDTETIRHLKQ+HGRQ CN KT +E VK +YDKYAPFQCLA
Subjt: KHNLGYRAGYIINFAQRVQNGTINLQNPNH---LPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLA
Query: YWLELVEYYESKFGKLSELCSLDYNKISGTT
YW+ELVEYYES+FGKLSEL DY KISGTT
Subjt: YWLELVEYYESKFGKLSELCSLDYNKISGTT
|
|
| A0A6J1GJ25 uncharacterized protein LOC111454659 | 1.2e-130 | 73.48 | Show/hide |
Query: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
MI L LGV SDF+LEKAVCNHG FMM+PNQWIPSSKTLQRPLRL SNS++S+ VSIN SSS LLT+QIHS L P+D+ AILDQV RMLRLTEKDEDE
Subjt: MIVLNLGVS-SDFDLEKAVCNHGQFMMSPNQWIPSSKTLQRPLRLISNSNSSVFVSINHSSSFLLTIQIHSSPPLSPQDQQAILDQVVRMLRLTEKDEDE
Query: LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
+R+FQ+LHP AKQ+GFGR+FRSP+LFED +KSIL+CNT+W+RTL MAE+LCE+QAKM +S+KRKRK GNFPNA EVCRMGVE LK H LGYRA Y
Subjt: LRKFQSLHPKAKQMGFGRLFRSPTLFEDAIKSILLCNTTWKRTLAMAEQLCELQAKMSNQSRKRKRK---VIGNFPNAEEVCRMGVELLKKHNLGYRAGY
Query: IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
++ FAQ V++G INLQ +P+ PKIKGFGPFATAN+ MCLGFY QLPIDTETIRHLKQ+HG Q+C KKTV EDVKQIYD YAP+QCLAYWLELV
Subjt: IINFAQRVQNGTINLQ-------NPNHLPKIKGFGPFATANLLMCLGFYRQLPIDTETIRHLKQLHGRQFCNKKTVQEDVKQIYDKYAPFQCLAYWLELV
Query: EYYESKFGKLSELCSLDYNKISGTTLNL
+YYE+KFGKLSEL S DY+KISG+TL+L
Subjt: EYYESKFGKLSELCSLDYNKISGTTLNL
|
|