| GenBank top hits | e value | %identity | Alignment |
|---|
| KAG2663507.1 hypothetical protein I3760_16G033000 [Carya illinoinensis] | 1.6e-42 | 32.04 | Show/hide |
Query: MASAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDEYH---WRLTGFYG------------FLRR-ICEIRRG----GDFNAL
M K LGF+NC V C+GKSGG+AL WN+ + +FS HIH +T +E + W LTGFYG LR + +G GDFN +
Subjt: MASAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDEYH---WRLTGFYG------------FLRR-ICEIRRG----GDFNAL
Query: LYQHENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIYE------LNHLDYHQ--------------SDHRPIELVLSPQPGCWRRS
L E G RDK ++ AF++VID L DLGF GN FTWCNRR L++L +H SDH PI L L+ G +
Subjt: LYQHENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIYE------LNHLDYHQ--------------SDHRPIELVLSPQPGCWRRS
Query: SQRISRFDETWLKQADLQQLVRDSW-GL-NREDPGLSAPQML----------------AQVSKREANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEE
+++ RF+ W+ D +Q+++D+W G+ R+D + ++ Q + ++A ++ L + + +R+ L +A +++ L
Subjt: SQRISRFDETWLKQADLQQLVRDSW-GL-NREDPGLSAPQML----------------AQVSKREANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEE
Query: ELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEW----RQDRAMI------------------LQDLQRSVDNEMNVDLLRPFTEEEI
E+ WKQRS+ +WL EGD+N+R+FH +A+ R++ N I + D G W R+D ++ LQ L + +M L PF+E+E+
Subjt: ELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEW----RQDRAMI------------------LQDLQRSVDNEMNVDLLRPFTEEEI
Query: LRALKQSHPHKA
RAL + HP KA
Subjt: LRALKQSHPHKA
|
|
| KAG6703462.1 hypothetical protein I3842_07G085800 [Carya illinoinensis] | 2.9e-41 | 30.71 | Show/hide |
Query: KRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWD---EYHWRLTGFYG------------FLRRICEIRRG-----GDFNALLYQH
K +L F NC +D KG SGGLALLW S+LS+S NHI + + +TGFYG LR +C GDFN +L+ H
Subjt: KRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWD---EYHWRLTGFYG------------FLRRICEIRRG-----GDFNALLYQH
Query: ENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCN-RRPNGTI-------------------YELNHLDYHQSDHRPIELVLSPQPGCWRRSSQRI
E RD+ S++ F+ +D+ L D+GF G ++TWCN R P + E+ H SDH PI L + R + R+
Subjt: ENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCN-RRPNGTI-------------------YELNHLDYHQSDHRPIELVLSPQPGCWRRSSQRI
Query: SRFDETWLKQADLQQLVRDSW----GLNREDPGLSAPQMLAQVSKREANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQ
RF+ W+ ++ ++V+D+W G N+ + ++ ++ ++A + +Q +G S+ + A ++ L +E+ W+QR + +WL+EGDQ
Subjt: SRFDETWLKQADLQQLVRDSW----GLNREDPGLSAPQMLAQVSKREANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQ
Query: NTRWFHRQASYRQRLNRIVGLTDDQGEWR----------------------QDRAMILQDLQRSVDNEMNVDLLRPFTEEEILRALKQSHPHKA
N+++F +AS+R+R N I L +D GEW+ Q+ + L DL V EMN DL +P+TE E+ ALKQ P A
Subjt: NTRWFHRQASYRQRLNRIVGLTDDQGEWR----------------------QDRAMILQDLQRSVDNEMNVDLLRPFTEEEILRALKQSHPHKA
|
|
| OMO59710.1 reverse transcriptase [Corchorus capsularis] | 9.5e-40 | 33.17 | Show/hide |
Query: SAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDE--YHWRLTGFYG------------FLRR-ICEIRRG----GDFNALLYQ
S +R CF V G+SGGLA+ W+ SV L+SFS +HI W+ ++ WRLTGFYG LR+ + + GDFN LL+Q
Subjt: SAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDE--YHWRLTGFYG------------FLRR-ICEIRRG----GDFNALLYQ
Query: HENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGT-IYE-------------------LNHLDYHQSDHRPIELVLSPQPGCWRRSSQR
E +G R++P +++ AF+ +D GL D+G+ GN FTW N I+E + HL SDH PI +L+ + RR Q
Subjt: HENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGT-IYE-------------------LNHLDYHQSDHRPIELVLSPQPGCWRRSSQR
Query: IS----RFDETWLKQADLQQLVRDSW----GLN--------REDPGLSAPQMLAQVSKR--EANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELY
S F+ W K+AD ++LV D W GL R+ G Q + +R E ++K+ I G+ G + ++ +L+EEE +
Subjt: IS----RFDETWLKQADLQQLVRDSW----GLN--------REDPGLSAPQMLAQVSKR--EANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELY
Query: WKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEWRQDRA-----------------------MILQDLQRSVDNEMNVDLLRPFTEEEILR
W Q SR WL EGD+NT +FH QAS R++ N I L + G D IL+ + S+ EMN LL FT EEI
Subjt: WKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEWRQDRA-----------------------MILQDLQRSVDNEMNVDLLRPFTEEEILR
Query: ALKQSHPHKA
ALKQ HP KA
Subjt: ALKQSHPHKA
|
|
| XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis] | 3.9e-41 | 32.11 | Show/hide |
Query: KRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWIT-WDEYHWRLTGFYG---------FLRRICEIRRG--------GDFNALLYQHEN
K LGF CF VD G+SGGLALLW + ++++S++HIH IT D W LTG YG R + + RG GDFN +L E
Subjt: KRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWIT-WDEYHWRLTGFYG---------FLRRICEIRRG--------GDFNALLYQHEN
Query: EGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIYE--------------------LNHLDYHQSDHRPIELVLSPQPGCWRRSSQRISR
G + ++ F+ V+ L DLG+VG+RFTW NRR + + + H SDH P L L + RR S+R+ R
Subjt: EGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIYE--------------------LNHLDYHQSDHRPIELVLSPQPGCWRRSSQRISR
Query: FDETWLKQADLQQLVRDSWGLNREDPGLSAPQMLAQVSK--------------------REANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYW
F+ W+ + + ++ WG R +S Q++ ++S A +++Q E G E QA +++ L+ +EL W
Subjt: FDETWLKQADLQQLVRDSWGLNREDPGLSAPQMLAQVSK--------------------REANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYW
Query: KQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEW----------------------RQDRAMILQDLQRSVDNEMNVDLLRPFTEEEILRAL
KQRSR WL+EGD N+R+FH +AS R+R N I+ L D+ G W R D +L ++ V EMN DLL+P+ EE+ AL
Subjt: KQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEW----------------------RQDRAMILQDLQRSVDNEMNVDLLRPFTEEEILRAL
Query: KQSHPHKA
KQ HP KA
Subjt: KQSHPHKA
|
|
| XP_042965942.1 uncharacterized protein LOC122299620 [Carya illinoinensis] | 3.6e-39 | 30.51 | Show/hide |
Query: MASAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDEY---HWRLTGFYGF------------LRRICEIRR-----GGDFNAL
M KR+LG ENC V C+G+ GGLAL W V ++L +S NHIH I +E W LTG YGF +R + + GDFN +
Subjt: MASAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDEY---HWRLTGFYGF------------LRRICEIRR-----GGDFNAL
Query: LYQHENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRP--------------------NGTIYELNHLDYHQSDHRPIELVLSPQPGCWRRS
++Q E G D+P ++ F++ ID G+ DLG+ G +TW NRR N + + H SDH P+ + + R
Subjt: LYQHENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRP--------------------NGTIYELNHLDYHQSDHRPIELVLSPQPGCWRRS
Query: SQRISRFDETWLKQADLQQLVRDSWGLNREDPGLSAPQMLAQV---------SKREANQKVQLA-----IEGLR-----GAGSRELLSQAEAQLEDVLQE
R RF+ W + +L++++W + G+ Q+ K N KV+LA + L+ G S+E +S A ++ L
Subjt: SQRISRFDETWLKQADLQQLVRDSWGLNREDPGLSAPQMLAQV---------SKREANQKVQLA-----IEGLR-----GAGSRELLSQAEAQLEDVLQE
Query: EELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEWRQDRAM------ILQDL----------------QRSVDNEMNVDLLRPFTEEE
EE W QRS+ +W++ GDQN+R+FH +AS+R++ N I L D+Q +W++ R M Q L + V + MN +L +PFT E
Subjt: EELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEWRQDRAM------ILQDL----------------QRSVDNEMNVDLLRPFTEEE
Query: ILRALKQSHPHKA
+ AL Q HP KA
Subjt: ILRALKQSHPHKA
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A2N9F2M3 Uncharacterized protein | 6.8e-44 | 32.27 | Show/hide |
Query: LGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDEYHWRLTGFYGF----LR-RICEIRRG------------GDFNALLYQHENEGSR
LG CF V+ G GGLALLWNSSV + ++S NHI + D WRLTGFYG+ LR R ++ R GDFN ++ E+ G
Subjt: LGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDEYHWRLTGFYGF----LR-RICEIRRG------------GDFNALLYQHENEGSR
Query: DKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIY--------------------ELNHLDYHQSDHRPIELVLSPQPGCWRRSSQRISRFDET
D+ L+++AAF+ ++ L DLGF G FTW NRR N + +NH+ + SDH + SP R ++ RF+ +
Subjt: DKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIY--------------------ELNHLDYHQSDHRPIELVLSPQPGCWRRSSQRISRFDET
Query: WLKQADLQQLVRDSWGLNREDPGLSAPQMLAQVSK-------REANQKVQLA---IEGLRGA-----------GSRELLSQAEAQLEDVLQEEELYWKQR
WL++ +++V+++WG+ E G +A LAQ K + + KV+ IE L+ + ++ +L ++LQ+EE++W+QR
Subjt: WLKQADLQQLVRDSWGLNREDPGLSAPQMLAQVSK-------REANQKVQLA---IEGLRGA-----------GSRELLSQAEAQLEDVLQEEELYWKQR
Query: SREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEWRQDRAMI-----------------------LQDLQRSVDNEMNVDLLRPFTEEEILRALKQ
SR WL +GD+NT++FH AS R++ N IVGL D W+ D++ I +Q + V+ EMN LL+PF EE+ AL Q
Subjt: SREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEWRQDRAMI-----------------------LQDLQRSVDNEMNVDLLRPFTEEEILRALKQ
Query: SHPHKA
+P K+
Subjt: SHPHKA
|
|
| A0A2N9GII4 Uncharacterized protein | 9.9e-43 | 31.73 | Show/hide |
Query: MASAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWIT-WDEYHWRLTGFYG---------------FLRRICEI--RRGGDFNALLY
M + LGF+N F V G+S GLALLW V + +F+ +HI I ++ WRL GFYG L + C + GDFN +L
Subjt: MASAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWIT-WDEYHWRLTGFYG---------------FLRRICEI--RRGGDFNALLY
Query: QHENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNG--------------------TIYELNHLDYHQSDHRPIELVLSPQPGCWRRSSQ
Q+E G R +P + F+ V++ +DLG+ G +FTW N R I + HL +SDH PI LV + R+ +
Subjt: QHENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNG--------------------TIYELNHLDYHQSDHRPIELVLSPQPGCWRRSSQ
Query: RISRFDETWLKQADLQQLVRDSWGLNREDPGLSAPQM------------LAQVSKR---------EANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQE
R++RF+E W D + ++R W E+ G +P LAQ SK+ A + A+ G +R L+S + ++ +L
Subjt: RISRFDETWLKQADLQQLVRDSWGLNREDPGLSAPQM------------LAQVSKR---------EANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQE
Query: EELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEW-------------------------RQDRAMILQDLQRSVDNEMNVDLLRPFT
+E++WKQRSR WLKEGD NT++FH A+ RQR N+I GL ++QG+W R + AM + D R V+ EMN L RPF
Subjt: EELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEW-------------------------RQDRAMILQDLQRSVDNEMNVDLLRPFT
Query: EEEILRALKQSHPHKA
EE+ +A+ Q HP K+
Subjt: EEEILRALKQSHPHKA
|
|
| A0A2N9IJF6 Uncharacterized protein | 9.9e-43 | 31.73 | Show/hide |
Query: MASAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWIT-WDEYHWRLTGFYG---------------FLRRICEI--RRGGDFNALLY
M + LGF+N F V G+S GLALLW V + +F+ +HI I ++ WRL GFYG L + C + GDFN +L
Subjt: MASAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWIT-WDEYHWRLTGFYG---------------FLRRICEI--RRGGDFNALLY
Query: QHENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNG--------------------TIYELNHLDYHQSDHRPIELVLSPQPGCWRRSSQ
Q+E G R +P + F+ V++ +DLG+ G +FTW N R I + HL +SDH PI LV + R+ +
Subjt: QHENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNG--------------------TIYELNHLDYHQSDHRPIELVLSPQPGCWRRSSQ
Query: RISRFDETWLKQADLQQLVRDSWGLNREDPGLSAPQM------------LAQVSKR---------EANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQE
R++RF+E W D + ++R W E+ G +P LAQ SK+ A + A+ G +R L+S + ++ +L
Subjt: RISRFDETWLKQADLQQLVRDSWGLNREDPGLSAPQM------------LAQVSKR---------EANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQE
Query: EELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEW-------------------------RQDRAMILQDLQRSVDNEMNVDLLRPFT
+E++WKQRSR WLKEGD NT++FH A+ RQR N+I GL ++QG+W R + AM + D R V+ EMN L RPF
Subjt: EELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEW-------------------------RQDRAMILQDLQRSVDNEMNVDLLRPFT
Query: EEEILRALKQSHPHKA
EE+ +A+ Q HP K+
Subjt: EEEILRALKQSHPHKA
|
|
| A0A803PHH5 Uncharacterized protein | 4.0e-44 | 31.72 | Show/hide |
Query: SAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDEYH-WRLTGFYG------------FLRRICEIRR-----GGDFNALLYQH
S + +LGF CF V+ KGKSGGL LLW+ + S+LSFS+ HI +I +E WR TGFYG L R+ + GGDFN +L Q
Subjt: SAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWITWDEYH-WRLTGFYG------------FLRRICEIRR-----GGDFNALLYQH
Query: ENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIYE-------------------LNHLDYHQSDHRPIELVLSPQPGCWRRSSQRIS
E +G KP + F+ +D L ++ F GN FTWCN R I+E + HL+ SDH P+ L Q + + S
Subjt: ENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIYE-------------------LNHLDYHQSDHRPIELVLSPQPGCWRRSSQRIS
Query: R--FDETWLKQADLQQLVRDSWGLNREDPGLSAPQMLAQVS-------------KREANQKVQ------LAIEGLRGAGSRELLSQAEAQLEDVLQEEEL
R F+ W+++ +LV +WG+ +A Q+ Q++ K+E N++++ A+ S L + E L +EE
Subjt: R--FDETWLKQADLQQLVRDSWGLNREDPGLSAPQMLAQVS-------------KREANQKVQ------LAIEGLRGAGSRELLSQAEAQLEDVLQEEEL
Query: YWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEWRQDRAMI---------------------LQDLQR----SVDNEMNVDLLRPFTEEE
+WKQRSR +WLKEGD+NT++FHR+A+ R++ N I GL D W M+ L++ QR + E+N L+ PFT++E
Subjt: YWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIVGLTDDQGEWRQDRAMI---------------------LQDLQR----SVDNEMNVDLLRPFTEEE
Query: ILRALKQSHPHKA
+L+A++ PHKA
Subjt: ILRALKQSHPHKA
|
|
| M5XQU7 Uncharacterized protein | 4.9e-42 | 32.99 | Show/hide |
Query: SAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWI--TWDEYHWRLTGFYGF------------LRRICEIRR-----GGDFNALLYQ
S K LGF+ CF VD G SGGL L W S + ++ S S +HI + D HWRLTGFYG+ LR + R GDFN LLY
Subjt: SAKRALGFENCFCVDCKGKSGGLALLWNSSVTFSLLSFSNNHIHGWI--TWDEYHWRLTGFYGF------------LRRICEIRR-----GGDFNALLYQ
Query: HENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIYE-------------------LNHLDYHQSDHRPIELVLSPQPGCWRRSSQRI
+E EG +P+ ++ AF++ I L D+GF G FTW + R NG I E ++HL+ SDH PI L SP WRR S
Subjt: HENEGSRDKPLSELAAFQNVIDSYGLLDLGFVGNRFTWCNRRPNGTIYE-------------------LNHLDYHQSDHRPIELVLSPQPGCWRRSSQRI
Query: SRFDETWLKQADLQQLVRDSWGLNREDPGLSAPQMLAQVSKREANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRW
RF+ W + D + ++ ++W + ++ QVS + LL +QL+ +L EE +WKQRS+ WLKEGD+NTR+
Subjt: SRFDETWLKQADLQQLVRDSWGLNREDPGLSAPQMLAQVSKREANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRW
Query: FHRQASYRQRLNRIVGLTDDQGEWRQDRAMI-----------------------LQDLQRSVDNEMNVDLLRPFTEEEILRALKQSHPHKA
FH++AS R++ N + GL D+ G WR+D + + ++ V +MN LL + + EI A+ Q +P KA
Subjt: FHRQASYRQRLNRIVGLTDDQGEWRQDRAMI-----------------------LQDLQRSVDNEMNVDLLRPFTEEEILRALKQSHPHKA
|
|