| GenBank top hits | e value | %identity | Alignment |
|---|
| KAG7014759.1 hypothetical protein SDJN02_22388, partial [Cucurbita argyrosperma subsp. argyrosperma] | 1.1e-227 | 83.73 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP GAVAPAVEHRTPS T+ PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KFALSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+GSGSVTPNGVRQDS+LGSGT+TP DGLLLD+Q S++GCQND NHRVSFELTGEDVARCLANKS TSTN
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGE+H+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| TYK22975.1 Hydroxyproline-rich glycoprotein family protein isoform 1 [Cucumis melo var. makuwa] | 2.4e-227 | 83.73 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
MGS+NNSVDTVNAAATAIVSAEARVQP +P KRRWGS WSLYWCFG G QKNNKRI +A LVPEP GAVAPAVEHRTPS T +PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
SEP SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KF LSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+GMG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+GSGSVTPNGVRQDS+LGSGTLTPDGL LLDNQ S++GCQND TNHRVSFELTGEDVARCLANKS+TS R +ESP TST+
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q EN+E S EAETCEFFDIKT S A EKT EDD+CYQNQRAV LGSFKEFNF+QTKGE+HNT SIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| XP_022922938.1 uncharacterized protein LOC111430767 [Cucurbita moschata] | 3.7e-228 | 83.94 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP GAVAPAVEHRTPS T+ PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KFALSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+GSGSVTPNGVRQDS+LGSGT+TP DGLLLD+Q S++GCQND NHRVSFELTGEDVARCLANKS TSTN
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGEMH+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| XP_022984784.1 uncharacterized protein LOC111482967 [Cucurbita maxima] | 9.1e-227 | 83.53 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP GAVAPAVEHRTPS T+ PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KFALSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+ SGSVTPNGVRQDS+LGSGT+TP DGLLLD+Q S++GCQND NHRVSFELTGEDVARCLANKS TSTN
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGE+H+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| XP_038895848.1 uncharacterized protein LOC120084016 isoform X1 [Benincasa hispida] | 3.4e-229 | 84.74 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFG QKNNKRI +A LVPEP GAVAPAVEHRTPS T +PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
SEPPSNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAYETQLVSPPVFSAF TEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KF LSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDGIGMG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+GSGSVTPNGVRQDS+LGSGTLTPDGL LLD+Q S+SGCQND TNHRVSFELTGEDVARCLANKSMTS R +ESP TST+
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q EN+ESS EAETCE FDIKT STA EKTS +DD+CYQNQRA+ LGSFKEFNF+QTKGE++NT SIGAEWWANEKVAVKEA+PGNNWTFFP+LQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| TrEMBL top hits | e value | %identity | Alignment |
|---|
| A0A1S3AYC5 uncharacterized protein LOC103484098 | 7.6e-227 | 83.53 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
MGS+NNSVDTVNAAATAIVSAEARVQP +P KRRWGS WSLYWCFG G QKNNKRI +A LVPEP GAVAPAVEHRTPS T +PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
S P SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KF LSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+GMG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+GSGSVTPNGVRQDS+LGSGTLTPDGL LLDNQ S++GCQND TNHRVSFELTGEDVARCLANKS+TS R +ESP TST+
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q EN+E S EAETCEFFDIKT S A EKT EDD+CYQNQRAV LGSFKEFNF+QTKGE+HNT SIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| A0A5A7SWP4 Hydroxyproline-rich glycoprotein family protein isoform 1 | 7.6e-227 | 83.53 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
MGS+NNSVDTVNAAATAIVSAEARVQP +P KRRWGS WSLYWCFG G QKNNKRI +A LVPEP GAVAPAVEHRTPS T +PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
S P SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KF LSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+GMG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+GSGSVTPNGVRQDS+LGSGTLTPDGL LLDNQ S++GCQND TNHRVSFELTGEDVARCLANKS+TS R +ESP TST+
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q EN+E S EAETCEFFDIKT S A EKT EDD+CYQNQRAV LGSFKEFNF+QTKGE+HNT SIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| A0A5D3DHC1 Hydroxyproline-rich glycoprotein family protein isoform 1 | 1.2e-227 | 83.73 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
MGS+NNSVDTVNAAATAIVSAEARVQP +P KRRWGS WSLYWCFG G QKNNKRI +A LVPEP GAVAPAVEHRTPS T +PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSIT--IPFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
SEP SNTQSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAF TEPSTAP TPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KF LSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+GMG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+GSGSVTPNGVRQDS+LGSGTLTPDGL LLDNQ S++GCQND TNHRVSFELTGEDVARCLANKS+TS R +ESP TST+
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTPDGL--------LLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q EN+E S EAETCEFFDIKT S A EKT EDD+CYQNQRAV LGSFKEFNF+QTKGE+HNT SIGAEWWANEKV VKEASPGNNWTFFP+LQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| A0A6J1E856 uncharacterized protein LOC111430767 | 1.8e-228 | 83.94 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP GAVAPAVEHRTPS T+ PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KFALSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+GSGSVTPNGVRQDS+LGSGT+TP DGLLLD+Q S++GCQND NHRVSFELTGEDVARCLANKS TSTN
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGEMH+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| A0A6J1JBI8 uncharacterized protein LOC111482967 | 4.4e-227 | 83.53 | Show/hide |
Query: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
MGSMNNSVDTVNAAATAIVSAEARVQPP+P KRRWGS WSLYWCFGNG QKNNKRI +A LVPEP GAVAPAVEHRTPS T+ PFIAPPSSP SFLQ
Subjt: MGSMNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV--GAVAPAVEHRTPSITI--PFIAPPSSPTSFLQ
Query: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
SEPPSN QSPAGLLSLTALSVNNYS NGPASIFAIGPYAY+TQLVSPPVFSAFPTEPSTAPFTPPPESVQ TTPSSPEVP+AKLLTSSLSHTNKSFGTNQ
Subjt: SEPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQ
Query: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
KFALSHCDFQPYQPYPGSP +LISPGSVISNSGTSSPFPDKHPILEFRMA+APKLLGLEHFTTRKWISRMGSGSLTPDG G+GSRLGSGTLTPDG+ MG
Subjt: KFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGMG
Query: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
SR+ SGSVTPNGVRQDS+LGSGT+TP DGLLLD+Q S++GCQND NHRVSFELTGEDVARCLANKS TSTN
Subjt: SRMGSGSVTPNGVRQDSKLGSGTLTP--------DGLLLDNQ--------TSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNC
Query: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
Q +N+ESS EAE+CEFFDIKT STA EKTS EDD+CYQNQRAVNLGSFKEFNF+QTKGE+H+T SIGAEWWANEKVAVKEASPGNNWTFFPMLQ GVS
Subjt: QKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|
| Arabidopsis top hits | e value | %identity | Alignment |
|---|
| AT1G63720.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT5G52430.1) | 6.4e-53 | 51.71 | Show/hide |
Query: NNSVDTVNAAATAIVSAEARVQPPSP--SKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV-----GAVAPAVEHRTPSITIPFIAPPSSPTSFLQS
NN DT+NAAA+AI S++ R+ SP KR+W + WSL CFG+ Q+ KRI N+ LVPEPV + +R+ T+PFIAPPSSP SF QS
Subjt: NNSVDTVNAAATAIVSAEARVQPPSP--SKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV-----GAVAPAVEHRTPSITIPFIAPPSSPTSFLQS
Query: EPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPES----VQFTTPSSPEVPYAKLLTSSLSHTNKSFG
EPPS TQSP G+LS + L NN SIFAIGPYA+ETQLVSPPVFS + TEPS+AP TPP + + TTPSSPEVP+A+L S +H S+G
Subjt: EPPSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPES----VQFTTPSSPEVPYAKLLTSSLSHTNKSFG
Query: TNQKFALSHC-DFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPIL--EFRMAEAPKLL
KF +S +FQ YQ PGSP G LISP SG +SPFPD L F++++ PKLL
Subjt: TNQKFALSHC-DFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPIL--EFRMAEAPKLL
|
|
| AT1G76660.1 FUNCTIONS IN: molecular_function unknown | 2.3e-34 | 47.44 | Show/hide |
Query: KRRWGSFWSLYWCFGNGLQKNNKRISNAALVPE--PVGAVAPAVEHR--------TPSITIPFIAPPSSPTSFLQSEPPSNTQSPAGLLSLTALSVNNYS
++RWG ++ CF + QK KRI A+ +PE V A P H+ I + +APPSSP SF S PS TQSP LSL A S
Subjt: KRRWGSFWSLYWCFGNGLQKNNKRISNAALVPE--PVGAVAPAVEHR--------TPSITIPFIAPPSSPTSFLQSEPPSNTQSPAGLLSLTALSVNNYS
Query: QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPSGYL
GP +S++A GPYA+ETQLVSPPVFS F TEPSTAPFTPPPE + T PSSP+VPYA+ LTSS+ N G + D Q Y YPGSP+ L
Subjt: QNGP-ASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSLSHTNKSFGTNQKFALSHCDFQ-PYQPYPGSPSGYL
Query: ISPGSVISNSGTSSP
SP S S G SP
Subjt: ISPGSVISNSGTSSP
|
|
| AT4G25620.1 hydroxyproline-rich glycoprotein family protein | 2.3e-111 | 53.69 | Show/hide |
Query: MGSMNN-SVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV---GAVAPA--VEHRTPSITIPFIAPPSSPTSF
M S+NN SVDTVNAAA+AIVSAE+R QP S K+R GS+WSLYWCFG+ +KNNKRI +A LVPEP AVAP + SI +PFIAPPSSP SF
Subjt: MGSMNN-SVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPV---GAVAPA--VEHRTPSITIPFIAPPSSPTSF
Query: LQSEPP--SNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSL--SHTNK
L S PP S+T P L SLT N P S F IGPYA+ETQ V+PPVFSAF TEPSTAPFTPPPES PSSPEVP+A+LLTSSL + N
Subjt: LQSEPP--SNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPESVQFTTPSSPEVPYAKLLTSSL--SHTNK
Query: SFGTNQKFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTP
G NQKF+ +H +F+ Q YPGSP G LISPG SGTSSP+P K I+EFR+ E PK LG EHFT RKW SR GSGS+TP G GSRLGSG LTP
Subjt: SFGTNQKFALSHCDFQPYQPYPGSPSGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTP
Query: DGIGMGSRMGSGSVTPNGVRQDSKLGSGTLTP-DGLLLDNQTSQ----------SGCQND---ATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPT
D GS++ SG VTPNG ++ G LTP +G LLD+Q S+ S ND HRVSFELTGEDVARCLA+K
Subjt: DGIGMGSRMGSGSVTPNGVRQDSKLGSGTLTP-DGLLLDNQTSQ----------SGCQND---ATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPT
Query: STNCQKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVK-EASPGNNWTFFPMLQS
NR S E + E T+ E SE+ Q R+ + GS KEF F+ T EM I +EWWANEKVA K + SP N+WTFFP+L+S
Subjt: STNCQKENRESSGEAETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVK-EASPGNNWTFFPMLQS
Query: G
G
Subjt: G
|
|
| AT5G52430.1 hydroxyproline-rich glycoprotein family protein | 1.1e-105 | 52.25 | Show/hide |
Query: MNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPVGAVAPAV----EHRTPSITIPFIAPPSSPTSFLQSEP
+NNSV+TVNAAATAIV+AE+RVQP S K RWG WSLY CFG QKNNKRI NA LVPEPV + P V + ++ +PFIAPPSSP SFLQS+P
Subjt: MNNSVDTVNAAATAIVSAEARVQPPSPSKRRWGSFWSLYWCFGNGLQKNNKRISNAALVPEPVGAVAPAV----EHRTPSITIPFIAPPSSPTSFLQSEP
Query: PSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPE-SVQFTTPSSPEVPYAKLLTSSLSHTNK--SFGTNQ
S + SP G LSLT+ N +S P S+F +GPYA ETQ V+PPVFSAF TEPSTAP+TPPPE SV TTPSSPEVP+A+LLTSSL T + + G NQ
Subjt: PSNTQSPAGLLSLTALSVNNYSQNGPASIFAIGPYAYETQLVSPPVFSAFPTEPSTAPFTPPPE-SVQFTTPSSPEVPYAKLLTSSLSHTNK--SFGTNQ
Query: KFALSHCDFQPYQPYPGSP-SGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGM
KF+ SH +F+ Q PGSP G LISPGSVISNSGTSSP+P K P++EFR+ E PK LG EHFT RKW SR GSGS+TP G+G G L SG LTP+
Subjt: KFALSHCDFQPYQPYPGSP-SGYLISPGSVISNSGTSSPFPDKHPILEFRMAEAPKLLGLEHFTTRKWISRMGSGSLTPDGNGIGSRLGSGTLTPDGIGM
Query: GSRMGSGSVTPNGV-----RQDSKLGSGTLTPDGLLLDNQTSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNCQKENRESSGE
G + SG++TPN Q S++ S L N S G + +HRVSFELTGEDVARCLA+K NR S + + N + E ESS
Subjt: GSRMGSGSVTPNGV-----RQDSKLGSGTLTPDGLLLDNQTSQSGCQNDATNHRVSFELTGEDVARCLANKSMTSNRITAESPNPTSTNCQKENRESSGE
Query: AETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
DI+ + E + Q + ++GS KEF F+ TK E EKVA GN+W+FFP L+SGVS
Subjt: AETCEFFDIKTSSTAAEKTSEEDDRCYQNQRAVNLGSFKEFNFEQTKGEMHNTPSIGAEWWANEKVAVKEASPGNNWTFFPMLQSGVS
|
|