; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0022201 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0022201
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionFilamentous hemagglutinin
Genome locationchr04:7462100..7466967
RNA-Seq ExpressionPI0022201
SyntenyPI0022201
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053895.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]3.3e-25488.19Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELS RNVE+RCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVK---------------------------VAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
        HIFELEDNIFGEIP+P VK                           VAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPMPFVK---------------------------VAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL

Query:  NVSLFGNTSLFEVLKFPGGVTIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGT
        N SLFGNTSLFEVLKFPGG+TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GT
Subjt:  NVSLFGNTSLFEVLKFPGGVTIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGT

Query:  NSSSSQQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHS------HHHHHHHHHHRHHHHHHHH----HHQDTAYS
        N SSS+QRLKQLA TITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNA SPSPAPL HS      HHHHHHHHHHRHHHHHHHH    HHQ  AYS
Subjt:  NSSSSQQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHS------HHHHHHHHHHRHHHHHHHH----HHQDTAYS

Query:  PSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLS
        PSPGTEEHKHAPKN VSSAP+AGSSPME PTS+KRNYEATPPAFR+GYKRSSTKLRKQHHLGPIPSPSSSP SPYLRVGLPAPVSDSISASSPLSGVVLS
Subjt:  PSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLS

Query:  NVQPPNTGSGRAENFERSSPSV-PPQFSCEYSVIPHSHIISQ
        NVQPPNTGSG AENFERSSPSV PPQFSCEYS IPHSHIISQ
Subjt:  NVQPPNTGSGRAENFERSSPSV-PPQFSCEYSVIPHSHIISQ

TYK25511.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]6.1e-24087.52Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELS RNVE+RCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFPGG+TIIPPQS
Subjt:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GTN SSS+QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNY
        NTVFGKVKQVRLSFLNHSLGGGGNA                                        PGTEEHKHAPKN VSSAP+AGSSPME PTS+KRNY
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNY

Query:  EATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSV-PPQFSCEYSVIPHS
        EATPPAFR+GYKRSSTKLRKQHHLGPIPSPSSSP SPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSG AENFERSSPSV PPQFSCEYS IPHS
Subjt:  EATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSV-PPQFSCEYSVIPHS

Query:  HIISQ
        HIISQ
Subjt:  HIISQ

XP_004136773.3 uncharacterized protein LOC101213172 isoform X1 [Cucumis sativus]1.4e-24792.74Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVG SSSELS RNVE+RCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRP+DSAYRDHDIVASFHA KPVPFLQ 
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFPGG+TIIPPQS
Subjt:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GTN SSS+QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHS-----HHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTS
        NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPL HS     HHHHHHHHHH HHHHHHHHHH+D AYSPSPGTEEHKHAPKN VSSAP+AGSSPME PTS
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHS-----HHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTS

Query:  KKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSV-PPQFS
        +KRNYEATPPAFR+GYKRS TKLRK H+LGPIPSPSSSPSSPYLRVG PAPVSDSISASSPLSGVVLSNVQPPNTGSG AENFERSSPSV PPQFS
Subjt:  KKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSV-PPQFS

XP_008443610.1 PREDICTED: uncharacterized protein LOC103487165 [Cucumis melo]5.5e-24989.06Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELS RNVE+RCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFPGG+TIIPPQS
Subjt:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GTN SSS+QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHH------------------------------HHQDTAYSPSPGTEE
        NTVFGKVKQVRLSFLNHSLGGGGNA SPSPAPL HSHHHHHHHHHH HHHHHHHH                              HHQ  AYSPSPGTEE
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHH------------------------------HHQDTAYSPSPGTEE

Query:  HKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNT
        HKHAPKN VSSAP+AGSSPME PTS+KRNYEATPPAFR+GYKRSSTKLRKQHHLGPIPSPSSSP SPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNT
Subjt:  HKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNT

Query:  GSGRAENFERSSPSV-PPQFS
        GSG AENFERSSPSV PPQFS
Subjt:  GSGRAENFERSSPSV-PPQFS

XP_031738527.1 uncharacterized protein LOC101213172 isoform X2 [Cucumis sativus]1.5e-22292.58Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVG SSSELS RNVE+RCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRP+DSAYRDHDIVASFHA KPVPFLQ 
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFPGG+TIIPPQS
Subjt:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GTN SSS+QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHS-----HHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTS
        NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPL HS     HHHHHHHHHH HHHHHHHHHH+D AYSPSPGTEEHKHAPKN VSSAP+AGSSPME PTS
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHS-----HHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTS

Query:  KKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLR
        +KRNYEATPPAFR+GYKRS TKLRK H+LGPIPSPSSSPSSPYLR
Subjt:  KKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLR

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein2.1e-24693.08Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVG SSSELS RNVE+RCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRP+DSAYRDHDIVASFHA KPVPFLQ 
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFPGG+TIIPPQS
Subjt:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GTN SSS+QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNY
        NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPL HSHHH HHHH   HHHHHHHHHH+D AYSPSPGTEEHKHAPKN VSSAP+AGSSPME PTS+KRNY
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNY

Query:  EATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSV-PPQFS
        EATPPAFR+GYKRS TKLRK H+LGPIPSPSSSPSSPYLRVG PAPVSDSISASSPLSGVVLSNVQPPNTGSG AENFERSSPSV PPQFS
Subjt:  EATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSV-PPQFS

A0A1S3B8E9 uncharacterized protein LOC1034871652.7e-24989.06Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELS RNVE+RCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFPGG+TIIPPQS
Subjt:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GTN SSS+QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHH------------------------------HHQDTAYSPSPGTEE
        NTVFGKVKQVRLSFLNHSLGGGGNA SPSPAPL HSHHHHHHHHHH HHHHHHHH                              HHQ  AYSPSPGTEE
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHH------------------------------HHQDTAYSPSPGTEE

Query:  HKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNT
        HKHAPKN VSSAP+AGSSPME PTS+KRNYEATPPAFR+GYKRSSTKLRKQHHLGPIPSPSSSP SPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNT
Subjt:  HKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNT

Query:  GSGRAENFERSSPSV-PPQFS
        GSG AENFERSSPSV PPQFS
Subjt:  GSGRAENFERSSPSV-PPQFS

A0A5A7UJM2 Filamentous hemagglutinin1.6e-25488.19Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELS RNVE+RCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVK---------------------------VAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
        HIFELEDNIFGEIP+P VK                           VAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPMPFVK---------------------------VAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL

Query:  NVSLFGNTSLFEVLKFPGGVTIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGT
        N SLFGNTSLFEVLKFPGG+TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GT
Subjt:  NVSLFGNTSLFEVLKFPGGVTIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGT

Query:  NSSSSQQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHS------HHHHHHHHHHRHHHHHHHH----HHQDTAYS
        N SSS+QRLKQLA TITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNA SPSPAPL HS      HHHHHHHHHHRHHHHHHHH    HHQ  AYS
Subjt:  NSSSSQQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHS------HHHHHHHHHHRHHHHHHHH----HHQDTAYS

Query:  PSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLS
        PSPGTEEHKHAPKN VSSAP+AGSSPME PTS+KRNYEATPPAFR+GYKRSSTKLRKQHHLGPIPSPSSSP SPYLRVGLPAPVSDSISASSPLSGVVLS
Subjt:  PSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLS

Query:  NVQPPNTGSGRAENFERSSPSV-PPQFSCEYSVIPHSHIISQ
        NVQPPNTGSG AENFERSSPSV PPQFSCEYS IPHSHIISQ
Subjt:  NVQPPNTGSGRAENFERSSPSV-PPQFSCEYSVIPHSHIISQ

A0A5D3DPD6 Filamentous hemagglutinin3.0e-24087.52Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELS RNVE+RCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS
        HIFELEDNIFGEIP+P VKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLN SLFGNTSLFEVLKFPGG+TIIPPQS
Subjt:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMA+GTN SSS+QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNY
        NTVFGKVKQVRLSFLNHSLGGGGNA                                        PGTEEHKHAPKN VSSAP+AGSSPME PTS+KRNY
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNY

Query:  EATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSV-PPQFSCEYSVIPHS
        EATPPAFR+GYKRSSTKLRKQHHLGPIPSPSSSP SPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSG AENFERSSPSV PPQFSCEYS IPHS
Subjt:  EATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSV-PPQFSCEYSVIPHS

Query:  HIISQ
        HIISQ
Subjt:  HIISQ

A0A6J1J074 uncharacterized protein LOC111482272 isoform X23.0e-21679.81Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELS   V+SRCGGGGC  IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYG+WPD+  DS YRDH+IVA F A KPVPFL+N
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS
        HIFELEDNIFGEIP+PFVKVA+LSLQSLGG NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLN SLFGNTSLFEVLKFPGG+TIIPPQS
Subjt:  HIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFNDL+SQLRSGLRLS YENLYVSLSNERGST+ APT+VQSSVLMA+GTNSS+  QRLKQLAQTITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLN

Query:  NTVFGKVKQVRL-SFLNHSLGGGGNARSPSPAPLLHS------------------------HHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAP
        NTVFGKVKQVRL S LNHSL  GG ARSPSPAPL HS                        HHHHHHHHHH HHHHHH HHHQD AYSPSPGTEEHKHAP
Subjt:  NTVFGKVKQVRL-SFLNHSLGGGGNARSPSPAPLLHS------------------------HHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAP

Query:  KNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRA
        KN +SSAP+AGSSP+E P SKKRNYEATPP FR+GYK  STK+RK+ HLG IPSPSS PSSPYLRVGLPAPV+ SISASSPL GV LSNVQPP  G    
Subjt:  KNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRA

Query:  ENFERSSPSV-PPQFSCEYSVIPHS
           +RS+PSV PPQFS    V  H+
Subjt:  ENFERSSPSV-PPQFSCEYSVIPHS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)4.6e-3633.04Show/hide
Query:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSE-IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQ
        M K  +E  L +   + +L       R  G  CS    RL+ +RC+  L+LS A+ LSAIFWL P  S   +  +   +   +  + ASF   KPV  + 
Subjt:  MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSE-IRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQ

Query:  NHIFELEDNIFGEIPMP-FVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPP
         H  ++E +I   I +    KV +LSL   G  N T + FAV       +I   S SL++ +F  L      L+L  S FG  + F+VLKFPGG+T+ P 
Subjt:  NHIFELEDNIFGEIPMP-FVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPP

Query:  QSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLG
        + A +   A + F+ T+  SI  +Q   + L+      L L PYE+++  L+N++GST+  P   Q  V     T      QRL    Q I  S + NLG
Subjt:  QSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLG

Query:  LNNTVFGKVKQVRLS-FLNHSLGGGGNARSPSPAPLLHS
        L+  VFG+VK +  S +L+  +       +P+P P L S
Subjt:  LNNTVFGKVKQVRLS-FLNHSLGGGGNARSPSPAPLLHS

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein2.1e-9247.13Show/hide
Query:  MGKSEEEQPLPV--GVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFL
        MGK+E++  L V  G ++ + + RN  +RC  G C  I   +  +C+F LLLS A+FLSA+F L PF    +  D  +D  +R H IVASF   +   FL
Subjt:  MGKSEEEQPLPV--GVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFL

Query:  QNHIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPP
          +  +L+++IF E+    +KV IL+++     N+TK+VF +D D  Y +I P S S IKE FE+++IN+  L+L  SLFG T LFEVLKFPGG+T+IPP
Subjt:  QNHIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPP

Query:  QSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLG
        QSAF LQ  +I FNFTLNYSI+QIQ+NFN L+SQL++GL L+PYENLYVSLSN  GSTV  PT V SSVL+ VGT++SS   RLKQL  TIT S S NLG
Subjt:  QSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLG

Query:  LNNTVFGKVKQVRL-SFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKK
        LNNT+FGKVKQVRL SFL +S     + +SPSP+P  HS HHHHHHHHH HHHHHHH+HH             H  +PK     +P A  +P     S+K
Subjt:  LNNTVFGKVKQVRL-SFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKK

Query:  RNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDS----ISASSPLSGVVLSN-VQPPNTGSGRAENFERSSPSVPPQFSCE
        R   A PP    G +    + R Q    P P+PS+   +P+ ++  PAP+S +    +  S+PL  VV ++  QPP T        E + P  P   S  
Subjt:  RNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPVSDS----ISASSPLSGVVLSN-VQPPNTGSGRAENFERSSPSVPPQFSCE

Query:  YSVIP
          V+P
Subjt:  YSVIP

AT3G56590.1 hydroxyproline-rich glycoprotein family protein8.3e-9446.26Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSGRNVESRCGGGG------CSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL + +  D  +D  ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSSELSGRNVESRCGGGG------CSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWK

Query:  PVPFLQNHIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGV
        P+ F+++++ +LE++I  EI  P  KV +L+L+ LG  N T ++FA+D + + SKIP   +SLIK  FETLV  +   RL  SLFG    FEVLKFPGG+
Subjt:  PVPFLQNHIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGV

Query:  TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSH
        T+IPPQ  F LQ AQ+ FNFTLN+SIYQIQ NF +L+SQL+ G+ L+ YENLY++LSN RGSTV  PT+V SSVL+  G++S     RLKQLAQTIT+SH
Subjt:  TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSH

Query:  SGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDP
        S NLGLN+TVFGKVKQVRLS +        +  SPSP P           H + HHH HHHHHH + A  PS               S P  G +P   P
Subjt:  SGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDP

Query:  TSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPV-SDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSVPP
        T         PP   +  +R        HH  P P+P+   S P+     PAP    +I  SSPL  VV +++ PP+  S  +E     SPS  P
Subjt:  TSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPV-SDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSVPP

AT3G56590.2 hydroxyproline-rich glycoprotein family protein8.3e-9446.26Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSGRNVESRCGGGG------CSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL + +  D  +D  ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSSELSGRNVESRCGGGG------CSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWK

Query:  PVPFLQNHIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGV
        P+ F+++++ +LE++I  EI  P  KV +L+L+ LG  N T ++FA+D + + SKIP   +SLIK  FETLV  +   RL  SLFG    FEVLKFPGG+
Subjt:  PVPFLQNHIFELEDNIFGEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGV

Query:  TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSH
        T+IPPQ  F LQ AQ+ FNFTLN+SIYQIQ NF +L+SQL+ G+ L+ YENLY++LSN RGSTV  PT+V SSVL+  G++S     RLKQLAQTIT+SH
Subjt:  TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSH

Query:  SGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDP
        S NLGLN+TVFGKVKQVRLS +        +  SPSP P           H + HHH HHHHHH + A  PS               S P  G +P   P
Subjt:  SGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSPAPLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDP

Query:  TSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPV-SDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSVPP
        T         PP   +  +R        HH  P P+P+   S P+     PAP    +I  SSPL  VV +++ PP+  S  +E     SPS  P
Subjt:  TSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLRVGLPAPV-SDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSVPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAGTGAAGAAGAACAGCCACTACCGGTTGGAGTGAGCTCCTCTGAGCTTTCTGGCCGGAATGTGGAGAGCAGATGCGGCGGCGGTGGGTGCTCTGAGATTCG
TAGACTGATTGCGGTGAGATGTGTGTTCTTCCTGTTACTATCAGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCGTTCCTATCCTATGGAAATTGGCCGGATC
GGCCTATTGATTCTGCTTATAGAGATCATGACATAGTAGCAAGTTTTCACGCTTGGAAGCCAGTTCCTTTTCTGCAAAACCATATTTTTGAGCTTGAGGATAACATTTTC
GGAGAAATACCCATGCCTTTTGTCAAGGTGGCTATCCTCTCACTACAATCATTAGGTGGACCAAATGTAACAAAAATAGTTTTTGCGGTAGATTCTGATGCCAAGTATTC
AAAAATTCCCCCAACATCTCAAAGTTTAATCAAGGAAACCTTTGAAACATTGGTTATAAATGAACCTCCTCTCAGATTGAATGTATCTTTATTTGGCAATACATCCTTAT
TCGAGGTGTTGAAATTTCCTGGAGGAGTAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACGGCACAGATATATTTCAATTTTACGTTAAATTATTCGATATAT
CAAATTCAAGTGAATTTCAATGATCTTTCCAGCCAGCTGAGGTCGGGATTACGTCTATCACCTTATGAGAATTTATATGTTAGCCTATCGAACGAAAGAGGTTCAACAGT
GGATGCACCCACTGTTGTCCAATCATCTGTCCTGATGGCAGTTGGGACCAATTCATCTTCATCGCAACAAAGGCTGAAACAGTTGGCTCAAACCATCACAAATTCTCATT
CAGGAAACCTTGGCCTGAACAACACCGTATTTGGTAAGGTCAAGCAGGTGCGTCTTTCATTTCTAAACCACTCTCTTGGTGGTGGTGGAAATGCACGGTCGCCTTCACCT
GCGCCTCTGCTTCATTCCCACCACCACCACCACCATCACCATCATCACCGCCACCACCACCACCACCACCATCACCACCACCAGGATACTGCATATTCACCAAGTCCTGG
AACAGAGGAGCACAAACATGCACCGAAGAACTGGGTCTCATCTGCTCCCAAAGCTGGTTCATCCCCAATGGAAGATCCAACTTCAAAAAAAAGAAACTACGAAGCAACCC
CACCTGCTTTTCGTTTTGGATATAAAAGGTCGTCAACAAAACTCAGAAAGCAACATCATTTAGGCCCTATTCCTTCTCCAAGCAGTTCTCCGTCGTCACCATACTTACGA
GTAGGCCTGCCAGCACCCGTCTCTGATTCTATTTCTGCATCTAGTCCTCTGTCAGGTGTAGTTCTATCTAATGTACAGCCTCCAAATACAGGCAGCGGACGTGCAGAAAA
TTTTGAAAGAAGTTCCCCTTCAGTACCACCACAATTTTCTTGTGAGTATAGTGTCATCCCTCATTCTCACATTATTTCACAGCGCACTTATTGCCTTTCTTTTCCCCAAG
AGACTTCTCCTTTTGATTTTTTAACAGTGGTTAGTATGATTGCGATTGGCCTATAG
mRNA sequenceShow/hide mRNA sequence
ATTTTGGTAAAGAAAACTGATAGATGAATGGTTGATTGATGATAGATAAACCCACCACTATAACCCCATTTAACTTCAAACACTATTCCTGTCATTAGTAGTTGTTGCGC
CACTTAAATCCAAACCAAACCACCCCATTTCTCTCTCTTTTTCTCTCGAACCCTAGATTTTCTTTTCAAATTTGCTTCTGGGTTTGTGGTGGATTAGCCCCACGAGGTGG
GATTTTGAGCTTTTATCTTGTTGTGTAAGTTAGAGTTGTTCTGGGGTTGATTTGATTTGAATTTGAGCTGTAATGGGTGAAGATAATTCAGACCCAATTGAGGGAGGCGG
CAATGAAGATTGTTAATCCATTTCACATGCATTGCTTCTATGGGAAAGAGTGAAGAAGAACAGCCACTACCGGTTGGAGTGAGCTCCTCTGAGCTTTCTGGCCGGAATGT
GGAGAGCAGATGCGGCGGCGGTGGGTGCTCTGAGATTCGTAGACTGATTGCGGTGAGATGTGTGTTCTTCCTGTTACTATCAGCGGCTGTGTTTCTTTCTGCTATTTTTT
GGCTGCCACCGTTCCTATCCTATGGAAATTGGCCGGATCGGCCTATTGATTCTGCTTATAGAGATCATGACATAGTAGCAAGTTTTCACGCTTGGAAGCCAGTTCCTTTT
CTGCAAAACCATATTTTTGAGCTTGAGGATAACATTTTCGGAGAAATACCCATGCCTTTTGTCAAGGTGGCTATCCTCTCACTACAATCATTAGGTGGACCAAATGTAAC
AAAAATAGTTTTTGCGGTAGATTCTGATGCCAAGTATTCAAAAATTCCCCCAACATCTCAAAGTTTAATCAAGGAAACCTTTGAAACATTGGTTATAAATGAACCTCCTC
TCAGATTGAATGTATCTTTATTTGGCAATACATCCTTATTCGAGGTGTTGAAATTTCCTGGAGGAGTAACTATTATTCCTCCTCAGAGTGCATTTCTTCTGCAGACGGCA
CAGATATATTTCAATTTTACGTTAAATTATTCGATATATCAAATTCAAGTGAATTTCAATGATCTTTCCAGCCAGCTGAGGTCGGGATTACGTCTATCACCTTATGAGAA
TTTATATGTTAGCCTATCGAACGAAAGAGGTTCAACAGTGGATGCACCCACTGTTGTCCAATCATCTGTCCTGATGGCAGTTGGGACCAATTCATCTTCATCGCAACAAA
GGCTGAAACAGTTGGCTCAAACCATCACAAATTCTCATTCAGGAAACCTTGGCCTGAACAACACCGTATTTGGTAAGGTCAAGCAGGTGCGTCTTTCATTTCTAAACCAC
TCTCTTGGTGGTGGTGGAAATGCACGGTCGCCTTCACCTGCGCCTCTGCTTCATTCCCACCACCACCACCACCATCACCATCATCACCGCCACCACCACCACCACCACCA
TCACCACCACCAGGATACTGCATATTCACCAAGTCCTGGAACAGAGGAGCACAAACATGCACCGAAGAACTGGGTCTCATCTGCTCCCAAAGCTGGTTCATCCCCAATGG
AAGATCCAACTTCAAAAAAAAGAAACTACGAAGCAACCCCACCTGCTTTTCGTTTTGGATATAAAAGGTCGTCAACAAAACTCAGAAAGCAACATCATTTAGGCCCTATT
CCTTCTCCAAGCAGTTCTCCGTCGTCACCATACTTACGAGTAGGCCTGCCAGCACCCGTCTCTGATTCTATTTCTGCATCTAGTCCTCTGTCAGGTGTAGTTCTATCTAA
TGTACAGCCTCCAAATACAGGCAGCGGACGTGCAGAAAATTTTGAAAGAAGTTCCCCTTCAGTACCACCACAATTTTCTTGTGAGTATAGTGTCATCCCTCATTCTCACA
TTATTTCACAGCGCACTTATTGCCTTTCTTTTCCCCAAGAGACTTCTCCTTTTGATTTTTTAACAGTGGTTAGTATGATTGCGATTGGCCTATAGGGGAATTATATATAT
GGTTTTGGTTAAGAGCAAGGGTTACCTTAGGAGATTTGGAGTGTTAGGCAGGTGCGAAGATGTGTCAACTTATAATTGAGTTAGATTTAAACCTCTAGCATCTTAGTTTA
GTTTCGCCAAGAGTTGTAATGTTGGCAATTTTTTGTTTTCTTATTCATGGAGGTAAGGTTGATTAAGTTGTAATCTAACATGCTCGTTCCCAGCTTCTGTAGGTGTTCGT
GTTTATACTATTCAATGGACACTTGCGCTATTTCTACTTGTATGGCATGTATAACCAAGGAGATAAAACCTACATGTAACATGCATATTTCAAGATGACCCCATAATTCA
GATTAGAGTTGCGGTAGCGAGATCGACGCAAAGGCGTGCTCCTGCTAGAGTTGATAAAATGATGTTTCAATGTATGTAAATACCAGATCAAGCAAGTTGGGAGATGCATT
TTTCTAGGTCAAAGTCACAGAGGTGGCAGGCCTTTTGTAATTGAGTATTGTGTATTGTGTTTTCTCTTCTCCATAAAATGTGAGGAGGGAAGAAATCAGCTAAATGCGTC
CCATTTTGTTTTGTTCTCAACTTATAATTATTATAAGGTAAATCAAGCAATTACAATTTTTTTTCAATTCACAAACAAAAAGCCTCATTTAAGCCACCTAGTTTTTTAGT
GTTGTCTTTTTTTTTATTAATTTTATTTTCATGATAATACCAACAAAAAGAAAAAAAAAGTATATAACAACAAATTAAATGTCTACTATGCAAACTTCTAAAGTTAATAT
TAAAACTAAATAAATGATTCTTTCCATTGAAGTCCTTGAGTGATGTCGTATTTATCAAGATAGCCTATGTCAAACATTCCCCATTCTTCATATCTGAATTAATGAATCCT
TTAATGTGTTATACTTAGTCAGTCTTGCTATATGACTTTTCTTCTATTTCTTAGAAAAAAAAACAAAATGACATATAATTACTATTTTTTATATATATATTTTAAAAATA
AACCATTCAAAATTAAAATTGTGAACTTTAAAATTTGAAGTTTTTTTTTTGATTCATTGACTCATATTGCATTCATTAATTATGTGGCATTACTATTGTTCAAAGTTAGT
TACTTTGTTGCTTTACCATTTGAATTTCTTAGCTTTTCTTTTAGCTTTTCCTTGTAAAAAATTGGACGGATATATTTTTTTTTGGGAGAGAATTGGATGGATATGATTAA
GAGAACATATTAGGGGTAACTATTAGAGTGGACTTTATTAGGAATTTTGGTGGTACACTTTTGACTCTTACTTTGATCAATTGGTTTTGCAAATAGTTTCATTTTTTTTC
TATTCATAACAATTCTCCTCTAAAAATAATTAAAAGGATTGTTGTTAATTTAATAAAACCTTACTTCATAATTATTTTAAATATAAATTATAAAATCCTAAATACTACAT
TGAAATTAGGTACAAATCATTGAAAACAAATTTATAATATAAAG
Protein sequenceShow/hide protein sequence
MGKSEEEQPLPVGVSSSELSGRNVESRCGGGGCSEIRRLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQNHIFELEDNIF
GEIPMPFVKVAILSLQSLGGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNVSLFGNTSLFEVLKFPGGVTIIPPQSAFLLQTAQIYFNFTLNYSIY
QIQVNFNDLSSQLRSGLRLSPYENLYVSLSNERGSTVDAPTVVQSSVLMAVGTNSSSSQQRLKQLAQTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNARSPSP
APLLHSHHHHHHHHHHRHHHHHHHHHHQDTAYSPSPGTEEHKHAPKNWVSSAPKAGSSPMEDPTSKKRNYEATPPAFRFGYKRSSTKLRKQHHLGPIPSPSSSPSSPYLR
VGLPAPVSDSISASSPLSGVVLSNVQPPNTGSGRAENFERSSPSVPPQFSCEYSVIPHSHIISQRTYCLSFPQETSPFDFLTVVSMIAIGL