; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0113331 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0113331
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionFilamentous hemagglutinin
Genome locationCMiso1.1chr04:31903297..31907561
RNA-Seq ExpressionCmc04g0113331
SyntenyCmc04g0113331
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053895.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]3.5e-23292.21Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVK---------------------------VAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
        HIFELEDNIFGEIPIPSVK                           VAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPIPSVK---------------------------VAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL

Query:  NESLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGT
        NESLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGT
Subjt:  NESLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGT

Query:  NLSSSKQRLKQLAHTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHH
        NLSSSKQRLKQLAHTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHS           HHHHHHHHHHHHHHHHRHHHHHHHHH
Subjt:  NLSSSKQRLKQLAHTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHH

Query:  HHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        HHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
Subjt:  HHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

TYK25511.1 Filamentous hemagglutinin [Cucumis melo var. makuwa]1.3e-21586.77Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV
        NTVFGKVKQVRLSFLNHSLGGGGNA                                                             PGTEEHKHAPKNGV
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV

Query:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
Subjt:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

XP_004136773.3 uncharacterized protein LOC101213172 isoform X1 [Cucumis sativus]6.6e-22393.28Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVG SSSELSDRNVENRCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRP+DSAYRDHDIVASFHA KPVPFLQ 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPIPSVKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV
        NTVFGKVKQVRLSFLNHSLGGGGNA SPSPAPLPHSHHH HHHHHHHHHHHHHHHHHHHH                HH+ AAYSPSPGTEEHKHAPKNGV
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV

Query:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRS TKLRK H+LGPIPSPSSSP SPYLR
Subjt:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

XP_008443610.1 PREDICTED: uncharacterized protein LOC103487165 [Cucumis melo]1.0e-23998.09Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHS---------HHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEE
        NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHS         HHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEE
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHS---------HHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEE

Query:  HKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        HKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
Subjt:  HKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

XP_031738527.1 uncharacterized protein LOC101213172 isoform X2 [Cucumis sativus]2.6e-25192.98Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVG SSSELSDRNVENRCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRP+DSAYRDHDIVASFHA KPVPFLQ 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPIPSVKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV
        NTVFGKVKQVRLSFLNHSLGGGGNA SPSPAPLPHSHHH HHHHHHHHHHHHHHHHHHHH                HH+ AAYSPSPGTEEHKHAPKNGV
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV

Query:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLRAADMQKILKEVPLPSYHHNFLLLQMFVFILFNGHLRYFY
        SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRS TKLRK H+LGPIPSPSSSP SPYLRA DMQKILKEVPL SYHHNFLLLQ+FVFILFNGHLRYFY
Subjt:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLRAADMQKILKEVPLPSYHHNFLLLQMFVFILFNGHLRYFY

Query:  LYGTYNQGDKTYM
         YG YNQGDKTYM
Subjt:  LYGTYNQGDKTYM

TrEMBL top hitse value%identityAlignment
A0A0A0LHD1 Uncharacterized protein5.4e-22391.54Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVG SSSELSDRNVENRCGGGGCSEIR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRP+DSAYRDHDIVASFHA KPVPFLQ 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPIPSVKVAILSLQSL GPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGST+DAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV
        NTVFGKVKQVRLSFLNHSLGGGGNA SPSPAPLPHSHHH HHHHHHHHHHHH                        HH+ AAYSPSPGTEEHKHAPKNGV
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV

Query:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRS TKLRK H+LGPIPSPSSSP SPYLR
Subjt:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

A0A1S3B8E9 uncharacterized protein LOC1034871654.9e-24098.09Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHS---------HHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEE
        NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHS         HHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEE
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHS---------HHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEE

Query:  HKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        HKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
Subjt:  HKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

A0A5A7UJM2 Filamentous hemagglutinin1.7e-23292.21Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVK---------------------------VAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
        HIFELEDNIFGEIPIPSVK                           VAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL
Subjt:  HIFELEDNIFGEIPIPSVK---------------------------VAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRL

Query:  NESLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGT
        NESLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGT
Subjt:  NESLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGT

Query:  NLSSSKQRLKQLAHTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHH
        NLSSSKQRLKQLAHTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHS           HHHHHHHHHHHHHHHHRHHHHHHHHH
Subjt:  NLSSSKQRLKQLAHTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHH

Query:  HHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        HHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
Subjt:  HHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

A0A5D3DPD6 Filamentous hemagglutinin6.4e-21686.77Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN

Query:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV
        NTVFGKVKQVRLSFLNHSLGGGGNA                                                             PGTEEHKHAPKNGV
Subjt:  NTVFGKVKQVRLSFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGV

Query:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
Subjt:  SSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

A0A6J1J074 uncharacterized protein LOC111482272 isoform X29.7e-19685.59Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN
        MGKSEEEQPLPVGVSSSELSD  V++RCGGGGC  IR+LIAVRCVFFLLLSAAVFLSAIFWLPPFLSYG+WPD+  DS YRDH+IVA F A KPVPFL+N
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQN

Query:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS
        HIFELEDNIFGEIP+P VKVA+LSLQSL G NVT I+F+VD DAKYSKIPPTSQSLIKETFETLVIN+PPLRLN SLFGNTSLFEVLKFPGGITIIPPQS
Subjt:  HIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQS

Query:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN
        AFLLQTAQIYFNFTLNYSIYQIQVNF+DL+SQLRSGLRLS YENLYVSLSNERGSTM APT+VQSSVLMAIGTN  SS QRLKQLA TITNSHSGNLGLN
Subjt:  AFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLN

Query:  NTVFGKVKQVRL-SFLNHSLGGGGNAWSPSPAPLPHS---HHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAP
        NTVFGKVKQVRL S LNHSL  GG A SPSPAPLPHS   H HHHHHHHHHHH HHH HHHHHHHHH HHHHHHHHHH +HHQ AAYSPSPGTEEHKHAP
Subjt:  NTVFGKVKQVRL-SFLNHSLGGGGNAWSPSPAPLPHS---HHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAP

Query:  KNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR
        KNG+SSAPEAGSSP+E P S+KRNYEATPP FRYGYK  STK+RK+ HLG IPSPSS P SPYLR
Subjt:  KNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10790.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT3G56590.2)7.7e-3632.54Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSE-IRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQ
        M K  +E  L +   + +L +     R  G  CS    +L+ +RC+  L+LS A+ LSAIFWL P  S   +  +   +   +  + ASF   KPV  + 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSE-IRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQ

Query:  NHIFELEDNIFGEIPIP-SVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPP
         H  ++E +I   I +  + KV +LSL      N T + FAV       +I   S SL++ +F  L      L+L  S FG  + F+VLKFPGGIT+ P 
Subjt:  NHIFELEDNIFGEIPIP-SVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPP

Query:  QSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLG
        + A +   A + F+ T+  SI  +Q   D L+      L L PYE+++  L+N++GST+  P   Q  V   +   L    QRL      I  S + NLG
Subjt:  QSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLG

Query:  LNNTVFGKVKQVRLS-FLNHSLGGGGNAWSPSPAP
        L+  VFG+VK +  S +L+  +       +P+P P
Subjt:  LNNTVFGKVKQVRLS-FLNHSLGGGGNAWSPSPAP

AT3G10810.1 zinc finger (C3HC4-type RING finger) family protein1.7e-8348.71Show/hide
Query:  MGKSEEEQPLPVGVSSSELSDRNVEN-RCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQ
        MGK+E++  L V        D  V N RC  G C  I   +  +C+F LLLS A+FLSA+F L PF    +  D  +D  +R H IVASF   +   FL 
Subjt:  MGKSEEEQPLPVGVSSSELSDRNVEN-RCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQ

Query:  NHIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQ
         +  +L+++IF E+   S+KV IL+++     N+TK+VF +D D  Y +I P S S IKE FE+++IN+  L+L +SLFG T LFEVLKFPGGIT+IPPQ
Subjt:  NHIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQ

Query:  SAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGL
        SAF LQ  +I FNFTLNYSI+QIQ+NF+ L+SQL++GL L+PYENLYVSLSN  GST+  PT V SSVL+ +GT  S+S  RLKQL  TIT S S NLGL
Subjt:  SAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGL

Query:  NNTVFGKVKQVRL-SFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKN
        NNT+FGKVKQVRL SFL +S     +  SPSP+P PHS HHHHHHHHHHHHHHHHH+HHHHHHH+                               +PK 
Subjt:  NNTVFGKVKQVRL-SFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKN

Query:  GVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLRA
            +P A  +P     SRKR   A PP    G +    + R Q    P P+PS+  P   L +
Subjt:  GVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLRKQHHLGPIPSPSSSPPSPYLRA

AT3G56590.1 hydroxyproline-rich glycoprotein family protein1.2e-8654.32Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSDRNVENRCGGGG------CSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL + +  D  +D  ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSSELSDRNVENRCGGGG------CSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWK

Query:  PVPFLQNHIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGI
        P+ F+++++ +LE++I  EI  P  KV +L+L+ L   N T ++FA+D + + SKIP   +SLIK  FETLV  +   RL ESLFG    FEVLKFPGGI
Subjt:  PVPFLQNHIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGI

Query:  TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSH
        T+IPPQ  F LQ AQ+ FNFTLN+SIYQIQ NF++L+SQL+ G+ L+ YENLY++LSN RGST+  PT+V SSVL+  G     S  RLKQLA TIT+SH
Subjt:  TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSH

Query:  SGNLGLNNTVFGKVKQVRL-SFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHH
        S NLGLN+TVFGKVKQVRL S L HS      + +PSP+P P +H + HHH HHHHHHH
Subjt:  SGNLGLNNTVFGKVKQVRL-SFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHH

AT3G56590.2 hydroxyproline-rich glycoprotein family protein1.2e-8654.32Show/hide
Query:  MGKSE-EEQPLPVGVSSSELSDRNVENRCGGGG------CSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWK
        MGK+  EEQ LP  VS    S RN     GGGG      C  I    ++RCV  L  SAAVFLSA+FWLPPFL + +  D  +D  ++DH IVASF   K
Subjt:  MGKSE-EEQPLPVGVSSSELSDRNVENRCGGGG------CSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWK

Query:  PVPFLQNHIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGI
        P+ F+++++ +LE++I  EI  P  KV +L+L+ L   N T ++FA+D + + SKIP   +SLIK  FETLV  +   RL ESLFG    FEVLKFPGGI
Subjt:  PVPFLQNHIFELEDNIFGEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGI

Query:  TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSH
        T+IPPQ  F LQ AQ+ FNFTLN+SIYQIQ NF++L+SQL+ G+ L+ YENLY++LSN RGST+  PT+V SSVL+  G     S  RLKQLA TIT+SH
Subjt:  TIIPPQSAFLLQTAQIYFNFTLNYSIYQIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSH

Query:  SGNLGLNNTVFGKVKQVRL-SFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHH
        S NLGLN+TVFGKVKQVRL S L HS      + +PSP+P P +H + HHH HHHHHHH
Subjt:  SGNLGLNNTVFGKVKQVRL-SFLNHSLGGGGNAWSPSPAPLPHSHHHHHHHHHHHHHHH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAGAGTGAAGAAGAACAGCCACTGCCGGTTGGAGTGAGCTCTTCTGAGCTTTCTGACCGGAATGTGGAGAACAGATGCGGCGGCGGTGGGTGCTCTGAGATTCG
TAAACTGATTGCGGTGAGATGTGTGTTCTTCCTGTTACTATCAGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCGTTCCTATCCTATGGAAATTGGCCGGATC
GGCCTATTGATTCTGCTTATAGAGATCATGACATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTCCTTTTCTGCAAAACCATATTTTTGAGCTTGAGGATAACATTTTC
GGAGAAATACCCATACCTTCTGTCAAGGTGGCTATCCTCTCACTACAATCATTAAGTGGACCAAATGTAACAAAAATAGTTTTTGCGGTAGATTCTGATGCCAAGTATTC
AAAAATTCCCCCAACATCTCAAAGCTTAATCAAGGAAACCTTTGAAACATTGGTTATAAATGAACCTCCTCTCAGATTGAATGAATCTTTATTTGGCAATACATCCTTAT
TCGAGGTGTTGAAATTTCCCGGAGGAATAACTATTATTCCTCCTCAAAGTGCATTTCTTCTTCAGACAGCGCAGATCTATTTCAATTTTACGTTAAATTATTCTATATAT
CAAATTCAAGTGAATTTCGATGATCTTTCCAGCCAGCTGAGGTCGGGATTACGTCTATCTCCTTATGAGAATTTATATGTTAGCCTATCAAACGAGAGAGGCTCAACAAT
GGATGCACCCACTGTTGTCCAATCATCTGTCCTGATGGCAATTGGGACTAATTTATCTTCATCGAAACAAAGGCTGAAACAGTTGGCTCATACCATCACAAATTCTCATT
CAGGGAACCTTGGCCTGAACAACACCGTATTTGGTAAGGTCAAGCAGGTGCGTCTTTCATTTCTAAACCACTCTCTTGGTGGTGGTGGAAATGCATGGTCGCCTTCACCT
GCACCTCTGCCTCATTCCCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCGTCATCACCA
CCACCATCACCACCACCATCACCATAACCACCACCAGCATGCTGCATATTCACCAAGTCCCGGAACAGAGGAGCACAAACATGCACCGAAAAACGGGGTCTCATCTGCTC
CCGAAGCTGGTTCATCCCCAATGGAAGGTCCAACTTCAAGAAAAAGAAACTACGAAGCAACCCCACCTGCTTTTCGTTATGGATATAAAAGATCGTCAACAAAACTCAGA
AAACAACATCATTTAGGCCCTATTCCTTCTCCAAGCAGTTCTCCACCGTCGCCGTACTTACGAGCAGCGGACATGCAGAAAATTTTGAAAGAAGTTCCCCTTCCGTCTTA
CCACCACAATTTTCTTCTACTGCAGATGTTCGTGTTTATACTATTCAATGGACACTTGCGCTATTTCTACTTGTATGGCACGTATAACCAAGGAGATAAAACCTACATGT
AA
mRNA sequenceShow/hide mRNA sequence
GAAGAAGAAAGGAATGCTTATGCGCCTCACACGGTTTCTTCTTCAACACACTGAGCTGAATCACTCATTGATTTTTTGTTTTCTCTCTCTATGAATTAAAAAATTAAAAG
TGGAAGTAAGAGATCGCCAATCGCCATAACCCATCTTCCACGGCTCTTCATTTAAGTGCGCAATTAGTGCCTTCAATTTACATTTCTCTGTTTTTGCCGCCTCTTCTTCC
CTTTTTTGCTTCTCTTGTCAACTTCTTCCATTTCTTTTTCAAATTTTTATTTTTTGTTTTTTTTAGAAAATAAATTTTGGTAAAGAAAACTGATAGATGAATGGTTGATT
GATGATAGATAAACCCACCACTATAACCCCATTTAACTTCAAACACTATTCCTGTCATTTGTAGTTGTTGCGCCACTTAAATCCAAACCAAACCACCCCATTTCTCTCTC
TTTTTCTTCTCTCGAACCCTAGATTTTCTTTTCAAATTTGCTTCTGGGTTTGCGGTGGATTAGCCCCACTAGGTGGGATTTTTGAGCTTTTATCTTGTTGTGTTAGTTAT
TTGTTGTTCTGGGGTTGATTTGATTTGAATTTGAGCTGTAATGAGTGAAGATAATTCAGACCCAATTGAGGGAGGTGGCAATGAAGATTGTTAATCCATTTCACATGCAT
TGCTTCTATGGGAAAGAGTGAAGAAGAACAGCCACTGCCGGTTGGAGTGAGCTCTTCTGAGCTTTCTGACCGGAATGTGGAGAACAGATGCGGCGGCGGTGGGTGCTCTG
AGATTCGTAAACTGATTGCGGTGAGATGTGTGTTCTTCCTGTTACTATCAGCGGCTGTGTTTCTTTCTGCTATTTTTTGGCTGCCACCGTTCCTATCCTATGGAAATTGG
CCGGATCGGCCTATTGATTCTGCTTATAGAGATCATGACATAGTAGCAAGTTTTCATGCTTGGAAGCCAGTTCCTTTTCTGCAAAACCATATTTTTGAGCTTGAGGATAA
CATTTTCGGAGAAATACCCATACCTTCTGTCAAGGTGGCTATCCTCTCACTACAATCATTAAGTGGACCAAATGTAACAAAAATAGTTTTTGCGGTAGATTCTGATGCCA
AGTATTCAAAAATTCCCCCAACATCTCAAAGCTTAATCAAGGAAACCTTTGAAACATTGGTTATAAATGAACCTCCTCTCAGATTGAATGAATCTTTATTTGGCAATACA
TCCTTATTCGAGGTGTTGAAATTTCCCGGAGGAATAACTATTATTCCTCCTCAAAGTGCATTTCTTCTTCAGACAGCGCAGATCTATTTCAATTTTACGTTAAATTATTC
TATATATCAAATTCAAGTGAATTTCGATGATCTTTCCAGCCAGCTGAGGTCGGGATTACGTCTATCTCCTTATGAGAATTTATATGTTAGCCTATCAAACGAGAGAGGCT
CAACAATGGATGCACCCACTGTTGTCCAATCATCTGTCCTGATGGCAATTGGGACTAATTTATCTTCATCGAAACAAAGGCTGAAACAGTTGGCTCATACCATCACAAAT
TCTCATTCAGGGAACCTTGGCCTGAACAACACCGTATTTGGTAAGGTCAAGCAGGTGCGTCTTTCATTTCTAAACCACTCTCTTGGTGGTGGTGGAAATGCATGGTCGCC
TTCACCTGCACCTCTGCCTCATTCCCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCACCGTC
ATCACCACCACCATCACCACCACCATCACCATAACCACCACCAGCATGCTGCATATTCACCAAGTCCCGGAACAGAGGAGCACAAACATGCACCGAAAAACGGGGTCTCA
TCTGCTCCCGAAGCTGGTTCATCCCCAATGGAAGGTCCAACTTCAAGAAAAAGAAACTACGAAGCAACCCCACCTGCTTTTCGTTATGGATATAAAAGATCGTCAACAAA
ACTCAGAAAACAACATCATTTAGGCCCTATTCCTTCTCCAAGCAGTTCTCCACCGTCGCCGTACTTACGAGCAGCGGACATGCAGAAAATTTTGAAAGAAGTTCCCCTTC
CGTCTTACCACCACAATTTTCTTCTACTGCAGATGTTCGTGTTTATACTATTCAATGGACACTTGCGCTATTTCTACTTGTATGGCACGTATAACCAAGGAGATAAAACC
TACATGTAACATGCATATTTCAGGATGACCTCATAATTCAGATTAGAGTTACGGTAGCGAGACCGACGCAAAGGCGTGCTCCTGCTAGAGTTGATAAAATGATGTTTCCA
TGTATGTAAATACCAGATGAAGCAAGTTGAGAGATGCATTTTTGTAGGTCAAAGTCACAGAGGTGGCAGGCCTTTTATAATTGTGTATTGTGTTTCTCTTCTCCATAAAA
TGTGAGGAGGGAAGAAATCTGCTAAATGCGTCTCATTTTGTTTTGTTCTCCTATAATTGTTATAAGGTCAATCAAGCAATTACAATTTTTTTTCAATGCACAAAAAGCCT
CATTTAATACCAACAAAAAGAAAATAAACTCTATAACAACAAATTAAATGTCTACTATGCAAACTTCAAAGTTGAAATTAAAACAAAAAAATGATTCTTTGCATTGAGGT
CCT
Protein sequenceShow/hide protein sequence
MGKSEEEQPLPVGVSSSELSDRNVENRCGGGGCSEIRKLIAVRCVFFLLLSAAVFLSAIFWLPPFLSYGNWPDRPIDSAYRDHDIVASFHAWKPVPFLQNHIFELEDNIF
GEIPIPSVKVAILSLQSLSGPNVTKIVFAVDSDAKYSKIPPTSQSLIKETFETLVINEPPLRLNESLFGNTSLFEVLKFPGGITIIPPQSAFLLQTAQIYFNFTLNYSIY
QIQVNFDDLSSQLRSGLRLSPYENLYVSLSNERGSTMDAPTVVQSSVLMAIGTNLSSSKQRLKQLAHTITNSHSGNLGLNNTVFGKVKQVRLSFLNHSLGGGGNAWSPSP
APLPHSHHHHHHHHHHHHHHHHHHHHHHHHHHHRHHHHHHHHHHHNHHQHAAYSPSPGTEEHKHAPKNGVSSAPEAGSSPMEGPTSRKRNYEATPPAFRYGYKRSSTKLR
KQHHLGPIPSPSSSPPSPYLRAADMQKILKEVPLPSYHHNFLLLQMFVFILFNGHLRYFYLYGTYNQGDKTYM