; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004704 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004704
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionflocculation protein FLO11
Genome locationchr6:6236109..6239397
RNA-Seq ExpressionLag0004704
SyntenyLag0004704
Gene Ontology termsGO:0051211 - anisotropic cell growth (biological process)
GO:0055028 - cortical microtubule (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029244.1 hypothetical protein SDJN02_07582 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.47Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSI+S HSHDRD YSSYSRGS+ASSGDDDLDSLQS+PIS+LDNSLSKGGIS SNNKALA SKKHRIVSS+SAPKRSLDSTIRQLDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHR LISRNSSVTTSSNASSDHGT IALDTEGSDHNQ+D  NECEKMPYH DSHEEIFAFDKMDIV+EDPIH IKSLDSG  P
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  --GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIK
          GC+PVVTGDSS++ VIPDI STS SSHVQGGDFSEVV LE DT VCSRCGCRYRVID+EEN  N CPECSREEK +GMA+S N  SVTESLS LSS+K
Subjt:  --GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIK

Query:  YEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHN
        YEAD P N+V+S VI PDSSLA TD GESRIS SV N+EQDQASYPEQGPSYL+ENF SETPVEES QHSL NHLE+GQ AV G+QPNTESG QQPLQHN
Subjt:  YEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHN

Query:  DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEI
        DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIE RMQRQLSSRKGD+ENKK E+
Subjt:  DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEI

Query:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL
        SVKSH SEV+S+GTP+N H   SFETCKQEE VDFYV   ECFSSQG TMSS KPELASENAE+DD SSI  A VEEDK ECD  R LD CTS SSRED 
Subjt:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL

Query:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD
        SGGRSVSDK+ASVTTFDCS+LEGHN+LD   FEDEHTELPTH MTTISETE  QI ++I P SQNDL I+   PL EES +PSG DQ+L PSV  TEKSD
Subjt:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD

Query:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQR
        GILE STVIVDYQGRTKV RSLTLEEATDTILFCSSIVHDLA+SAATIAIEKEKEKENEVTLEASRPMVTILGKS  +R DLRHRTGGKRVMK QKPRQR
Subjt:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQR

Query:  RVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        RV+MSTKPPIAKTENDENTDESTI+NVGLPNQVDS KPPKLESKCNCSIM
Subjt:  RVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

XP_022961987.1 flocculation protein FLO11 [Cucurbita moschata]0.0e+0084.82Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSI+S HSHDRD YSSYSRGS+ASSGDDDLDSLQS+PIS+LDNSLSKGGIS SNNKALA SKKHRIVSS+SAPKRSLDSTIRQLDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHR LISRNSSVTTSSNASSDHGT IALDTEGSDHNQ+D  NECEKMPYH DSHEEIFAFDKMDIV+EDP H IKSLDSG  P
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  --GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIK
          GC+PVVTGDSS++AVIPDI STS SSHVQGGDFSEVV LE DT VCSRCGCRYRVID+EEN  N CPECSREEK +GMA+S N  SVTESLS LSS+K
Subjt:  --GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIK

Query:  YEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHN
        YEAD P N+V+S VI PDSSLA TD GESRIS SV N+EQDQAS+PEQGPSYL+ENF SETPVEES QHSL NHLEMGQ AV G+QPNTESG QQPLQHN
Subjt:  YEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHN

Query:  DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEI
        DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIE RMQRQLSSRKGD+ENKK E+
Subjt:  DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEI

Query:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL
        SVKSH SEV+S+GTP+N H   SFETCKQEE VDFYV   ECFSSQGTTMSS KPELASENAE+DD SSI  A VEEDK ECD  R LD CTS SSRED 
Subjt:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL

Query:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD
        SGGRSVSDK+ASVTTFDCS+LEGHN+LD   FEDEHTELPTH MTTISETE  QI ++I P SQNDLSI+   PL EES VPSG DQ+L PSV  TEKSD
Subjt:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD

Query:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQR
        GILE STVIVDYQGRTKV RSLTLEEATDTILFCSSIVHDLA+SAATIAIEKEKEKENEVTLEASRPMVTILGKS  +R DLRHRTGGKRVMK QKPRQR
Subjt:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQR

Query:  RVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        RV+MSTKPPIAKTENDENTDESTI+NVGLPNQVDS KPPKLESKCNCSIM
Subjt:  RVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

XP_022996635.1 flocculation protein FLO11 [Cucurbita maxima]0.0e+0084.79Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSI+S HSHDRDRYSSYSRGS+ASSGDDDLDSLQSIPIS+LDNSLSKGGIS SNNKALA SKKHRIVSS+SAPKRSLDSTIRQLDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHR LISRNSSVTTSSNASSDHGT IALDTEGSDHNQ+D  NECEK+PYH DSHEEIFAFDKMDIV+EDPIH IKSLDSG A 
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE
        GC+PVVTGDSS++AVIPDISSTS SSHVQGGDFSEVV LE DT VCSRCGCRYRVID+EEN  N CPECSREEK +GMA+S N  SVTESLS LSS+KYE
Subjt:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE

Query:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY
         D P N+V+S VI PDSSLA TD GES IS SV N+EQD+ASYPEQGPSYL+ENF SETPVEES QHSL NHLEMGQ AV  +QPNTESG QQPLQHNDY
Subjt:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY

Query:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEISV
        QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIE RMQRQLSSRKGD+ENKK E+SV
Subjt:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEISV

Query:  KSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDLSG
        KSH SEV+S+ TP+N H   S ETCKQEE VDFYV   ECFSSQGTTMSSQKPELASENAE+DD SSI  A VEEDK ECD  R LD CTS SSRED SG
Subjt:  KSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDLSG

Query:  GRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSDGI
        GRSVSDK+ASV TFDCS+LEGHN+LD   FEDEHTELPTH MTTISETE  QI ++I P SQNDLSI+S  PL EES VPSG DQ+L PSV  TEKSDGI
Subjt:  GRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSDGI

Query:  LEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQRRV
        LE STVIVDYQGR KV RSLTLEEATDTILFCSSIVHDLA+SAATIAIEKEKEKENEVTLEASRPMVTILGKS  +R DLRHRTGGKRVMK QKPRQRRV
Subjt:  LEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQRRV

Query:  DMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        +MSTKPPIAKTENDENTDESTI+NVGLPNQVDS KPPKLESKCNCSIM
Subjt:  DMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

XP_023545976.1 flocculation protein FLO11 [Cucurbita pepo subsp. pepo]0.0e+0084.82Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSI+S HSHDRD YSSYSRGS+ASSGDDDLDSLQSIPIS+LDNSLSKGGIS SNNKALA SKKHRIVSS+SAPKRSLDSTIRQLDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHR LISRNSSVTTSSNASSDHGT IALDTEGSDHNQ D  NECEKMPYH DSHEEIFAFDKMDIV+EDPIH IKSLDSG  P
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  --GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIK
          GC+PVVTGDSS++AVIPDI STS SSHVQGGDFSEVV LE DT VCSRCGCRYRVID+EEN  N CPECSREEK +GMA+S N  SVTESLS LSS+K
Subjt:  --GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIK

Query:  YEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHN
        YEAD P N+V+S +I PDSSLA TD GESRIS SV N+EQDQASYPEQGPSYL+ENF SETPVEES QHSL NHLEMGQ AV G+QPNTESG QQPLQ N
Subjt:  YEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHN

Query:  DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEI
        DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIE RMQRQLSSRKGD+ENKK E+
Subjt:  DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEI

Query:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL
        SVKSH SEV+S+GTP+N H   SFETCKQEE VDFYV   E FSSQGTTMSSQKPE ASENAE+DD SSI  A VEEDK ECD  R LD CTS SSRED 
Subjt:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL

Query:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD
        SGGRSVSDK+ASVTTFDCS+LEGHN+LD   FEDEHTELPTH MTTISETE  QI ++I P SQNDLSI+   PL EES VPSG DQ+L PSV  TEKSD
Subjt:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD

Query:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQR
        GILE STVIVDYQGRTKV RSLTLEEATDTILFCSSIVHDLA+SAATIAIEKEKEKENEVTLEASRPMVTILGKS  +R DLRHRTGGKRVMK QKPRQR
Subjt:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQR

Query:  RVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        RV+MSTKPPIAKTENDENTDESTI+NVGLPNQVDS KPPKLESKCNCSIM
Subjt:  RVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

XP_031737323.1 serine/arginine repetitive matrix protein 2 isoform X1 [Cucumis sativus]0.0e+0083.49Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSISSSHSHDRDRYSSYSRGS+ASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSS SAPKRSLDSTIR LDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHRSLISRNSSVTTSSNASSDHGT IALDTEGSD NQDDM NECEK+ YH +SHEEIFAFDKMDIV+EDPIHDIKSLDSG A 
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE
        GC+PVVTGDSS++AV+PDISSTS SSHVQG DFSE+V LE DTVVCSRCGCRYRV DTEEN+ANLCPECSREEK L +A+SENM +VTESLS LSS+KYE
Subjt:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE

Query:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY
         D P +KVE  VI PDS+LA  DLGESRIS  V NVEQDQASYPEQGPSY+ ENF +ETP EES QHSL+NHLE+GQSAV GNQP+T SG+QQPLQ NDY
Subjt:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY

Query:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQ--LSSRKGDVENKKGEI
        Q+LRFDS EGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSS+RQIEARMQRQ  LSSRKG++ENKKGEI
Subjt:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQ--LSSRKGDVENKKGEI

Query:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL
        SVKSH +E++SSG P++ H    FETCKQ+E VDFYV N EC S QGTT SSQK ELASEN ++DDTSSI+VAVVEEDKFE D  RILDTCTS+ SRED 
Subjt:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL

Query:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD
        SGGRSVSDK+ASVT  DCSKLEGHNML D  FEDE +E+ TH M TISETE  QI +++A  SQ+D+S +SM PLEEESVV SG DQ+LTPS+   EKSD
Subjt:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD

Query:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAI----EKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQK
        GILEESTVIVDYQG+TKVVRSLTLEEATDTILFCSSIVHDLA+SAATIAI    EKEKEKENEVTLEASRPMVTILGKSNT+RSDLRHRTGGKRVMK QK
Subjt:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAI----EKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQK

Query:  PRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        PRQRRV+MSTKPPIA TENDENTDESTIRNVGLPNQVD+ KPPKLESKCNCSIM
Subjt:  PRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

TrEMBL top hitse value%identityAlignment
A0A0A0LKP5 Uncharacterized protein0.0e+0083.49Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSISSSHSHDRDRYSSYSRGS+ASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSS SAPKRSLDSTIR LDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHRSLISRNSSVTTSSNASSDHGT IALDTEGSD NQDDM NECEK+ YH +SHEEIFAFDKMDIV+EDPIHDIKSLDSG A 
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE
        GC+PVVTGDSS++AV+PDISSTS SSHVQG DFSE+V LE DTVVCSRCGCRYRV DTEEN+ANLCPECSREEK L +A+SENM +VTESLS LSS+KYE
Subjt:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE

Query:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY
         D P +KVE  VI PDS+LA  DLGESRIS  V NVEQDQASYPEQGPSY+ ENF +ETP EES QHSL+NHLE+GQSAV GNQP+T SG+QQPLQ NDY
Subjt:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY

Query:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQ--LSSRKGDVENKKGEI
        Q+LRFDS EGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSS+RQIEARMQRQ  LSSRKG++ENKKGEI
Subjt:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQ--LSSRKGDVENKKGEI

Query:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL
        SVKSH +E++SSG P++ H    FETCKQ+E VDFYV N EC S QGTT SSQK ELASEN ++DDTSSI+VAVVEEDKFE D  RILDTCTS+ SRED 
Subjt:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL

Query:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD
        SGGRSVSDK+ASVT  DCSKLEGHNML D  FEDE +E+ TH M TISETE  QI +++A  SQ+D+S +SM PLEEESVV SG DQ+LTPS+   EKSD
Subjt:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD

Query:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAI----EKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQK
        GILEESTVIVDYQG+TKVVRSLTLEEATDTILFCSSIVHDLA+SAATIAI    EKEKEKENEVTLEASRPMVTILGKSNT+RSDLRHRTGGKRVMK QK
Subjt:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAI----EKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQK

Query:  PRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        PRQRRV+MSTKPPIA TENDENTDESTIRNVGLPNQVD+ KPPKLESKCNCSIM
Subjt:  PRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

A0A1S3BAA3 uncharacterized protein LOC103487758 isoform X10.0e+0082.44Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSISSSHSHDRDRYSSYSRGS+ASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKA AFSKK RI+SS SAPKRSLDSTIR LDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHRSLISRNSSVTTSSNASSDHGT IALDTEG DHNQDDM NECEK+PYH DSHEEIFAFDK+DIV+EDPIHDIKSLD G A 
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE
        GC PVVTGDSS++AV+PDISSTS SSHVQG DFSE+V LE DTVVCSRCGCRYRV DTEEN+ NLC ECSREEK L +A+SENM +V ESLS LSS+KYE
Subjt:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE

Query:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY
         D P +KVE  VI PDS+LA  D+GESRIS SV NVEQDQ SYPEQGPSY++ENF +ETPVEES QHSL+NHLE+GQSAV GNQ +T SG+QQPLQ NDY
Subjt:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY

Query:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQ--LSSRKGDVENKKGEI
        Q+LRFDS EGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSS+RQIEAR+QRQ  LSSRKGD+ENKKGEI
Subjt:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQ--LSSRKGDVENKKGEI

Query:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL
        SVKSH +E++S+G P+N H    FETCKQ+E VDFYV N EC S QGTT SSQKPELASEN ++DDTSSI+VAVVEEDKFE D  RILDTCTS  SRED 
Subjt:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL

Query:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD
        SGGRSVSDK+ASVTT DCSKLEGHNML D  FEDE +EL T  M TISETE  +I +++A  SQ+D+S +S   LEEESVVPSG DQ+L PS+  TEKSD
Subjt:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD

Query:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAI----EKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQK
        GILEESTVIVDYQG+TKVVRSLTLEEATDTILFCSSIV DLA+SAATIAI    EKEKEKENEVTLEASRPMVTILGKSNT+RSDLRHRTGGKRVMK QK
Subjt:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAI----EKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQK

Query:  PRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        PRQRRV+MSTKPPIA TENDENTDESTIRNV LPNQVD+ KPPKLESKCNCSIM
Subjt:  PRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

A0A5A7UYP7 Serine/arginine repetitive matrix protein 20.0e+0082.67Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSISSSHSHDRDRYSSYSRGS+ASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKA AFSKK RI+SS SAPKRSLDSTIR LDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHRSLISRNSSVTTSSNASSDHGT IALDTEG DHNQDDM NECEK+PYH DSHEEIFAFDK+DIV+EDPIHDIKSLD G A 
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE
        GC PVVTGDSS++AV+PDISSTS SSHVQG DFSE+V LE DTVVCSRCGCRYRV DTEEN+ NLC ECSREEK L +A+SENM +V ESLS LSS+KYE
Subjt:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE

Query:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY
         D P +KVE  VI PDS+LA  D+GESRIS SV NVEQDQ SYPEQGPSY++ENF +ETPVEES QHSL+NHLE+GQSAV GNQ +T SG+QQPLQ NDY
Subjt:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY

Query:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQ--LSSRKGDVENKKGEI
        Q+LRFDS EGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSS+RQIEAR+QRQ  LSSRKGD+ENKKGEI
Subjt:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQ--LSSRKGDVENKKGEI

Query:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL
        SVKSH +E++S+G P+N H    FETCKQ+E VDFYV N EC S QGTT SSQKPELASEN ++DDTSSI+VAVVEEDKFE D  RILDTCTS  SRED 
Subjt:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL

Query:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD
        SGGRSVSDK+ASVTT DCSKLEGHNML D  FEDE +EL TH M TISETE  QI +++A  SQ+D+S +S   LEEESVVPSG DQ+L PS+  TEKSD
Subjt:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD

Query:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAI----EKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQK
        GILEESTVIVDYQG+TKVVRSLTLEEATDTILFCSSIV DLA+SAATIAI    EKEKEKENEVTLEASRPMVTILGKSNT+RSDLRHRTGGKRVMK QK
Subjt:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAI----EKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQK

Query:  PRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        PRQRRV+MSTKPPIA TENDENTDESTIRNV LPNQVD+ KPPKLESKCNCSIM
Subjt:  PRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

A0A6J1HBJ8 flocculation protein FLO110.0e+0084.82Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSI+S HSHDRD YSSYSRGS+ASSGDDDLDSLQS+PIS+LDNSLSKGGIS SNNKALA SKKHRIVSS+SAPKRSLDSTIRQLDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHR LISRNSSVTTSSNASSDHGT IALDTEGSDHNQ+D  NECEKMPYH DSHEEIFAFDKMDIV+EDP H IKSLDSG  P
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  --GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIK
          GC+PVVTGDSS++AVIPDI STS SSHVQGGDFSEVV LE DT VCSRCGCRYRVID+EEN  N CPECSREEK +GMA+S N  SVTESLS LSS+K
Subjt:  --GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIK

Query:  YEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHN
        YEAD P N+V+S VI PDSSLA TD GESRIS SV N+EQDQAS+PEQGPSYL+ENF SETPVEES QHSL NHLEMGQ AV G+QPNTESG QQPLQHN
Subjt:  YEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHN

Query:  DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEI
        DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIE RMQRQLSSRKGD+ENKK E+
Subjt:  DYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEI

Query:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL
        SVKSH SEV+S+GTP+N H   SFETCKQEE VDFYV   ECFSSQGTTMSS KPELASENAE+DD SSI  A VEEDK ECD  R LD CTS SSRED 
Subjt:  SVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDL

Query:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD
        SGGRSVSDK+ASVTTFDCS+LEGHN+LD   FEDEHTELPTH MTTISETE  QI ++I P SQNDLSI+   PL EES VPSG DQ+L PSV  TEKSD
Subjt:  SGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSD

Query:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQR
        GILE STVIVDYQGRTKV RSLTLEEATDTILFCSSIVHDLA+SAATIAIEKEKEKENEVTLEASRPMVTILGKS  +R DLRHRTGGKRVMK QKPRQR
Subjt:  GILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQR

Query:  RVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        RV+MSTKPPIAKTENDENTDESTI+NVGLPNQVDS KPPKLESKCNCSIM
Subjt:  RVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

A0A6J1K7D2 flocculation protein FLO110.0e+0084.79Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP
        MSPTASRSI+S HSHDRDRYSSYSRGS+ASSGDDDLDSLQSIPIS+LDNSLSKGGIS SNNKALA SKKHRIVSS+SAPKRSLDSTIRQLDRKSPNMFRP
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRP

Query:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP
        LLSSVPST FY GKASSAHR LISRNSSVTTSSNASSDHGT IALDTEGSDHNQ+D  NECEK+PYH DSHEEIFAFDKMDIV+EDPIH IKSLDSG A 
Subjt:  LLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAP

Query:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE
        GC+PVVTGDSS++AVIPDISSTS SSHVQGGDFSEVV LE DT VCSRCGCRYRVID+EEN  N CPECSREEK +GMA+S N  SVTESLS LSS+KYE
Subjt:  GCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYE

Query:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY
         D P N+V+S VI PDSSLA TD GES IS SV N+EQD+ASYPEQGPSYL+ENF SETPVEES QHSL NHLEMGQ AV  +QPNTESG QQPLQHNDY
Subjt:  ADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDY

Query:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEISV
        QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIE RMQRQLSSRKGD+ENKK E+SV
Subjt:  QTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEISV

Query:  KSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDLSG
        KSH SEV+S+ TP+N H   S ETCKQEE VDFYV   ECFSSQGTTMSSQKPELASENAE+DD SSI  A VEEDK ECD  R LD CTS SSRED SG
Subjt:  KSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDLSG

Query:  GRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSDGI
        GRSVSDK+ASV TFDCS+LEGHN+LD   FEDEHTELPTH MTTISETE  QI ++I P SQNDLSI+S  PL EES VPSG DQ+L PSV  TEKSDGI
Subjt:  GRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPDSQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSDGI

Query:  LEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQRRV
        LE STVIVDYQGR KV RSLTLEEATDTILFCSSIVHDLA+SAATIAIEKEKEKENEVTLEASRPMVTILGKS  +R DLRHRTGGKRVMK QKPRQRRV
Subjt:  LEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTILGKSNTDRSDLRHRTGGKRVMKLQKPRQRRV

Query:  DMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM
        +MSTKPPIAKTENDENTDESTI+NVGLPNQVDS KPPKLESKCNCSIM
Subjt:  DMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27850.1 unknown protein7.3e-8434.87Show/hide
Query:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLD--RKSPNMF
        +SP+ASRS+SSSHSH+RDR+SS S+GSVASSGDDDL SLQSIP+   + ++SK      N++    S+  +++S  SAP+R  +S +RQ++  +   +MF
Subjt:  MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLD--RKSPNMF

Query:  RPLLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHD--------
        RPL SS+PST  Y GK SS++  ++ R+S+ T  SN+SS   T    D +G D       +E E + Y  D HEE  AF  +++ NE   H+        
Subjt:  RPLLSSVPSTAFYLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHD--------

Query:  -IKSLDSGSAPGCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTE
         +  +D      CE     + SHQ    + SST  S HV G +F E V LE    VC RCG  Y   +   +  N+CPEC  E  ++     E  +  T 
Subjt:  -IKSLDSGSAPGCEPVVTGDSSHQAVIPDISSTSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTE

Query:  SLSRLSSIKYEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTES
        S  +LS   ++ +N +     PVI    SL    + E  I ++ + +EQ   SY EQ   +L E+ +S    E++    ++N+ +  QS+ G    +  +
Subjt:  SLSRLSSIKYEADNPINKVESPVILPDSSLATTDLGESRISKSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTES

Query:  GFQQPLQHNDYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKG
           Q    + +  +   S     + +++KRS S K PV+Q    +  T SY+  S++RD   SLRSS    + SASSS D+ SS +  + + RQ S    
Subjt:  GFQQPLQHNDYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISYDDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKG

Query:  DVENKKGEISVKSHSSEVSSSGTPSNPHLAL------SFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEE--------
        D+E  + + + KS S+  SSSG  S+   AL      SFE C  +        + E  +        +   + ++  E+     I+  V+ +        
Subjt:  DVENKKGEISVKSHSSEVSSSGTPSNPHLAL------SFETCKQEEIVDFYVGNSECFSSQGTTMSSQKPELASENAEADDTSSIAVAVVEE--------

Query:  -DKFECDNYR-ILDTCTSQ-SSREDLSGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLI--------APDSQND
         D   C+N   + +T  S+  +RE  +  RS SD  AS  T DC     H+ L + D  +    L T   TT SE E +     I         P+S+ +
Subjt:  -DKFECDNYR-ILDTCTSQ-SSREDLSGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLI--------APDSQND

Query:  LSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSDGILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASR
        L+  +++   E+S+V +  D + + S  + E    IL+ESTV+V+  G  K  RSLTLEEATDTILFCSSIVHDL + AATIA++K K+   E   E   
Subjt:  LSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSDGILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASR

Query:  PMVTILGKSNTDRSDLRHRTGG-----KRVMKLQKPRQRRVDMSTKPPIAKTENDENTDEST-IRNVGL-PNQVDSMK-PPKLESKCNCSIM
        P VT+LGKSN +R+   +  GG     KR  K  K  +++ +   K  + + ENDEN  E+  +RNVG+ P++ ++MK PP LESKCNCSIM
Subjt:  PMVTILGKSNTDRSDLRHRTGG-----KRVMKLQKPRQRRVDMSTKPPIAKTENDENTDEST-IRNVGL-PNQVDSMK-PPKLESKCNCSIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCAACTGCTTCTAGAAGCATCAGCTCATCTCATAGTCATGATCGAGATCGTTATAGCTCTTACAGCCGAGGTTCTGTGGCCTCATCTGGTGATGATGATTTAGA
CTCCCTGCAGTCAATTCCTATTAGCAGTTTAGATAATTCATTGTCAAAAGGAGGGATTTCATTTTCAAATAACAAAGCTCTGGCCTTCTCCAAGAAACACAGAATAGTTT
CTTCTACTTCTGCTCCTAAAAGATCCCTTGATTCCACTATTCGACAATTGGATCGGAAAAGCCCGAATATGTTTAGGCCACTTTTATCAAGTGTCCCCAGCACAGCTTTT
TACTTGGGCAAGGCAAGTTCTGCACATCGTTCTCTGATTTCCAGGAACTCCTCAGTCACAACTAGCAGTAATGCAAGTTCTGATCATGGTACAAGCATTGCACTTGATAC
AGAAGGGAGTGACCACAACCAGGATGATATGGCAAATGAATGTGAAAAAATGCCATATCATGATGACAGTCATGAAGAAATATTCGCATTTGATAAGATGGATATTGTTA
ATGAAGATCCAATTCATGATATTAAGTCACTTGATAGTGGCTCTGCACCCGGTTGTGAGCCGGTTGTCACTGGAGATAGCAGTCACCAAGCTGTTATCCCAGACATAAGT
TCCACTTCTTACTCTTCTCATGTTCAGGGAGGTGATTTTTCAGAGGTTGTTTTCCTTGAAGATGATACAGTTGTGTGTTCCAGATGTGGTTGCAGATATCGTGTTATCGA
CACCGAGGAAAATAATGCTAATCTTTGTCCGGAATGCAGCAGGGAAGAAAAATATCTAGGCATGGCCCTTTCAGAAAATATGGCTTCAGTTACTGAAAGCTTGTCAAGGT
TGTCTTCAATAAAATATGAAGCAGATAACCCTATCAATAAGGTGGAGTCACCGGTGATATTGCCTGACTCTTCCCTAGCTACTACTGATTTGGGTGAATCTAGGATTTCT
AAGTCGGTGGACAATGTTGAGCAAGATCAAGCATCTTATCCTGAACAAGGCCCGAGTTACCTGAAAGAAAACTTCCTCTCAGAAACACCAGTGGAGGAAAGTCATCAGCA
TAGCCTTGTCAACCATTTGGAGATGGGCCAATCAGCTGTTGGAGGGAACCAGCCCAACACTGAATCCGGATTTCAGCAACCTCTTCAGCATAACGACTATCAAACTTTGA
GATTTGATTCATCAGAAGGTGCAGGCATTTCTATATTGTTAAAGAGGTCAAGCAGCAGTAAGGGACCTGTCGTACAGGGAAGAACTTTTACTGCAAGTACCATATCGTAT
GATGATCTATCTTTTGCAAGAGATAGCATGAGCAGTTTGAGAAGCTCTATTGGACACAGCAGTTTTTCTGCATCATCATCGGCTGATTTTAGCTCATCCAGACAGATTGA
AGCCCGAATGCAACGACAACTAAGTTCAAGGAAAGGGGATGTGGAGAACAAAAAAGGTGAAATCAGCGTGAAATCTCATAGCTCTGAGGTATCTTCTTCTGGAACACCTT
CGAATCCTCATCTTGCATTAAGTTTTGAAACTTGTAAGCAAGAGGAAATTGTGGATTTTTATGTGGGCAATTCAGAATGTTTTTCAAGCCAGGGAACTACCATGTCTTCT
CAAAAACCTGAACTAGCTTCTGAAAATGCCGAAGCAGATGATACCTCTTCAATTGCGGTAGCTGTTGTAGAGGAGGATAAATTTGAATGCGACAACTATAGAATACTGGA
TACTTGTACCTCACAATCGTCTAGGGAGGACTTATCAGGTGGTAGGAGTGTCTCAGATAAAGAAGCATCAGTTACAACTTTTGACTGTTCAAAATTGGAGGGACACAACA
TGCTTGATGATGGTGATTTTGAAGATGAACATACAGAACTTCCCACTCATTCAATGACGACAATATCAGAAACAGAAACAAAACAAATAGTTCAGTTGATAGCTCCTGAT
TCACAAAATGATTTGTCAATAATGTCAATGAATCCCTTGGAGGAGGAATCTGTGGTTCCTAGTGGATCCGACCAGAATTTAACTCCATCTGTTAATATTACTGAAAAATC
TGATGGCATCCTAGAAGAATCAACGGTAATTGTCGATTATCAAGGCAGAACAAAGGTGGTAAGAAGCTTGACACTTGAAGAGGCAACGGATACAATACTTTTCTGTAGCT
CCATTGTTCATGATCTAGCCTTCTCGGCTGCTACCATAGCAATCGAAAAGGAAAAGGAAAAGGAAAATGAGGTTACATTGGAAGCCTCACGACCAATGGTTACCATTTTG
GGGAAATCTAACACTGACAGAAGTGATCTGCGCCATAGAACAGGTGGTAAACGTGTAATGAAATTACAAAAACCCAGACAACGGCGTGTCGACATGAGTACAAAGCCCCC
TATTGCTAAGACTGAGAATGACGAGAACACGGACGAGTCCACCATTCGAAATGTAGGTCTTCCTAACCAAGTGGACAGCATGAAACCTCCAAAGCTGGAATCCAAATGCA
ACTGCAGTATAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCAACTGCTTCTAGAAGCATCAGCTCATCTCATAGTCATGATCGAGATCGTTATAGCTCTTACAGCCGAGGTTCTGTGGCCTCATCTGGTGATGATGATTTAGA
CTCCCTGCAGTCAATTCCTATTAGCAGTTTAGATAATTCATTGTCAAAAGGAGGGATTTCATTTTCAAATAACAAAGCTCTGGCCTTCTCCAAGAAACACAGAATAGTTT
CTTCTACTTCTGCTCCTAAAAGATCCCTTGATTCCACTATTCGACAATTGGATCGGAAAAGCCCGAATATGTTTAGGCCACTTTTATCAAGTGTCCCCAGCACAGCTTTT
TACTTGGGCAAGGCAAGTTCTGCACATCGTTCTCTGATTTCCAGGAACTCCTCAGTCACAACTAGCAGTAATGCAAGTTCTGATCATGGTACAAGCATTGCACTTGATAC
AGAAGGGAGTGACCACAACCAGGATGATATGGCAAATGAATGTGAAAAAATGCCATATCATGATGACAGTCATGAAGAAATATTCGCATTTGATAAGATGGATATTGTTA
ATGAAGATCCAATTCATGATATTAAGTCACTTGATAGTGGCTCTGCACCCGGTTGTGAGCCGGTTGTCACTGGAGATAGCAGTCACCAAGCTGTTATCCCAGACATAAGT
TCCACTTCTTACTCTTCTCATGTTCAGGGAGGTGATTTTTCAGAGGTTGTTTTCCTTGAAGATGATACAGTTGTGTGTTCCAGATGTGGTTGCAGATATCGTGTTATCGA
CACCGAGGAAAATAATGCTAATCTTTGTCCGGAATGCAGCAGGGAAGAAAAATATCTAGGCATGGCCCTTTCAGAAAATATGGCTTCAGTTACTGAAAGCTTGTCAAGGT
TGTCTTCAATAAAATATGAAGCAGATAACCCTATCAATAAGGTGGAGTCACCGGTGATATTGCCTGACTCTTCCCTAGCTACTACTGATTTGGGTGAATCTAGGATTTCT
AAGTCGGTGGACAATGTTGAGCAAGATCAAGCATCTTATCCTGAACAAGGCCCGAGTTACCTGAAAGAAAACTTCCTCTCAGAAACACCAGTGGAGGAAAGTCATCAGCA
TAGCCTTGTCAACCATTTGGAGATGGGCCAATCAGCTGTTGGAGGGAACCAGCCCAACACTGAATCCGGATTTCAGCAACCTCTTCAGCATAACGACTATCAAACTTTGA
GATTTGATTCATCAGAAGGTGCAGGCATTTCTATATTGTTAAAGAGGTCAAGCAGCAGTAAGGGACCTGTCGTACAGGGAAGAACTTTTACTGCAAGTACCATATCGTAT
GATGATCTATCTTTTGCAAGAGATAGCATGAGCAGTTTGAGAAGCTCTATTGGACACAGCAGTTTTTCTGCATCATCATCGGCTGATTTTAGCTCATCCAGACAGATTGA
AGCCCGAATGCAACGACAACTAAGTTCAAGGAAAGGGGATGTGGAGAACAAAAAAGGTGAAATCAGCGTGAAATCTCATAGCTCTGAGGTATCTTCTTCTGGAACACCTT
CGAATCCTCATCTTGCATTAAGTTTTGAAACTTGTAAGCAAGAGGAAATTGTGGATTTTTATGTGGGCAATTCAGAATGTTTTTCAAGCCAGGGAACTACCATGTCTTCT
CAAAAACCTGAACTAGCTTCTGAAAATGCCGAAGCAGATGATACCTCTTCAATTGCGGTAGCTGTTGTAGAGGAGGATAAATTTGAATGCGACAACTATAGAATACTGGA
TACTTGTACCTCACAATCGTCTAGGGAGGACTTATCAGGTGGTAGGAGTGTCTCAGATAAAGAAGCATCAGTTACAACTTTTGACTGTTCAAAATTGGAGGGACACAACA
TGCTTGATGATGGTGATTTTGAAGATGAACATACAGAACTTCCCACTCATTCAATGACGACAATATCAGAAACAGAAACAAAACAAATAGTTCAGTTGATAGCTCCTGAT
TCACAAAATGATTTGTCAATAATGTCAATGAATCCCTTGGAGGAGGAATCTGTGGTTCCTAGTGGATCCGACCAGAATTTAACTCCATCTGTTAATATTACTGAAAAATC
TGATGGCATCCTAGAAGAATCAACGGTAATTGTCGATTATCAAGGCAGAACAAAGGTGGTAAGAAGCTTGACACTTGAAGAGGCAACGGATACAATACTTTTCTGTAGCT
CCATTGTTCATGATCTAGCCTTCTCGGCTGCTACCATAGCAATCGAAAAGGAAAAGGAAAAGGAAAATGAGGTTACATTGGAAGCCTCACGACCAATGGTTACCATTTTG
GGGAAATCTAACACTGACAGAAGTGATCTGCGCCATAGAACAGGTGGTAAACGTGTAATGAAATTACAAAAACCCAGACAACGGCGTGTCGACATGAGTACAAAGCCCCC
TATTGCTAAGACTGAGAATGACGAGAACACGGACGAGTCCACCATTCGAAATGTAGGTCTTCCTAACCAAGTGGACAGCATGAAACCTCCAAAGCTGGAATCCAAATGCA
ACTGCAGTATAATGTGA
Protein sequenceShow/hide protein sequence
MSPTASRSISSSHSHDRDRYSSYSRGSVASSGDDDLDSLQSIPISSLDNSLSKGGISFSNNKALAFSKKHRIVSSTSAPKRSLDSTIRQLDRKSPNMFRPLLSSVPSTAF
YLGKASSAHRSLISRNSSVTTSSNASSDHGTSIALDTEGSDHNQDDMANECEKMPYHDDSHEEIFAFDKMDIVNEDPIHDIKSLDSGSAPGCEPVVTGDSSHQAVIPDIS
STSYSSHVQGGDFSEVVFLEDDTVVCSRCGCRYRVIDTEENNANLCPECSREEKYLGMALSENMASVTESLSRLSSIKYEADNPINKVESPVILPDSSLATTDLGESRIS
KSVDNVEQDQASYPEQGPSYLKENFLSETPVEESHQHSLVNHLEMGQSAVGGNQPNTESGFQQPLQHNDYQTLRFDSSEGAGISILLKRSSSSKGPVVQGRTFTASTISY
DDLSFARDSMSSLRSSIGHSSFSASSSADFSSSRQIEARMQRQLSSRKGDVENKKGEISVKSHSSEVSSSGTPSNPHLALSFETCKQEEIVDFYVGNSECFSSQGTTMSS
QKPELASENAEADDTSSIAVAVVEEDKFECDNYRILDTCTSQSSREDLSGGRSVSDKEASVTTFDCSKLEGHNMLDDGDFEDEHTELPTHSMTTISETETKQIVQLIAPD
SQNDLSIMSMNPLEEESVVPSGSDQNLTPSVNITEKSDGILEESTVIVDYQGRTKVVRSLTLEEATDTILFCSSIVHDLAFSAATIAIEKEKEKENEVTLEASRPMVTIL
GKSNTDRSDLRHRTGGKRVMKLQKPRQRRVDMSTKPPIAKTENDENTDESTIRNVGLPNQVDSMKPPKLESKCNCSIM