; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002594 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002594
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHyccin
Genome locationscaffold6:3291179..3292453
RNA-Seq ExpressionSpg002594
SyntenySpg002594
Gene Ontology termsGO:0046854 - phosphatidylinositol phosphorylation (biological process)
GO:0072659 - protein localization to plasma membrane (biological process)
GO:0005829 - cytosol (cellular component)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR018619 - Hyccin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576864.1 Family With Sequence Similarity 126 Member B-like protein, partial [Cucurbita argyrosperma subsp. sororia]5.6e-21292.31Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTT  +NP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+SG+QAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQIS MPSWSKLEFCRSAASWAGQ+CCCKREFDKED  EI GFSEKRALE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NSDD EPK +RIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

XP_004137170.1 uncharacterized protein LOC101215901 [Cucumis sativus]6.7e-21392.74Show/hide
Query:  MDFHRNPSLNNRHSPSP-SSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS
        MDFHRNPS+NNRHS SP SSSASSTT  HNP   TATA+AD+DPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS
Subjt:  MDFHRNPSLNNRHSPSP-SSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS

Query:  ALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSP
        ALSSS+SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPV+VSIPDLSQPSLYHSP
Subjt:  ALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSP

Query:  RTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADE
          KPNSGAQAQ RPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQ+CCC REFDKEDG ++GGFSEKRALEY DE
Subjt:  RTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADE

Query:  IEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAK-
        IED SEEMG+LQIEKCGNNS+DSEPKG RIPLPWELLQPVLRILGHCLLAPLNSQDVKD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR AAK 
Subjt:  IEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAK-

Query:  -AATANASSNANTPSKDKKPEILLVSK
         AA AN+SSNANTPSKDKKPEILLVSK
Subjt:  -AATANASSNANTPSKDKKPEILLVSK

XP_022984223.1 uncharacterized protein LOC111482604 [Cucurbita maxima]4.3e-21292.54Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTT  HNP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+S AQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQI  MPSWSKLEFCRSAASWAGQ+CCCKREFDKED  EI GFSEKRALE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NSDDSEPK +RIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

XP_023552230.1 uncharacterized protein LOC111809964 [Cucurbita pepo subsp. pepo]4.3e-21292.54Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTT  HNP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+S AQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQIS MPSWSKLEFCRSAASWAGQ+CCCKREFDKED  EI GFSEKRALE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NSDD EPK +RIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

XP_038905210.1 uncharacterized protein LOC120091307 [Benincasa hispida]4.2e-21593.26Show/hide
Query:  MDFHRNPSLNNRHSPSP-SSSASSTTVPHNPSAT---TATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SP SSSASSTT PHNPSA+   TATA+AD+DPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYS 
Subjt:  MDFHRNPSLNNRHSPSP-SSSASSTTVPHNPSAT---TATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSAL+SS+SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLS PSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KPNSGAQAQ RPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQ+CCC+REFDKEDG +IGGFSEKRALEY
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMG+LQIEKCGNNS+DSE KG RIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK--AATANASSNANTPSKDKKPEILLVSK
        AK  AA AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK--AATANASSNANTPSKDKKPEILLVSK

TrEMBL top hitse value%identityAlignment
A0A0A0KZ90 Uncharacterized protein3.2e-21392.74Show/hide
Query:  MDFHRNPSLNNRHSPSP-SSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS
        MDFHRNPS+NNRHS SP SSSASSTT  HNP   TATA+AD+DPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS
Subjt:  MDFHRNPSLNNRHSPSP-SSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS

Query:  ALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSP
        ALSSS+SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPV+VSIPDLSQPSLYHSP
Subjt:  ALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSP

Query:  RTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADE
          KPNSGAQAQ RPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQ+CCC REFDKEDG ++GGFSEKRALEY DE
Subjt:  RTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADE

Query:  IEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAK-
        IED SEEMG+LQIEKCGNNS+DSEPKG RIPLPWELLQPVLRILGHCLLAPLNSQDVKD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR AAK 
Subjt:  IEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAK-

Query:  -AATANASSNANTPSKDKKPEILLVSK
         AA AN+SSNANTPSKDKKPEILLVSK
Subjt:  -AATANASSNANTPSKDKKPEILLVSK

A0A1S3C1D0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034957691.0e-21191.82Show/hide
Query:  MDFHRNPSLNNRHSPSP-SSSASSTTVPH--NPSAT-TATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPS+NNRHS SP SSSASSTT P   NP+AT +A+A+AD+DPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
Subjt:  MDFHRNPSLNNRHSPSP-SSSASSTTVPH--NPSAT-TATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPV+VSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY
        HSP  KPNSGAQAQ+RPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKL FCRSAASWAGQ+CCC REFDKEDG ++GGFSEKRALEY
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIED SEEMG+LQIEKCGNNS+DSEPKG RIPLPWELLQP+LRILGHCLL PLNSQDVKD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AKAATANASSNANTPSKDKKPEILLVSK
        AKAA AN+SSNANTPSKDKKPEILLVSK
Subjt:  AKAATANASSNANTPSKDKKPEILLVSK

A0A6J1E3K7 uncharacterized protein LOC1114304891.8e-21192.07Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTT  +NP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDS SLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+SG+QAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQIS MPSWSKLEFCRSAASWAGQ+CCCKREFDKED  EI GFSEKR LE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NSDDSEPK +RIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

A0A6J1FWG4 uncharacterized protein LOC1114479765.7e-21091.51Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSA
        MDFHRNPS+NNRHSPSP+SS SSTTVPHNPSAT  TATAD+DPMHSWWESVSKARSRIHALSSILPPH DSFFLSS+ADSDRPALSLLSSHDAYSVISSA
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSA

Query:  LSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPR
        LSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPS PSLAGFEAVLLALYSSEVKSRAGKPVLV+IPDLSQPSLYHSP 
Subjt:  LSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPR

Query:  TKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADEI
         KPNS AQAQ RPSVGVL PSLEPQNAVKSTKRACI+GVALDCYYKQISQMPSWSKLE CRSAASWAGQ+CCCKREFDKEDG +I GFSEKRALEY DEI
Subjt:  TKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADEI

Query:  EDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAKAA
        EDVS +MG LQ+E CGNNSDDSEPKG+RIPLPWELLQP+LRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR AAKAA
Subjt:  EDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAKAA

Query:  TANASSNANTPSKDKKPEILLVSK
        TA   +NA TPSKDKK EILLVSK
Subjt:  TANASSNANTPSKDKKPEILLVSK

A0A6J1J9X7 uncharacterized protein LOC1114826042.1e-21292.54Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTT  HNP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+S AQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQI  MPSWSKLEFCRSAASWAGQ+CCCKREFDKED  EI GFSEKRALE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NSDDSEPK +RIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

SwissProt top hitse value%identityAlignment
Q5R977 Protein FAM126B3.8e-0931.65Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +  +IP LS+PS+YH P T       
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN

Query:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
         GA  Q    + V+   L PQ     +  R  ++   + CY   I  MP+ S    CR
Subjt:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Q5ZM13 Hyccin7.1e-0830.41Show/hide
Query:  LSSVADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSS
        +SS A + +   +L+SS   Y VI    S  +     +P+CH L++ + S +  L    L FLP L   YL+      S S  L S    EA+LL +Y+ 
Subjt:  LSSVADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSS

Query:  EVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
        E+  + G  K +  +IP LS+PS+YH P +        GA +Q   S  V S     +  + +  R  ++   L CY   +S MP+ S    C+
Subjt:  EVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Q6P121 Hyccin4.6e-0730.57Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVS--IPDLSQPSLYHSPRT----KPN
        +P+CH L++ + S +P L+   L FLP L   YLS    T++  P        EA+LL +Y+ E+  + G+  ++S  IP LS+PS+YH P +       
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVS--IPDLSQPSLYHSPRT----KPN

Query:  SGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
         GA A    S  V S     +    +  R  ++   L CY   +S M   S    C+
Subjt:  SGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Q8C729 Protein FAM126B4.9e-0931.65Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +  +IP LS+PS+YH P T       
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN

Query:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
         GA  Q    + V+   L PQ     +  R  ++   + CY   I  MP+ S    CR
Subjt:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Q8IXS8 Protein FAM126B3.8e-0931.65Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +  +IP LS+PS+YH P T       
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN

Query:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
         GA  Q    + V+   L PQ     +  R  ++   + CY   I  MP+ S    CR
Subjt:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Arabidopsis top hitse value%identityAlignment
AT5G21050.1 LOCATED IN: chloroplast1.2e-5540.74Show/hide
Query:  SSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPA-LSLLSSHDAYS-VISSALSSSVSGSGSDPLCH
        S S+SS   P +P+    T  +++   ++  ES +K ++ I +LS+I            V +++ P+ +++L   +A S  ISS L    SG+G + LC 
Subjt:  SSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPA-LSLLSSHDAYS-VISSALSSSVSGSGSDPLCH

Query:  WLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPR--TKPNSGAQAQSRP
        WLYDTF S++P L+L+VL F+PL++ LYLSRV       P     AGFEAVLLALY+ E  SRAG+ + V+IPDLS PS+YH  +  T+ N+        
Subjt:  WLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPR--TKPNSGAQAQSRP

Query:  SVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADEIEDVS-EEMGKLQI
        ++ V+S +L+P   V+ST+RA IVGVAL+ YY +IS+MP  SKL FC S   WAGQN                G +E+ +      + D S  E   + I
Subjt:  SVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADEIEDVS-EEMGKLQI

Query:  EKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLA-PLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQL
                 SE    RIPLPWELLQP+LRILGHCLL   +  +++ +AA+ A + LY R+ HD+ P+ ILAT SL++L
Subjt:  EKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLA-PLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQL

AT5G64090.1 FUNCTIONS IN: molecular_function unknown5.8e-13060.26Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNPSATTATAT------------ADSDPMHSWWESVSKARSRIHALSSILPPHSDSFF-------LSSVADSD
        MDF   PS     SPSPSSS SS+T     S TT TAT            AD DPMHSWWESVSK RSRI +LSS+L    DS F       +SS+ADSD
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTVPHNPSATTATAT------------ADSDPMHSWWESVSKARSRIHALSSILPPHSDSFF-------LSSVADSD

Query:  RPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGK
        RPALSLLSS  AYS+IS++L +  SGSGSDPLC WLY+T+LSSDP LRLVVLSF PLL  +YLSR+H  SSDS SLPSL+GFEAVLLA+Y++EVK+RAGK
Subjt:  RPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGK

Query:  PVLVSIPDLSQPSLYHSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKE-
        P+LV IPDLSQPSLYH+PR   +    +    SVGVLSP LEPQ AVKSTKRA IVGV L CY+K+ISQMP+WSKLEFC+ +ASWAGQ+C CK + D++ 
Subjt:  PVLVSIPDLSQPSLYHSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKE-

Query:  -----------DGSEIGGFSEKRALEYADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYA
                   D S   G S  R+LE  ++ + ++    + Q+    N       +G RIPLPWEL QP LRILGHCLL+PLN++DVKDAAS AVR LYA
Subjt:  -----------DGSEIGGFSEKRALEYADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYA

Query:  RASHDLVPQVILATRSLIQLDNRTRTAAK---AATANASSNANTPSKDKKPEILLVSK
        RASHDL PQ ILATRSL+ LD   RT+ K   A T N SSN NTPSK KKPEILL SK
Subjt:  RASHDLVPQVILATRSLIQLDNRTRTAAK---AATANASSNANTPSKDKKPEILLVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTCCACCGGAATCCTTCCCTCAACAACCGCCATTCCCCCTCCCCCTCCTCCTCCGCCTCCTCCACCACCGTCCCCCACAACCCCTCCGCCACCACCGCCACCGC
CACCGCCGACAGCGACCCTATGCACTCCTGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCCCTTTCCTCCATCCTTCCCCCTCACTCCGACTCCTTTTTCC
TCTCCTCCGTCGCCGATTCCGACCGCCCGGCCCTCTCTCTTCTGTCGTCTCACGACGCTTACTCCGTCATCTCCTCCGCCCTCTCCTCCTCCGTCTCTGGATCTGGTTCT
GACCCTCTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCTTTCCTTCCCCTTCTTTCCTCTTTGTACCTTTCTCGCGT
CCATTCCACTTCCTCCGATTCCCCTTCCCTCCCTTCCCTTGCCGGCTTTGAGGCTGTGCTTCTCGCCCTTTATTCCTCTGAGGTTAAGTCCCGGGCTGGGAAGCCTGTTC
TTGTCTCGATTCCTGATCTTTCGCAGCCTTCTCTTTATCATTCTCCTCGGACTAAGCCCAATTCTGGTGCCCAAGCTCAATCCCGGCCGTCCGTTGGGGTTCTTTCCCCT
TCACTCGAGCCACAAAACGCGGTGAAGTCGACCAAGAGAGCTTGCATTGTTGGCGTCGCTCTTGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCT
TGAATTCTGCCGATCTGCAGCGTCGTGGGCTGGGCAAAATTGTTGCTGCAAGAGAGAGTTTGACAAAGAAGACGGTTCGGAAATTGGTGGGTTTTCGGAGAAAAGGGCGT
TGGAGTATGCAGATGAGATAGAGGATGTTTCGGAAGAAATGGGTAAACTACAAATTGAGAAGTGCGGGAACAATTCCGACGATTCAGAACCTAAGGGGTACAGAATTCCG
CTGCCATGGGAGCTTTTGCAGCCAGTGCTTAGAATTTTAGGACATTGTTTACTGGCTCCTTTGAATTCGCAAGATGTTAAGGATGCAGCTTCCGTTGCTGTCAGGTGTTT
ATATGCGAGGGCATCTCATGATTTGGTACCACAGGTAATATTGGCAACTCGGAGTCTTATTCAGCTCGACAACAGAACTCGAACGGCTGCAAAGGCTGCAACAGCAAATG
CTTCGTCTAATGCAAATACACCCAGCAAAGATAAGAAACCAGAAATCCTATTGGTCTCAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTCCACCGGAATCCTTCCCTCAACAACCGCCATTCCCCCTCCCCCTCCTCCTCCGCCTCCTCCACCACCGTCCCCCACAACCCCTCCGCCACCACCGCCACCGC
CACCGCCGACAGCGACCCTATGCACTCCTGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCCCTTTCCTCCATCCTTCCCCCTCACTCCGACTCCTTTTTCC
TCTCCTCCGTCGCCGATTCCGACCGCCCGGCCCTCTCTCTTCTGTCGTCTCACGACGCTTACTCCGTCATCTCCTCCGCCCTCTCCTCCTCCGTCTCTGGATCTGGTTCT
GACCCTCTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCTTTCCTTCCCCTTCTTTCCTCTTTGTACCTTTCTCGCGT
CCATTCCACTTCCTCCGATTCCCCTTCCCTCCCTTCCCTTGCCGGCTTTGAGGCTGTGCTTCTCGCCCTTTATTCCTCTGAGGTTAAGTCCCGGGCTGGGAAGCCTGTTC
TTGTCTCGATTCCTGATCTTTCGCAGCCTTCTCTTTATCATTCTCCTCGGACTAAGCCCAATTCTGGTGCCCAAGCTCAATCCCGGCCGTCCGTTGGGGTTCTTTCCCCT
TCACTCGAGCCACAAAACGCGGTGAAGTCGACCAAGAGAGCTTGCATTGTTGGCGTCGCTCTTGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCT
TGAATTCTGCCGATCTGCAGCGTCGTGGGCTGGGCAAAATTGTTGCTGCAAGAGAGAGTTTGACAAAGAAGACGGTTCGGAAATTGGTGGGTTTTCGGAGAAAAGGGCGT
TGGAGTATGCAGATGAGATAGAGGATGTTTCGGAAGAAATGGGTAAACTACAAATTGAGAAGTGCGGGAACAATTCCGACGATTCAGAACCTAAGGGGTACAGAATTCCG
CTGCCATGGGAGCTTTTGCAGCCAGTGCTTAGAATTTTAGGACATTGTTTACTGGCTCCTTTGAATTCGCAAGATGTTAAGGATGCAGCTTCCGTTGCTGTCAGGTGTTT
ATATGCGAGGGCATCTCATGATTTGGTACCACAGGTAATATTGGCAACTCGGAGTCTTATTCAGCTCGACAACAGAACTCGAACGGCTGCAAAGGCTGCAACAGCAAATG
CTTCGTCTAATGCAAATACACCCAGCAAAGATAAGAAACCAGAAATCCTATTGGTCTCAAAATAA
Protein sequenceShow/hide protein sequence
MDFHRNPSLNNRHSPSPSSSASSTTVPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSALSSSVSGSGS
DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRTKPNSGAQAQSRPSVGVLSP
SLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQNCCCKREFDKEDGSEIGGFSEKRALEYADEIEDVSEEMGKLQIEKCGNNSDDSEPKGYRIP
LPWELLQPVLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAKAATANASSNANTPSKDKKPEILLVSK