; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019795 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019795
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHyccin
Genome locationchr5:45585338..45586612
RNA-Seq ExpressionLag0019795
SyntenyLag0019795
Gene Ontology termsGO:0046854 - phosphatidylinositol phosphorylation (biological process)
GO:0072659 - protein localization to plasma membrane (biological process)
GO:0005829 - cytosol (cellular component)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR018619 - Hyccin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576864.1 Family With Sequence Similarity 126 Member B-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.8e-21192.31Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTTA +NP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+SG+QAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQIS MPSWSKLEFCRSAASWAGQDCCCKREFDKED  EI GFSEKRALE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NS+D EPK +RIPLPWELLQPVLRILGHCLLAPLN QDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

XP_004137170.1 uncharacterized protein LOC101215901 [Cucumis sativus]1.5e-21292.97Show/hide
Query:  MDFHRNPSLNNRHSPSP-SSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS
        MDFHRNPS+NNRHS SP SSSASSTTA HNP   TATA+AD+DPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS
Subjt:  MDFHRNPSLNNRHSPSP-SSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS

Query:  ALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSP
        ALSSS+SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPV+VSIPDLSQPSLYHSP
Subjt:  ALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSP

Query:  RTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADE
          KPNSGAQAQ RPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCC REFDKEDG ++GGFSEKRALEY DE
Subjt:  RTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADE

Query:  IEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAK-
        IED SEEMG+LQIEKCGNNS DSEPKG RIPLPWELLQPVLRILGHCLLAPLN QDVKD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR AAK 
Subjt:  IEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAK-

Query:  -AATANASSNANTPSKDKKPEILLVSK
         AA AN+SSNANTPSKDKKPEILLVSK
Subjt:  -AATANASSNANTPSKDKKPEILLVSK

XP_022984223.1 uncharacterized protein LOC111482604 [Cucurbita maxima]2.1e-21192.54Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTTA HNP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+S AQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQI  MPSWSKLEFCRSAASWAGQDCCCKREFDKED  EI GFSEKRALE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NS+DSEPK +RIPLPWELLQPVLRILGHCLLAPLN QDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

XP_023552230.1 uncharacterized protein LOC111809964 [Cucurbita pepo subsp. pepo]2.1e-21192.54Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTTA HNP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+S AQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQIS MPSWSKLEFCRSAASWAGQDCCCKREFDKED  EI GFSEKRALE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NS+D EPK +RIPLPWELLQPVLRILGHCLLAPLN QDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

XP_038905210.1 uncharacterized protein LOC120091307 [Benincasa hispida]9.3e-21593.49Show/hide
Query:  MDFHRNPSLNNRHSPSP-SSSASSTTAPHNPSAT---TATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SP SSSASSTTAPHNPSA+   TATA+AD+DPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYS 
Subjt:  MDFHRNPSLNNRHSPSP-SSSASSTTAPHNPSAT---TATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSAL+SS+SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLS PSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KPNSGAQAQ RPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCC+REFDKEDG +IGGFSEKRALEY
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMG+LQIEKCGNNS DSE KG RIPLPWELLQPVLRILGHCLLAPLN QDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK--AATANASSNANTPSKDKKPEILLVSK
        AK  AA AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK--AATANASSNANTPSKDKKPEILLVSK

TrEMBL top hitse value%identityAlignment
A0A0A0KZ90 Uncharacterized protein7.2e-21392.97Show/hide
Query:  MDFHRNPSLNNRHSPSP-SSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS
        MDFHRNPS+NNRHS SP SSSASSTTA HNP   TATA+AD+DPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS
Subjt:  MDFHRNPSLNNRHSPSP-SSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISS

Query:  ALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSP
        ALSSS+SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPV+VSIPDLSQPSLYHSP
Subjt:  ALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSP

Query:  RTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADE
          KPNSGAQAQ RPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCC REFDKEDG ++GGFSEKRALEY DE
Subjt:  RTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADE

Query:  IEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAK-
        IED SEEMG+LQIEKCGNNS DSEPKG RIPLPWELLQPVLRILGHCLLAPLN QDVKD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR AAK 
Subjt:  IEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAK-

Query:  -AATANASSNANTPSKDKKPEILLVSK
         AA AN+SSNANTPSKDKKPEILLVSK
Subjt:  -AATANASSNANTPSKDKKPEILLVSK

A0A1S3C1D0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034957693.9e-21191.82Show/hide
Query:  MDFHRNPSLNNRHSPSP-SSSASSTTAPH--NPSAT-TATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPS+NNRHS SP SSSASSTT P   NP+AT +A+A+AD+DPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
Subjt:  MDFHRNPSLNNRHSPSP-SSSASSTTAPH--NPSAT-TATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPV+VSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY
        HSP  KPNSGAQAQ+RPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKL FCRSAASWAGQDCCC REFDKEDG ++GGFSEKRALEY
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIED SEEMG+LQIEKCGNNS DSEPKG RIPLPWELLQP+LRILGHCLL PLN QDVKD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AKAATANASSNANTPSKDKKPEILLVSK
        AKAA AN+SSNANTPSKDKKPEILLVSK
Subjt:  AKAATANASSNANTPSKDKKPEILLVSK

A0A6J1E3K7 uncharacterized protein LOC1114304898.8e-21192.07Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTTA +NP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDS SLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+SG+QAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQIS MPSWSKLEFCRSAASWAGQDCCCKREFDKED  EI GFSEKR LE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NS+DSEPK +RIPLPWELLQPVLRILGHCLLAPLN QDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

A0A6J1FWG4 uncharacterized protein LOC1114479762.4e-20891.04Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSA
        MDFHRNPS+NNRHSPSP+SS SSTT PHNPSAT  TATAD+DPMHSWWESVSKARSRIHALSSILPPH DSFFLSS+ADSDRPALSLLSSHDAYSVISSA
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSA

Query:  LSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPR
        LSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPS PSLAGFEAVLLALYSSEVKSRAGKPVLV+IPDLSQPSLYHSP 
Subjt:  LSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPR

Query:  TKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADEI
         KPNS AQAQ RPSVGVL PSLEPQNAVKSTKRACI+GVALDCYYKQISQMPSWSKLE CRSAASWAGQDCCCKREFDKEDG +I GFSEKRALEY DEI
Subjt:  TKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADEI

Query:  EDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAKAA
        EDVS +MG LQ+E CGNNS+DSEPKG+RIPLPWELLQP+LRILGHCLLAPLN QDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTR AAKAA
Subjt:  EDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAKAA

Query:  TANASSNANTPSKDKKPEILLVSK
        TA   +NA TPSKDKK EILLVSK
Subjt:  TANASSNANTPSKDKKPEILLVSK

A0A6J1J9X7 uncharacterized protein LOC1114826041.0e-21192.54Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV
        MDFHRNPSLNNRHS SPSSS+SSTTA HNP    SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAY  
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNP----SATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSV

Query:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
        ISSALSSS +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY
        HSPR KP+S AQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQI  MPSWSKLEFCRSAASWAGQDCCCKREFDKED  EI GFSEKRALE+
Subjt:  HSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEY

Query:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA
         DEIEDVSEEMGKLQIEK G+NS+DSEPK +RIPLPWELLQPVLRILGHCLLAPLN QDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNR R A
Subjt:  ADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTA

Query:  AK-AATANASSNANTPSKDKKPEILLVSK
        AK AA+AN+SSNANTPSKDKKPEILLVSK
Subjt:  AK-AATANASSNANTPSKDKKPEILLVSK

SwissProt top hitse value%identityAlignment
Q5R977 Protein FAM126B3.8e-0931.65Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +  +IP LS+PS+YH P T       
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN

Query:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
         GA  Q    + V+   L PQ     +  R  ++   + CY   I  MP+ S    CR
Subjt:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Q5ZM13 Hyccin7.1e-0830.41Show/hide
Query:  LSSVADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSS
        +SS A + +   +L+SS   Y VI    S  +     +P+CH L++ + S +  L    L FLP L   YL+      S S  L S    EA+LL +Y+ 
Subjt:  LSSVADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSS

Query:  EVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
        E+  + G  K +  +IP LS+PS+YH P +        GA +Q   S  V S     +  + +  R  ++   L CY   +S MP+ S    C+
Subjt:  EVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Q6P121 Hyccin3.5e-0730.57Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVS--IPDLSQPSLYHSPRT----KPN
        +P+CH L++ + S +P L+   L FLP L   YLS    T++  P        EA+LL +Y+ E+  + G+  ++S  IP LS+PS+YH P +       
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVS--IPDLSQPSLYHSPRT----KPN

Query:  SGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
         GA A    S  V S     +    +  R  ++   L CY   +S M   S    C+
Subjt:  SGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Q8C729 Protein FAM126B4.9e-0931.65Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +  +IP LS+PS+YH P T       
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN

Query:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
         GA  Q    + V+   L PQ     +  R  ++   + CY   I  MP+ S    CR
Subjt:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Q8IXS8 Protein FAM126B3.8e-0931.65Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +Y+ E+  + G  K +  +IP LS+PS+YH P T       
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRT----KPN

Query:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR
         GA  Q    + V+   L PQ     +  R  ++   + CY   I  MP+ S    CR
Subjt:  SGAQAQSRPSVGVLSPSLEPQ-NAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCR

Arabidopsis top hitse value%identityAlignment
AT5G21050.1 LOCATED IN: chloroplast1.0e-5440.74Show/hide
Query:  SSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPA-LSLLSSHDAYS-VISSALSSSVSGSGSDPLCH
        S S+SS  +P +P+    T  +++   ++  ES +K ++ I +LS+I            V +++ P+ +++L   +A S  ISS L    SG+G + LC 
Subjt:  SSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPA-LSLLSSHDAYS-VISSALSSSVSGSGSDPLCH

Query:  WLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPR--TKPNSGAQAQSRP
        WLYDTF S++P L+L+VL F+PL++ LYLSRV       P     AGFEAVLLALY+ E  SRAG+ + V+IPDLS PS+YH  +  T+ N+        
Subjt:  WLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPR--TKPNSGAQAQSRP

Query:  SVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADEIEDVSEEMGKLQIE
        ++ V+S +L+P   V+ST+RA IVGVAL+ YY +IS+MP  SKL FC S   WAGQ+                G +E+ +      + D S    +  + 
Subjt:  SVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADEIEDVSEEMGKLQIE

Query:  KCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQD--VKDAASVAVRCLYARASHDLVPQVILATRSLIQL
          G +  DS     RIPLPWELLQP+LRILGHCLL  L ++D  + +AA+ A + LY R+ HD+ P+ ILAT SL++L
Subjt:  KCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQD--VKDAASVAVRCLYARASHDLVPQVILATRSLIQL

AT5G64090.1 FUNCTIONS IN: molecular_function unknown9.8e-13060.48Show/hide
Query:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNPSATTATAT------------ADSDPMHSWWESVSKARSRIHALSSILPPHSDSFF-------LSSVADSD
        MDF   PS     SPSPSSS SS+T     S TT TAT            AD DPMHSWWESVSK RSRI +LSS+L    DS F       +SS+ADSD
Subjt:  MDFHRNPSLNNRHSPSPSSSASSTTAPHNPSATTATAT------------ADSDPMHSWWESVSKARSRIHALSSILPPHSDSFF-------LSSVADSD

Query:  RPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGK
        RPALSLLSS  AYS+IS++L +  SGSGSDPLC WLY+T+LSSDP LRLVVLSF PLL  +YLSR+H  SSDS SLPSL+GFEAVLLA+Y++EVK+RAGK
Subjt:  RPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGK

Query:  PVLVSIPDLSQPSLYHSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKE-
        P+LV IPDLSQPSLYH+PR   +    +    SVGVLSP LEPQ AVKSTKRA IVGV L CY+K+ISQMP+WSKLEFC+ +ASWAGQDC CK + D++ 
Subjt:  PVLVSIPDLSQPSLYHSPRTKPNSGAQAQSRPSVGVLSPSLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKE-

Query:  -----------DGSEIGGFSEKRALEYADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYA
                   D S   G S  R+LE  ++ + ++    + Q+    N       +G RIPLPWEL QP LRILGHCLL+PLN +DVKDAAS AVR LYA
Subjt:  -----------DGSEIGGFSEKRALEYADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIPLPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYA

Query:  RASHDLVPQVILATRSLIQLDNRTRTAAK---AATANASSNANTPSKDKKPEILLVSK
        RASHDL PQ ILATRSL+ LD   RT+ K   A T N SSN NTPSK KKPEILL SK
Subjt:  RASHDLVPQVILATRSLIQLDNRTRTAAK---AATANASSNANTPSKDKKPEILLVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTCCACCGGAATCCTTCCCTCAACAACCGACATTCCCCCTCCCCCTCCTCCTCCGCCTCCTCCACCACCGCCCCCCATAACCCCTCCGCCACCACCGCCACCGC
CACCGCCGACAGCGACCCTATGCACTCCTGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCCCTTTCCTCCATCCTTCCCCCTCACTCCGACTCCTTTTTCC
TCTCCTCCGTCGCCGATTCCGACCGCCCGGCCCTATCTCTTTTGTCGTCTCACGACGCCTACTCCGTCATCTCCTCCGCCCTCTCCTCCTCCGTCTCTGGATCTGGTTCT
GACCCTCTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCTTTCCTTCCCCTTCTTTCCTCCTTGTACCTTTCTCGCGT
CCATTCCACTTCCTCCGATTCCCCTTCTCTCCCTTCCCTTGCCGGCTTTGAGGCTGTGCTTCTCGCGCTTTATTCCTCTGAGGTTAAGTCCCGGGCTGGGAAGCCTGTTC
TTGTCTCGATTCCTGATCTTTCGCAGCCTTCTCTTTATCATTCTCCTCGGACTAAGCCCAATTCTGGTGCCCAAGCTCAATCCCGGCCGTCCGTTGGGGTTCTTTCCCCT
TCGCTCGAGCCACAAAACGCGGTGAAGTCGACCAAGAGAGCTTGCATTGTTGGCGTCGCTCTTGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCT
TGAATTCTGCCGATCTGCAGCGTCGTGGGCTGGGCAAGATTGTTGCTGCAAGAGAGAGTTTGACAAAGAAGACGGTTCAGAAATTGGTGGGTTTTCGGAGAAAAGGGCGT
TGGAGTATGCAGATGAGATAGAGGATGTTTCAGAAGAAATGGGTAAACTACAAATTGAGAAGTGTGGGAACAATTCCGAGGATTCAGAACCTAAGGGGTACAGAATTCCG
CTGCCATGGGAGCTTTTGCAGCCAGTGCTTAGAATTTTAGGACATTGTTTACTGGCTCCTTTGAATTTGCAAGATGTTAAGGATGCAGCTTCCGTTGCTGTCAGGTGTTT
ATATGCGAGGGCATCTCATGATTTGGTACCACAGGTAATATTGGCAACTCGGAGTCTTATTCAGCTCGACAACAGAACTCGAACGGCTGCAAAGGCTGCAACAGCAAATG
CTTCGTCTAATGCAAATACACCCAGCAAAGATAAGAAACCAGAAATCCTATTGGTCTCAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTCCACCGGAATCCTTCCCTCAACAACCGACATTCCCCCTCCCCCTCCTCCTCCGCCTCCTCCACCACCGCCCCCCATAACCCCTCCGCCACCACCGCCACCGC
CACCGCCGACAGCGACCCTATGCACTCCTGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCCCTTTCCTCCATCCTTCCCCCTCACTCCGACTCCTTTTTCC
TCTCCTCCGTCGCCGATTCCGACCGCCCGGCCCTATCTCTTTTGTCGTCTCACGACGCCTACTCCGTCATCTCCTCCGCCCTCTCCTCCTCCGTCTCTGGATCTGGTTCT
GACCCTCTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCTTTCCTTCCCCTTCTTTCCTCCTTGTACCTTTCTCGCGT
CCATTCCACTTCCTCCGATTCCCCTTCTCTCCCTTCCCTTGCCGGCTTTGAGGCTGTGCTTCTCGCGCTTTATTCCTCTGAGGTTAAGTCCCGGGCTGGGAAGCCTGTTC
TTGTCTCGATTCCTGATCTTTCGCAGCCTTCTCTTTATCATTCTCCTCGGACTAAGCCCAATTCTGGTGCCCAAGCTCAATCCCGGCCGTCCGTTGGGGTTCTTTCCCCT
TCGCTCGAGCCACAAAACGCGGTGAAGTCGACCAAGAGAGCTTGCATTGTTGGCGTCGCTCTTGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCT
TGAATTCTGCCGATCTGCAGCGTCGTGGGCTGGGCAAGATTGTTGCTGCAAGAGAGAGTTTGACAAAGAAGACGGTTCAGAAATTGGTGGGTTTTCGGAGAAAAGGGCGT
TGGAGTATGCAGATGAGATAGAGGATGTTTCAGAAGAAATGGGTAAACTACAAATTGAGAAGTGTGGGAACAATTCCGAGGATTCAGAACCTAAGGGGTACAGAATTCCG
CTGCCATGGGAGCTTTTGCAGCCAGTGCTTAGAATTTTAGGACATTGTTTACTGGCTCCTTTGAATTTGCAAGATGTTAAGGATGCAGCTTCCGTTGCTGTCAGGTGTTT
ATATGCGAGGGCATCTCATGATTTGGTACCACAGGTAATATTGGCAACTCGGAGTCTTATTCAGCTCGACAACAGAACTCGAACGGCTGCAAAGGCTGCAACAGCAAATG
CTTCGTCTAATGCAAATACACCCAGCAAAGATAAGAAACCAGAAATCCTATTGGTCTCAAAATAA
Protein sequenceShow/hide protein sequence
MDFHRNPSLNNRHSPSPSSSASSTTAPHNPSATTATATADSDPMHSWWESVSKARSRIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSALSSSVSGSGS
DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRTKPNSGAQAQSRPSVGVLSP
SLEPQNAVKSTKRACIVGVALDCYYKQISQMPSWSKLEFCRSAASWAGQDCCCKREFDKEDGSEIGGFSEKRALEYADEIEDVSEEMGKLQIEKCGNNSEDSEPKGYRIP
LPWELLQPVLRILGHCLLAPLNLQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRTAAKAATANASSNANTPSKDKKPEILLVSK