; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0010074 (gene) of Chayote v1 genome

Gene IDSed0010074
OrganismSechium edule (Chayote v1)
DescriptionHyccin
Genome locationLG07:9000526..9002314
RNA-Seq ExpressionSed0010074
SyntenySed0010074
Gene Ontology termsGO:0046854 - phosphatidylinositol phosphorylation (biological process)
GO:0072659 - protein localization to plasma membrane (biological process)
GO:0005829 - cytosol (cellular component)
GO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR018619 - Hyccin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576864.1 Family With Sequence Similarity 126 Member B-like protein, partial [Cucurbita argyrosperma subsp. sororia]7.2e-18882.87Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAASTAAA-------------AAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA
        MDFHRN SL++RHS SPSSS++ST AA             A AD+DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAY A
Subjt:  MDFHRNSSLSHRHSPSPSSSAASTAAA-------------AAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA

Query:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY
        ++SALSS+ +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVHS      S+PSLAGFEAVLLA+YSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE
        HSPR+KP+S  Q+QSRP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQIS MPSWSKLEFCRSAA+WAGQDCCCKREFDKEDD   I GFSEKRA+E
Subjt:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE

Query:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT
        +    E+EDVS+EMGKLQIEK G+NSDD EPK FRIPLPW++LQPVLR+LGHCLLAPLNSQDVKDAAS+AVRCLYARASHDLVPQVILATRSLIQLDNR 
Subjt:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT

Query:  RAAAK--APTNASSNANTPSKDKKPEILLVSK
        RAAAK  A  N+SSNANTPSKDKKPEILLVSK
Subjt:  RAAAK--APTNASSNANTPSKDKKPEILLVSK

XP_004137170.1 uncharacterized protein LOC101215901 [Cucumis sativus]2.6e-19083.61Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAAST-------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSAVASALS
        MDFHRN S+++RHS SPSSS+AS+        A A+AD DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAYS ++SALS
Subjt:  MDFHRNSSLSHRHSPSPSSSAAST-------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSAVASALS

Query:  SAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSK
        S++SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVHS      S+PSLAGFEAVLLA+YSSEVKSRAGKPV+VSIPDLSQPSLYHSP +K
Subjt:  SAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSK

Query:  PNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVE
        PNS  Q+Q RP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQISQMPSWSKLEFCRSAA+WAGQDCCC REFDKE DGF +GGFSEKRA+EY    E
Subjt:  PNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVE

Query:  VEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAK-
        +ED S+EMG+LQIEKCGNNS+DSEPKG RIPLPW++LQPVLR+LGHCLLAPLNSQDVKD AS+AVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAK 
Subjt:  VEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAK-

Query:  --APTNASSNANTPSKDKKPEILLVSK
          A  N+SSNANTPSKDKKPEILLVSK
Subjt:  --APTNASSNANTPSKDKKPEILLVSK

XP_008455651.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103495769 [Cucumis melo]7.2e-18881.9Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAAST-------------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA
        MDFHRN S+++RHS SPSSS+AS+             +A+A+AD DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAYS 
Subjt:  MDFHRNSSLSHRHSPSPSSSAAST-------------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA

Query:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY
        ++SALSS+ SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVHS      S+PSLAGFEAVLLA+YSSEVKSRAGKPV+VSIPDLSQPSLY
Subjt:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE
        HSP +KPNS  Q+Q+RP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQISQMPSWSKL FCRSAA+WAGQDCCC REFDKE DG  +GGFSEKRA+E
Subjt:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE

Query:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT
        Y    E+ED S+EMG+LQIEKCGNNS+DSEPKG RIPLPW++LQP+LR+LGHCLL PLNSQDVKD AS+AVRCLYARASHDLVPQVILATRSLIQLDNRT
Subjt:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT

Query:  RAAAK-APTNASSNANTPSKDKKPEILLVSK
        RAAAK A  N+SSNANTPSKDKKPEILLVSK
Subjt:  RAAAK-APTNASSNANTPSKDKKPEILLVSK

XP_022922509.1 uncharacterized protein LOC111430489 [Cucurbita moschata]4.2e-18882.87Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAASTAAA-------------AAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA
        MDFHRN SL++RHS SPSSS++ST AA             A AD+DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAY A
Subjt:  MDFHRNSSLSHRHSPSPSSSAASTAAA-------------AAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA

Query:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVH-----SASVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY
        ++SALSS+ +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVH     S+S+PSLAGFEAVLLA+YSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVH-----SASVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE
        HSPR+KP+S  Q+QSRP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQIS MPSWSKLEFCRSAA+WAGQDCCCKREFDKEDD   I GFSEKR +E
Subjt:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE

Query:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT
        +    E+EDVS+EMGKLQIEK G+NSDDSEPK FRIPLPW++LQPVLR+LGHCLLAPLNSQDVKDAAS+AVRCLYARASHDLVPQVILATRSLIQLDNR 
Subjt:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT

Query:  RAAAK--APTNASSNANTPSKDKKPEILLVSK
        RAAAK  A  N+SSNANTPSKDKKPEILLVSK
Subjt:  RAAAK--APTNASSNANTPSKDKKPEILLVSK

XP_038905210.1 uncharacterized protein LOC120091307 [Benincasa hispida]4.5e-19083.14Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAAST-------------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA
        MDFHRN SL++RHS SPSSS+AS+              A A+AD DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAYSA
Subjt:  MDFHRNSSLSHRHSPSPSSSAAST-------------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA

Query:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY
        ++SAL+S++SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVHS      S+PSLAGFEAVLLA+YSSEVKSRAGKPVLVSIPDLS PSLY
Subjt:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE
        HSPR+KPNS  Q+Q RP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQISQMPSWSKLEFCRSAA+WAGQDCCC+REFDKE DG  IGGFSEKRA+E
Subjt:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE

Query:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT
        Y    E+EDVS+EMG+LQIEKCGNNS+DSE KG RIPLPW++LQPVLR+LGHCLLAPLNSQDVKDAAS+AVRCLYARASHDLVPQVILATRSLIQLDNRT
Subjt:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT

Query:  RAAAK---APTNASSNANTPSKDKKPEILLVSK
        RAAAK   A  N+SSNANTPSKDKKPEILLVSK
Subjt:  RAAAK---APTNASSNANTPSKDKKPEILLVSK

TrEMBL top hitse value%identityAlignment
A0A0A0KZ90 Uncharacterized protein1.3e-19083.61Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAAST-------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSAVASALS
        MDFHRN S+++RHS SPSSS+AS+        A A+AD DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAYS ++SALS
Subjt:  MDFHRNSSLSHRHSPSPSSSAAST-------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSAVASALS

Query:  SAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSK
        S++SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVHS      S+PSLAGFEAVLLA+YSSEVKSRAGKPV+VSIPDLSQPSLYHSP +K
Subjt:  SAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSK

Query:  PNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVE
        PNS  Q+Q RP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQISQMPSWSKLEFCRSAA+WAGQDCCC REFDKE DGF +GGFSEKRA+EY    E
Subjt:  PNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVE

Query:  VEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAK-
        +ED S+EMG+LQIEKCGNNS+DSEPKG RIPLPW++LQPVLR+LGHCLLAPLNSQDVKD AS+AVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAK 
Subjt:  VEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAK-

Query:  --APTNASSNANTPSKDKKPEILLVSK
          A  N+SSNANTPSKDKKPEILLVSK
Subjt:  --APTNASSNANTPSKDKKPEILLVSK

A0A1S3C1D0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034957693.5e-18881.9Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAAST-------------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA
        MDFHRN S+++RHS SPSSS+AS+             +A+A+AD DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAYS 
Subjt:  MDFHRNSSLSHRHSPSPSSSAAST-------------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA

Query:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY
        ++SALSS+ SGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVHS      S+PSLAGFEAVLLA+YSSEVKSRAGKPV+VSIPDLSQPSLY
Subjt:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE
        HSP +KPNS  Q+Q+RP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQISQMPSWSKL FCRSAA+WAGQDCCC REFDKE DG  +GGFSEKRA+E
Subjt:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE

Query:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT
        Y    E+ED S+EMG+LQIEKCGNNS+DSEPKG RIPLPW++LQP+LR+LGHCLL PLNSQDVKD AS+AVRCLYARASHDLVPQVILATRSLIQLDNRT
Subjt:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT

Query:  RAAAK-APTNASSNANTPSKDKKPEILLVSK
        RAAAK A  N+SSNANTPSKDKKPEILLVSK
Subjt:  RAAAK-APTNASSNANTPSKDKKPEILLVSK

A0A6J1E3K7 uncharacterized protein LOC1114304892.0e-18882.87Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAASTAAA-------------AAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA
        MDFHRN SL++RHS SPSSS++ST AA             A AD+DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAY A
Subjt:  MDFHRNSSLSHRHSPSPSSSAASTAAA-------------AAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA

Query:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVH-----SASVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY
        ++SALSS+ +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVH     S+S+PSLAGFEAVLLA+YSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVH-----SASVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE
        HSPR+KP+S  Q+QSRP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQIS MPSWSKLEFCRSAA+WAGQDCCCKREFDKEDD   I GFSEKR +E
Subjt:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE

Query:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT
        +    E+EDVS+EMGKLQIEK G+NSDDSEPK FRIPLPW++LQPVLR+LGHCLLAPLNSQDVKDAAS+AVRCLYARASHDLVPQVILATRSLIQLDNR 
Subjt:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT

Query:  RAAAK--APTNASSNANTPSKDKKPEILLVSK
        RAAAK  A  N+SSNANTPSKDKKPEILLVSK
Subjt:  RAAAK--APTNASSNANTPSKDKKPEILLVSK

A0A6J1FWG4 uncharacterized protein LOC1114479761.6e-18582.55Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAAST-------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSAVASALS
        MDFHRN S+++RHSPSP+SS +ST       A  A AD DPMHSWWESVSKARSRIHALSSILPP+ DSFFLSS+ADSDRPALSLLSSHDAYS ++SALS
Subjt:  MDFHRNSSLSHRHSPSPSSSAAST-------AAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSAVASALS

Query:  SAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSK
        S+VSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVHS      S PSLAGFEAVLLA+YSSEVKSRAGKPVLV+IPDLSQPSLYHSP +K
Subjt:  SAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSK

Query:  PNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVE
        PNS  Q+Q RP VGVL PSLEPQ+AVKSTKRACI+GV LDCYYKQISQMPSWSKLE CRSAA+WAGQDCCCKREFDKE DG  I GFSEKRA+EYG   E
Subjt:  PNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVE

Query:  VEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKA
        +EDVS +MG LQ+E CGNNSDDSEPKGFRIPLPW++LQP+LR+LGHCLLAPLNSQDVKDAAS+AVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKA
Subjt:  VEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKA

Query:  PTNASSNANTPSKDKKPEILLVSK
         T  ++NA TPSKDKK EILLVSK
Subjt:  PTNASSNANTPSKDKKPEILLVSK

A0A6J1J9X7 uncharacterized protein LOC1114826046.0e-18882.87Show/hide
Query:  MDFHRNSSLSHRHSPSPSSSAASTAAA-------------AAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA
        MDFHRN SL++RHS SPSSS++ST AA             A AD+DPMHSWWESVSKARSRIHALSSILPP+SDSFFLSSVADSDRPALSLLSSHDAY A
Subjt:  MDFHRNSSLSHRHSPSPSSSAASTAAA-------------AAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSA

Query:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY
        ++SALSS+ +GSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYL RVHS      S+PSLAGFEAVLLA+YSSEVKSRAGKPVLVSIPDLSQPSLY
Subjt:  VASALSSAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA-----SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLY

Query:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE
        HSPR+KP+S  Q+QSRP VGVLSPSLEPQ+AVKSTKRACIVGV LDCYYKQI  MPSWSKLEFCRSAA+WAGQDCCCKREFDKEDD   I GFSEKRA+E
Subjt:  HSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVE

Query:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT
        +    E+EDVS+EMGKLQIEK G+NSDDSEPK FRIPLPW++LQPVLR+LGHCLLAPLNSQDVKDAAS+AVRCLYARASHDLVPQVILATRSLIQLDNR 
Subjt:  YGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRT

Query:  RAAAK--APTNASSNANTPSKDKKPEILLVSK
        RAAAK  A  N+SSNANTPSKDKKPEILLVSK
Subjt:  RAAAK--APTNASSNANTPSKDKKPEILLVSK

SwissProt top hitse value%identityAlignment
Q5R977 Protein FAM126B2.1e-0932.68Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRSKPNSDPQSQSRP
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S    EA+LL IY+ E+  + G  K +  +IP LS+PS+YH P S   S   ++   
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRSKPNSDPQSQSRP

Query:  C----VGVLSPSLEPQ-DAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCR
        C    + V+   L PQ +   +  R  ++   + CY   I  MP+ S    CR
Subjt:  C----VGVLSPSLEPQ-DAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCR

Q6P121 Hyccin1.2e-0741.57Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAG-FEAVLLAIYSSEVKSRAGKPVLVS--IPDLSQPSLYHSPRS
        +P+CH L++ + S +P L+   L FLP L   YL  V +A  P  +G  EA+LL IY+ E+  + G+  ++S  IP LS+PS+YH P S
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAG-FEAVLLAIYSSEVKSRAGKPVLVS--IPDLSQPSLYHSPRS

Q6P9N1 Hyccin1.2e-0730.92Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRS----KPNSDPQS
        +P+CH L++ + S +  L    L FLP L   YL    S  V S    EA+LL +Y+ E+  + G  K +  +IP LS+PS+YH P S           S
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRS----KPNSDPQS

Query:  QSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCR
        Q      V S     ++ + +  R  ++   L CY   ++ MPS S    C+
Subjt:  QSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCR

Q8C729 Protein FAM126B6.2e-0931.58Show/hide
Query:  AVASALSSAVSGSGS---DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAG--KPVLVSIPDLSQPSL
        A+  AL   +  S +   +P+CH L++ + SS+  L+   L FLP L  +YL    S    S    EA+LL IY+ E+  + G  K +  +IP LS+PS+
Subjt:  AVASALSSAVSGSGS---DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAG--KPVLVSIPDLSQPSL

Query:  YHSPRSKPNSDPQSQSRPC----VGVLSPSLEPQ-DAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCR
        YH P S   S   ++   C    + V+   L PQ +   +  R  ++   + CY   I  MP+ S    CR
Subjt:  YHSPRSKPNSDPQSQSRPC----VGVLSPSLEPQ-DAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCR

Q8IXS8 Protein FAM126B2.1e-0932.68Show/hide
Query:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRSKPNSDPQSQSRP
        +P+CH L++ + SS+  L+   L FLP L  +YL    S    S    EA+LL IY+ E+  + G  K +  +IP LS+PS+YH P S   S   ++   
Subjt:  DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAG--KPVLVSIPDLSQPSLYHSPRSKPNSDPQSQSRP

Query:  C----VGVLSPSLEPQ-DAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCR
        C    + V+   L PQ +   +  R  ++   + CY   I  MP+ S    CR
Subjt:  C----VGVLSPSLEPQ-DAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCR

Arabidopsis top hitse value%identityAlignment
AT5G21050.1 LOCATED IN: chloroplast1.0e-5138.3Show/hide
Query:  NSSLSHRHSPSPSSSAASTAAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPA-LSLLSSHDAYS-AVASALSSAVSGSGSDP
        +SS SH   PSP+ +  S       ++       ES +K ++ I +LS+I            V +++ P+ +++L   +A S A++S L    SG+G + 
Subjt:  NSSLSHRHSPSPSSSAASTAAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPA-LSLLSSHDAYS-AVASALSSAVSGSGSDP

Query:  LCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSKPNSDPQSQSRPCVGV
        LC WLYDTF S++P L+L+VL F+PL++ LYL RV        AGFEAVLLA+Y+ E  SRAG+ + V+IPDLS PS+YH   SK  +   + +   + V
Subjt:  LCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSKPNSDPQSQSRPCVGV

Query:  LSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVEVEDVSDEMGKLQIEK
        +S +L+P   V+ST+RA IVGV L+ YY +IS+MP  SKL FC S   WAGQ+   ++             + E+  V  GG                  
Subjt:  LSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVEVEDVSDEMGKLQIEK

Query:  CGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLA-PLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQL
               SE    RIPLPW++LQP+LR+LGHCLL   +  +++ +AA+ A + LY R+ HD+ P+ ILAT SL++L
Subjt:  CGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLA-PLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQL

AT5G64090.1 FUNCTIONS IN: molecular_function unknown3.0e-12358.62Show/hide
Query:  NSSLSHRHS--PSPSSSAASTAA---AAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFF-------LSSVADSDRPALSLLSSHDAYSAVASALS
        +SS  HR     +P+++AA+ +    +AAAD DPMHSWWESVSK RSRI +LSS+L  + DS F       +SS+ADSDRPALSLLSS  AYS ++++L 
Subjt:  NSSLSHRHS--PSPSSSAASTAA---AAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFF-------LSSVADSDRPALSLLSSHDAYSAVASALS

Query:  SAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA---SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSKPN
        +  SGSGSDPLC WLY+T+LSSDP LRLVVLSF PLL  +YL R+HS+   S+PSL+GFEAVLLAIY++EVK+RAGKP+LV IPDLSQPSLYH+PR+  +
Subjt:  SAVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLFRVHSA---SVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSKPN

Query:  SDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDD-------GFGIGGFSEKRAVEY
            S     VGVLSP LEPQ AVKSTKRA IVGVGL CY+K+ISQMP+WSKLEFC+ +A+WAGQDC CK + D+++D       GF   G S       
Subjt:  SDPQSQSRPCVGVLSPSLEPQDAVKSTKRACIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDD-------GFGIGGFSEKRAVEY

Query:  GGVVEVEDVSDEMGKLQIEK--CGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNR
        G  +E+E+  D +   + E+    N       +G RIPLPW++ QP LR+LGHCLL+PLN++DVKDAAS AVR LYARASHDL PQ ILATRSL+ LD  
Subjt:  GGVVEVEDVSDEMGKLQIEK--CGNNSDDSEPKGFRIPLPWDILQPVLRMLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNR

Query:  TRAAAKA----PTNASSNANTPSKDKKPEILLVSK
         R + K       N SSN NTPSK KKPEILL SK
Subjt:  TRAAAKA----PTNASSNANTPSKDKKPEILLVSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTCCACCGGAATTCTTCCCTCAGCCACCGCCATTCCCCGTCCCCCTCCTCCTCCGCCGCCTCCACCGCCGCCGCCGCCGCCGCCGATGCCGACCCCATGCACTC
CTGGTGGGAATCCGTCTCCAAAGCCCGTTCTCGCATCCACGCCCTTTCCTCCATCCTTCCCCCCAATTCCGACTCCTTCTTCCTCTCCTCCGTCGCCGATTCCGACCGCC
CGGCGCTGTCTCTCCTCTCCTCTCACGACGCTTACTCCGCCGTCGCCTCCGCCCTCTCCTCCGCCGTCTCCGGATCTGGCTCCGACCCTCTCTGCCACTGGCTCTACGAC
ACTTTCCTCTCTTCCGATCCTCATCTCCGCCTCGTCGTTCTTTCGTTCCTTCCTCTTCTTTCCTCTCTGTACCTCTTTCGCGTTCATTCCGCTTCTGTTCCTTCTCTCGC
CGGATTTGAGGCTGTGCTTCTCGCGATTTACTCCTCTGAGGTTAAGTCTAGGGCTGGGAAGCCTGTTCTTGTCTCGATTCCGGATTTGTCTCAGCCTTCTCTTTATCATT
CTCCTCGGAGTAAGCCTAATTCTGATCCCCAATCTCAATCCAGGCCGTGTGTTGGGGTTCTTTCCCCTTCTCTCGAGCCACAGGACGCGGTGAAGTCGACGAAACGAGCT
TGCATTGTTGGCGTTGGTCTTGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCTTGAATTCTGCCGATCTGCGGCGGCGTGGGCTGGGCAGGATTG
TTGCTGCAAGAGGGAGTTTGATAAAGAAGATGATGGTTTCGGAATTGGCGGGTTTTCGGAGAAAAGGGCGGTGGAGTATGGGGGGGTTGTTGAGGTTGAGGATGTCTCTG
ATGAAATGGGAAAACTTCAAATTGAGAAATGTGGGAATAATTCTGATGATTCAGAGCCTAAGGGGTTTAGAATTCCATTGCCGTGGGACATTTTGCAGCCAGTTCTTAGA
ATGTTAGGACATTGTTTACTGGCTCCTTTGAATTCACAAGATGTTAAGGATGCAGCTTCCATTGCTGTGAGGTGTTTATATGCGAGGGCATCTCATGATTTGGTGCCGCA
GGTAATATTGGCGACTCGGAGTCTTATTCAGCTCGATAACAGAACTCGAGCGGCTGCAAAGGCTCCGACAAATGCTTCTTCTAATGCTAATACACCCAGCAAAGATAAGA
AACCAGAAATCCTATTGGTCTCAAAATAA
mRNA sequenceShow/hide mRNA sequence
GTTTGAAAAAGTAGAAGGTCAATATTGAAAAAGAAAAAAAAAAGCATAAAACTCAATTTGTCTACTACACCATTGACCCATTGGCTTCGTCGCCGCCGTCCCCCTGAAAC
CAGGGGGCTCTGAAATGGATTTCCACCGGAATTCTTCCCTCAGCCACCGCCATTCCCCGTCCCCCTCCTCCTCCGCCGCCTCCACCGCCGCCGCCGCCGCCGCCGATGCC
GACCCCATGCACTCCTGGTGGGAATCCGTCTCCAAAGCCCGTTCTCGCATCCACGCCCTTTCCTCCATCCTTCCCCCCAATTCCGACTCCTTCTTCCTCTCCTCCGTCGC
CGATTCCGACCGCCCGGCGCTGTCTCTCCTCTCCTCTCACGACGCTTACTCCGCCGTCGCCTCCGCCCTCTCCTCCGCCGTCTCCGGATCTGGCTCCGACCCTCTCTGCC
ACTGGCTCTACGACACTTTCCTCTCTTCCGATCCTCATCTCCGCCTCGTCGTTCTTTCGTTCCTTCCTCTTCTTTCCTCTCTGTACCTCTTTCGCGTTCATTCCGCTTCT
GTTCCTTCTCTCGCCGGATTTGAGGCTGTGCTTCTCGCGATTTACTCCTCTGAGGTTAAGTCTAGGGCTGGGAAGCCTGTTCTTGTCTCGATTCCGGATTTGTCTCAGCC
TTCTCTTTATCATTCTCCTCGGAGTAAGCCTAATTCTGATCCCCAATCTCAATCCAGGCCGTGTGTTGGGGTTCTTTCCCCTTCTCTCGAGCCACAGGACGCGGTGAAGT
CGACGAAACGAGCTTGCATTGTTGGCGTTGGTCTTGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCTTGAATTCTGCCGATCTGCGGCGGCGTGG
GCTGGGCAGGATTGTTGCTGCAAGAGGGAGTTTGATAAAGAAGATGATGGTTTCGGAATTGGCGGGTTTTCGGAGAAAAGGGCGGTGGAGTATGGGGGGGTTGTTGAGGT
TGAGGATGTCTCTGATGAAATGGGAAAACTTCAAATTGAGAAATGTGGGAATAATTCTGATGATTCAGAGCCTAAGGGGTTTAGAATTCCATTGCCGTGGGACATTTTGC
AGCCAGTTCTTAGAATGTTAGGACATTGTTTACTGGCTCCTTTGAATTCACAAGATGTTAAGGATGCAGCTTCCATTGCTGTGAGGTGTTTATATGCGAGGGCATCTCAT
GATTTGGTGCCGCAGGTAATATTGGCGACTCGGAGTCTTATTCAGCTCGATAACAGAACTCGAGCGGCTGCAAAGGCTCCGACAAATGCTTCTTCTAATGCTAATACACC
CAGCAAAGATAAGAAACCAGAAATCCTATTGGTCTCAAAATAAACTATCTGAATGTCATTTGGTTTGCAGGTAACAGATGTTGTATAGTGAATCTGAAGTGATTGAACAT
TATTTTGTTTAGGATAATACTGTGTGGATGTCTGTACTCTTTATATCATGGTTAAATCTCTCACCAAGTTACAAGGCTTTTGAGTTGTACTTTTTTAGGATTTTATAGAA
CTTGTGTTCTGTGGCTGAGTAGACCTCTTTGGTTAGAATTTCATGTGCTTTCCTATACATGGAGGGTGACATTTTTTCTTCTTCTTTTTCAATTTTTTTTTATTATGGTT
GGTGTGTATTGTTTAGGCTATAAATAACTCAAGTTTTATACATTGTCATGTTCATTTCGATAGTGTATTTTTCAGATTTATGACATATACAATACATTTGGGTTTGTTTA
TTTTATCTATACTAATGGTTATTAAGTTG
Protein sequenceShow/hide protein sequence
MDFHRNSSLSHRHSPSPSSSAASTAAAAAADADPMHSWWESVSKARSRIHALSSILPPNSDSFFLSSVADSDRPALSLLSSHDAYSAVASALSSAVSGSGSDPLCHWLYD
TFLSSDPHLRLVVLSFLPLLSSLYLFRVHSASVPSLAGFEAVLLAIYSSEVKSRAGKPVLVSIPDLSQPSLYHSPRSKPNSDPQSQSRPCVGVLSPSLEPQDAVKSTKRA
CIVGVGLDCYYKQISQMPSWSKLEFCRSAAAWAGQDCCCKREFDKEDDGFGIGGFSEKRAVEYGGVVEVEDVSDEMGKLQIEKCGNNSDDSEPKGFRIPLPWDILQPVLR
MLGHCLLAPLNSQDVKDAASIAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAPTNASSNANTPSKDKKPEILLVSK