; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC10G202980 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC10G202980
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionCarbohydrate-binding-like fold, putative isoform 2
Genome locationCmU531Chr10:34526257..34529669
RNA-Seq ExpressionCmUC10G202980
SyntenyCmUC10G202980
Gene Ontology termsGO:2001070 - starch binding (molecular function)
InterPro domainsIPR002044 - Carbohydrate binding module family 20
IPR013783 - Immunoglobulin-like fold
IPR013784 - Carbohydrate-binding-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650984.1 hypothetical protein Csa_001314 [Cucumis sativus]4.3e-17776.07Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQADT QNDAVENQE +QSKTVRVKFQ
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ

Query:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE
        L KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFIL+GITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSEE
Subjt:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE

Query:  EKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        EKIVNQEE+SPIAPE LM E+NLTYP+EELI N  KDSIA K SVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS 
Subjt:  EKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLPE
                  N +  E  +    DTKITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHE EDDGS+ G NESN++KLPE
Subjt:  PKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLPE

Query:  NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        NIQ NQK DP+V+A QEMEAKSSY     E+DTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

XP_011651865.1 uncharacterized protein LOC101213899 isoform X1 [Cucumis sativus]3.4e-17475.58Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQ-ADT-QNDAVENQEINQSKTVRVKF
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQ ADT QNDAVENQE +QSKTVRVKF
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQ-ADT-QNDAVENQEINQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE
        QL KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFIL+GITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSE
Subjt:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE

Query:  EEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS
        EEKIVNQEE+SPIAPE LM E+NLTYP+EELI N  KDSIA K SVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS
Subjt:  EEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS

Query:  YPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLP
                   N +  E  +    DTKITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHE EDDGS+ G NESN++KLP
Subjt:  YPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLP

Query:  E--NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        E  NIQ NQK DP+V+A QEMEAKSSY     E+DTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  E--NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

XP_011651866.1 phosphoglucan, water dikinase, chloroplastic isoform X2 [Cucumis sativus]1.4e-17575.74Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQADT QNDAVENQE +QSKTVRVKFQ
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ

Query:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE
        L KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFIL+GITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSEE
Subjt:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE

Query:  EKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        EKIVNQEE+SPIAPE LM E+NLTYP+EELI N  KDSIA K SVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS 
Subjt:  EKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLPE
                  N +  E  +    DTKITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHE EDDGS+ G NESN++KLPE
Subjt:  PKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLPE

Query:  --NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
          NIQ NQK DP+V+A QEMEAKSSY     E+DTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  --NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

XP_011651867.1 uncharacterized protein LOC101213899 isoform X3 [Cucumis sativus]1.1e-17575.91Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQ-ADT-QNDAVENQEINQSKTVRVKF
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQ ADT QNDAVENQE +QSKTVRVKF
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQ-ADT-QNDAVENQEINQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE
        QL KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFIL+GITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSE
Subjt:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE

Query:  EEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS
        EEKIVNQEE+SPIAPE LM E+NLTYP+EELI N  KDSIA K SVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS
Subjt:  EEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS

Query:  YPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLP
                   N +  E  +    DTKITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHE EDDGS+ G NESN++KLP
Subjt:  YPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLP

Query:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        ENIQ NQK DP+V+A QEMEAKSSY     E+DTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

XP_038906171.1 uncharacterized protein LOC120092050 [Benincasa hispida]4.4e-19880.93Show/hide
Query:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE
        MK LATS SI+ N+ PSSYF A SLKERLLSGGPEFISYRR WKLA+  L+HLVP R GGIDLISCFSS  QADTQNDAVENQE NQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIV
        CTFGEHFFVVGDDPIFGSWDV+SAIPLNWADGHQWAAEV+IPVGKTIQFKFIL+G TGNVVWQPGPDRTF+PWETSNTII+SEDWDSAESRI S EEKIV
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIV

Query:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKES
        NQEE+S IA EKL+I+ENLTYPNEELI NTNKDSIAEK SVE IDGSNI A EENGS+ISASEEN SNVSLSEDN +SIS SKENA+ LVA NIS PKES
Subjt:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKES

Query:  FILNTGN--------NSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNY
        FILNT N        NSNGETTITS+SDTKITEEILEND+KD       N  VQESFVN GVPILVPGLPPTPTTSNQ APP+E +DDGSIDG N++N+ 
Subjt:  FILNTGN--------NSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNY

Query:  KLPENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
         LPENIQ NQKPDPDV+A QEME KSSYEEI++E+DTN IEN+S+LQE N +IVQNDITWGHKTLKKFLS L
Subjt:  KLPENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

TrEMBL top hitse value%identityAlignment
A0A0A0LA83 CBM20 domain-containing protein6.7e-17675.74Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQADT QNDAVENQE +QSKTVRVKFQ
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ

Query:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE
        L KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFIL+GITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSEE
Subjt:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE

Query:  EKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        EKIVNQEE+SPIAPE LM E+NLTYP+EELI N  KDSIA K SVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS 
Subjt:  EKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLPE
                  N +  E  +    DTKITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHE EDDGS+ G NESN++KLPE
Subjt:  PKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLPE

Query:  --NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
          NIQ NQK DP+V+A QEMEAKSSY     E+DTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  --NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

A0A1S3B5P4 uncharacterized protein LOC103486305 isoform X21.5e-15970.36Show/hide
Query:  MKTLATSNSIVTNTAPSSYF----SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQAD-TQNDAVENQEINQSKTVRVKF
        MKTL TSNSI+ N +PSSYF    S+SS+KERLLS GPEFISYRR WKLA+S LQH VPLR GGID ISCFSS QQAD  Q+DA+ENQE +QSKTVRVKF
Subjt:  MKTLATSNSIVTNTAPSSYF----SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQAD-TQNDAVENQEINQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE
        QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFIL+GITGNV WQPGPDRTFQPWETSNTII+SEDWDSAESRILSE
Subjt:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE

Query:  EEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS
        EEKIVNQEE SPIAPE LM+E NLTYPNEELI NTNKDSIA K SVE IDGSNIPALEENG +ISASEEN SNVSL   N +SIS S E           
Subjt:  EEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS

Query:  YPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLP
                                   IT+EILEND +D          VQES V+  VPILVPGLPP            + E DGS+ G NESN++KLP
Subjt:  YPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLP

Query:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        E+ QN QK DP+V+A QEME KSSYEEI++E+DTN  ENQS+LQE NN+IVQNDITWGHKTLKKFLS L
Subjt:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

A0A5D3DMY0 Carbohydrate-binding-like fold, putative isoform 21.5e-15970.36Show/hide
Query:  MKTLATSNSIVTNTAPSSYF----SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQAD-TQNDAVENQEINQSKTVRVKF
        MKTL TSNSI+ N +PSSYF    S+SS+KERLLS GPEFISYRR WKLA+S LQH VPLR GGID ISCFSS QQAD  Q+DA+ENQE +QSKTVRVKF
Subjt:  MKTLATSNSIVTNTAPSSYF----SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQAD-TQNDAVENQEINQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE
        QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFIL+GITGNV WQPGPDRTFQPWETSNTII+SEDWDSAESRILSE
Subjt:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE

Query:  EEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS
        EEKIVNQEE SPIAPE LM+E NLTYPNEELI NTNKDSIA K SVE IDGSNIPALEENG +ISASEEN SNVSL   N +SIS S E           
Subjt:  EEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS

Query:  YPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLP
                                   IT+EILEND +D          VQES V+  VPILVPGLPP            + E DGS+ G NESN++KLP
Subjt:  YPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NESNNYKLP

Query:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        E+ QN QK DP+V+A QEME KSSYEEI++E+DTN  ENQS+LQE NN+IVQNDITWGHKTLKKFLS L
Subjt:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

A0A6J1F2P2 uncharacterized protein LOC1114416391.3e-16869.29Show/hide
Query:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE
        MKTLATSNSI+ N A  S FSASSLKERLL GGPEF+SYRRH KL SS LQHLV LR GGI+ +SCFSS QQADTQN+ VENQ  NQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIV
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH WAAEV+IPVGK IQFKF+L+G TGNVVWQPGPDR FQPWETSNTII+SEDWDSA+SR+LSEEE IV
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIV

Query:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVE----LIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        NQ+++SP+ PEKLMIE++     +         SI EKSSVE    LI G NI A EENGS++SASEENT                    KD++A NI  
Subjt:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVE----LIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESFILNTGN--------NSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NE
         KES+ILNT N        N NGETTI SQS+TK TEE+LEN +K+ TAKI +N DVQESF+NYGVP+LVPGLPPTPTTSNQ+AP HE +DDGSIDG NE
Subjt:  PKESFILNTGN--------NSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NE

Query:  SNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE------EIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        SN++KLPENIQ     DPDV+ E EMEAKSSYE      EI++E+DTNKI N+S+LQE N++IVQNDITWGHKTLKKF S L
Subjt:  SNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE------EIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

A0A6J1J7C1 uncharacterized protein LOC1114820358.7e-16868.67Show/hide
Query:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE
        MKTLATSNSI+ N A  S FSAS LKERLL GGPEF+SYRRH KL SS LQHLV LR GGI+ + CFSS QQADTQN+ VENQ+ NQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIV
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH WAAEV+IPVGK IQFKF+L+G TGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRIL EEE I+
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIV

Query:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVE----LIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        NQ+E+SP+  EKLMIE++L    +         SI EKSSVE    +I G NI A EENGS++SASEENT                    KD++  NI  
Subjt:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSIAEKSSVE----LIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESFILNTGN--------NSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NE
        PKES+ILNT N        N NGETTI SQS+TK  EE+LEN +K+ TAKI +N DVQESF+NYGVP+LVPGLPPTPTTSNQ+AP HE EDDGSIDG NE
Subjt:  PKESFILNTGN--------NSNGETTITSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDG-NE

Query:  SNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE------EIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        SN++KLPENIQ     DPDV+ E EME KSSYE      EI++E+DTNKI N+S+LQE N +IV+NDITWGHKTLKKF S L
Subjt:  SNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE------EIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

SwissProt top hitse value%identityAlignment
O30565 Cyclomaltodextrin glucanotransferase4.7e-0928.17Show/hide
Query:  ASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVD
        +++E+Q  VP    G   +S  ++    +T++ A E  E+     V V+F +    T  G + ++VG+    G+WD   AI P+     ++   W  ++ 
Subjt:  ASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVD

Query:  IPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIIS
        +P GK +++K+I +   GNV WQ G +RT+    T    +IS
Subjt:  IPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIIS

P0DN29 Glucoamylase ARB_02327-11.1e-1041.46Show/hide
Query:  VKFQLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLN---WADG-HQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTF
        V+F+L      GE  F+VG  P  GSWDV  A+PLN   +AD  HQW  ++++P     ++KFI R   G VVW+  P+R +
Subjt:  VKFQLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLN---WADG-HQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTF

P30921 Cyclomaltodextrin glucanotransferase4.0e-0830.77Show/hide
Query:  SELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVDIP
        ++++  +P   GG+  I   +S   A T ++  +N E+     V V+F +    T  G++ ++ G     G+WD   AI PL     +Q   W  +V +P
Subjt:  SELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVDIP

Query:  VGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDW
         GKTI+FKF L+     V W+ G + TF    TS T  I+ +W
Subjt:  VGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDW

P31746 Cyclomaltodextrin glucanotransferase1.8e-0827.27Show/hide
Query:  ASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECTF-GEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVD
        +++++   VP   GG   +S  ++   A+ ++   +  E+     V V+F +    T  G + ++VG+    G+WD   AI P+     +Q   W  ++ 
Subjt:  ASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECTF-GEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVD

Query:  IPVGKTIQFKFILRGITGNVVWQPGPDRTF-QPWETSNTIIIS
        +P GK +++K+I +   GNVVWQ G +RT+  P   ++T++I+
Subjt:  IPVGKTIQFKFILRGITGNVVWQPGPDRTF-QPWETSNTIIIS

P31797 Cyclomaltodextrin glucanotransferase6.1e-0930.65Show/hide
Query:  QQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAIPLNW----ADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQP
        Q  +   + A +N E+  +  V V+F +    T  G++ ++VG+    G+WD + AI   +         W  +V +P GKTI+FKFI +   GNV W+ 
Subjt:  QQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAIPLNW----ADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQP

Query:  GPDRTF-QPWETSNTIIISEDWDS
        G +  +  P  T+  II+  DW +
Subjt:  GPDRTF-QPWETSNTIIISEDWDS

Arabidopsis top hitse value%identityAlignment
AT5G01260.1 Carbohydrate-binding-like fold8.4e-3836.03Show/hide
Query:  ISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQS-KTVRVKFQLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQ
        I + R     SS +   VPLR   I            D+Q + VE++EI  S KTVRV+FQL+KEC FGEHFF+VGDDP+FG  WD  +A+PLNW+DG+ 
Subjt:  ISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQS-KTVRVKFQLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQ

Query:  WAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDS
        W  ++D+PVG+ ++FK +L+  TG ++WQPGP+R  + WET+ TI I EDWD+A+ +++ EE+ +                     Y N   I + ++D 
Subjt:  WAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDS

Query:  IAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESFILNTG
        +    SV+    S++ A+ EN   +S      S+ S+  + T   S     A++++   +   +ES +L  G
Subjt:  IAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESFILNTG

AT5G01260.2 Carbohydrate-binding-like fold5.8e-3929.03Show/hide
Query:  ISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQS-KTVRVKFQLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQ
        I + R     SS +   VPLR   I            D+Q + VE++EI  S KTVRV+FQL+KEC FGEHFF+VGDDP+FG  WD  +A+PLNW+DG+ 
Subjt:  ISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQS-KTVRVKFQLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQ

Query:  WAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDS
        W  ++D+PVG+ ++FK +L+  TG ++WQPGP+R  + WET+ TI I EDWD+A+ +++ E                            E+ +  TN  S
Subjt:  WAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDS

Query:  IAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKD
        I                                    SED    + + ++N+  +   N  Y  +    N+  +   E T+   +      E+++     
Subjt:  IAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESFILNTGNNSNGETTITSQSDTKITEEILENDQKD

Query:  ATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDGNESNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYEEIK--EEEDTNKIE
                   +  F     P+LVPGL P     N+     E  ++G  +     + K     + N+K     ++  E   KS  E +K  E+   N +E
Subjt:  ATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDGNESNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYEEIK--EEEDTNKIE

Query:  NQSELQETN-----NNIVQNDITWGHKTLKKFLS
         + +  ET      + + +NDI WG +TL K LS
Subjt:  NQSELQETN-----NNIVQNDITWGHKTLKKFLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCCTAGCGACCTCCAACTCCATCGTCACCAACACTGCACCTTCTTCTTACTTCTCTGCTTCTTCTCTGAAAGAGCGTCTTCTTTCCGGAGGACCTGAA
TTCATCTCGTATCGGAGGCATTGGAAATTGGCTAGTTCTGAACTTCAGCATTTGGTACCTTTGCGCCCGGGAGGCATCGACTTGATTTCTTGCTTCTCGTCTCAA
CAACAGGCAGATACTCAGAATGATGCAGTTGAGAATCAAGAAATAAATCAATCAAAGACCGTTCGTGTCAAATTCCAGCTACAGAAAGAGTGCACATTTGGGGAG
CATTTCTTTGTAGTAGGTGATGATCCAATTTTTGGTTCCTGGGACGTTACAAGTGCAATACCTTTAAACTGGGCCGATGGGCATCAATGGGCAGCAGAAGTGGAT
ATTCCTGTTGGAAAAACAATCCAGTTCAAATTCATACTTCGAGGAATAACTGGAAATGTTGTATGGCAACCTGGTCCTGATCGAACATTTCAACCCTGGGAAACA
TCCAATACAATCATCATTTCTGAAGATTGGGATAGCGCTGAATCTCGGATACTAAGTGAAGAAGAAAAAATTGTTAACCAGGAGGAGAATTCTCCCATTGCCCCA
GAAAAGTTAATGATTGAGGAGAACCTCACTTATCCAAACGAAGAACTGATCCACAATACAAATAAGGATTCAATAGCAGAAAAATCATCAGTGGAACTGATTGAT
GGCAGTAACATCCCAGCTTTAGAAGAAAATGGCAGTAGTATCTCTGCTTCTGAAGAGAATACCAGTAACGTTTCTCTTTCAGAGGATAATACTAACAGCATTTCC
GCTTCAAAAGAGAATGCCAAAGATCTCGTGGCAGGGAATATTAGCTACCCAAAGGAGAGTTTCATTCTCAATACAGGTAACAATTCGAATGGGGAGACAACAATT
ACATCCCAGAGTGATACGAAGATAACAGAGGAAATTTTGGAAAATGATCAGAAAGATGCAACAGCGAAGATCCTTAAGAACACGGATGTTCAAGAAAGCTTTGTT
AACTATGGAGTTCCCATTCTAGTTCCTGGTTTACCTCCAACACCAACAACATCAAATCAGAATGCACCTCCACATGAAGCTGAAGATGATGGTTCCATCGATGGT
AATGAATCAAACAATTATAAACTACCTGAGAACATTCAAAATAATCAGAAACCGGATCCTGATGTTCTGGCTGAACAAGAGATGGAGGCAAAGTCAAGTTATGAA
GAGATTAAAGAGGAGGAGGACACAAATAAAATTGAGAATCAGTCCGAATTGCAGGAAACCAACAACAATATCGTTCAAAATGACATAACATGGGGTCATAAAACC
CTGAAGAAGTTCCTCTCCATTCTGTACTATGTTGTTGGAAGATTTCAACTTTCTAGAACCCCGCTGGGGGATGGAAGCTGCTGTTGGGAGTTGTACATATGGCCC
ACCCAATTCACTATATTGTATACTAATTGGGGTGTAAATACAAAGACTACGAAATTTGGTTCATTGGTGAATCTGGACTGCAATGGTATTGAAGATTCTGGTGAT
TAG
mRNA sequenceShow/hide mRNA sequence
GCGCGCGAGGCTTAAGCTCTGAGACTGTTTGTCATTGTCAGTAGCAGAGTGAGATTTTGATTGCGAGTGATGAAAACCCTAGCGACCTCCAACTCCATCGTCACC
AACACTGCACCTTCTTCTTACTTCTCTGCTTCTTCTCTGAAAGAGCGTCTTCTTTCCGGAGGACCTGAATTCATCTCGTATCGGAGGCATTGGAAATTGGCTAGT
TCTGAACTTCAGCATTTGGTACCTTTGCGCCCGGGAGGCATCGACTTGATTTCTTGCTTCTCGTCTCAACAACAGGCAGATACTCAGAATGATGCAGTTGAGAAT
CAAGAAATAAATCAATCAAAGACCGTTCGTGTCAAATTCCAGCTACAGAAAGAGTGCACATTTGGGGAGCATTTCTTTGTAGTAGGTGATGATCCAATTTTTGGT
TCCTGGGACGTTACAAGTGCAATACCTTTAAACTGGGCCGATGGGCATCAATGGGCAGCAGAAGTGGATATTCCTGTTGGAAAAACAATCCAGTTCAAATTCATA
CTTCGAGGAATAACTGGAAATGTTGTATGGCAACCTGGTCCTGATCGAACATTTCAACCCTGGGAAACATCCAATACAATCATCATTTCTGAAGATTGGGATAGC
GCTGAATCTCGGATACTAAGTGAAGAAGAAAAAATTGTTAACCAGGAGGAGAATTCTCCCATTGCCCCAGAAAAGTTAATGATTGAGGAGAACCTCACTTATCCA
AACGAAGAACTGATCCACAATACAAATAAGGATTCAATAGCAGAAAAATCATCAGTGGAACTGATTGATGGCAGTAACATCCCAGCTTTAGAAGAAAATGGCAGT
AGTATCTCTGCTTCTGAAGAGAATACCAGTAACGTTTCTCTTTCAGAGGATAATACTAACAGCATTTCCGCTTCAAAAGAGAATGCCAAAGATCTCGTGGCAGGG
AATATTAGCTACCCAAAGGAGAGTTTCATTCTCAATACAGGTAACAATTCGAATGGGGAGACAACAATTACATCCCAGAGTGATACGAAGATAACAGAGGAAATT
TTGGAAAATGATCAGAAAGATGCAACAGCGAAGATCCTTAAGAACACGGATGTTCAAGAAAGCTTTGTTAACTATGGAGTTCCCATTCTAGTTCCTGGTTTACCT
CCAACACCAACAACATCAAATCAGAATGCACCTCCACATGAAGCTGAAGATGATGGTTCCATCGATGGTAATGAATCAAACAATTATAAACTACCTGAGAACATT
CAAAATAATCAGAAACCGGATCCTGATGTTCTGGCTGAACAAGAGATGGAGGCAAAGTCAAGTTATGAAGAGATTAAAGAGGAGGAGGACACAAATAAAATTGAG
AATCAGTCCGAATTGCAGGAAACCAACAACAATATCGTTCAAAATGACATAACATGGGGTCATAAAACCCTGAAGAAGTTCCTCTCCATTCTGTACTATGTTGTT
GGAAGATTTCAACTTTCTAGAACCCCGCTGGGGGATGGAAGCTGCTGTTGGGAGTTGTACATATGGCCCACCCAATTCACTATATTGTATACTAATTGGGGTGTA
AATACAAAGACTACGAAATTTGGTTCATTGGTGAATCTGGACTGCAATGGTATTGAAGATTCTGGTGATTAGAGTCTTCTAAGTCATTTGGATTTGTATGTATAT
GTTACAAAGGATGCATTCTCAAGTTTCAAGTACCCTTCTAGTTATTCTTGAATATTGTTTTGTTTTCAAG
Protein sequenceShow/hide protein sequence
MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECTFGE
HFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILRGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKIVNQEENSPIAP
EKLMIEENLTYPNEELIHNTNKDSIAEKSSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESFILNTGNNSNGETTI
TSQSDTKITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEAEDDGSIDGNESNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE
EIKEEEDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSILYYVVGRFQLSRTPLGDGSCCWELYIWPTQFTILYTNWGVNTKTTKFGSLVNLDCNGIEDSGD