; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G199410 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G199410
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionCarbohydrate-binding-like fold, putative isoform 2
Genome locationCiama_Chr10:34764363..34767740
RNA-Seq ExpressionCaUC10G199410
SyntenyCaUC10G199410
Gene Ontology termsGO:2001070 - starch binding (molecular function)
InterPro domainsIPR002044 - Carbohydrate binding module family 20
IPR013783 - Immunoglobulin-like fold
IPR013784 - Carbohydrate-binding-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650984.1 hypothetical protein Csa_001314 [Cucumis sativus]3.0e-17876.28Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQADT QNDAVENQE +QSKTVRVKFQ
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ

Query:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE
        L KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFILQGITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSEE
Subjt:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE

Query:  EKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        EK VNQEE+SPIAPE LM E+NLTYP+EELI N  KDSI  KPSVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS 
Subjt:  EKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLPE
                  N +  E  +    DT+ITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHEVEDDGS+ G NESN++KLPE
Subjt:  PKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLPE

Query:  NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        NIQ NQK DP+V+A QEMEAKSSY     EDDTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

XP_011651865.1 uncharacterized protein LOC101213899 isoform X1 [Cucumis sativus]2.3e-17575.8Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQ-ADT-QNDAVENQEINQSKTVRVKF
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQ ADT QNDAVENQE +QSKTVRVKF
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQ-ADT-QNDAVENQEINQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE
        QL KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFILQGITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSE
Subjt:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE

Query:  EEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS
        EEK VNQEE+SPIAPE LM E+NLTYP+EELI N  KDSI  KPSVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS
Subjt:  EEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS

Query:  YPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLP
                   N +  E  +    DT+ITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHEVEDDGS+ G NESN++KLP
Subjt:  YPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLP

Query:  E--NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        E  NIQ NQK DP+V+A QEMEAKSSY     EDDTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  E--NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

XP_011651866.1 phosphoglucan, water dikinase, chloroplastic isoform X2 [Cucumis sativus]9.5e-17775.96Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQADT QNDAVENQE +QSKTVRVKFQ
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ

Query:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE
        L KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFILQGITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSEE
Subjt:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE

Query:  EKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        EK VNQEE+SPIAPE LM E+NLTYP+EELI N  KDSI  KPSVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS 
Subjt:  EKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLPE
                  N +  E  +    DT+ITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHEVEDDGS+ G NESN++KLPE
Subjt:  PKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLPE

Query:  --NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
          NIQ NQK DP+V+A QEMEAKSSY     EDDTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  --NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

XP_011651867.1 uncharacterized protein LOC101213899 isoform X3 [Cucumis sativus]7.3e-17776.12Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQ-ADT-QNDAVENQEINQSKTVRVKF
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQ ADT QNDAVENQE +QSKTVRVKF
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQ-ADT-QNDAVENQEINQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE
        QL KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFILQGITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSE
Subjt:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE

Query:  EEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS
        EEK VNQEE+SPIAPE LM E+NLTYP+EELI N  KDSI  KPSVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS
Subjt:  EEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS

Query:  YPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLP
                   N +  E  +    DT+ITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHEVEDDGS+ G NESN++KLP
Subjt:  YPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLP

Query:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        ENIQ NQK DP+V+A QEMEAKSSY     EDDTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

XP_038906171.1 uncharacterized protein LOC120092050 [Benincasa hispida]2.0e-19880.93Show/hide
Query:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE
        MK LATS SI+ N+ PSSYF A SLKERLLSGGPEFISYRR WKLA+  L+HLVP R GGIDLISCFSS  QADTQNDAVENQE NQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFV
        CTFGEHFFVVGDDPIFGSWDV+SAIPLNWADGHQWAAEV+IPVGKTIQFKFILQG TGNVVWQPGPDRTF+PWETSNTII+SEDWDSAESRI S EEK V
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFV

Query:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKES
        NQEE+S IA EKL+I+ENLTYPNEELI NTNKDSI EKPSVE IDGSNI A EENGS+ISASEEN SNVSLSEDN +SIS SKENA+ LVA NIS PKES
Subjt:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKES

Query:  VILNTGN--------NSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNY
         ILNT N        NSNGETTITS+SDT+ITEEILEND+KD       N  VQESFVN GVPILVPGLPPTPTTSNQ APP+EV+DDGSIDG N++N+ 
Subjt:  VILNTGN--------NSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNY

Query:  KLPENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
         LPENIQ NQKPDPDV+A QEME KSSYEEI++EDDTN IEN+S+LQE N +IVQNDITWGHKTLKKFLS L
Subjt:  KLPENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

TrEMBL top hitse value%identityAlignment
A0A0A0LA83 CBM20 domain-containing protein4.6e-17775.96Show/hide
Query:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ
        MKTL T NSI+ N +PSSYF  S+SSLKERLLSGGPEFISYRR WKLA+S LQHLVPLR GGID I SCF+S QQADT QNDAVENQE +QSKTVRVKFQ
Subjt:  MKTLATSNSIVTNTAPSSYF--SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLI-SCFSSQQQADT-QNDAVENQEINQSKTVRVKFQ

Query:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE
        L KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFILQGITGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRILSEE
Subjt:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEE

Query:  EKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        EK VNQEE+SPIAPE LM E+NLTYP+EELI N  KDSI  KPSVELIDGSNI ALEENG +ISASEEN +NVSL E + +SIS S +NAKDLVAGNIS 
Subjt:  EKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLPE
                  N +  E  +    DT+ITEE LEND KD          VQES V+  VPILVPGLPPT T SNQNAPPHEVEDDGS+ G NESN++KLPE
Subjt:  PKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLPE

Query:  --NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
          NIQ NQK DP+V+A QEMEAKSSY     EDDTN IENQS+LQE NN++VQND+TWGHKTLKKFLS L
Subjt:  --NIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

A0A1S3B5P4 uncharacterized protein LOC103486305 isoform X23.9e-16070.58Show/hide
Query:  MKTLATSNSIVTNTAPSSYF----SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQAD-TQNDAVENQEINQSKTVRVKF
        MKTL TSNSI+ N +PSSYF    S+SS+KERLLS GPEFISYRR WKLA+S LQH VPLR GGID ISCFSS QQAD  Q+DA+ENQE +QSKTVRVKF
Subjt:  MKTLATSNSIVTNTAPSSYF----SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQAD-TQNDAVENQEINQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE
        QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFILQGITGNV WQPGPDRTFQPWETSNTII+SEDWDSAESRILSE
Subjt:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE

Query:  EEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS
        EEK VNQEE SPIAPE LM+E NLTYPNEELI NTNKDSI  K SVE IDGSNIPALEENG +ISASEEN SNVSL   N +SIS S             
Subjt:  EEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS

Query:  YPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLP
                                  EIT+EILEND +D          VQES V+  VPILVPGLPP            +VE DGS+ G NESN++KLP
Subjt:  YPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLP

Query:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        E+ QN QK DP+V+A QEME KSSYEEI++EDDTN  ENQS+LQE NN+IVQNDITWGHKTLKKFLS L
Subjt:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

A0A5D3DMY0 Carbohydrate-binding-like fold, putative isoform 23.9e-16070.58Show/hide
Query:  MKTLATSNSIVTNTAPSSYF----SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQAD-TQNDAVENQEINQSKTVRVKF
        MKTL TSNSI+ N +PSSYF    S+SS+KERLLS GPEFISYRR WKLA+S LQH VPLR GGID ISCFSS QQAD  Q+DA+ENQE +QSKTVRVKF
Subjt:  MKTLATSNSIVTNTAPSSYF----SASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQAD-TQNDAVENQEINQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE
        QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGK IQFKFILQGITGNV WQPGPDRTFQPWETSNTII+SEDWDSAESRILSE
Subjt:  QLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSE

Query:  EEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS
        EEK VNQEE SPIAPE LM+E NLTYPNEELI NTNKDSI  K SVE IDGSNIPALEENG +ISASEEN SNVSL   N +SIS S             
Subjt:  EEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNIS

Query:  YPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLP
                                  EIT+EILEND +D          VQES V+  VPILVPGLPP            +VE DGS+ G NESN++KLP
Subjt:  YPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NESNNYKLP

Query:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        E+ QN QK DP+V+A QEME KSSYEEI++EDDTN  ENQS+LQE NN+IVQNDITWGHKTLKKFLS L
Subjt:  ENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

A0A6J1F2P2 uncharacterized protein LOC1114416393.0e-16869.29Show/hide
Query:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE
        MKTLATSNSI+ N A  S FSASSLKERLL GGPEF+SYRRH KL SS LQHLV LR GGI+ +SCFSS QQADTQN+ VENQ  NQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFV
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH WAAEV+IPVGK IQFKF+LQG TGNVVWQPGPDR FQPWETSNTII+SEDWDSA+SR+LSEEE  V
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFV

Query:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVE----LIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        NQ+++SP+ PEKLMIE++     +         SI EK SVE    LI G NI A EENGS++SASEENT                    KD++A NI  
Subjt:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVE----LIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESVILNTGN--------NSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NE
         KES ILNT N        N NGETTI SQS+T+ TEE+LEN +K+ TAKI +N DVQESF+NYGVP+LVPGLPPTPTTSNQ+AP HEV+DDGSIDG NE
Subjt:  PKESVILNTGN--------NSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NE

Query:  SNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE------EIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        SN++KLPENIQ     DPDV+ E EMEAKSSYE      EI++EDDTNKI N+S+LQE N++IVQNDITWGHKTLKKF S L
Subjt:  SNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE------EIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

A0A6J1J7C1 uncharacterized protein LOC1114820351.9e-16768.67Show/hide
Query:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE
        MKTLATSNSI+ N A  S FSAS LKERLL GGPEF+SYRRH KL SS LQHLV LR GGI+ + CFSS QQADTQN+ VENQ+ NQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFV
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH WAAEV+IPVGK IQFKF+LQG TGNVVWQPGPDRTFQPWETSNTII+SEDWDSAESRIL EEE  +
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFV

Query:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVE----LIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY
        NQ+E+SP+  EKLMIE++L    +         SI EK SVE    +I G NI A EENGS++SASEENT                    KD++  NI  
Subjt:  NQEENSPIAPEKLMIEENLTYPNEELIHNTNKDSITEKPSVE----LIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISY

Query:  PKESVILNTGN--------NSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NE
        PKES ILNT N        N NGETTI SQS+T+  EE+LEN +K+ TAKI +N DVQESF+NYGVP+LVPGLPPTPTTSNQ+AP HEVEDDGSIDG NE
Subjt:  PKESVILNTGN--------NSNGETTITSQSDTEITEEILENDQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDG-NE

Query:  SNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE------EIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL
        SN++KLPENIQ     DPDV+ E EME KSSYE      EI++EDDTNKI N+S+LQE N +IV+NDITWGHKTLKKF S L
Subjt:  SNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYE------EIKEEDDTNKIENQSELQETNNNIVQNDITWGHKTLKKFLSIL

SwissProt top hitse value%identityAlignment
O30565 Cyclomaltodextrin glucanotransferase8.0e-0928.17Show/hide
Query:  ASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVD
        +++E+Q  VP    G   +S  ++    +T++ A E  E+     V V+F +    T  G + ++VG+    G+WD   AI P+     ++   W  ++ 
Subjt:  ASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVD

Query:  IPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIIS
        +P GK +++K+I +   GNV WQ G +RT+    T    +IS
Subjt:  IPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIIS

P0DN29 Glucoamylase ARB_02327-12.5e-1040.24Show/hide
Query:  VKFQLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLN---WADG-HQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTF
        V+F+L      GE  F+VG  P  GSWDV  A+PLN   +AD  HQW  ++++P     ++KFI +   G VVW+  P+R +
Subjt:  VKFQLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLN---WADG-HQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTF

P30921 Cyclomaltodextrin glucanotransferase3.0e-0831.94Show/hide
Query:  SELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVDIP
        ++++  +P   GG+  I   +S   A T ++  +N E+     V V+F +    T  G++ ++ G     G+WD   AI PL     +Q   W  +V +P
Subjt:  SELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVDIP

Query:  VGKTIQFKFI-LQGITGNVVWQPGPDRTFQPWETSNTIIISEDW
         GKTI+FKF+  QG T  V W+ G + TF    TS T  I+ +W
Subjt:  VGKTIQFKFI-LQGITGNVVWQPGPDRTFQPWETSNTIIISEDW

P31746 Cyclomaltodextrin glucanotransferase3.0e-0827.27Show/hide
Query:  ASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECTF-GEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVD
        +++++   VP   GG   +S  ++   A+ ++   +  E+     V V+F +    T  G + ++VG+    G+WD   AI P+     +Q   W  ++ 
Subjt:  ASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECTF-GEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WAAEVD

Query:  IPVGKTIQFKFILQGITGNVVWQPGPDRTF-QPWETSNTIIIS
        +P GK +++K+I +   GNVVWQ G +RT+  P   ++T++I+
Subjt:  IPVGKTIQFKFILQGITGNVVWQPGPDRTF-QPWETSNTIIIS

P31797 Cyclomaltodextrin glucanotransferase1.0e-0830.65Show/hide
Query:  QQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAIPLNW----ADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQP
        Q  +   + A +N E+  +  V V+F +    T  G++ ++VG+    G+WD + AI   +         W  +V +P GKTI+FKFI +   GNV W+ 
Subjt:  QQQADTQNDAVENQEINQSKTVRVKFQLQKECT-FGEHFFVVGDDPIFGSWDVTSAIPLNW----ADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQP

Query:  GPDRTF-QPWETSNTIIISEDWDS
        G +  +  P  T+  II+  DW +
Subjt:  GPDRTF-QPWETSNTIIISEDWDS

Arabidopsis top hitse value%identityAlignment
AT5G01260.1 Carbohydrate-binding-like fold9.3e-3734.11Show/hide
Query:  ISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQS-KTVRVKFQLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQ
        I + R     SS +   VPLR   I            D+Q + VE++EI  S KTVRV+FQL+KEC FGEHFF+VGDDP+FG  WD  +A+PLNW+DG+ 
Subjt:  ISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQS-KTVRVKFQLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQ

Query:  WAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDS
        W  ++D+PVG+ ++FK +L+  TG ++WQPGP+R  + WET+ TI I EDWD+A+ +++ EE+ FV     S I  E            +E++      S
Subjt:  WAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDS

Query:  ITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESVILNTG-----NNSNGETTITSQSDTEITEEIL
        + +  SV  +         EN   +S      S+ S+  + T   S     A++++   +   +ES +L  G     +  N +  + ++   E   E++
Subjt:  ITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESVILNTG-----NNSNGETTITSQSDTEITEEIL

AT5G01260.2 Carbohydrate-binding-like fold1.1e-3730.88Show/hide
Query:  ISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQS-KTVRVKFQLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQ
        I + R     SS +   VPLR   I            D+Q + VE++EI  S KTVRV+FQL+KEC FGEHFF+VGDDP+FG  WD  +A+PLNW+DG+ 
Subjt:  ISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQS-KTVRVKFQLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQ

Query:  WAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDS
        W  ++D+PVG+ ++FK +L+  TG ++WQPGP+R  + WET+ TI I EDWD+A+ +++ EE+ FV                    Y N   I + ++D 
Subjt:  WAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFVNQEENSPIAPEKLMIEENLTYPNEELIHNTNKDS

Query:  ITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKD
        +             + ++++N S ++   EN   VS      +S S   E                    T   SNG  T       E+ +E +  +++ 
Subjt:  ITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESVILNTGNNSNGETTITSQSDTEITEEILENDQKD

Query:  ATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDGNESNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYEEIK--EEDDTNKIE
                            P+LVPGL P     N+     EV ++G  +     + K     + N+K     ++  E   KS  E +K  E+   N +E
Subjt:  ATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDGNESNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYEEIK--EEDDTNKIE

Query:  NQSELQETN-----NNIVQNDITWGHKTLKKFLS
         + +  ET      + + +NDI WG +TL K LS
Subjt:  NQSELQETN-----NNIVQNDITWGHKTLKKFLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCCTAGCGACCTCCAACTCCATCGTCACCAACACTGCACCTTCTTCTTACTTCTCTGCTTCTTCTCTGAAAGAGCGTCTTCTTTCCGGAGGACCTGAATTCAT
CTCGTATCGGAGGCATTGGAAATTGGCTAGTTCTGAACTTCAGCATTTGGTACCTTTGCGCCCGGGAGGCATCGACTTGATTTCTTGCTTCTCGTCTCAACAACAGGCAG
ATACTCAGAATGATGCAGTTGAGAATCAAGAAATAAATCAATCAAAGACCGTTCGTGTCAAATTCCAGCTGCAGAAAGAGTGCACATTTGGGGAGCATTTCTTTGTAGTA
GGTGATGATCCAATTTTTGGTTCCTGGGACGTTACAAGTGCAATACCTTTAAACTGGGCCGATGGGCATCAATGGGCAGCAGAAGTGGATATTCCTGTTGGAAAAACAAT
CCAGTTCAAATTCATACTTCAAGGAATAACTGGAAATGTTGTATGGCAACCTGGTCCTGATCGAACATTTCAACCCTGGGAAACATCCAATACAATCATCATTTCTGAAG
ATTGGGATAGCGCTGAATCTCGGATACTGAGTGAAGAAGAAAAATTTGTTAACCAAGAGGAGAATTCTCCCATTGCCCCAGAAAAGTTAATGATTGAGGAGAACCTCACT
TATCCAAACGAAGAACTGATCCACAATACAAATAAGGATTCAATAACAGAAAAACCATCAGTGGAACTGATTGATGGCAGTAACATCCCAGCTTTAGAAGAAAATGGCAG
TAGTATCTCTGCTTCTGAAGAGAATACCAGTAACGTTTCTCTTTCAGAGGATAATACTAACAGCATTTCCGCTTCAAAAGAGAATGCCAAAGATCTCGTGGCAGGGAATA
TTAGCTACCCAAAGGAGAGTGTCATTCTCAATACAGGTAACAATTCGAATGGGGAGACAACAATTACATCCCAGAGTGATACAGAGATAACAGAGGAAATTTTGGAAAAT
GATCAGAAAGATGCAACAGCGAAGATCCTTAAGAACACGGATGTTCAAGAAAGCTTTGTTAACTATGGAGTTCCCATTCTAGTTCCTGGTTTACCTCCAACACCAACAAC
ATCAAATCAGAATGCACCTCCACATGAAGTTGAAGATGATGGTTCCATCGATGGTAATGAATCAAACAATTATAAACTACCTGAGAACATTCAAAATAATCAGAAACCGG
ATCCTGATGTTCTGGCTGAACAAGAGATGGAGGCAAAGTCAAGTTATGAAGAGATTAAAGAGGAGGATGACACAAATAAAATTGAGAATCAGTCCGAATTGCAGGAAACC
AACAACAATATCGTTCAAAATGACATAACATGGGGTCATAAAACCCTGAAGAAGTTCCTCTCCATTCTGTACTATGTTGTTGGAAGATTTCAACTTTCTAGAACCCCACT
GGGGGATGGAAGCTGCTGTTGGGAGTTGTACATATGGCCCACCCAATTCACTATATTGTATACTAATTGGGGTGTAAAGACAAAGACTACGAAATTTGGTTCATTGGTGA
GTCTGGACTGCAATGGTATTGAAGATTCTGGTAATTAG
mRNA sequenceShow/hide mRNA sequence
GCGCGCGAGGCTTAAGCTCTGAGACTGTTTGTCATTGTCAGTAGCAGAGTGAGATTTTGATTGCGAGTGATGAAAACCCTAGCGACCTCCAACTCCATCGTCACCAACAC
TGCACCTTCTTCTTACTTCTCTGCTTCTTCTCTGAAAGAGCGTCTTCTTTCCGGAGGACCTGAATTCATCTCGTATCGGAGGCATTGGAAATTGGCTAGTTCTGAACTTC
AGCATTTGGTACCTTTGCGCCCGGGAGGCATCGACTTGATTTCTTGCTTCTCGTCTCAACAACAGGCAGATACTCAGAATGATGCAGTTGAGAATCAAGAAATAAATCAA
TCAAAGACCGTTCGTGTCAAATTCCAGCTGCAGAAAGAGTGCACATTTGGGGAGCATTTCTTTGTAGTAGGTGATGATCCAATTTTTGGTTCCTGGGACGTTACAAGTGC
AATACCTTTAAACTGGGCCGATGGGCATCAATGGGCAGCAGAAGTGGATATTCCTGTTGGAAAAACAATCCAGTTCAAATTCATACTTCAAGGAATAACTGGAAATGTTG
TATGGCAACCTGGTCCTGATCGAACATTTCAACCCTGGGAAACATCCAATACAATCATCATTTCTGAAGATTGGGATAGCGCTGAATCTCGGATACTGAGTGAAGAAGAA
AAATTTGTTAACCAAGAGGAGAATTCTCCCATTGCCCCAGAAAAGTTAATGATTGAGGAGAACCTCACTTATCCAAACGAAGAACTGATCCACAATACAAATAAGGATTC
AATAACAGAAAAACCATCAGTGGAACTGATTGATGGCAGTAACATCCCAGCTTTAGAAGAAAATGGCAGTAGTATCTCTGCTTCTGAAGAGAATACCAGTAACGTTTCTC
TTTCAGAGGATAATACTAACAGCATTTCCGCTTCAAAAGAGAATGCCAAAGATCTCGTGGCAGGGAATATTAGCTACCCAAAGGAGAGTGTCATTCTCAATACAGGTAAC
AATTCGAATGGGGAGACAACAATTACATCCCAGAGTGATACAGAGATAACAGAGGAAATTTTGGAAAATGATCAGAAAGATGCAACAGCGAAGATCCTTAAGAACACGGA
TGTTCAAGAAAGCTTTGTTAACTATGGAGTTCCCATTCTAGTTCCTGGTTTACCTCCAACACCAACAACATCAAATCAGAATGCACCTCCACATGAAGTTGAAGATGATG
GTTCCATCGATGGTAATGAATCAAACAATTATAAACTACCTGAGAACATTCAAAATAATCAGAAACCGGATCCTGATGTTCTGGCTGAACAAGAGATGGAGGCAAAGTCA
AGTTATGAAGAGATTAAAGAGGAGGATGACACAAATAAAATTGAGAATCAGTCCGAATTGCAGGAAACCAACAACAATATCGTTCAAAATGACATAACATGGGGTCATAA
AACCCTGAAGAAGTTCCTCTCCATTCTGTACTATGTTGTTGGAAGATTTCAACTTTCTAGAACCCCACTGGGGGATGGAAGCTGCTGTTGGGAGTTGTACATATGGCCCA
CCCAATTCACTATATTGTATACTAATTGGGGTGTAAAGACAAAGACTACGAAATTTGGTTCATTGGTGAGTCTGGACTGCAATGGTATTGAAGATTCTGGTAATTAGAGT
CTTCTAAGTCATTTGGATTTGTATGTATATGTTACAAAGGATGCATTCTCAAGTTTCAAGTACCCTTCTAGTTATTCTTGAATATTGTTTTGTTTTCAAG
Protein sequenceShow/hide protein sequence
MKTLATSNSIVTNTAPSSYFSASSLKERLLSGGPEFISYRRHWKLASSELQHLVPLRPGGIDLISCFSSQQQADTQNDAVENQEINQSKTVRVKFQLQKECTFGEHFFVV
GDDPIFGSWDVTSAIPLNWADGHQWAAEVDIPVGKTIQFKFILQGITGNVVWQPGPDRTFQPWETSNTIIISEDWDSAESRILSEEEKFVNQEENSPIAPEKLMIEENLT
YPNEELIHNTNKDSITEKPSVELIDGSNIPALEENGSSISASEENTSNVSLSEDNTNSISASKENAKDLVAGNISYPKESVILNTGNNSNGETTITSQSDTEITEEILEN
DQKDATAKILKNTDVQESFVNYGVPILVPGLPPTPTTSNQNAPPHEVEDDGSIDGNESNNYKLPENIQNNQKPDPDVLAEQEMEAKSSYEEIKEEDDTNKIENQSELQET
NNNIVQNDITWGHKTLKKFLSILYYVVGRFQLSRTPLGDGSCCWELYIWPTQFTILYTNWGVKTKTTKFGSLVSLDCNGIEDSGN