; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014473 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014473
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCarbohydrate-binding-like fold, putative isoform 2
Genome locationtig00000589:680076..683154
RNA-Seq ExpressionSgr014473
SyntenySgr014473
Gene Ontology termsGO:2001070 - starch binding (molecular function)
InterPro domainsIPR002044 - Carbohydrate binding module family 20
IPR013783 - Immunoglobulin-like fold
IPR013784 - Carbohydrate-binding-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580479.1 putative LRR receptor-like serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. sororia]3.5e-17368.55Show/hide
Query:  METLASSNSIVGNNTAPGYFYA-SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE
        M+TLA+SNSI+GNN AP  F A SLKERLLCGGPE +SYRR RKL +SGL+HLVSLRRGGI  LSC SS+QQADTQN+ V N+ TNQSKTVRVKFQLQKE
Subjt:  METLASSNSIVGNNTAPGYFYA-SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE

Query:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV
        CTFGE F VVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTIIVSEDWDSA+SRMLSEEE IV
Subjt:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV

Query:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD
        NQD+ +P+VPEKLMI +S                S ALAD SI +KSSVESH  LI    I+ASEEN SN+S+SE                    EN KD
Subjt:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD

Query:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD
        I+A NI   KES+ILNTSNK VS+VY N NGETT  S S+TK TE +L+N E++ T KI  N DVQESFIN+GVP+LVPGLPPTPT SNQ A QHE + D
Subjt:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD

Query:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL
         SI+GINESNDHKLPE   DP VV E EM AK SYEE+         +QSE+RQEDDTNKI N SDLQE+ND I+QNDITWGHKTL KFF++L+LL
Subjt:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL

KAG7017230.1 hypothetical protein SDJN02_19093 [Cucurbita argyrosperma subsp. argyrosperma]4.1e-17468.75Show/hide
Query:  METLASSNSIVGNNTAPGYFYA-SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE
        M+TLA+SNSI+GNN AP  F A SLKERLLCGGPE +SYRR RKL +SGL+HLVSLRRGGI  LSC SS+QQADTQN+ V N+DTNQSKTVRVKFQLQKE
Subjt:  METLASSNSIVGNNTAPGYFYA-SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE

Query:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV
        CTFGE F VVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTIIVSEDWDSA+SRMLSEEE IV
Subjt:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV

Query:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD
        NQD+ +P+VPEKLMI +S                S ALAD SI +KSSVESH  LI    I+ASEEN SN+S+SE                    EN KD
Subjt:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD

Query:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD
        I+A NI   KES+ILNTSNK VS+VY+N NGETT  S S+TK TE +L+N E++ T KI  N DVQESFIN+GVP+LVPGLPPTPT SNQ A QHE + D
Subjt:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD

Query:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL
         SI+GINESNDHKLPE   DP VV E EM AK SYEE+         +QSE+RQEDDTNKI N SDLQE+ND I+QNDITWGHKTL KFF++L+LL
Subjt:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL

XP_022145584.1 uncharacterized protein LOC111015001 [Momordica charantia]1.1e-19574.16Show/hide
Query:  METLASSNSIVGNNTAPGYFYA---SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQ
        METLA+SNSI+ NNTAP  F A   SL+ERLLCGGPE ISYR P K  +SGL+HL SLRRGGI   + LSS+ Q DTQNDAV N+DTNQ KTVRVKFQLQ
Subjt:  METLASSNSIVGNNTAPGYFYA---SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQ

Query:  KECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEK
        KECTFGEQFLVVGDDP+ GSW+VTSAIPLNWADGHQW AEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTI+VSEDWDS ES  L+EEEK
Subjt:  KECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEK

Query:  IVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENA
        +VNQ+ED+PIV E LMIV+    PN+ LIHNTN EPSVAL DTSIA+KSSVESH ELIDFS I+AS+EN S+IS+ + +A +ISL EE AS IS     A
Subjt:  IVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENA

Query:  KDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDE
        K+I+AENISY KESFILN+SNK VS+VY+NSNGE+TFT  SDTKITEGI ++ E+ ATVKILGN DVQES IN  VPILVPGLPPTPT SN+AA QHE E
Subjt:  KDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDE

Query:  YDVSINGINESNDHKLPED--------PYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNL
         DVSINGINESN H+LPE+        PY+VAE+E+ AKLSY ED+EEDD+ED LQSE+RQ+DD NKIEN SDLQEIN+DIL+ND+TWGHKTL+K   NL
Subjt:  YDVSINGINESNDHKLPED--------PYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNL

Query:  KLL
        K L
Subjt:  KLL

XP_022934469.1 uncharacterized protein LOC111441639 [Cucurbita moschata]3.5e-17368.55Show/hide
Query:  METLASSNSIVGNNTAPGYFYA-SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE
        M+TLA+SNSI+GNN AP  F A SLKERLLCGGPE +SYRR RKL +SGL+HLVSLRRGGI  LSC SS+QQADTQN+ V N+ TNQSKTVRVKFQLQKE
Subjt:  METLASSNSIVGNNTAPGYFYA-SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE

Query:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV
        CTFGE F VVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTIIVSEDWDSA+SRMLSEEE IV
Subjt:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV

Query:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD
        NQD+ +P+VPEKLMI +S                S ALAD SI +KSSVESH  LI    I+ASEEN SN+S+SE                    EN KD
Subjt:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD

Query:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD
        I+A NI   KES+ILNTSNK VS+VY N NGETT  S S+TK TE +L+N E++ T KI  N DVQESFIN+GVP+LVPGLPPTPT SNQ A QHE + D
Subjt:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD

Query:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL
         SI+GINESNDHKLPE   DP VV E EM AK SYEE+         +QSE+RQEDDTNKI N SDLQE+ND I+QNDITWGHKTL KFF++L+LL
Subjt:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL

XP_038906171.1 uncharacterized protein LOC120092050 [Benincasa hispida]7.8e-17369Show/hide
Query:  METLASSNSIVGNNTAPGYFYASLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKEC
        M+ LA+S SI+ N+T   YF  SLKERLL GGPE ISYRRP KL   GL HLV  RRGGI L+SC SS  QADTQNDAV N++TNQSKTVRVKFQLQKEC
Subjt:  METLASSNSIVGNNTAPGYFYASLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKEC

Query:  TFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVN
        TFGE F VVGDDPIFGSWDV+SAIPLNWADGHQW AEVEIPVGKTIQFKF+LQG TGNVVWQPGPDRTF+PWET+NTIIVSEDWDSAESR+ S EEKIVN
Subjt:  TFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVN

Query:  QDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKDI
        Q+ED+ I  EKL+I E+L YPN+ELI NTN +        SIA+K SVES    ID S I+ASEEN SNIS+SE NA+++SLSE+  SSIS SKENA+ +
Subjt:  QDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKDI

Query:  IAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYDV
        +AENIS PKESFILNTSNKAVS+V++NSNGETT TS SDTKITE IL+NDE+D  V    N  VQESF+N GVPILVPGLPPTPT SNQ A  +E + D 
Subjt:  IAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYDV

Query:  SINGINESNDHKLPE--------DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL
        SI+GIN++ND  LPE        DP V+A QEM  K SYE              E+RQEDDTN IEN SDLQEIN DI+QNDITWGHKTL KF ++L+LL
Subjt:  SINGINESNDHKLPE--------DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LA83 CBM20 domain-containing protein7.4e-14561.54Show/hide
Query:  METLASSNSIVGNNTAPGYF---YASLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLL-SCLSSNQQADT-QNDAVVNRDTNQSKTVRVKFQ
        M+TL + NSI+ N +   YF    +SLKERLL GGPE ISYRRP KL  SGL+HLV LRRGGI  + SC +S QQADT QNDAV N++T+QSKTVRVKFQ
Subjt:  METLASSNSIVGNNTAPGYF---YASLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLL-SCLSSNQQADT-QNDAVVNRDTNQSKTVRVKFQ

Query:  LQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEE
        L KECTFGE F VVGDDPIFGSWDVTSAIPLNWADGHQW AEV+IPVGK IQFKF+LQG TGNVVWQPGPDRTFQPWET+NTIIVSEDWDSAESR+LSEE
Subjt:  LQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEE

Query:  EKIVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKE
        EKIVNQ+ED+PI PE LM  ++L YP++ELI N        +   SIA+K SV    ELID S I+A EEN  NIS+SE N T++SL E   SSIS S +
Subjt:  EKIVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKE

Query:  NAKDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHE
        NAKD++A NI           SNKAVS+VY +           DTKITE  L+ND +D          VQES ++  VPILVPGLPPT T SNQ A  HE
Subjt:  NAKDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHE

Query:  DEYDVSINGINESNDHKLPE----------DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKF
         E D S+ GINESNDHKLPE          DP VVA QEM AK SY                   EDDTN IEN SDLQEIN+D++QND+TWGHKTL KF
Subjt:  DEYDVSINGINESNDHKLPE----------DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKF

Query:  FTNLKLL
         ++L+LL
Subjt:  FTNLKLL

A0A5D3DMY0 Carbohydrate-binding-like fold, putative isoform 22.3e-13056.94Show/hide
Query:  METLASSNSIVGNNTAPGYF-----YASLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQAD-TQNDAVVNRDTNQSKTVRVKF
        M+TL +SNSI+ N +   YF      +S+KERLL  GPE ISYRRP KL  SGL+H V LRRGGI  +SC SS QQAD  Q+DA+ N++T+QSKTVRVKF
Subjt:  METLASSNSIVGNNTAPGYF-----YASLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQAD-TQNDAVVNRDTNQSKTVRVKF

Query:  QLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSE
        QLQKECTFGE F VVGDDPIFGSWDVTSAIPLNWADGHQW AEV+IPVGK IQFKF+LQG TGNV WQPGPDRTFQPWET+NTIIVSEDWDSAESR+LSE
Subjt:  QLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSE

Query:  EEKIVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSK
        EEKIVNQ+E +PI PE LM+  +L YPN+ELI NTN +        SIA K SVES    ID S I A EEN  NIS+SE N +++SL     SSIS S 
Subjt:  EEKIVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSK

Query:  ENAKDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQH
        E                                              IT+ IL+ND +D          VQES ++  VPILVPGLPP            
Subjt:  ENAKDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQH

Query:  EDEYDVSINGINESNDHKLPE------DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTN
        + E D S++GINESNDHKLPE      DP VVA QEM  K SYE              E+RQEDDTN  EN SDLQEIN+DI+QNDITWGHKTL KF ++
Subjt:  EDEYDVSINGINESNDHKLPE------DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTN

Query:  LKLL
        L+LL
Subjt:  LKLL

A0A6J1CVP4 uncharacterized protein LOC1110150015.4e-19674.16Show/hide
Query:  METLASSNSIVGNNTAPGYFYA---SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQ
        METLA+SNSI+ NNTAP  F A   SL+ERLLCGGPE ISYR P K  +SGL+HL SLRRGGI   + LSS+ Q DTQNDAV N+DTNQ KTVRVKFQLQ
Subjt:  METLASSNSIVGNNTAPGYFYA---SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQ

Query:  KECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEK
        KECTFGEQFLVVGDDP+ GSW+VTSAIPLNWADGHQW AEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTI+VSEDWDS ES  L+EEEK
Subjt:  KECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEK

Query:  IVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENA
        +VNQ+ED+PIV E LMIV+    PN+ LIHNTN EPSVAL DTSIA+KSSVESH ELIDFS I+AS+EN S+IS+ + +A +ISL EE AS IS     A
Subjt:  IVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENA

Query:  KDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDE
        K+I+AENISY KESFILN+SNK VS+VY+NSNGE+TFT  SDTKITEGI ++ E+ ATVKILGN DVQES IN  VPILVPGLPPTPT SN+AA QHE E
Subjt:  KDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDE

Query:  YDVSINGINESNDHKLPED--------PYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNL
         DVSINGINESN H+LPE+        PY+VAE+E+ AKLSY ED+EEDD+ED LQSE+RQ+DD NKIEN SDLQEIN+DIL+ND+TWGHKTL+K   NL
Subjt:  YDVSINGINESNDHKLPED--------PYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNL

Query:  KLL
        K L
Subjt:  KLL

A0A6J1F2P2 uncharacterized protein LOC1114416391.7e-17368.55Show/hide
Query:  METLASSNSIVGNNTAPGYFYA-SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE
        M+TLA+SNSI+GNN AP  F A SLKERLLCGGPE +SYRR RKL +SGL+HLVSLRRGGI  LSC SS+QQADTQN+ V N+ TNQSKTVRVKFQLQKE
Subjt:  METLASSNSIVGNNTAPGYFYA-SLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE

Query:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV
        CTFGE F VVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTIIVSEDWDSA+SRMLSEEE IV
Subjt:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV

Query:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD
        NQD+ +P+VPEKLMI +S                S ALAD SI +KSSVESH  LI    I+ASEEN SN+S+SE                    EN KD
Subjt:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD

Query:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD
        I+A NI   KES+ILNTSNK VS+VY N NGETT  S S+TK TE +L+N E++ T KI  N DVQESFIN+GVP+LVPGLPPTPT SNQ A QHE + D
Subjt:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD

Query:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL
         SI+GINESNDHKLPE   DP VV E EM AK SYEE+         +QSE+RQEDDTNKI N SDLQE+ND I+QNDITWGHKTL KFF++L+LL
Subjt:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL

A0A6J1J7C1 uncharacterized protein LOC1114820352.5e-17267.74Show/hide
Query:  METLASSNSIVGNNTAPGYFYAS-LKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE
        M+TLA+SNSI+GNN AP  F AS LKERLLCGGPE +SYRR RKL +SGL+HLVSLRRGGI  L C SS+QQADTQN+ V N+DTNQSKTVRVKFQLQKE
Subjt:  METLASSNSIVGNNTAPGYFYAS-LKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKE

Query:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV
        CTFGE F VVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDRTFQPWET+NTIIVSEDWDSAESR+L EEE I+
Subjt:  CTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIV

Query:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD
        NQDE +P+V EKLMI +SL                 ALAD SI +KSSVESH  +I    I+ASEEN SN+S+SE                    EN KD
Subjt:  NQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKD

Query:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD
        I+  NI  PKES+ILNTSNKAVS+VY+N NGETT  S S+TK  E +L+N E++ T KI  N DVQESFIN+GVP+LVPGLPPTPT SNQ A QHE E D
Subjt:  IIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYD

Query:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL
         SI+GINESNDHKLPE   DP VV E EM  K SYEE+         +QSE+RQEDDTNKI N SDLQE+N  I++NDITWGHKTL KFF++L+LL
Subjt:  VSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL

SwissProt top hitse value%identityAlignment
P0DN29 Glucoamylase ARB_02327-13.0e-1039.02Show/hide
Query:  VKFQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLN---WADG-HQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTF
        V+F+L      GE   +VG  P  GSWDV  A+PLN   +AD  HQW  ++E+P     ++KF+ + + G VVW+  P+R +
Subjt:  VKFQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLN---WADG-HQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTF

P30270 Alpha-amylase5.3e-0728.57Show/hide
Query:  FQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDW
        F +     +GE   V GD    G+WD   A+ L+ A    W+ +V +  G   Q+K++ +   G  VW+ G +RT     TT  + +++ W
Subjt:  FQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDW

P31746 Cyclomaltodextrin glucanotransferase4.1e-0727.69Show/hide
Query:  GGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTF-GEQFLVVGDDPIFGSWDVTSAI-PLNWADGHQ---WEAEVEIPVGKTIQFKFVL
        GG + LS +++   A+ ++      +      V V+F +    T  G    +VG+    G+WD   AI P+     +Q   W  ++ +P GK +++K++ 
Subjt:  GGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTF-GEQFLVVGDDPIFGSWDVTSAI-PLNWADGHQ---WEAEVEIPVGKTIQFKFVL

Query:  QGKTGNVVWQPGPDRTF-QPWETTNTIIVS
        + + GNVVWQ G +RT+  P   T+T++++
Subjt:  QGKTGNVVWQPGPDRTF-QPWETTNTIIVS

P31797 Cyclomaltodextrin glucanotransferase1.8e-0730.6Show/hide
Query:  GIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECT-FGEQFLVVGDDPIFGSWDVTSAIPLNW----ADGHQWEAEVEIPVGKTIQFKFVLQ
        G + ++  SS+ Q     D   N +   +  V V+F +    T  G+   +VG+    G+WD + AI   +         W  +V +P GKTI+FKF+ +
Subjt:  GIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECT-FGEQFLVVGDDPIFGSWDVTSAIPLNW----ADGHQWEAEVEIPVGKTIQFKFVLQ

Query:  GKTGNVVWQPGPDRTF-QPWETTNTIIVSEDWDS
           GNV W+ G +  +  P  TT  IIV  DW +
Subjt:  GKTGNVVWQPGPDRTF-QPWETTNTIIVSEDWDS

Q6ZY51 Phosphoglucan, water dikinase, chloroplastic3.1e-0726.28Show/hide
Query:  LRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQF
        L H     R     L+C +++  + T  +    +D + +K VR+  +L  +  FG+   + G     GSW   S  PLNW++ + W  E+E+  G+ +++
Subjt:  LRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQF

Query:  KFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDED
        KFV+    G++ W+ G +R  +   + N  +V   WD+    +   +E  V  D+D
Subjt:  KFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDED

Arabidopsis top hitse value%identityAlignment
AT5G01260.1 Carbohydrate-binding-like fold1.5e-3640.45Show/hide
Query:  LSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGS-WDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPG
        L S+   D+Q +         +KTVRV+FQL+KEC FGE F +VGDDP+FG  WD  +A+PLNW+DG+ W  ++++PVG+ ++FK +L+ +TG ++WQPG
Subjt:  LSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGS-WDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPG

Query:  PDRTFQPWETTNTIIVSEDWDSAESRMLSEEE--------KIVNQDEDAPI----VPEKLMIVESLAYPNKELIHNTN
        P+R  + WET  TI + EDWD+A+ +M+ EE+         I ++DED  +        ++ VE+  Y + E   N++
Subjt:  PDRTFQPWETTNTIIVSEDWDSAESRMLSEEE--------KIVNQDEDAPI----VPEKLMIVESLAYPNKELIHNTN

AT5G01260.2 Carbohydrate-binding-like fold3.4e-3327.31Show/hide
Query:  LSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGS-WDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPG
        L S+   D+Q +         +KTVRV+FQL+KEC FGE F +VGDDP+FG  WD  +A+PLNW+DG+ W  ++++PVG+ ++FK +L+ +TG ++WQPG
Subjt:  LSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGS-WDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPG

Query:  PDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASE
        P+R  + WET  TI + EDWD+A+ +M+ EE+           VP          Y N   I + + +  +     S+ Q SSV              + 
Subjt:  PDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDEDAPIVPEKLMIVESLAYPNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASE

Query:  ENDSNISSSEVNATDISLSEETASSISPSKENAKDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDV
        EN   +S      +  S+  E     S     A+++I E +   +ES                                                     
Subjt:  ENDSNISSSEVNATDISLSEETASSISPSKENAKDIIAENISYPKESFILNTSNKAVSDVYNNSNGETTFTSHSDTKITEGILKNDEEDATVKILGNTDV

Query:  QESFINHGVPILVPGLPPTPTISNQAASQHEDEYDVSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHS
                 P+LVPGL P   + N+            +  INE      PE        AE+   AK+     FE+ ++E     E RQ +   + +   
Subjt:  QESFINHGVPILVPGLPPTPTISNQAASQHEDEYDVSINGINESNDHKLPE---DPYVVAEQEMGAKLSYEEDFEEDDEEDDLQSELRQEDDTNKIENHS

Query:  DLQEIN--DDILQNDITWGHKTLVKFFTNLKL
        + + +   D + +NDI WG +TL K  +N +L
Subjt:  DLQEIN--DDILQNDITWGHKTLVKFFTNLKL

AT5G26570.1 catalytics;carbohydrate kinases;phosphoglucan, water dikinases2.2e-0826.28Show/hide
Query:  LRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQF
        L H     R     L+C +++  + T  +    +D + +K VR+  +L  +  FG+   + G     GSW   S  PLNW++ + W  E+E+  G+ +++
Subjt:  LRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQF

Query:  KFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDED
        KFV+    G++ W+ G +R  +   + N  +V   WD+    +   +E  V  D+D
Subjt:  KFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDED

AT5G26570.2 catalytics;carbohydrate kinases;phosphoglucan, water dikinases2.2e-0826.28Show/hide
Query:  LRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQF
        L H     R     L+C +++  + T  +    +D + +K VR+  +L  +  FG+   + G     GSW   S  PLNW++ + W  E+E+  G+ +++
Subjt:  LRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVGDDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQF

Query:  KFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDED
        KFV+    G++ W+ G +R  +   + N  +V   WD+    +   +E  V  D+D
Subjt:  KFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACCCTAGCGAGCTCCAACTCCATCGTCGGCAACAACACAGCTCCTGGTTACTTCTATGCTTCGCTGAAAGAGCGTCTTCTTTGTGGAGGACCTGAAATCATCTC
TTATCGGAGGCCTCGGAAACTACCTACTTCTGGACTTCGGCATTTGGTATCTTTGCGGCGGGGAGGCATCCACCTGCTTTCTTGTCTCTCTTCCAATCAACAGGCAGATA
CTCAGAATGATGCAGTTGTGAATCGAGACACGAATCAATCAAAGACTGTTCGTGTCAAATTCCAGCTGCAGAAAGAGTGCACGTTTGGGGAGCAATTCCTTGTAGTAGGT
GACGATCCAATTTTTGGTTCCTGGGACGTTACAAGTGCAATACCATTAAACTGGGCAGATGGGCATCAATGGGAAGCAGAAGTGGAGATTCCTGTTGGAAAAACAATCCA
GTTCAAATTCGTACTTCAAGGAAAAACTGGAAATGTTGTATGGCAACCTGGTCCTGATCGAACATTTCAACCCTGGGAAACAACTAATACAATCATCGTTTCTGAAGATT
GGGACAGTGCTGAATCACGGATGCTAAGTGAAGAAGAAAAAATTGTTAACCAGGATGAGGATGCTCCCATTGTCCCAGAAAAGTTAATGATTGTGGAGAGCTTGGCTTAT
CCAAACAAAGAACTGATCCACAATACCAATACGGAACCGTCAGTTGCTCTTGCTGATACTTCAATAGCACAAAAATCATCTGTGGAATCACACGGAGAATTGATTGATTT
CAGTAAAATCACAGCTTCGGAAGAAAATGATAGTAATATCTCATCATCAGAAGTCAATGCCACTGACATCTCTCTTTCAGAGGAGACCGCTAGTAGCATTTCTCCTTCAA
AAGAGAATGCCAAAGATATCATAGCAGAGAATATAAGCTACCCGAAGGAGAGCTTCATTCTCAATACAAGTAACAAGGCTGTCAGCGACGTATACAACAATTCAAATGGG
GAGACAACATTTACATCCCATAGTGATACAAAGATAACAGAGGGAATATTGAAGAATGATGAGGAAGATGCAACAGTTAAGATCCTTGGGAATACAGATGTTCAAGAAAG
CTTCATTAACCATGGAGTTCCCATTCTAGTTCCTGGTTTACCTCCAACACCAACAATATCAAATCAGGCAGCATCTCAACATGAAGACGAATATGATGTTTCCATCAATG
GAATTAATGAATCTAACGACCATAAACTACCTGAGGATCCTTATGTTGTGGCTGAACAAGAGATGGGGGCGAAGTTAAGTTATGAAGAAGACTTTGAAGAAGATGACGAA
GAAGATGACCTCCAAAGTGAACTTAGACAAGAGGATGACACAAATAAAATTGAGAATCATTCCGACTTGCAGGAAATCAACGATGATATCCTTCAAAATGACATAACATG
GGGACATAAAACCCTGGTGAAGTTCTTCACCAATTTAAAATTGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAACCCTAGCGAGCTCCAACTCCATCGTCGGCAACAACACAGCTCCTGGTTACTTCTATGCTTCGCTGAAAGAGCGTCTTCTTTGTGGAGGACCTGAAATCATCTC
TTATCGGAGGCCTCGGAAACTACCTACTTCTGGACTTCGGCATTTGGTATCTTTGCGGCGGGGAGGCATCCACCTGCTTTCTTGTCTCTCTTCCAATCAACAGGCAGATA
CTCAGAATGATGCAGTTGTGAATCGAGACACGAATCAATCAAAGACTGTTCGTGTCAAATTCCAGCTGCAGAAAGAGTGCACGTTTGGGGAGCAATTCCTTGTAGTAGGT
GACGATCCAATTTTTGGTTCCTGGGACGTTACAAGTGCAATACCATTAAACTGGGCAGATGGGCATCAATGGGAAGCAGAAGTGGAGATTCCTGTTGGAAAAACAATCCA
GTTCAAATTCGTACTTCAAGGAAAAACTGGAAATGTTGTATGGCAACCTGGTCCTGATCGAACATTTCAACCCTGGGAAACAACTAATACAATCATCGTTTCTGAAGATT
GGGACAGTGCTGAATCACGGATGCTAAGTGAAGAAGAAAAAATTGTTAACCAGGATGAGGATGCTCCCATTGTCCCAGAAAAGTTAATGATTGTGGAGAGCTTGGCTTAT
CCAAACAAAGAACTGATCCACAATACCAATACGGAACCGTCAGTTGCTCTTGCTGATACTTCAATAGCACAAAAATCATCTGTGGAATCACACGGAGAATTGATTGATTT
CAGTAAAATCACAGCTTCGGAAGAAAATGATAGTAATATCTCATCATCAGAAGTCAATGCCACTGACATCTCTCTTTCAGAGGAGACCGCTAGTAGCATTTCTCCTTCAA
AAGAGAATGCCAAAGATATCATAGCAGAGAATATAAGCTACCCGAAGGAGAGCTTCATTCTCAATACAAGTAACAAGGCTGTCAGCGACGTATACAACAATTCAAATGGG
GAGACAACATTTACATCCCATAGTGATACAAAGATAACAGAGGGAATATTGAAGAATGATGAGGAAGATGCAACAGTTAAGATCCTTGGGAATACAGATGTTCAAGAAAG
CTTCATTAACCATGGAGTTCCCATTCTAGTTCCTGGTTTACCTCCAACACCAACAATATCAAATCAGGCAGCATCTCAACATGAAGACGAATATGATGTTTCCATCAATG
GAATTAATGAATCTAACGACCATAAACTACCTGAGGATCCTTATGTTGTGGCTGAACAAGAGATGGGGGCGAAGTTAAGTTATGAAGAAGACTTTGAAGAAGATGACGAA
GAAGATGACCTCCAAAGTGAACTTAGACAAGAGGATGACACAAATAAAATTGAGAATCATTCCGACTTGCAGGAAATCAACGATGATATCCTTCAAAATGACATAACATG
GGGACATAAAACCCTGGTGAAGTTCTTCACCAATTTAAAATTGCTTTAG
Protein sequenceShow/hide protein sequence
METLASSNSIVGNNTAPGYFYASLKERLLCGGPEIISYRRPRKLPTSGLRHLVSLRRGGIHLLSCLSSNQQADTQNDAVVNRDTNQSKTVRVKFQLQKECTFGEQFLVVG
DDPIFGSWDVTSAIPLNWADGHQWEAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRMLSEEEKIVNQDEDAPIVPEKLMIVESLAY
PNKELIHNTNTEPSVALADTSIAQKSSVESHGELIDFSKITASEENDSNISSSEVNATDISLSEETASSISPSKENAKDIIAENISYPKESFILNTSNKAVSDVYNNSNG
ETTFTSHSDTKITEGILKNDEEDATVKILGNTDVQESFINHGVPILVPGLPPTPTISNQAASQHEDEYDVSINGINESNDHKLPEDPYVVAEQEMGAKLSYEEDFEEDDE
EDDLQSELRQEDDTNKIENHSDLQEINDDILQNDITWGHKTLVKFFTNLKLL