; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G000650 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G000650
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionCBM20 domain-containing protein
Genome locationCmo_Chr14:327123..330696
RNA-Seq ExpressionCmoCh14G000650
SyntenyCmoCh14G000650
Gene Ontology termsGO:2001070 - starch binding (molecular function)
InterPro domainsIPR002044 - Carbohydrate binding module family 20
IPR013783 - Immunoglobulin-like fold
IPR013784 - Carbohydrate-binding-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580479.1 putative LRR receptor-like serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. sororia]1.1e-250100Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

KAG7017230.1 hypothetical protein SDJN02_19093 [Cucurbita argyrosperma subsp. argyrosperma]3.6e-24999.56Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQ TNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVY NPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

XP_022934469.1 uncharacterized protein LOC111441639 [Cucurbita moschata]1.1e-250100Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

XP_022983429.1 uncharacterized protein LOC111482035 [Cucurbita maxima]7.1e-23794.91Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSAS LKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL CFSSHQQADTQNEVVENQ TNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR FQPWETSNTIIVSEDWDSA+SR+L EEENI+
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
        NQD+HSPVV EKLMIEDS FALADASIVEKSSVESHEV+ILG NISASEENGSNVSASEENTKDIM SNIIS KESYILNTSNK VSEVY NPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKR EEVLENYEKE TAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEV+DDGSIDGINESNDHKLPENIQDPDVVVELEME KSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVN SIV+NDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

XP_023527439.1 uncharacterized protein LOC111790671 [Cucurbita pepo subsp. pepo]7.8e-23695.18Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAP SFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL CFS HQQADTQNEVVENQ TNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR FQPWETSNTIIVSEDWDSA+SR+LSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVE HEVLILGGNISASEENGSNVSASEENTKDIMASNIIS KESYILNTSNK VSEVY NPNGETT+I
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYG----VPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAK
        S SETKRTEEVLENYEKE TAKIP    VQESFINYG    VPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAK
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYG----VPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAK

Query:  SSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        SSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  SSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

TrEMBL top hitse value%identityAlignment
A0A0A0LA83 CBM20 domain-containing protein6.4e-14363.41Show/hide
Query:  MKTLATSNSIIGNNAAPSSF---SASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL-SCFSSHQQADT-QNEVVENQGTNQSKTVRVKF
        MKTL T NSII  N +PSS+   S+SSLKERLL GGPEF+SYRR  KL +SGLQHLV LRRGGI+F+ SCF+S+QQADT QN+ VENQ T+QSKTVRVKF
Subjt:  MKTLATSNSIIGNNAAPSSF---SASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL-SCFSSHQQADT-QNEVVENQGTNQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSE
        QL KECTFGEHF+VVGDDP FGSWDVTSAIPLNWADGH WAAEV+IPVGK IQFKF+LQG TGNVVWQPGPDR FQPWETSNTIIVSEDWDSA+SR+LSE
Subjt:  QLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSE

Query:  EENIVNQDDHSPVVPEKLMIEDSSF--------ALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDI---------MASNIISTKESYI
        EE IVNQ++ SP+ PE LM ED+           +   SI  K SVE    LI G NISA EENG N+SASEEN  ++         ++ +  + K+   
Subjt:  EENIVNQDDHSPVVPEKLMIEDSSF--------ALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDI---------MASNIISTKESYI

Query:  LNTSNKDVSEVYGNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKL
         N SNK VSEVY             +TK TEE LEN  K++         VQES ++  VP+LVPGLPPT T SNQ+AP HEV+DDGS+ GINESNDHKL
Subjt:  LNTSNKDVSEVYGNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKL

Query:  PE--NIQ-----DPDVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        PE  NIQ     DP+VV   EMEAKSSY           EDDTN I N+SDLQE+N+ +VQND+TWGHKTLKKF SSLRLL
Subjt:  PE--NIQ-----DPDVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

A0A1S3B6C3 uncharacterized protein LOC103486305 isoform X34.6e-13361.03Show/hide
Query:  MKTLATSNSIIGNNAAPSSF----SASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQ-AD-TQNEVVENQGTNQSKTVRVK
        MKTL TSNSII N +  S F    S+SS+KERLL  GPEF+SYRR  KL +SGLQH V LRRGGI+F+SCFSS+QQ AD  Q++ +ENQ T+QSKTVRVK
Subjt:  MKTLATSNSIIGNNAAPSSF----SASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQ-AD-TQNEVVENQGTNQSKTVRVK

Query:  FQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLS
        FQLQKECTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH WAAEV+IPVGK IQFKF+LQG TGNV WQPGPDR FQPWETSNTIIVSEDWDSA+SR+LS
Subjt:  FQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLS

Query:  EEENIVNQDDHSPVVPEKLMIE-DSSFALADA-------SIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDV
        EEE IVNQ+++SP+ PE LM+E + ++   +        SI  K SVES    I G NI A EENG N+SASEEN  ++                     
Subjt:  EEENIVNQDDHSPVVPEKLMIE-DSSFALADA-------SIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDV

Query:  SEVYGNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQ-DP
              P G  + IS S  + T+E+LEN  +++         VQES ++  VP+LVPGLPP            +V+ DGS+ GINESNDHKLPENIQ DP
Subjt:  SEVYGNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQ-DP

Query:  DVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        +VV   EME KSSYE      EIRQEDDTN   N+SDLQE+N+ IVQNDITWGHKTLKKF SSLRLL
Subjt:  DVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

A0A6J1CVP4 uncharacterized protein LOC1110150013.4e-16064.24Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSAS--SLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQ
        M+TLATSNSII NN AP  FSAS  SL+ERLLCGGPEF+SYR   K  SSGLQHL SLRRGGI+F +  SSH Q DTQN+ VENQ TNQ KTVRVKFQLQ
Subjt:  MKTLATSNSIIGNNAAPSSFSAS--SLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQ

Query:  KECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEEN
        KECTFGE F VVGDDP  GSW+VTSAIPLNWADGH WAAEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTI+VSEDWDS +S  L+EEE 
Subjt:  KECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEEN

Query:  IVNQDDHSPVVPEKLMIED------------SSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEEN----------------TKDIMASNI
        +VNQ++ SP+V E LMI D             S AL D SI EKSSVESHE LI    ISAS+ENGS++SA +E+                 K+I+A NI
Subjt:  IVNQDDHSPVVPEKLMIED------------SSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEEN----------------TKDIMASNI

Query:  ISTKESYILNTSNKDVSEVYGNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGI
           KES+ILN+SNK VSEVY N NGE+T   QS+TK TE + E++EK  T KI  NADVQES IN  VP+LVPGLPPTPTTSN+ APQHEV+ D SI+GI
Subjt:  ISTKESYILNTSNKDVSEVYGNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGI

Query:  NESNDHKLPENI-----QDPDVVVELEMEAKSSY--------EENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        NESN H+LPEN+     Q P +V E E+EAK SY        +E+ +QSEIRQ+DD NKI N SDLQE+N+ I++ND+TWGHKTL K  ++L+ L
Subjt:  NESNDHKLPENI-----QDPDVVVELEMEAKSSY--------EENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

A0A6J1F2P2 uncharacterized protein LOC1114416395.4e-251100Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

A0A6J1J7C1 uncharacterized protein LOC1114820353.4e-23794.91Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSAS LKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL CFSSHQQADTQNEVVENQ TNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR FQPWETSNTIIVSEDWDSA+SR+L EEENI+
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII
        NQD+HSPVV EKLMIEDS FALADASIVEKSSVESHEV+ILG NISASEENGSNVSASEENTKDIM SNIIS KESYILNTSNK VSEVY NPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKR EEVLENYEKE TAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEV+DDGSIDGINESNDHKLPENIQDPDVVVELEME KSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVN SIV+NDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

SwissProt top hitse value%identityAlignment
P0DN29 Glucoamylase ARB_02327-12.8e-1041.25Show/hide
Query:  VKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLN---WADG-HLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR
        V+F+L      GE  F+VG  P  GSWDV  A+PLN   +AD  H W  ++E+P   A ++KF+ +   G VVW+  P+R
Subjt:  VKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLN---WADG-HLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR

P30270 Alpha-amylase6.4e-0726.47Show/hide
Query:  GTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSE
        GT Q+      F +     +GE+ +V GD  + G+WD   A+ L+ A   +W  +V +  G   Q+K++ +   G  VW+ G +R      T+  + +++
Subjt:  GTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSE

Query:  DW
         W
Subjt:  DW

P31797 Cyclomaltodextrin glucanotransferase5.8e-0832.04Show/hide
Query:  VRVKFQLQKECT-FGEHFFVVGDDPSFGSWDVTSAIPLNW----ADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIF-QPWETSNTIIVSED
        V V+F +    T  G++ ++VG+    G+WD + AI   +         W  +V +P GK I+FKF+ +   GNV W+ G + ++  P  T+  IIV  D
Subjt:  VRVKFQLQKECT-FGEHFFVVGDDPSFGSWDVTSAIPLNW----ADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIF-QPWETSNTIIVSED

Query:  WDS
        W +
Subjt:  WDS

P36914 Glucoamylase3.8e-0732.53Show/hide
Query:  TVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLN----WADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR
        TV V F ++    +GE   +VG     GSW+ +SA  LN      D  LW   + +P G++ ++KF+ + + G V W+  P+R
Subjt:  TVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLN----WADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR

Q6ZY51 Phosphoglucan, water dikinase, chloroplastic6.2e-1027.52Show/hide
Query:  LRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQG
        LR          +S    + Q +  +  GT     VR+  +L  +  FG+H  + G     GSW   S  PLNW++   W  E+E+  G+ +++KFV+  
Subjt:  LRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQG

Query:  ETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD
          G++ W+ G +R+ +   + N  +V   WD A    L   + + N DD
Subjt:  ETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD

Arabidopsis top hitse value%identityAlignment
AT2G04270.1 RNAse E/G-like1.5e-0627.1Show/hide
Query:  LQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADG-HLWAAEVEIPVGKAIQFKFVLQ---GETGNVVWQPGPDRIFQPWETSN---TIIVSEDWDSAD
        ++ +    EH +V GD  + GSW+   AI +   +  + W A+V+I  G   ++ ++L+   G + +V+W+PGP        + N    II+ + W S  
Subjt:  LQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADG-HLWAAEVEIPVGKAIQFKFVLQ---GETGNVVWQPGPDRIFQPWETSN---TIIVSEDWDSAD

Query:  SRMLSEE
            S+E
Subjt:  SRMLSEE

AT5G01260.1 Carbohydrate-binding-like fold8.0e-3733.6Show/hide
Query:  DTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGS-WDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQP
        D+Q  V + +    +KTVRV+FQL+KEC FGEHFF+VGDDP FG  WD  +A+PLNW+DG++W  ++++PVG+ ++FK +L+ +TG ++WQPGP+R  + 
Subjt:  DTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGS-WDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQP

Query:  WETSNTIIVSEDWDSADSRMLSEEENI-------VNQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDI
        WET+ TI + EDWD+AD +M+ EE+ +       +  +D   V+   +    S  A+ +A  V   S ++        + S   E     S      +++
Subjt:  WETSNTIIVSEDWDSADSRMLSEEENI-------VNQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDI

Query:  MASNIISTKESYILNTSNKDVSEVYGNPNGETTIISQSETKRTEEVL
        +   + + +ES +L      +S++    N +  +I++ + +   EV+
Subjt:  MASNIISTKESYILNTSNKDVSEVYGNPNGETTIISQSETKRTEEVL

AT5G01260.2 Carbohydrate-binding-like fold2.6e-4032.9Show/hide
Query:  DTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGS-WDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQP
        D+Q  V + +    +KTVRV+FQL+KEC FGEHFF+VGDDP FG  WD  +A+PLNW+DG++W  ++++PVG+ ++FK +L+ +TG ++WQPGP+R  + 
Subjt:  DTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGS-WDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQP

Query:  WETSNTIIVSEDWDSADSRMLSEEENIVNQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIIS
        WET+ TI + EDWD+AD +M+ EE+           VP       SS    D   V  S  ++  V+ +      S+E+  N S S ++ K +  SN   
Subjt:  WETSNTIIVSEDWDSADSRMLSEEENIVNQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIIS

Query:  TKESYILNTSNKDVSEVYGNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINE
        T    I                         E   TEE                    ES      PVLVPGL P    S+ D  Q EV ++G  +   E
Subjt:  TKESYILNTSNKDVSEVYGNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINE

Query:  SNDHKLPENIQDPDVVVELEMEAKSSYEENVVQSEIRQ----EDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRL
         +  + P+  ++    V+     + S +E V   E RQ    E++  ++  E++     D + +NDI WG +TL K  S+ RL
Subjt:  SNDHKLPENIQDPDVVVELEMEAKSSYEENVVQSEIRQ----EDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRL

AT5G26570.1 catalytics;carbohydrate kinases;phosphoglucan, water dikinases4.4e-1127.52Show/hide
Query:  LRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQG
        LR          +S    + Q +  +  GT     VR+  +L  +  FG+H  + G     GSW   S  PLNW++   W  E+E+  G+ +++KFV+  
Subjt:  LRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQG

Query:  ETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD
          G++ W+ G +R+ +   + N  +V   WD A    L   + + N DD
Subjt:  ETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD

AT5G26570.2 catalytics;carbohydrate kinases;phosphoglucan, water dikinases4.4e-1127.52Show/hide
Query:  LRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQG
        LR          +S    + Q +  +  GT     VR+  +L  +  FG+H  + G     GSW   S  PLNW++   W  E+E+  G+ +++KFV+  
Subjt:  LRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQG

Query:  ETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD
          G++ W+ G +R+ +   + N  +V   WD A    L   + + N DD
Subjt:  ETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCCTAGCGACCTCCAACTCCATCATCGGTAACAATGCAGCTCCTTCTTCCTTCTCTGCTTCTTCGCTGAAAGAGCGTCTTCTTTGTGGAGGACCTGAA
TTCGTCTCTTATCGGAGGCATCGGAAATTGACTAGTTCTGGACTTCAGCATTTGGTATCGTTGCGCCGGGGAGGCATTGAATTTCTTTCTTGCTTCTCGTCTCAT
CAGCAGGCAGATACTCAGAATGAGGTAGTTGAGAATCAAGGCACGAATCAATCAAAGACAGTTCGCGTCAAATTCCAGCTGCAGAAAGAGTGTACATTTGGGGAG
CATTTCTTTGTAGTAGGTGATGATCCAAGTTTTGGTTCCTGGGACGTTACAAGTGCAATACCTTTAAACTGGGCAGATGGGCATCTATGGGCAGCAGAAGTGGAG
ATTCCTGTTGGAAAAGCAATCCAATTCAAATTCGTACTTCAAGGAGAAACTGGGAATGTCGTATGGCAACCTGGTCCTGATCGAATATTTCAACCCTGGGAAACA
TCTAATACAATCATCGTTTCTGAAGATTGGGATAGTGCTGACTCACGGATGCTCAGTGAAGAAGAAAACATTGTTAACCAGGATGATCATTCTCCCGTTGTCCCA
GAAAAGTTAATGATTGAGGATTCATCATTTGCTCTTGCCGATGCTTCAATAGTAGAAAAATCATCGGTGGAATCGCATGAAGTGTTGATTCTTGGCGGTAACATC
TCAGCTTCAGAAGAAAATGGCAGTAATGTCTCTGCTTCAGAAGAGAATACCAAAGATATTATGGCATCGAATATAATCTCAACAAAGGAGAGCTACATTCTCAAT
ACAAGTAACAAGGATGTGAGCGAGGTATACGGCAATCCAAATGGGGAGACAACAATTATATCCCAGAGTGAAACAAAGAGAACAGAGGAAGTTTTGGAAAATTAT
GAGAAAGAAGAAACAGCGAAGATCCCTAGGAATGCGGATGTTCAAGAAAGCTTTATCAACTATGGCGTTCCTGTTCTAGTTCCTGGTTTACCTCCAACACCAACA
ACCTCAAATCAGGATGCACCTCAACATGAAGTCAAAGATGATGGTTCCATCGATGGAATTAATGAATCTAACGATCATAAACTACCTGAGAACATACAGGATCCT
GATGTCGTGGTAGAACTAGAGATGGAAGCGAAGTCAAGTTATGAAGAAAATGTCGTCCAAAGTGAAATTAGACAAGAGGATGACACAAATAAAATTGCGAATGAA
TCTGATTTGCAGGAAGTCAACGATAGTATCGTTCAGAATGACATAACATGGGGTCATAAAACCCTGAAGAAGTTCTTCTCCAGTTTAAGGTTGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCACGTGAGAAGTCGTTTTCCCTTATCCTCACGAGGCCCCACCGTTATATAATGGCAACCACCAGAAACTCCATTAAGGCGCGAGACCTAAGCTCTGAGAGTG
TCATTGTCAGTGGCAGACTAAGATTGTGGAGAGGATGAAAACCCTAGCGACCTCCAACTCCATCATCGGTAACAATGCAGCTCCTTCTTCCTTCTCTGCTTCTTC
GCTGAAAGAGCGTCTTCTTTGTGGAGGACCTGAATTCGTCTCTTATCGGAGGCATCGGAAATTGACTAGTTCTGGACTTCAGCATTTGGTATCGTTGCGCCGGGG
AGGCATTGAATTTCTTTCTTGCTTCTCGTCTCATCAGCAGGCAGATACTCAGAATGAGGTAGTTGAGAATCAAGGCACGAATCAATCAAAGACAGTTCGCGTCAA
ATTCCAGCTGCAGAAAGAGTGTACATTTGGGGAGCATTTCTTTGTAGTAGGTGATGATCCAAGTTTTGGTTCCTGGGACGTTACAAGTGCAATACCTTTAAACTG
GGCAGATGGGCATCTATGGGCAGCAGAAGTGGAGATTCCTGTTGGAAAAGCAATCCAATTCAAATTCGTACTTCAAGGAGAAACTGGGAATGTCGTATGGCAACC
TGGTCCTGATCGAATATTTCAACCCTGGGAAACATCTAATACAATCATCGTTTCTGAAGATTGGGATAGTGCTGACTCACGGATGCTCAGTGAAGAAGAAAACAT
TGTTAACCAGGATGATCATTCTCCCGTTGTCCCAGAAAAGTTAATGATTGAGGATTCATCATTTGCTCTTGCCGATGCTTCAATAGTAGAAAAATCATCGGTGGA
ATCGCATGAAGTGTTGATTCTTGGCGGTAACATCTCAGCTTCAGAAGAAAATGGCAGTAATGTCTCTGCTTCAGAAGAGAATACCAAAGATATTATGGCATCGAA
TATAATCTCAACAAAGGAGAGCTACATTCTCAATACAAGTAACAAGGATGTGAGCGAGGTATACGGCAATCCAAATGGGGAGACAACAATTATATCCCAGAGTGA
AACAAAGAGAACAGAGGAAGTTTTGGAAAATTATGAGAAAGAAGAAACAGCGAAGATCCCTAGGAATGCGGATGTTCAAGAAAGCTTTATCAACTATGGCGTTCC
TGTTCTAGTTCCTGGTTTACCTCCAACACCAACAACCTCAAATCAGGATGCACCTCAACATGAAGTCAAAGATGATGGTTCCATCGATGGAATTAATGAATCTAA
CGATCATAAACTACCTGAGAACATACAGGATCCTGATGTCGTGGTAGAACTAGAGATGGAAGCGAAGTCAAGTTATGAAGAAAATGTCGTCCAAAGTGAAATTAG
ACAAGAGGATGACACAAATAAAATTGCGAATGAATCTGATTTGCAGGAAGTCAACGATAGTATCGTTCAGAATGACATAACATGGGGTCATAAAACCCTGAAGAA
GTTCTTCTCCAGTTTAAGGTTGCTTTAGCATCACAAATTCATTCTTATTGCTTTACTATGTTGTTCCCCAAGAAATTTTCAACTTACTGTGACCCAATTGGGTGG
ATGGAAGCTGCTGTTTGGGAATTGTACATATGGTCTGCCACTTCACTATATTCTAATACTACTTGGTGTAAATACACAGACTACGAAATTGGTTCATTAGCAAAT
CTGGACTGCAATGGTTTTGCAAGAATCTGGTAATTATAGTCTTTAGTCATTTGGATTGTATGTATATGTTTAATTTTAAATTTCAAGAACTCTGGTCATATATTG
TTTTGCATTATAATCAGAACGTATGAGTTAAAAGTGAGTTCATAATTAAAGAAACTCTTAGATCATGATGTTACTGTGAATGGGCAACAACTGCGTAGGGC
Protein sequenceShow/hide protein sequence
MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQGTNQSKTVRVKFQLQKECTFGE
HFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDDHSPVVP
EKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYGNPNGETTIISQSETKRTEEVLENY
EKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANE
SDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL