; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg12483 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg12483
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionCBM20 domain-containing protein
Genome locationCarg_Chr14:311742..315739
RNA-Seq ExpressionCarg12483
SyntenyCarg12483
Gene Ontology termsGO:2001070 - starch binding (molecular function)
InterPro domainsIPR002044 - Carbohydrate binding module family 20
IPR013783 - Immunoglobulin-like fold
IPR013784 - Carbohydrate-binding-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580479.1 putative LRR receptor-like serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. sororia]1.0e-24899.56Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQ TNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVY NPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

KAG7017230.1 hypothetical protein SDJN02_19093 [Cucurbita argyrosperma subsp. argyrosperma]5.6e-250100Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

XP_022934469.1 uncharacterized protein LOC111441639 [Cucurbita moschata]1.0e-24899.56Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQ TNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVY NPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

XP_022983429.1 uncharacterized protein LOC111482035 [Cucurbita maxima]1.1e-23795.35Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSAS LKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL CFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR FQPWETSNTIIVSEDWDSA+SR+L EEENI+
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII
        NQD+HSPVV EKLMIEDS FALADASIVEKSSVESHEV+ILG NISASEENGSNVSASEENTKDIM SNIIS KESYILNTSNK VSEVYSNPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKR EEVLENYEKE TAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEV+DDGSIDGINESNDHKLPENIQDPDVVVELEME KSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVN SIV+NDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

XP_023527439.1 uncharacterized protein LOC111790671 [Cucurbita pepo subsp. pepo]2.7e-23695.39Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAP SFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL CFS HQQADTQNEVVENQDTNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR FQPWETSNTIIVSEDWDSA+SR+LSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVE HEVLILGGNISASEENGSNVSASEENTKDIMASNIIS KESYILNTSNK VSEVY+NPNGETT+I
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYG----VPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAK
        S SETKRTEEVLENYEKE TAKIP    VQESFINYG    VPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAK
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYG----VPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAK

Query:  SSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        SSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  SSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

TrEMBL top hitse value%identityAlignment
A0A0A0LA83 CBM20 domain-containing protein4.9e-14363.41Show/hide
Query:  MKTLATSNSIIGNNAAPSSF---SASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL-SCFSSHQQADT-QNEVVENQDTNQSKTVRVKF
        MKTL T NSII  N +PSS+   S+SSLKERLL GGPEF+SYRR  KL +SGLQHLV LRRGGI+F+ SCF+S+QQADT QN+ VENQ+T+QSKTVRVKF
Subjt:  MKTLATSNSIIGNNAAPSSF---SASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL-SCFSSHQQADT-QNEVVENQDTNQSKTVRVKF

Query:  QLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSE
        QL KECTFGEHF+VVGDDP FGSWDVTSAIPLNWADGH WAAEV+IPVGK IQFKF+LQG TGNVVWQPGPDR FQPWETSNTIIVSEDWDSA+SR+LSE
Subjt:  QLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSE

Query:  EENIVNQDDHSPVVPEKLMIEDSSF--------ALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDI---------MASNIISTKESYI
        EE IVNQ++ SP+ PE LM ED+           +   SI  K SVE    LI G NISA EENG N+SASEEN  ++         ++ +  + K+   
Subjt:  EENIVNQDDHSPVVPEKLMIEDSSF--------ALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDI---------MASNIISTKESYI

Query:  LNTSNKDVSEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKL
         N SNK VSEVY +           +TK TEE LEN  K++         VQES ++  VP+LVPGLPPT T SNQ+AP HEV+DDGS+ GINESNDHKL
Subjt:  LNTSNKDVSEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKL

Query:  PE--NIQ-----DPDVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        PE  NIQ     DP+VV   EMEAKSSY           EDDTN I N+SDLQE+N+ +VQND+TWGHKTLKKF SSLRLL
Subjt:  PE--NIQ-----DPDVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

A0A1S3B6C3 uncharacterized protein LOC103486305 isoform X32.1e-13361.24Show/hide
Query:  MKTLATSNSIIGNNAAPSSF----SASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQ-AD-TQNEVVENQDTNQSKTVRVK
        MKTL TSNSII N +  S F    S+SS+KERLL  GPEF+SYRR  KL +SGLQH V LRRGGI+F+SCFSS+QQ AD  Q++ +ENQ+T+QSKTVRVK
Subjt:  MKTLATSNSIIGNNAAPSSF----SASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQ-AD-TQNEVVENQDTNQSKTVRVK

Query:  FQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLS
        FQLQKECTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH WAAEV+IPVGK IQFKF+LQG TGNV WQPGPDR FQPWETSNTIIVSEDWDSA+SR+LS
Subjt:  FQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLS

Query:  EEENIVNQDDHSPVVPEKLMIE-DSSFALADA-------SIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDV
        EEE IVNQ+++SP+ PE LM+E + ++   +        SI  K SVES    I G NI A EENG N+SASEEN  ++                     
Subjt:  EEENIVNQDDHSPVVPEKLMIE-DSSFALADA-------SIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDV

Query:  SEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQ-DP
            S P G  + IS S  + T+E+LEN  +++         VQES ++  VP+LVPGLPP            +V+ DGS+ GINESNDHKLPENIQ DP
Subjt:  SEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQ-DP

Query:  DVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        +VV   EME KSSYE      EIRQEDDTN   N+SDLQE+N+ IVQNDITWGHKTLKKF SSLRLL
Subjt:  DVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

A0A6J1CVP4 uncharacterized protein LOC1110150015.2e-16164.65Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSAS--SLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQ
        M+TLATSNSII NN AP  FSAS  SL+ERLLCGGPEF+SYR   K  SSGLQHL SLRRGGI+F +  SSH Q DTQN+ VENQDTNQ KTVRVKFQLQ
Subjt:  MKTLATSNSIIGNNAAPSSFSAS--SLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQ

Query:  KECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEEN
        KECTFGE F VVGDDP  GSW+VTSAIPLNWADGH WAAEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTI+VSEDWDS +S  L+EEE 
Subjt:  KECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEEN

Query:  IVNQDDHSPVVPEKLMIED------------SSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEEN----------------TKDIMASNI
        +VNQ++ SP+V E LMI D             S AL D SI EKSSVESHE LI    ISAS+ENGS++SA +E+                 K+I+A NI
Subjt:  IVNQDDHSPVVPEKLMIED------------SSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEEN----------------TKDIMASNI

Query:  ISTKESYILNTSNKDVSEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGI
           KES+ILN+SNK VSEVYSN NGE+T   QS+TK TE + E++EK  T KI  NADVQES IN  VP+LVPGLPPTPTTSN+ APQHEV+ D SI+GI
Subjt:  ISTKESYILNTSNKDVSEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGI

Query:  NESNDHKLPENI-----QDPDVVVELEMEAKSSY--------EENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        NESN H+LPEN+     Q P +V E E+EAK SY        +E+ +QSEIRQ+DD NKI N SDLQE+N+ I++ND+TWGHKTL K  ++L+ L
Subjt:  NESNDHKLPENI-----QDPDVVVELEMEAKSSY--------EENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

A0A6J1F2P2 uncharacterized protein LOC1114416395.1e-24999.56Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQ TNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII
        NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVY NPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

A0A6J1J7C1 uncharacterized protein LOC1114820355.3e-23895.35Show/hide
Query:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
        MKTLATSNSIIGNNAAPSSFSAS LKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFL CFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE
Subjt:  MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKE

Query:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV
        CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR FQPWETSNTIIVSEDWDSA+SR+L EEENI+
Subjt:  CTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIV

Query:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII
        NQD+HSPVV EKLMIEDS FALADASIVEKSSVESHEV+ILG NISASEENGSNVSASEENTKDIM SNIIS KESYILNTSNK VSEVYSNPNGETTII
Subjt:  NQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTII

Query:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE
        SQSETKR EEVLENYEKE TAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEV+DDGSIDGINESNDHKLPENIQDPDVVVELEME KSSYE
Subjt:  SQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYE

Query:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL
        ENVVQSEIRQEDDTNKIANESDLQEVN SIV+NDITWGHKTLKKFFSSLRLL
Subjt:  ENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRLL

SwissProt top hitse value%identityAlignment
P0DN29 Glucoamylase ARB_02327-12.8e-1041.25Show/hide
Query:  VKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLN---WADG-HLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR
        V+F+L      GE  F+VG  P  GSWDV  A+PLN   +AD  H W  ++E+P   A ++KF+ +   G VVW+  P+R
Subjt:  VKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLN---WADG-HLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR

P30270 Alpha-amylase1.1e-0626.37Show/hide
Query:  FQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDW
        F +     +GE+ +V GD  + G+WD   A+ L+ A   +W  +V +  G   Q+K++ +   G  VW+ G +R      T+  + +++ W
Subjt:  FQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDW

P31797 Cyclomaltodextrin glucanotransferase9.9e-0830.09Show/hide
Query:  ENQDTNQSKTVRVKFQLQKECT-FGEHFFVVGDDPSFGSWDVTSAIPLNW----ADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIF-QPWE
        +N +   +  V V+F +    T  G++ ++VG+    G+WD + AI   +         W  +V +P GK I+FKF+ +   GNV W+ G + ++  P  
Subjt:  ENQDTNQSKTVRVKFQLQKECT-FGEHFFVVGDDPSFGSWDVTSAIPLNW----ADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIF-QPWE

Query:  TSNTIIVSEDWDS
        T+  IIV  DW +
Subjt:  TSNTIIVSEDWDS

P36914 Glucoamylase3.8e-0732.53Show/hide
Query:  TVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLN----WADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR
        TV V F ++    +GE   +VG     GSW+ +SA  LN      D  LW   + +P G++ ++KF+ + + G V W+  P+R
Subjt:  TVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLN----WADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDR

Q6ZY51 Phosphoglucan, water dikinase, chloroplastic1.1e-0928.39Show/hide
Query:  LQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQF
        L H     R     L+C ++   + T  E  + +D + +K VR+  +L  +  FG+H  + G     GSW   S  PLNW++   W  E+E+  G+ +++
Subjt:  LQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQF

Query:  KFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD
        KFV+    G++ W+ G +R+ +   + N  +V   WD A    L   + + N DD
Subjt:  KFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD

Arabidopsis top hitse value%identityAlignment
AT5G01260.1 Carbohydrate-binding-like fold1.4e-3633.6Show/hide
Query:  DTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGS-WDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQP
        D+Q  V + +    +KTVRV+FQL+KEC FGEHFF+VGDDP FG  WD  +A+PLNW+DG++W  ++++PVG+ ++FK +L+ +TG ++WQPGP+R  + 
Subjt:  DTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGS-WDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQP

Query:  WETSNTIIVSEDWDSADSRMLSEEENI-------VNQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDI
        WET+ TI + EDWD+AD +M+ EE+ +       +  +D   V+   +    S  A+ +A  V   S ++        + S   E     S      +++
Subjt:  WETSNTIIVSEDWDSADSRMLSEEENI-------VNQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDI

Query:  MASNIISTKESYILNTSNKDVSEVYSNPNGETTIISQSETKRTEEVL
        +   + + +ES +L      +S++    N +  +I++ + +   EV+
Subjt:  MASNIISTKESYILNTSNKDVSEVYSNPNGETTIISQSETKRTEEVL

AT5G01260.2 Carbohydrate-binding-like fold3.5e-4032.9Show/hide
Query:  DTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGS-WDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQP
        D+Q  V + +    +KTVRV+FQL+KEC FGEHFF+VGDDP FG  WD  +A+PLNW+DG++W  ++++PVG+ ++FK +L+ +TG ++WQPGP+R  + 
Subjt:  DTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGS-WDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQP

Query:  WETSNTIIVSEDWDSADSRMLSEEENIVNQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIIS
        WET+ TI + EDWD+AD +M+ EE+           VP       SS    D   V  S  ++  V+ +      S+E+  N S S ++ K +  SN   
Subjt:  WETSNTIIVSEDWDSADSRMLSEEENIVNQDDHSPVVPEKLMIEDSSFALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIIS

Query:  TKESYILNTSNKDVSEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINE
        T    I                         E   TEE                    ES      PVLVPGL P    S+ D  Q EV ++G  +   E
Subjt:  TKESYILNTSNKDVSEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINE

Query:  SNDHKLPENIQDPDVVVELEMEAKSSYEENVVQSEIRQ----EDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRL
         +  + P+  ++    V+     + S +E V   E RQ    E++  ++  E++     D + +NDI WG +TL K  S+ RL
Subjt:  SNDHKLPENIQDPDVVVELEMEAKSSYEENVVQSEIRQ----EDDTNKIANESDLQEVNDSIVQNDITWGHKTLKKFFSSLRL

AT5G26570.1 catalytics;carbohydrate kinases;phosphoglucan, water dikinases7.5e-1128.39Show/hide
Query:  LQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQF
        L H     R     L+C ++   + T  E  + +D + +K VR+  +L  +  FG+H  + G     GSW   S  PLNW++   W  E+E+  G+ +++
Subjt:  LQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQF

Query:  KFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD
        KFV+    G++ W+ G +R+ +   + N  +V   WD A    L   + + N DD
Subjt:  KFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD

AT5G26570.2 catalytics;carbohydrate kinases;phosphoglucan, water dikinases7.5e-1128.39Show/hide
Query:  LQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQF
        L H     R     L+C ++   + T  E  + +D + +K VR+  +L  +  FG+H  + G     GSW   S  PLNW++   W  E+E+  G+ +++
Subjt:  LQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVVGDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQF

Query:  KFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD
        KFV+    G++ W+ G +R+ +   + N  +V   WD A    L   + + N DD
Subjt:  KFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCCTAGCGACCTCCAACTCCATCATCGGTAACAATGCAGCTCCTTCTTCCTTCTCTGCTTCTTCGCTGAAAGAGCGTCTTCTTTGTGGAGGACCTGAATTCGT
CTCTTATCGGAGGCATCGGAAATTGACTAGTTCTGGACTTCAGCATTTGGTATCGTTGCGCCGGGGAGGCATTGAATTTCTTTCTTGCTTCTCGTCTCATCAGCAGGCAG
ATACTCAGAATGAGGTAGTTGAGAATCAAGACACGAATCAATCAAAGACAGTTCGCGTCAAATTCCAGCTGCAGAAAGAGTGTACATTTGGGGAGCATTTCTTTGTAGTA
GGTGATGATCCAAGTTTTGGTTCCTGGGACGTTACAAGTGCAATACCTTTAAACTGGGCAGATGGGCATCTATGGGCAGCAGAAGTGGAGATTCCTGTTGGAAAAGCAAT
CCAATTCAAATTCGTACTTCAAGGAGAAACTGGGAATGTTGTATGGCAACCTGGTCCTGATCGAATATTTCAACCCTGGGAAACATCTAATACAATCATCGTTTCTGAAG
ATTGGGATAGTGCTGACTCACGGATGCTCAGTGAAGAAGAAAACATTGTTAACCAGGATGATCATTCTCCCGTTGTCCCAGAAAAGTTAATGATTGAGGATTCATCATTT
GCTCTTGCCGATGCTTCAATAGTAGAAAAATCATCGGTGGAATCGCATGAAGTGTTGATTCTTGGCGGTAACATCTCAGCTTCAGAAGAAAATGGCAGTAATGTCTCTGC
TTCAGAAGAGAATACCAAAGATATTATGGCATCGAATATAATCTCAACAAAGGAGAGCTACATTCTCAATACAAGTAACAAGGATGTGAGCGAGGTATACAGCAATCCAA
ATGGGGAGACAACAATTATATCCCAGAGTGAAACAAAGAGAACAGAGGAAGTTTTGGAAAATTATGAGAAAGAAGAAACAGCGAAGATCCCTAGGAATGCGGATGTTCAA
GAAAGCTTTATCAACTATGGCGTTCCTGTTCTAGTTCCTGGTTTACCTCCAACACCAACAACCTCAAATCAGGATGCACCTCAACATGAAGTCAAAGATGATGGTTCCAT
CGATGGAATTAATGAATCTAACGATCATAAACTACCTGAGAACATACAGGATCCTGATGTCGTGGTAGAACTAGAGATGGAAGCGAAGTCAAGTTATGAAGAAAATGTCG
TCCAAAGTGAAATTAGACAAGAGGATGACACAAATAAAATTGCGAATGAATCTGATTTGCAGGAAGTCAACGATAGTATCGTTCAGAATGACATAACATGGGGTCATAAA
ACCCTGAAGAAGTTCTTCTCCAGTTTAAGGTTGCTTTAG
mRNA sequenceShow/hide mRNA sequence
TGCACGTGAGAAGTCGTTTTCCCTTATCCTCACGAGGCCCCACCGTTATATAATGGCAACCACCAGAAACTCCATTAAGGCGCGAGACCTAAGCTCTGAGAGTGTCATTG
TCAGTGGCAGACTAAGATTGTGGAGAGGATGAAAACCCTAGCGACCTCCAACTCCATCATCGGTAACAATGCAGCTCCTTCTTCCTTCTCTGCTTCTTCGCTGAAAGAGC
GTCTTCTTTGTGGAGGACCTGAATTCGTCTCTTATCGGAGGCATCGGAAATTGACTAGTTCTGGACTTCAGCATTTGGTATCGTTGCGCCGGGGAGGCATTGAATTTCTT
TCTTGCTTCTCGTCTCATCAGCAGGCAGATACTCAGAATGAGGTAGTTGAGAATCAAGACACGAATCAATCAAAGACAGTTCGCGTCAAATTCCAGCTGCAGAAAGAGTG
TACATTTGGGGAGCATTTCTTTGTAGTAGGTGATGATCCAAGTTTTGGTTCCTGGGACGTTACAAGTGCAATACCTTTAAACTGGGCAGATGGGCATCTATGGGCAGCAG
AAGTGGAGATTCCTGTTGGAAAAGCAATCCAATTCAAATTCGTACTTCAAGGAGAAACTGGGAATGTTGTATGGCAACCTGGTCCTGATCGAATATTTCAACCCTGGGAA
ACATCTAATACAATCATCGTTTCTGAAGATTGGGATAGTGCTGACTCACGGATGCTCAGTGAAGAAGAAAACATTGTTAACCAGGATGATCATTCTCCCGTTGTCCCAGA
AAAGTTAATGATTGAGGATTCATCATTTGCTCTTGCCGATGCTTCAATAGTAGAAAAATCATCGGTGGAATCGCATGAAGTGTTGATTCTTGGCGGTAACATCTCAGCTT
CAGAAGAAAATGGCAGTAATGTCTCTGCTTCAGAAGAGAATACCAAAGATATTATGGCATCGAATATAATCTCAACAAAGGAGAGCTACATTCTCAATACAAGTAACAAG
GATGTGAGCGAGGTATACAGCAATCCAAATGGGGAGACAACAATTATATCCCAGAGTGAAACAAAGAGAACAGAGGAAGTTTTGGAAAATTATGAGAAAGAAGAAACAGC
GAAGATCCCTAGGAATGCGGATGTTCAAGAAAGCTTTATCAACTATGGCGTTCCTGTTCTAGTTCCTGGTTTACCTCCAACACCAACAACCTCAAATCAGGATGCACCTC
AACATGAAGTCAAAGATGATGGTTCCATCGATGGAATTAATGAATCTAACGATCATAAACTACCTGAGAACATACAGGATCCTGATGTCGTGGTAGAACTAGAGATGGAA
GCGAAGTCAAGTTATGAAGAAAATGTCGTCCAAAGTGAAATTAGACAAGAGGATGACACAAATAAAATTGCGAATGAATCTGATTTGCAGGAAGTCAACGATAGTATCGT
TCAGAATGACATAACATGGGGTCATAAAACCCTGAAGAAGTTCTTCTCCAGTTTAAGGTTGCTTTAGCATCACAAATTCATTCTTATTGCTTTACTATGTTGTTCCCCAA
GAAATTTTCAACTTACTGTGACCCAATTGGGTGGATGGAAGCTGCTGTTTGGGAATTGTACATATGGTCTGCCACTTCACTATATTCTAATACTACTTGGTGTAAATACA
CAGACTACGAAATTGGTTCATTAGCAAATCTGGACTGCAATGGTTTTGCAAGAATCTGGTAATTATAGTCTTTAGTCATTTGGATTGTATGTATATGTTTAATTTTAAAT
TTCAAGAACTCTGGTCATATATTGTTTTGCATTATAATCAGAACGTATGAGTTAAAAGTGAGTTCATAATTAAAGAAACTCTTAGATCATGATGTTACTGTGAATGGGCA
ACAACTGCGTAGGGCTGAAAAGTTGATCATTAAAGACTGCATATAGCGTACGCCACAAATAGTGTTGTTGGGTTGCCTAGTAATGTGTTATTTTGTCACTTCGAGGTAGA
TCTCACACAGAACAAGGCCAAATTAAATGAAGGTTGCCCTGGTTGATGATCGTACAGTAATCGATACTCCCAATGCGTGGGGGCATTCGTTTTCTAGGTGGGTAACTAAT
CGCGAATCCAGGTGTTGACCCAAATTGATTCAATGCTGAGTTAGCTTGGCTTAATTCTTCTTGTTCCATGCTGTGACTACTAAGTAGTAAAATTTTATGGTTTGATATTG
TCCGCTTTGACTCGTTATGTATCGTTGTTAATCACAATTTTTAAAACGTGTCTACTAAGGAGAGATGTTTACACTTATAAGAAATGTTTTGTTCTACTTTCTAATCGACA
TG
Protein sequenceShow/hide protein sequence
MKTLATSNSIIGNNAAPSSFSASSLKERLLCGGPEFVSYRRHRKLTSSGLQHLVSLRRGGIEFLSCFSSHQQADTQNEVVENQDTNQSKTVRVKFQLQKECTFGEHFFVV
GDDPSFGSWDVTSAIPLNWADGHLWAAEVEIPVGKAIQFKFVLQGETGNVVWQPGPDRIFQPWETSNTIIVSEDWDSADSRMLSEEENIVNQDDHSPVVPEKLMIEDSSF
ALADASIVEKSSVESHEVLILGGNISASEENGSNVSASEENTKDIMASNIISTKESYILNTSNKDVSEVYSNPNGETTIISQSETKRTEEVLENYEKEETAKIPRNADVQ
ESFINYGVPVLVPGLPPTPTTSNQDAPQHEVKDDGSIDGINESNDHKLPENIQDPDVVVELEMEAKSSYEENVVQSEIRQEDDTNKIANESDLQEVNDSIVQNDITWGHK
TLKKFFSSLRLL