; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014073 (gene) of Chayote v1 genome

Gene IDSed0014073
OrganismSechium edule (Chayote v1)
DescriptionCBM20 domain-containing protein
Genome locationLG02:35727210..35733495
RNA-Seq ExpressionSed0014073
SyntenySed0014073
Gene Ontology termsGO:2001070 - starch binding (molecular function)
InterPro domainsIPR002044 - Carbohydrate binding module family 20
IPR013783 - Immunoglobulin-like fold
IPR013784 - Carbohydrate-binding-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580479.1 putative LRR receptor-like serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. sororia]8.2e-14659.38Show/hide
Query:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE
        MKTLAT NSIIGN A P +FSASSLKERLL  GP+F+SYRR  KL  SGLQHL  LRRGGIEF+SCFSSH Q DTQN+ VENQ TN +KTVRVKFQLQKE
Subjt:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE

Query:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV
        CTFGEHFF+VGDDP FGSWDVT+AIPLNWADGH W AEVEIPVGK IQFKFVLQG TGNV WQP PDR FQ WE++NTIIV+EDWDSA+SR+L  EE  V
Subjt:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV

Query:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA
        NQD+ SP+V EKL++                                                     EDSS AL+D S+ EKS +ESH  L+    +SA
Subjt:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA

Query:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPG
         EEN SNVS SE+N     ASNI +TKE+ +   L T NK VSEV  + NGE TI +QS+T  TE  L+  EK+ T  I  N DVQ SFI+YGVP+LVPG
Subjt:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPG

Query:  LPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDI
        LPPT  T         +QDAP HE++DD +I+G N     KLP+  QDPD VV E  ME KSSYEE+VVQSEI  EDD+NKI NE         I+QNDI
Subjt:  LPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDI

Query:  TWGHKTLKKFFSSFRLL
        TWGHKTLKKFFSS RLL
Subjt:  TWGHKTLKKFFSSFRLL

KAG7017230.1 hypothetical protein SDJN02_19093 [Cucurbita argyrosperma subsp. argyrosperma]4.4e-14759.77Show/hide
Query:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE
        MKTLAT NSIIGN A P +FSASSLKERLL  GP+F+SYRR  KL  SGLQHL  LRRGGIEF+SCFSSH Q DTQN+ VENQDTN +KTVRVKFQLQKE
Subjt:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE

Query:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV
        CTFGEHFF+VGDDP FGSWDVT+AIPLNWADGH W AEVEIPVGK IQFKFVLQG TGNV WQP PDR FQ WE++NTIIV+EDWDSA+SR+L  EE  V
Subjt:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV

Query:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA
        NQD+ SP+V EKL++                                                     EDSS AL+D S+ EKS +ESH  L+    +SA
Subjt:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA

Query:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPG
         EEN SNVS SE+N     ASNI +TKE+ +   L T NK VSEV S+ NGE TI +QS+T  TE  L+  EK+ T  I  N DVQ SFI+YGVP+LVPG
Subjt:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPG

Query:  LPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDI
        LPPT  T         +QDAP HE++DD +I+G N     KLP+  QDPD VV E  ME KSSYEE+VVQSEI  EDD+NKI NE         I+QNDI
Subjt:  LPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDI

Query:  TWGHKTLKKFFSSFRLL
        TWGHKTLKKFFSS RLL
Subjt:  TWGHKTLKKFFSSFRLL

XP_022934469.1 uncharacterized protein LOC111441639 [Cucurbita moschata]8.2e-14659.38Show/hide
Query:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE
        MKTLAT NSIIGN A P +FSASSLKERLL  GP+F+SYRR  KL  SGLQHL  LRRGGIEF+SCFSSH Q DTQN+ VENQ TN +KTVRVKFQLQKE
Subjt:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE

Query:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV
        CTFGEHFF+VGDDP FGSWDVT+AIPLNWADGH W AEVEIPVGK IQFKFVLQG TGNV WQP PDR FQ WE++NTIIV+EDWDSA+SR+L  EE  V
Subjt:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV

Query:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA
        NQD+ SP+V EKL++                                                     EDSS AL+D S+ EKS +ESH  L+    +SA
Subjt:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA

Query:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPG
         EEN SNVS SE+N     ASNI +TKE+ +   L T NK VSEV  + NGE TI +QS+T  TE  L+  EK+ T  I  N DVQ SFI+YGVP+LVPG
Subjt:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPG

Query:  LPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDI
        LPPT  T         +QDAP HE++DD +I+G N     KLP+  QDPD VV E  ME KSSYEE+VVQSEI  EDD+NKI NE         I+QNDI
Subjt:  LPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDI

Query:  TWGHKTLKKFFSSFRLL
        TWGHKTLKKFFSS RLL
Subjt:  TWGHKTLKKFFSSFRLL

XP_022983429.1 uncharacterized protein LOC111482035 [Cucurbita maxima]3.7e-14659.96Show/hide
Query:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE
        MKTLAT NSIIGN A P +FSAS LKERLL  GP+F+SYRR  KL  SGLQHL  LRRGGIEF+ CFSSH Q DTQN+ VENQDTN +KTVRVKFQLQKE
Subjt:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE

Query:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV
        CTFGEHFF+VGDDP FGSWDVT+AIPLNWADGH W AEVEIPVGK IQFKFVLQG TGNV WQP PDRTFQ WE++NTIIV+EDWDSAESRIL  EE  +
Subjt:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV

Query:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA
        NQDE SP+VSEKL++E+  +         AL++ S+ E + +ES   +I G NIS SEENG                                  S VSA
Subjt:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA

Query:  LEENTSNVSLSEDNASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPGLPPTT
         EENT ++ +     SNI + KE+ +   L T NKAVSEV S+ NGE TI +QS+T   E  L+  EK+ T  I  N DVQ SFI+YGVP+LVPGLPPT 
Subjt:  LEENTSNVSLSEDNASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPGLPPTT

Query:  ATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDITWGHK
         T         +QDAP HE+EDD +I+G N     KLP+  QDPD VV E  ME KSSYEE+VVQSEI  EDD+NKI NE         I++NDITWGHK
Subjt:  ATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDITWGHK

Query:  TLKKFFSSFRLL
        TLKKFFSS RLL
Subjt:  TLKKFFSSFRLL

XP_023527439.1 uncharacterized protein LOC111790671 [Cucurbita pepo subsp. pepo]1.0e-14058.35Show/hide
Query:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE
        MKTLAT NSIIGN A P +FSASSLKERLL  GP+F+SYRR  KL  SGLQHL  LRRGGIEF+ CFS H Q DTQN+ VENQDTN +KTVRVKFQLQKE
Subjt:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE

Query:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV
        CTFGEHFF+VGDDP FGSWDVT+AIPLNWADGH W AEVEIPVGK IQFKFVLQG TGNV WQP PDRTFQ WE++NTIIV+EDWDSAESRIL  EE  V
Subjt:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV

Query:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA
        NQD+ SP+V EKL++                                                     EDSS AL+D S+ EKS +E H  L+    +SA
Subjt:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA

Query:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYG----VPI
         EEN SNVS SE+N     ASNI + KE+ +   L T NKAVSEV ++ NGE T+ + S+T  TE  L+  EK+ T  I     VQ SFI+YG    VP+
Subjt:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYG----VPI

Query:  LVPGLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------II
        LVPGLPPT  T         +QDAP HE++DD +I+G N     KLP+  QDPD VV E  ME KSSYEE+VVQSEI  EDD+NKI NE         I+
Subjt:  LVPGLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------II

Query:  QNDITWGHKTLKKFFSSFRLL
        QNDITWGHKTLKKFFSS RLL
Subjt:  QNDITWGHKTLKKFFSSFRLL

TrEMBL top hitse value%identityAlignment
A0A0A0LA83 CBM20 domain-containing protein1.6e-12355.02Show/hide
Query:  MKTLATYNSIIGNAALPPNF--SASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFI-SCFSSHSQVDT-QNDEVENQDTNITKTVRVKFQ
        MKTL TYNSII N +    F  S+SSLKERLLS GP+FISYRRP KLA+SGLQHL PLRRGGI+FI SCF+S+ Q DT QND VENQ+T+ +KTVRVKFQ
Subjt:  MKTLATYNSIIGNAALPPNF--SASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFI-SCFSSHSQVDT-QNDEVENQDTNITKTVRVKFQ

Query:  LQKECTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAE
        L KECTFGEHF++VGDDP+FGSWDVT+AIPLNWADGHQW AEV+IPVGK IQFKF+LQGITGNV WQP PDRTFQ WE++NTIIV+EDWDSAESRIL  E
Subjt:  LQKECTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAE

Query:  EKTVNQDEDSPIVSEKLLVEE-LTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDD
        EK VNQ+EDSPI  E L+ E+ LTY +  +E    + + S+A    +    ELIDGSNIS  EENG                                  
Subjt:  EKTVNQDEDSPIVSEKLLVEE-LTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDD

Query:  SIVSALEENTSNVSLSEDNASNIPATKENTVAEALETG---NKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPIL
          +SA EEN +NVSL E + S+I  + +N  A+ L  G   NKAVSEV  D           DT ITE +L+ + KD          VQ S +D  VPIL
Subjt:  SIVSALEENTSNVSLSEDNASNIPATKENTVAEALETG---NKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPIL

Query:  VPGLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLP--------QKQDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENEIIQND
        VPGLPPT            +Q+AP HE+EDD ++ G N     KLP        QK DP+ VVA Q ME KSSYE+D   + I ++ D  +I N+++QND
Subjt:  VPGLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLP--------QKQDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENEIIQND

Query:  ITWGHKTLKKFFSSFRLL
        +TWGHKTLKKF SS RLL
Subjt:  ITWGHKTLKKFFSSFRLL

A0A5D3DMY0 Carbohydrate-binding-like fold, putative isoform 21.1e-11151.63Show/hide
Query:  MKTLATYNSIIGNAALPPNF----SASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVD-TQNDEVENQDTNITKTVRVKF
        MKTL T NSII N +    F    S+SS+KERLLS GP+FISYRRP KLA+SGLQH  PLRRGGI+FISCFSS+ Q D  Q+D +ENQ+T+ +KTVRVKF
Subjt:  MKTLATYNSIIGNAALPPNF----SASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVD-TQNDEVENQDTNITKTVRVKF

Query:  QLQKECTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCA
        QLQKECTFGEHFF+VGDDP+FGSWDVT+AIPLNWADGHQW AEV+IPVGK IQFKF+LQGITGNVEWQP PDRTFQ WE++NTIIV+EDWDSAESRIL  
Subjt:  QLQKECTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCA

Query:  EEKTVNQDEDSPIVSEKLLVE-ELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVD
        EEK VNQ+E SPI  E L+VE  LTY N  +E     ++ S+A    +ES    IDGSNI   EENG                                 
Subjt:  EEKTVNQDEDSPIVSEKLLVE-ELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVD

Query:  DSIVSALEENTSNVSLSEDNASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVP
           +SA EEN SNVSL   N S+I                       SDSN            IT+  L+ + +D          VQ S +D  VPILVP
Subjt:  DSIVSALEENTSNVSLSEDNASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVP

Query:  GLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQKQ----DPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSN---------KIENEII
        GLPP                    ++E D +++G N     KLP+ Q    DP+ VVA Q ME KSSYE      EI  EDD+N         +I N+I+
Subjt:  GLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQKQ----DPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSN---------KIENEII

Query:  QNDITWGHKTLKKFFSSFRLL
        QNDITWGHKTLKKF SS RLL
Subjt:  QNDITWGHKTLKKFFSSFRLL

A0A6J1CVP4 uncharacterized protein LOC1110150011.5e-13756.03Show/hide
Query:  MKTLATYNSIIGNAALPPNFSAS--SLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQ
        M+TLAT NSII N   PP FSAS  SL+ERLL  GP+FISYR P K A SGLQHL  LRRGGI+F +  SSH+QVDTQND VENQDTN  KTVRVKFQLQ
Subjt:  MKTLATYNSIIGNAALPPNFSAS--SLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQ

Query:  KECTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEK
        KECTFGE F +VGDDP+ GSW+VT+AIPLNWADGHQW AEVEIPVGKTIQFKFVLQG TGNV WQP PDRTFQ WE+TNTI+V+EDWDS ES  L  EEK
Subjt:  KECTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEK

Query:  TVNQDEDSPIVSEKLLV---EELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDD
         VNQ+EDSPIVSE L++    E    N  KE  VAL +TS+AE + +ES  ELID S IS S+ENG                                  
Subjt:  TVNQDEDSPIVSEKLLV---EELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDD

Query:  SIVSALEENTSNVSLSEDNASNIPATKENTVAE---------ALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFID
        S +SAL+E+  N+SL E+NAS+I A K N VAE          L + NK VSEV S+SNGE T T QSDT ITE   +  EK ATV ILGN DVQ S I+
Subjt:  SIVSALEENTSNVSLSEDNASNIPATKENTVAE---------ALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFID

Query:  YGVPILVPGLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQ------KQDPDAVVAEQGMEGKSSY--------EEDVVQSEIIHEDD
          VPILVPGLPPT  T         ++ AP HE+E DV+ING N     +LP+      KQ P  +VAE+ +E K SY        +ED +QSEI  +DD
Subjt:  YGVPILVPGLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQ------KQDPDAVVAEQGMEGKSSY--------EEDVVQSEIIHEDD

Query:  SNKIE---------NEIIQNDITWGHKTLKKFFSSFRLL
         NKIE         N+I++ND+TWGHKTL K  ++ + L
Subjt:  SNKIE---------NEIIQNDITWGHKTLKKFFSSFRLL

A0A6J1F2P2 uncharacterized protein LOC1114416394.0e-14659.38Show/hide
Query:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE
        MKTLAT NSIIGN A P +FSASSLKERLL  GP+F+SYRR  KL  SGLQHL  LRRGGIEF+SCFSSH Q DTQN+ VENQ TN +KTVRVKFQLQKE
Subjt:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE

Query:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV
        CTFGEHFF+VGDDP FGSWDVT+AIPLNWADGH W AEVEIPVGK IQFKFVLQG TGNV WQP PDR FQ WE++NTIIV+EDWDSA+SR+L  EE  V
Subjt:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV

Query:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA
        NQD+ SP+V EKL++                                                     EDSS AL+D S+ EKS +ESH  L+    +SA
Subjt:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA

Query:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPG
         EEN SNVS SE+N     ASNI +TKE+ +   L T NK VSEV  + NGE TI +QS+T  TE  L+  EK+ T  I  N DVQ SFI+YGVP+LVPG
Subjt:  LEENTSNVSLSEDN-----ASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPG

Query:  LPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDI
        LPPT  T         +QDAP HE++DD +I+G N     KLP+  QDPD VV E  ME KSSYEE+VVQSEI  EDD+NKI NE         I+QNDI
Subjt:  LPPTTATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDI

Query:  TWGHKTLKKFFSSFRLL
        TWGHKTLKKFFSS RLL
Subjt:  TWGHKTLKKFFSSFRLL

A0A6J1J7C1 uncharacterized protein LOC1114820351.8e-14659.96Show/hide
Query:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE
        MKTLAT NSIIGN A P +FSAS LKERLL  GP+F+SYRR  KL  SGLQHL  LRRGGIEF+ CFSSH Q DTQN+ VENQDTN +KTVRVKFQLQKE
Subjt:  MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKE

Query:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV
        CTFGEHFF+VGDDP FGSWDVT+AIPLNWADGH W AEVEIPVGK IQFKFVLQG TGNV WQP PDRTFQ WE++NTIIV+EDWDSAESRIL  EE  +
Subjt:  CTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTV

Query:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA
        NQDE SP+VSEKL++E+  +         AL++ S+ E + +ES   +I G NIS SEENG                                  S VSA
Subjt:  NQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSA

Query:  LEENTSNVSLSEDNASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPGLPPTT
         EENT ++ +     SNI + KE+ +   L T NKAVSEV S+ NGE TI +QS+T   E  L+  EK+ T  I  N DVQ SFI+YGVP+LVPGLPPT 
Subjt:  LEENTSNVSLSEDNASNIPATKENTVAEALETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPGLPPTT

Query:  ATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDITWGHK
         T         +QDAP HE+EDD +I+G N     KLP+  QDPD VV E  ME KSSYEE+VVQSEI  EDD+NKI NE         I++NDITWGHK
Subjt:  ATEASSQYQDPDQDAPLHELEDDVAINGNN-----KLPQK-QDPDAVVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENE---------IIQNDITWGHK

Query:  TLKKFFSSFRLL
        TLKKFFSS RLL
Subjt:  TLKKFFSSFRLL

SwissProt top hitse value%identityAlignment
P08704 Cyclomaltodextrin glucanotransferase2.1e-0626.36Show/hide
Query:  VDTQNDEVENQDTNITKTVRVKFQLQKECTF-GEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQ---GITGNVEWQPDPDR
        + +Q+D+ EN          + F      T  G+  +++G+ P  G WD+T A+ ++     QW+A +E+P    +++K V +     T NVEWQ   + 
Subjt:  VDTQNDEVENQDTNITKTVRVKFQLQKECTF-GEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQ---GITGNVEWQPDPDR

Query:  TFQAWESTNT
         F + ++  T
Subjt:  TFQAWESTNT

P0DN29 Glucoamylase ARB_02327-14.7e-1141.46Show/hide
Query:  VKFQLQKECTFGEHFFLVGDDPMFGSWDVTNAIPLN---WADG-HQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTF
        V+F+L      GE  FLVG  P  GSWDV  A+PLN   +AD  HQW  ++E+P     ++KF+ +   G V W+ DP+R +
Subjt:  VKFQLQKECTFGEHFFLVGDDPMFGSWDVTNAIPLN---WADG-HQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTF

P30270 Alpha-amylase1.3e-0523.47Show/hide
Query:  TKTVRVKFQLQKECTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDW
        T      F +     +GE+ ++ GD    G+WD   A+ L+ A    W  +V +  G   Q+K++ +   G   W+   +RT     +T  + + + W
Subjt:  TKTVRVKFQLQKECTFGEHFFLVGDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDW

P31746 Cyclomaltodextrin glucanotransferase1.0e-0530Show/hide
Query:  VRVKFQLQKECTF-GEHFFLVGDDPMFGSWDVTNAI-PLNWADGHQ---WTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDW
        V V+F +    T  G + ++VG+    G+WD   AI P+     +Q   W  ++ +P GK +++K++ +   GNV WQ   +RT+ +  +T T  V  +W
Subjt:  VRVKFQLQKECTF-GEHFFLVGDDPMFGSWDVTNAI-PLNWADGHQ---WTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDW

P31797 Cyclomaltodextrin glucanotransferase4.6e-0627.68Show/hide
Query:  ENQDTNITKTVRVKFQLQKECT-FGEHFFLVGDDPMFGSWDVTNAIPLNW----ADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWES
        +N +      V V+F +    T  G++ ++VG+    G+WD + AI   +         W  +V +P GKTI+FKF+ +   GNV W+   +  +    +
Subjt:  ENQDTNITKTVRVKFQLQKECT-FGEHFFLVGDDPMFGSWDVTNAIPLNW----ADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWES

Query:  TNTIIVTEDWDS
        T   I+  DW +
Subjt:  TNTIIVTEDWDS

Arabidopsis top hitse value%identityAlignment
AT5G01260.1 Carbohydrate-binding-like fold8.8e-3738.94Show/hide
Query:  PLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKECTFGEHFFLVGDDPMFGS-WDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVL
        PLR   I+        SQV+ +++E+E  +    KTVRV+FQL+KEC FGEHFF+VGDDP+FG  WD   A+PLNW+DG+ WT ++++PVG+ ++FK +L
Subjt:  PLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKECTFGEHFFLVGDDPMFGS-WDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVL

Query:  QGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEE--------KTVNQDEDSPIVSEK-----LLVEELTYINGFKETPVALSETSVAENT
        +  TG + WQP P+R  + WE+  TI + EDWD+A+ +++  E+           ++DED  + S +     + VE   Y++   E+    S +  +E T
Subjt:  QGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEE--------KTVNQDEDSPIVSEK-----LLVEELTYINGFKETPVALSETSVAENT

Query:  LMESQGEL
        +  S G L
Subjt:  LMESQGEL

AT5G01260.2 Carbohydrate-binding-like fold6.5e-3232.94Show/hide
Query:  PLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKECTFGEHFFLVGDDPMFGS-WDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVL
        PLR   I+        SQV+ +++E+E  +    KTVRV+FQL+KEC FGEHFF+VGDDP+FG  WD   A+PLNW+DG+ WT ++++PVG+ ++FK +L
Subjt:  PLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKECTFGEHFFLVGDDPMFGS-WDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVL

Query:  QGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTVNQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGS-
        +  TG + WQP P+R  + WE+  TI + EDWD+A+ +++  EE  V     S I SE              E  V  S    +    +E+ G + D S 
Subjt:  QGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTVNQDEDSPIVSEKLLVEELTYINGFKETPVALSETSVAENTLMESQGELIDGS-

Query:  -NISGSEENGAFEENTNEDSSVALSDTSVAEKSLM--ESHGELVDDSIVSALEENTSNVSLSEDNASNIPAT--KENTVAEALETGN-KAVSEVQSDSNG
         N S S ++    E T E S+ AL+   V ++++   E    LV   I  +  +N     ++E  A   P    K+   AE  +    KA+S  +     
Subjt:  -NISGSEENGAFEENTNEDSSVALSDTSVAEKSLM--ESHGELVDDSIVSALEENTSNVSLSEDNASNIPAT--KENTVAEALETGN-KAVSEVQSDSNG

Query:  EMTITAQSDTMITEANLKKEEKD--ATVTILGNTDVQ
         +    Q      E   ++ E +   T  +L   D+Q
Subjt:  EMTITAQSDTMITEANLKKEEKD--ATVTILGNTDVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCCTAGCGACCTACAACTCCATCATCGGCAACGCTGCGCTTCCCCCTAACTTCTCTGCTTCTTCTTTGAAAGAGCGTCTTCTTTCTGCCGGACCGGATTTCAT
CTCTTATCGGAGGCCGATGAAATTGGCTGATTCTGGACTTCAGCATTTGGCACCTTTGCGCCGCGGAGGCATCGAATTCATTTCTTGCTTCTCCTCTCATTCTCAGGTAG
ATACTCAAAATGATGAAGTTGAGAATCAAGATACAAATATAACAAAGACTGTTCGCGTCAAATTCCAGCTGCAGAAAGAGTGCACGTTTGGGGAGCATTTCTTTTTAGTA
GGTGATGATCCAATGTTTGGTTCTTGGGACGTTACAAATGCAATACCTTTAAACTGGGCTGATGGGCATCAATGGACAGCAGAAGTGGAGATTCCCGTTGGGAAAACAAT
CCAGTTCAAATTCGTACTTCAAGGAATAACTGGAAATGTTGAATGGCAACCAGATCCTGATCGAACGTTTCAAGCCTGGGAATCAACTAATACAATCATCGTTACTGAAG
ATTGGGACAGTGCTGAATCACGGATACTATGTGCAGAAGAAAAAACTGTTAACCAGGATGAGGATTCTCCCATTGTCTCAGAAAAGTTATTGGTTGAGGAGCTCACTTAT
ATAAACGGATTTAAGGAAACACCAGTTGCCCTTTCCGAAACTTCTGTAGCAGAAAACACATTGATGGAGTCGCAGGGAGAATTGATTGATGGCAGTAATATCTCAGGTTC
GGAAGAAAATGGCGCGTTTGAAGAGAATACCAATGAAGATTCATCAGTTGCACTTTCTGATACTTCAGTAGCAGAAAAATCATTGATGGAGTCACACGGAGAATTGGTTG
ATGACAGTATTGTCTCTGCATTAGAAGAGAATACCAGTAACGTTTCTCTTTCAGAGGATAATGCTAGCAACATTCCTGCTACAAAAGAGAATACCGTGGCAGAGGCTCTT
GAAACCGGTAACAAGGCTGTCAGCGAGGTACAAAGCGATTCAAATGGGGAGATGACAATTACAGCCCAGAGTGATACAATGATAACAGAGGCGAACTTGAAGAAAGAAGA
GAAAGATGCAACCGTTACGATCCTTGGGAATACAGATGTTCAAGGAAGCTTCATTGACTATGGAGTTCCCATTCTCGTTCCTGGTTTGCCTCCAACAACAGCAACAGAAG
CATCGAGTCAGTATCAGGATCCGGATCAGGATGCACCTCTACATGAACTCGAGGATGATGTTGCCATCAATGGAAATAATAAACTACCTCAGAAACAGGATCCTGATGCT
GTTGTGGCTGAACAAGGGATGGAGGGGAAGTCTAGTTATGAAGAAGATGTCGTCCAAAGTGAAATTATACATGAGGATGACTCAAATAAAATCGAGAATGAAATCATTCA
AAATGACATAACATGGGGCCATAAAACCCTGAAGAAGTTTTTCTCCAGTTTTAGATTGCTTTAG
mRNA sequenceShow/hide mRNA sequence
AAAGCAAGCAGATCCTCCATTAAAGGCGGAGCCAAAGCTCTGTGAGTGTTTGTCAATATCTGCAGCGAACTGAGATTCTGAGAGCGATGAAAACCCTAGCGACCTACAAC
TCCATCATCGGCAACGCTGCGCTTCCCCCTAACTTCTCTGCTTCTTCTTTGAAAGAGCGTCTTCTTTCTGCCGGACCGGATTTCATCTCTTATCGGAGGCCGATGAAATT
GGCTGATTCTGGACTTCAGCATTTGGCACCTTTGCGCCGCGGAGGCATCGAATTCATTTCTTGCTTCTCCTCTCATTCTCAGGTAGATACTCAAAATGATGAAGTTGAGA
ATCAAGATACAAATATAACAAAGACTGTTCGCGTCAAATTCCAGCTGCAGAAAGAGTGCACGTTTGGGGAGCATTTCTTTTTAGTAGGTGATGATCCAATGTTTGGTTCT
TGGGACGTTACAAATGCAATACCTTTAAACTGGGCTGATGGGCATCAATGGACAGCAGAAGTGGAGATTCCCGTTGGGAAAACAATCCAGTTCAAATTCGTACTTCAAGG
AATAACTGGAAATGTTGAATGGCAACCAGATCCTGATCGAACGTTTCAAGCCTGGGAATCAACTAATACAATCATCGTTACTGAAGATTGGGACAGTGCTGAATCACGGA
TACTATGTGCAGAAGAAAAAACTGTTAACCAGGATGAGGATTCTCCCATTGTCTCAGAAAAGTTATTGGTTGAGGAGCTCACTTATATAAACGGATTTAAGGAAACACCA
GTTGCCCTTTCCGAAACTTCTGTAGCAGAAAACACATTGATGGAGTCGCAGGGAGAATTGATTGATGGCAGTAATATCTCAGGTTCGGAAGAAAATGGCGCGTTTGAAGA
GAATACCAATGAAGATTCATCAGTTGCACTTTCTGATACTTCAGTAGCAGAAAAATCATTGATGGAGTCACACGGAGAATTGGTTGATGACAGTATTGTCTCTGCATTAG
AAGAGAATACCAGTAACGTTTCTCTTTCAGAGGATAATGCTAGCAACATTCCTGCTACAAAAGAGAATACCGTGGCAGAGGCTCTTGAAACCGGTAACAAGGCTGTCAGC
GAGGTACAAAGCGATTCAAATGGGGAGATGACAATTACAGCCCAGAGTGATACAATGATAACAGAGGCGAACTTGAAGAAAGAAGAGAAAGATGCAACCGTTACGATCCT
TGGGAATACAGATGTTCAAGGAAGCTTCATTGACTATGGAGTTCCCATTCTCGTTCCTGGTTTGCCTCCAACAACAGCAACAGAAGCATCGAGTCAGTATCAGGATCCGG
ATCAGGATGCACCTCTACATGAACTCGAGGATGATGTTGCCATCAATGGAAATAATAAACTACCTCAGAAACAGGATCCTGATGCTGTTGTGGCTGAACAAGGGATGGAG
GGGAAGTCTAGTTATGAAGAAGATGTCGTCCAAAGTGAAATTATACATGAGGATGACTCAAATAAAATCGAGAATGAAATCATTCAAAATGACATAACATGGGGCCATAA
AACCCTGAAGAAGTTTTTCTCCAGTTTTAGATTGCTTTAGTTCGGACACTGGAAACAAGTTCTCTACAGGTTTCATTTTTAGTTCTGTACCATGTTGTTCCCAAAGAAAA
ATTCAATTTTCAGGGACCCAATTGTGGATGGATGGAAGCTGCTTCAGGGAGAGTTGTACATGTGGTCTTCCATTTCCACCACCTCACTATATTGATGGAATCCGTTCATT
GGTGAATCTGGACTGCAATGGTATGAAAGATTCTGGTACTTTGAGTCTCAGTTCATTTGGGTTGTTTATGTTTATAATTTATTAAAGATGCAAACGTTTGTTTGTTGGCC
TGCGTTTTCAAGTACCATTGTTGTCTATTGTTCTGTTTTCATAAGCCTAATAAATTCTATTCATATCATCAGTCCCCATCATTTCTACCTTTGTTATGTGTTAGAACGTT
TCTTATAGTCCACAAGAGATTCAAATTATTAACCTCTTGGTGTAGGCTACAAATTAGTTGAACTATGGTTGAATTGGTTGTATTATAATTTAGTAATGTATCCAATTTTC
TT
Protein sequenceShow/hide protein sequence
MKTLATYNSIIGNAALPPNFSASSLKERLLSAGPDFISYRRPMKLADSGLQHLAPLRRGGIEFISCFSSHSQVDTQNDEVENQDTNITKTVRVKFQLQKECTFGEHFFLV
GDDPMFGSWDVTNAIPLNWADGHQWTAEVEIPVGKTIQFKFVLQGITGNVEWQPDPDRTFQAWESTNTIIVTEDWDSAESRILCAEEKTVNQDEDSPIVSEKLLVEELTY
INGFKETPVALSETSVAENTLMESQGELIDGSNISGSEENGAFEENTNEDSSVALSDTSVAEKSLMESHGELVDDSIVSALEENTSNVSLSEDNASNIPATKENTVAEAL
ETGNKAVSEVQSDSNGEMTITAQSDTMITEANLKKEEKDATVTILGNTDVQGSFIDYGVPILVPGLPPTTATEASSQYQDPDQDAPLHELEDDVAINGNNKLPQKQDPDA
VVAEQGMEGKSSYEEDVVQSEIIHEDDSNKIENEIIQNDITWGHKTLKKFFSSFRLL