; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019718 (gene) of Snake gourd v1 genome

Gene IDTan0019718
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCBM20 domain-containing protein
Genome locationLG08:73086576..73091900
RNA-Seq ExpressionTan0019718
SyntenyTan0019718
Gene Ontology termsGO:2001070 - starch binding (molecular function)
InterPro domainsIPR002044 - Carbohydrate binding module family 20
IPR013783 - Immunoglobulin-like fold
IPR013784 - Carbohydrate-binding-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580479.1 putative LRR receptor-like serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. sororia]1.7e-17570.1Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE
        MKTLATSNSIIGNN AP  FSASSLKERLL GGPEF+SYRR RKL +SGLQHLV LRRG I  LSCFSS  QADTQN+ +ENQ TNQSKTVRVKFQLQ+E
Subjt:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE

Query:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV
        CTFGE FF+VGDDPSFGSWDVT+AIPLNWADGH W AEVEIP+GK IQFKFVLQG+TGNV WQP PDR FQPWET+NTII+SEDWD A+ RMLSEEE IV
Subjt:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV

Query:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD
        NQD  S +VPEKLMIE                DSS ALAD S+ EKSSVESHE  I G NISASEENGSNVS                    AS+EN KD
Subjt:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD

Query:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK
        IMA NI S KES+ILNTS+K V +VY N NGET I SQ +TK TEE+L+  E + T KI RN DV ESF+++GVP+LVPGLPPTPT SNQDA QHE  VK
Subjt:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK

Query:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI
        DD SI+GINESNDHKLPENI     QDPDVV E EMEAKSSY E+VVQSEIRQED TN+  N+         IVQNDITWGHKTLKKF S+LRL+
Subjt:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI

KAG7017230.1 hypothetical protein SDJN02_19093 [Cucurbita argyrosperma subsp. argyrosperma]8.8e-17770.51Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE
        MKTLATSNSIIGNN AP  FSASSLKERLL GGPEF+SYRR RKL +SGLQHLV LRRG I  LSCFSS  QADTQN+ +ENQDTNQSKTVRVKFQLQ+E
Subjt:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE

Query:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV
        CTFGE FF+VGDDPSFGSWDVT+AIPLNWADGH W AEVEIP+GK IQFKFVLQG+TGNV WQP PDR FQPWET+NTII+SEDWD A+ RMLSEEE IV
Subjt:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV

Query:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD
        NQD  S +VPEKLMIE                DSS ALAD S+ EKSSVESHE  I G NISASEENGSNVS                    AS+EN KD
Subjt:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD

Query:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK
        IMA NI S KES+ILNTS+K V +VYSN NGET I SQ +TK TEE+L+  E + T KI RN DV ESF+++GVP+LVPGLPPTPT SNQDA QHE  VK
Subjt:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK

Query:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI
        DD SI+GINESNDHKLPENI     QDPDVV E EMEAKSSY E+VVQSEIRQED TN+  N+         IVQNDITWGHKTLKKF S+LRL+
Subjt:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI

XP_022934469.1 uncharacterized protein LOC111441639 [Cucurbita moschata]1.7e-17570.1Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE
        MKTLATSNSIIGNN AP  FSASSLKERLL GGPEF+SYRR RKL +SGLQHLV LRRG I  LSCFSS  QADTQN+ +ENQ TNQSKTVRVKFQLQ+E
Subjt:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE

Query:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV
        CTFGE FF+VGDDPSFGSWDVT+AIPLNWADGH W AEVEIP+GK IQFKFVLQG+TGNV WQP PDR FQPWET+NTII+SEDWD A+ RMLSEEE IV
Subjt:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV

Query:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD
        NQD  S +VPEKLMIE                DSS ALAD S+ EKSSVESHE  I G NISASEENGSNVS                    AS+EN KD
Subjt:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD

Query:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK
        IMA NI S KES+ILNTS+K V +VY N NGET I SQ +TK TEE+L+  E + T KI RN DV ESF+++GVP+LVPGLPPTPT SNQDA QHE  VK
Subjt:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK

Query:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI
        DD SI+GINESNDHKLPENI     QDPDVV E EMEAKSSY E+VVQSEIRQED TN+  N+         IVQNDITWGHKTLKKF S+LRL+
Subjt:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI

XP_022983429.1 uncharacterized protein LOC111482035 [Cucurbita maxima]7.7e-17368.69Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE
        MKTLATSNSIIGNN AP  FSAS LKERLL GGPEF+SYRR RKL +SGLQHLV LRRG I  L CFSS  QADTQN+ +ENQDTNQSKTVRVKFQLQ+E
Subjt:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE

Query:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV
        CTFGE FF+VGDDPSFGSWDVT+AIPLNWADGH W AEVEIP+GK IQFKFVLQG+TGNV WQP PDRTFQPWET+NTII+SEDWD AE R+L EEE I+
Subjt:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV

Query:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD
        NQD+ S +V EKLMIE                DS  ALAD S+ EKSSVESHE  I G NISASEENGSNVS                    AS+EN KD
Subjt:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD

Query:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK
        IM  NI SPKES+ILNTS+K+V +VYSN NGET I SQ +TK  EE+L+  E + T KI RN DV ESF+++GVP+LVPGLPPTPT SNQDA QHE  V+
Subjt:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK

Query:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI
        DD SI+GINESNDHKLPENI     QDPDVV E EME KSSY E+VVQSEIRQED TN+  N+         IV+NDITWGHKTLKKF S+LRL+
Subjt:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI

XP_038906171.1 uncharacterized protein LOC120092050 [Benincasa hispida]8.0e-17871.72Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE
        MK LATS SII N+T   YF A SLKERLLSGGPEFISYRRP KLA  GL+HLVP RRG I+L+SCFSS  QADTQND +ENQ+TNQSKTVRVKFQLQ+E
Subjt:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE

Query:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV
        CTFGE FF+VGDDP FGSWDV++AIPLNWADGH+W AEVEIP+GKTIQFKF+LQG TGNV WQP PDRTF+PWET+NTII+SEDWD AE R+ S EEKIV
Subjt:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV

Query:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD
        NQ++DSSI  EKL+I+ENLTYPN+ELI  TNKD        S+AEK SV    ESI+GSNISASEENGSN+SA EENASNVSLSEDN S+I  SKENA+ 
Subjt:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD

Query:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK
        ++A+NISSPKESFILNTS+K+V +V+SNSNGET ITS+ DTKITEEIL+ DE D  V    N  V ESF++ GVPILVPGLPPTPT SNQ A  +E  VK
Subjt:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK

Query:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEN---------DIVQNDITWGHKTLKKFLSNLRLI
        DD SI+GIN++ND  LPENIQ NQK DPDV+A QEME KSSY       EIRQED TN  EN         DIVQNDITWGHKTLKKFLS+LRL+
Subjt:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEN---------DIVQNDITWGHKTLKKFLSNLRLI

TrEMBL top hitse value%identityAlignment
A0A0A0LA83 CBM20 domain-containing protein1.6e-14763.62Show/hide
Query:  MKTLATSNSIIGNNTAPPYF--SASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLL-SCFSSQSQADT-QNDEIENQDTNQSKTVRVKFQ
        MKTL T NSII N +   YF  S+SSLKERLLSGGPEFISYRRP KLA SGLQHLVPLRRG I+ + SCF+S  QADT QND +ENQ+T+QSKTVRVKFQ
Subjt:  MKTLATSNSIIGNNTAPPYF--SASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLL-SCFSSQSQADT-QNDEIENQDTNQSKTVRVKFQ

Query:  LQRECTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEE
        L +ECTFGE F++VGDDP FGSWDVT+AIPLNWADGH+W AEV+IP+GK IQFKF+LQG TGNV WQP PDRTFQPWET+NTII+SEDWD AE R+LSEE
Subjt:  LQRECTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEE

Query:  EKIVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKE
        EKIVNQ++DS I PE LM E+NLTYP++ELI    KD        S+A K SV    E I+GSNISA EENG N+SA EEN +NVSL E + S+I  S +
Subjt:  EKIVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKE

Query:  NAKDIMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHE
        NAKD++A NI           S+K+V +VY +           DTKITEE L   ENDA     ++  V ES +D  VPILVPGLPPT T SNQ+A  HE
Subjt:  NAKDIMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHE

Query:  VVVKDDDSINGINESNDHKLPE--NIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTENDIVQNDITWGHKTLKKFLSNLRLI
          V+DD S+ GINESNDHKLPE  NIQ NQK DP+VVA QEMEAKSSY +D   + I  +    +  ND+VQND+TWGHKTLKKFLS+LRL+
Subjt:  VVVKDDDSINGINESNDHKLPE--NIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTENDIVQNDITWGHKTLKKFLSNLRLI

A0A5D3DMY0 Carbohydrate-binding-like fold, putative isoform 22.0e-13459Show/hide
Query:  MKTLATSNSIIGNNTAPPYF----SASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQAD-TQNDEIENQDTNQSKTVRVKF
        MKTL TSNSII N +   YF    S+SS+KERLLS GPEFISYRRP KLA SGLQH VPLRRG I+ +SCFSS  QAD  Q+D +ENQ+T+QSKTVRVKF
Subjt:  MKTLATSNSIIGNNTAPPYF----SASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQAD-TQNDEIENQDTNQSKTVRVKF

Query:  QLQRECTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSE
        QLQ+ECTFGE FF+VGDDP FGSWDVT+AIPLNWADGH+W AEV+IP+GK IQFKF+LQG TGNVEWQP PDRTFQPWET+NTII+SEDWD AE R+LSE
Subjt:  QLQRECTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSE

Query:  EEKIVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASK
        EEKIVNQ++ S I PE LM+E NLTYPN+ELI  TNKD        S+A K SV    ESI+GSNI A EENG N+SA EEN SNVSL   N S+I  S 
Subjt:  EEKIVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASK

Query:  ENAKDIMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQH
        E                                              IT+EIL+ D  D  V+        ES +D  VPILVPGLPP            
Subjt:  ENAKDIMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQH

Query:  EVVVKDDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTE---------NDIVQNDITWGHKTLKKFLSNLRLI
           V+ D S++GINESNDHKLPE+   N ++DP+VVA QEME KSSY       EIRQED TN TE         NDIVQNDITWGHKTLKKFLS+LRL+
Subjt:  EVVVKDDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTE---------NDIVQNDITWGHKTLKKFLSNLRLI

A0A6J1CVP4 uncharacterized protein LOC1110150012.5e-16966.53Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAS--SLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQ
        M+TLATSNSII NNTAPP FSAS  SL+ERLL GGPEFISYR P K A+SGLQHL  LRRG I   +  SS +Q DTQND +ENQDTNQ KTVRVKFQLQ
Subjt:  MKTLATSNSIIGNNTAPPYFSAS--SLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQ

Query:  RECTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEK
        +ECTFGEQF +VGDDP  GSW+VT+AIPLNWADGH+W AEVEIP+GKTIQFKFVLQGKTGNV WQP PDRTFQPWETTNTI++SEDWD  E   L+EEEK
Subjt:  RECTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEK

Query:  IVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENA
        +VNQ++DS IV E LMI +    PN+ LI+ TNK+ SVAL DTS+AEKSSVESHEE I+ S ISAS+ENGS++SAL+E+A N+SL E+N S+I A    A
Subjt:  IVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENA

Query:  KDIMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVV
        K+I+A+NIS  KESFILN+S+K V +VYSNSNGE+  T Q DTKITE I +  E  ATVKIL N DV ES ++  VPILVPGLPPTPT SN+ A QHE  
Subjt:  KDIMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVV

Query:  VKDDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDV--------VQSEIRQEDYTNQTE---------NDIVQNDITWGHKTLKKFLS
        V+ D SINGINESN H+LPEN+QMN KQ P +VAE+E+EAK SY ED         +QSEIRQ+D  N+ E         NDI++ND+TWGHKTL K L+
Subjt:  VKDDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDV--------VQSEIRQEDYTNQTE---------NDIVQNDITWGHKTLKKFLS

Query:  NLRLI
        NL+ +
Subjt:  NLRLI

A0A6J1F2P2 uncharacterized protein LOC1114416398.1e-17670.1Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE
        MKTLATSNSIIGNN AP  FSASSLKERLL GGPEF+SYRR RKL +SGLQHLV LRRG I  LSCFSS  QADTQN+ +ENQ TNQSKTVRVKFQLQ+E
Subjt:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE

Query:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV
        CTFGE FF+VGDDPSFGSWDVT+AIPLNWADGH W AEVEIP+GK IQFKFVLQG+TGNV WQP PDR FQPWET+NTII+SEDWD A+ RMLSEEE IV
Subjt:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV

Query:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD
        NQD  S +VPEKLMIE                DSS ALAD S+ EKSSVESHE  I G NISASEENGSNVS                    AS+EN KD
Subjt:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD

Query:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK
        IMA NI S KES+ILNTS+K V +VY N NGET I SQ +TK TEE+L+  E + T KI RN DV ESF+++GVP+LVPGLPPTPT SNQDA QHE  VK
Subjt:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK

Query:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI
        DD SI+GINESNDHKLPENI     QDPDVV E EMEAKSSY E+VVQSEIRQED TN+  N+         IVQNDITWGHKTLKKF S+LRL+
Subjt:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI

A0A6J1J7C1 uncharacterized protein LOC1114820353.7e-17368.69Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE
        MKTLATSNSIIGNN AP  FSAS LKERLL GGPEF+SYRR RKL +SGLQHLV LRRG I  L CFSS  QADTQN+ +ENQDTNQSKTVRVKFQLQ+E
Subjt:  MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRE

Query:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV
        CTFGE FF+VGDDPSFGSWDVT+AIPLNWADGH W AEVEIP+GK IQFKFVLQG+TGNV WQP PDRTFQPWET+NTII+SEDWD AE R+L EEE I+
Subjt:  CTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIV

Query:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD
        NQD+ S +V EKLMIE                DS  ALAD S+ EKSSVESHE  I G NISASEENGSNVS                    AS+EN KD
Subjt:  NQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKD

Query:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK
        IM  NI SPKES+ILNTS+K+V +VYSN NGET I SQ +TK  EE+L+  E + T KI RN DV ESF+++GVP+LVPGLPPTPT SNQDA QHE  V+
Subjt:  IMAKNISSPKESFILNTSSKSVGKVYSNSNGETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVK

Query:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI
        DD SI+GINESNDHKLPENI     QDPDVV E EME KSSY E+VVQSEIRQED TN+  N+         IV+NDITWGHKTLKKF S+LRL+
Subjt:  DDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKSSYSEDVVQSEIRQEDYTNQTEND---------IVQNDITWGHKTLKKFLSNLRLI

SwissProt top hitse value%identityAlignment
P08704 Cyclomaltodextrin glucanotransferase9.0e-0730.56Show/hide
Query:  TQNDEIENQDTNQSKTVRVKFQLQRECTF-GEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGK---TGNVEWQPDPDRTF
        +Q+D+ EN  T QS    + F      T  G+  +I+G+ P  G WD+T A+ ++     +W+A +E+P    +++K V + +   T NVEWQ   +  F
Subjt:  TQNDEIENQDTNQSKTVRVKFQLQRECTF-GEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGK---TGNVEWQPDPDRTF

Query:  QPWETTNT
           +T  T
Subjt:  QPWETTNT

P0DN29 Glucoamylase ARB_02327-14.6e-1139.02Show/hide
Query:  VKFQLQRECTFGEQFFIVGDDPSFGSWDVTNAIPLN---WADG-HEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTF
        V+F+L      GE  F+VG  P  GSWDV  A+PLN   +AD  H+W  ++E+P     ++KF+ + + G V W+ DP+R +
Subjt:  VKFQLQRECTFGEQFFIVGDDPSFGSWDVTNAIPLN---WADG-HEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTF

P30270 Alpha-amylase5.8e-0625.27Show/hide
Query:  FQLQRECTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDW
        F +     +GE  ++ GD  + G+WD   A+ L+ A    W  +V +  G   Q+K++ +   G   W+   +RT     TT  + +++ W
Subjt:  FQLQRECTFGEQFFIVGDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDW

P31797 Cyclomaltodextrin glucanotransferase1.1e-0732.26Show/hide
Query:  SSQSQADTQNDEIENQDTNQSKTVRVKFQLQRECT-FGEQFFIVGDDPSFGSWDVTNAIPLNW----ADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEW
        S Q+ A   N E+   D      V V+F +    T  G+  +IVG+    G+WD + AI   +         W  +V +P GKTI+FKF+ +   GNV W
Subjt:  SSQSQADTQNDEIENQDTNQSKTVRVKFQLQRECT-FGEQFFIVGDDPSFGSWDVTNAIPLNW----ADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEW

Query:  QPDPDRTF-QPWETTNTIIISEDW
        +   +  +  P  TT  II+  DW
Subjt:  QPDPDRTF-QPWETTNTIIISEDW

P36914 Glucoamylase1.4e-0732.94Show/hide
Query:  TVRVKFQLQRECTFGEQFFIVGDDPSFGSWDVTNAIPLN----WADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTF
        TV V F ++    +GE   IVG     GSW+ ++A  LN      D   WT  + +P G++ ++KF+ + + G V W+ DP+R +
Subjt:  TVRVKFQLQRECTFGEQFFIVGDDPSFGSWDVTNAIPLN----WADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTF

Arabidopsis top hitse value%identityAlignment
AT5G01260.1 Carbohydrate-binding-like fold8.6e-3739.22Show/hide
Query:  ISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRECTFGEQFFIVGDDPSFGS-WDVTNAIPLNWADGHEW
        I + R     +S +   VPLR  +I         SQ + +++EIE      +KTVRV+FQL++EC FGE FFIVGDDP FG  WD   A+PLNW+DG+ W
Subjt:  ISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRECTFGEQFFIVGDDPSFGS-WDVTNAIPLNWADGHEW

Query:  TAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSS
        T ++++P+G+ ++FK +L+ +TG + WQP P+R  + WET  TI I EDWD A+L+M+ EE+  V     SSI  E            DE++    ++SS
Subjt:  TAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSS

Query:  VA-------LADTSMAEKSSVESHEESIEGSN
        V        ++D S    S     E+++E SN
Subjt:  VA-------LADTSMAEKSSVESHEESIEGSN

AT5G01260.2 Carbohydrate-binding-like fold3.8e-3730.62Show/hide
Query:  ISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRECTFGEQFFIVGDDPSFGS-WDVTNAIPLNWADGHEW
        I + R     +S +   VPLR  +I         SQ + +++EIE      +KTVRV+FQL++EC FGE FFIVGDDP FG  WD   A+PLNW+DG+ W
Subjt:  ISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRECTFGEQFFIVGDDPSFGS-WDVTNAIPLNWADGHEW

Query:  TAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSS
        T ++++P+G+ ++FK +L+ +TG + WQP P+R  + WET  TI I EDWD A+L+M+ EE+  V     SSI  E            DE++    ++SS
Subjt:  TAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIVNQDKDSSIVPEKLMIEENLTYPNDELINKTNKDSS

Query:  VALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKDIMAKNISSPKESFILNTSSKSVGKVYSNSNGETRI
        V                           + EN   VS  +E+A N S S                I ++    P                   SNG    
Subjt:  VALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKDIMAKNISSPKESFILNTSSKSVGKVYSNSNGETRI

Query:  TSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVKDDDSINGINESNDHKLPENIQMNQKQDPDVVAEQE
                  E++KE                  F +   P+LVPGL P   +S+ D  Q EV          INE      PE   +++KQ+P     ++
Subjt:  TSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVKDDDSINGINESNDHKLPENIQMNQKQDPDVVAEQE

Query:  MEAKS------SYSEDVVQSEIRQEDYTNQ-----------TENDIVQNDITWGHKTLKKFLSNLRL
         + K+      S  E V   E RQ +   +           T + + +NDI WG +TL K LSN RL
Subjt:  MEAKS------SYSEDVVQSEIRQEDYTNQ-----------TENDIVQNDITWGHKTLKKFLSNLRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCCTAGCGACCTCCAACTCTATCATCGGCAACAATACGGCTCCTCCTTACTTCTCTGCTTCTTCTTTGAAAGAGCGTCTTCTTTCTGGAGGACCTGAATTCAT
CTCTTATCGGAGGCCGCGGAAATTGGCTGCTTCTGGACTTCAGCATTTGGTGCCTTTGCGCCGGGGAGCCATCAACTTGCTTTCTTGCTTCTCGTCTCAATCGCAGGCAG
ATACTCAGAATGATGAAATTGAGAATCAAGATACAAATCAATCAAAGACCGTTCGCGTCAAATTCCAACTACAGAGAGAGTGCACGTTTGGGGAGCAATTCTTTATAGTA
GGTGATGATCCAAGTTTTGGTTCCTGGGACGTTACAAATGCAATACCTTTAAACTGGGCAGATGGGCATGAATGGACAGCTGAAGTGGAGATTCCTATTGGAAAAACTAT
CCAGTTCAAATTCGTGCTTCAAGGAAAAACTGGAAATGTTGAATGGCAACCTGATCCTGACCGAACATTCCAACCCTGGGAAACAACTAATACAATCATCATTTCTGAAG
ATTGGGATTGTGCTGAATTACGGATGTTAAGTGAAGAAGAAAAAATTGTTAACCAGGATAAGGATTCTTCCATTGTCCCAGAAAAGTTAATGATCGAGGAGAACCTCACT
TATCCAAACGACGAACTGATCAACAAAACAAATAAGGATTCATCAGTTGCACTTGCCGATACTTCAATGGCAGAAAAATCATCAGTGGAGTCACATGAAGAATCGATTGA
GGGCAGTAATATATCCGCTTCAGAAGAAAATGGCAGTAATGTCTCTGCATTAGAAGAGAATGCCAGTAACGTTTCTCTTTCAGAGGACAACACTAGCAACATTCCTGCTT
CAAAAGAGAATGCCAAAGATATCATGGCAAAGAATATAAGCTCCCCGAAGGAGAGCTTCATTTTGAATACAAGTAGCAAGTCCGTTGGCAAGGTATACAGCAATTCAAAT
GGGGAGACAAGAATTACATCCCAGTGTGATACAAAGATAACAGAGGAAATTTTGAAGGAAGATGAGAACGATGCAACCGTTAAGATCCTTCGGAATACAGATGTTCTAGA
AAGCTTCATGGACCATGGAGTTCCCATTCTAGTTCCTGGTTTACCTCCAACACCAACAATATCAAATCAGGATGCATCTCAACATGAAGTCGTGGTCAAAGATGATGATT
CCATCAATGGAATTAATGAATCTAACGATCATAAACTACCTGAGAACATTCAGATGAATCAGAAACAGGATCCTGATGTTGTGGCTGAACAAGAGATGGAGGCGAAGTCA
AGCTATAGCGAAGATGTCGTCCAAAGTGAAATTAGACAGGAGGATTACACAAATCAAACTGAAAATGATATCGTTCAAAATGACATAACATGGGGTCATAAAACCCTGAA
GAAGTTCCTCTCCAATTTAAGATTGATTTAG
mRNA sequenceShow/hide mRNA sequence
CACTGCTATATAATGGTAATTGGCAACCACCAGAAACTCACCAGAAAGTTAGAAACTCCATTAAAGGCGCGAGGCCTAAGCTCTGAGAGTGTTTGTCATTGTCAGTAGCA
GACTGAGATTCTGAGAGTGAGAGTGATGAAAACCCTAGCGACCTCCAACTCTATCATCGGCAACAATACGGCTCCTCCTTACTTCTCTGCTTCTTCTTTGAAAGAGCGTC
TTCTTTCTGGAGGACCTGAATTCATCTCTTATCGGAGGCCGCGGAAATTGGCTGCTTCTGGACTTCAGCATTTGGTGCCTTTGCGCCGGGGAGCCATCAACTTGCTTTCT
TGCTTCTCGTCTCAATCGCAGGCAGATACTCAGAATGATGAAATTGAGAATCAAGATACAAATCAATCAAAGACCGTTCGCGTCAAATTCCAACTACAGAGAGAGTGCAC
GTTTGGGGAGCAATTCTTTATAGTAGGTGATGATCCAAGTTTTGGTTCCTGGGACGTTACAAATGCAATACCTTTAAACTGGGCAGATGGGCATGAATGGACAGCTGAAG
TGGAGATTCCTATTGGAAAAACTATCCAGTTCAAATTCGTGCTTCAAGGAAAAACTGGAAATGTTGAATGGCAACCTGATCCTGACCGAACATTCCAACCCTGGGAAACA
ACTAATACAATCATCATTTCTGAAGATTGGGATTGTGCTGAATTACGGATGTTAAGTGAAGAAGAAAAAATTGTTAACCAGGATAAGGATTCTTCCATTGTCCCAGAAAA
GTTAATGATCGAGGAGAACCTCACTTATCCAAACGACGAACTGATCAACAAAACAAATAAGGATTCATCAGTTGCACTTGCCGATACTTCAATGGCAGAAAAATCATCAG
TGGAGTCACATGAAGAATCGATTGAGGGCAGTAATATATCCGCTTCAGAAGAAAATGGCAGTAATGTCTCTGCATTAGAAGAGAATGCCAGTAACGTTTCTCTTTCAGAG
GACAACACTAGCAACATTCCTGCTTCAAAAGAGAATGCCAAAGATATCATGGCAAAGAATATAAGCTCCCCGAAGGAGAGCTTCATTTTGAATACAAGTAGCAAGTCCGT
TGGCAAGGTATACAGCAATTCAAATGGGGAGACAAGAATTACATCCCAGTGTGATACAAAGATAACAGAGGAAATTTTGAAGGAAGATGAGAACGATGCAACCGTTAAGA
TCCTTCGGAATACAGATGTTCTAGAAAGCTTCATGGACCATGGAGTTCCCATTCTAGTTCCTGGTTTACCTCCAACACCAACAATATCAAATCAGGATGCATCTCAACAT
GAAGTCGTGGTCAAAGATGATGATTCCATCAATGGAATTAATGAATCTAACGATCATAAACTACCTGAGAACATTCAGATGAATCAGAAACAGGATCCTGATGTTGTGGC
TGAACAAGAGATGGAGGCGAAGTCAAGCTATAGCGAAGATGTCGTCCAAAGTGAAATTAGACAGGAGGATTACACAAATCAAACTGAAAATGATATCGTTCAAAATGACA
TAACATGGGGTCATAAAACCCTGAAGAAGTTCCTCTCCAATTTAAGATTGATTTAGCATCACAACTTTATTCAGATTCAGACACTGGAAAAAAGACTTCCAGATTTCCTT
TTTTGTGTACATATGGTCTACCACTTCAATTGTTGTATGCTACATGGGTTGTAAATACAAAGACGACTATGAAATCGGTTCATTGGCGAATCTGGACTTGCAATGGTATG
AAAGATTCTGGTAATTAGAGTCTCTTCTTAAATCTTAGTCATTTGGATTGTATATGTTCAATTCTATAAATACAAAAACGTTCGTTAGTAGCCTGCATTTTAAAGTACCC
TTCTTGTCATATATTGTCTTGTTTTCAAGGCTTCATATATTCTGTTTCTATCATGAACCTCACCATTTCTACCTTCATGCCACATTAGAAATTTTTTAGTTCAACATGCA
TATGAGAATCACATCTAGTTAAACTATAATGCGC
Protein sequenceShow/hide protein sequence
MKTLATSNSIIGNNTAPPYFSASSLKERLLSGGPEFISYRRPRKLAASGLQHLVPLRRGAINLLSCFSSQSQADTQNDEIENQDTNQSKTVRVKFQLQRECTFGEQFFIV
GDDPSFGSWDVTNAIPLNWADGHEWTAEVEIPIGKTIQFKFVLQGKTGNVEWQPDPDRTFQPWETTNTIIISEDWDCAELRMLSEEEKIVNQDKDSSIVPEKLMIEENLT
YPNDELINKTNKDSSVALADTSMAEKSSVESHEESIEGSNISASEENGSNVSALEENASNVSLSEDNTSNIPASKENAKDIMAKNISSPKESFILNTSSKSVGKVYSNSN
GETRITSQCDTKITEEILKEDENDATVKILRNTDVLESFMDHGVPILVPGLPPTPTISNQDASQHEVVVKDDDSINGINESNDHKLPENIQMNQKQDPDVVAEQEMEAKS
SYSEDVVQSEIRQEDYTNQTENDIVQNDITWGHKTLKKFLSNLRLI