; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027250 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027250
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCBM20 domain-containing protein
Genome locationchr10:46191713..46194697
RNA-Seq ExpressionLag0027250
SyntenyLag0027250
Gene Ontology termsGO:2001070 - starch binding (molecular function)
InterPro domainsIPR002044 - Carbohydrate binding module family 20
IPR013783 - Immunoglobulin-like fold
IPR013784 - Carbohydrate-binding-like fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580479.1 putative LRR receptor-like serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. sororia]2.0e-17370.52Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE
        MKTLATSNSIIGNN AP  FSA+SLKERLL GGPEF+SYRR RK   SGLQHLV LRRG I+ LSCFSS  QADTQ + +ENQ TNQSKTVRVKF LQKE
Subjt:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTIIVSEDWDSA+SR+LSEEE I 
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID

Query:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP
        NQD+ SP+VPEKL+IE                DSS ALAD SI E+SSVES +E +I G NI A EENGSNV          SASEEN KD++A NI + 
Subjt:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP

Query:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES
        KES+ILNTSN+ +SEVY N NGETTI SQS+TK TEE+L+N EK+ T KI  N +VQ SFINYGVP+LVPGLPPTP TSNQ APQHEV+ D SI+GINES
Subjt:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES

Query:  DDHKPPEEKSSYKEDDDEDYVLQSEV---------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        +DHK PE         D D V++ E+         V+QSE RQEDDTNKI N+SDLQE+N+ IVQNDI WGHKTLKKFFSSLRLL
Subjt:  DDHKPPEEKSSYKEDDDEDYVLQSEV---------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

KAG7017230.1 hypothetical protein SDJN02_19093 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-17470.93Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE
        MKTLATSNSIIGNN AP  FSA+SLKERLL GGPEF+SYRR RK   SGLQHLV LRRG I+ LSCFSS  QADTQ + +ENQDTNQSKTVRVKF LQKE
Subjt:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTIIVSEDWDSA+SR+LSEEE I 
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID

Query:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP
        NQD+ SP+VPEKL+IE                DSS ALAD SI E+SSVES +E +I G NI A EENGSNV          SASEEN KD++A NI + 
Subjt:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP

Query:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES
        KES+ILNTSN+ +SEVYSN NGETTI SQS+TK TEE+L+N EK+ T KI  N +VQ SFINYGVP+LVPGLPPTP TSNQ APQHEV+ D SI+GINES
Subjt:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES

Query:  DDHKPPEEKSSYKEDDDEDYVLQSEV---------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        +DHK PE         D D V++ E+         V+QSE RQEDDTNKI N+SDLQE+N+ IVQNDI WGHKTLKKFFSSLRLL
Subjt:  DDHKPPEEKSSYKEDDDEDYVLQSEV---------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

XP_022934469.1 uncharacterized protein LOC111441639 [Cucurbita moschata]2.0e-17370.52Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE
        MKTLATSNSIIGNN AP  FSA+SLKERLL GGPEF+SYRR RK   SGLQHLV LRRG I+ LSCFSS  QADTQ + +ENQ TNQSKTVRVKF LQKE
Subjt:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTIIVSEDWDSA+SR+LSEEE I 
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID

Query:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP
        NQD+ SP+VPEKL+IE                DSS ALAD SI E+SSVES +E +I G NI A EENGSNV          SASEEN KD++A NI + 
Subjt:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP

Query:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES
        KES+ILNTSN+ +SEVY N NGETTI SQS+TK TEE+L+N EK+ T KI  N +VQ SFINYGVP+LVPGLPPTP TSNQ APQHEV+ D SI+GINES
Subjt:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES

Query:  DDHKPPEEKSSYKEDDDEDYVLQSEV---------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        +DHK PE         D D V++ E+         V+QSE RQEDDTNKI N+SDLQE+N+ IVQNDI WGHKTLKKFFSSLRLL
Subjt:  DDHKPPEEKSSYKEDDDEDYVLQSEV---------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

XP_022983429.1 uncharacterized protein LOC111482035 [Cucurbita maxima]1.5e-17371.43Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE
        MKTLATSNSIIGNN AP  FSA+ LKERLL GGPEF+SYRR RK   SGLQHLV LRRG I+ L CFSS  QADTQ + +ENQDTNQSKTVRVKF LQKE
Subjt:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDRTFQPWET+NTIIVSEDWDSAESRIL EEE I 
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID

Query:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP
        NQDE SP+V EKL+IE                DS  ALAD SI E+SSVES +E MI G NI A EENGSNV          SASEEN KD++  NI +P
Subjt:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP

Query:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES
        KES+ILNTSN+A+SEVYSN NGETTI SQS+TK  EE+L+N EK+ T KI  N +VQ SFINYGVP+LVPGLPPTP TSNQ APQHEVE D SI+GINES
Subjt:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES

Query:  DDHKPPEEKSSYKEDDDEDYVLQSEV-------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        +DHK PE      +D D    L+ EV       V+QSE RQEDDTNKI N+SDLQE+N  IV+NDI WGHKTLKKFFSSLRLL
Subjt:  DDHKPPEEKSSYKEDDDEDYVLQSEV-------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

XP_038906171.1 uncharacterized protein LOC120092050 [Benincasa hispida]4.3e-17672.6Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE
        MK LATS SII N+T   YF A SLKERLLSGGPEFISYRRP K A  GL+HLVP RRG IDL+SCFSS  QADTQ DA+ENQ+TNQSKTVRVKF LQKE
Subjt:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID
        CTFGEHFFVVGDDPIFGSWDV+SAIPLNWADGHQW AEVEIPVGKTIQFKF+LQG TGNVVWQPGPDRTF+PWET+NTIIVSEDWDSAESRI S EEKI 
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID

Query:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASE----------ENAK
        NQ+EDS I  EKL+I+ENLTYPNEELI NTNKD        SIAE+ SVES     IDGSNI A EENGSN+S SE N SN+S SE          ENA+
Subjt:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASE----------ENAK

Query:  DLVAENISNPKESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVEC
         LVAENIS+PKESFILNTSN+A+SEV+SNSNGETTITS+SDTKITEEIL+NDEKD  +    N  VQ SF+N GVPILVPGLPPTP TSNQ AP +EV+ 
Subjt:  DLVAENISNPKESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVEC

Query:  DDSINGINESDDHKPPEEKSSYKEDDDEDYVLQSEVVIQS---ETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        D SI+GIN+++D   PE      +  D D +   E+ ++S   E RQEDDTN IEN+SDLQEIN DIVQNDI WGHKTLKKF SSLRLL
Subjt:  DDSINGINESDDHKPPEEKSSYKEDDDEDYVLQSEVVIQS---ETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

TrEMBL top hitse value%identityAlignment
A0A0A0LA83 CBM20 domain-containing protein1.1e-15065.1Show/hide
Query:  MKTLATSNSIIGNNTAPPYF--SAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLL-SCFSSLPQADT-QTDAIENQDTNQSKTVRVKFL
        MKTL T NSII N +   YF  S++SLKERLLSGGPEFISYRRP K A SGLQHLVPLRRG ID + SCF+S  QADT Q DA+ENQ+T+QSKTVRVKF 
Subjt:  MKTLATSNSIIGNNTAPPYF--SAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLL-SCFSSLPQADT-QTDAIENQDTNQSKTVRVKFL

Query:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEE
        L KECTFGEHF+VVGDDPIFGSWDVTSAIPLNWADGHQW AEV+IPVGK IQFKF+LQG TGNVVWQPGPDRTFQPWET+NTIIVSEDWDSAESRILSEE
Subjt:  LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEE

Query:  EKIDNQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSN----------ISASE
        EKI NQ+EDSPI PE L+ E+NLTYP+EELI N  KD        SIA + SV     E+IDGSNI ALEENG N+S SE N +N          IS S 
Subjt:  EKIDNQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSN----------ISASE

Query:  ENAKDLVAENISNPKESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQH
        +NAKDLVA NI           SN+A+SEVY +           DTKITEE L+ND KD          VQ S ++  VPILVPGLPPT   SNQ AP H
Subjt:  ENAKDLVAENISNPKESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQH

Query:  EVECDDSINGINESDDHKPPEEKSSYKEDDDEDYVLQSEVVIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        EVE D S+ GINES+DHK PE ++  K    +  V+  +  +++++  EDDTN IENQSDLQEINND+VQND+ WGHKTLKKF SSLRLL
Subjt:  EVECDDSINGINESDDHKPPEEKSSYKEDDDEDYVLQSEVVIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

A0A5D3DMY0 Carbohydrate-binding-like fold, putative isoform 22.4e-14062.6Show/hide
Query:  MKTLATSNSIIGNNTAPPYF----SAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQAD-TQTDAIENQDTNQSKTVRVKF
        MKTL TSNSII N +   YF    S++S+KERLLS GPEFISYRRP K A SGLQH VPLRRG ID +SCFSS  QAD  Q+DA+ENQ+T+QSKTVRVKF
Subjt:  MKTLATSNSIIGNNTAPPYF----SAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQAD-TQTDAIENQDTNQSKTVRVKF

Query:  LLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSE
         LQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQW AEV+IPVGK IQFKF+LQG TGNV WQPGPDRTFQPWET+NTIIVSEDWDSAESRILSE
Subjt:  LLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSE

Query:  EEKIDNQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAE
        EEKI NQ+E SPI PE L++E NLTYPNEELI NTNKD        SIA + SVES     IDGSNI ALEENG N+S SE N SN+S    N   +   
Subjt:  EEKIDNQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAE

Query:  NISNPKESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSIN
                              S+SN           +IT+EIL+ND +D          VQ S ++  VPILVPGLPP            +VE D S++
Subjt:  NISNPKESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSIN

Query:  GINESDDHKPPEEKSSYKEDDDEDYVLQSEVVIQS---ETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        GINES+DHK PE ++  K   D + V   E+  +S   E RQEDDTN  ENQSDLQEINNDIVQNDI WGHKTLKKF SSLRLL
Subjt:  GINESDDHKPPEEKSSYKEDDDEDYVLQSEVVIQS---ETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

A0A6J1CVP4 uncharacterized protein LOC1110150011.4e-16967.79Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAA--SLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQ
        M+TLATSNSII NNTAPP FSA+  SL+ERLL GGPEFISYR P K A SGLQHL  LRRG I   +  SS  Q DTQ DA+ENQDTNQ KTVRVKF LQ
Subjt:  MKTLATSNSIIGNNTAPPYFSAA--SLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQ

Query:  KECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEK
        KECTFGE F VVGDDP+ GSW+VTSAIPLNWADGHQW AEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTI+VSEDWDS ES  L+EEEK
Subjt:  KECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEK

Query:  IDNQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEEN------AKDL
        + NQ+EDSPIV E L+I +    PNE LIHNTNK+ SVAL DTSIAE+SSVES +EE+ID S I A +ENGS++S  + +  NIS  EEN      AK++
Subjt:  IDNQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEEN------AKDL

Query:  VAENISNPKESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDD
        VAENIS  KESFILN+SN+ +SEVYSNSNGE+T T QSDTKITE I ++ EK +T+KIL N +VQ S IN  VPILVPGLPPTP TSN+AAPQHEVE D 
Subjt:  VAENISNPKESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDD

Query:  SINGINESDDHKPPEE-------------------KSSYKEDDDEDYVLQSEVVIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSL
        SINGINES+ H+ PE                    K SY ED +ED     E  +QSE RQ+DD NKIEN+SDLQEINNDI++ND+ WGHKTL K  ++L
Subjt:  SINGINESDDHKPPEE-------------------KSSYKEDDDEDYVLQSEVVIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSL

Query:  RLL
        + L
Subjt:  RLL

A0A6J1F2P2 uncharacterized protein LOC1114416399.6e-17470.52Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE
        MKTLATSNSIIGNN AP  FSA+SLKERLL GGPEF+SYRR RK   SGLQHLV LRRG I+ LSCFSS  QADTQ + +ENQ TNQSKTVRVKF LQKE
Subjt:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDR FQPWET+NTIIVSEDWDSA+SR+LSEEE I 
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID

Query:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP
        NQD+ SP+VPEKL+IE                DSS ALAD SI E+SSVES +E +I G NI A EENGSNV          SASEEN KD++A NI + 
Subjt:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP

Query:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES
        KES+ILNTSN+ +SEVY N NGETTI SQS+TK TEE+L+N EK+ T KI  N +VQ SFINYGVP+LVPGLPPTP TSNQ APQHEV+ D SI+GINES
Subjt:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES

Query:  DDHKPPEEKSSYKEDDDEDYVLQSEV---------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        +DHK PE         D D V++ E+         V+QSE RQEDDTNKI N+SDLQE+N+ IVQNDI WGHKTLKKFFSSLRLL
Subjt:  DDHKPPEEKSSYKEDDDEDYVLQSEV---------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

A0A6J1J7C1 uncharacterized protein LOC1114820357.4e-17471.43Show/hide
Query:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE
        MKTLATSNSIIGNN AP  FSA+ LKERLL GGPEF+SYRR RK   SGLQHLV LRRG I+ L CFSS  QADTQ + +ENQDTNQSKTVRVKF LQKE
Subjt:  MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKE

Query:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID
        CTFGEHFFVVGDDP FGSWDVTSAIPLNWADGH W AEVEIPVGK IQFKFVLQG+TGNVVWQPGPDRTFQPWET+NTIIVSEDWDSAESRIL EEE I 
Subjt:  CTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKID

Query:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP
        NQDE SP+V EKL+IE                DS  ALAD SI E+SSVES +E MI G NI A EENGSNV          SASEEN KD++  NI +P
Subjt:  NQDEDSPIVPEKLIIEENLTYPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNP

Query:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES
        KES+ILNTSN+A+SEVYSN NGETTI SQS+TK  EE+L+N EK+ T KI  N +VQ SFINYGVP+LVPGLPPTP TSNQ APQHEVE D SI+GINES
Subjt:  KESFILNTSNEAISEVYSNSNGETTITSQSDTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINES

Query:  DDHKPPEEKSSYKEDDDEDYVLQSEV-------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL
        +DHK PE      +D D    L+ EV       V+QSE RQEDDTNKI N+SDLQE+N  IV+NDI WGHKTLKKFFSSLRLL
Subjt:  DDHKPPEEKSSYKEDDDEDYVLQSEV-------VIQSETRQEDDTNKIENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL

SwissProt top hitse value%identityAlignment
O30565 Cyclomaltodextrin glucanotransferase2.3e-0729.2Show/hide
Query:  DTQTDAIENQDTNQSKTVRVKFLLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDR
        +T++ A E  +      V V+F +    T  G + ++VG+    G+WD   AI P+     ++   W  ++ +P GK +++K++ +   GNV WQ G +R
Subjt:  DTQTDAIENQDTNQSKTVRVKFLLQKECT-FGEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDR

Query:  TF-QPWETTNTII
        T+  P   T+T+I
Subjt:  TF-QPWETTNTII

P0DN29 Glucoamylase ARB_02327-11.7e-1040.24Show/hide
Query:  VKFLLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLN---WADG-HQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTF
        V+F L      GE  F+VG  P  GSWDV  A+PLN   +AD  HQW  ++E+P     ++KF+ + + G VVW+  P+R +
Subjt:  VKFLLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLN---WADG-HQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTF

P31746 Cyclomaltodextrin glucanotransferase1.8e-0727.27Show/hide
Query:  SSLPQADTQTDAIENQDTNQSKTVRVKFLLQKECTF-GEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WGAEVEIPVGKTIQFKFVLQGKTGNVVW
        S +  A+ ++   +  +      V V+F +    T  G + ++VG+    G+WD   AI P+     +Q   W  ++ +P GK +++K++ + + GNVVW
Subjt:  SSLPQADTQTDAIENQDTNQSKTVRVKFLLQKECTF-GEHFFVVGDDPIFGSWDVTSAI-PLNWADGHQ---WGAEVEIPVGKTIQFKFVLQGKTGNVVW

Query:  QPGPDRTF-QPWETTNTIIVS
        Q G +RT+  P   T+T++++
Subjt:  QPGPDRTF-QPWETTNTIIVS

P31797 Cyclomaltodextrin glucanotransferase4.2e-0932.17Show/hide
Query:  AIENQDTNQSKTVRVKFLLQKECT-FGEHFFVVGDDPIFGSWDVTSAIPLNW----ADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTF-QP
        A +N +   +  V V+F++    T  G++ ++VG+    G+WD + AI   +         W  +V +P GKTI+FKF+ +   GNV W+ G +  +  P
Subjt:  AIENQDTNQSKTVRVKFLLQKECT-FGEHFFVVGDDPIFGSWDVTSAIPLNW----ADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTF-QP

Query:  WETTNTIIVSEDWDS
          TT  IIV  DW +
Subjt:  WETTNTIIVSEDWDS

Q6ZY51 Phosphoglucan, water dikinase, chloroplastic4.2e-0928.48Show/hide
Query:  LQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKT---VRVKFLLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKT
        + H V L   S  L +  S L    T +  IE Q   +  +   VR+   L  +  FG+H  + G     GSW   S  PLNW++ + W  E+E+  G+ 
Subjt:  LQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKT---VRVKFLLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKT

Query:  IQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKIDNQDE
        +++KFV+    G++ W+ G +R  +   + N  +V   WD A    L   +++ N D+
Subjt:  IQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKIDNQDE

Arabidopsis top hitse value%identityAlignment
AT5G01260.1 Carbohydrate-binding-like fold1.5e-3837.98Show/hide
Query:  ISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQW
        I + R   +  S +   VPLR  SI            D+Q +  + +    +KTVRV+F L+KEC FGEHFF+VGDDP+FG  WD  +A+PLNW+DG+ W
Subjt:  ISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQW

Query:  GAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEE--------KIDNQDEDSPI----VPEKLIIEENLTYPN
          ++++PVG+ ++FK +L+ +TG ++WQPGP+R  + WET  TI + EDWD+A+ +++ EE+         I ++DED  +        ++  EN  Y +
Subjt:  GAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEE--------KIDNQDEDSPI----VPEKLIIEENLTYPN

Query:  EELIHNTN
        +E   N++
Subjt:  EELIHNTN

AT5G01260.2 Carbohydrate-binding-like fold1.5e-3837.98Show/hide
Query:  ISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQW
        I + R   +  S +   VPLR  SI            D+Q +  + +    +KTVRV+F L+KEC FGEHFF+VGDDP+FG  WD  +A+PLNW+DG+ W
Subjt:  ISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKECTFGEHFFVVGDDPIFGS-WDVTSAIPLNWADGHQW

Query:  GAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEE--------KIDNQDEDSPI----VPEKLIIEENLTYPN
          ++++PVG+ ++FK +L+ +TG ++WQPGP+R  + WET  TI + EDWD+A+ +++ EE+         I ++DED  +        ++  EN  Y +
Subjt:  GAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEE--------KIDNQDEDSPI----VPEKLIIEENLTYPN

Query:  EELIHNTN
        +E   N++
Subjt:  EELIHNTN

AT5G26570.1 catalytics;carbohydrate kinases;phosphoglucan, water dikinases3.0e-1028.48Show/hide
Query:  LQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKT---VRVKFLLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKT
        + H V L   S  L +  S L    T +  IE Q   +  +   VR+   L  +  FG+H  + G     GSW   S  PLNW++ + W  E+E+  G+ 
Subjt:  LQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKT---VRVKFLLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKT

Query:  IQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKIDNQDE
        +++KFV+    G++ W+ G +R  +   + N  +V   WD A    L   +++ N D+
Subjt:  IQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKIDNQDE

AT5G26570.2 catalytics;carbohydrate kinases;phosphoglucan, water dikinases3.0e-1028.48Show/hide
Query:  LQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKT---VRVKFLLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKT
        + H V L   S  L +  S L    T +  IE Q   +  +   VR+   L  +  FG+H  + G     GSW   S  PLNW++ + W  E+E+  G+ 
Subjt:  LQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKT---VRVKFLLQKECTFGEHFFVVGDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKT

Query:  IQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKIDNQDE
        +++KFV+    G++ W+ G +R  +   + N  +V   WD A    L   +++ N D+
Subjt:  IQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKIDNQDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACCCTAGCGACCTCCAACTCCATCATCGGCAACAATACAGCTCCGCCTTACTTCTCTGCTGCTTCTTTGAAGGAGCGTCTTCTTTCTGGAGGACCTGAATTCAT
CTCTTATCGGAGGCCTCGGAAATCGGCTGGTTCTGGACTTCAGCATTTGGTGCCTTTGCGCCGAGGAAGCATTGACTTGCTTTCTTGCTTCTCGTCTCTTCCGCAGGCAG
ATACTCAGACTGATGCAATTGAGAATCAAGATACAAATCAATCAAAGACTGTTCGCGTCAAATTCCTGTTACAGAAAGAGTGCACGTTTGGGGAGCATTTCTTTGTAGTA
GGTGATGATCCAATTTTTGGTTCTTGGGACGTTACGAGTGCAATACCTTTAAACTGGGCAGATGGGCATCAATGGGGGGCAGAAGTGGAGATTCCTGTTGGAAAAACAAT
CCAGTTCAAATTCGTACTTCAAGGAAAAACTGGAAATGTTGTATGGCAACCTGGTCCTGATCGAACGTTTCAACCCTGGGAAACAACTAATACAATCATCGTTTCTGAAG
ATTGGGATAGTGCCGAATCACGGATACTAAGTGAAGAAGAAAAAATTGATAACCAGGATGAGGATTCTCCCATTGTCCCAGAAAAGTTAATCATTGAGGAGAACCTCACT
TATCCAAATGAAGAACTGATCCACAATACAAATAAGGATTCATCAGTTGCTCTTGCTGATACTTCAATAGCAGAACAATCATCGGTGGAATCACAATACGAAGAAATGAT
TGATGGCAGTAACATCTTAGCTTTGGAAGAAAATGGCAGTAACGTTTCTCTTTCAGAGAATAACACTAGCAACATTTCTGCTTCAGAAGAGAATGCCAAAGATCTCGTGG
CAGAGAATATAAGCAACCCGAAGGAGAGCTTCATTCTCAATACAAGTAATGAAGCCATCAGCGAGGTATACAGCAATTCAAATGGGGAGACAACAATTACATCCCAGAGT
GATACAAAGATAACAGAGGAAATTTTGAAGAATGATGAGAAAGATTCAACAATGAAGATCCTTAGTAATAGAGAAGTTCAAGTAAGCTTCATCAACTATGGAGTTCCCAT
TCTAGTTCCTGGTTTACCTCCTACACCAGTAACATCAAATCAGGCAGCACCTCAACATGAAGTCGAATGTGATGACTCCATCAATGGAATTAATGAATCTGATGATCATA
AACCACCTGAGGAAAAGTCAAGTTACAAAGAAGACGACGACGAAGACTATGTCCTCCAAAGTGAAGTCGTCATCCAAAGTGAAACTAGACAAGAGGATGACACAAATAAA
ATTGAGAATCAATCCGACTTGCAGGAAATCAACAACGATATCGTTCAAAATGACATAATTTGGGGTCATAAAACTCTGAAGAAGTTCTTCTCCAGTTTAAGATTGCTTTA
G
mRNA sequenceShow/hide mRNA sequence
ATGAAAACCCTAGCGACCTCCAACTCCATCATCGGCAACAATACAGCTCCGCCTTACTTCTCTGCTGCTTCTTTGAAGGAGCGTCTTCTTTCTGGAGGACCTGAATTCAT
CTCTTATCGGAGGCCTCGGAAATCGGCTGGTTCTGGACTTCAGCATTTGGTGCCTTTGCGCCGAGGAAGCATTGACTTGCTTTCTTGCTTCTCGTCTCTTCCGCAGGCAG
ATACTCAGACTGATGCAATTGAGAATCAAGATACAAATCAATCAAAGACTGTTCGCGTCAAATTCCTGTTACAGAAAGAGTGCACGTTTGGGGAGCATTTCTTTGTAGTA
GGTGATGATCCAATTTTTGGTTCTTGGGACGTTACGAGTGCAATACCTTTAAACTGGGCAGATGGGCATCAATGGGGGGCAGAAGTGGAGATTCCTGTTGGAAAAACAAT
CCAGTTCAAATTCGTACTTCAAGGAAAAACTGGAAATGTTGTATGGCAACCTGGTCCTGATCGAACGTTTCAACCCTGGGAAACAACTAATACAATCATCGTTTCTGAAG
ATTGGGATAGTGCCGAATCACGGATACTAAGTGAAGAAGAAAAAATTGATAACCAGGATGAGGATTCTCCCATTGTCCCAGAAAAGTTAATCATTGAGGAGAACCTCACT
TATCCAAATGAAGAACTGATCCACAATACAAATAAGGATTCATCAGTTGCTCTTGCTGATACTTCAATAGCAGAACAATCATCGGTGGAATCACAATACGAAGAAATGAT
TGATGGCAGTAACATCTTAGCTTTGGAAGAAAATGGCAGTAACGTTTCTCTTTCAGAGAATAACACTAGCAACATTTCTGCTTCAGAAGAGAATGCCAAAGATCTCGTGG
CAGAGAATATAAGCAACCCGAAGGAGAGCTTCATTCTCAATACAAGTAATGAAGCCATCAGCGAGGTATACAGCAATTCAAATGGGGAGACAACAATTACATCCCAGAGT
GATACAAAGATAACAGAGGAAATTTTGAAGAATGATGAGAAAGATTCAACAATGAAGATCCTTAGTAATAGAGAAGTTCAAGTAAGCTTCATCAACTATGGAGTTCCCAT
TCTAGTTCCTGGTTTACCTCCTACACCAGTAACATCAAATCAGGCAGCACCTCAACATGAAGTCGAATGTGATGACTCCATCAATGGAATTAATGAATCTGATGATCATA
AACCACCTGAGGAAAAGTCAAGTTACAAAGAAGACGACGACGAAGACTATGTCCTCCAAAGTGAAGTCGTCATCCAAAGTGAAACTAGACAAGAGGATGACACAAATAAA
ATTGAGAATCAATCCGACTTGCAGGAAATCAACAACGATATCGTTCAAAATGACATAATTTGGGGTCATAAAACTCTGAAGAAGTTCTTCTCCAGTTTAAGATTGCTTTA
G
Protein sequenceShow/hide protein sequence
MKTLATSNSIIGNNTAPPYFSAASLKERLLSGGPEFISYRRPRKSAGSGLQHLVPLRRGSIDLLSCFSSLPQADTQTDAIENQDTNQSKTVRVKFLLQKECTFGEHFFVV
GDDPIFGSWDVTSAIPLNWADGHQWGAEVEIPVGKTIQFKFVLQGKTGNVVWQPGPDRTFQPWETTNTIIVSEDWDSAESRILSEEEKIDNQDEDSPIVPEKLIIEENLT
YPNEELIHNTNKDSSVALADTSIAEQSSVESQYEEMIDGSNILALEENGSNVSLSENNTSNISASEENAKDLVAENISNPKESFILNTSNEAISEVYSNSNGETTITSQS
DTKITEEILKNDEKDSTMKILSNREVQVSFINYGVPILVPGLPPTPVTSNQAAPQHEVECDDSINGINESDDHKPPEEKSSYKEDDDEDYVLQSEVVIQSETRQEDDTNK
IENQSDLQEINNDIVQNDIIWGHKTLKKFFSSLRLL