; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029699 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029699
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionEndoglucanase
Genome locationtig00153449:1961090..1972904
RNA-Seq ExpressionSgr029699
SyntenySgr029699
Gene Ontology termsGO:0000272 - polysaccharide catabolic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016853 - isomerase activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR001701 - Glycoside hydrolase family 9
IPR008928 - Six-hairpin glycosidase superfamily
IPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR012341 - Six-hairpin glycosidase-like superfamily
IPR025114 - Beta-carotene isomerase D27-like, C-terminal
IPR025610 - Transcription factor MYC/MYB N-terminal
IPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR045084 - Transcription factor AIB/MYC-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8525087.1 hypothetical protein F0562_007049 [Nyssa sinensis]9.8e-18863.69Show/hide
Query:  TLSGRKKRSKVIWCGIAEASGEPTPVGQKPSTMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLS
        +L  R+   + I CGIAE SGEP P GQK     G  +     + +   E      G E     W++ YDYESFVDVS+ VMQG+SR+QQQQVVREVLLS
Subjt:  TLSGRKKRSKVIWCGIAEASGEPTPVGQKPSTMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLS

Query:  MLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFED
        MLPPGAPAQFRKLFPPT+WA EFNA++TVPFF WLVGPSEV+EVEVNG+KQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTM PNFED
Subjt:  MLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFED

Query:  MSCEMIYGQAPPPFEEDPVSNQPCYTNLCSMANPSASLCHKL-----------------------------------QVDLVGGYYDAGDNVKFGLPMAF
        MSCEM+YGQ PPPFEEDPVS QPC+ ++CSMANPS+     L                                    VDLVGGYYDAGDNVKFGLPMAF
Subjt:  MSCEMIYGQAPPPFEEDPVSNQPCYTNLCSMANPSASLCHKL-----------------------------------QVDLVGGYYDAGDNVKFGLPMAF

Query:  TTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKA
        TTTLLAWSVIEFG SM  +I+NARAAVRW +DYLLKAAT+ P TLYVQVGDPN+DH+CWERPEDMDTPR+VYK++TQNPGSDVAAETAAALAAASIVFK 
Subjt:  TTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKA

Query:  SDPSYSTKLLDAALKT-------------SIEVLTAIPSIQWSVHFTVL----------TPDTTY-----SNGHILGADDDDYTFSWDDKRPGTKILLSQ
        SDPSYS+KLL  ++K              S+  +       +S +   L          + D +Y     SNGH +GADDDDY+FSWDDKR GTKILLSQ
Subjt:  SDPSYSTKLLDAALKT-------------SIEVLTAIPSIQWSVHFTVL----------TPDTTY-----SNGHILGADDDDYTFSWDDKRPGTKILLSQ

Query:  DFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
         FLEKN++EFQ+YK HSDN+ICS+IPG  +F +QYTP
Subjt:  DFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

KAF2313527.1 hypothetical protein GH714_011458 [Hevea brasiliensis]1.6e-18264.23Show/hide
Query:  IWCGIAEASGEPTPVGQKPS-----------TMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLS
        I C IAE +GEP P+GQK             T+  R K    A  ++DG      EG+E G    W  YDY+SFVDVSRRVMQG++RLQQQQVVREVLLS
Subjt:  IWCGIAEASGEPTPVGQKPS-----------TMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLS

Query:  MLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFED
        MLPPGAP QFRKLFPPTRWA EFNA++TVPFFQWLVGPSEVVEVEVNG+KQRSGVHIKKCRYLENSGCVG+CVNMCKIPTQDFFTNEFGLPLTM PNFED
Subjt:  MLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFED

Query:  MSCEMIYGQAPPPFEEDPVSNQPCYTNLCS--------------------------------MANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTT
        MSCEM+YGQAPPPFEEDP S QPC+ ++                                    N   S      V+LVGGYYDAGDNVKFGLPMAFTTT
Subjt:  MSCEMIYGQAPPPFEEDPVSNQPCYTNLCS--------------------------------MANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTT

Query:  LLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDP
        LLAWSVIEFG SM  +IEN +AA+RW +DYLLKAATA P TLYVQVGDPNLDHKCWERPEDMDTPR+VYK+ TQNPGSDVAAETAAALAAASIVFK SDP
Subjt:  LLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDP

Query:  SYSTKLLDAALKT-----------SIEVLTAI-----------PSIQWSVHFTVLTPDT------TYSNGHILGADDDDYTFSWDDKRPGTKILLSQDFL
        SYS+KLL  A+K            S  + +A+             + W   +               SNGH +GADDDDY+FSWDDKR GTKILLS+ FL
Subjt:  SYSTKLLDAALKT-----------SIEVLTAI-----------PSIQWSVHFTVLTPDT------TYSNGHILGADDDDYTFSWDDKRPGTKILLSQDFL

Query:  EKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
        +K  EEFQV+KAHSDNYICSLIPGTSSF +QYTP
Subjt:  EKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

KAG7028209.1 Transcription factor bHLH28, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-13160.84Show/hide
Query:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD
        SPNS+A       STLQQ+LQFILH+R EWWAYSIFW A+KD +  NLV  WRDGHFRGTRDF ARPSK   G A QLISFGFD              + 
Subjt:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD

Query:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL
         V+R E  DF DLEWYYTVS+TR F + DNV+GRVFDS +YVWLTADDGL+                 + F     GVLELGSS+LIKQDWSLAQ AKS+
Subjt:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL

Query:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKE--------PGGSGSGGVGGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEA
        FG   A F P  ++N +           APPCS V K E         GG G GG GGSSSDSLSDNSDGNFI   +NKRG++P+KS   S+PPVNHVEA
Subjt:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKE--------PGGSGSGGVGGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEA

Query:  ERQRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKL------KSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNV
        ERQRR KLN+RFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQ++E+KL           SS     L++N                    N+ NNV
Subjt:  ERQRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKL------KSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNV

Query:  EVNIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL
        EV IIGSEAMVRVQCRDEN+PSARLLNVLRDLGLQ+HHAS SSVNDLMLQDVVVR+PQG  + EKALR AILQRL
Subjt:  EVNIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL

XP_022942541.1 transcription factor MYC2-like [Cucurbita moschata]4.9e-13160.89Show/hide
Query:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD
        SPNS+A       STLQQ+LQFILH+R EWWAYSIFW A+KD +  NLV  WRDGHFRGTRDF ARPSK   G A QLISFGFD              + 
Subjt:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD

Query:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL
         V+R E  DF DLEWYYTVS+TR F + DNV+GRVFDS +YVWLTADDGL+                 + F     GVLELGSS+LIKQDWSLAQ AKS+
Subjt:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL

Query:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKEPGGSGSGGV------GGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEAER
        FG   A F P  ++N +           APPCS V K E G SG GG       GGSSSDSLSDNSDGNF+   +NKRG++P+KS   S+PPVNHVEAER
Subjt:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKEPGGSGSGGV------GGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEAER

Query:  QRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKL------KSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEV
        QRR KLN+RFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQ++E+KL           SS     L++N                    N+ NNVEV
Subjt:  QRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKL------KSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEV

Query:  NIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL
         IIGSEAMVRVQCRDEN+PSARLLNVLRDLGLQ+HHAS SSVNDLMLQDVVVR+PQG  + EKALR AILQRL
Subjt:  NIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL

XP_023539515.1 transcription factor MYC2-like [Cucurbita pepo subsp. pepo]1.3e-13161.36Show/hide
Query:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD
        SPNS+A       STLQQ+LQFILH+R EWWAYSIFW A+KD +  NLV  WRDGHFRGTRDF ARPSK   G A QLISFGFD              + 
Subjt:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD

Query:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL
         V+R E  DF DLEWYYTVS+TR F + DNV+GRVFDS +YVWLTADDGL+                 + F     GVLELGSS+LIKQDWSLAQ AKS+
Subjt:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL

Query:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKE-----PGGSGSGGVGGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEAERQ
        FG   A F P  ++N +           APPCS VAK E      GG G GG GGSSSDSLSDNSDGNFI   +NKRG++P+KS   S+PPVNHVEAERQ
Subjt:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKE-----PGGSGSGGVGGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEAERQ

Query:  RRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKL-----KSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEVNI
        RR KLN+RFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQ++E+KL          SS     L++N                    N+ NNVEV I
Subjt:  RRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKL-----KSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEVNI

Query:  IGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL
        IGSEAMVRVQCRDEN+PSARLLNVLRDLGLQ+HHAS SSVNDLMLQDVVVR+P G  + EKALR AILQRL
Subjt:  IGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL

TrEMBL top hitse value%identityAlignment
A0A5J5A6W7 Endoglucanase4.7e-18863.69Show/hide
Query:  TLSGRKKRSKVIWCGIAEASGEPTPVGQKPSTMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLS
        +L  R+   + I CGIAE SGEP P GQK     G  +     + +   E      G E     W++ YDYESFVDVS+ VMQG+SR+QQQQVVREVLLS
Subjt:  TLSGRKKRSKVIWCGIAEASGEPTPVGQKPSTMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLS

Query:  MLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFED
        MLPPGAPAQFRKLFPPT+WA EFNA++TVPFF WLVGPSEV+EVEVNG+KQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTM PNFED
Subjt:  MLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFED

Query:  MSCEMIYGQAPPPFEEDPVSNQPCYTNLCSMANPSASLCHKL-----------------------------------QVDLVGGYYDAGDNVKFGLPMAF
        MSCEM+YGQ PPPFEEDPVS QPC+ ++CSMANPS+     L                                    VDLVGGYYDAGDNVKFGLPMAF
Subjt:  MSCEMIYGQAPPPFEEDPVSNQPCYTNLCSMANPSASLCHKL-----------------------------------QVDLVGGYYDAGDNVKFGLPMAF

Query:  TTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKA
        TTTLLAWSVIEFG SM  +I+NARAAVRW +DYLLKAAT+ P TLYVQVGDPN+DH+CWERPEDMDTPR+VYK++TQNPGSDVAAETAAALAAASIVFK 
Subjt:  TTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKA

Query:  SDPSYSTKLLDAALKT-------------SIEVLTAIPSIQWSVHFTVL----------TPDTTY-----SNGHILGADDDDYTFSWDDKRPGTKILLSQ
        SDPSYS+KLL  ++K              S+  +       +S +   L          + D +Y     SNGH +GADDDDY+FSWDDKR GTKILLSQ
Subjt:  SDPSYSTKLLDAALKT-------------SIEVLTAIPSIQWSVHFTVL----------TPDTTY-----SNGHILGADDDDYTFSWDDKRPGTKILLSQ

Query:  DFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
         FLEKN++EFQ+YK HSDN+ICS+IPG  +F +QYTP
Subjt:  DFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

A0A6A6MKM2 Endoglucanase7.8e-18364.23Show/hide
Query:  IWCGIAEASGEPTPVGQKPS-----------TMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLS
        I C IAE +GEP P+GQK             T+  R K    A  ++DG      EG+E G    W  YDY+SFVDVSRRVMQG++RLQQQQVVREVLLS
Subjt:  IWCGIAEASGEPTPVGQKPS-----------TMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLS

Query:  MLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFED
        MLPPGAP QFRKLFPPTRWA EFNA++TVPFFQWLVGPSEVVEVEVNG+KQRSGVHIKKCRYLENSGCVG+CVNMCKIPTQDFFTNEFGLPLTM PNFED
Subjt:  MLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFED

Query:  MSCEMIYGQAPPPFEEDPVSNQPCYTNLCS--------------------------------MANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTT
        MSCEM+YGQAPPPFEEDP S QPC+ ++                                    N   S      V+LVGGYYDAGDNVKFGLPMAFTTT
Subjt:  MSCEMIYGQAPPPFEEDPVSNQPCYTNLCS--------------------------------MANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTT

Query:  LLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDP
        LLAWSVIEFG SM  +IEN +AA+RW +DYLLKAATA P TLYVQVGDPNLDHKCWERPEDMDTPR+VYK+ TQNPGSDVAAETAAALAAASIVFK SDP
Subjt:  LLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDP

Query:  SYSTKLLDAALKT-----------SIEVLTAI-----------PSIQWSVHFTVLTPDT------TYSNGHILGADDDDYTFSWDDKRPGTKILLSQDFL
        SYS+KLL  A+K            S  + +A+             + W   +               SNGH +GADDDDY+FSWDDKR GTKILLS+ FL
Subjt:  SYSTKLLDAALKT-----------SIEVLTAI-----------PSIQWSVHFTVLTPDT------TYSNGHILGADDDDYTFSWDDKRPGTKILLSQDFL

Query:  EKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
        +K  EEFQV+KAHSDNYICSLIPGTSSF +QYTP
Subjt:  EKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

A0A6J1E9T4 transcription factor MYC2-like isoform X14.1e-13159.25Show/hide
Query:  MDDL-VSPSSSPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKDMAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDI
        MDD  +S SSSPNS A        T+QQRLQFILH+ P+W+AYSIFW A+KD AGNL+ +W DGHFRGTR          +GG  QLISFGFD       
Subjt:  MDDL-VSPSSSPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKDMAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDI

Query:  AFHDLFYDDLVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADD----------------GLNHVGFFGDFRGVLELGSSDLIKQDW
               D  V+R+E GDF+DLE YYT+SI++ +G  DNV+GRVFDSS+Y+WLT D+                G+  + F     GVLELGSS+LIK+DW
Subjt:  AFHDLFYDDLVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADD----------------GLNHVGFFGDFRGVLELGSSDLIKQDW

Query:  SLAQSAKSLFGAAGACFTPLNDQND-----------QAPPCSGVAKKEPGGSGSGGVGGSSSDSLSDNSDGNFIPAKN-----NKRGKKPAKS---SSPP
        SLAQ AK+LFG      T    ++            QAPPCSG+ K+E  G G GG GGSSSDSLSDNSDGNF+   +     NK+GK+ AK+   S+ P
Subjt:  SLAQSAKSLFGAAGACFTPLNDQND-----------QAPPCSGVAKKEPGGSGSGGVGGSSSDSLSDNSDGNFIPAKN-----NKRGKKPAKS---SSPP

Query:  VNHVEAERQRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKLK-SQHQ-SSSMSQVLDQNR---SPVEQTITSAYYAMNNNNNN
        VNHVEAERQRR KLNHRFYALRSVVPNVSKMDKASLLADAV+YINELK+K+Q MESKLK SQHQ SSS+S   DQ R   S +EQT++S  YAM  NN  
Subjt:  VNHVEAERQRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKLK-SQHQ-SSSMSQVLDQNR---SPVEQTITSAYYAMNNNNNN

Query:  NNNNVEVNIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRLD
        NNN VEV ++G+EA+VRV CRDENYPSARL+NVL+DLGLQV  AS SSVND+MLQDVV+R+PQG A+REK LRTAILQRL+
Subjt:  NNNNVEVNIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRLD

A0A6J1FP53 transcription factor MYC2-like2.4e-13160.89Show/hide
Query:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD
        SPNS+A       STLQQ+LQFILH+R EWWAYSIFW A+KD +  NLV  WRDGHFRGTRDF ARPSK   G A QLISFGFD              + 
Subjt:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD

Query:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL
         V+R E  DF DLEWYYTVS+TR F + DNV+GRVFDS +YVWLTADDGL+                 + F     GVLELGSS+LIKQDWSLAQ AKS+
Subjt:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL

Query:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKEPGGSGSGGV------GGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEAER
        FG   A F P  ++N +           APPCS V K E G SG GG       GGSSSDSLSDNSDGNF+   +NKRG++P+KS   S+PPVNHVEAER
Subjt:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKEPGGSGSGGV------GGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEAER

Query:  QRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKL------KSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEV
        QRR KLN+RFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQ++E+KL           SS     L++N                    N+ NNVEV
Subjt:  QRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKL------KSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEV

Query:  NIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL
         IIGSEAMVRVQCRDEN+PSARLLNVLRDLGLQ+HHAS SSVNDLMLQDVVVR+PQG  + EKALR AILQRL
Subjt:  NIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL

A0A6J1KYA9 transcription factor MYC2-like1.4e-12860.78Show/hide
Query:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD
        SPNS+A       STLQQ+LQFILH+R EWWAYSIFW A+KD +  NLV  WRDGHFRGTRDF ARPSK   G A QLISFGFD              + 
Subjt:  SPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKD-MAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDD

Query:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL
         V++ +  DF DLEWYYT+S+TR F + DNV+GRVFDS +YVWLTADDGL+                 + F     GVLELGSS+LIKQDWSLAQ AKS+
Subjt:  LVNRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLN----------------HVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSL

Query:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKEPGGSG---SGGVGGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEAERQRR
        FG   A F P  ++N +           APPCS V K E G SG    GG GGSSSDSLS NSDGNFI   +NKRG++P KS   S+PPVNHVEAERQRR
Subjt:  FGAAGACFTPLNDQNDQ-----------APPCSGVAKKEPGGSG---SGGVGGSSSDSLSDNSDGNFIPAKNNKRGKKPAKS---SSPPVNHVEAERQRR

Query:  LKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKLKSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEVNIIGSEAMV
         KLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSK+Q++E+ L             +   S    T+        N   N+ NNVEV IIGSEAMV
Subjt:  LKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKLKSQHQSSSMSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEVNIIGSEAMV

Query:  RVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL
        RVQCRDEN+PSARLLNVLRDLGLQ+HHAS SSVNDLMLQDVVVR+PQG  + EKALR A+LQRL
Subjt:  RVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRL

SwissProt top hitse value%identityAlignment
O81416 Endoglucanase 171.4e-6752.63Show/hide
Query:  LQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINT
        L VDLVGGYYDAGDN+KFG PMAFTTT+L+WSVIEFG  M  E++NA+ A+RW +DYLLK AT+ PDT+YVQVGD N DH CWERPEDMDT RSV+K++ 
Subjt:  LQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINT

Query:  QNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKTSIEVLTAIPSIQWSVHFTVLTPDT-----TYS----------------------------
          PGSDVAAETAAALAAA+IVF+ SDPSYS  LL  A+      + A        +   L PD      +YS                            
Subjt:  QNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKTSIEVLTAIPSIQWSVHFTVLTPDT-----TYS----------------------------

Query:  NGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
        NG ILGA + D TF WD+K  G +ILL++ FL +N +    YK H+DN+ICS+IPG     +QYTP
Subjt:  NGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

P05522 Endoglucanase 12.7e-8462.55Show/hide
Query:  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQN
        VDLVGGYYDAGDN+KFGLPMAFTTT+LAW +IEFG  M  ++ENARAA+RW +DYLLKA+TA  ++LYVQVG+PN DH+CWERPEDMDTPR+VYK++TQN
Subjt:  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQN

Query:  PGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKT-------------SIEVLTA-------------IPSIQWSVHFTVLTPDTTY--SNGHILGA
        PGSDVAAETAAALAAASIVF  SD SYSTKLL  A+K              S+  +               +    W    +      TY  SNGH LGA
Subjt:  PGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKT-------------SIEVLTA-------------IPSIQWSVHFTVLTPDTTY--SNGHILGA

Query:  DDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
        DDDDY+FSWDDKR GTK+LLS+ FL+   EE Q+YK H+DNYICSLIPGTSSF +QYTP
Subjt:  DDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

Q6YXT7 Endoglucanase 196.6e-7858.3Show/hide
Query:  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQN
        VDLVGGYYDAGDNVKFGLPMAF+TT+LAWSV++FG  MG E+ NARAAVRWG+DYLLKAATA P  LYVQV DPN DH+CWERPEDMDTPRSVY++    
Subjt:  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQN

Query:  PGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKTSIEVLTAIPSIQWSVHFTVLTPDTTYS----------------------------NGHILGA
        PGSDVA ETAAALAA+S+VF+ +DP+YS +LL AA +          S   S+  +V     +YS                            NG  LGA
Subjt:  PGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKTSIEVLTAIPSIQWSVHFTVLTPDTTYS----------------------------NGHILGA

Query:  DDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
         DDDY+FSWDDKR GTK+LL++ FL       ++YKAHSD+YICSL+PGT+SF S+YTP
Subjt:  DDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

Q6Z715 Endoglucanase 42.1e-7657.25Show/hide
Query:  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMG----------------REIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERP
        VDL GGYYDAGDNVKFGLPMAFT T+L+WSVIEFGD M                  +++NARAAVRWG+DYLLKAATA PDTLYVQV DP  DH+CWERP
Subjt:  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMG----------------REIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERP

Query:  EDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAA-------------LKTSIEVLTAIPSIQWSVHFTVL---------TPD-
        EDMDTPRSVYK+  Q+PGSDVA ETAAALAAASIVF+ SDPSYS KLLDAA                S+  +        S H  +L         +P+ 
Subjt:  EDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAA-------------LKTSIEVLTAIPSIQWSVHFTVL---------TPD-

Query:  ----TTY--SNGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
             +Y  SNGH LGA+ DD+TFSWDDKR  TK      FL+  ++  Q+YKAH+DNYICSL+PG + F SQYTP
Subjt:  ----TTY--SNGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

Q9SRX3 Endoglucanase 15.6e-6950.52Show/hide
Query:  FEEDPVSNQPCYTNLCSMANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQ
        FE       P    +   +N   S    L VDLVGGYYDAGDN+KFG PMAFTTT+L+WS+IEFG  M  E+ NA+ A+RW +D+LLK AT+ PDT+YVQ
Subjt:  FEEDPVSNQPCYTNLCSMANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQ

Query:  VGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKT---------------SIEVLTAIPS-------
        VGDPN+DH CWERPEDMDTPRSV+K++  NPGSD+A E AAALAAASIVF+  DPSYS  LL  A+                 + EV     S       
Subjt:  VGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKT---------------SIEVLTAIPS-------

Query:  IQW-SVHFTVLTPDTTY-----SNGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
        + W +      T + TY     +NG ILGAD+ D  FSWD+K  G +ILLS++FL +  +  + YK H+D++ICS++PG SS  SQYTP
Subjt:  IQW-SVHFTVLTPDTTY-----SNGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

Arabidopsis top hitse value%identityAlignment
AT1G02800.1 cellulase 24.0e-7050.52Show/hide
Query:  FEEDPVSNQPCYTNLCSMANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQ
        FE       P    +   +N   S    L VDLVGGYYDAGDN+KFG PMAFTTT+L+WS+IEFG  M  E+ NA+ A+RW +D+LLK AT+ PDT+YVQ
Subjt:  FEEDPVSNQPCYTNLCSMANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQ

Query:  VGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKT---------------SIEVLTAIPS-------
        VGDPN+DH CWERPEDMDTPRSV+K++  NPGSD+A E AAALAAASIVF+  DPSYS  LL  A+                 + EV     S       
Subjt:  VGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKT---------------SIEVLTAIPS-------

Query:  IQW-SVHFTVLTPDTTY-----SNGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
        + W +      T + TY     +NG ILGAD+ D  FSWD+K  G +ILLS++FL +  +  + YK H+D++ICS++PG SS  SQYTP
Subjt:  IQW-SVHFTVLTPDTTY-----SNGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP

AT1G22880.1 cellulase 55.7e-6147.64Show/hide
Query:  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQN
        VDL GGYYDAGDNVKF  PMAFTTT+L+WS +E+G  MG E++N+R A+RW +DYLLK A A P  LYV VGDPN DHKCWERPEDMDTPR+VY ++  N
Subjt:  VDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQN

Query:  PGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKT---SIEVLTAIPS-------------------IQWSVHFTVLTPDTTYSNGHI--LGADDDD
        PGSDVAAETAAALAA+S+VF+  DP YS  LL  A K    +I+   A  +                   + W   +     +  Y    I  LG  D  
Subjt:  PGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKT---SIEVLTAIPS-------------------IQWSVHFTVLTPDTTYSNGHI--LGADDDD

Query:  YTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYT
          FSWD+K  G  +LLS+  +      F++YK  ++N++C ++P + S  ++YT
Subjt:  YTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYT

AT1G64680.1 unknown protein4.3e-9369.57Show/hide
Query:  CGIAEASGEPTPVGQKPSTMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLSMLPPGAPAQFRKL
        CGIAE SGEP P+G K     G ++ V   + +   +     + ++     +WE YDYESFV+VS+RVMQG+SR+QQQ+ VREVLLSMLPPGAP QFRKL
Subjt:  CGIAEASGEPTPVGQKPSTMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESFVDVSRRVMQGKSRLQQQQVVREVLLSMLPPGAPAQFRKL

Query:  FPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFEDMSCEMIYGQAPPP
        FPPT+WAAEFNA++TVPFF WLVGPS+V+EVEVNG+KQRSGV IKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPN+EDMSCEMIYGQAPP 
Subjt:  FPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFFTNEFGLPLTMNPNFEDMSCEMIYGQAPPP

Query:  FEEDPVSNQPCYTNLCSMANPSASLCHKLQ
        FEED  + QPC  ++CSM+NPS+ +C KL+
Subjt:  FEEDPVSNQPCYTNLCSMANPSASLCHKLQ

AT1G71380.1 cellulase 31.6e-6344.77Show/hide
Query:  NPNF-EDMSCEMIY--GQAPPPFEEDPVSNQPCYTNLCSMANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAA
        NPN+ E +S  +++  GQ   P    P   Q  +     +++ SA+      VDL GGYYDAGDNVKF LPMAFTTT+L+WS +E+G  MG E+ENAR  
Subjt:  NPNF-EDMSCEMIY--GQAPPPFEEDPVSNQPCYTNLCSMANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAA

Query:  VRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAA---LKTSIEVLTA
        +RW +DYLLK A A P  LYV VGDPN+DHKCWERPEDMDTPR+VY ++  NPGSDVAAETAAALAAAS+VF+  D  YS  LL  A   ++ +I+   A
Subjt:  VRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAA---LKTSIEVLTA

Query:  I-------------------PSIQWSVHFTVLTPDTTYSNGHI--LGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSS
                              + W   + +   +  Y    I  LG  D    FSWD+K  G  +LLS+  L      F+ YK  ++N+IC ++P + S
Subjt:  I-------------------PSIQWSVHFTVLTPDTTYSNGHI--LGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSS

Query:  FGSQYT
          +QYT
Subjt:  FGSQYT

AT4G02290.1 glycosyl hydrolase 9B139.7e-6952.63Show/hide
Query:  LQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINT
        L VDLVGGYYDAGDN+KFG PMAFTTT+L+WSVIEFG  M  E++NA+ A+RW +DYLLK AT+ PDT+YVQVGD N DH CWERPEDMDT RSV+K++ 
Subjt:  LQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRWGSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINT

Query:  QNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKTSIEVLTAIPSIQWSVHFTVLTPDT-----TYS----------------------------
          PGSDVAAETAAALAAA+IVF+ SDPSYS  LL  A+      + A        +   L PD      +YS                            
Subjt:  QNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKTSIEVLTAIPSIQWSVHFTVLTPDT-----TYS----------------------------

Query:  NGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP
        NG ILGA + D TF WD+K  G +ILL++ FL +N +    YK H+DN+ICS+IPG     +QYTP
Subjt:  NGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGACCTCGTCTCTCCTTCTTCCTCCCCCAATTCCCTCGCTTCCTTCTGCCACCAGACGCCGTCGACGCTCCAGCAGCGCCTTCAGTTCATCCTCCACGACCGCCC
CGAGTGGTGGGCCTATTCCATTTTCTGGCAGGCTACGAAAGACATGGCTGGGAATCTGGTTCTAACGTGGAGAGACGGCCATTTCCGCGGCACTAGGGACTTCGCCGCCC
GACCTTCGAAAACGAACGACGGCGGCGCCGGACAACTCATTAGCTTTGGGTTTGACTCCGGGAGGAAGAAGGATATAGCATTTCACGATCTGTTCTACGATGATTTGGTG
AACAGAATGGAAGGTGGGGATTTTACAGACTTGGAATGGTATTACACCGTGTCTATAACGCGATCGTTTGGCGTCGCCGATAATGTCATCGGCCGGGTGTTCGACTCCAG
CTCCTACGTCTGGTTGACGGCGGACGACGGCCTGAATCACGTTGGTTTTTTTGGCGACTTCCGCGGCGTTCTCGAATTGGGTTCTTCTGATCTGATCAAACAGGATTGGA
GTTTGGCCCAGTCTGCAAAATCACTTTTTGGAGCTGCTGGTGCATGTTTTACGCCATTGAATGATCAAAACGATCAAGCACCGCCGTGTTCCGGTGTGGCGAAGAAAGAA
CCGGGCGGCAGTGGCAGTGGCGGCGTCGGAGGGTCGTCGTCCGACTCGCTCTCAGATAACTCCGATGGGAATTTCATACCAGCCAAAAACAACAAGAGAGGGAAGAAACC
GGCGAAGTCATCATCGCCACCGGTGAACCATGTGGAAGCAGAGAGGCAGCGCCGGCTGAAGCTAAACCATAGATTCTACGCTCTCCGATCAGTGGTTCCAAATGTATCAA
AGATGGACAAAGCTTCATTGCTGGCCGATGCAGTCATCTACATCAACGAGCTCAAGTCCAAGATTCAAGCAATGGAGTCCAAGCTAAAATCACAGCACCAGAGCAGCAGC
ATGAGCCAAGTACTCGATCAGAACAGGAGCCCGGTCGAGCAGACGATCACGTCAGCATATTACGCGATGAACAACAACAACAACAACAACAATAACAACGTCGAGGTGAA
TATTATTGGGTCGGAAGCGATGGTGAGAGTTCAATGCCGAGACGAGAATTACCCGTCGGCGAGGCTCCTGAATGTGCTCCGAGACCTGGGGCTTCAAGTTCATCATGCGA
GTTTCTCAAGCGTAAACGATCTGATGCTGCAAGATGTTGTTGTTAGGATTCCTCAAGGAGCAGCCATGAGAGAGAAGGCCTTAAGAACTGCCATACTTCAACGACTGGAT
ATGTTGATTATCTACATAGTGTCCAATCCTTCACACCCCACAGGAGATAAGGAGTTTCGTGGAGTGGTGAAATGCGTAGCGATGGGAAAGAACACCAACGGTGAAAGCAC
TCTGTCGGGTCGTAAGAAAAGGAGTAAAGTGATATGGTGTGGAATTGCAGAGGCATCAGGGGAGCCGACGCCGGTGGGGCAAAAACCAAGTACAATGATGGGGCGTTTGA
AAGGTGTTCATGACGCTGTTTGCTCGGAAGATGGAGAAGTTTGCAAAAGAGCAGAGGGAGAAGAAGACGGCGGCGGCGCATGGTGGGAGTTTTATGACTACGAAAGCTTC
GTGGACGTGTCGAGAAGAGTAATGCAAGGGAAGTCTCGGCTGCAGCAGCAGCAGGTGGTGCGAGAGGTTCTCTTGTCTATGCTTCCTCCAGGAGCGCCTGCTCAGTTCAG
GAAATTGTTCCCGCCAACAAGGTGGGCGGCTGAGTTCAATGCCTCAATAACAGTGCCATTTTTTCAGTGGTTAGTCGGCCCGTCGGAGGTTGTGGAAGTGGAGGTAAATG
GAATAAAGCAAAGAAGTGGAGTTCATATAAAGAAGTGCAGGTACCTTGAGAACAGTGGGTGTGTGGGTATGTGCGTGAATATGTGCAAGATACCTACACAAGATTTCTTC
ACCAATGAATTTGGGCTCCCTCTCACCATGAATCCTAATTTTGAAGACATGAGTTGTGAGATGATATATGGCCAAGCTCCACCGCCATTTGAAGAGGATCCAGTATCCAA
TCAACCTTGCTACACAAATTTATGTTCCATGGCGAATCCTAGTGCCTCCTTATGTCATAAATTGCAAGTTGACCTTGTTGGTGGCTACTATGACGCTGGGGATAATGTCA
AGTTTGGCTTGCCAATGGCCTTCACTACTACATTGCTGGCTTGGAGTGTCATTGAGTTTGGCGACTCGATGGGGAGAGAGATTGAGAATGCAAGAGCAGCTGTCCGATGG
GGGTCGGATTATCTATTAAAGGCTGCCACTGCTGCCCCTGACACCTTATATGTTCAAGTGGGAGACCCAAACCTTGATCACAAATGCTGGGAAAGGCCTGAAGATATGGA
CACACCGCGCAGTGTGTATAAGATAAACACTCAAAATCCAGGCTCTGATGTAGCAGCAGAGACCGCGGCTGCGCTGGCTGCGGCTTCGATTGTGTTCAAAGCCTCCGACC
CTTCTTATTCTACCAAATTGCTCGACGCGGCCTTGAAAACAAGCATAGAGGTTCTTACAGCGATTCCCTCCATTCAATGGTCTGTCCATTTTACTGTTCTTACTCCGGAT
ACAACGTACTCCAATGGCCACATACTGGGTGCTGATGACGACGACTACACGTTCAGCTGGGACGACAAGCGCCCTGGAACAAAGATCCTTCTCTCCCAGGATTTCCTAGA
GAAAAACTCAGAGGAATTCCAAGTCTATAAAGCACACTCTGATAATTACATATGCTCTCTGATTCCAGGGACTTCCAGTTTCGGTTCCCAATATACTCCTG
mRNA sequenceShow/hide mRNA sequence
ATGGACGACCTCGTCTCTCCTTCTTCCTCCCCCAATTCCCTCGCTTCCTTCTGCCACCAGACGCCGTCGACGCTCCAGCAGCGCCTTCAGTTCATCCTCCACGACCGCCC
CGAGTGGTGGGCCTATTCCATTTTCTGGCAGGCTACGAAAGACATGGCTGGGAATCTGGTTCTAACGTGGAGAGACGGCCATTTCCGCGGCACTAGGGACTTCGCCGCCC
GACCTTCGAAAACGAACGACGGCGGCGCCGGACAACTCATTAGCTTTGGGTTTGACTCCGGGAGGAAGAAGGATATAGCATTTCACGATCTGTTCTACGATGATTTGGTG
AACAGAATGGAAGGTGGGGATTTTACAGACTTGGAATGGTATTACACCGTGTCTATAACGCGATCGTTTGGCGTCGCCGATAATGTCATCGGCCGGGTGTTCGACTCCAG
CTCCTACGTCTGGTTGACGGCGGACGACGGCCTGAATCACGTTGGTTTTTTTGGCGACTTCCGCGGCGTTCTCGAATTGGGTTCTTCTGATCTGATCAAACAGGATTGGA
GTTTGGCCCAGTCTGCAAAATCACTTTTTGGAGCTGCTGGTGCATGTTTTACGCCATTGAATGATCAAAACGATCAAGCACCGCCGTGTTCCGGTGTGGCGAAGAAAGAA
CCGGGCGGCAGTGGCAGTGGCGGCGTCGGAGGGTCGTCGTCCGACTCGCTCTCAGATAACTCCGATGGGAATTTCATACCAGCCAAAAACAACAAGAGAGGGAAGAAACC
GGCGAAGTCATCATCGCCACCGGTGAACCATGTGGAAGCAGAGAGGCAGCGCCGGCTGAAGCTAAACCATAGATTCTACGCTCTCCGATCAGTGGTTCCAAATGTATCAA
AGATGGACAAAGCTTCATTGCTGGCCGATGCAGTCATCTACATCAACGAGCTCAAGTCCAAGATTCAAGCAATGGAGTCCAAGCTAAAATCACAGCACCAGAGCAGCAGC
ATGAGCCAAGTACTCGATCAGAACAGGAGCCCGGTCGAGCAGACGATCACGTCAGCATATTACGCGATGAACAACAACAACAACAACAACAATAACAACGTCGAGGTGAA
TATTATTGGGTCGGAAGCGATGGTGAGAGTTCAATGCCGAGACGAGAATTACCCGTCGGCGAGGCTCCTGAATGTGCTCCGAGACCTGGGGCTTCAAGTTCATCATGCGA
GTTTCTCAAGCGTAAACGATCTGATGCTGCAAGATGTTGTTGTTAGGATTCCTCAAGGAGCAGCCATGAGAGAGAAGGCCTTAAGAACTGCCATACTTCAACGACTGGAT
ATGTTGATTATCTACATAGTGTCCAATCCTTCACACCCCACAGGAGATAAGGAGTTTCGTGGAGTGGTGAAATGCGTAGCGATGGGAAAGAACACCAACGGTGAAAGCAC
TCTGTCGGGTCGTAAGAAAAGGAGTAAAGTGATATGGTGTGGAATTGCAGAGGCATCAGGGGAGCCGACGCCGGTGGGGCAAAAACCAAGTACAATGATGGGGCGTTTGA
AAGGTGTTCATGACGCTGTTTGCTCGGAAGATGGAGAAGTTTGCAAAAGAGCAGAGGGAGAAGAAGACGGCGGCGGCGCATGGTGGGAGTTTTATGACTACGAAAGCTTC
GTGGACGTGTCGAGAAGAGTAATGCAAGGGAAGTCTCGGCTGCAGCAGCAGCAGGTGGTGCGAGAGGTTCTCTTGTCTATGCTTCCTCCAGGAGCGCCTGCTCAGTTCAG
GAAATTGTTCCCGCCAACAAGGTGGGCGGCTGAGTTCAATGCCTCAATAACAGTGCCATTTTTTCAGTGGTTAGTCGGCCCGTCGGAGGTTGTGGAAGTGGAGGTAAATG
GAATAAAGCAAAGAAGTGGAGTTCATATAAAGAAGTGCAGGTACCTTGAGAACAGTGGGTGTGTGGGTATGTGCGTGAATATGTGCAAGATACCTACACAAGATTTCTTC
ACCAATGAATTTGGGCTCCCTCTCACCATGAATCCTAATTTTGAAGACATGAGTTGTGAGATGATATATGGCCAAGCTCCACCGCCATTTGAAGAGGATCCAGTATCCAA
TCAACCTTGCTACACAAATTTATGTTCCATGGCGAATCCTAGTGCCTCCTTATGTCATAAATTGCAAGTTGACCTTGTTGGTGGCTACTATGACGCTGGGGATAATGTCA
AGTTTGGCTTGCCAATGGCCTTCACTACTACATTGCTGGCTTGGAGTGTCATTGAGTTTGGCGACTCGATGGGGAGAGAGATTGAGAATGCAAGAGCAGCTGTCCGATGG
GGGTCGGATTATCTATTAAAGGCTGCCACTGCTGCCCCTGACACCTTATATGTTCAAGTGGGAGACCCAAACCTTGATCACAAATGCTGGGAAAGGCCTGAAGATATGGA
CACACCGCGCAGTGTGTATAAGATAAACACTCAAAATCCAGGCTCTGATGTAGCAGCAGAGACCGCGGCTGCGCTGGCTGCGGCTTCGATTGTGTTCAAAGCCTCCGACC
CTTCTTATTCTACCAAATTGCTCGACGCGGCCTTGAAAACAAGCATAGAGGTTCTTACAGCGATTCCCTCCATTCAATGGTCTGTCCATTTTACTGTTCTTACTCCGGAT
ACAACGTACTCCAATGGCCACATACTGGGTGCTGATGACGACGACTACACGTTCAGCTGGGACGACAAGCGCCCTGGAACAAAGATCCTTCTCTCCCAGGATTTCCTAGA
GAAAAACTCAGAGGAATTCCAAGTCTATAAAGCACACTCTGATAATTACATATGCTCTCTGATTCCAGGGACTTCCAGTTTCGGTTCCCAATATACTCCTG
Protein sequenceShow/hide protein sequence
MDDLVSPSSSPNSLASFCHQTPSTLQQRLQFILHDRPEWWAYSIFWQATKDMAGNLVLTWRDGHFRGTRDFAARPSKTNDGGAGQLISFGFDSGRKKDIAFHDLFYDDLV
NRMEGGDFTDLEWYYTVSITRSFGVADNVIGRVFDSSSYVWLTADDGLNHVGFFGDFRGVLELGSSDLIKQDWSLAQSAKSLFGAAGACFTPLNDQNDQAPPCSGVAKKE
PGGSGSGGVGGSSSDSLSDNSDGNFIPAKNNKRGKKPAKSSSPPVNHVEAERQRRLKLNHRFYALRSVVPNVSKMDKASLLADAVIYINELKSKIQAMESKLKSQHQSSS
MSQVLDQNRSPVEQTITSAYYAMNNNNNNNNNNVEVNIIGSEAMVRVQCRDENYPSARLLNVLRDLGLQVHHASFSSVNDLMLQDVVVRIPQGAAMREKALRTAILQRLD
MLIIYIVSNPSHPTGDKEFRGVVKCVAMGKNTNGESTLSGRKKRSKVIWCGIAEASGEPTPVGQKPSTMMGRLKGVHDAVCSEDGEVCKRAEGEEDGGGAWWEFYDYESF
VDVSRRVMQGKSRLQQQQVVREVLLSMLPPGAPAQFRKLFPPTRWAAEFNASITVPFFQWLVGPSEVVEVEVNGIKQRSGVHIKKCRYLENSGCVGMCVNMCKIPTQDFF
TNEFGLPLTMNPNFEDMSCEMIYGQAPPPFEEDPVSNQPCYTNLCSMANPSASLCHKLQVDLVGGYYDAGDNVKFGLPMAFTTTLLAWSVIEFGDSMGREIENARAAVRW
GSDYLLKAATAAPDTLYVQVGDPNLDHKCWERPEDMDTPRSVYKINTQNPGSDVAAETAAALAAASIVFKASDPSYSTKLLDAALKTSIEVLTAIPSIQWSVHFTVLTPD
TTYSNGHILGADDDDYTFSWDDKRPGTKILLSQDFLEKNSEEFQVYKAHSDNYICSLIPGTSSFGSQYTPX