; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021305 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021305
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFYVE-type domain-containing protein
Genome locationChr05:7528358..7536361
RNA-Seq ExpressionHG10021305
SyntenyHG10021305
Gene Ontology termsGO:0030245 - cellulose catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008810 - cellulase activity (molecular function)
GO:0035091 - phosphatidylinositol binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000306 - FYVE zinc finger
IPR001701 - Glycoside hydrolase family 9
IPR007461 - Ysc84 actin-binding domain
IPR008928 - Six-hairpin glycosidase superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR012341 - Six-hairpin glycosidase-like superfamily
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017455 - Zinc finger, FYVE-related
IPR018221 - Glycoside hydrolase family 9, His active site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057068.1 SH3 domain-containing protein [Cucumis melo var. makuwa]5.4e-29192.75Show/hide
Query:  SFSSMEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCF
        SFSSME+NHPCSPT+      S A  ASDYPN   G +SDD  SPLGYPKLDSFRE+NASLGPQCS  EKT +FRESDSES    GEMQKS DQN+GNCF
Subjt:  SFSSMEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCF

Query:  YYDSPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALN
        YYDSPLLEDTGVWIPISVPPMSE+DHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN GCRFSWISSHMVEQAL+
Subjt:  YYDSPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALN

Query:  ELAHSLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKV
        ELAHSLTEANFGNIREI+EAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKV
Subjt:  ELAHSLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKV

Query:  SNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPS
        SNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITA+KVGAVVTYNVGTGLVVARREDGSWSPPS
Subjt:  SNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPS

Query:  AISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARF
        AISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLA+GAG+SAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARF
Subjt:  AISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARF

Query:  YGNQSLTASDILLGSLPRPPAAAILYRALAELYQKINK
        YGNQS+TASDILLGSLPRPPAAAILYRAL +LYQKINK
Subjt:  YGNQSLTASDILLGSLPRPPAAAILYRALAELYQKINK

XP_004147420.1 uncharacterized protein LOC101212249 [Cucumis sativus]1.1e-28892.13Show/hide
Query:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS
        ME+N PCSPT+      S A  ASDYPN H G +SDD  SPLGYPKLDSFRELNAS GPQCS  EK+ EFRESDSES    GE+ KSCDQN+GNCFYYDS
Subjt:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS

Query:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
        PLLEDTGVWIPISVPPMSE+DHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN GCR SWISSHMVEQALNELAH
Subjt:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH

Query:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
        SLTEANFGNIREI+EAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
Subjt:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA

Query:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS
        QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNT+RAYNKVGSLKPEKLIPDAILGQAKGLAIITA+KVGAVVTYNVGTGLVVARREDGSWSPPSAISS
Subjt:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS

Query:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
        IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLA+GAG+SAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
Subjt:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ

Query:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK
        S+TASDILLGSLPRPPAAA+LYRAL +LYQKINK
Subjt:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK

XP_008443606.1 PREDICTED: uncharacterized protein LOC103487151 isoform X1 [Cucumis melo]3.9e-28992.7Show/hide
Query:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS
        ME+NHPCSPT+      S A  ASDYPN   G +SDD  SPLGYPKLDSFRE+NASLGPQCS  EKT +FRESDSES    GEMQKS DQN+GNCFYYDS
Subjt:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS

Query:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
        PLLEDTGVWIPISVPPMSE+DHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN GCRFSWISSHMVEQAL+ELAH
Subjt:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH

Query:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
        SLTEANFGNIREI+EAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
Subjt:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA

Query:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS
        QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITA+KVGAVVTYNVGTGLVVARREDGSWSPPSAISS
Subjt:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS

Query:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
        IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLA+GAG+SAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
Subjt:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ

Query:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK
        S+TASDILLGSLPRPPAAAILYRAL +LYQKINK
Subjt:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK

XP_022954495.1 uncharacterized protein LOC111456746 [Cucurbita moschata]4.9e-28490.26Show/hide
Query:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS
        MEE HP SPT+S+ +E      ASD+P  HP  ISD  PSPLGYPKLDSFRE   SLGP CSSSEK +E R+SDSESLQSIGE+QKSCDQN+GNCFYYDS
Subjt:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS

Query:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
        PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPE +SGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN+GCRFSWISSH+VEQALNELAH
Subjt:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH

Query:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
        SLTEANFGNI+EI+EAEPPRWLSDSAA+SCMLCGVKFHPIMCSRHHCRFCGGIFCG CS+GRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
Subjt:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA

Query:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS
        QLPT DLTDLSTLRSWLNFPWGQSME+EIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAI+TA+KVGAVVTYNVGTGLVVARREDGSWSPPSAIS+
Subjt:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS

Query:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
        IGMGWGAQIGGELTDFIIVLRT DAVNAFSG+MHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRT+ENARFYGNQ
Subjt:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ

Query:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK
        S+TA+DILLGSLPRPPAAAILY ALA+LY KI+K
Subjt:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK

XP_038894446.1 uncharacterized protein LOC120083029 [Benincasa hispida]1.1e-30496.25Show/hide
Query:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS
        MEENHPCSPTDSQ +EASR AKASDYPNP P FISDD  SPLGYPKLDSFRELN SLGPQCS S K ++FRESDSESLQSIGE+QKSCDQN+GNCFYYDS
Subjt:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS

Query:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
        PLLEDTGVWIPISVPPMSE+DHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
Subjt:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH

Query:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
        SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
Subjt:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA

Query:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS
        QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITA+KVGAVVTYNVGTGLVVARREDGSWSPPSAISS
Subjt:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS

Query:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
        IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFG CYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
Subjt:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ

Query:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK
        SLTASDILLGSLPRPPAAAILYRALA+LYQKI+K
Subjt:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK

TrEMBL top hitse value%identityAlignment
A0A0A0LUK2 FYVE-type domain-containing protein5.5e-28992.13Show/hide
Query:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS
        ME+N PCSPT+      S A  ASDYPN H G +SDD  SPLGYPKLDSFRELNAS GPQCS  EK+ EFRESDSES    GE+ KSCDQN+GNCFYYDS
Subjt:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS

Query:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
        PLLEDTGVWIPISVPPMSE+DHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN GCR SWISSHMVEQALNELAH
Subjt:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH

Query:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
        SLTEANFGNIREI+EAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
Subjt:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA

Query:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS
        QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNT+RAYNKVGSLKPEKLIPDAILGQAKGLAIITA+KVGAVVTYNVGTGLVVARREDGSWSPPSAISS
Subjt:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS

Query:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
        IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLA+GAG+SAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
Subjt:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ

Query:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK
        S+TASDILLGSLPRPPAAA+LYRAL +LYQKINK
Subjt:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK

A0A1S3B969 uncharacterized protein LOC103487151 isoform X11.9e-28992.7Show/hide
Query:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS
        ME+NHPCSPT+      S A  ASDYPN   G +SDD  SPLGYPKLDSFRE+NASLGPQCS  EKT +FRESDSES    GEMQKS DQN+GNCFYYDS
Subjt:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS

Query:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
        PLLEDTGVWIPISVPPMSE+DHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN GCRFSWISSHMVEQAL+ELAH
Subjt:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH

Query:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
        SLTEANFGNIREI+EAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
Subjt:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA

Query:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS
        QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITA+KVGAVVTYNVGTGLVVARREDGSWSPPSAISS
Subjt:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS

Query:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
        IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLA+GAG+SAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
Subjt:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ

Query:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK
        S+TASDILLGSLPRPPAAAILYRAL +LYQKINK
Subjt:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK

A0A5A7UPM6 SH3 domain-containing protein2.6e-29192.75Show/hide
Query:  SFSSMEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCF
        SFSSME+NHPCSPT+      S A  ASDYPN   G +SDD  SPLGYPKLDSFRE+NASLGPQCS  EKT +FRESDSES    GEMQKS DQN+GNCF
Subjt:  SFSSMEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCF

Query:  YYDSPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALN
        YYDSPLLEDTGVWIPISVPPMSE+DHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN GCRFSWISSHMVEQAL+
Subjt:  YYDSPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALN

Query:  ELAHSLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKV
        ELAHSLTEANFGNIREI+EAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKV
Subjt:  ELAHSLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKV

Query:  SNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPS
        SNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITA+KVGAVVTYNVGTGLVVARREDGSWSPPS
Subjt:  SNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPS

Query:  AISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARF
        AISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLA+GAG+SAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARF
Subjt:  AISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARF

Query:  YGNQSLTASDILLGSLPRPPAAAILYRALAELYQKINK
        YGNQS+TASDILLGSLPRPPAAAILYRAL +LYQKINK
Subjt:  YGNQSLTASDILLGSLPRPPAAAILYRALAELYQKINK

A0A6J1GSK3 uncharacterized protein LOC1114567462.4e-28490.26Show/hide
Query:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS
        MEE HP SPT+S+ +E      ASD+P  HP  ISD  PSPLGYPKLDSFRE   SLGP CSSSEK +E R+SDSESLQSIGE+QKSCDQN+GNCFYYDS
Subjt:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS

Query:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
        PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPE +SGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN+GCRFSWISSH+VEQALNELAH
Subjt:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH

Query:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
        SLTEANFGNI+EI+EAEPPRWLSDSAA+SCMLCGVKFHPIMCSRHHCRFCGGIFCG CS+GRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
Subjt:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA

Query:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS
        QLPT DLTDLSTLRSWLNFPWGQSME+EIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAI+TA+KVGAVVTYNVGTGLVVARREDGSWSPPSAIS+
Subjt:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS

Query:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
        IGMGWGAQIGGELTDFIIVLRT DAVNAFSG+MHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRT+ENARFYGNQ
Subjt:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ

Query:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK
        S+TA+DILLGSLPRPPAAAILY ALA+LY KI+K
Subjt:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK

A0A6J1JWU8 uncharacterized protein LOC1114904092.0e-28389.89Show/hide
Query:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS
        MEE HP SPT+S+ +E      ASD+P  HP  ISD+ PSPLGYPKLDSF+E   SLGP CSS EK +E R+SDSESLQSIGE++KSCDQN+GNCFYYDS
Subjt:  MEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYDS

Query:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH
        PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFP+G+SGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSN+GCRFSWISSH+VEQALNELAH
Subjt:  PLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAH

Query:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
        SLTEANFGNI+EI+EAEPPRWLSDSAA+SCMLCGVKFHPIMCSRHHCRFCGGIFCG CS+GRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA
Subjt:  SLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAA

Query:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS
        QLPT DLTDLSTLRSWLNFPWGQSME+EIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITA+KVGAVVTYNVGTGLVVARREDGSWSPPSAIS+
Subjt:  QLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISS

Query:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ
        IGMGWGAQIGGELTDFIIVLRT DAVNAFSG+MHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRT+ENARFYGNQ
Subjt:  IGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQ

Query:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK
        S+TA+DILLGSLPRPPAAAILY ALA+LY KI+K
Subjt:  SLTASDILLGSLPRPPAAAILYRALAELYQKINK

SwissProt top hitse value%identityAlignment
P0C1U4 Endoglucanase 98.8e-11147.05Show/hide
Query:  AGRYPKRSPVKFRGDSGLEDGVSSNKPD-GLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQT
        +G+ PK + V +RG+S ++DG+S       LVGG+YD+G+ +KF F  A+++TLLSWSVIEY  KY  + EL H++D I+WG +Y LK F +   T D+ 
Subjt:  AGRYPKRSPVKFRGDSGLEDGVSSNKPD-GLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQT

Query:  IIYSQVGSTSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQGTYTSVDSCGG
        ++    G+TS  G+ Q ND+ CW RPED+ Y RPV +C A  SDLA E+ A+L+A+S+VFK++  YS +L   A  LF    K     +G Y+   + G 
Subjt:  IIYSQVGSTSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQGTYTSVDSCGG

Query:  EARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSANITEILMCSY
        +A KFYNS+SY DE +W G+W++ ATGN+SYL  AT     + A +     D G+F+W+NK +   VLL+RL  F   GYPYE  L    N T I+MCSY
Subjt:  EARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSANITEILMCSY

Query:  I-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYVVGYGNNFPTH
        + I + FNRT GG+I         LQ    A+FL+ LY DYL+        C    P F  +E LRNF+R+Q   + YILG NP+KMSYVVGYGN +P  
Subjt:  I-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYVVGYGNNFPTH

Query:  VHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA
        VHHR ASIP +G  Y C  G +W  +K  NPNI+ GAMVAGPD+ D F D R    +TE T+A NAGLVAAL+A
Subjt:  VHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA

Q38890 Endoglucanase 252.2e-10946.69Show/hide
Query:  KLFSA---GRYPKRSPVKFRGDSGLEDGVSSNKP--DGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVA
        K F+A   G+ PK + V +RG+SGL+DG          LVGG+YD+G+ +KF F  AY +T+LSWSVIEY  KY    EL HVK++I+WGT+Y LK F  
Subjt:  KLFSA---GRYPKRSPVKFRGDSGLEDGVSSNKP--DGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVA

Query:  PNATSDQ-TIIYSQVGS-TSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQG
         N+T+D    + SQVGS  ++DGN   ND+ CW RPEDM Y RPV+ C+   SDLA E+ AAL+++S+VFK++  YS +L   A+ +++       + +G
Subjt:  PNATSDQ-TIIYSQVGS-TSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQG

Query:  TYTSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSA
         Y++  +   E+ KFYNSS Y DE IW G W+++ATGN +YL   T     + A +       G+F+W+NK +   +LL+RL  F   GYPYE  L    
Subjt:  TYTSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSA

Query:  NITEILMCSYI-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYV
        N T I+MCSY+ I   FNRT GG+I         LQ +  A+FL+ LYSDYLD        C    P F +   LR+F+RSQ   ++YILG NP KMSYV
Subjt:  NITEILMCSYI-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYV

Query:  VGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA
        VG+G  +P HVHHR ASIP +   Y+C  G +W  SK  NPN + GAMVAGPDK D + D R    +TEPT+A NAGLVAAL+A
Subjt:  VGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA

Q7XUK4 Endoglucanase 125.5e-10543.97Show/hide
Query:  AGRYPKRSPVKFRGDSGLEDGVS-SNKPDGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQT
        +GR PK + +K+RG+SGL DG   ++   GLVGG+YD+G+N+KF F  A+++T+LSWSVIEY  KY  + E DHV+++I+WGT+YLL  F +  +T D+ 
Subjt:  AGRYPKRSPVKFRGDSGLEDGVS-SNKPDGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQT

Query:  IIYSQVGSTSNDGNEQTNDNCWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQGTYTSVDSCGGE
         +YSQVG    +G +  +  CW RPEDM Y RPV    + A DL GE+ AAL+A+S+VF+++  YS +L   A  +++          G  T        
Subjt:  IIYSQVGSTSNDGNEQTNDNCWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQGTYTSVDSCGGE

Query:  ARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSANITEILMCSYI
           +YNS+SY DE +W+  W+++ATGN +Y+ +ATD    + A++  + +D  +F+W+NK     +LL+RL  F + GYPYE +L    N T + MC+Y 
Subjt:  ARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSANITEILMCSYI

Query:  ID-QHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYVVGYGNNFPTHV
             FN T GG+  F    G  LQ     SFL+ LY+DY++ +      C    P F T+++LR+F+RSQ   +NYILGDNP KMSYVVGYG  +P  +
Subjt:  ID-QHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYVVGYGNNFPTHV

Query:  HHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA
        HHR AS P +G  YSC  G +W  +K ++PN+L GAMV GPDK D F D R      EPT+  NAGLVAAL+A
Subjt:  HHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA

Q84R49 Endoglucanase 104.9e-10946.62Show/hide
Query:  AGRYPKRSPVKFRGDSGLEDGVS-SNKPDGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQT
        +G  PK + V +RG+S ++DG+S S     LVGGFYD+G+ +KF +  A+++T+LSWSVIEY  KY  + ELDHVK++I+WGT+YLLK F +   T D+ 
Subjt:  AGRYPKRSPVKFRGDSGLEDGVS-SNKPDGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQT

Query:  IIYSQVGSTSNDGNEQTNDNCWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQGTYTSVDSCGGE
        +    VG TS  G +  +  CW RPED+ Y RPV++C +  SDLA E+ AAL+A+S+VFK+   YS +L + A+ L+    K    ++G Y+     G +
Subjt:  IIYSQVGSTSNDGNEQTNDNCWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQGTYTSVDSCGGE

Query:  ARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEA-SIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSANITEILMCSY
           FYNS+SY DE +W G W++FATGN +YL+ AT     + A +    S + G+F W++K     VLL+RL  F   GYPYE  L    N T+ +MCSY
Subjt:  ARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEA-SIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSANITEILMCSY

Query:  I-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYVVGYGNNFPTH
        + +   FN T GGMI         LQ    A+FL+ LYSDYLD        C    P F T E LR F+RSQ   L+Y+LG NP+KMSYVVG+GN +P  
Subjt:  I-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYVVGYGNNFPTH

Query:  VHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA
         HHR ASIP +G  Y C  G +W  +K  NPNIL GA+VAGPD+ D F D R    +TEPT+A+NAGLVAALI+
Subjt:  VHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA

Q9STW8 Endoglucanase 216.1e-10446.31Show/hide
Query:  TDKKLFSA---GRYPKR-SPVKFRGDSGLEDGVSSNKP--DGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLK
        T  K F+A   G+ PK    V +R DS L+DG          LVGG+YD+G+++KF F  +Y +T+LSWSVIEY  KY    EL+HVK++I+WGT+Y LK
Subjt:  TDKKLFSA---GRYPKR-SPVKFRGDSGLEDGVSSNKP--DGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLK

Query:  VFVAPNATSDQT-IIYSQVGS-TSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDP
         F   N+++D   ++  QVGS  S  G+E  ND+ CW RPED+ Y R VS+C +  SDLA E+ AAL+++S+VFK++  YS  L   A+ L+   T    
Subjt:  VFVAPNATSDQT-IIYSQVGS-TSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDP

Query:  SEQGTYTSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYAL
        + +  Y+     G E+ KFYNSS + DEL+W G WL++ATGN +YL   T     + A +   S   G+F+W+NK     +LLTR+  F   GYPYE  L
Subjt:  SEQGTYTSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYAL

Query:  GVSANITEILMCSYI-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMK
            N T  +MCSY+   + FNRT GG+I         LQ  A A+FL+ L+SDYL+        C    P F T E LRNFSRSQ   ++YILG NP K
Subjt:  GVSANITEILMCSYI-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMK

Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA
        MSYVVGYG  +P  VHHR ASIP      +C  G +W  SK +NPN ++GAMVAGPDK D F D R    +TEPT+A NAGLVAAL+A
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA

Arabidopsis top hitse value%identityAlignment
AT1G29800.1 RING/FYVE/PHD-type zinc finger family protein2.2e-20269.33Show/hide
Query:  RELNASLGPQCSS---SEKTIEFRESDSESLQSIGEMQKSCD---QNKGNCFYYDSPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWS
        RE+ AS G   +    S+     RE DS S +   E  K  +   Q  G  F+YD+PL E+TGVWIP+SVPPM E DHEEW++G   NGG FPEGD GW 
Subjt:  RELNASLGPQCSS---SEKTIEFRESDSESLQSIGEMQKSCD---QNKGNCFYYDSPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWS

Query:  QCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAHSLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSR
        Q F+ +KELTMWDV+V+MLLAARGK  +L+S +   C  +++S H++EQA  ++AH+LTEANFGN REI+E EPP+WL DSAAS+CMLC V+FHPIMCSR
Subjt:  QCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAHSLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSR

Query:  HHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAY-NKVG
        HHCR+CGGIFC DCSKG+SL+PVKFRV+DPQRVCDVC VRLESVQPYLMD+VS AAQLPTHDLTDLSTLRSW+NFPWGQSME+EIYKATNT+R Y  KVG
Subjt:  HHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAY-NKVG

Query:  SLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSA
        S + E+ IPDAIL QAKGLA+IT  +VG +VTY +GTGLVVARR+DGSWSPPSAISS G+GWGAQ GGE  DFIIVLRT +A+  F  N HL +GAG+SA
Subjt:  SLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSA

Query:  AVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQSLTASDILLGSLPRPPAAAILYRALAELYQKI
        AVGV GR  EAD+RAG GG+ +CYTYSCSKGAFVGCSL+GSI TTRT EN+RFYG+QSL ASDILLGSLPRPPAAA LYRAL +LYQK+
Subjt:  AVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQSLTASDILLGSLPRPPAAAILYRALAELYQKI

AT1G29800.2 RING/FYVE/PHD-type zinc finger family protein1.3e-20269.04Show/hide
Query:  RELNASLGPQCSS---SEKTIEFRESDSESLQSIGEMQKSCD---QNKGNCFYYDSPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWS
        RE+ AS G   +    S+     RE DS S +   E  K  +   Q  G  F+YD+PL E+TGVWIP+SVPPM E DHEEW++G   NGG FPEGD GW 
Subjt:  RELNASLGPQCSS---SEKTIEFRESDSESLQSIGEMQKSCD---QNKGNCFYYDSPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWS

Query:  QCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAHSLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSR
        Q F+ +KELTMWDV+V+MLLAARGK  +L+S +   C  +++S H++EQA  ++AH+LTEANFGN REI+E EPP+WL DSAAS+CMLC V+FHPIMCSR
Subjt:  QCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAHSLTEANFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSR

Query:  HHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAY-NKVG
        HHCR+CGGIFC DCSKG+SL+PVKFRV+DPQRVCDVC VRLESVQPYLMD+VS AAQLPTHDLTDLSTLRSW+NFPWGQSME+EIYKATNT+R Y  KVG
Subjt:  HHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAY-NKVG

Query:  SLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSA
        S + E+ IPDAIL QAKGLA+IT  +VG +VTY +GTGLVVARR+DGSWSPPSAISS G+GWGAQ GGE  DFIIVLRT +A+  F  N HL +GAG+SA
Subjt:  SLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSA

Query:  AVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQSLTASDILLGSLPRPPAAAILYRALAELYQKINK
        AVGV GR  EAD+RAG GG+ +CYTYSCSKGAFVGCSL+GSI TTRT EN+RFYG+QSL ASDILLGSLPRPPAAA LYRAL +LYQK+ +
Subjt:  AVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQSLTASDILLGSLPRPPAAAILYRALAELYQKINK

AT3G43230.1 RING/FYVE/PHD-type zinc finger family protein2.4e-11147.74Show/hide
Query:  EDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQC--FEGEK--ELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELA
        ED   W   S+P   E ++     G +++ G    G+   +     +G +  E+ + +V+  ++    G+  +     N     S++ S          +
Subjt:  EDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQC--FEGEK--ELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELA

Query:  HSLTEA---------NFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQP
          +  A         N+   +E++EAEPP WL DS AS+CM C   F  I C RHHCRFCGGIFC +CSKGR L+P +FR  +PQRVCD C  RL+ +Q 
Subjt:  HSLTEA---------NFGNIREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQP

Query:  YLMDKVSNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDG
         L++ +SNA Q+  HD+ D +  R WLN P G SME EIYKA NT+R Y +V  L PEK IP A+L +AKGLAIIT  K GA+++Y +GTGLV++RR DG
Subjt:  YLMDKVSNAAQLPTHDLTDLSTLRSWLNFPWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDG

Query:  SWSPPSAISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRT
        SWS PSAI S+G+GWGAQIGGEL DFIIVL    AV  F   MH ++GAG SAA G IGR  EAD+RAGD G G CYTYS SKGAFVG SL+G++V TR 
Subjt:  SWSPPSAISSIGMGWGAQIGGELTDFIIVLRTSDAVNAFSGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRT

Query:  QENARFYGNQSLTASDILLGSLPRPPAAAILYRALAELYQKI
          N +FYG+  L+ SDILLG + +P AA  LY AL ELY ++
Subjt:  QENARFYGNQSLTASDILLGSLPRPPAAAILYRALAELYQKI

AT4G24260.1 glycosyl hydrolase 9A34.4e-10546.31Show/hide
Query:  TDKKLFSA---GRYPKR-SPVKFRGDSGLEDGVSSNKP--DGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLK
        T  K F+A   G+ PK    V +R DS L+DG          LVGG+YD+G+++KF F  +Y +T+LSWSVIEY  KY    EL+HVK++I+WGT+Y LK
Subjt:  TDKKLFSA---GRYPKR-SPVKFRGDSGLEDGVSSNKP--DGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLK

Query:  VFVAPNATSDQT-IIYSQVGS-TSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDP
         F   N+++D   ++  QVGS  S  G+E  ND+ CW RPED+ Y R VS+C +  SDLA E+ AAL+++S+VFK++  YS  L   A+ L+   T    
Subjt:  VFVAPNATSDQT-IIYSQVGS-TSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDP

Query:  SEQGTYTSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYAL
        + +  Y+     G E+ KFYNSS + DEL+W G WL++ATGN +YL   T     + A +   S   G+F+W+NK     +LLTR+  F   GYPYE  L
Subjt:  SEQGTYTSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYAL

Query:  GVSANITEILMCSYI-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMK
            N T  +MCSY+   + FNRT GG+I         LQ  A A+FL+ L+SDYL+        C    P F T E LRNFSRSQ   ++YILG NP K
Subjt:  GVSANITEILMCSYI-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMK

Query:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA
        MSYVVGYG  +P  VHHR ASIP      +C  G +W  SK +NPN ++GAMVAGPDK D F D R    +TEPT+A NAGLVAAL+A
Subjt:  MSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA

AT5G49720.1 glycosyl hydrolase 9A11.5e-11046.69Show/hide
Query:  KLFSA---GRYPKRSPVKFRGDSGLEDGVSSNKP--DGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVA
        K F+A   G+ PK + V +RG+SGL+DG          LVGG+YD+G+ +KF F  AY +T+LSWSVIEY  KY    EL HVK++I+WGT+Y LK F  
Subjt:  KLFSA---GRYPKRSPVKFRGDSGLEDGVSSNKP--DGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVA

Query:  PNATSDQ-TIIYSQVGS-TSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQG
         N+T+D    + SQVGS  ++DGN   ND+ CW RPEDM Y RPV+ C+   SDLA E+ AAL+++S+VFK++  YS +L   A+ +++       + +G
Subjt:  PNATSDQ-TIIYSQVGS-TSNDGNEQTNDN-CWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQG

Query:  TYTSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSA
         Y++  +   E+ KFYNSS Y DE IW G W+++ATGN +YL   T     + A +       G+F+W+NK +   +LL+RL  F   GYPYE  L    
Subjt:  TYTSVDSCGGEARKFYNSSSYTDELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSA

Query:  NITEILMCSYI-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYV
        N T I+MCSY+ I   FNRT GG+I         LQ +  A+FL+ LYSDYLD        C    P F +   LR+F+RSQ   ++YILG NP KMSYV
Subjt:  NITEILMCSYI-IDQHFNRTPGGMILFRPHDGAWLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGF-TLENLRNFSRSQASALNYILGDNPMKMSYV

Query:  VGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA
        VG+G  +P HVHHR ASIP +   Y+C  G +W  SK  NPN + GAMVAGPDK D + D R    +TEPT+A NAGLVAAL+A
Subjt:  VGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNILSGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTGACTGATAAAAAGCTTTTCTCAGCTGGTAGGTACCCGAAACGTAGTCCAGTCAAGTTTCGGGGAGATTCAGGCTTGGAAGATGGGGTTTCGAGCAATAAACC
AGATGGTCTCGTTGGTGGTTTCTATGATTCTGGAAACAACATGAAGTTCACTTTCACTACAGCTTATACCATAACTCTTTTAAGCTGGAGTGTGATCGAGTATCATCCGA
AGTATGCAGACATGAACGAGCTTGATCATGTAAAGGACATCATCAGATGGGGAACTGAATATTTGCTCAAAGTTTTTGTAGCCCCAAATGCAACTTCTGATCAAACTATA
ATATATTCTCAGGTTGGCAGTACCAGTAATGATGGTAATGAACAAACTAATGACAACTGCTGGCAAAGACCAGAAGACATGATGTACCACAGACCTGTTTCAAAATGTGA
TGCCCGGGCTTCGGACCTTGCTGGAGAAATCGTTGCAGCATTATCAGCTTCATCGTTAGTATTCAAAGAAGATAACAACTATTCAGGAGAATTAGCAAAAGCTGCAGAGA
AATTGTTTGAGCATGTGACTAAATTAGACCCAAGCGAGCAAGGAACTTACACCTCGGTTGATTCTTGTGGAGGAGAAGCAAGAAAGTTCTATAACTCGTCAAGTTACACG
GATGAATTGATATGGGCAGGAACTTGGCTGTTCTTTGCTACTGGAAACACTTCATATCTTGCATATGCCACTGATGCTGTCCGATTTCAGTTAGCACAGAGCGAGGAAGC
AAGTATTGACAGAGGGATTTTCAATTGGAACAATAAGTTCAGTGCAACTATGGTATTATTGACACGCCTTCTCTACTTTCATGATATTGGCTACCCATATGAATATGCCT
TAGGAGTATCAGCAAATATCACAGAAATCCTCATGTGTTCTTATATCATTGATCAACACTTCAACAGGACACCTGGAGGAATGATTCTCTTTAGGCCTCATGATGGAGCA
TGGCTCCAGTTAGCTGCAACAGCATCATTTCTCAGTAAATTGTACAGTGATTACCTTGATCTTTTGGGAGCATCTTACATGAGTTGCATTTTCGCCAATCCTGGCTTTAC
TTTGGAAAATTTGCGGAACTTCTCCAGGTCTCAGGCAAGTGCTCTCAACTACATACTTGGGGACAATCCAATGAAAATGAGCTACGTAGTTGGCTATGGAAACAACTTTC
CCACCCATGTCCACCATAGGGCTGCCTCAATTCCTTGGGATGGTCAATTCTATTCATGTGCAGAGGGAGATAGATGGCTGTTATCAAAGGCTTCAAATCCAAATATTCTT
TCCGGAGCCATGGTGGCTGGACCAGACAAGTTTGACTATTTCTCAGATGAAAGAGACAGACCTTGGTTTACTGAACCAACCATAGCAAGCAATGCAGGTTTAGTTGCTGC
ACTCATTGCTCGACATGACTATCCAGAACGATCCGAACTGGGAGAGTTGAATTTGCGATCTGGGGACAAGGCATTACATGATAAGCAGAGGAAGAAGAGGAAAGAGAGGA
GGAAAATGGAGAGCGGCGATGGAAGATGGGAGCTCCTTTCCATGGAAATTGAGGGAAAAGCTGTGGGAATTTGGATCCCACAGTTACAATTACAGAGCTCAAAGCTGGAT
GGGTGCTTTAGAGCATGTTCATCATTTTCTTCGATGGAGGAGAATCATCCGTGTTCTCCGACTGATTCCCAAGTCGTCGAAGCTTCTAGGGCTGCTAAGGCTTCCGATTA
TCCCAATCCCCATCCGGGTTTTATTTCTGATGATTATCCTTCTCCTCTGGGTTATCCAAAACTTGATTCTTTTCGAGAGCTTAACGCTTCGTTGGGCCCTCAATGTTCTT
CGAGTGAAAAAACTATTGAATTTAGAGAAAGTGATTCTGAGTCACTCCAAAGCATTGGGGAAATGCAAAAGTCTTGTGATCAGAATAAGGGGAATTGCTTCTATTATGAT
TCGCCTCTGTTGGAAGATACTGGAGTATGGATACCCATTTCTGTTCCACCCATGTCTGAAACTGATCATGAGGAGTGGGCCAAAGGTTTTCACTTGAATGGTGGTTGTTT
TCCTGAAGGTGACTCGGGGTGGAGCCAGTGCTTTGAGGGAGAAAAAGAGTTGACAATGTGGGATGTGATGGTAGAAATGTTGCTTGCAGCTCGAGGAAAAGTCGGTTCGC
TGGCTTCCACCAGTAATAGTGGGTGTAGGTTTTCATGGATATCTAGCCATATGGTGGAGCAGGCTTTGAATGAACTGGCTCATTCACTCACTGAAGCTAATTTTGGAAAT
ATAAGAGAAATTATTGAAGCAGAGCCACCCAGATGGTTGTCTGATAGTGCTGCTTCTTCTTGTATGTTATGTGGTGTGAAGTTTCATCCAATCATGTGCTCCAGGCATCA
CTGTCGGTTTTGCGGTGGAATATTTTGTGGGGATTGCTCTAAAGGGAGGAGCTTGCTGCCTGTAAAATTTCGTGTTGCAGATCCACAACGAGTGTGTGATGTATGTAATG
TGCGACTTGAATCTGTGCAACCATATTTAATGGATAAAGTGAGTAATGCTGCGCAGCTACCAACCCATGACCTGACGGACCTCAGTACATTGAGGTCATGGTTAAACTTC
CCTTGGGGACAGTCAATGGAACATGAGATTTATAAAGCAACAAACACCATTCGAGCATATAATAAGGTGGGTTCTTTAAAGCCTGAGAAACTAATTCCAGATGCGATTCT
AGGACAAGCAAAAGGCCTAGCAATTATTACTGCTATCAAAGTGGGAGCAGTGGTGACTTACAATGTAGGAACTGGTCTCGTAGTTGCACGTAGAGAAGATGGCTCATGGT
CTCCTCCATCTGCAATTTCATCAATTGGTATGGGTTGGGGAGCTCAGATTGGAGGAGAACTGACCGACTTTATAATCGTCTTGAGAACAAGTGATGCTGTCAATGCATTC
AGCGGTAATATGCACTTGGCAATTGGGGCTGGCGTGAGTGCTGCAGTTGGAGTTATTGGAAGGACGGCTGAAGCCGACGTACGAGCTGGTGATGGTGGTTTTGGCTCCTG
CTATACATACAGTTGTAGTAAAGGTGCATTTGTTGGATGTTCACTTAAAGGAAGCATTGTTACGACTCGGACGCAAGAAAATGCTCGATTCTACGGCAATCAGTCACTAA
CTGCCTCCGACATCCTCCTTGGTTCACTCCCAAGACCACCTGCTGCCGCCATTCTGTATCGTGCTCTTGCTGAACTTTATCAGAAGATCAATAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCACTGACTGATAAAAAGCTTTTCTCAGCTGGTAGGTACCCGAAACGTAGTCCAGTCAAGTTTCGGGGAGATTCAGGCTTGGAAGATGGGGTTTCGAGCAATAAACC
AGATGGTCTCGTTGGTGGTTTCTATGATTCTGGAAACAACATGAAGTTCACTTTCACTACAGCTTATACCATAACTCTTTTAAGCTGGAGTGTGATCGAGTATCATCCGA
AGTATGCAGACATGAACGAGCTTGATCATGTAAAGGACATCATCAGATGGGGAACTGAATATTTGCTCAAAGTTTTTGTAGCCCCAAATGCAACTTCTGATCAAACTATA
ATATATTCTCAGGTTGGCAGTACCAGTAATGATGGTAATGAACAAACTAATGACAACTGCTGGCAAAGACCAGAAGACATGATGTACCACAGACCTGTTTCAAAATGTGA
TGCCCGGGCTTCGGACCTTGCTGGAGAAATCGTTGCAGCATTATCAGCTTCATCGTTAGTATTCAAAGAAGATAACAACTATTCAGGAGAATTAGCAAAAGCTGCAGAGA
AATTGTTTGAGCATGTGACTAAATTAGACCCAAGCGAGCAAGGAACTTACACCTCGGTTGATTCTTGTGGAGGAGAAGCAAGAAAGTTCTATAACTCGTCAAGTTACACG
GATGAATTGATATGGGCAGGAACTTGGCTGTTCTTTGCTACTGGAAACACTTCATATCTTGCATATGCCACTGATGCTGTCCGATTTCAGTTAGCACAGAGCGAGGAAGC
AAGTATTGACAGAGGGATTTTCAATTGGAACAATAAGTTCAGTGCAACTATGGTATTATTGACACGCCTTCTCTACTTTCATGATATTGGCTACCCATATGAATATGCCT
TAGGAGTATCAGCAAATATCACAGAAATCCTCATGTGTTCTTATATCATTGATCAACACTTCAACAGGACACCTGGAGGAATGATTCTCTTTAGGCCTCATGATGGAGCA
TGGCTCCAGTTAGCTGCAACAGCATCATTTCTCAGTAAATTGTACAGTGATTACCTTGATCTTTTGGGAGCATCTTACATGAGTTGCATTTTCGCCAATCCTGGCTTTAC
TTTGGAAAATTTGCGGAACTTCTCCAGGTCTCAGGCAAGTGCTCTCAACTACATACTTGGGGACAATCCAATGAAAATGAGCTACGTAGTTGGCTATGGAAACAACTTTC
CCACCCATGTCCACCATAGGGCTGCCTCAATTCCTTGGGATGGTCAATTCTATTCATGTGCAGAGGGAGATAGATGGCTGTTATCAAAGGCTTCAAATCCAAATATTCTT
TCCGGAGCCATGGTGGCTGGACCAGACAAGTTTGACTATTTCTCAGATGAAAGAGACAGACCTTGGTTTACTGAACCAACCATAGCAAGCAATGCAGGTTTAGTTGCTGC
ACTCATTGCTCGACATGACTATCCAGAACGATCCGAACTGGGAGAGTTGAATTTGCGATCTGGGGACAAGGCATTACATGATAAGCAGAGGAAGAAGAGGAAAGAGAGGA
GGAAAATGGAGAGCGGCGATGGAAGATGGGAGCTCCTTTCCATGGAAATTGAGGGAAAAGCTGTGGGAATTTGGATCCCACAGTTACAATTACAGAGCTCAAAGCTGGAT
GGGTGCTTTAGAGCATGTTCATCATTTTCTTCGATGGAGGAGAATCATCCGTGTTCTCCGACTGATTCCCAAGTCGTCGAAGCTTCTAGGGCTGCTAAGGCTTCCGATTA
TCCCAATCCCCATCCGGGTTTTATTTCTGATGATTATCCTTCTCCTCTGGGTTATCCAAAACTTGATTCTTTTCGAGAGCTTAACGCTTCGTTGGGCCCTCAATGTTCTT
CGAGTGAAAAAACTATTGAATTTAGAGAAAGTGATTCTGAGTCACTCCAAAGCATTGGGGAAATGCAAAAGTCTTGTGATCAGAATAAGGGGAATTGCTTCTATTATGAT
TCGCCTCTGTTGGAAGATACTGGAGTATGGATACCCATTTCTGTTCCACCCATGTCTGAAACTGATCATGAGGAGTGGGCCAAAGGTTTTCACTTGAATGGTGGTTGTTT
TCCTGAAGGTGACTCGGGGTGGAGCCAGTGCTTTGAGGGAGAAAAAGAGTTGACAATGTGGGATGTGATGGTAGAAATGTTGCTTGCAGCTCGAGGAAAAGTCGGTTCGC
TGGCTTCCACCAGTAATAGTGGGTGTAGGTTTTCATGGATATCTAGCCATATGGTGGAGCAGGCTTTGAATGAACTGGCTCATTCACTCACTGAAGCTAATTTTGGAAAT
ATAAGAGAAATTATTGAAGCAGAGCCACCCAGATGGTTGTCTGATAGTGCTGCTTCTTCTTGTATGTTATGTGGTGTGAAGTTTCATCCAATCATGTGCTCCAGGCATCA
CTGTCGGTTTTGCGGTGGAATATTTTGTGGGGATTGCTCTAAAGGGAGGAGCTTGCTGCCTGTAAAATTTCGTGTTGCAGATCCACAACGAGTGTGTGATGTATGTAATG
TGCGACTTGAATCTGTGCAACCATATTTAATGGATAAAGTGAGTAATGCTGCGCAGCTACCAACCCATGACCTGACGGACCTCAGTACATTGAGGTCATGGTTAAACTTC
CCTTGGGGACAGTCAATGGAACATGAGATTTATAAAGCAACAAACACCATTCGAGCATATAATAAGGTGGGTTCTTTAAAGCCTGAGAAACTAATTCCAGATGCGATTCT
AGGACAAGCAAAAGGCCTAGCAATTATTACTGCTATCAAAGTGGGAGCAGTGGTGACTTACAATGTAGGAACTGGTCTCGTAGTTGCACGTAGAGAAGATGGCTCATGGT
CTCCTCCATCTGCAATTTCATCAATTGGTATGGGTTGGGGAGCTCAGATTGGAGGAGAACTGACCGACTTTATAATCGTCTTGAGAACAAGTGATGCTGTCAATGCATTC
AGCGGTAATATGCACTTGGCAATTGGGGCTGGCGTGAGTGCTGCAGTTGGAGTTATTGGAAGGACGGCTGAAGCCGACGTACGAGCTGGTGATGGTGGTTTTGGCTCCTG
CTATACATACAGTTGTAGTAAAGGTGCATTTGTTGGATGTTCACTTAAAGGAAGCATTGTTACGACTCGGACGCAAGAAAATGCTCGATTCTACGGCAATCAGTCACTAA
CTGCCTCCGACATCCTCCTTGGTTCACTCCCAAGACCACCTGCTGCCGCCATTCTGTATCGTGCTCTTGCTGAACTTTATCAGAAGATCAATAAATGA
Protein sequenceShow/hide protein sequence
MSLTDKKLFSAGRYPKRSPVKFRGDSGLEDGVSSNKPDGLVGGFYDSGNNMKFTFTTAYTITLLSWSVIEYHPKYADMNELDHVKDIIRWGTEYLLKVFVAPNATSDQTI
IYSQVGSTSNDGNEQTNDNCWQRPEDMMYHRPVSKCDARASDLAGEIVAALSASSLVFKEDNNYSGELAKAAEKLFEHVTKLDPSEQGTYTSVDSCGGEARKFYNSSSYT
DELIWAGTWLFFATGNTSYLAYATDAVRFQLAQSEEASIDRGIFNWNNKFSATMVLLTRLLYFHDIGYPYEYALGVSANITEILMCSYIIDQHFNRTPGGMILFRPHDGA
WLQLAATASFLSKLYSDYLDLLGASYMSCIFANPGFTLENLRNFSRSQASALNYILGDNPMKMSYVVGYGNNFPTHVHHRAASIPWDGQFYSCAEGDRWLLSKASNPNIL
SGAMVAGPDKFDYFSDERDRPWFTEPTIASNAGLVAALIARHDYPERSELGELNLRSGDKALHDKQRKKRKERRKMESGDGRWELLSMEIEGKAVGIWIPQLQLQSSKLD
GCFRACSSFSSMEENHPCSPTDSQVVEASRAAKASDYPNPHPGFISDDYPSPLGYPKLDSFRELNASLGPQCSSSEKTIEFRESDSESLQSIGEMQKSCDQNKGNCFYYD
SPLLEDTGVWIPISVPPMSETDHEEWAKGFHLNGGCFPEGDSGWSQCFEGEKELTMWDVMVEMLLAARGKVGSLASTSNSGCRFSWISSHMVEQALNELAHSLTEANFGN
IREIIEAEPPRWLSDSAASSCMLCGVKFHPIMCSRHHCRFCGGIFCGDCSKGRSLLPVKFRVADPQRVCDVCNVRLESVQPYLMDKVSNAAQLPTHDLTDLSTLRSWLNF
PWGQSMEHEIYKATNTIRAYNKVGSLKPEKLIPDAILGQAKGLAIITAIKVGAVVTYNVGTGLVVARREDGSWSPPSAISSIGMGWGAQIGGELTDFIIVLRTSDAVNAF
SGNMHLAIGAGVSAAVGVIGRTAEADVRAGDGGFGSCYTYSCSKGAFVGCSLKGSIVTTRTQENARFYGNQSLTASDILLGSLPRPPAAAILYRALAELYQKINK