; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G012630 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G012630
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGlycoside hydrolase, family 43
Genome locationCG_Chr02:26145842..26156026
RNA-Seq ExpressionClCG02G012630
SyntenyClCG02G012630
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsIPR006710 - Glycoside hydrolase, family 43
IPR023296 - Glycosyl hydrolase, five-bladed beta-propellor domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045836.1 Glycoside hydrolase, family 43 [Cucumis melo var. makuwa]1.1e-29575.9Show/hide
Query:  MGDRRNGGYGGDSSSGEEDGDAQWRAAIDSVAVSSVFISSLTNGLPATSTTTASNSDDDFELNLGAQPPKQYQIKAQKLLDNILETTLELVEHSNSVPYD
        MGDRR+  +GGDSSSGEEDGDA+WRAAIDSV VSSVFISSLTNG+PATS  T S  DDDFELNL AQPPK YQIKAQKLLDNILETTLELVEHSNSVP  
Subjt:  MGDRRNGGYGGDSSSGEEDGDAQWRAAIDSVAVSSVFISSLTNGLPATSTTTASNSDDDFELNLGAQPPKQYQIKAQKLLDNILETTLELVEHSNSVPYD

Query:  DDSKSSEGGIRLFKNAPVGVVFDHVDELQRPTKRPKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKRACEKSIARLEAKEAAVKATAKREEERVAEL
        DDSKSSEGGIRLFKNAPVGVVFDHVDEL RPTK+PKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKR CEKSIARLEAKEAA+KA AKREEERVA+L
Subjt:  DDSKSSEGGIRLFKNAPVGVVFDHVDELQRPTKRPKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKRACEKSIARLEAKEAAVKATAKREEERVAEL

Query:  KKSPRHSRFPLSPPSHRPTSSSSAPLRQRRTPQQSDEGYRSGLSLSHHSQHLS-FDLSPPAQSSSAASPSSPPISKPTPLFSWPDTFACVPPPLPESWFW
        KK                                           S+HS  L+ F LSPP   S  A P+SP              F+   PP       
Subjt:  KKSPRHSRFPLSPPSHRPTSSSSAPLRQRRTPQQSDEGYRSGLSLSHHSQHLS-FDLSPPAQSSSAASPSSPPISKPTPLFSWPDTFACVPPPLPESWFW

Query:  YCLTHRS------FEVIKAAVRHGKRNFWTFL---DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLH
         C T  +      FE+I AAV HGKRN WTFL   DERM+ RNK+RKSTTLRCD+ S CLI++VIGSLM CILLL L S+I+ KDE+GQGIQIRTSH LH
Subjt:  YCLTHRS------FEVIKAAVRHGKRNFWTFL---DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLH

Query:  FRELEEVEEENFQI--PRKRSPRALKRKPLKRTTTLIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSET
         REL+EVEEEN QI  P KR  R  KR+P KRTT LIDEFLDEDSQLR KFFPDHKTS+DPM+ GNDSMFYYPGRVWLDT G PIQAHGGGV+ DERS+T
Subjt:  FRELEEVEEENFQI--PRKRSPRALKRKPLKRTTTLIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSET

Query:  YYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDY
        YYWYGEYKDGPTYHAH+KGAARVDIIGVGCYSSKDLWTWKNEGIVL AEE DET DLHKSNVLERPKVIYNSRTGKYVMWMHID+VNYTKASVG+AISDY
Subjt:  YYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDY

Query:  PTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAA
        P GPFHYL+SKRPH FDSRDMTIFKDD+GTAYLIYSSE NSELHIGPLSEDYL+VTNV RRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAA
Subjt:  PTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAA

Query:  GSIMGPWETMGNPCIGGNKMFRLATF
         SIMGPWET+GNPCIG NKMFRLATF
Subjt:  GSIMGPWETMGNPCIGGNKMFRLATF

XP_004148025.3 uncharacterized protein LOC101203100 [Cucumis sativus]2.4e-20789.77Show/hide
Query:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT
        DERMKMRN+YRKST LRCDAGS CLI++VIGSLMGCILLL LYS+IS  DEIGQGI +RTSH LHF ELEEVEEEN QI  PRKRSPRA KR+P K+TTT
Subjt:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT

Query:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
        LIDEFLDEDSQLRHKFFPD K S+DPM+TGNDSMFYYPGRVWLDTEG PIQAHGGGVL DERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
Subjt:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD

Query:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY
        LWTWKNEGIVLTAEE DET DLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVG+AISDYPTGPF YLYSK+PH FDSRDMTIFKDDDGTAYLIY
Subjt:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY

Query:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        SSEDNSELH+G LS+DYLDVTNV RR+LIGQHREAPALFKHQGTYYM+TSGCTGWAPNEAL HAA SIMGPWETMGNPCIGGNKMFRLATF
Subjt:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

XP_008457848.1 PREDICTED: uncharacterized protein LOC103497430 [Cucumis melo]3.3e-20489Show/hide
Query:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT
        DERMKMRN+YRKST LRCDAGS CLI++VIGSLMGCILLL LYS+I   DEIGQ I +RTSH LHF ELEEVEEEN QI  PRKRSPRA KR+P K+TTT
Subjt:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT

Query:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
        LIDEFLDEDSQ+RHKFFPD KTS+DPM+TGNDSMFYYPGRVWLDTEG PIQAHGGGVL DERS TYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
Subjt:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD

Query:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY
        LWTWKNEGIVLTAEE DET DLHKSNVLERPKVIYNSRT KYVMWMHIDDVNYTKASVG+AISDYPTGPF YLYSKRPH  DSRDMTIFKDDDGTAYLIY
Subjt:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY

Query:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        SSEDNSELH+G LSEDYLDVTNV RRILIGQHREAPALFKHQGTYYM+TSGCTGWAPNEAL HAA SIMGPWETMGNPC+GGNKMFRLATF
Subjt:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

XP_022927964.1 uncharacterized protein LOC111434812 [Cucurbita moschata]2.8e-20387.72Show/hide
Query:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT
        D++M MRN+YRKST LRCDAGS CLI++VIGSLMGCILLL L S +S KDEIG+GIQ+RTS  LHFRELEEVEEEN QI  PRKRSPRA KR+P K+T T
Subjt:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT

Query:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
        LIDEFLDEDSQLRHKFFPDHKTSVDPM+ G+DSMFYYPGRVWLDTEG PIQAHGGGVL DERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
Subjt:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD

Query:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY
        LWTW+NEGIVLTAEE +ET DLHKSNVLERPKVIYNSRT KYVMWMHIDD NYTKASVG+A+SDYPTGPF YLYSKRPH FDSRDMTIFKDDDGTAYL+Y
Subjt:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY

Query:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        SSEDNSELHIGPLSEDYLDVTNV +RIL+GQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA+ SIMGPWET+GNPCIGGNK+FRLATF
Subjt:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

XP_038901231.1 uncharacterized protein LOC120088188 [Benincasa hispida]8.9e-21092.53Show/hide
Query:  MKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTTLID
        MKMRN+YRKSTTLRC  GS CLI++VIGSLMGCILLL LYS+ S KDEIGQGIQ+RTSH LHFRELEEVEEEN QI  PRKRSPRA KR+P KRT TLID
Subjt:  MKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTTLID

Query:  EFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWT
        EFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEG PIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWT
Subjt:  EFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWT

Query:  WKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIYSSE
        WKNEGIVL AEE DET DLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVG+AISDYPTGPF YLYSKRPH FDSRDMTIFKDDDGTAYLIYSSE
Subjt:  WKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIYSSE

Query:  DNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        DNSELHIGPLS+DYLDVTNV RRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAA SIMGPWETMGNPCIGGNKMFRLATF
Subjt:  DNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

TrEMBL top hitse value%identityAlignment
A0A0A0LJS1 Uncharacterized protein3.3e-20283.78Show/hide
Query:  FEVIKAAVRHGKRNFWTFLD---ERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQ
        FE+I A V HGKR  WTFL+   ERM+MRNK+RKSTTLRCD+ S CLI++VIGSLM CILLL L S  S K+E+GQGIQIRTSH LH REL+EVEEEN Q
Subjt:  FEVIKAAVRHGKRNFWTFLD---ERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQ

Query:  I--PRKRSPRALKRKPLKRTTTLIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTY
        I  P KR  RA KR+P KR T LIDEFLDEDSQLR KFFPDHKT +DPM+TGNDSMFYYPGRVWLDTEG PIQAHGGGV+ DERSETYYWYGEYKDGPTY
Subjt:  I--PRKRSPRALKRKPLKRTTTLIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTY

Query:  HAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRP
        HAH+KGAARVDIIG+GCYSSKDLW+WKNEGIVL AEE DET DLHKSNVLERPKVIYNSRTGKYVMWMHID+VNYTKASVG+AISDYP GPFHYL SKRP
Subjt:  HAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRP

Query:  HRFDSRDMTIFKDDDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNP
        H FDSRDMTIFKDD+GTAYLIYSS+ NSELH+GPLSEDYLDVTNV RR+LIGQHREAPALFKH+GTYYMITSGCTGWAPNEALAHAA SIMGPWET+GNP
Subjt:  HRFDSRDMTIFKDDDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNP

Query:  CIGGNKMFRLATF
        CIG NKMFRLATF
Subjt:  CIGGNKMFRLATF

A0A0A0LPY3 Uncharacterized protein1.2e-20789.77Show/hide
Query:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT
        DERMKMRN+YRKST LRCDAGS CLI++VIGSLMGCILLL LYS+IS  DEIGQGI +RTSH LHF ELEEVEEEN QI  PRKRSPRA KR+P K+TTT
Subjt:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT

Query:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
        LIDEFLDEDSQLRHKFFPD K S+DPM+TGNDSMFYYPGRVWLDTEG PIQAHGGGVL DERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
Subjt:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD

Query:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY
        LWTWKNEGIVLTAEE DET DLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVG+AISDYPTGPF YLYSK+PH FDSRDMTIFKDDDGTAYLIY
Subjt:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY

Query:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        SSEDNSELH+G LS+DYLDVTNV RR+LIGQHREAPALFKHQGTYYM+TSGCTGWAPNEAL HAA SIMGPWETMGNPCIGGNKMFRLATF
Subjt:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

A0A1S3C6G0 uncharacterized protein LOC1034974301.6e-20489Show/hide
Query:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT
        DERMKMRN+YRKST LRCDAGS CLI++VIGSLMGCILLL LYS+I   DEIGQ I +RTSH LHF ELEEVEEEN QI  PRKRSPRA KR+P K+TTT
Subjt:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT

Query:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
        LIDEFLDEDSQ+RHKFFPD KTS+DPM+TGNDSMFYYPGRVWLDTEG PIQAHGGGVL DERS TYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
Subjt:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD

Query:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY
        LWTWKNEGIVLTAEE DET DLHKSNVLERPKVIYNSRT KYVMWMHIDDVNYTKASVG+AISDYPTGPF YLYSKRPH  DSRDMTIFKDDDGTAYLIY
Subjt:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY

Query:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        SSEDNSELH+G LSEDYLDVTNV RRILIGQHREAPALFKHQGTYYM+TSGCTGWAPNEAL HAA SIMGPWETMGNPC+GGNKMFRLATF
Subjt:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

A0A5A7TUM5 Glycoside hydrolase, family 435.1e-29675.9Show/hide
Query:  MGDRRNGGYGGDSSSGEEDGDAQWRAAIDSVAVSSVFISSLTNGLPATSTTTASNSDDDFELNLGAQPPKQYQIKAQKLLDNILETTLELVEHSNSVPYD
        MGDRR+  +GGDSSSGEEDGDA+WRAAIDSV VSSVFISSLTNG+PATS  T S  DDDFELNL AQPPK YQIKAQKLLDNILETTLELVEHSNSVP  
Subjt:  MGDRRNGGYGGDSSSGEEDGDAQWRAAIDSVAVSSVFISSLTNGLPATSTTTASNSDDDFELNLGAQPPKQYQIKAQKLLDNILETTLELVEHSNSVPYD

Query:  DDSKSSEGGIRLFKNAPVGVVFDHVDELQRPTKRPKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKRACEKSIARLEAKEAAVKATAKREEERVAEL
        DDSKSSEGGIRLFKNAPVGVVFDHVDEL RPTK+PKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKR CEKSIARLEAKEAA+KA AKREEERVA+L
Subjt:  DDSKSSEGGIRLFKNAPVGVVFDHVDELQRPTKRPKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKRACEKSIARLEAKEAAVKATAKREEERVAEL

Query:  KKSPRHSRFPLSPPSHRPTSSSSAPLRQRRTPQQSDEGYRSGLSLSHHSQHLS-FDLSPPAQSSSAASPSSPPISKPTPLFSWPDTFACVPPPLPESWFW
        KK                                           S+HS  L+ F LSPP   S  A P+SP              F+   PP       
Subjt:  KKSPRHSRFPLSPPSHRPTSSSSAPLRQRRTPQQSDEGYRSGLSLSHHSQHLS-FDLSPPAQSSSAASPSSPPISKPTPLFSWPDTFACVPPPLPESWFW

Query:  YCLTHRS------FEVIKAAVRHGKRNFWTFL---DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLH
         C T  +      FE+I AAV HGKRN WTFL   DERM+ RNK+RKSTTLRCD+ S CLI++VIGSLM CILLL L S+I+ KDE+GQGIQIRTSH LH
Subjt:  YCLTHRS------FEVIKAAVRHGKRNFWTFL---DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLH

Query:  FRELEEVEEENFQI--PRKRSPRALKRKPLKRTTTLIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSET
         REL+EVEEEN QI  P KR  R  KR+P KRTT LIDEFLDEDSQLR KFFPDHKTS+DPM+ GNDSMFYYPGRVWLDT G PIQAHGGGV+ DERS+T
Subjt:  FRELEEVEEENFQI--PRKRSPRALKRKPLKRTTTLIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSET

Query:  YYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDY
        YYWYGEYKDGPTYHAH+KGAARVDIIGVGCYSSKDLWTWKNEGIVL AEE DET DLHKSNVLERPKVIYNSRTGKYVMWMHID+VNYTKASVG+AISDY
Subjt:  YYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDY

Query:  PTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAA
        P GPFHYL+SKRPH FDSRDMTIFKDD+GTAYLIYSSE NSELHIGPLSEDYL+VTNV RRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAA
Subjt:  PTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAA

Query:  GSIMGPWETMGNPCIGGNKMFRLATF
         SIMGPWET+GNPCIG NKMFRLATF
Subjt:  GSIMGPWETMGNPCIGGNKMFRLATF

A0A6J1EJG7 uncharacterized protein LOC1114348121.3e-20387.72Show/hide
Query:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT
        D++M MRN+YRKST LRCDAGS CLI++VIGSLMGCILLL L S +S KDEIG+GIQ+RTS  LHFRELEEVEEEN QI  PRKRSPRA KR+P K+T T
Subjt:  DERMKMRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT

Query:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
        LIDEFLDEDSQLRHKFFPDHKTSVDPM+ G+DSMFYYPGRVWLDTEG PIQAHGGGVL DERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
Subjt:  LIDEFLDEDSQLRHKFFPDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD

Query:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY
        LWTW+NEGIVLTAEE +ET DLHKSNVLERPKVIYNSRT KYVMWMHIDD NYTKASVG+A+SDYPTGPF YLYSKRPH FDSRDMTIFKDDDGTAYL+Y
Subjt:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY

Query:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        SSEDNSELHIGPLSEDYLDVTNV +RIL+GQHREAPALFKHQGTYYMITSGCTGWAPNEALAHA+ SIMGPWET+GNPCIGGNK+FRLATF
Subjt:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49880.1 glycosyl hydrolase family protein 431.0e-15568.29Show/hide
Query:  MRNKY-RKSTTLRCDAGSSCLIAMVIGSLMGCILL--LGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTTLI
        M+NK+ +K+T LRC          ++ +++GC+ +  L +  S S   ++    Q+   H +  RELE VEEEN  +  PRKRSPRA+KRKP K  TTL+
Subjt:  MRNKY-RKSTTLRCDAGSSCLIAMVIGSLMGCILL--LGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQI--PRKRSPRALKRKPLKRTTTLI

Query:  DEFLDEDSQLRHKFFPDHKTSVDPMM--TGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD
        +EFLDE+SQ+RH FFPD K++  P    T + S +Y+PGR+W DTEG PIQAHGGG+L D+ S+ YYWYGEYKDGPTY +HKKGAARVDIIGVGCYSSKD
Subjt:  DEFLDEDSQLRHKFFPDHKTSVDPMM--TGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKD

Query:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY
        LWTWKNEG+VL AEE DET DLHKSNVLERPKVIYNS TGKYVMWMHIDD NYTKASVG+AISD PTGPF YLYS+ PH FDSRDMT++KDDD  AYLIY
Subjt:  LWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIY

Query:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        SSEDNS LHIGPL+E+YLDV  V++RI++GQHREAPA+FKHQ TYYMITSGCTGWAPNEALAHAA SIMGPWET+GNPC+GGN +FR  TF
Subjt:  SSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

AT3G49890.1 unknown protein9.0e-3546.94Show/hide
Query:  GGDSSSGEEDGDAQWRAAIDSVAVSSVFISSLTNGLPATSTTTASNSDDDFELNLGAQPPKQY---QIKAQKLLDNILETTLELVEHSNSVPYDDDSKSS
        GGDSSS  ED D +WRAAI+S+A ++V+ +S T   PA    T S++  DF L      PK+    QIK + LL+ ++E TL+ VE   ++P  +D   +
Subjt:  GGDSSSGEEDGDAQWRAAIDSVAVSSVFISSLTNGLPATSTTTASNSDDDFELNLGAQPPKQY---QIKAQKLLDNILETTLELVEHSNSVPYDDDSKSS

Query:  EGGIRLFKNAPVGVVFDHVDELQRPTKRPKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKRACEKSIARLEAKEAAVKATAKREEERVAELKK
        + G+RLFK    G+VFDHVDE++ P K+P + P K +   SK+FK++++S+AV+G DI+TAA  A +K+ ARL+AKE A K  AK+EEER+AELKK
Subjt:  EGGIRLFKNAPVGVVFDHVDELQRPTKRPKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKRACEKSIARLEAKEAAVKATAKREEERVAELKK

AT5G67540.1 Arabinanase/levansucrase/invertase3.0e-14766.41Show/hide
Query:  YRKSTTLRCDAGSSCLIAM--VIGSLMGCILLLGLYSSISCKD-EIGQGI---QIR-TSHLLH--FRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT
        Y  S  LR  AG  C  ++  ++ +++G  L+  L S  S KD  I Q +   Q++   HL H   REL  VEEE  ++  PRKRSPR  KR+  ++   
Subjt:  YRKSTTLRCDAGSSCLIAM--VIGSLMGCILLLGLYSSISCKD-EIGQGI---QIR-TSHLLH--FRELEEVEEENFQI--PRKRSPRALKRKPLKRTTT

Query:  LIDEFLDEDSQLRHKFFPDHKTSV--DPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSS
        L++EFLD+ S +RH FFP  KT+        GN++ +Y+PG++W+DT+G PIQAHGGG+LLD +S TYYWYGEYKDGPTYHAHKKG ARVDIIGVGCYSS
Subjt:  LIDEFLDEDSQLRHKFFPDHKTSV--DPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSS

Query:  KDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYL
        KDLWTWKNEGIVL AEE ++T DLHKSNVLERPKVIYN +T KYVMWMHIDD NYTKASVG+AIS+ PTGPF YLYSKRPH FDSRDMT+FKDDDG AYL
Subjt:  KDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYL

Query:  IYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        IYSSE NS LHIGPL+EDYLDVT V++R+++GQHREAPA+FKHQ  YYM+TS CTGWAPNEALAHAA SIMGPWE +GNPCIGGNK+FRL TF
Subjt:  IYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF

AT5G67540.2 Arabinanase/levansucrase/invertase3.4e-15166.25Show/hide
Query:  MKMRNKY-RKSTTLRCDAGSSCLIAM--VIGSLMGCILLLGLYSSISCKD-EIGQGI---QIR-TSHLLH--FRELEEVEEENFQI--PRKRSPRALKRK
        MK  NKY +KST+L C+    C  ++  ++ +++G  L+  L S  S KD  I Q +   Q++   HL H   REL  VEEE  ++  PRKRSPR  KR+
Subjt:  MKMRNKY-RKSTTLRCDAGSSCLIAM--VIGSLMGCILLLGLYSSISCKD-EIGQGI---QIR-TSHLLH--FRELEEVEEENFQI--PRKRSPRALKRK

Query:  PLKRTTTLIDEFLDEDSQLRHKFFPDHKTSV--DPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDII
          ++   L++EFLD+ S +RH FFP  KT+        GN++ +Y+PG++W+DT+G PIQAHGGG+LLD +S TYYWYGEYKDGPTYHAHKKG ARVDII
Subjt:  PLKRTTTLIDEFLDEDSQLRHKFFPDHKTSV--DPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKD
        GVGCYSSKDLWTWKNEGIVL AEE ++T DLHKSNVLERPKVIYN +T KYVMWMHIDD NYTKASVG+AIS+ PTGPF YLYSKRPH FDSRDMT+FKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAEE-DETRDLHKSNVLERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKD

Query:  DDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF
        DDG AYLIYSSE NS LHIGPL+EDYLDVT V++R+++GQHREAPA+FKHQ  YYM+TS CTGWAPNEALAHAA SIMGPWE +GNPCIGGNK+FRL TF
Subjt:  DDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPALFKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGATCGGAGAAACGGCGGCTATGGTGGCGACAGCAGCAGCGGAGAGGAAGACGGCGACGCCCAATGGAGAGCCGCCATTGATTCTGTTGCTGTCTCGTCTGTGTT
TATCTCGTCCTTGACTAATGGCCTTCCGGCTACTTCGACAACCACAGCTTCAAACTCGGATGATGATTTTGAGCTTAATCTCGGTGCTCAGCCGCCCAAGCAATATCAAA
TCAAGGCACAGAAGCTATTGGATAATATTTTGGAAACTACTCTAGAGTTGGTGGAACATTCCAATTCTGTTCCTTATGATGATGATTCCAAATCCAGTGAAGGTGGAATT
CGTTTGTTTAAAAATGCTCCAGTGGGGGTTGTGTTTGATCATGTGGATGAGCTTCAACGCCCCACAAAGAGACCTAAAATTCTTCCAGGGAAAGAAATTAACGAGAAATC
AAAGAAGTTCAAGCAGCAGCTCCGATCCGTGGCCGTTGAAGGAGAAGACATAATAACTGCTGCAAAACGTGCCTGTGAGAAGTCGATCGCTAGGCTTGAAGCTAAAGAAG
CAGCAGTGAAAGCAACTGCCAAAAGAGAGGAAGAAAGGGTAGCCGAACTGAAAAAGAGCCCACGTCACTCACGATTTCCTCTCTCTCCGCCGTCGCACCGCCCCACTTCC
TCGTCGTCCGCCCCCCTCCGACAGCGTCGCACGCCACAACAGTCCGACGAAGGTTACAGATCGGGTCTCTCTCTCTCACATCACTCTCAACATCTCTCCTTCGATCTCTC
TCCCCCTGCCCAGTCTTCCAGCGCCGCCTCACCGTCCTCTCCGCCCATCAGCAAACCAACGCCACTGTTTTCCTGGCCGGACACATTCGCCTGTGTTCCGCCGCCGTTGC
CGGAATCGTGGTTTTGGTATTGTTTAACCCATAGAAGTTTTGAGGTGATCAAAGCGGCTGTCCGGCATGGGAAACGGAATTTTTGGACATTTTTGGACGAGAGGATGAAG
ATGAGGAACAAATACAGGAAGTCAACTACTTTACGTTGCGATGCAGGGAGCAGCTGTTTGATAGCTATGGTGATTGGGAGTCTAATGGGGTGTATTCTTCTACTGGGGTT
ATATTCTTCTATAAGCTGCAAGGATGAGATAGGACAAGGTATCCAAATTCGAACAAGTCATCTCCTTCACTTCCGAGAACTGGAAGAGGTGGAAGAAGAAAACTTTCAAA
TTCCCCGTAAGAGATCCCCACGTGCATTGAAGCGAAAACCACTAAAGAGAACGACCACACTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTT
CCTGATCATAAAACTTCCGTTGATCCAATGATGACAGGAAATGATAGTATGTTCTATTATCCAGGGAGAGTTTGGCTGGATACTGAAGGGATTCCCATTCAAGCTCATGG
AGGTGGAGTGTTACTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTTGACATTA
TAGGAGTCGGTTGCTACTCTTCCAAAGACTTGTGGACGTGGAAAAATGAAGGCATTGTTTTGACAGCGGAAGAAGACGAGACCCGTGATCTTCACAAATCCAATGTGCTC
GAGAGGCCGAAAGTAATCTACAATTCAAGGACTGGAAAATACGTAATGTGGATGCATATTGATGATGTGAACTATACAAAGGCTTCTGTTGGTATTGCCATCAGTGATTA
CCCAACCGGTCCATTCCATTATCTCTACAGCAAAAGACCTCATAGATTTGACAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTACCTCATTTACTCAT
CCGAAGACAATAGTGAGCTTCATATAGGACCTCTCTCAGAAGATTATCTCGACGTGACCAATGTAGTGAGAAGGATTCTCATTGGCCAACACCGAGAAGCACCGGCTTTG
TTCAAACACCAGGGAACTTACTATATGATCACATCAGGGTGCACAGGATGGGCACCGAATGAGGCACTGGCACACGCAGCAGGGTCGATAATGGGTCCATGGGAGACAAT
GGGAAACCCATGTATAGGAGGAAACAAGATGTTTCGACTGGCTACATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGATCGGAGAAACGGCGGCTATGGTGGCGACAGCAGCAGCGGAGAGGAAGACGGCGACGCCCAATGGAGAGCCGCCATTGATTCTGTTGCTGTCTCGTCTGTGTT
TATCTCGTCCTTGACTAATGGCCTTCCGGCTACTTCGACAACCACAGCTTCAAACTCGGATGATGATTTTGAGCTTAATCTCGGTGCTCAGCCGCCCAAGCAATATCAAA
TCAAGGCACAGAAGCTATTGGATAATATTTTGGAAACTACTCTAGAGTTGGTGGAACATTCCAATTCTGTTCCTTATGATGATGATTCCAAATCCAGTGAAGGTGGAATT
CGTTTGTTTAAAAATGCTCCAGTGGGGGTTGTGTTTGATCATGTGGATGAGCTTCAACGCCCCACAAAGAGACCTAAAATTCTTCCAGGGAAAGAAATTAACGAGAAATC
AAAGAAGTTCAAGCAGCAGCTCCGATCCGTGGCCGTTGAAGGAGAAGACATAATAACTGCTGCAAAACGTGCCTGTGAGAAGTCGATCGCTAGGCTTGAAGCTAAAGAAG
CAGCAGTGAAAGCAACTGCCAAAAGAGAGGAAGAAAGGGTAGCCGAACTGAAAAAGAGCCCACGTCACTCACGATTTCCTCTCTCTCCGCCGTCGCACCGCCCCACTTCC
TCGTCGTCCGCCCCCCTCCGACAGCGTCGCACGCCACAACAGTCCGACGAAGGTTACAGATCGGGTCTCTCTCTCTCACATCACTCTCAACATCTCTCCTTCGATCTCTC
TCCCCCTGCCCAGTCTTCCAGCGCCGCCTCACCGTCCTCTCCGCCCATCAGCAAACCAACGCCACTGTTTTCCTGGCCGGACACATTCGCCTGTGTTCCGCCGCCGTTGC
CGGAATCGTGGTTTTGGTATTGTTTAACCCATAGAAGTTTTGAGGTGATCAAAGCGGCTGTCCGGCATGGGAAACGGAATTTTTGGACATTTTTGGACGAGAGGATGAAG
ATGAGGAACAAATACAGGAAGTCAACTACTTTACGTTGCGATGCAGGGAGCAGCTGTTTGATAGCTATGGTGATTGGGAGTCTAATGGGGTGTATTCTTCTACTGGGGTT
ATATTCTTCTATAAGCTGCAAGGATGAGATAGGACAAGGTATCCAAATTCGAACAAGTCATCTCCTTCACTTCCGAGAACTGGAAGAGGTGGAAGAAGAAAACTTTCAAA
TTCCCCGTAAGAGATCCCCACGTGCATTGAAGCGAAAACCACTAAAGAGAACGACCACACTGATTGATGAATTTCTTGATGAAGATTCACAGCTTAGGCACAAATTCTTT
CCTGATCATAAAACTTCCGTTGATCCAATGATGACAGGAAATGATAGTATGTTCTATTATCCAGGGAGAGTTTGGCTGGATACTGAAGGGATTCCCATTCAAGCTCATGG
AGGTGGAGTGTTACTCGATGAAAGATCTGAAACATACTATTGGTATGGAGAGTATAAAGATGGCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTTGACATTA
TAGGAGTCGGTTGCTACTCTTCCAAAGACTTGTGGACGTGGAAAAATGAAGGCATTGTTTTGACAGCGGAAGAAGACGAGACCCGTGATCTTCACAAATCCAATGTGCTC
GAGAGGCCGAAAGTAATCTACAATTCAAGGACTGGAAAATACGTAATGTGGATGCATATTGATGATGTGAACTATACAAAGGCTTCTGTTGGTATTGCCATCAGTGATTA
CCCAACCGGTCCATTCCATTATCTCTACAGCAAAAGACCTCATAGATTTGACAGTAGAGACATGACAATCTTCAAAGATGATGATGGTACAGCCTACCTCATTTACTCAT
CCGAAGACAATAGTGAGCTTCATATAGGACCTCTCTCAGAAGATTATCTCGACGTGACCAATGTAGTGAGAAGGATTCTCATTGGCCAACACCGAGAAGCACCGGCTTTG
TTCAAACACCAGGGAACTTACTATATGATCACATCAGGGTGCACAGGATGGGCACCGAATGAGGCACTGGCACACGCAGCAGGGTCGATAATGGGTCCATGGGAGACAAT
GGGAAACCCATGTATAGGAGGAAACAAGATGTTTCGACTGGCTACATTCTAG
Protein sequenceShow/hide protein sequence
MGDRRNGGYGGDSSSGEEDGDAQWRAAIDSVAVSSVFISSLTNGLPATSTTTASNSDDDFELNLGAQPPKQYQIKAQKLLDNILETTLELVEHSNSVPYDDDSKSSEGGI
RLFKNAPVGVVFDHVDELQRPTKRPKILPGKEINEKSKKFKQQLRSVAVEGEDIITAAKRACEKSIARLEAKEAAVKATAKREEERVAELKKSPRHSRFPLSPPSHRPTS
SSSAPLRQRRTPQQSDEGYRSGLSLSHHSQHLSFDLSPPAQSSSAASPSSPPISKPTPLFSWPDTFACVPPPLPESWFWYCLTHRSFEVIKAAVRHGKRNFWTFLDERMK
MRNKYRKSTTLRCDAGSSCLIAMVIGSLMGCILLLGLYSSISCKDEIGQGIQIRTSHLLHFRELEEVEEENFQIPRKRSPRALKRKPLKRTTTLIDEFLDEDSQLRHKFF
PDHKTSVDPMMTGNDSMFYYPGRVWLDTEGIPIQAHGGGVLLDERSETYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVLTAEEDETRDLHKSNVL
ERPKVIYNSRTGKYVMWMHIDDVNYTKASVGIAISDYPTGPFHYLYSKRPHRFDSRDMTIFKDDDGTAYLIYSSEDNSELHIGPLSEDYLDVTNVVRRILIGQHREAPAL
FKHQGTYYMITSGCTGWAPNEALAHAAGSIMGPWETMGNPCIGGNKMFRLATF