; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003622 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003622
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionArabinanase/levansucrase/invertase
Genome locationscaffold963:157499..164995
RNA-Seq ExpressionMS003622
SyntenyMS003622
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsIPR006710 - Glycoside hydrolase, family 43
IPR023296 - Glycosyl hydrolase, five-bladed beta-propellor domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148025.3 uncharacterized protein LOC101203100 [Cucumis sativus]5.7e-26188.19Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        ML+Y+GD K+E MKMRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+L+++S I+  DEIG+GI L+TSHHL F ELEEV+EENIQIPPPR KRSPRA 
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKTTTLIDEFLDEDSQ+RHKFFPD K S+DPM TGNDSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTWKNEGIVLTA ET+ETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDD NYTKASVGVAISDYPTGPFDYLYSK+PHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYLIYSS+DNSELH+G L++DYLDVTNV RR+ IG HREAPALFKHQGTYYMVTSGCTGWAPNEAL HA+ES++GPWETMGNPCIGGNKMFRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG
        F+QSTFVLPLPS+P+LFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLW RVSIYWHRKWRLPQG
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG

XP_022149504.1 uncharacterized protein LOC111017920 [Momordica charantia]1.1e-29199.37Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPI+RKDEI RGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPM TGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK
        FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK

XP_022927964.1 uncharacterized protein LOC111434812 [Cucurbita moschata]2.2e-26588.82Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP++RKDEIGRGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAA
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKT TLIDEFLDEDSQ+RHKFFPDHKTSVDPM  G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTW+NEGIVLTA ETNETHDLHKSNVLERPKVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYL+YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFKHQGTYYM+TSGCTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG
        F+QSTFVLPLPSHP LFIFMADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG

XP_022974783.1 uncharacterized protein LOC111473520 [Cucurbita maxima]3.0e-26288.4Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP++RK EIGRGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAA
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKT TLIDEFLDEDSQ+RHKFFPDHKTSVDPM  G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTWKNEGIVLTA ETNETHDLHKSNVLERPKVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYL YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFK QGTYYM+TSGCTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG
        F+QSTFVL LPSHP LFIFMADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG

XP_023553112.1 uncharacterized protein LOC111810610 [Cucurbita pepo subsp. pepo]6.5e-26588.61Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        MLHY+GD K+++M MRN+YRKS  LRC+AGSRC ISV+IGSL+GCIL+LH+ SP++RKDEIGRGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAA
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKT TLIDEFLDEDSQ+RHKFFPDHKTSVDPM  G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTW+NEGIVLTA ETNETHDLHKSNVLERPKVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYL+YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFKHQGTYYM+TSGCTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG
        F+QSTFVLPLPSHP LFIFMADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG

TrEMBL top hitse value%identityAlignment
A0A0A0LPY3 Uncharacterized protein5.2e-26087.76Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        ML+Y+GD K+E MKMRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+L+++S I+  DEIG+GI L+TSHHL F ELEEV+EENIQIPPPR KRSPRA 
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKTTTLIDEFLDEDSQ+RHKFFPD K S+DPM TGNDSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTWKNEGIVLTA ET+ETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDD NYTKASVGVAISDYPTGPFDYLYSK+PHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYLIYSS+DNSELH+G L++DYLDVTNV RR+ IG HREAPALFKHQGTYYMVTSGCTGWAPNEAL HA+ES++GPWETMGNPCIGGNKMFRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG
        F+QSTFVLPLPS+P+LFIFMADRWNPADLRDSRYVWLPLMVGGLVD+PLDYNF FPLW RVSIYWHRKWRLPQG
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG

A0A6J1D780 uncharacterized protein LOC1110179205.1e-29299.37Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPI+RKDEI RGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPM TGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK
        FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK

A0A6J1EJG7 uncharacterized protein LOC1114348121.1e-26588.82Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP++RKDEIGRGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAA
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKT TLIDEFLDEDSQ+RHKFFPDHKTSVDPM  G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTW+NEGIVLTA ETNETHDLHKSNVLERPKVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYL+YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFKHQGTYYM+TSGCTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG
        F+QSTFVLPLPSHP LFIFMADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG

A0A6J1IHC3 uncharacterized protein LOC1114735201.5e-26288.4Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA
        MLHY+GD K+++M MRN+YRKST LRC+AGSRC ISV+IGSL+GCIL+LH+ SP++RK EIGRGI+L+TS HL FRELEEV+EENIQIPPPR KRSPRAA
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAA

Query:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        KRRPKKT TLIDEFLDEDSQ+RHKFFPDHKTSVDPM  G+DSM+YYPGRVWLDTEGNPIQAHGGG+LFDERSE+YYWYGEYKDGPTYHAHKKGAARVDII
Subjt:  KRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTWKNEGIVLTA ETNETHDLHKSNVLERPKVIYNSRT KYVMWMHIDDANYTKASVGVA+SDYPTGPFDYLYSKRPHGFDSRDMTIFKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDGTAYL YSS+DNSELHIGPL+EDYLDVTNV +RI +G HREAPALFK QGTYYM+TSGCTGWAPNEALAHASES++GPWET+GNPCIGGNK+FRLATF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG
        F+QSTFVL LPSHP LFIFMADRWNPADLRDSRY+WLPLMVGGLVD+PLDYNFGFPLW RVSIYWHRKWRLPQG
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQG

A0A6J1L3W6 uncharacterized protein LOC1114996589.8e-25987.08Show/hide
Query:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDE-IGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRA
        MLHY GD K+EEMKM+NKYRKSTTLRC+AGS   +SV+IGSL+GCIL+L ++SPI+RKD  IGR I+L+TSH L FRELEEV+EE IQIPPPRGKRSPRA
Subjt:  MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDE-IGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRA

Query:  AKRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDI
        AKRRPK+TTTLIDEFLDEDS +RHKFFPD KTS+DP  TGNDSM+YYPGRVWLDTEGNPIQAHGGG+L+DE SE++YWYGEYKDGPTYHAHKKGAARVDI
Subjt:  AKRRPKKTTTLIDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDI

Query:  IGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFK
        IGVGCYSSKDLWTWKNEGIVLTA ETNETHDLHKSNVLERPKVIYNS+TGKYVMWMH+DDANYTKASVG+AISDYPTGPFDYLYSKRPHGFDSRDMTIFK
Subjt:  IGVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFK

Query:  DDDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLAT
        DDDGTAYL+YSS DNSELHIGPLTEDYLDVTNV RRI IG HREAPALFKHQGTYYM+TSGCTGWAPNEAL HA+ES++GPWETM NPCIGGNKMFRLAT
Subjt:  DDDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLAT

Query:  FFAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK
        FFAQSTFVLPLPS+PSLFIFMADRWNPA+LRDSRY+WLPLMVGGLVDEPLDYNFGFPLW RVSIYWHRKW+LP+G +L K
Subjt:  FFAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49880.1 glycosyl hydrolase family protein 435.0e-20771.18Show/hide
Query:  MRNKY-RKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQ-TSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLID
        M+NK+ +K+T LRC+         ++ ++VGC+ ++H+    +R   +   +  Q   HH   RELE V+EENI +PPPR KRSPRA KR+PK  TTL++
Subjt:  MRNKY-RKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQ-TSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTLID

Query:  EFLDEDSQIRHKFFPDHKTSVDPM--NTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDL
        EFLDE+SQIRH FFPD K++  P   +T + S YY+PGR+W DTEGNPIQAHGGGILFD+ S+ YYWYGEYKDGPTY +HKKGAARVDIIGVGCYSSKDL
Subjt:  EFLDEDSQIRHKFFPDHKTSVDPM--NTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDL

Query:  WTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLIYS
        WTWKNEG+VL A ET+ETHDLHKSNVLERPKVIYNS TGKYVMWMHIDDANYTKASVGVAISD PTGPFDYLYS+ PHGFDSRDMT++KDDD  AYLIYS
Subjt:  WTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLIYS

Query:  SDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPL
        S+DNS LHIGPLTE+YLDV  V++RI +G HREAPA+FKHQ TYYM+TSGCTGWAPNEALAHA+ES++GPWET+GNPC+GGN +FR  TFFAQSTFV+PL
Subjt:  SDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPL

Query:  PSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGR
        P  P +FIFMADRWNPADLRDSRY+WLPL+VGG  D PL+Y+FGFP+W RVS+YWHR+WRLP  R
Subjt:  PSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGR

AT5G67540.1 Arabinanase/levansucrase/invertase1.7e-19970.47Show/hide
Query:  YRKSTTLRCNAGS-RCFISVIIGSLVGCILILHIFSPINRKD-EIGRGI---ELQTSHHLR---FRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTL
        Y  S  LR  AG  R  +  I+ ++VG  L+ H+ S  +RKD  I + +   +LQ  HHL     REL  V+EE +++PPPR KRSPR +KRR +K   L
Subjt:  YRKSTTLRCNAGS-RCFISVIIGSLVGCILILHIFSPINRKD-EIGRGI---ELQTSHHLR---FRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTL

Query:  IDEFLDEDSQIRHKFFPDHKTSV--DPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSK
        ++EFLD+ S IRH FFP  KT+      + GN++ YY+PG++W+DT+GNPIQAHGGGIL D +S +YYWYGEYKDGPTYHAHKKG ARVDIIGVGCYSSK
Subjt:  IDEFLDEDSQIRHKFFPDHKTSV--DPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSK

Query:  DLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLI
        DLWTWKNEGIVL A ETN+THDLHKSNVLERPKVIYN +T KYVMWMHIDDANYTKASVGVAIS+ PTGPF+YLYSKRPHGFDSRDMT+FKDDDG AYLI
Subjt:  DLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLI

Query:  YSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVL
        YSS+ NS LHIGPLTEDYLDVT V++R+ +G HREAPA+FKHQ  YYMVTS CTGWAPNEALAHA+ES++GPWE +GNPCIGGNK+FRL TFFAQST+V+
Subjt:  YSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVL

Query:  PLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLP
        PLP  P  FIFMADRWNPADLRDSRYVWLPL++GG  D+PL++NFGFP W RVSIYWH KWRLP
Subjt:  PLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLP

AT5G67540.2 Arabinanase/levansucrase/invertase3.4e-20370.55Show/hide
Query:  MKMRNKY-RKSTTLRCN--AGSRCFISVIIGSLVGCILILHIFSPINRKD-EIGRGI---ELQTSHHLR---FRELEEVDEENIQIPPPRGKRSPRAAKR
        MK  NKY +KST+L CN   G R  +  I+ ++VG  L+ H+ S  +RKD  I + +   +LQ  HHL     REL  V+EE +++PPPR KRSPR +KR
Subjt:  MKMRNKY-RKSTTLRCN--AGSRCFISVIIGSLVGCILILHIFSPINRKD-EIGRGI---ELQTSHHLR---FRELEEVDEENIQIPPPRGKRSPRAAKR

Query:  RPKKTTTLIDEFLDEDSQIRHKFFPDHKTSV--DPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII
        R +K   L++EFLD+ S IRH FFP  KT+      + GN++ YY+PG++W+DT+GNPIQAHGGGIL D +S +YYWYGEYKDGPTYHAHKKG ARVDII
Subjt:  RPKKTTTLIDEFLDEDSQIRHKFFPDHKTSV--DPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDII

Query:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD
        GVGCYSSKDLWTWKNEGIVL A ETN+THDLHKSNVLERPKVIYN +T KYVMWMHIDDANYTKASVGVAIS+ PTGPF+YLYSKRPHGFDSRDMT+FKD
Subjt:  GVGCYSSKDLWTWKNEGIVLTAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKD

Query:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF
        DDG AYLIYSS+ NS LHIGPLTEDYLDVT V++R+ +G HREAPA+FKHQ  YYMVTS CTGWAPNEALAHA+ES++GPWE +GNPCIGGNK+FRL TF
Subjt:  DDGTAYLIYSSDDNSELHIGPLTEDYLDVTNVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATF

Query:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLP
        FAQST+V+PLP  P  FIFMADRWNPADLRDSRYVWLPL++GG  D+PL++NFGFP W RVSIYWH KWRLP
Subjt:  FAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLMVGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCATTATATTGGAGATAACAAGGAGGAGGAAATGAAGATGAGGAACAAATACAGGAAATCAACCACTTTACGTTGCAATGCAGGGAGTAGATGTTTCATATCTGT
GATAATAGGGAGTCTAGTGGGGTGTATTCTTATACTACATATATTTTCTCCTATAAACCGCAAGGATGAGATAGGTCGGGGCATCGAACTTCAAACAAGTCACCACCTTC
GCTTCCGTGAACTTGAAGAGGTAGATGAGGAAAACATTCAAATTCCCCCTCCAAGGGGTAAGAGATCCCCACGTGCAGCGAAGCGAAGACCAAAGAAAACAACCACGCTA
ATTGATGAATTTCTTGATGAAGATTCACAGATTAGACACAAGTTCTTTCCTGATCATAAAACTTCCGTTGATCCAATGAACACAGGAAATGATAGTATGTACTATTATCC
AGGGAGAGTTTGGCTGGATACTGAAGGGAATCCCATTCAAGCTCATGGAGGTGGAATTTTGTTCGATGAAAGATCTGAATCATACTATTGGTATGGAGAGTATAAAGATG
GCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTCGACATTATAGGAGTTGGTTGCTACTCCTCCAAAGACCTGTGGACATGGAAAAATGAAGGCATTGTTTTG
ACAGCGGTAGAAACAAATGAGACCCATGATCTTCACAAATCCAATGTACTCGAGAGGCCTAAAGTTATCTACAATTCGAGGACGGGAAAATACGTAATGTGGATGCATAT
AGACGATGCGAATTATACAAAAGCTTCCGTGGGTGTTGCCATCAGCGATTACCCCACCGGTCCATTCGATTATCTTTACAGCAAAAGACCCCATGGATTTGACAGCAGAG
ACATGACAATCTTTAAAGATGATGACGGTACAGCGTATCTCATTTACTCATCTGATGACAATAGTGAACTTCATATAGGGCCTCTCACAGAAGATTATCTCGACGTGACC
AACGTCGTGAGAAGGATTTTCATTGGCTACCACCGGGAAGCGCCAGCTTTGTTCAAACACCAGGGAACTTACTATATGGTCACGTCGGGATGCACAGGATGGGCACCGAA
TGAGGCACTGGCACACGCATCAGAGTCGATGTTGGGTCCATGGGAGACGATGGGAAATCCATGTATAGGAGGAAACAAGATGTTTCGACTAGCTACATTCTTTGCCCAGA
GCACATTTGTTCTTCCCCTACCTTCACATCCTAGCTTGTTCATCTTCATGGCAGACCGATGGAACCCCGCAGATCTTAGAGACTCAAGATACGTTTGGTTGCCGTTGATG
GTCGGAGGACTTGTCGATGAACCACTCGACTACAATTTCGGGTTCCCTTTGTGGCCAAGAGTGTCCATATATTGGCATAGAAAGTGGAGGCTTCCTCAGGGCAGGAGTCT
GTCGAAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGCATTATATTGGAGATAACAAGGAGGAGGAAATGAAGATGAGGAACAAATACAGGAAATCAACCACTTTACGTTGCAATGCAGGGAGTAGATGTTTCATATCTGT
GATAATAGGGAGTCTAGTGGGGTGTATTCTTATACTACATATATTTTCTCCTATAAACCGCAAGGATGAGATAGGTCGGGGCATCGAACTTCAAACAAGTCACCACCTTC
GCTTCCGTGAACTTGAAGAGGTAGATGAGGAAAACATTCAAATTCCCCCTCCAAGGGGTAAGAGATCCCCACGTGCAGCGAAGCGAAGACCAAAGAAAACAACCACGCTA
ATTGATGAATTTCTTGATGAAGATTCACAGATTAGACACAAGTTCTTTCCTGATCATAAAACTTCCGTTGATCCAATGAACACAGGAAATGATAGTATGTACTATTATCC
AGGGAGAGTTTGGCTGGATACTGAAGGGAATCCCATTCAAGCTCATGGAGGTGGAATTTTGTTCGATGAAAGATCTGAATCATACTATTGGTATGGAGAGTATAAAGATG
GCCCCACCTACCATGCTCACAAAAAGGGAGCTGCACGGGTCGACATTATAGGAGTTGGTTGCTACTCCTCCAAAGACCTGTGGACATGGAAAAATGAAGGCATTGTTTTG
ACAGCGGTAGAAACAAATGAGACCCATGATCTTCACAAATCCAATGTACTCGAGAGGCCTAAAGTTATCTACAATTCGAGGACGGGAAAATACGTAATGTGGATGCATAT
AGACGATGCGAATTATACAAAAGCTTCCGTGGGTGTTGCCATCAGCGATTACCCCACCGGTCCATTCGATTATCTTTACAGCAAAAGACCCCATGGATTTGACAGCAGAG
ACATGACAATCTTTAAAGATGATGACGGTACAGCGTATCTCATTTACTCATCTGATGACAATAGTGAACTTCATATAGGGCCTCTCACAGAAGATTATCTCGACGTGACC
AACGTCGTGAGAAGGATTTTCATTGGCTACCACCGGGAAGCGCCAGCTTTGTTCAAACACCAGGGAACTTACTATATGGTCACGTCGGGATGCACAGGATGGGCACCGAA
TGAGGCACTGGCACACGCATCAGAGTCGATGTTGGGTCCATGGGAGACGATGGGAAATCCATGTATAGGAGGAAACAAGATGTTTCGACTAGCTACATTCTTTGCCCAGA
GCACATTTGTTCTTCCCCTACCTTCACATCCTAGCTTGTTCATCTTCATGGCAGACCGATGGAACCCCGCAGATCTTAGAGACTCAAGATACGTTTGGTTGCCGTTGATG
GTCGGAGGACTTGTCGATGAACCACTCGACTACAATTTCGGGTTCCCTTTGTGGCCAAGAGTGTCCATATATTGGCATAGAAAGTGGAGGCTTCCTCAGGGCAGGAGTCT
GTCGAAA
Protein sequenceShow/hide protein sequence
MLHYIGDNKEEEMKMRNKYRKSTTLRCNAGSRCFISVIIGSLVGCILILHIFSPINRKDEIGRGIELQTSHHLRFRELEEVDEENIQIPPPRGKRSPRAAKRRPKKTTTL
IDEFLDEDSQIRHKFFPDHKTSVDPMNTGNDSMYYYPGRVWLDTEGNPIQAHGGGILFDERSESYYWYGEYKDGPTYHAHKKGAARVDIIGVGCYSSKDLWTWKNEGIVL
TAVETNETHDLHKSNVLERPKVIYNSRTGKYVMWMHIDDANYTKASVGVAISDYPTGPFDYLYSKRPHGFDSRDMTIFKDDDGTAYLIYSSDDNSELHIGPLTEDYLDVT
NVVRRIFIGYHREAPALFKHQGTYYMVTSGCTGWAPNEALAHASESMLGPWETMGNPCIGGNKMFRLATFFAQSTFVLPLPSHPSLFIFMADRWNPADLRDSRYVWLPLM
VGGLVDEPLDYNFGFPLWPRVSIYWHRKWRLPQGRSLSK