; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cp4.1LG09g03210 (gene) of Cucurbita pepo (MU-CU-16) v4.1 genome

Gene IDCp4.1LG09g03210
OrganismCucurbita pepo var. pepo MU-CU-16 (Cucurbita pepo (MU-CU-16) v4.1)
DescriptionProtein of unknown function (DUF789)
Genome locationCp4.1LG09:1898471..1909841
RNA-Seq ExpressionCp4.1LG09g03210
SyntenyCp4.1LG09g03210
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573941.1 hypothetical protein SDJN03_27828, partial [Cucurbita argyrosperma subsp. sororia]8.40e-29789.21Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFG VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADK            
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
                     ISDLASRFP+LKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT MGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL

Query:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFS+RDLAPY
Subjt:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

KAG7013005.1 hypothetical protein SDJN02_25760 [Cucurbita argyrosperma subsp. argyrosperma]0.094.49Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFG VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSF GFENIFG
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
        FTASHFSLFFCIQISDLASRFP+LKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT MGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL

Query:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFS+RDLAPY
Subjt:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

XP_022945839.1 uncharacterized protein LOC111449961 isoform X2 [Cucurbita moschata]5.42e-29388.33Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFG VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPI EAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLG LEDCSSDEAESCNSQGCLLFEYLERD PYSREPLADK            
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
                     ISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYH LHT MGG QSPQLPFVAYPCKTDAKKVPLRIFGL
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL

Query:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLR+LQVNHPDFVFFSRRDLAPY
Subjt:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

XP_022968120.1 uncharacterized protein LOC111467452 isoform X2 [Cucurbita maxima]8.04e-29588.33Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFG VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKP+PQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQP+FVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERD PYSREPLADK            
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
                     ISDLASRFPQLKT+RSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT MGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL

Query:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        ASYKFKGSSLWMRNGGVEHQLANKLSREADKWL+ELQVNHPDFVFF RRDLAPY
Subjt:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

XP_023541256.1 uncharacterized protein LOC111801477 [Cucurbita pepo subsp. pepo]1.30e-30090.09Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADK            
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
                     ISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL

Query:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
Subjt:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

TrEMBL top hitse value%identityAlignment
A0A6J1DC61 uncharacterized protein LOC1110187378.72e-24776.79Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPV------PQPAVSPLSNLERFLQSVT
        MLGAGVRFG  RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPV------PQPAVSPLSNLERFLQSVT

Query:  PSVPAQFLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQW
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KP                    +W
Subjt:  PSVPAQFLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQW

Query:  GEETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPG
        GEE+DSDYRDSSSDGSSDSETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERDLPYSREPLADK      
Subjt:  GEETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPG

Query:  FENIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTD-AKKV
                           I DLASRFPQLKT+RSCDLLP SWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT + G QS Q+PFVAYPCKTD A+K+
Subjt:  FENIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTD-AKKV

Query:  PLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR LQVNHPDF+FFSRRD  PY
Subjt:  PLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

A0A6J1G225 uncharacterized protein LOC111449961 isoform X22.62e-29388.33Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFG VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPI EAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLG LEDCSSDEAESCNSQGCLLFEYLERD PYSREPLADK            
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
                     ISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYH LHT MGG QSPQLPFVAYPCKTDAKKVPLRIFGL
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL

Query:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLR+LQVNHPDFVFFSRRDLAPY
Subjt:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

A0A6J1G242 uncharacterized protein LOC111449961 isoform X13.99e-29187.75Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFG VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPI EAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLG LEDCSSDEAESCNSQGCLLFEYLERD PYSREPLADK            
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQ---LPFVAYPCKTDAKKVPLRI
                     ISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYH LHT MGG QSPQ   LPFVAYPCKTDAKKVPLRI
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQ---LPFVAYPCKTDAKKVPLRI

Query:  FGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        FGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLR+LQVNHPDFVFFSRRDLAPY
Subjt:  FGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

A0A6J1HWA7 uncharacterized protein LOC111467452 isoform X23.89e-29588.33Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFG VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKP+PQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQP+FVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERD PYSREPLADK            
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
                     ISDLASRFPQLKT+RSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT MGGLQSPQLPFVAYPCKTDAKKVPLRIFGL
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGL

Query:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        ASYKFKGSSLWMRNGGVEHQLANKLSREADKWL+ELQVNHPDFVFF RRDLAPY
Subjt:  ASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

A0A6J1HYQ4 uncharacterized protein LOC111467452 isoform X15.93e-29387.75Show/hide
Query:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFG VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKP+PQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS
        FLSKSALRGWKTSDSERQP+FVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKP                    QWGEETDS
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDS

Query:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG
        DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERD PYSREPLADK            
Subjt:  DYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFG

Query:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGG---LQSPQLPFVAYPCKTDAKKVPLRI
                     ISDLASRFPQLKT+RSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT MGG   LQSPQLPFVAYPCKTDAKKVPLRI
Subjt:  FTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGG---LQSPQLPFVAYPCKTDAKKVPLRI

Query:  FGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY
        FGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL+ELQVNHPDFVFF RRDLAPY
Subjt:  FGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRRDLAPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)8.4e-9553.8Show/hide
Query:  SNLERFLQSVTPSVPAQFLSKSALRGWKTSDSERQ-PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAI
        SN+ERFL SVTPSVPA +LSK+ +R    SD E Q PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y    +   S   +    
Subjt:  SNLERFLQSVTPSVPAQFLSKSALRGWKTSDSERQ-PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAI

Query:  FGSDSVLLYEGQWGEETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYS
                   + GEE++SD+RDSSS+GSS SE++R + + +E        I+A  RMD+LSLR +H    ED SSD+ E  +SQG L+FEYLERDLPY 
Subjt:  FGSDSVLLYEGQWGEETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYS

Query:  REPLADKASSFPGFENIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPF
        REP ADK                         +SDLASRFP+LKTLRSCDLLP SW SVAWYPIY+IPTG TLKDLDACFLTYHSLHT   G        
Subjt:  REPLADKASSFPGFENIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPF

Query:  VAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRR
             +   +K+ L +FGLASYK +G S+W   GG  HQLAN L + AD WLR  QVNHPDF+FF RR
Subjt:  VAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRR

AT2G01260.1 Protein of unknown function (DUF789)1.6e-9847.46Show/hide
Query:  MLGAGVRFGCVR-GEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPA
        MLGAG +    R G+D FY S++ R+   +++ D+L R Q   S  PS A      Q                    +P+    SNL+RFL+SVTPSVPA
Subjt:  MLGAGVRFGCVR-GEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPA

Query:  QFLSKSALRGWKTSDSERQ--PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGE
        QFLSK+ LR  +  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y    +   S  ++               + G+
Subjt:  QFLSKSALRGWKTSDSERQ--PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGE

Query:  ETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFE
         +DSD+RDSSSD SSDS+++R                    R+D +SLRDQH    ED SSD+ E   SQG L+FEYLERDLPY REP ADK        
Subjt:  ETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFE

Query:  NIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLR
                         + DLA++FP+L TLRSCDLL  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT+ GG  S Q   +  P   +++K+ L 
Subjt:  NIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLR

Query:  IFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRR
        +FGLASYKF+G SLW   GG EHQL N L + ADKWL    V+HPDF+FF RR
Subjt:  IFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFSRR

AT2G01260.2 Protein of unknown function (DUF789)2.7e-7746.68Show/hide
Query:  MLGAGVRFGCVR-GEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPA
        MLGAG +    R G+D FY S++ R+   +++ D+L R Q   S  PS A      Q                    +P+    SNL+RFL+SVTPSVPA
Subjt:  MLGAGVRFGCVR-GEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPA

Query:  QFLSKSALRGWKTSDSERQ--PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGE
        QFLSK+ LR  +  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y    +   S  ++               + G+
Subjt:  QFLSKSALRGWKTSDSERQ--PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGE

Query:  ETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFE
         +DSD+RDSSSD SSDS+++R                    R+D +SLRDQH    ED SSD+ E   SQG L+FEYLERDLPY REP ADK        
Subjt:  ETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFE

Query:  NIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGG
                         + DLA++FP+L TLRSCDLL  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT+ GG
Subjt:  NIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGG

AT4G16100.1 Protein of unknown function (DUF789)3.8e-7140.22Show/hide
Query:  VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPL-------SNLERFLQSVTPSVPAQFLS
        +RGE+RFY+    RK    R+  RL   +       +  + D  ++    E         +   VP    S         SNL RFL   TP V  Q L 
Subjt:  VRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPL-------SNLERFLQSVTPSVPAQFLS

Query:  KSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDSDY-
         ++ +GW+T + E +PYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY   S    +                   + GEE+D D  
Subjt:  KSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDSDY-

Query:  RDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAE-SCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFGF
        RD SSDGS+D         CRE   +          + R SL ++        SSDE+E S NS G L+FEYLE  +P+ REPL DK             
Subjt:  RDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAE-SCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFGF

Query:  TASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGLA
                    IS+L+S+FP L+T RSCDL P SW+SVAWYPIYRIP GQ+L++LDACFLT+HSL T   G  + +    +      + K+PL  FGLA
Subjt:  TASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGLA

Query:  SYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFF
        SYKFK S     +   E+Q    L R A++WLR L+V  PDF  F
Subjt:  SYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFF

AT5G49220.1 Protein of unknown function (DUF789)1.4e-6837.87Show/hide
Query:  GAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEH----------------ASATPSCAVKDVSV---QYPITEAGDRVVSDEATKPVPQPAV-SP
        G  +    +RGE+RFY+    R+     Q  +  R ++                 A+  P    K + V   +  +  +G  V +  +        V S 
Subjt:  GAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEH----------------ASATPSCAVKDVSV---QYPITEAGDRVVSDEATKPVPQPAV-SP

Query:  LSNLERFLQSVTPSVPAQFLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQ
         SNL+RFL+  TP VPA+     +    KT +S+   YFVL DLWE+F EWSAYGAGV     PL ++  D  VQYYVPYLSGIQLY  +   KP     
Subjt:  LSNLERFLQSVTPSVPAQFLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQ

Query:  EFAIFGSDSVLLYEGQWGEETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERD
        +                         + SS+GSS+S T               P   +   ++R+SL+DQ +      SS EAE  N QG LLFEYLE +
Subjt:  EFAIFGSDSVLLYEGQWGEETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERD

Query:  LPYSREPLADKASSFPGFENIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSP
         P+ REPLA+K                         ISDLASR P+L T RSCDLLP SW+SV+WYPIYRIP G TL++LDACFLT+HSL T      +P
Subjt:  LPYSREPLADKASSFPGFENIFGFTASHFSLFFCIQISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSP

Query:  QLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFS
            +       + K+PL  FGLASYK K  S+W +N   E Q    L + ADKWL+ LQV+HPD+ FF+
Subjt:  QLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNHPDFVFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAGCAGGTGTTCGGTTTGGTTGCGTTAGAGGTGAGGACCGGTTTTACGATTCATCGAGAGCCAGGAAGGGCCTTCTCAGCCGTCAAAATGATAGGCTCTGTAG
ACCTCAAGAACACGCTTCAGCTACTCCATCTTGCGCGGTTAAGGATGTTTCCGTGCAGTATCCGATTACAGAGGCTGGGGACCGTGTCGTCTCTGATGAAGCTACTAAAC
CAGTTCCCCAGCCGGCTGTTTCTCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCTTCTGTGCCTGCTCAGTTTCTCTCTAAGAGTGCGTTGAGAGGTTGG
AAGACGAGCGATTCGGAGAGGCAGCCTTACTTTGTGCTTGGTGATTTGTGGGAAACTTTCAAGGAGTGGAGCGCTTATGGCGCTGGAGTGCCTCTGTTATTGAATAACAC
TGATGGTGTGGTTCAGTATTATGTCCCGTATTTGTCTGGCATACAATTGTATGGCACGGAATCGTCTACAAAGCCAAGCTTCTCAACTCAGGAATTTGCAATTTTTGGTT
CTGATTCTGTTTTACTTTATGAGGGGCAATGGGGTGAGGAAACTGATAGTGACTACAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACAAAGAGAAGAATAAAA
CACTGTAGAGAACCACCCCACCATAATGATCCGTCTATCACAGCTCCTCTTAGAATGGATAGATTGTCTTTGAGGGACCAGCATTTGGGACGTCTCGAGGACTGCTCCAG
CGATGAGGCTGAATCTTGCAATTCTCAAGGTTGCCTTCTATTTGAGTATCTTGAAAGAGACCTACCGTATTCACGCGAACCTTTGGCAGACAAGGCAAGTAGTTTTCCGG
GTTTTGAGAATATATTCGGGTTTACAGCTTCTCATTTTTCTTTGTTTTTTTGCATACAGATATCGGACCTTGCCTCTCGTTTCCCTCAGCTGAAAACATTGAGAAGCTGT
GACCTACTACCATGTAGTTGGATATCTGTGGCATGGTACCCAATTTACAGGATACCAACTGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACATACCATTCTCT
ACATACTACAATGGGAGGCCTTCAAAGCCCTCAATTGCCATTTGTGGCATATCCTTGTAAGACGGATGCCAAAAAAGTTCCTTTAAGAATTTTTGGACTTGCTTCATACA
AGTTTAAAGGGTCATCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAATTGGCAAACAAGCTCTCGCGGGAAGCTGATAAGTGGTTAAGAGAACTCCAGGTCAATCAC
CCAGATTTCGTATTCTTCAGCCGACGAGATTTAGCACCTTACTGA
mRNA sequenceShow/hide mRNA sequence
CAAGGGCTACCGAAAATCTCCTACTTCGCGCCCCTCGAATCATGAGGGAGCCCACTTATTGCAATTTGCAGACTCTTCGTGTAATTTAGTCTGTTAGGTAGAAGTTTTCA
CATCTCAATAAAGAATGTTTTGTTTCCTTCTCCAACTGACGTGGGATCTCACTATCCACTCCTTTGGGGCCAACGTCCTCGTTGACACTTGTTTCTTGAAGATTATCAGA
TCGCACCGCCATTTCTCTTCCACTTCCGCTTCGTCGCCGTCGTCGTCGTCGTCGTTGACTTTGGTTAGGGATTGTTCTTGCAACTTGCAGACTAGTGTTCCTTAGTTGGT
GGTAGACTGTCGAGATGTTAGGAGCAGGTGTTCGGTTTGGTTGCGTTAGAGGTGAGGACCGGTTTTACGATTCATCGAGAGCCAGGAAGGGCCTTCTCAGCCGTCAAAAT
GATAGGCTCTGTAGACCTCAAGAACACGCTTCAGCTACTCCATCTTGCGCGGTTAAGGATGTTTCCGTGCAGTATCCGATTACAGAGGCTGGGGACCGTGTCGTCTCTGA
TGAAGCTACTAAACCAGTTCCCCAGCCGGCTGTTTCTCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCTTCTGTGCCTGCTCAGTTTCTCTCTAAGAGTG
CGTTGAGAGGTTGGAAGACGAGCGATTCGGAGAGGCAGCCTTACTTTGTGCTTGGTGATTTGTGGGAAACTTTCAAGGAGTGGAGCGCTTATGGCGCTGGAGTGCCTCTG
TTATTGAATAACACTGATGGTGTGGTTCAGTATTATGTCCCGTATTTGTCTGGCATACAATTGTATGGCACGGAATCGTCTACAAAGCCAAGCTTCTCAACTCAGGAATT
TGCAATTTTTGGTTCTGATTCTGTTTTACTTTATGAGGGGCAATGGGGTGAGGAAACTGATAGTGACTACAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACAA
AGAGAAGAATAAAACACTGTAGAGAACCACCCCACCATAATGATCCGTCTATCACAGCTCCTCTTAGAATGGATAGATTGTCTTTGAGGGACCAGCATTTGGGACGTCTC
GAGGACTGCTCCAGCGATGAGGCTGAATCTTGCAATTCTCAAGGTTGCCTTCTATTTGAGTATCTTGAAAGAGACCTACCGTATTCACGCGAACCTTTGGCAGACAAGGC
AAGTAGTTTTCCGGGTTTTGAGAATATATTCGGGTTTACAGCTTCTCATTTTTCTTTGTTTTTTTGCATACAGATATCGGACCTTGCCTCTCGTTTCCCTCAGCTGAAAA
CATTGAGAAGCTGTGACCTACTACCATGTAGTTGGATATCTGTGGCATGGTACCCAATTTACAGGATACCAACTGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTC
ACATACCATTCTCTACATACTACAATGGGAGGCCTTCAAAGCCCTCAATTGCCATTTGTGGCATATCCTTGTAAGACGGATGCCAAAAAAGTTCCTTTAAGAATTTTTGG
ACTTGCTTCATACAAGTTTAAAGGGTCATCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAATTGGCAAACAAGCTCTCGCGGGAAGCTGATAAGTGGTTAAGAGAAC
TCCAGGTCAATCACCCAGATTTCGTATTCTTCAGCCGACGAGATTTAGCACCTTACTGATACGATACTCTTACAAACTAGAAAACGAAAGTCGACAAGTCGTGGCCCTAC
AAAAGAACCCCGTAATTAGTTGGATTAAGCTGTATGCCTGTTGAAGGAGTGGTACACTCGTTTGCTTTCGTCAAGGTGGGGGATTGAAGGAAGGTTAAGGAATGTGATAG
GTTGCAAAATACTGAAGCTTTAGTTGGTTGTTGGTGTGCCGTGCCTGATGTATTAACAAAGACCGATAGAAAGTGAAGTCTAAAGATGTAAGAAGAGAACATAGCTGTAA
GACTCGCTGGTTTAGCAAGGTATGGTCGTCATGTGAAGATGAAAGAATGTGAAAGCAAGAGGATCCTTACTCGTAAGCAATGCTCAATGGCCAAGTGTTCTGTCCTGATC
TGTAGAAATGGTAAGTGTACGGTAAGAAAGCCTCTAGTTTCTGCTTTACAGAGGGCAGGGGATCCACTAATGGAAACACAGCGTTGGCTTGCTTGCCTGGTGTTCATAGG
ACCTGTAACAAAGTTTCTGATTCCACCATACCAATGGCGGAGTTGATGCTGTGGAAGAAGAAGAGAGAATCGGCGGCGATGTTGGCCGGAATCAGCGGCGTTTGGTTGCT
GCTTGAGGTCTTGGACTACCACTTTGTTACTCTGCTTTGCTATCTACTCATCATTACTATGCTTCTTCTCTTCTCTTGGAACAAATTGGCTCTTCTCATCAACAGGGCTC
CACCGAATGTGCATGATTTTGAGATCTCAAACGCTACTCTCACTCGTTTCTTGGATACATTCAACTGGTTGGTCCACCACTTCTTTCAAATTTCAACTGGCCAAAACTTC
AAGCTATTTGCAATGGTATTGGGTAGCATTTGGTTGGTATCACTCATTGGAGAGCTAACAAACTCATTGAATCTCATATACATTGTGTTTTTGAGCTTACAAACAGTTCC
AATTGTGTTGGACAAGTACGAGGAGGAGATTCATAAGCTTGTCTCCAACCTCAAGTCTTCCATGGACACCTTCTTCCACACCTTACACTCCAATTTTCTCACCAAAATCC
CAAGAAGAACCCATATCAAACAAACTTAGTTCCTAATACCCTATATTTCAATCACTTATTTCGTCCCGAACCCGTCTTAGATTCAATCAAAATCGAGTTGTATTCCATGT
TTTGAAATACGTGTTTGATAGTAGATTTAGAAATCCATTAATGTATCAACCAATTTGAACATATTTTAACTGATTAATATTTTAACAATGTATCTTCTACTAAAAGGTTA
GACGTTCAAAGTCATTAATGTATCAATCAATTTGAACCTATCTGATTATTTC
Protein sequenceShow/hide protein sequence
MLGAGVRFGCVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPITEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQFLSKSALRGW
KTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPSFSTQEFAIFGSDSVLLYEGQWGEETDSDYRDSSSDGSSDSETKRRIK
HCREPPHHNDPSITAPLRMDRLSLRDQHLGRLEDCSSDEAESCNSQGCLLFEYLERDLPYSREPLADKASSFPGFENIFGFTASHFSLFFCIQISDLASRFPQLKTLRSC
DLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTTMGGLQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRELQVNH
PDFVFFSRRDLAPY