; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G010090 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G010090
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationCmo_Chr18:10968926..10971658
RNA-Seq ExpressionCmoCh18G010090
SyntenyCmoCh18G010090
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573941.1 hypothetical protein SDJN03_27828, partial [Cucurbita argyrosperma subsp. sororia]2.1e-23597.34Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPI EAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLG LEDCSSDEAESCNSQGCLLFEYLERD PYSREPLADKISDLASRFP+LKTLRSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYH LHTAMGG    QSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLR+LQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFFS+RDLAPY
Subjt:  DFVFFSRRDLAPY

XP_022945838.1 uncharacterized protein LOC111449961 isoform X1 [Cucurbita moschata]1.1e-244100Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFFSRRDLAPY
Subjt:  DFVFFSRRDLAPY

XP_022945839.1 uncharacterized protein LOC111449961 isoform X2 [Cucurbita moschata]3.3e-24199.27Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYHFLHTAMGG   PQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFFSRRDLAPY
Subjt:  DFVFFSRRDLAPY

XP_022968119.1 uncharacterized protein LOC111467452 isoform X1 [Cucurbita maxima]2.9e-23797.09Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPI EAGDRVVSDEATKP+PQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQP+FVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLG LEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKT+RSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYH LHTAMGG  S QSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL++LQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFF RRDLAPY
Subjt:  DFVFFSRRDLAPY

XP_022968120.1 uncharacterized protein LOC111467452 isoform X2 [Cucurbita maxima]2.1e-23596.85Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPI EAGDRVVSDEATKP+PQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQP+FVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLG LEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKT+RSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYH LHTAMGG    QSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL++LQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFF RRDLAPY
Subjt:  DFVFFSRRDLAPY

TrEMBL top hitse value%identityAlignment
A0A6J1DC61 uncharacterized protein LOC1110187374.0e-20084.52Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPV------PQPAVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PI     RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPV------PQPAVSPLSNLERFLQSVT

Query:  PSVPAQFLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDS

Query:  ETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADKI DLASRFPQLKT+RSCDLLP SWI
Subjt:  ETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTD-AKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLR
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+ G   PQS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTD-AKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLR

Query:  DLQVNHPDFVFFSRRDLAPY
         LQVNHPDF+FFSRRD  PY
Subjt:  DLQVNHPDFVFFSRRDLAPY

A0A6J1G225 uncharacterized protein LOC111449961 isoform X21.6e-24199.27Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYHFLHTAMGG   PQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFFSRRDLAPY
Subjt:  DFVFFSRRDLAPY

A0A6J1G242 uncharacterized protein LOC111449961 isoform X15.3e-245100Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFFSRRDLAPY
Subjt:  DFVFFSRRDLAPY

A0A6J1HWA7 uncharacterized protein LOC111467452 isoform X21.0e-23596.85Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPI EAGDRVVSDEATKP+PQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQP+FVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLG LEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKT+RSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYH LHTAMGG    QSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL++LQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFF RRDLAPY
Subjt:  DFVFFSRRDLAPY

A0A6J1HYQ4 uncharacterized protein LOC111467452 isoform X11.4e-23797.09Show/hide
Query:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ
        MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPI EAGDRVVSDEATKP+PQPAVSPLSNLERFLQSVTPSVPAQ
Subjt:  MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQ

Query:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
        FLSKSALRGWKTSDSERQP+FVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI
Subjt:  FLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRI

Query:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP
        KHCREPPHHNDPSITAPLRMDRLSLRDQHLG LEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKT+RSCDLLPCSWISVAWYP
Subjt:  KHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYP

Query:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP
        IYRIPTGQTLKDLDACFLTYH LHTAMGG  S QSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL++LQVNHP
Subjt:  IYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHP

Query:  DFVFFSRRDLAPY
        DFVFF RRDLAPY
Subjt:  DFVFFSRRDLAPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)2.7e-10060.73Show/hide
Query:  SNLERFLQSVTPSVPAQFLSKSALRGWKTSDSERQ-PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GTESSTKPRQWGE
        SN+ERFL SVTPSVPA +LSK+ +R    SD E Q PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y       SS + R+ GE
Subjt:  SNLERFLQSVTPSVPAQFLSKSALRGWKTSDSERQ-PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GTESSTKPRQWGE

Query:  ETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRF
        E++SD+RDSSS+GSS SE++R + + +E        I+A  RMD+LSLR +   H ED SSD+ E  +SQG L+FEYLERD PY REP ADK+SDLASRF
Subjt:  ETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRF

Query:  PQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVE
        P+LKTLRSCDLLP SW SVAWYPIY+IPTG TLKDLDACFLTYH LHT   GP          V  P +   +K+ L +FGLASYK +G S+W   GG  
Subjt:  PQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVE

Query:  HQLANKLSREADKWLRDLQVNHPDFVFFSRR
        HQLAN L + AD WLR  QVNHPDF+FF RR
Subjt:  HQLANKLSREADKWLRDLQVNHPDFVFFSRR

AT2G01260.1 Protein of unknown function (DUF789)8.2e-10552.4Show/hide
Query:  MLGAGVRFGRVR-GEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPA
        MLGAG +  R R G+D FY S++ R+   +++ D+L R Q   S  PS A      Q                    +P+    SNL+RFL+SVTPSVPA
Subjt:  MLGAGVRFGRVR-GEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPA

Query:  QFLSKSALRGWKTSDSERQ--PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GTESSTKPRQWGEETDSDYRDSSSDGSS
        QFLSK+ LR  +  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y      +SS K R+ G+ +DSD+RDSSSD SS
Subjt:  QFLSKSALRGWKTSDSERQ--PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GTESSTKPRQWGEETDSDYRDSSSDGSS

Query:  DSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCS
        DS+++R                    R+D +SLRDQ   H ED SSD+ E   SQG L+FEYLERD PY REP ADK+ DLA++FP+L TLRSCDLL  S
Subjt:  DSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL
        W SVAWYPIYRIPTG TLKDLDACFLTYH LHT+ GG  S QS  L         +++K+ L +FGLASYKF+G SLW   GG EHQL N L + ADKWL
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL

Query:  RDLQVNHPDFVFFSRR
            V+HPDF+FF RR
Subjt:  RDLQVNHPDFVFFSRR

AT2G01260.2 Protein of unknown function (DUF789)1.4e-8352.82Show/hide
Query:  MLGAGVRFGRVR-GEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPA
        MLGAG +  R R G+D FY S++ R+   +++ D+L R Q   S  PS A      Q                    +P+    SNL+RFL+SVTPSVPA
Subjt:  MLGAGVRFGRVR-GEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPA

Query:  QFLSKSALRGWKTSDSERQ--PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GTESSTKPRQWGEETDSDYRDSSSDGSS
        QFLSK+ LR  +  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y      +SS K R+ G+ +DSD+RDSSSD SS
Subjt:  QFLSKSALRGWKTSDSERQ--PYFVLGDLWETFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GTESSTKPRQWGEETDSDYRDSSSDGSS

Query:  DSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCS
        DS+++R                    R+D +SLRDQ   H ED SSD+ E   SQG L+FEYLERD PY REP ADK+ DLA++FP+L TLRSCDLL  S
Subjt:  DSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGG
        W SVAWYPIYRIPTG TLKDLDACFLTYH LHT+ GG
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGG

AT4G16100.1 Protein of unknown function (DUF789)8.5e-7843.98Show/hide
Query:  RVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKD--VSVQYPIIEAGDRVVSDEATKPVPQPAVSPL-----SNLERFLQSVTPSVPAQFL
        R+RGE+RFY+    RK    R+  RL   +       +  + D  + V+   I+  +   + + + P    + +       SNL RFL   TP V  Q L
Subjt:  RVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKD--VSVQYPIIEAGDRVVSDEATKPVPQPAVSPL-----SNLERFLQSVTPSVPAQFL

Query:  SKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLY--GTESSTKPRQWGEETDSDY-RDSSSDGSSDSETKRR
          ++ +GW+T + E +PYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY   + + T  R+ GEE+D D  RD SSDGS+D      
Subjt:  SKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLY--GTESSTKPRQWGEETDSDY-RDSSSDGSSDSETKRR

Query:  IKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAE-SCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAW
           CRE   +          + R SL ++        SSDE+E S NS G L+FEYLE   P+ REPL DKIS+L+S+FP L+T RSCDL P SW+SVAW
Subjt:  IKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAE-SCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAW

Query:  YPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVN
        YPIYRIP GQ+L++LDACFLT+H L T   G  + +       +      + K+PL  FGLASYKFK S     +   E+Q    L R A++WLR L+V 
Subjt:  YPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVN

Query:  HPDFVFF
         PDF  F
Subjt:  HPDFVFF

AT5G49220.1 Protein of unknown function (DUF789)8.8e-7542.46Show/hide
Query:  GAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEH----------------ASATPSCAVKDVSV---QYPIIEAGDRVVSDEATKPVPQPAV-SP
        G  +    +RGE+RFY+    R+     Q  +  R ++                 A+  P    K + V   +  ++ +G  V +  +        V S 
Subjt:  GAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEH----------------ASATPSCAVKDVSV---QYPIIEAGDRVVSDEATKPVPQPAV-SP

Query:  LSNLERFLQSVTPSVPAQFLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGE
         SNL+RFL+  TP VPA+     +    KT +S+   YFVL DLWE+F EWSAYGAGV     PL ++  D  VQYYVPYLSGIQLY  +   KPR    
Subjt:  LSNLERFLQSVTPSVPAQFLSKSALRGWKTSDSERQPYFVLGDLWETFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGE

Query:  ETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRF
               + SS+GSS+S T               P   +   ++R+SL+DQ +      SS EAE  N QG LLFEYLE + P+ REPLA+KISDLASR 
Subjt:  ETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRMDRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRF

Query:  PQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTD--AKKVPLRIFGLASYKFKGSSLWMRNGG
        P+L T RSCDLLP SW+SV+WYPIYRIP G TL++LDACFLT+H L TA      PQS      A  C     + K+PL  FGLASYK K  S+W +N  
Subjt:  PQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGPQSPQSPQLPFVAYPCKTD--AKKVPLRIFGLASYKFKGSSLWMRNGG

Query:  VEHQLANKLSREADKWLRDLQVNHPDFVFFS
         E Q    L + ADKWL+ LQV+HPD+ FF+
Subjt:  VEHQLANKLSREADKWLRDLQVNHPDFVFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAGCAGGTGTTCGGTTTGGTCGCGTCAGAGGTGAGGACCGGTTTTACGATTCATCGAGAGCCAGGAAGGGCCTTCTCAGCCGTCAAAATGATAGGCTCTGTAG
ACCTCAAGAACACGCTTCAGCTACTCCATCTTGCGCGGTTAAGGATGTTTCCGTGCAGTATCCGATTATAGAGGCTGGGGACCGTGTCGTCTCTGATGAAGCTACTAAAC
CAGTTCCCCAGCCGGCTGTTTCTCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCGTCTGTGCCTGCTCAGTTTCTCTCTAAGAGTGCGTTGAGAGGTTGG
AAGACGAGCGATTCGGAGAGGCAGCCTTACTTTGTGCTTGGTGATTTGTGGGAAACTTTCAAGGAGTGGAGCGCTTATGGCGCTGGAGTGCCTCTGTTATTGAATAACAC
TGATGGTGTGGTTCAATATTATGTCCCGTATTTGTCTGGCATACAATTGTATGGCACGGAATCGTCTACAAAGCCAAGGCAATGGGGCGAGGAAACTGATAGTGACTATA
GAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACAAAGAGAAGAATAAAACACTGTAGAGAACCACCCCACCATAATGATCCGTCTATCACGGCTCCTCTTCGAATG
GATAGATTGTCTTTGAGGGACCAGCATTTGGGACATCTCGAGGACTGCTCCAGCGATGAGGCTGAATCTTGCAATTCTCAAGGTTGCCTTCTATTTGAGTATCTTGAAAG
AGACCAACCGTATTCACGTGAACCTCTGGCAGACAAGATATCGGACCTTGCGTCTCGTTTCCCTCAGCTGAAAACATTGAGAAGTTGTGACCTACTACCATGTAGTTGGA
TATCTGTGGCATGGTACCCAATTTACAGGATACCAACTGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACATACCATTTTCTACATACCGCAATGGGAGGCCCT
CAAAGCCCTCAAAGCCCTCAATTGCCATTTGTGGCATATCCTTGTAAGACGGATGCCAAAAAAGTTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAAAGGCTC
ATCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAATTGGCAAACAAGCTCTCGCGGGAAGCTGATAAGTGGTTAAGAGATCTCCAGGTCAATCACCCAGATTTCGTAT
TCTTCAGCCGACGAGATTTAGCACCTTACTGA
mRNA sequenceShow/hide mRNA sequence
TTTCAATATTTAATTTCTTTATTTTAAAAAAGAAAAAAGAAAAAAAATCTCCCTCATTCTTAATCCTAATCGGTTTTTCGTCGAGCCGCTTTGATTTCCTCCCATTCCAT
TCGACTTCGTGTTATTCAAAGCTACTGATTCTCTTCTCTCTTGAAATTCCTCATCCGCGATCTGATTTCCCCGGAAAGATCTGACAATTGCTGTTTCTGCTGTCTGGATT
TCAGATTATCAGATCGCACCGCCATTTCTCTTCCACTTCCGCTTCGTCGCCGTCGTCGTCGTCGTCGTTGACTTTGGTTAGGGATTGTTCTTGCCACTTGCAGACTAGTG
TTCCTTAGTTGGTGGTAGACTGTCGAGATGTTAGGAGCAGGTGTTCGGTTTGGTCGCGTCAGAGGTGAGGACCGGTTTTACGATTCATCGAGAGCCAGGAAGGGCCTTCT
CAGCCGTCAAAATGATAGGCTCTGTAGACCTCAAGAACACGCTTCAGCTACTCCATCTTGCGCGGTTAAGGATGTTTCCGTGCAGTATCCGATTATAGAGGCTGGGGACC
GTGTCGTCTCTGATGAAGCTACTAAACCAGTTCCCCAGCCGGCTGTTTCTCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCGTCTGTGCCTGCTCAGTTT
CTCTCTAAGAGTGCGTTGAGAGGTTGGAAGACGAGCGATTCGGAGAGGCAGCCTTACTTTGTGCTTGGTGATTTGTGGGAAACTTTCAAGGAGTGGAGCGCTTATGGCGC
TGGAGTGCCTCTGTTATTGAATAACACTGATGGTGTGGTTCAATATTATGTCCCGTATTTGTCTGGCATACAATTGTATGGCACGGAATCGTCTACAAAGCCAAGGCAAT
GGGGCGAGGAAACTGATAGTGACTATAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACAAAGAGAAGAATAAAACACTGTAGAGAACCACCCCACCATAATGAT
CCGTCTATCACGGCTCCTCTTCGAATGGATAGATTGTCTTTGAGGGACCAGCATTTGGGACATCTCGAGGACTGCTCCAGCGATGAGGCTGAATCTTGCAATTCTCAAGG
TTGCCTTCTATTTGAGTATCTTGAAAGAGACCAACCGTATTCACGTGAACCTCTGGCAGACAAGATATCGGACCTTGCGTCTCGTTTCCCTCAGCTGAAAACATTGAGAA
GTTGTGACCTACTACCATGTAGTTGGATATCTGTGGCATGGTACCCAATTTACAGGATACCAACTGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACATACCAT
TTTCTACATACCGCAATGGGAGGCCCTCAAAGCCCTCAAAGCCCTCAATTGCCATTTGTGGCATATCCTTGTAAGACGGATGCCAAAAAAGTTCCTTTAAGAATTTTTGG
ACTTGCTTCATACAAGTTTAAAGGCTCATCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAATTGGCAAACAAGCTCTCGCGGGAAGCTGATAAGTGGTTAAGAGATC
TCCAGGTCAATCACCCAGATTTCGTATTCTTCAGCCGACGAGATTTAGCACCTTACTGA
Protein sequenceShow/hide protein sequence
MLGAGVRFGRVRGEDRFYDSSRARKGLLSRQNDRLCRPQEHASATPSCAVKDVSVQYPIIEAGDRVVSDEATKPVPQPAVSPLSNLERFLQSVTPSVPAQFLSKSALRGW
KTSDSERQPYFVLGDLWETFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGTESSTKPRQWGEETDSDYRDSSSDGSSDSETKRRIKHCREPPHHNDPSITAPLRM
DRLSLRDQHLGHLEDCSSDEAESCNSQGCLLFEYLERDQPYSREPLADKISDLASRFPQLKTLRSCDLLPCSWISVAWYPIYRIPTGQTLKDLDACFLTYHFLHTAMGGP
QSPQSPQLPFVAYPCKTDAKKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHPDFVFFSRRDLAPY