; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011077 (gene) of Snake gourd v1 genome

Gene IDTan0011077
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF789)
Genome locationLG04:6962248..6966350
RNA-Seq ExpressionTan0011077
SyntenyTan0011077
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573941.1 hypothetical protein SDJN03_27828, partial [Cucurbita argyrosperma subsp. sororia]1.1e-21791.3Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   IT AGDRVVSDEATKPV    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERDLPYSREPLADKI DLASRFP+LKT+RSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTA+GG QSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLR+LQVNH
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH

Query:  PDFLFFSRRDLTPY
        PDF+FFS+RDL PY
Subjt:  PDFLFFSRRDLTPY

XP_022945838.1 uncharacterized protein LOC111449961 isoform X1 [Cucurbita moschata]2.2e-21690.89Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   I  AGDRVVSDEATKPV    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERD PYSREPLADKI DLASRFPQLKT+RSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG---PQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ
        AWYPIYRIPTGQTLKDLDACFLTYH LHTA+GG   PQSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG---PQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ

Query:  VNHPDFLFFSRRDLTPY
        VNHPDF+FFSRRDL PY
Subjt:  VNHPDFLFFSRRDLTPY

XP_022945839.1 uncharacterized protein LOC111449961 isoform X2 [Cucurbita moschata]5.1e-21891.55Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   I  AGDRVVSDEATKPV    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERD PYSREPLADKI DLASRFPQLKT+RSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH
        AWYPIYRIPTGQTLKDLDACFLTYH LHTA+GGPQSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH

Query:  PDFLFFSRRDLTPY
        PDF+FFSRRDL PY
Subjt:  PDFLFFSRRDLTPY

XP_022968120.1 uncharacterized protein LOC111467452 isoform X2 [Cucurbita maxima]9.7e-21790.82Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   IT AGDRVVSDEATKP+    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERD PYSREPLADKI DLASRFPQLKTMRSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTA+GG QSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL++LQVNH
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH

Query:  PDFLFFSRRDLTPY
        PDF+FF RRDL PY
Subjt:  PDFLFFSRRDLTPY

XP_023541256.1 uncharacterized protein LOC111801477 [Cucurbita pepo subsp. pepo]3.3e-21791.3Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFG  RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   IT AGDRVVSDEATKPV    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERDLPYSREPLADKI DLASRFPQLKT+RSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH
        AWYPIYRIPTGQTLKDLDACFLTYHSLHT +GG QSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLR+LQVNH
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH

Query:  PDFLFFSRRDLTPY
        PDF+FFSRRDL PY
Subjt:  PDFLFFSRRDLTPY

TrEMBL top hitse value%identityAlignment
A0A6J1DC61 uncharacterized protein LOC1110187377.0e-21389.69Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAI--SNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGRGRGEDRFYDSSRAR+GLLSRQNDRLCRPQEDASATPSC  KD+S+HS IT    RV SDEATKPVA+   NPQPVVSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAI--SNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGWRT DSE QPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGME S KPRRWGEESDSDYRDSSSDGSSDS
Subjt:  PSVPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDS

Query:  DTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWI
        +TKRRIKH RE  HHNDPSITAPLR+DRLSLRDQ +GL EDCSSDEAES NS+G LLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWI
Subjt:  DTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSD-AEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAI GPQS QVPFVAYPCK+D AEK+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSD-AEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ

Query:  VNHPDFLFFSRRDLTPY
        VNHPDFLFFSRRD TPY
Subjt:  VNHPDFLFFSRRDLTPY

A0A6J1G225 uncharacterized protein LOC111449961 isoform X22.5e-21891.55Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   I  AGDRVVSDEATKPV    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERD PYSREPLADKI DLASRFPQLKT+RSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH
        AWYPIYRIPTGQTLKDLDACFLTYH LHTA+GGPQSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH

Query:  PDFLFFSRRDLTPY
        PDF+FFSRRDL PY
Subjt:  PDFLFFSRRDLTPY

A0A6J1G242 uncharacterized protein LOC111449961 isoform X11.0e-21690.89Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   I  AGDRVVSDEATKPV    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERD PYSREPLADKI DLASRFPQLKT+RSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG---PQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ
        AWYPIYRIPTGQTLKDLDACFLTYH LHTA+GG   PQSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG---PQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ

Query:  VNHPDFLFFSRRDLTPY
        VNHPDF+FFSRRDL PY
Subjt:  VNHPDFLFFSRRDLTPY

A0A6J1HWA7 uncharacterized protein LOC111467452 isoform X24.7e-21790.82Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   IT AGDRVVSDEATKP+    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERD PYSREPLADKI DLASRFPQLKTMRSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTA+GG QSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL++LQVNH
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNH

Query:  PDFLFFSRRDLTPY
        PDF+FF RRDL PY
Subjt:  PDFLFFSRRDLTPY

A0A6J1HYQ4 uncharacterized protein LOC111467452 isoform X11.5e-21590.17Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS
        MLGAGVRFGR RGEDRFYDSSRARKGLLSRQNDRLCRPQE ASATPSCA KD SV   IT AGDRVVSDEATKP+    PQP VSPLSNLERFLQSVTPS
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPS

Query:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT
        VPAQFLSKSALRGW+TSDSERQP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG ESSTKPR+WGEE+DSDYRDSSSDGSSDS+T
Subjt:  VPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDT

Query:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV
        KRRIKH REPPHHNDPSITAPLRMDRLSLRDQ LG LEDCSSDEAESCNSQG LLFEYLERD PYSREPLADKI DLASRFPQLKTMRSCDLLP SWISV
Subjt:  KRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGP---QSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTA+GG    QSPQ+PFVAYPCK+DA+KVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWL++LQ
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGP---QSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQ

Query:  VNHPDFLFFSRRDLTPY
        VNHPDF+FF RRDL PY
Subjt:  VNHPDFLFFSRRDLTPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.1e-10160.98Show/hide
Query:  SNLERFLQSVTPSVPAQFLSKSALRGWRTSDSERQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRWGE
        SN+ERFL SVTPSVPA +LSK+ +R    SD E Q PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y     + SS + RR GE
Subjt:  SNLERFLQSVTPSVPAQFLSKSALRGWRTSDSERQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRWGE

Query:  ESDSDYRDSSSDGSSDSDTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRF
        ES+SD+RDSSS+GSS S+++R + +++E        I+A  RMD+LSLR +     ED SSD+ E  +SQG L+FEYLERDLPY REP ADK+ DLASRF
Subjt:  ESDSDYRDSSSDGSSDSDTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRF

Query:  PQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQL
        P+LKT+RSCDLLP SW SVAWYPIY+IPTG TLKDLDACFLTYHSLHT   GP            +   EK+ L +FGLASYK +G S+W   GG  HQL
Subjt:  PQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQL

Query:  ANKLSREADKWLRDLQVNHPDFLFFSRR
        AN L + AD WLR  QVNHPDF+FF RR
Subjt:  ANKLSREADKWLRDLQVNHPDFLFFSRR

AT2G01260.1 Protein of unknown function (DUF789)2.5e-10954.68Show/hide
Query:  MLGAGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTP
        MLGAG +  RGR G+D FY S++ R+   +++ D+L R Q D S  PS A    S H             +  +P  +S+        SNL+RFL+SVTP
Subjt:  MLGAGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTP

Query:  SVPAQFLSKSALRGWRTSDSERQ--PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRWGEESDSDYRDSSS
        SVPAQFLSK+ LR  R  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++SS K RR G+ SDSD+RDSSS
Subjt:  SVPAQFLSKSALRGWRTSDSERQ--PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRWGEESDSDYRDSSS

Query:  DGSSDSDTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDL
        D SSDSD++R                    R+D +SLRDQ     ED SSD+ E   SQG L+FEYLERDLPY REP ADK+LDLA++FP+L T+RSCDL
Subjt:  DGSSDSDTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDL

Query:  LPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKW
        L  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT+ GG  S Q   +  P   ++EK+ L +FGLASYKF+G SLW   GG EHQL N L + ADKW
Subjt:  LPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKW

Query:  LRDLQVNHPDFLFFSRR
        L    V+HPDFLFF RR
Subjt:  LRDLQVNHPDFLFFSRR

AT2G01260.2 Protein of unknown function (DUF789)4.5e-8754.84Show/hide
Query:  MLGAGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTP
        MLGAG +  RGR G+D FY S++ R+   +++ D+L R Q D S  PS A    S H             +  +P  +S+        SNL+RFL+SVTP
Subjt:  MLGAGVRFGRGR-GEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTP

Query:  SVPAQFLSKSALRGWRTSDSERQ--PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRWGEESDSDYRDSSS
        SVPAQFLSK+ LR  R  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++SS K RR G+ SDSD+RDSSS
Subjt:  SVPAQFLSKSALRGWRTSDSERQ--PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMESSTKPRRWGEESDSDYRDSSS

Query:  DGSSDSDTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDL
        D SSDSD++R                    R+D +SLRDQ     ED SSD+ E   SQG L+FEYLERDLPY REP ADK+LDLA++FP+L T+RSCDL
Subjt:  DGSSDSDTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDL

Query:  LPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG
        L  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT+ GG
Subjt:  LPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGG

AT4G16100.1 Protein of unknown function (DUF789)4.2e-7745.3Show/hide
Query:  RGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASV---HSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPSVPAQFL
        R RGE+RFY+    RK    R+  RL   + +     +    D  +      I    +   SD +      S      +  SNL RFL   TP V  Q L
Subjt:  RGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASV---HSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPSVPAQFL

Query:  SKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLY--GMESSTKPRRWGEESDSDY-RDSSSDGSSDSDTKRR
          ++ +GWRT + E +PYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY     + T  RR GEESD D  RD SSDGS+D    R 
Subjt:  SKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLY--GMESSTKPRRWGEESDSDY-RDSSSDGSSDSDTKRR

Query:  IKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAE-SCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISVAW
        +  N                + R SL ++        SSDE+E S NS G L+FEYLE  +P+ REPL DKI +L+S+FP L+T RSCDL P SW+SVAW
Subjt:  IKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAE-SCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISVAW

Query:  YPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHPD
        YPIYRIP GQ+L++LDACFLT+HSL T   G  + +    +    S   K+PL  FGLASYKFK S     +   E+Q    L R A++WLR L+V  PD
Subjt:  YPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHPD

Query:  FLFF
        F  F
Subjt:  FLFF

AT5G49220.1 Protein of unknown function (DUF789)2.6e-7443.84Show/hide
Query:  RGEDRFYDSSRARK-----GLLSRQNDRLCRPQED-----------ASATPSCAAKDASVHSS---ITGAGDRVVSDEATKPVAISNPQPVVSPLSNLER
        RGE+RFY+    R+      L  +  ++  R  ED           A+  P    K   V  S   +  +G  V +  +    + S    V+S  SNL+R
Subjt:  RGEDRFYDSSRARK-----GLLSRQNDRLCRPQED-----------ASATPSCAAKDASVHSS---ITGAGDRVVSDEATKPVAISNPQPVVSPLSNLER

Query:  FLQSVTPSVPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDY
        FL+  TP VPA+     +    +T +S+   YFVL DLWE+F EWSAYGAGV     PL ++  D  VQYYVPYLSGIQLY ++   KPR     +    
Subjt:  FLQSVTPSVPAQFLSKSALRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDY

Query:  RDSSSDGSSDSDTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTM
         + SS+GSS+S T               P   +   ++R+SL+DQ   +    SS EAE  N QG LLFEYLE + P+ REPLA+KI DLASR P+L T 
Subjt:  RDSSSDGSSDSDTKRRIKHNREPPHHNDPSITAPLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTM

Query:  RSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSD--AEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKL
        RSCDLLP SW+SV+WYPIYRIP G TL++LDACFLT+HSL TA   PQS      A  C     + K+PL  FGLASYK K  S+W +N   E Q    L
Subjt:  RSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIGGPQSPQVPFVAYPCKSD--AEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKL

Query:  SREADKWLRDLQVNHPDFLFFS
         + ADKWL+ LQV+HPD+ FF+
Subjt:  SREADKWLRDLQVNHPDFLFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAGCGGGTGTACGGTTTGGTCGCGGCAGGGGAGAGGACAGGTTTTACGATTCATCGAGAGCGAGGAAGGGCCTTCTCAGTCGTCAAAACGATAGGCTGTGTAG
ACCTCAAGAGGACGCTTCGGCTACTCCCTCTTGCGCGGCTAAGGATGCTTCGGTGCATTCTTCGATTACAGGGGCTGGGGACCGTGTGGTGTCTGATGAAGCTACTAAAC
CAGTTGCCATTTCTAATCCCCAGCCCGTTGTTTCTCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCATCTGTGCCTGCTCAGTTTCTCTCCAAGAGTGCG
TTGAGAGGCTGGAGGACGAGCGATTCGGAGAGGCAACCTTACTTTGTGCTTGGTGATTTGTGGGAGGCTTTCAAGGAGTGGAGCGCTTATGGTGCTGGAGTGCCTCTTTT
ATTGAATAACACTGATGGTGTAGTTCAGTATTATGTCCCGTATTTGTCTGGTATACAATTGTACGGCATGGAATCGTCTACAAAGCCAAGGCGATGGGGTGAGGAAAGTG
ACAGTGACTACAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGATACAAAGAGAAGAATAAAACACAATAGAGAACCACCCCACCACAATGATCCGTCTATCACAGCT
CCTCTTAGAATGGATAGATTGTCTTTGAGGGACCAGCGTTTGGGACTTCTTGAGGACTGCTCCAGTGATGAGGCTGAATCTTGCAATTCTCAAGGTGGCCTTCTATTTGA
GTATCTTGAAAGAGACCTACCGTATTCACGTGAACCGTTGGCAGACAAGATATTGGACCTTGCTTCTCGCTTCCCTCAGCTGAAAACAATGAGAAGTTGTGACCTACTAC
CGTATAGTTGGATATCTGTGGCATGGTACCCAATTTACAGGATACCAACTGGTCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACATACCATTCTCTACACACAGCA
ATCGGAGGCCCTCAAAGCCCACAAGTGCCATTTGTGGCATATCCTTGTAAGTCAGATGCCGAAAAGGTTCCTCTAAGAATTTTTGGACTTGCTTCATATAAGTTTAAAGG
GTCGTCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAATTGGCAAACAAGCTCTCGCGGGAAGCTGATAAGTGGTTAAGAGATCTCCAGGTCAATCACCCAGATTTCC
TGTTCTTCAGCCGCCGAGATTTAACACCTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATTTCATTCCCTCGAAAAAAAAAAAAAACTCTATCATTCTTAATCGTAATCGTTTTTTCGTCGAACCGTTTTGATCTGTTTACCATTCTCACCCTTATGAAAACCCGCCT
CTTTCTTCCAATTCGATTCACTGTGTTTTTTCTTTTCGTCTTCGTTTCAGACTTTGTGTTATTCATCGCTACTGATTGATTCTTGGATCGATTATCGTTTTTGTTGAAAT
TCTTCATCGGCGATCTGATTTCCCCAGGCAGATCCGACAATTGCTGTTTCTGCTGTTTGGATTTCGGATTATCAGATCGCTCCGCCATTTTCTTCCAGTTCCAGTTCCAG
TTTGTCGTCGTTGACTTTGGTGAGGGATTGTCATTGCAACTTACAGAGTAGTGTTCATCGTTCATCAGTTGGTGGTAGACTGTCGAGATGTTAGGAGCGGGTGTACGGTT
TGGTCGCGGCAGGGGAGAGGACAGGTTTTACGATTCATCGAGAGCGAGGAAGGGCCTTCTCAGTCGTCAAAACGATAGGCTGTGTAGACCTCAAGAGGACGCTTCGGCTA
CTCCCTCTTGCGCGGCTAAGGATGCTTCGGTGCATTCTTCGATTACAGGGGCTGGGGACCGTGTGGTGTCTGATGAAGCTACTAAACCAGTTGCCATTTCTAATCCCCAG
CCCGTTGTTTCTCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCATCTGTGCCTGCTCAGTTTCTCTCCAAGAGTGCGTTGAGAGGCTGGAGGACGAGCGA
TTCGGAGAGGCAACCTTACTTTGTGCTTGGTGATTTGTGGGAGGCTTTCAAGGAGTGGAGCGCTTATGGTGCTGGAGTGCCTCTTTTATTGAATAACACTGATGGTGTAG
TTCAGTATTATGTCCCGTATTTGTCTGGTATACAATTGTACGGCATGGAATCGTCTACAAAGCCAAGGCGATGGGGTGAGGAAAGTGACAGTGACTACAGAGATTCAAGT
AGTGATGGTAGTAGTGATTCTGATACAAAGAGAAGAATAAAACACAATAGAGAACCACCCCACCACAATGATCCGTCTATCACAGCTCCTCTTAGAATGGATAGATTGTC
TTTGAGGGACCAGCGTTTGGGACTTCTTGAGGACTGCTCCAGTGATGAGGCTGAATCTTGCAATTCTCAAGGTGGCCTTCTATTTGAGTATCTTGAAAGAGACCTACCGT
ATTCACGTGAACCGTTGGCAGACAAGATATTGGACCTTGCTTCTCGCTTCCCTCAGCTGAAAACAATGAGAAGTTGTGACCTACTACCGTATAGTTGGATATCTGTGGCA
TGGTACCCAATTTACAGGATACCAACTGGTCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACATACCATTCTCTACACACAGCAATCGGAGGCCCTCAAAGCCCACA
AGTGCCATTTGTGGCATATCCTTGTAAGTCAGATGCCGAAAAGGTTCCTCTAAGAATTTTTGGACTTGCTTCATATAAGTTTAAAGGGTCGTCATTGTGGATGCGAAATG
GTGGAGTTGAGCATCAATTGGCAAACAAGCTCTCGCGGGAAGCTGATAAGTGGTTAAGAGATCTCCAGGTCAATCACCCAGATTTCCTGTTCTTCAGCCGCCGAGATTTA
ACACCTTACTGATACAATACTCTTACCGACTAGAAAACGAAAGTTGACAAGTTGTGGCCCTGCAAAAGAATCCCTTAATTAGTTGGATTAAGTTGTGTATGCTTTTTGAA
GGAGTGGTTCACTCGTTTGCTTTCGTTAAGGCGGGGGGCTGAAGGAAGGTTAAGGAATGGGATAGTTCGCAAAATACTGAAGCTTTAGATGGTCGCGCGCATATGCATGG
TGTAATAACAAAGACCGATAGGAAGTGAAGGCTAAAAAATGTAAGAAGAAAACATTGCTGTGAGGCACACTGGTTTAGTGAGGTAGATGTTGGTTCGACATGTATTAATA
AGCAGGTTTTCTTGTTTGCTTTCACCTTGTACTTTATTTATCCCTAGAAGGGCCATCATTTGAGAAATCAAGCCTTCCTTAATCCTTCTGACTGACAGTCCTAATTTTGT
TAGAATGTAATGTTATTTAAGGGGATTTTAATAGGTTTTATTCAGGCATTGAACTATACTGCAGGAGTTTCCTTGACCATGAACTGGTTATAAGTATAACGATTAGAAAT
GTTTATGGAAAGCTGTACAATACGCCCTGAACTCGGGTATTTATCTGTTCTTGGTTTGGTTTTCTTCTTTTCCATATCAATAGAAGTGATGAAAGTTTTAAGCCATGTGC
ATACCCTGCTTAATTAAGAACAGGGCCTGTTTATAATGACTTTCTAAGTACTTAGAAAGTCATTCCATAGGAAACCATTCAGTTCTGACTTTTTACGTGCTTAAAAATGG
CCCTAAGATGTATTTGAAGATGTGGGAGTTCTTTCCTTGGCTTTTCTACTGGTCATTTTTTTTTGTCAGGTATCGACCACCATCATGAAGTGAGAGATGAAGATGAAACA
ATGTGAATGAAACCAAGTGGATCCCGACTCCTGAGCAATGCTTGATGGCCTCGTGTTTGTTTTGTTCTGATCTGTAGAAATGGTAAGTGTTGGTAACAGAAACTCTACTT
TCTGCTTTACAGAGCCAGGGACCACTAATGGGAAAGAGCGTTGGCTTGCGTGGTGGTCATAAGACTCGTAATAAAGTTTCTGATTCTACCATACTAGGTAAGTTTTCTAA
ATTGCTGTTTAATATTTCTATTAATAATGATGCTAAATGTTAATTGAGTGAATAGATATTGCATTAGTATTGGAATTCTTTTCGGTGTTTGTAAAGAGGCTGAAGAATCT
GGACGTGGC
Protein sequenceShow/hide protein sequence
MLGAGVRFGRGRGEDRFYDSSRARKGLLSRQNDRLCRPQEDASATPSCAAKDASVHSSITGAGDRVVSDEATKPVAISNPQPVVSPLSNLERFLQSVTPSVPAQFLSKSA
LRGWRTSDSERQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMESSTKPRRWGEESDSDYRDSSSDGSSDSDTKRRIKHNREPPHHNDPSITA
PLRMDRLSLRDQRLGLLEDCSSDEAESCNSQGGLLFEYLERDLPYSREPLADKILDLASRFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA
IGGPQSPQVPFVAYPCKSDAEKVPLRIFGLASYKFKGSSLWMRNGGVEHQLANKLSREADKWLRDLQVNHPDFLFFSRRDLTPY