; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1497 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1497
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationMC09:20834061..20841010
RNA-Seq ExpressionMC09g1497
SyntenyMC09g1497
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573941.1 hypothetical protein SDJN03_27828, partial [Cucurbita argyrosperma subsp. sororia]5.69e-25384.73Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERDLPYSREPLADK S  DLASRFP+LKT+RSCDLLP S
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF
        WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA+ G QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR 
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF

Query:  LQVNHPDFLFFSRRDATPY
        LQVNHPDF+FFS+RD  PY
Subjt:  LQVNHPDFLFFSRRDATPY

XP_022150656.1 uncharacterized protein LOC111018737 [Momordica charantia]1.85e-30799.28Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV
        MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV

Query:  PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK
        PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK
Subjt:  PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK

Query:  RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYSWIS
        RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADK   LDLASRFPQLKTMRSCDLLPYSWIS
Subjt:  RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYSWIS

Query:  VAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQV
        VAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQV
Subjt:  VAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQV

Query:  NHPDFLFFSRRDATPY
        NHPDFLFFSRRDATPY
Subjt:  NHPDFLFFSRRDATPY

XP_022945839.1 uncharacterized protein LOC111449961 isoform X2 [Cucurbita moschata]8.08e-25384.73Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PI     RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK S  DLASRFPQLKT+RSCDLLP S
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF
        WISVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+ GPQS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR 
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF

Query:  LQVNHPDFLFFSRRDATPY
        LQVNHPDF+FFSRRD  PY
Subjt:  LQVNHPDFLFFSRRDATPY

XP_022968120.1 uncharacterized protein LOC111467452 isoform X2 [Cucurbita maxima]9.41e-25284.25Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKP+      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK S  DLASRFPQLKTMRSCDLLP S
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF
        WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA+ G QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WL+ 
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF

Query:  LQVNHPDFLFFSRRDATPY
        LQVNHPDF+FF RRD  PY
Subjt:  LQVNHPDFLFFSRRDATPY

XP_023541256.1 uncharacterized protein LOC111801477 [Cucurbita pepo subsp. pepo]2.31e-25284.73Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFG  RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERDLPYSREPLADK S  DLASRFPQLKT+RSCDLLP S
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF
        WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT + G QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR 
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF

Query:  LQVNHPDFLFFSRRDATPY
        LQVNHPDF+FFSRRD  PY
Subjt:  LQVNHPDFLFFSRRDATPY

TrEMBL top hitse value%identityAlignment
A0A6J1DC61 uncharacterized protein LOC1110187378.96e-30899.28Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV
        MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV

Query:  PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK
        PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK
Subjt:  PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK

Query:  RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYSWIS
        RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADK   LDLASRFPQLKTMRSCDLLPYSWIS
Subjt:  RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYSWIS

Query:  VAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQV
        VAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQV
Subjt:  VAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQV

Query:  NHPDFLFFSRRDATPY
        NHPDFLFFSRRDATPY
Subjt:  NHPDFLFFSRRDATPY

A0A6J1G225 uncharacterized protein LOC111449961 isoform X23.91e-25384.73Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PI     RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK S  DLASRFPQLKT+RSCDLLP S
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF
        WISVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+ GPQS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR 
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF

Query:  LQVNHPDFLFFSRRDATPY
        LQVNHPDF+FFSRRD  PY
Subjt:  LQVNHPDFLFFSRRDATPY

A0A6J1G242 uncharacterized protein LOC111449961 isoform X15.93e-25184.12Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PI     RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK S  DLASRFPQLKT+RSCDLLP S
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQ---VPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHW
        WISVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+ GPQS Q   +PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD W
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQ---VPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHW

Query:  LRFLQVNHPDFLFFSRRDATPY
        LR LQVNHPDF+FFSRRD  PY
Subjt:  LRFLQVNHPDFLFFSRRDATPY

A0A6J1HWA7 uncharacterized protein LOC111467452 isoform X24.56e-25284.25Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKP+      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK S  DLASRFPQLKTMRSCDLLP S
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF
        WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA+ G QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WL+ 
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRF

Query:  LQVNHPDFLFFSRRDATPY
        LQVNHPDF+FF RRD  PY
Subjt:  LQVNHPDFLFFSRRDATPY

A0A6J1HYQ4 uncharacterized protein LOC111467452 isoform X14.86e-25083.65Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKP+      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK S  DLASRFPQLKTMRSCDLLP S
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYS

Query:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGP---QSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHW
        WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA+ G    QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD W
Subjt:  WISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGP---QSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHW

Query:  LRFLQVNHPDFLFFSRRDATPY
        L+ LQVNHPDF+FF RRD  PY
Subjt:  LRFLQVNHPDFLFFSRRDATPY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.5e-10361.45Show/hide
Query:  SNLERFLQSVTPSVPAQFFSKSSLRGWRTCDSETQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGE
        SN+ERFL SVTPSVPA + SK+ +R     D E+Q PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y     +  S + RR GE
Subjt:  SNLERFLQSVTPSVPAQFFSKSSLRGWRTCDSETQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGE

Query:  ESDSDYRDSSSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLAS
        ES+SD+RDSSS+GSS SE++R + +++E +           R+D+LSLR +H    ED SSD+ E  +S+GRL+FEYLERDLPY REP ADK S  DLAS
Subjt:  ESDSDYRDSSSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLAS

Query:  RFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGP-QSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGV
        RFP+LKT+RSCDLLP SW SVAWYPIY+IPTG TLKDLDACFLTYHSLHT  +GP  +T    V  P   +S EK+ L +FGLASYK +G S+W   GG 
Subjt:  RFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGP-QSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGV

Query:  EHQLANSLSQAADHWLRFLQVNHPDFLFFSRR
         HQLANSL QAAD+WLR  QVNHPDF+FF RR
Subjt:  EHQLANSLSQAADHWLRFLQVNHPDFLFFSRR

AT2G01260.1 Protein of unknown function (DUF789)5.1e-10753.79Show/hide
Query:  MLGAGVRFGRGR-GEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQ---PVVSPLSNLERFLQSV
        MLGAG +  RGR G+D FY S++ RR   +++ D+L R Q D S  PS                         + P+P+ Q   P     SNL+RFL+SV
Subjt:  MLGAGVRFGRGR-GEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQ---PVVSPLSNLERFLQSV

Query:  TPSVPAQFFSKSSLRGWRTCD--SETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGEESDSDYRDS
        TPSVPAQF SK+ LR  R  D  ++  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++ S K RR G+ SDSD+RDS
Subjt:  TPSVPAQFFSKSSLRGWRTCD--SETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGEESDSDYRDS

Query:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMR
        SSD SSDS+++R                    R+D +SLRDQH    ED SSD+ E   S+GRL+FEYLERDLPY REP ADK   LDLA++FP+L T+R
Subjt:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMR

Query:  SCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQ
        SCDLL  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT+  G  S Q   +  P     +EK+ L +FGLASYKF+G SLW   GG EHQL NSL Q
Subjt:  SCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQ

Query:  AADHWLRFLQVNHPDFLFFSRR
        AAD WL    V+HPDFLFF RR
Subjt:  AADHWLRFLQVNHPDFLFFSRR

AT2G01260.2 Protein of unknown function (DUF789)1.2e-8453.33Show/hide
Query:  MLGAGVRFGRGR-GEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQ---PVVSPLSNLERFLQSV
        MLGAG +  RGR G+D FY S++ RR   +++ D+L R Q D S  PS                         + P+P+ Q   P     SNL+RFL+SV
Subjt:  MLGAGVRFGRGR-GEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQ---PVVSPLSNLERFLQSV

Query:  TPSVPAQFFSKSSLRGWRTCD--SETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGEESDSDYRDS
        TPSVPAQF SK+ LR  R  D  ++  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++ S K RR G+ SDSD+RDS
Subjt:  TPSVPAQFFSKSSLRGWRTCD--SETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGEESDSDYRDS

Query:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMR
        SSD SSDS+++R                    R+D +SLRDQH    ED SSD+ E   S+GRL+FEYLERDLPY REP ADK   LDLA++FP+L T+R
Subjt:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMR

Query:  SCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRG
        SCDLL  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT+  G
Subjt:  SCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRG

AT4G16100.1 Protein of unknown function (DUF789)9.5e-7745.34Show/hide
Query:  RGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDE-ATKPVAVP---NPNPQPVVSPLSNLERFLQSVTPSVPAQFF
        R RGE+RFY+    R+    R+  RL   + +     +  + D  +          +E +T   +VP   +       +  SNL RFL   TP V  Q  
Subjt:  RGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDE-ATKPVAVP---NPNPQPVVSPLSNLERFLQSVTPSVPAQFF

Query:  SKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPS---AKPRRWGEESDSDY-RDSSSDGSSDSETKR
          +S +GWRT + E +PYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY  +PS      RR GEESD D  RD SSDGS+D     
Subjt:  SKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPS---AKPRRWGEESDSDY-RDSSSDGSSDSETKR

Query:  RIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAE-SFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYSWIS
             REL  +          + R SL ++        SSDE+E S NS G L+FEYLE  +P+ REPL DK SN  L+S+FP L+T RSCDL P SW+S
Subjt:  RIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAE-SFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYSWIS

Query:  VAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQV
        VAWYPIYRIP GQ+L++LDACFLT+HSL T  RG  + +    +   K+ ++ K+PL  FGLASYKFK S     +   E+Q   +L + A+ WLR L+V
Subjt:  VAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQV

Query:  NHPDFLFF
          PDF  F
Subjt:  NHPDFLFF

AT5G49220.1 Protein of unknown function (DUF789)4.7e-7644.89Show/hide
Query:  RGEDRFYDSSRARR-----GLLSRQNDRLCRPQED-----------ASATPSCVVKDSSLHSPITHRVAS-DEATKPVAVPNPNPQPVVSPLSNLERFLQ
        RGE+RFY+    RR      L  +  ++  R  ED           A+  P    K   +    +  V S  E     +  +     V+S  SNL+RFL+
Subjt:  RGEDRFYDSSRARR-----GLLSRQNDRLCRPQED-----------ASATPSCVVKDSSLHSPITHRVAS-DEATKPVAVPNPNPQPVVSPLSNLERFLQ

Query:  SVTPSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDS
          TP VPA+ F   S    +T +S+   YFVL DLWE+F EWSAYGAGV     PL ++  D  VQYYVPYLSGIQLY ++P  KPR     +     + 
Subjt:  SVTPSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDS

Query:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMR
        SS+GSS+S T               P   +   ++R+SL+DQ   +    SS EAE  N +GRLLFEYLE + P+ REPLA+K S  DLASR P+L T R
Subjt:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMR

Query:  SCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCK-TDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLS
        SCDLLP SW+SV+WYPIYRIP G TL++LDACFLT+HSL TA   PQS      A  C  +  + K+PL  FGLASYK K  S+W +N   E Q   SL 
Subjt:  SCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCK-TDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLS

Query:  QAADHWLRFLQVNHPDFLFFS
        QAAD WL+ LQV+HPD+ FF+
Subjt:  QAADHWLRFLQVNHPDFLFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAGCCGGTGTACGGTTTGGTCGCGGCAGGGGAGAGGACCGGTTCTACGATTCATCCAGAGCGCGCAGAGGCCTTCTCAGTCGTCAAAATGATCGGCTGTGTAG
ACCTCAAGAAGACGCTTCCGCTACTCCATCCTGCGTCGTTAAGGATTCTTCGCTGCATTCTCCGATTACGCACCGCGTCGCCTCTGATGAAGCTACTAAACCAGTTGCCG
TTCCTAATCCTAATCCCCAGCCGGTTGTTTCCCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCCTCTGTGCCTGCTCAGTTTTTCTCCAAGAGTTCGTTG
AGAGGTTGGAGGACGTGCGATTCGGAGACGCAACCGTACTTCGTGCTTGGGGATTTGTGGGAGGCCTTCAAGGAGTGGAGCGCTTATGGGGCAGGAGTGCCTCTGCTATT
GAATAACACTGATGGTGTGGTTCAATATTACGTCCCGTATTTGTCTGGTATACAACTGTACGGCATGGAACCGTCGGCAAAGCCAAGGCGATGGGGTGAGGAAAGTGACA
GTGACTATAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACCAAGAGAAGAATAAAACACACTAGAGAGCTACTCCACCATAATGATCCGTCTATCACAGCTCCT
CTTAGAATAGATAGATTGTCTTTGAGGGATCAGCACATGGGACTTCATGAAGACTGCTCCAGTGATGAGGCTGAATCTTTCAATTCTGAAGGTCGCCTTCTATTCGAGTA
TCTTGAAAGAGACCTACCGTATTCACGTGAACCTTTGGCTGACAAGGCAAGTAATTTGGACCTTGCTTCGCGCTTCCCTCAGCTGAAAACAATGAGAAGTTGTGACCTAC
TACCATATAGTTGGATATCTGTGGCATGGTACCCAATTTACAGAATACCAACCGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACGTACCATTCTCTACATACA
GCAATCAGAGGCCCTCAAAGCACACAAGTGCCATTTGTGGCATATCCTTGCAAGACGGATAGTGCCGAAAAGATTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTT
TAAAGGGTCGTCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAGTTGGCAAACTCCCTCTCGCAGGCTGCAGATCACTGGTTAAGATTTCTCCAGGTCAATCACCCGG
ATTTCCTGTTCTTCAGCCGCCGAGATGCAACACCTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGTCCGAAGCAGCACGTTTGGTTTCATAGAGAAAGAGAGAGAGATTTTTCTTTTCTTCAAATATATATTTTTTTAAATTTACTTTTTGGAATGAAAAAGGAGAT
TTCCAATTTCTTTTAATCCTAATCGTCTCTTCATCGAACCGTTTTGATGTGTCTACCATTCTCACCCATCTGAAAACCCGGCTCTTCCTCCCGTTCGATTCACTGTGTGT
TTTCGTCTTCGTTTCGGCTTCATATACATCGCCGCTGTTCCTCCGATCGGTTCTTGTTTCCTTGAATTTCTTCGCCGGCGATCTGATTTACCAGGCAGATCCGACGATTG
CTGTTTCTGCCGTTTGGAATTTCGACAATCGGATCGCTCCTCATTTTCTTCGAGTTGGAGTTCGTCGTCGTTGACTTTGGTTAGGGATTTCTAATAGGAACTTACAGAGT
AGTGTTTGAGTTATCAGTAGATAGACTGTGGAGATGTTAGGAGCCGGTGTACGGTTTGGTCGCGGCAGGGGAGAGGACCGGTTCTACGATTCATCCAGAGCGCGCAGAGG
CCTTCTCAGTCGTCAAAATGATCGGCTGTGTAGACCTCAAGAAGACGCTTCCGCTACTCCATCCTGCGTCGTTAAGGATTCTTCGCTGCATTCTCCGATTACGCACCGCG
TCGCCTCTGATGAAGCTACTAAACCAGTTGCCGTTCCTAATCCTAATCCCCAGCCGGTTGTTTCCCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCCTCT
GTGCCTGCTCAGTTTTTCTCCAAGAGTTCGTTGAGAGGTTGGAGGACGTGCGATTCGGAGACGCAACCGTACTTCGTGCTTGGGGATTTGTGGGAGGCCTTCAAGGAGTG
GAGCGCTTATGGGGCAGGAGTGCCTCTGCTATTGAATAACACTGATGGTGTGGTTCAATATTACGTCCCGTATTTGTCTGGTATACAACTGTACGGCATGGAACCGTCGG
CAAAGCCAAGGCGATGGGGTGAGGAAAGTGACAGTGACTATAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACCAAGAGAAGAATAAAACACACTAGAGAGCTA
CTCCACCATAATGATCCGTCTATCACAGCTCCTCTTAGAATAGATAGATTGTCTTTGAGGGATCAGCACATGGGACTTCATGAAGACTGCTCCAGTGATGAGGCTGAATC
TTTCAATTCTGAAGGTCGCCTTCTATTCGAGTATCTTGAAAGAGACCTACCGTATTCACGTGAACCTTTGGCTGACAAGGCAAGTAATTTGGACCTTGCTTCGCGCTTCC
CTCAGCTGAAAACAATGAGAAGTTGTGACCTACTACCATATAGTTGGATATCTGTGGCATGGTACCCAATTTACAGAATACCAACCGGGCAAACATTAAAGGATCTTGAT
GCTTGCTTTCTCACGTACCATTCTCTACATACAGCAATCAGAGGCCCTCAAAGCACACAAGTGCCATTTGTGGCATATCCTTGCAAGACGGATAGTGCCGAAAAGATTCC
TTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAAAGGGTCGTCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAGTTGGCAAACTCCCTCTCGCAGGCTGCAGATC
ACTGGTTAAGATTTCTCCAGGTCAATCACCCGGATTTCCTGTTCTTCAGCCGCCGAGATGCAACACCTTACTGATACTATATTCTTACCAACTAGAAAGCAAAGCATAAC
GAGTTGTGGCCCTGGAAAAGAATCCTGTAATTTTGTGCGTTGCGCTGTATGCTTGGTGAAAGGGCGGTTCTCTCGTTTGCTTTCACTAAGGTGAGGGGAACTGGAGGGGA
AGGCTAAGGAATGGAATAGGTCCCCAAAAACTGAAGGCTGTAGATGGTCGGCAAGCACATGCCTGATGTAATAACAAAGACCAATGGAAAGTGCATGGTAAAAATGTAAG
AGAAAAAGCCTTGGAACTGGAAGGTAGACGTTGGTTCGACGTGTATTAATAAGCAGGTTTCCTTGCCTTTTTAAATCTCTCTGACCGACAGTCCTAATTTTGTTAGAATG
TAATGTTATTTAACGGGATATTAATAGGTTTGATTCAGGAGCTCAGATACAGCAGAGCATTGAACTGTATCGCAGGAGTTCCCTTGTCCACGAACTGGTTATAACCTTTA
GAAATGTTTATGGAAAGCGTTCAATACGCCCTGAACTTGGGTGTTTTTCCGTTCTTGGTTTCCTTAAAATGGCTTTGAGGCTTGATGAAACAATGGGAATCCAAGCGGAT
CCGGAGCAATGGAATGCTTGATGACCTTGTTTTTGTTCTCTTCTGTAGAAATGTAAATGTACGGTACCAGCATCTACTGGAGGGGGCCATTCGTAGGAACAGCGTCGGCT
TCCCTGGTGCTCGTAAGTGTACTGTGAGAAAGAGAGAGAGAAAAGAAAAAGGGACGATGCCGATAATGGCGGTTTCTGCTATTGTATTGGCCACCATCACTTCTCTGCAT
CTCATCGCCTTCGTCCTCGCCGTCGGCGCCGAGCGCCGCCGCAGCACGGCCAAGGTTGTGCCGGATGAGTACGACGAGCAGACGTACTGCGTGTACGGCACCGACGCGTC
GACGGTGTACGGACTGTCGGCGTTCGGTTTGCTTCTGATAAGCCAGACGGTGGTTAACGGCGTGACCAAGTGTCTGTGCTGTGGGAAAGGTTTAATCAGCGGATCAAAAA
CCACCGCCGTCGCCATTTTCTTCTTCGTATTCTCCTGGATGGGGTTTGTGGGAGCGGAGATTTGCCTGCTGGCGGGATCGGCGAGGAACGCGTACCACACCAAGTACAGG
GCGGCGTTCGGCGGGGAAAATCTGTCGTGTGCGACGCTGCGGAAGGGGGTGTTCGCCGGCGCCGGCGCCATGACGGTGGTGTCGTTGGTGGGGTCGGTTGTGTACTACGT
GGCGCACTCGAGGGCCGACACCGGAGGATGGGTGAAGCAGCGGCGGAATGAGAATGAGAATGATGGGCTGCCCATGGCGGCGGCGCCTTACGATCTGAAACAGAACGCCT
AGGGTTTGGTTTTTGGTTTTTGATTTATTATATTAGAAATCAGAGGGAAAGGGAAGTGTTTGGTTTTTGGTTGGTAGAGAAAAAGAGAGGGAAACTGTTAATTAGTCAAA
TCCACCTTTCTTTTTCTTTTTCTTTTTTGTTTTCTTTTTGGGTCTTTTGCTTTATTTATGTATATTTAGATTTTAGGGTTGCTTTACGACATTTGGGTCTCCAACGTTTG
ATTCGTTCTCTGCTGTGATGGTGCCATTTGGCTGTGTTTATGGTCTTTTTATTTTTACAGTGAGA
Protein sequenceShow/hide protein sequence
MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSVPAQFFSKSSL
RGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETKRRIKHTRELLHHNDPSITAP
LRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASNLDLASRFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT
AIRGPQSTQVPFVAYPCKTDSAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNHPDFLFFSRRDATPY