; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS015365 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS015365
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function (DUF789)
Genome locationscaffold2:2895442..2897959
RNA-Seq ExpressionMS015365
SyntenyMS015365
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573941.1 hypothetical protein SDJN03_27828, partial [Cucurbita argyrosperma subsp. sororia]6.9e-19985.44Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERDLPYSREPLADK SDLASRFP+LKT+RSCDLLP SWI
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA+ G QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ

Query:  VNHPDFLFFSRR
        VNHPDF+FFS+R
Subjt:  VNHPDFLFFSRR

XP_022150656.1 uncharacterized protein LOC111018737 [Momordica charantia]4.4e-23899.27Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV
        MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV

Query:  PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK
        PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK
Subjt:  PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK

Query:  RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWISVA
        RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADK  DLASRFPQLKTMRSCDLLPYSWISVA
Subjt:  RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWISVA

Query:  WYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNH
        WYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTD AEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNH
Subjt:  WYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNH

Query:  PDFLFFSRR
        PDFLFFSRR
Subjt:  PDFLFFSRR

XP_022945839.1 uncharacterized protein LOC111449961 isoform X2 [Cucurbita moschata]9.0e-19985.44Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PI     RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK SDLASRFPQLKT+RSCDLLP SWI
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+ GPQS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ

Query:  VNHPDFLFFSRR
        VNHPDF+FFSRR
Subjt:  VNHPDFLFFSRR

XP_022968120.1 uncharacterized protein LOC111467452 isoform X2 [Cucurbita maxima]5.8e-19884.95Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKP+      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK SDLASRFPQLKTMRSCDLLP SWI
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA+ G QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WL+ LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ

Query:  VNHPDFLFFSRR
        VNHPDF+FF RR
Subjt:  VNHPDFLFFSRR

XP_023541256.1 uncharacterized protein LOC111801477 [Cucurbita pepo subsp. pepo]2.0e-19885.44Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFG  RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERDLPYSREPLADK SDLASRFPQLKT+RSCDLLP SWI
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYHSLHT + G QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ

Query:  VNHPDFLFFSRR
        VNHPDF+FFSRR
Subjt:  VNHPDFLFFSRR

TrEMBL top hitse value%identityAlignment
A0A6J1DC61 uncharacterized protein LOC1110187372.1e-23899.27Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV
        MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSV

Query:  PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK
        PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK
Subjt:  PAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETK

Query:  RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWISVA
        RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADK  DLASRFPQLKTMRSCDLLPYSWISVA
Subjt:  RRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWISVA

Query:  WYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNH
        WYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTD AEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNH
Subjt:  WYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNH

Query:  PDFLFFSRR
        PDFLFFSRR
Subjt:  PDFLFFSRR

A0A6J1G225 uncharacterized protein LOC111449961 isoform X24.4e-19985.44Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PI     RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK SDLASRFPQLKT+RSCDLLP SWI
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+ GPQS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ

Query:  VNHPDFLFFSRR
        VNHPDF+FFSRR
Subjt:  VNHPDFLFFSRR

A0A6J1G242 uncharacterized protein LOC111449961 isoform X11.8e-19784.82Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PI     RV SDEATKPV      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QPYFVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK SDLASRFPQLKT+RSCDLLP SWI
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRG---PQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLR
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+ G   PQS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WLR
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRG---PQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLR

Query:  FLQVNHPDFLFFSRR
         LQVNHPDF+FFSRR
Subjt:  FLQVNHPDFLFFSRR

A0A6J1HWA7 uncharacterized protein LOC111467452 isoform X22.8e-19884.95Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKP+      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK SDLASRFPQLKTMRSCDLLP SWI
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA+ G QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WL+ LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQ

Query:  VNHPDFLFFSRR
        VNHPDF+FF RR
Subjt:  VNHPDFLFFSRR

A0A6J1HYQ4 uncharacterized protein LOC111467452 isoform X19.1e-19784.34Show/hide
Query:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT
        MLGAGVRFGR RGEDRFYDSSRAR+GLLSRQNDRLCRPQE ASATPSC VKD S+  PIT    RV SDEATKP+      PQP VSPLSNLERFLQSVT
Subjt:  MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITH---RVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVT

Query:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS
        PSVPAQF SKS+LRGW+T DSE QP+FVLGDLWE FKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYG E S KPR+WGEE+DSDYRDSSSDGSSDS
Subjt:  PSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDS

Query:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI
        ETKRRIKH RE  HHNDPSITAPLR+DRLSLRDQH+G  EDCSSDEAES NS+G LLFEYLERD PYSREPLADK SDLASRFPQLKTMRSCDLLP SWI
Subjt:  ETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGP---QSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLR
        SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTA+ G    QS Q+PFVAYPCKTD A+K+PLRIFGLASYKFKGSSLWMRNGGVEHQLAN LS+ AD WL+
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGP---QSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLR

Query:  FLQVNHPDFLFFSRR
         LQVNHPDF+FF RR
Subjt:  FLQVNHPDFLFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.1e-10461.52Show/hide
Query:  SNLERFLQSVTPSVPAQFFSKSSLRGWRTCDSETQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGE
        SN+ERFL SVTPSVPA + SK+ +R     D E+Q PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y     +  S + RR GE
Subjt:  SNLERFLQSVTPSVPAQFFSKSSLRGWRTCDSETQ-PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGE

Query:  ESDSDYRDSSSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRF
        ES+SD+RDSSS+GSS SE++R + +++E +           R+D+LSLR +H    ED SSD+ E  +S+GRL+FEYLERDLPY REP ADK SDLASRF
Subjt:  ESDSDYRDSSSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRF

Query:  PQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGP-QSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEH
        P+LKT+RSCDLLP SW SVAWYPIY+IPTG TLKDLDACFLTYHSLHT  +GP  +T    V  P   +  EK+ L +FGLASYK +G S+W   GG  H
Subjt:  PQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGP-QSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEH

Query:  QLANSLSQAADHWLRFLQVNHPDFLFFSRR
        QLANSL QAAD+WLR  QVNHPDF+FF RR
Subjt:  QLANSLSQAADHWLRFLQVNHPDFLFFSRR

AT2G01260.1 Protein of unknown function (DUF789)6.0e-10853.81Show/hide
Query:  MLGAGVRFGRGR-GEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQ---PVVSPLSNLERFLQSV
        MLGAG +  RGR G+D FY S++ RR   +++ D+L R Q D S  PS                         + P+P+ Q   P     SNL+RFL+SV
Subjt:  MLGAGVRFGRGR-GEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQ---PVVSPLSNLERFLQSV

Query:  TPSVPAQFFSKSSLRGWRTCD--SETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGEESDSDYRDS
        TPSVPAQF SK+ LR  R  D  ++  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++ S K RR G+ SDSD+RDS
Subjt:  TPSVPAQFFSKSSLRGWRTCD--SETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGEESDSDYRDS

Query:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSC
        SSD SSDS+++R                    R+D +SLRDQH    ED SSD+ E   S+GRL+FEYLERDLPY REP ADK  DLA++FP+L T+RSC
Subjt:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSC

Query:  DLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAA
        DLL  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT+  G  S Q   +  P +   +EK+ L +FGLASYKF+G SLW   GG EHQL NSL QAA
Subjt:  DLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAA

Query:  DHWLRFLQVNHPDFLFFSRR
        D WL    V+HPDFLFF RR
Subjt:  DHWLRFLQVNHPDFLFFSRR

AT2G01260.2 Protein of unknown function (DUF789)1.4e-8553.35Show/hide
Query:  MLGAGVRFGRGR-GEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQ---PVVSPLSNLERFLQSV
        MLGAG +  RGR G+D FY S++ RR   +++ D+L R Q D S  PS                         + P+P+ Q   P     SNL+RFL+SV
Subjt:  MLGAGVRFGRGR-GEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQ---PVVSPLSNLERFLQSV

Query:  TPSVPAQFFSKSSLRGWRTCD--SETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGEESDSDYRDS
        TPSVPAQF SK+ LR  R  D  ++  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y     ++ S K RR G+ SDSD+RDS
Subjt:  TPSVPAQFFSKSSLRGWRTCD--SETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----GMEPSAKPRRWGEESDSDYRDS

Query:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSC
        SSD SSDS+++R                    R+D +SLRDQH    ED SSD+ E   S+GRL+FEYLERDLPY REP ADK  DLA++FP+L T+RSC
Subjt:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSC

Query:  DLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRG
        DLL  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT+  G
Subjt:  DLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRG

AT4G16100.1 Protein of unknown function (DUF789)1.1e-7745.32Show/hide
Query:  RGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDE-ATKPVAVP---NPNPQPVVSPLSNLERFLQSVTPSVPAQFF
        R RGE+RFY+    R+    R+  RL   + +     +  + D  +          +E +T   +VP   +       +  SNL RFL   TP V  Q  
Subjt:  RGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDE-ATKPVAVP---NPNPQPVVSPLSNLERFLQSVTPSVPAQFF

Query:  SKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPS---AKPRRWGEESDSDY-RDSSSDGSSDSETKR
          +S +GWRT + E +PYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY  +PS      RR GEESD D  RD SSDGS+D     
Subjt:  SKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPS---AKPRRWGEESDSDY-RDSSSDGSSDSETKR

Query:  RIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAE-SFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWISVA
             REL  +          + R SL ++        SSDE+E S NS G L+FEYLE  +P+ REPL DK S+L+S+FP L+T RSCDL P SW+SVA
Subjt:  RIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAE-SFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWISVA

Query:  WYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNH
        WYPIYRIP GQ+L++LDACFLT+HSL T  RG  + +    +   K+  + K+PL  FGLASYKFK S     +   E+Q   +L + A+ WLR L+V  
Subjt:  WYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNH

Query:  PDFLFF
        PDF  F
Subjt:  PDFLFF

AT5G49220.1 Protein of unknown function (DUF789)1.4e-7745.11Show/hide
Query:  RGEDRFYDSSRARR-----GLLSRQNDRLCRPQED-----------ASATPSCVVKDSSLHSPITHRVAS-DEATKPVAVPNPNPQPVVSPLSNLERFLQ
        RGE+RFY+    RR      L  +  ++  R  ED           A+  P    K   +    +  V S  E     +  +     V+S  SNL+RFL+
Subjt:  RGEDRFYDSSRARR-----GLLSRQNDRLCRPQED-----------ASATPSCVVKDSSLHSPITHRVAS-DEATKPVAVPNPNPQPVVSPLSNLERFLQ

Query:  SVTPSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDS
          TP VPA+ F   S    +T +S+   YFVL DLWE+F EWSAYGAGV     PL ++  D  VQYYVPYLSGIQLY ++P  KPR     +     + 
Subjt:  SVTPSVPAQFFSKSSLRGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDS

Query:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSC
        SS+GSS+S T               P   +   ++R+SL+DQ   +    SS EAE  N +GRLLFEYLE + P+ REPLA+K SDLASR P+L T RSC
Subjt:  SSDGSSDSETKRRIKHTRELLHHNDPSITAPLRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSC

Query:  DLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCK-TDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQA
        DLLP SW+SV+WYPIYRIP G TL++LDACFLT+HSL TA   PQS      A  C  +  + K+PL  FGLASYK K  S+W +N   E Q   SL QA
Subjt:  DLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAIRGPQSTQVPFVAYPCK-TDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQA

Query:  ADHWLRFLQVNHPDFLFFS
        AD WL+ LQV+HPD+ FF+
Subjt:  ADHWLRFLQVNHPDFLFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAGCCGGTGTACGGTTTGGTCGCGGCAGGGGAGAGGACCGGTTCTACGATTCATCCAGAGCGCGCAGAGGCCTTCTCAGTCGTCAAAATGATCGGCTGTGTAG
ACCTCAAGAAGACGCTTCCGCTACTCCATCCTGCGTCGTTAAGGATTCTTCGCTGCATTCTCCGATTACGCACCGCGTCGCCTCTGATGAAGCTACTAAACCAGTTGCCG
TTCCTAATCCTAATCCCCAGCCGGTTGTTTCCCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCCTCTGTGCCTGCTCAGTTTTTCTCCAAGAGTTCGTTG
AGAGGTTGGAGGACGTGCGATTCGGAGACGCAACCGTACTTCGTGCTTGGGGATTTGTGGGAGGCCTTCAAGGAGTGGAGCGCTTATGGGGCAGGAGTGCCTCTGCTATT
GAATAACACTGATGGTGTGGTTCAATATTACGTCCCGTATTTGTCTGGTATACAACTGTACGGCATGGAACCGTCGGCAAAGCCAAGGCGATGGGGTGAGGAAAGTGACA
GTGACTATAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACCAAGAGAAGAATAAAACACACTAGAGAGCTACTCCACCATAATGATCCGTCTATCACAGCTCCT
CTTAGAATAGATAGATTGTCTTTGAGGGATCAGCACATGGGACTTCATGAAGACTGCTCCAGTGATGAGGCTGAATCTTTCAATTCTGAAGGTCGCCTTCTATTCGAGTA
TCTTGAAAGAGACCTACCGTATTCACGTGAACCTTTGGCTGACAAGGCAAGTGACCTTGCTTCGCGCTTCCCTCAGCTGAAAACAATGAGAAGTTGTGACCTACTACCAT
ATAGTTGGATATCTGTGGCATGGTACCCAATTTACAGAATACCAACCGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACGTACCATTCTCTACATACAGCAATC
AGAGGCCCTCAAAGCACACAAGTGCCATTTGTGGCATATCCTTGCAAGACGGATGGTGCCGAAAAGATTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAAAGG
GTCGTCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAGTTGGCAAACTCCCTCTCGCAGGCTGCAGATCACTGGTTAAGATTTCTCCAGGTCAATCACCCGGATTTCC
TGTTCTTCAGCCGCCGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGAGCCGGTGTACGGTTTGGTCGCGGCAGGGGAGAGGACCGGTTCTACGATTCATCCAGAGCGCGCAGAGGCCTTCTCAGTCGTCAAAATGATCGGCTGTGTAG
ACCTCAAGAAGACGCTTCCGCTACTCCATCCTGCGTCGTTAAGGATTCTTCGCTGCATTCTCCGATTACGCACCGCGTCGCCTCTGATGAAGCTACTAAACCAGTTGCCG
TTCCTAATCCTAATCCCCAGCCGGTTGTTTCCCCGTTAAGTAATCTCGAGCGCTTCTTGCAGTCGGTTACTCCCTCTGTGCCTGCTCAGTTTTTCTCCAAGAGTTCGTTG
AGAGGTTGGAGGACGTGCGATTCGGAGACGCAACCGTACTTCGTGCTTGGGGATTTGTGGGAGGCCTTCAAGGAGTGGAGCGCTTATGGGGCAGGAGTGCCTCTGCTATT
GAATAACACTGATGGTGTGGTTCAATATTACGTCCCGTATTTGTCTGGTATACAACTGTACGGCATGGAACCGTCGGCAAAGCCAAGGCGATGGGGTGAGGAAAGTGACA
GTGACTATAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACCAAGAGAAGAATAAAACACACTAGAGAGCTACTCCACCATAATGATCCGTCTATCACAGCTCCT
CTTAGAATAGATAGATTGTCTTTGAGGGATCAGCACATGGGACTTCATGAAGACTGCTCCAGTGATGAGGCTGAATCTTTCAATTCTGAAGGTCGCCTTCTATTCGAGTA
TCTTGAAAGAGACCTACCGTATTCACGTGAACCTTTGGCTGACAAGGCAAGTGACCTTGCTTCGCGCTTCCCTCAGCTGAAAACAATGAGAAGTTGTGACCTACTACCAT
ATAGTTGGATATCTGTGGCATGGTACCCAATTTACAGAATACCAACCGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTCACGTACCATTCTCTACATACAGCAATC
AGAGGCCCTCAAAGCACACAAGTGCCATTTGTGGCATATCCTTGCAAGACGGATGGTGCCGAAAAGATTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAAAGG
GTCGTCATTGTGGATGCGAAATGGTGGAGTTGAGCATCAGTTGGCAAACTCCCTCTCGCAGGCTGCAGATCACTGGTTAAGATTTCTCCAGGTCAATCACCCGGATTTCC
TGTTCTTCAGCCGCCGA
Protein sequenceShow/hide protein sequence
MLGAGVRFGRGRGEDRFYDSSRARRGLLSRQNDRLCRPQEDASATPSCVVKDSSLHSPITHRVASDEATKPVAVPNPNPQPVVSPLSNLERFLQSVTPSVPAQFFSKSSL
RGWRTCDSETQPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYGMEPSAKPRRWGEESDSDYRDSSSDGSSDSETKRRIKHTRELLHHNDPSITAP
LRIDRLSLRDQHMGLHEDCSSDEAESFNSEGRLLFEYLERDLPYSREPLADKASDLASRFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTAI
RGPQSTQVPFVAYPCKTDGAEKIPLRIFGLASYKFKGSSLWMRNGGVEHQLANSLSQAADHWLRFLQVNHPDFLFFSRR