; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002548 (gene) of Chayote v1 genome

Gene IDSed0002548
OrganismSechium edule (Chayote v1)
DescriptionProtein of unknown function (DUF789)
Genome locationLG07:36609239..36612797
RNA-Seq ExpressionSed0002548
SyntenySed0002548
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573941.1 hypothetical protein SDJN03_27828, partial [Cucurbita argyrosperma subsp. sororia]3.3e-19382.65Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGR RGEDRFYDSSRAR+GLLSRQ+DRLCRPQ+ ASAT S AVKD+  +     A  RV SD+A KP+    P+P VSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQFLSKSALR W+TSD ER+PYFVLGDLWE FKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYG ES  KPR+WGEE+DSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI++ REP HH DPSITAPLRM+RLSLRDQHLG LEDCSSDEAESCN QG LLFEYLERD PYSREPLADKI DLASRFP LKTLRSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+GG Q+PQ+PFVAYPCKTDA+KVPLR+FGLASYKFKGSSLWM+NGGVEHQLANKLS+EADKWLR+LQV
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV

Query:  NHPDFLFFSRQDLTP
        NHPDF+FFS++DL P
Subjt:  NHPDFLFFSRQDLTP

XP_022150656.1 uncharacterized protein LOC111018737 [Momordica charantia]3.3e-19383.41Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKP--LPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGRGRGEDRFYDSSRARRGLLSRQ+DRLCRPQ+DASAT S  VKD  +  S I  RVASD+A KP  +P  NP+PVVSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKP--LPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQF SKS+LR WRT D E +PYFVLGDLWEAFKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYGME   KPRRWGEESDSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI+++RE LHH DPSITAPLR++RLSLRDQH+GL EDCSSDEAES N +G LLFEYLERD PYSREPLADKILDLASRFP LKT+RSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTD-AEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTAI GPQ+ QVPFVAYPCKTD AEK+PLR+FGLASYKFKGSSLWM+NGGVEHQLAN LSQ AD WLR LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTD-AEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQ

Query:  VNHPDFLFFSRQDLTP
        VNHPDFLFFSR+D TP
Subjt:  VNHPDFLFFSRQDLTP

XP_022945838.1 uncharacterized protein LOC111449961 isoform X1 [Cucurbita moschata]1.2e-19583.25Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGR RGEDRFYDSSRAR+GLLSRQ+DRLCRPQ+ ASAT S AVKD+  +   I A  RV SD+A KP+    P+P VSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQFLSKSALR W+TSD ER+PYFVLGDLWE FKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYG ES  KPR+WGEE+DSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI++ REP HH DPSITAPLRM+RLSLRDQHLG LEDCSSDEAESCN QG LLFEYLERDQPYSREPLADKI DLASRFP LKTLRSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGG---PQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRD
        SVAWYPIYRIPTGQTLKDLDACFLTYH+LHTA+GG   PQ+PQ+PFVAYPCKTDA+KVPLR+FGLASYKFKGSSLWM+NGGVEHQLANKLS+EADKWLRD
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGG---PQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRD

Query:  LQVNHPDFLFFSRQDLTP
        LQVNHPDF+FFSR+DL P
Subjt:  LQVNHPDFLFFSRQDLTP

XP_022945839.1 uncharacterized protein LOC111449961 isoform X2 [Cucurbita moschata]2.9e-19783.86Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGR RGEDRFYDSSRAR+GLLSRQ+DRLCRPQ+ ASAT S AVKD+  +   I A  RV SD+A KP+    P+P VSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQFLSKSALR W+TSD ER+PYFVLGDLWE FKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYG ES  KPR+WGEE+DSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI++ REP HH DPSITAPLRM+RLSLRDQHLG LEDCSSDEAESCN QG LLFEYLERDQPYSREPLADKI DLASRFP LKTLRSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV
        SVAWYPIYRIPTGQTLKDLDACFLTYH+LHTA+GGPQ+PQ+PFVAYPCKTDA+KVPLR+FGLASYKFKGSSLWM+NGGVEHQLANKLS+EADKWLRDLQV
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV

Query:  NHPDFLFFSRQDLTP
        NHPDF+FFSR+DL P
Subjt:  NHPDFLFFSRQDLTP

XP_022968120.1 uncharacterized protein LOC111467452 isoform X2 [Cucurbita maxima]4.4e-19382.17Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGR RGEDRFYDSSRAR+GLLSRQ+DRLCRPQ+ ASAT S AVKD+  +     A  RV SD+A KP+    P+P VSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQFLSKSALR W+TSD ER+P+FVLGDLWE FKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYG ES  KPR+WGEE+DSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI++ REP HH DPSITAPLRM+RLSLRDQHLG LEDCSSDEAESCN QG LLFEYLERDQPYSREPLADKI DLASRFP LKT+RSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+GG Q+PQ+PFVAYPCKTDA+KVPLR+FGLASYKFKGSSLWM+NGGVEHQLANKLS+EADKWL++LQV
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV

Query:  NHPDFLFFSRQDLTP
        NHPDF+FF R+DL P
Subjt:  NHPDFLFFSRQDLTP

TrEMBL top hitse value%identityAlignment
A0A6J1DC61 uncharacterized protein LOC1110187371.6e-19383.41Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKP--LPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGRGRGEDRFYDSSRARRGLLSRQ+DRLCRPQ+DASAT S  VKD  +  S I  RVASD+A KP  +P  NP+PVVSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKP--LPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQF SKS+LR WRT D E +PYFVLGDLWEAFKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYGME   KPRRWGEESDSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI+++RE LHH DPSITAPLR++RLSLRDQH+GL EDCSSDEAES N +G LLFEYLERD PYSREPLADKILDLASRFP LKT+RSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTD-AEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQ
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTAI GPQ+ QVPFVAYPCKTD AEK+PLR+FGLASYKFKGSSLWM+NGGVEHQLAN LSQ AD WLR LQ
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTD-AEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQ

Query:  VNHPDFLFFSRQDLTP
        VNHPDFLFFSR+D TP
Subjt:  VNHPDFLFFSRQDLTP

A0A6J1G225 uncharacterized protein LOC111449961 isoform X21.4e-19783.86Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGR RGEDRFYDSSRAR+GLLSRQ+DRLCRPQ+ ASAT S AVKD+  +   I A  RV SD+A KP+    P+P VSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQFLSKSALR W+TSD ER+PYFVLGDLWE FKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYG ES  KPR+WGEE+DSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI++ REP HH DPSITAPLRM+RLSLRDQHLG LEDCSSDEAESCN QG LLFEYLERDQPYSREPLADKI DLASRFP LKTLRSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV
        SVAWYPIYRIPTGQTLKDLDACFLTYH+LHTA+GGPQ+PQ+PFVAYPCKTDA+KVPLR+FGLASYKFKGSSLWM+NGGVEHQLANKLS+EADKWLRDLQV
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV

Query:  NHPDFLFFSRQDLTP
        NHPDF+FFSR+DL P
Subjt:  NHPDFLFFSRQDLTP

A0A6J1G242 uncharacterized protein LOC111449961 isoform X16.0e-19683.25Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGR RGEDRFYDSSRAR+GLLSRQ+DRLCRPQ+ ASAT S AVKD+  +   I A  RV SD+A KP+    P+P VSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQFLSKSALR W+TSD ER+PYFVLGDLWE FKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYG ES  KPR+WGEE+DSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI++ REP HH DPSITAPLRM+RLSLRDQHLG LEDCSSDEAESCN QG LLFEYLERDQPYSREPLADKI DLASRFP LKTLRSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGG---PQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRD
        SVAWYPIYRIPTGQTLKDLDACFLTYH+LHTA+GG   PQ+PQ+PFVAYPCKTDA+KVPLR+FGLASYKFKGSSLWM+NGGVEHQLANKLS+EADKWLRD
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGG---PQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRD

Query:  LQVNHPDFLFFSRQDLTP
        LQVNHPDF+FFSR+DL P
Subjt:  LQVNHPDFLFFSRQDLTP

A0A6J1HWA7 uncharacterized protein LOC111467452 isoform X22.1e-19382.17Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGR RGEDRFYDSSRAR+GLLSRQ+DRLCRPQ+ ASAT S AVKD+  +     A  RV SD+A KP+    P+P VSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQFLSKSALR W+TSD ER+P+FVLGDLWE FKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYG ES  KPR+WGEE+DSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI++ REP HH DPSITAPLRM+RLSLRDQHLG LEDCSSDEAESCN QG LLFEYLERDQPYSREPLADKI DLASRFP LKT+RSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+GG Q+PQ+PFVAYPCKTDA+KVPLR+FGLASYKFKGSSLWM+NGGVEHQLANKLS+EADKWL++LQV
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQV

Query:  NHPDFLFFSRQDLTP
        NHPDF+FF R+DL P
Subjt:  NHPDFLFFSRQDLTP

A0A6J1HYQ4 uncharacterized protein LOC111467452 isoform X16.8e-19281.58Show/hide
Query:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF
        MLGA VRFGR RGEDRFYDSSRAR+GLLSRQ+DRLCRPQ+ ASAT S AVKD+  +     A  RV SD+A KP+    P+P VSPLSNLERFLQSVTP 
Subjt:  MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAA--RVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPF

Query:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS
        VPAQFLSKSALR W+TSD ER+P+FVLGDLWE FKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLYG ES  KPR+WGEE+DSDYRDSSSDG  SSDS
Subjt:  VPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDS

Query:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI
        E KRRI++ REP HH DPSITAPLRM+RLSLRDQHLG LEDCSSDEAESCN QG LLFEYLERDQPYSREPLADKI DLASRFP LKT+RSCDLL  SWI
Subjt:  EIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWI

Query:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGP---QNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRD
        SVAWYPIYRIPTGQTLKDLDACFLTYH LHTA+GG    Q+PQ+PFVAYPCKTDA+KVPLR+FGLASYKFKGSSLWM+NGGVEHQLANKLS+EADKWL++
Subjt:  SVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGP---QNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRD

Query:  LQVNHPDFLFFSRQDLTP
        LQVNHPDF+FF R+DL P
Subjt:  LQVNHPDFLFFSRQDLTP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)4.6e-10059.39Show/hide
Query:  SNLERFLQSVTPFVPAQFLSKSALRSWRTSDLERR-PYFVLGDLWEAFKEWSAYGAGVPLLLNNS-DGVVQYYVPYLSGIQLY----GMESFIKPRRWGE
        SN+ERFL SVTP VPA +LSK+ +R    SD+E + PYF+LGD+WE+F EWSAYG GVPL LNN+ D V QYYVP LSGIQ+Y     + S ++ RR GE
Subjt:  SNLERFLQSVTPFVPAQFLSKSALRSWRTSDLERR-PYFVLGDLWEAFKEWSAYGAGVPLLLNNS-DGVVQYYVPYLSGIQLY----GMESFIKPRRWGE

Query:  ESDSDYRDSSSDGSTSSDSEIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLAS
        ES+SD+RDSSS+GS+   SE +R + YS+E +           RM++LSLR +H    ED SSD+ E  + QG L+FEYLERD PY REP ADK+ DLAS
Subjt:  ESDSDYRDSSSDGSTSSDSEIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLAS

Query:  RFPPLKTLRSCDLLTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEH
        RFP LKTLRSCDLL SSW SVAWYPIY+IPTG TLKDLDACFLTYH LHT   GP            +   EK+ L VFGLASYK +G S+W   GG  H
Subjt:  RFPPLKTLRSCDLLTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEH

Query:  QLANKLSQEADKWLRDLQVNHPDFLFFSRQ
        QLAN L Q AD WLR  QVNHPDF+FF R+
Subjt:  QLANKLSQEADKWLRDLQVNHPDFLFFSRQ

AT2G01260.1 Protein of unknown function (DUF789)3.7e-10553.24Show/hide
Query:  MLGASVRFGRGR-GEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPFV
        MLGA  +  RGR G+D FY S++ RR   +++ D+L R Q D S   SSA                     +P  +S+        SNL+RFL+SVTP V
Subjt:  MLGASVRFGRGR-GEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPFV

Query:  PAQFLSKSALRSWRTSDLERR--PYFVLGDLWEAFKEWSAYGAGVPLLLNNS-DGVVQYYVPYLSGIQLY----GMESFIKPRRWGEESDSDYRDSSSDG
        PAQFLSK+ LR  R  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN+ D V+QYYVP LS IQ+Y     ++S +K RR G+ SDSD+RDSSSD 
Subjt:  PAQFLSKSALRSWRTSDLERR--PYFVLGDLWEAFKEWSAYGAGVPLLLNNS-DGVVQYYVPYLSGIQLY----GMESFIKPRRWGEESDSDYRDSSSDG

Query:  STSSDSEIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDL
        S+ SDSE                       R++ +SLRDQH    ED SSD+ E    QG L+FEYLERD PY REP ADK+LDLA++FP L TLRSCDL
Subjt:  STSSDSEIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDL

Query:  LTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKW
        L SSW SVAWYPIYRIPTG TLKDLDACFLTYH LHT+ GG  + Q   +  P   ++EK+ L VFGLASYKF+G SLW   GG EHQL N L Q ADKW
Subjt:  LTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKW

Query:  LRDLQVNHPDFLFFSRQ
        L    V+HPDFLFF R+
Subjt:  LRDLQVNHPDFLFFSRQ

AT2G01260.2 Protein of unknown function (DUF789)5.1e-8353.08Show/hide
Query:  MLGASVRFGRGR-GEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPFV
        MLGA  +  RGR G+D FY S++ RR   +++ D+L R Q D S   SSA                     +P  +S+        SNL+RFL+SVTP V
Subjt:  MLGASVRFGRGR-GEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPFV

Query:  PAQFLSKSALRSWRTSDLERR--PYFVLGDLWEAFKEWSAYGAGVPLLLNNS-DGVVQYYVPYLSGIQLY----GMESFIKPRRWGEESDSDYRDSSSDG
        PAQFLSK+ LR  R  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN+ D V+QYYVP LS IQ+Y     ++S +K RR G+ SDSD+RDSSSD 
Subjt:  PAQFLSKSALRSWRTSDLERR--PYFVLGDLWEAFKEWSAYGAGVPLLLNNS-DGVVQYYVPYLSGIQLY----GMESFIKPRRWGEESDSDYRDSSSDG

Query:  STSSDSEIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDL
        S+ SDSE                       R++ +SLRDQH    ED SSD+ E    QG L+FEYLERD PY REP ADK+LDLA++FP L TLRSCDL
Subjt:  STSSDSEIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDL

Query:  LTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGG
        L SSW SVAWYPIYRIPTG TLKDLDACFLTYH LHT+ GG
Subjt:  LTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGG

AT4G16100.1 Protein of unknown function (DUF789)9.7e-7443.84Show/hide
Query:  RGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKD----LPARSSSIAARVASDDAAKPLPVSNPRPVV-SPLSNLERFLQSVTPFVPAQFL
        R RGE+RFY+    R+    R+  RL   + +    ++  + D    +  +        ++ D + P  VS+      +  SNL RFL   TP V  Q L
Subjt:  RGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKD----LPARSSSIAARVASDDAAKPLPVSNPRPVV-SPLSNLERFLQSVTPFVPAQFL

Query:  SKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLY--GMESFIKPRRWGEESDSDY-RDSSSDGSTSSDSEIK
          ++ + WRT + E RPYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY     +    RR GEESD D  RD SSDGS     E+ 
Subjt:  SKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLY--GMESFIKPRRWGEESDSDY-RDSSSDGSTSSDSEIK

Query:  RRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAE-SCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWISV
        + +   R  L  K P I +                    SSDE+E S N  G L+FEYLE   P+ REPL DKI +L+S+FP L+T RSCDL  SSW+SV
Subjt:  RRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAE-SCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQVNH
        AWYPIYRIP GQ+L++LDACFLT+H L T   G  N +    +      + K+PL  FGLASYKFK S    ++   E+Q    L + A++WLR L+V  
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQVNH

Query:  PDFLFF
        PDF  F
Subjt:  PDFLFF

AT5G49220.1 Protein of unknown function (DUF789)4.8e-7343.06Show/hide
Query:  GASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKPLPVSNPRP----------------------
        G S+     RGE+RFY+    RR     Q  +  R +Q     +   + D   R    AA VA     K L VS  +                       
Subjt:  GASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKPLPVSNPRP----------------------

Query:  VVSPLSNLERFLQSVTPFVPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPR
        V+S  SNL+RFL+  TP VPA+     +    +T + +   YFVL DLWE+F EWSAYGAGV     PL ++ +D  VQYYVPYLSGIQLY ++   KPR
Subjt:  VVSPLSNLERFLQSVTPFVPAQFLSKSALRSWRTSDLERRPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPR

Query:  RWGEESDSDYRDSSSDGSTSSDSEIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKIL
             +     + SS+GS++S +                 P   +   + R+SL+DQ   +    SS EAE  NPQG LLFEYLE + P+ REPLA+KI 
Subjt:  RWGEESDSDYRDSSSDGSTSSDSEIKRRIQYSREPLHHKDPSITAPLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKIL

Query:  DLASRFPPLKTLRSCDLLTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNG
        DLASR P L T RSCDLL SSW+SV+WYPIYRIP G TL++LDACFLT+H L TA   PQ+      + P    + K+PL  FGLASYK K  S+W QN 
Subjt:  DLASRFPPLKTLRSCDLLTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTAIGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNG

Query:  GVEHQLANKLSQEADKWLRDLQVNHPDFLFFS
          E Q    L Q ADKWL+ LQV+HPD+ FF+
Subjt:  GVEHQLANKLSQEADKWLRDLQVNHPDFLFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGCGCCAGTGTTCGGTTCGGCCGCGGCAGAGGAGAGGACAGGTTCTACGATTCGTCCAGAGCGAGACGAGGTCTTCTCAGTCGTCAAAGCGACAGGCTGTGCAG
ACCTCAACAAGACGCTTCCGCTACTCAATCTTCGGCGGTGAAGGATTTGCCGGCGCGTTCTTCTTCGATTGCAGCGCGTGTTGCCTCCGATGATGCTGCTAAACCACTGC
CCGTTTCTAATCCCCGACCGGTCGTGTCTCCGTTGAGTAATCTCGAGCGGTTCTTGCAGTCGGTTACTCCGTTTGTGCCTGCTCAGTTTCTCTCCAAGAGTGCGTTGAGA
AGTTGGAGAACTAGCGATTTGGAGAGGCGGCCGTATTTTGTGCTTGGGGATTTGTGGGAGGCTTTTAAGGAATGGAGCGCTTATGGGGCTGGAGTGCCTCTGTTGTTGAA
TAACTCTGATGGGGTGGTTCAGTATTATGTTCCCTATTTGTCTGGTATACAGTTGTATGGGATGGAATCGTTTATCAAACCAAGGCGATGGGGTGAGGAAAGTGATAGTG
ATTACAGAGATTCAAGTAGTGATGGTAGTACTAGTAGTGATTCTGAAATAAAGAGAAGAATACAATACAGTAGGGAACCACTCCACCATAAAGATCCGTCTATAACAGCT
CCTCTCAGAATGGAAAGATTGTCTTTAAGGGACCAGCATTTGGGACTTCTTGAGGACTGCTCCAGTGATGAGGCTGAATCTTGCAACCCTCAAGGTCACCTTCTATTTGA
GTATCTTGAAAGAGACCAACCGTATTCACGTGAACCTTTGGCAGACAAGATATTGGACCTTGCTTCTCGCTTCCCTCCGCTGAAAACATTGAGAAGTTGTGACCTACTAA
CAAGTAGTTGGATATCTGTGGCATGGTACCCAATTTACAGGATACCAACTGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTTACATACCATTATCTACACACAGCA
ATCGGAGGACCTCAAAACCCACAAGTGCCATTTGTGGCATATCCTTGTAAAACGGATGCTGAAAAGGTTCCTTTAAGAGTTTTTGGACTTGCTTCATACAAGTTTAAAGG
GTCCTCATTGTGGATGCAAAATGGTGGAGTTGAGCATCAATTGGCAAACAAGCTTTCGCAGGAAGCCGACAAGTGGTTAAGAGATCTCCAGGTCAATCACCCAGATTTTC
TATTCTTCAGCCGCCAAGATCTAACTCCTTAG
mRNA sequenceShow/hide mRNA sequence
GGAGATTTTCAAAAAAAGAAAAAAGAAAAAGAAAAAGGAAATCCCAATTCTTAATCGTAATCGGTTTTCGTCGAACCGTTTTGATCTGTTTACCCCTTTTGTGAAAACCC
GCTTCCCCATTCGATTCACTGTGTTCTTCATCTTCTCCGATTCACTCATTCTCATTTCCCTTCCGATCCTTCACCGGCGAGATCTGATTTCCTCCGGCAGATCCGCCCAT
CGCCGTTTCTGCTCTTCCGATCTCCAATTACCAGATCGCTCCGCCGTTGACTTCGGCCAGTCTCCGCCACTGTCCAGATGTTAGGCGCCAGTGTTCGGTTCGGCCGCGGC
AGAGGAGAGGACAGGTTCTACGATTCGTCCAGAGCGAGACGAGGTCTTCTCAGTCGTCAAAGCGACAGGCTGTGCAGACCTCAACAAGACGCTTCCGCTACTCAATCTTC
GGCGGTGAAGGATTTGCCGGCGCGTTCTTCTTCGATTGCAGCGCGTGTTGCCTCCGATGATGCTGCTAAACCACTGCCCGTTTCTAATCCCCGACCGGTCGTGTCTCCGT
TGAGTAATCTCGAGCGGTTCTTGCAGTCGGTTACTCCGTTTGTGCCTGCTCAGTTTCTCTCCAAGAGTGCGTTGAGAAGTTGGAGAACTAGCGATTTGGAGAGGCGGCCG
TATTTTGTGCTTGGGGATTTGTGGGAGGCTTTTAAGGAATGGAGCGCTTATGGGGCTGGAGTGCCTCTGTTGTTGAATAACTCTGATGGGGTGGTTCAGTATTATGTTCC
CTATTTGTCTGGTATACAGTTGTATGGGATGGAATCGTTTATCAAACCAAGGCGATGGGGTGAGGAAAGTGATAGTGATTACAGAGATTCAAGTAGTGATGGTAGTACTA
GTAGTGATTCTGAAATAAAGAGAAGAATACAATACAGTAGGGAACCACTCCACCATAAAGATCCGTCTATAACAGCTCCTCTCAGAATGGAAAGATTGTCTTTAAGGGAC
CAGCATTTGGGACTTCTTGAGGACTGCTCCAGTGATGAGGCTGAATCTTGCAACCCTCAAGGTCACCTTCTATTTGAGTATCTTGAAAGAGACCAACCGTATTCACGTGA
ACCTTTGGCAGACAAGATATTGGACCTTGCTTCTCGCTTCCCTCCGCTGAAAACATTGAGAAGTTGTGACCTACTAACAAGTAGTTGGATATCTGTGGCATGGTACCCAA
TTTACAGGATACCAACTGGGCAAACATTAAAGGATCTTGATGCTTGCTTTCTTACATACCATTATCTACACACAGCAATCGGAGGACCTCAAAACCCACAAGTGCCATTT
GTGGCATATCCTTGTAAAACGGATGCTGAAAAGGTTCCTTTAAGAGTTTTTGGACTTGCTTCATACAAGTTTAAAGGGTCCTCATTGTGGATGCAAAATGGTGGAGTTGA
GCATCAATTGGCAAACAAGCTTTCGCAGGAAGCCGACAAGTGGTTAAGAGATCTCCAGGTCAATCACCCAGATTTTCTATTCTTCAGCCGCCAAGATCTAACTCCTTAGT
GATACAATGCTCTTACCAATTAGAAAACGAAGGTTGACCGGTTGTGGCCATGCAAAAGAATCCCCTATTTTGTTGGATTAAGCTGTTTGCTTGTTGGAGTAATGGTTCAC
TCGTTTGCGTTCGTTACAGTTGGGGACTGAAGGAAGGTTAATGAATGAGGTAGGCCGCAAAATAGTGAAGCTTTAGACATCCATGCGGTCCATGCGACACATGCATATGA
TGTAATAACAAAGACCGATAGAAAGTGAAGGCTATATAAATGTAAGCAGAAAACATTGCTGTAAGGCTCGCTGGTTTAGCAAGGTAGAGTTGGGGTTTGAACTTCGACAT
GTATTAATAAGCAGGTTTTCTTGTTTGCTTTCACTTTTTTTTTTCTCTATTTATTCCTAGAAGAGGCCATCATTTGAGAAATCAGGCCTTATCCCTCTGACTAACGGTTC
TAATTTTTGTTAGAATGTAATGTTATTTAAGAGGTTTTCCTTAGGGCATTGAACTGTTGCTGTGTACTGCAGGAGTTTCAGTGACCATGAACTGGTGTTTACGGACGAAA
GCATATAATACTCACTGAACTTGAGT
Protein sequenceShow/hide protein sequence
MLGASVRFGRGRGEDRFYDSSRARRGLLSRQSDRLCRPQQDASATQSSAVKDLPARSSSIAARVASDDAAKPLPVSNPRPVVSPLSNLERFLQSVTPFVPAQFLSKSALR
SWRTSDLERRPYFVLGDLWEAFKEWSAYGAGVPLLLNNSDGVVQYYVPYLSGIQLYGMESFIKPRRWGEESDSDYRDSSSDGSTSSDSEIKRRIQYSREPLHHKDPSITA
PLRMERLSLRDQHLGLLEDCSSDEAESCNPQGHLLFEYLERDQPYSREPLADKILDLASRFPPLKTLRSCDLLTSSWISVAWYPIYRIPTGQTLKDLDACFLTYHYLHTA
IGGPQNPQVPFVAYPCKTDAEKVPLRVFGLASYKFKGSSLWMQNGGVEHQLANKLSQEADKWLRDLQVNHPDFLFFSRQDLTP