; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G022510 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G022510
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF789)
Genome locationCG_Chr05:34416993..34419871
RNA-Seq ExpressionClCG05G022510
SyntenyClCG05G022510
Gene Ontology termsNA
InterPro domainsIPR008507 - Protein of unknown function DUF789


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446634.1 PREDICTED: uncharacterized protein LOC103489306 [Cucumis melo]2.9e-20588.43Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP
        MLGAGVRFGR KGEDRFYDSSRARKGLLSRQNDRL  T+QQDASATTPSYADK+ VSTRP DRLASDEATKPVP       VS LSNLERFLQSVTP VP
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP

Query:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR
        AQFLSKSALRGWRTCD +T+PYF+LGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESST  RRWGEESDSDYRDSSSDGSSDSET +
Subjt:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR

Query:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV
        RIKH REP HHNDP+ITDPLRMDRLSLR+QH GLHEDCSSDEAESFNS GRLLFEYLERDLP+     + D +ISDLASRFPQLKTMRSCDLLPYSWISV
Subjt:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTP+RDSQ+PPIPF AYPCKTNGA KVPLRIFGLASYKFNGSSLWMRNGGVEHQLA  LSRAAD WLRDL VN
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN

Query:  HPDFLFFSRRDATPY
        HPDFLFFSRRDATPY
Subjt:  HPDFLFFSRRDATPY

XP_011655788.1 uncharacterized protein LOC101209750 [Cucumis sativus]1.5e-20487.95Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP
        MLGAGVRFGR KGEDRFYDSSRARKGLLSRQNDRL  T+QQDASATTPSYADK+ VSTRP DRL SDEATKPVP       VS LSNLERFLQSVTP VP
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP

Query:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR
        AQFLSKSALRGWRTCD +T+PYF+LGDLWEAFKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLY MESST  RRWGEESDSDYRDSSSDGSSDSET R
Subjt:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR

Query:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV
        RIKH REP HHNDP+ITDPLRMDRLSLRDQH GLHEDCSSDEAESFNS GRLLFEYLERDLP+     + D +ISDLASRFPQLKTMRSCDLLPYSWISV
Subjt:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTP+RDSQ+PP PF AYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLA  LSRAA+ WLRDL VN
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN

Query:  HPDFLFFSRRDATPY
        HPDFLFFSRRDATPY
Subjt:  HPDFLFFSRRDATPY

XP_022150656.1 uncharacterized protein LOC111018737 [Momordica charantia]1.2e-19583.97Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRP--GDRLASDEATKP--VPFSNPQPVVSPLSNLERFLQSVTP
        MLGAGVRFGRG+GEDRFYDSSRAR+GLLSRQNDRL C  Q+DASA TPS   KD S       R+ASDEATKP  VP  NPQPVVSPLSNLERFLQSVTP
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRP--GDRLASDEATKP--VPFSNPQPVVSPLSNLERFLQSVTP

Query:  SVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSE
        SVPAQF SKS+LRGWRTCDS+T+PYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLY ME S  PRRWGEESDSDYRDSSSDGSSDSE
Subjt:  SVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSE

Query:  TKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSW
        TKRRIKH RE LHHNDP+IT PLR+DRLSLRDQH GLHEDCSSDEAESFNS GRLLFEYLERDLP++   +    +I DLASRFPQLKTMRSCDLLPYSW
Subjt:  TKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSW

Query:  ISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDL
        ISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT IR  Q+  +PF AYPCKT+ AEK+PLRIFGLASYKF GSSLWMRNGGVEHQLA +LS+AAD WLR L
Subjt:  ISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDL

Query:  QVNHPDFLFFSRRDATPY
        QVNHPDFLFFSRRDATPY
Subjt:  QVNHPDFLFFSRRDATPY

XP_022980279.1 uncharacterized protein LOC111479669 [Cucurbita maxima]7.0e-19182.65Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDV------STRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSV
        MLG GVRFGRG+GEDRFYDSSRARKGLLSRQNDRL    QQ ASATTPS A  DV      +TRPGDRL SDEAT+PV  SNPQP VSPLSNLERFLQS 
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDV------STRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSV

Query:  TPSVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSD
        TPSVPAQFLSKSALRG R CDS+T+PYFVL DLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESST PRRW EESDSDYRDSSSDGSSD
Subjt:  TPSVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSD

Query:  SETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPY
        SE KRRIKH REP HH+DP IT P RMDRLSLRDQH G H+DCSSDEAESFNS GRLLFEYLERDLP+     + D +ISDLASRFPQLKTMRSCDLLP+
Subjt:  SETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPY

Query:  SWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLR
        SWISVAWYPIYRIPTGQTLKDLDACFLTYH LHTPIR  ++P +P   YPCKT+GA+K+PLRIFGLASYKFNGSSLWMRNGGVEHQLA  LS+AAD+WLR
Subjt:  SWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLR

Query:  DLQVNHPDFLFFSRR
         LQVNHPDF FFSR+
Subjt:  DLQVNHPDFLFFSRR

XP_038892909.1 uncharacterized protein LOC120081811 [Benincasa hispida]4.5e-21490.58Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVPA
        MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRL CT QQDASATTPSYADKDVSTRPGDRL SDEATKPVPFSN QPVVSPLSNLERFLQSVTPS+ A
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVPA

Query:  QFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKRR
        QFLSKSALRGWRTCD +T+PYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKRR
Subjt:  QFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKRR

Query:  IKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISVA
        IKH REP HHNDPTITDPLRMDRLSLRDQ  GLHEDCSSDEAES NS G L+FEYLERDLP++   +    +ISDLAS FPQLKTMRSCDLLPYSWISVA
Subjt:  IKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISVA

Query:  WYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVNH
        WYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQ+PPIPF AY CKT+GAEKV LRIFGLASYKFNGSSLW+RNGGVEHQLA +LSRAAD+WLRDL VNH
Subjt:  WYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVNH

Query:  PDFLFFSRRDATPY
        PDFLFFSRRDATPY
Subjt:  PDFLFFSRRDATPY

TrEMBL top hitse value%identityAlignment
A0A0A0KR63 Uncharacterized protein7.0e-20587.95Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP
        MLGAGVRFGR KGEDRFYDSSRARKGLLSRQNDRL  T+QQDASATTPSYADK+ VSTRP DRL SDEATKPVP       VS LSNLERFLQSVTP VP
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP

Query:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR
        AQFLSKSALRGWRTCD +T+PYF+LGDLWEAFKEWSAYGAGVPLLLNN+DGVVQYYVPYLSGIQLY MESST  RRWGEESDSDYRDSSSDGSSDSET R
Subjt:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR

Query:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV
        RIKH REP HHNDP+ITDPLRMDRLSLRDQH GLHEDCSSDEAESFNS GRLLFEYLERDLP+     + D +ISDLASRFPQLKTMRSCDLLPYSWISV
Subjt:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTP+RDSQ+PP PF AYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLA  LSRAA+ WLRDL VN
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN

Query:  HPDFLFFSRRDATPY
        HPDFLFFSRRDATPY
Subjt:  HPDFLFFSRRDATPY

A0A1S3BG78 uncharacterized protein LOC1034893061.4e-20588.43Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP
        MLGAGVRFGR KGEDRFYDSSRARKGLLSRQNDRL  T+QQDASATTPSYADK+ VSTRP DRLASDEATKPVP       VS LSNLERFLQSVTP VP
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP

Query:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR
        AQFLSKSALRGWRTCD +T+PYF+LGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESST  RRWGEESDSDYRDSSSDGSSDSET +
Subjt:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR

Query:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV
        RIKH REP HHNDP+ITDPLRMDRLSLR+QH GLHEDCSSDEAESFNS GRLLFEYLERDLP+     + D +ISDLASRFPQLKTMRSCDLLPYSWISV
Subjt:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTP+RDSQ+PPIPF AYPCKTNGA KVPLRIFGLASYKFNGSSLWMRNGGVEHQLA  LSRAAD WLRDL VN
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN

Query:  HPDFLFFSRRDATPY
        HPDFLFFSRRDATPY
Subjt:  HPDFLFFSRRDATPY

A0A5D3CBA2 DUF789 domain-containing protein1.7e-19088.24Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP
        MLGAGVRFGR KGEDRFYDSSRARKGLLSRQNDRL  T+QQDASATTPSYADK+ VSTRP DRLASDEATKPVP       VS LSNLERFLQSVTP VP
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKD-VSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVP

Query:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR
        AQFLSKSALRGWRTCD +T+PYF+LGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESST  RRWGEESDSDYRDSSSDGSSDSET +
Subjt:  AQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKR

Query:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV
        RIKH REP HHNDP+ITDPLRMDRLSLR+QH GLHEDCSSDEAESFNS GRLLFEYLERDLP+     + D +ISDLASRFPQLKTMRSCDLLPYSWISV
Subjt:  RIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAAD
        AWYPIYRIPTGQTLKDLDACFLTYHSLHTP+RDSQ+PPIPF AYPCKTNGA KVPLRIFGLASYKFNGSSLWMRNGGVEHQLA  LSRAAD
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAAD

A0A6J1DC61 uncharacterized protein LOC1110187376.0e-19683.97Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRP--GDRLASDEATKP--VPFSNPQPVVSPLSNLERFLQSVTP
        MLGAGVRFGRG+GEDRFYDSSRAR+GLLSRQNDRL C  Q+DASA TPS   KD S       R+ASDEATKP  VP  NPQPVVSPLSNLERFLQSVTP
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRP--GDRLASDEATKP--VPFSNPQPVVSPLSNLERFLQSVTP

Query:  SVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSE
        SVPAQF SKS+LRGWRTCDS+T+PYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLY ME S  PRRWGEESDSDYRDSSSDGSSDSE
Subjt:  SVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSE

Query:  TKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSW
        TKRRIKH RE LHHNDP+IT PLR+DRLSLRDQH GLHEDCSSDEAESFNS GRLLFEYLERDLP++   +    +I DLASRFPQLKTMRSCDLLPYSW
Subjt:  TKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSW

Query:  ISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDL
        ISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT IR  Q+  +PF AYPCKT+ AEK+PLRIFGLASYKF GSSLWMRNGGVEHQLA +LS+AAD WLR L
Subjt:  ISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDL

Query:  QVNHPDFLFFSRRDATPY
        QVNHPDFLFFSRRDATPY
Subjt:  QVNHPDFLFFSRRDATPY

A0A6J1IYU8 uncharacterized protein LOC1114796693.4e-19182.65Show/hide
Query:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDV------STRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSV
        MLG GVRFGRG+GEDRFYDSSRARKGLLSRQNDRL    QQ ASATTPS A  DV      +TRPGDRL SDEAT+PV  SNPQP VSPLSNLERFLQS 
Subjt:  MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDV------STRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSV

Query:  TPSVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSD
        TPSVPAQFLSKSALRG R CDS+T+PYFVL DLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESST PRRW EESDSDYRDSSSDGSSD
Subjt:  TPSVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSD

Query:  SETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPY
        SE KRRIKH REP HH+DP IT P RMDRLSLRDQH G H+DCSSDEAESFNS GRLLFEYLERDLP+     + D +ISDLASRFPQLKTMRSCDLLP+
Subjt:  SETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPY

Query:  SWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLR
        SWISVAWYPIYRIPTGQTLKDLDACFLTYH LHTPIR  ++P +P   YPCKT+GA+K+PLRIFGLASYKFNGSSLWMRNGGVEHQLA  LS+AAD+WLR
Subjt:  SWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLR

Query:  DLQVNHPDFLFFSRR
         LQVNHPDF FFSR+
Subjt:  DLQVNHPDFLFFSRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15030.1 Protein of unknown function (DUF789)1.2e-9558.68Show/hide
Query:  SNLERFLQSVTPSVPAQFLSKSAL--RGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTMPRRWG
        SN+ERFL SVTPSVPA +LSK+ +  RG    +SQ  PYF+LGD+WE+F EWSAYG GVPL LNN  D V QYYVP LSGIQ+Y    A+ SS   RR G
Subjt:  SNLERFLQSVTPSVPAQFLSKSAL--RGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTMPRRWG

Query:  EESDSDYRDSSSDGSSDSETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLA
        EES+SD+RDSSS+GSS SE++R + + +E +           RMD+LSLR +H    ED SSD+ E  +S GRL+FEYLERDLP+      F  ++SDLA
Subjt:  EESDSDYRDSSSDGSSDSETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLA

Query:  SRFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPC--KTNGAEKVPLRIFGLASYKFNGSSLWMRNG
        SRFP+LKT+RSCDLLP SW SVAWYPIY+IPTG TLKDLDACFLTYHSLHTP    Q P +   +          EK+ L +FGLASYK  G S+W   G
Subjt:  SRFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPC--KTNGAEKVPLRIFGLASYKFNGSSLWMRNG

Query:  GVEHQLAYNLSRAADQWLRDLQVNHPDFLFFSRR
        G  HQLA +L +AAD WLR  QVNHPDF+FF RR
Subjt:  GVEHQLAYNLSRAADQWLRDLQVNHPDFLFFSRR

AT2G01260.1 Protein of unknown function (DUF789)6.9e-9650.72Show/hide
Query:  MLGAGVRFGRGK-GEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRPGDRLASDEATKPVPFSNP-QPVVSPLSNLERFLQSVTPSV
        MLGAG +  RG+ G+D FY S++ R+   +++ D+L               A  DVS  P        ++ P P     +P     SNL+RFL+SVTPSV
Subjt:  MLGAGVRFGRGK-GEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRPGDRLASDEATKPVPFSNP-QPVVSPLSNLERFLQSVTPSV

Query:  PAQFLSKSALRGWRTCDSQTR--PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTMPRRWGEESDSDYRDSSSDG
        PAQFLSK+ LR  R  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y    A++SS   RR G+ SDSD+RDSSSD 
Subjt:  PAQFLSKSALRGWRTCDSQTR--PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTMPRRWGEESDSDYRDSSSDG

Query:  SSDSETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDL
        SSDS+++R                    R+D +SLRDQH    ED SSD+ E   S GRL+FEYLERDLP  +    F  ++ DLA++FP+L T+RSCDL
Subjt:  SSDSETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDL

Query:  LPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQ
        L  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT      +        P     +EK+ L +FGLASYKF G SLW   GG EHQL  +L +AAD+
Subjt:  LPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQ

Query:  WLRDLQVNHPDFLFFSRR
        WL    V+HPDFLFF RR
Subjt:  WLRDLQVNHPDFLFFSRR

AT2G01260.2 Protein of unknown function (DUF789)9.4e-7751.63Show/hide
Query:  MLGAGVRFGRGK-GEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRPGDRLASDEATKPVPFSNP-QPVVSPLSNLERFLQSVTPSV
        MLGAG +  RG+ G+D FY S++ R+   +++ D+L               A  DVS  P        ++ P P     +P     SNL+RFL+SVTPSV
Subjt:  MLGAGVRFGRGK-GEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRPGDRLASDEATKPVPFSNP-QPVVSPLSNLERFLQSVTPSV

Query:  PAQFLSKSALRGWRTCDSQTR--PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTMPRRWGEESDSDYRDSSSDG
        PAQFLSK+ LR  R  D   +  PYFVLGD+W++F EWSAYG GVPL+LNN  D V+QYYVP LS IQ+Y    A++SS   RR G+ SDSD+RDSSSD 
Subjt:  PAQFLSKSALRGWRTCDSQTR--PYFVLGDLWEAFKEWSAYGAGVPLLLNNT-DGVVQYYVPYLSGIQLY----AMESSTMPRRWGEESDSDYRDSSSDG

Query:  SSDSETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDL
        SSDS+++R                    R+D +SLRDQH    ED SSD+ E   S GRL+FEYLERDLP  +    F  ++ DLA++FP+L T+RSCDL
Subjt:  SSDSETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDL

Query:  LPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT
        L  SW SVAWYPIYRIPTG TLKDLDACFLTYHSLHT
Subjt:  LPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHT

AT4G16100.1 Protein of unknown function (DUF789)7.5e-7444.47Show/hide
Query:  RGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVST------RPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVPAQFL
        R +GE+RFY+    RK    R+  RL     +          D+ +        +P +   SD +      S      +  SNL RFL   TP V  Q L
Subjt:  RGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVST------RPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVPAQFL

Query:  SKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLY--AMESSTMPRRWGEESDSDY-RDSSSDGSSDSETKRR
          ++ +GWRT + + RPYF+L DLW++F+EWSAYG GVPLLLN  D VVQYYVPYLSGIQLY     + T  RR GEESD D  RD SSDGS+D      
Subjt:  SKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLY--AMESSTMPRRWGEESDSDY-RDSSSDGSSDSETKRR

Query:  IKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAE-SFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV
           CRE L  N         + R SL ++        SSDE+E S NS G L+FEYLE  +PF         +IS+L+S+FP L+T RSCDL P SW+SV
Subjt:  IKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAE-SFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISV

Query:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN
        AWYPIYRIP GQ+L++LDACFLT+HSL TP R +            K+  + K+PL  FGLASYKF  S     +   E+Q    L R A++WLR L+V 
Subjt:  AWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVN

Query:  HPDFLFF
         PDF  F
Subjt:  HPDFLFF

AT5G49220.1 Protein of unknown function (DUF789)3.0e-6747.01Show/hide
Query:  VVSPLSNLERFLQSVTPSVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPR
        V+S  SNL+RFL+  TP VPA+     +    +T +S    YFVL DLWE+F EWSAYGAGV     PL ++  D  VQYYVPYLSGIQLY ++    PR
Subjt:  VVSPLSNLERFLQSVTPSVPAQFLSKSALRGWRTCDSQTRPYFVLGDLWEAFKEWSAYGAGV-----PLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPR

Query:  RWGEESDSDYRDSSSDGSSDSETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQIS
             +     + SS+GSS+S T               P       ++R+SL+DQ   +    SS EAE  N  GRLLFEYLE + PF    +    +IS
Subjt:  RWGEESDSDYRDSSSDGSSDSETKRRIKHCREPLHHNDPTITDPLRMDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQIS

Query:  DLASRFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCK-TNGAEKVPLRIFGLASYKFNGSSLWMR
        DLASR P+L T RSCDLLP SW+SV+WYPIYRIP G TL++LDACFLT+HSL T          P +A  C  +  + K+PL  FGLASYK    S+W +
Subjt:  DLASRFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPIRDSQTPPIPFAAYPCK-TNGAEKVPLRIFGLASYKFNGSSLWMR

Query:  NGGVEHQLAYNLSRAADQWLRDLQVNHPDFLFFS
        N   E Q   +L +AAD+WL+ LQV+HPD+ FF+
Subjt:  NGGVEHQLAYNLSRAADQWLRDLQVNHPDFLFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAGCGGGTGTACGGTTTGGTCGCGGCAAGGGAGAGGACCGGTTCTACGATTCATCCAGAGCGAGGAAAGGCCTCCTCAGTCGTCAAAATGATAGGCTATGTTG
TACAACTCAACAAGACGCTTCTGCTACTACTCCATCCTACGCGGATAAGGATGTTTCGACACGCCCTGGGGACCGTTTAGCTTCTGACGAAGCTACTAAACCAGTTCCGT
TTTCTAATCCACAGCCTGTTGTTTCACCGTTGAGTAATCTCGAGCGATTCTTGCAGTCGGTTACTCCCTCTGTGCCTGCTCAGTTTCTCTCTAAGAGTGCGTTAAGAGGT
TGGAGGACGTGCGATTCGCAGACGCGGCCTTACTTTGTGCTTGGGGATTTGTGGGAAGCTTTCAAAGAGTGGAGTGCTTACGGGGCAGGAGTGCCTCTTTTATTGAATAA
CACTGATGGTGTTGTTCAGTATTATGTCCCCTACTTGTCTGGCATACAATTGTATGCCATGGAATCGTCTACAATGCCAAGGCGATGGGGTGAAGAAAGCGACAGTGACT
ACAGAGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACAAAGAGAAGAATAAAACACTGTAGAGAACCACTCCATCATAATGATCCAACTATCACAGATCCTCTTAGA
ATGGATAGATTGTCTTTGAGGGACCAGCACTCGGGACTTCATGAGGACTGCTCCAGTGATGAGGCTGAATCTTTCAATTCTACAGGTCGTCTTCTATTTGAGTATCTTGA
AAGAGACCTACCCTTCACATTTTTCTCTGTTATTTTTGACATCCAGATATCGGACCTTGCTTCACGCTTCCCGCAGTTGAAAACAATGAGAAGCTGTGACCTCCTACCAT
ATAGTTGGATATCCGTGGCATGGTACCCAATTTACCGGATACCAACTGGGCAAACCTTAAAGGATCTTGATGCTTGCTTTCTCACGTACCATTCTCTACACACACCAATC
AGAGATTCTCAAACCCCACCAATACCATTTGCGGCATATCCTTGTAAGACGAATGGTGCCGAAAAGGTTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAACGG
GTCGTCGTTGTGGATGCGAAATGGTGGAGTAGAGCATCAATTGGCATACAACCTATCGCGAGCAGCTGATCAGTGGTTAAGAGATCTCCAAGTCAATCATCCAGATTTCC
TGTTCTTCAGCCGCCGAGATGCAACACCTTACTGA
mRNA sequenceShow/hide mRNA sequence
TTGGAACGAAAAAAAGGAGATATTAATTTTCTTGACCAAAAAAAGAAAGGAAAAAAAATGTCTCTGAATTCTAGTCGTTTTTTCGTCGAACCGTTTTGATCTGTTTGCCA
TTCTTCATTTCTCACCCTTTCTGAAAACCCGTCTCTTCCCATTCGATTCACTGTGTTCTTCTTATCGATCTCCTGTTTATTCATCGTTACTCTTTCTCCGATCGCTTCTT
TTCTCTTTTCTTTCTGGAATTTCTTCATCGGCGATCTGGTTTCCTAAGCAGATCCGACCAATTGTTGTTTCTGCTGTTTGGATTTCCGATAATCACATCGCATCCTCATT
TTCTTCAATTTCCACTTCCTCCCCGTTGACTTTCATTACGGATTGTTACTACAACTTCCAGAGGTGAGTAGTGTTGTTCATCACTTCTTGGTGCTAGACTGTCGAGATGT
TAGGAGCGGGTGTACGGTTTGGTCGCGGCAAGGGAGAGGACCGGTTCTACGATTCATCCAGAGCGAGGAAAGGCCTCCTCAGTCGTCAAAATGATAGGCTATGTTGTACA
ACTCAACAAGACGCTTCTGCTACTACTCCATCCTACGCGGATAAGGATGTTTCGACACGCCCTGGGGACCGTTTAGCTTCTGACGAAGCTACTAAACCAGTTCCGTTTTC
TAATCCACAGCCTGTTGTTTCACCGTTGAGTAATCTCGAGCGATTCTTGCAGTCGGTTACTCCCTCTGTGCCTGCTCAGTTTCTCTCTAAGAGTGCGTTAAGAGGTTGGA
GGACGTGCGATTCGCAGACGCGGCCTTACTTTGTGCTTGGGGATTTGTGGGAAGCTTTCAAAGAGTGGAGTGCTTACGGGGCAGGAGTGCCTCTTTTATTGAATAACACT
GATGGTGTTGTTCAGTATTATGTCCCCTACTTGTCTGGCATACAATTGTATGCCATGGAATCGTCTACAATGCCAAGGCGATGGGGTGAAGAAAGCGACAGTGACTACAG
AGATTCAAGTAGTGATGGTAGTAGTGATTCTGAAACAAAGAGAAGAATAAAACACTGTAGAGAACCACTCCATCATAATGATCCAACTATCACAGATCCTCTTAGAATGG
ATAGATTGTCTTTGAGGGACCAGCACTCGGGACTTCATGAGGACTGCTCCAGTGATGAGGCTGAATCTTTCAATTCTACAGGTCGTCTTCTATTTGAGTATCTTGAAAGA
GACCTACCCTTCACATTTTTCTCTGTTATTTTTGACATCCAGATATCGGACCTTGCTTCACGCTTCCCGCAGTTGAAAACAATGAGAAGCTGTGACCTCCTACCATATAG
TTGGATATCCGTGGCATGGTACCCAATTTACCGGATACCAACTGGGCAAACCTTAAAGGATCTTGATGCTTGCTTTCTCACGTACCATTCTCTACACACACCAATCAGAG
ATTCTCAAACCCCACCAATACCATTTGCGGCATATCCTTGTAAGACGAATGGTGCCGAAAAGGTTCCTTTAAGAATTTTTGGACTTGCTTCATACAAGTTTAACGGGTCG
TCGTTGTGGATGCGAAATGGTGGAGTAGAGCATCAATTGGCATACAACCTATCGCGAGCAGCTGATCAGTGGTTAAGAGATCTCCAAGTCAATCATCCAGATTTCCTGTT
CTTCAGCCGCCGAGATGCAACACCTTACTGA
Protein sequenceShow/hide protein sequence
MLGAGVRFGRGKGEDRFYDSSRARKGLLSRQNDRLCCTTQQDASATTPSYADKDVSTRPGDRLASDEATKPVPFSNPQPVVSPLSNLERFLQSVTPSVPAQFLSKSALRG
WRTCDSQTRPYFVLGDLWEAFKEWSAYGAGVPLLLNNTDGVVQYYVPYLSGIQLYAMESSTMPRRWGEESDSDYRDSSSDGSSDSETKRRIKHCREPLHHNDPTITDPLR
MDRLSLRDQHSGLHEDCSSDEAESFNSTGRLLFEYLERDLPFTFFSVIFDIQISDLASRFPQLKTMRSCDLLPYSWISVAWYPIYRIPTGQTLKDLDACFLTYHSLHTPI
RDSQTPPIPFAAYPCKTNGAEKVPLRIFGLASYKFNGSSLWMRNGGVEHQLAYNLSRAADQWLRDLQVNHPDFLFFSRRDATPY