; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0004340 (gene) of Chayote v1 genome

Gene IDSed0004340
OrganismSechium edule (Chayote v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationLG05:30308729..30310773
RNA-Seq ExpressionSed0004340
SyntenySed0004340
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608203.1 hypothetical protein SDJN03_01545, partial [Cucurbita argyrosperma subsp. sororia]2.5e-21079.67Show/hide
Query:  LLRFPPLPLFQAPVLNLPFSLLASSVSFFLFPQNVQQFSFHLSYWGLE-VWFTVYEPLSGEMQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLG
        LL+F  LP F    +N P   L +S ++   P       F+LS WGLE + FTVYEP SGEMQPQ SSRID+GDLKAQIVKKLGNDKSKRYFFYLSRFLG
Subjt:  LLRFPPLPLFQAPVLNLPFSLLASSVSFFLFPQNVQQFSFHLSYWGLE-VWFTVYEPLSGEMQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLG

Query:  QKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRENGPEQAGSAFP--NQNQTMPVWPNGVLSESPWKGRS
        QKLSK EFDK CVRVLGREN+QLHN+LI+SILKNACVAKTPPPIN SGH+QS+LQAS NS CRE+GPE  GS FP  NQNQ MP+WPNGVL  SP KGRS
Subjt:  QKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRENGPEQAGSAFP--NQNQTMPVWPNGVLSESPWKGRS

Query:  VLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPENDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSS
        VLRGKFRDRPSPLGPNGK TCLSYQS+GTED  SKVITENG+V MCDYQRPVQHL+AVAELPENDIDGA+ RPSEKPRIHPTEAA+ ED +EVEQSDP S
Subjt:  VLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPENDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSS

Query:  FLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSG-DFLSCFDSFGLSDSETVRKRMEQIATAQGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARST
         LRG L+PPLGIPF S SVGGARK LPVGSS SG DFLSC+DS GLSD ETVRKRMEQIATAQGLEGVS+EC +ILNNTLDVYLKQLIKSCL+LVR+RST
Subjt:  FLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSG-DFLSCFDSFGLSDSETVRKRMEQIATAQGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARST

Query:  FEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
         EHTGH +QKQQNQGKVINGM PSNH  VQN NG SEVLQEKSLECSASLLDFKVAME+NPKQLGEDWPL+LEKISMRAFE+
Subjt:  FEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE

XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]6.8e-20886.84Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK CVRVLGREN+QLHNQLI+SILKNACVAKTPPPIN SGH+QSVLQAS NS 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
        CRE+GPEQ GSAFPNQNQ+ P+WPNGVL  SP KGRS LRGKFRDRPSPLGPNGK+TCLSYQSTG+ED  SKVITENG+VT+CDYQRPV++LQ+VAELPE
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG
        NDIDGA+QRPSEKPRIHPTEAAI E+ EEVEQSDP SFLRG L+PPLGIPF S SVGGARK LPV SS S DFLSC+DS GLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL
        LEGVSMEC SILNNTLDVYLKQLIKSCL+LVRARSTFEH+GH +QKQQNQGKV+NGMWP+NHLRVQN NG SEVLQEKSLECS SLLDFKVAMELNPKQL
Subjt:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]4.8e-20686.6Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK CVRVLGREN+QLHNQLI+SILKNACVAKTPPPIN SGH+QSVL AS NS 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
        CRE+GPEQ GSAFPNQNQ+ P+WPNGVL  SP KGRSVLRGKFRDRPSPLGPNGK TCLSYQSTG+ED  SKVITENG+VT+CDYQRPVQ+LQ+VAELPE
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG
        NDIDGA+QRPSEKPRIHPTEAAI E+ EEVEQSDP  FLRG L+PPLGIPF S SVGGARK LPV SS S DFLSC+DS GLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL
        LEGVSMEC +ILNNTLDVYLKQLIKSCL+LVRARSTFEH+GH +QKQQNQGKV+NGMWP+NHLRVQN NG SEVLQEKSLECS SLLDFKVAMELNPKQL
Subjt:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

XP_023524221.1 uncharacterized protein LOC111788192 [Cucurbita pepo subsp. pepo]8.6e-20385.68Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSK EFDK CVRVLGREN+QLHN+LI+SILKNACVAKTPPPIN SGH+QS+LQAS NS 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
        CRE+GPE  GS FPNQNQ MP+WPNGVL  SP KGRSVLRGKFRDRPSPLGPNGK TCLSYQS+GTED  SKVITENG+V MCDYQRPVQHL+AVAELPE
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSS-SSGDFLSCFDSFGLSDSETVRKRMEQIATAQ
        NDIDGA+ RPSEKPRIHPTEAA+ ED +EVEQSDP S LRG L+PPLGIPF S SVGGARK LPVGSS SS DFLSC+DS GLSDSETVRKRMEQIATAQ
Subjt:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSS-SSGDFLSCFDSFGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVS+EC +ILNNTLDVYLKQLIKSCL+LVR+RST EHTGH +QKQQNQGKVINGM PSNH  VQN NG SEVLQEKSLECSASLLDFKVAME+NPKQ
Subjt:  GLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPL+LEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]1.8e-20586.6Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGND+SKRYFFYLSRFLGQKLSKVEFDK CVRVLGREN+QLHNQLI+SILKNACVAKTPP IN SGH+QSVLQ S  S 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
        CR++GPEQ GSAFPNQNQ++P+W NGVL  SP KGRSVLRGKFRDRPSPLGPNGK TCLSYQSTGTED  SKVITENG+VTMCDYQRPVQHLQAVAELPE
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG
        NDIDGA+ RPSEKPRIHPTEAAI E+ EEVEQSDP SFLRG L+PPLGIPF S SVGGARK LPV SS S DFLSC+DS GLSDS TVRKRMEQIATAQG
Subjt:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL
        LEGVSMEC +ILNNTLDVYLKQLIKSCL+LVRARSTFEHTGH +QKQQNQGKV+N MWP+NHLRVQN NG SEVLQEKSLECS SLLDFKVAMELNPKQL
Subjt:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKI MRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein3.3e-20886.84Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK CVRVLGREN+QLHNQLI+SILKNACVAKTPPPIN SGH+QSVLQAS NS 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
        CRE+GPEQ GSAFPNQNQ+ P+WPNGVL  SP KGRS LRGKFRDRPSPLGPNGK+TCLSYQSTG+ED  SKVITENG+VT+CDYQRPV++LQ+VAELPE
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG
        NDIDGA+QRPSEKPRIHPTEAAI E+ EEVEQSDP SFLRG L+PPLGIPF S SVGGARK LPV SS S DFLSC+DS GLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL
        LEGVSMEC SILNNTLDVYLKQLIKSCL+LVRARSTFEH+GH +QKQQNQGKV+NGMWP+NHLRVQN NG SEVLQEKSLECS SLLDFKVAMELNPKQL
Subjt:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037572.3e-20686.6Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK CVRVLGREN+QLHNQLI+SILKNACVAKTPPPIN SGH+QSVL AS NS 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
        CRE+GPEQ GSAFPNQNQ+ P+WPNGVL  SP KGRSVLRGKFRDRPSPLGPNGK TCLSYQSTG+ED  SKVITENG+VT+CDYQRPVQ+LQ+VAELPE
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG
        NDIDGA+QRPSEKPRIHPTEAAI E+ EEVEQSDP  FLRG L+PPLGIPF S SVGGARK LPV SS S DFLSC+DS GLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL
        LEGVSMEC +ILNNTLDVYLKQLIKSCL+LVRARSTFEH+GH +QKQQNQGKV+NGMWP+NHLRVQN NG SEVLQEKSLECS SLLDFKVAMELNPKQL
Subjt:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein2.3e-20686.6Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK CVRVLGREN+QLHNQLI+SILKNACVAKTPPPIN SGH+QSVL AS NS 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
        CRE+GPEQ GSAFPNQNQ+ P+WPNGVL  SP KGRSVLRGKFRDRPSPLGPNGK TCLSYQSTG+ED  SKVITENG+VT+CDYQRPVQ+LQ+VAELPE
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG
        NDIDGA+QRPSEKPRIHPTEAAI E+ EEVEQSDP  FLRG L+PPLGIPF S SVGGARK LPV SS S DFLSC+DS GLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL
        LEGVSMEC +ILNNTLDVYLKQLIKSCL+LVRARSTFEH+GH +QKQQNQGKV+NGMWP+NHLRVQN NG SEVLQEKSLECS SLLDFKVAMELNPKQL
Subjt:  LEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

A0A6J1CPD1 uncharacterized protein LOC1110128831.6e-20285.48Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK CVRVLGREN+QLHNQLI+SILKNACVAKTPPPINVSGH+QSVLQAS NS 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRG-KFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELP
        CRE+GPEQ GSAFPNQNQT+P+W NGVL  SP KGRS+LR  KFRDRPSPLGPNGK TCLSY STGTED GSKVITENG+VT+CDYQRPVQHLQAVAELP
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRG-KFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQ
        ENDI+GA+QRPSEKPRIHPTEAAI ED EEVEQSDP SFLRG L+PPLGIPF S SVGGAR+ LP+G  +SGDF SC+DS GLSD+ETVRKRMEQIATAQ
Subjt:  ENDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQN-GNGWSEVLQEKSLECSASLLDFKVAMELNPK
        GLEGVSMECS+ILN+TLD+YLKQLIKSCL+LVR+RST EHTGH +QKQQNQGKVINGMWPSNHLRVQN  NG  EVLQEKSL+CS SLLDFKVAMELNPK
Subjt:  GLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQN-GNGWSEVLQEKSLECSASLLDFKVAMELNPK

Query:  QLGEDWPLLLEKISMRAFEE
        QLGEDWPLLLEKI MR FEE
Subjt:  QLGEDWPLLLEKISMRAFEE

A0A6J1IVJ1 uncharacterized protein LOC1114809993.0e-20184.96Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQPQ SSRID+GDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSK EFDK CVRVLGREN+QLHN+LI+SILKNACVAKTPPPIN SGH+QS+LQAS NS 
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
        CRE+GPE  GS FPNQNQ MP+WPNGVL  SP KGRSVLRGKFRDRPSPLGPNGK T LSYQS+GTED  SKVITENG+V +CDYQRPVQHL+AVAELPE
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSS-SSGDFLSCFDSFGLSDSETVRKRMEQIATAQ
        NDIDGA+ RPSEKPRIHPTEAA+ ED +EVEQS+P S LRG L+PPLGIPF S SVGGA K LPVGSS SS DFLSC+DS GLSDSETVRKRMEQIATAQ
Subjt:  NDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSS-SSGDFLSCFDSFGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVS+EC +ILNNTLDVYLKQLIKSCL+LVR+RST EHTGH +QKQQNQGK+INGMWPSNH  VQN NG SEVLQEKSLECSASLLDFKVAMELNPKQ
Subjt:  GLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPL+LEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein8.7e-3631.4Show/hide
Query:  SRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRENGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK C + +GREN+ LHN+L++SILKNA VAK+PPP                        
Subjt:  SRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRENGP

Query:  EQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPENDIDGA
              +P ++    ++ + V   SP K RS    KFRDRPSPLGP GK   L    T T D                                      
Subjt:  EQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPENDIDGA

Query:  LQRPSEKPRIHPTEAAIREDEEEVEQ--SDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQGLEGV
         +  S+  R+ P E    ED EEVEQ    PS   R  L  PLG+ F   S   AR +   G +      +C  S  L D  T+R R+E+    +G++ +
Subjt:  LQRPSEKPRIHPTEAAIREDEEEVEQ--SDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQIATAQGLEGV

Query:  SMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQLGEDW
        SM+ +++LN  L+ Y+++LI+ CL L                                             Q+K    + S+LDF  AME+NP+ LGE+W
Subjt:  SMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPKQLGEDW

Query:  PLLLEKISMRAFEE
        P+ LEKI  RA EE
Subjt:  PLLLEKISMRAFEE

AT2G24530.1 unknown protein2.7e-10651.42Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQ  Q  RI + +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHNQLI+SIL+NA VAK+PPP + +GHS       +   
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRG-KFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELP
         R +G EQ+G+  PN +Q  PVW NGVL  SP K RS ++  K RDRPSPLG NGK   + +Q    ED    V  ENG     DYQR  +++       
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRG-KFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGALQRPSEKPRIHPTE----AAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQI
         ++ DG   RP EKPRI   E     ++R+D+ + EQ+  +  +   L+ PLGIPF S SVGG+ +T+PV  S++ + +SC+DS GL D E +RKRME I
Subjt:  ENDIDGALQRPSEKPRIHPTE----AAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQI

Query:  ATAQGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTG-HSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAME
        A AQGLEGVSMEC+  LNN LDVYLK+LI SC DLV ARST    G   + KQQ+Q K++NG+WP+N L++Q  NG S++ Q+     S S+LDF+ AME
Subjt:  ATAQGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTG-HSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAME

Query:  LNPKQLGEDWPLLLEKISMRAFEE
        LNP+QLGEDWP L E+IS+R+FEE
Subjt:  LNPKQLGEDWPLLLEKISMRAFEE

AT4G31440.1 unknown protein3.0e-8143.87Show/hide
Query:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC
        MQ  Q  RID+ +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK+C R+LGREN+ LHN+LI+SIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSC

Query:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE
         +E+GPE++ S  P+  +      NGVL++   +  +      RD+P PLG NGK                        +    Y RP ++         
Subjt:  CRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGALQRPSEKPRIHPTEAAI----REDEEEVE-QSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQI
        ++ D A   P+E+  +   +       R+DE +V   S P       ++ PLGIPF S SVGG R+T+PV +S++   +SC+DS GLSD+E +RKRME I
Subjt:  NDIDGALQRPSEKPRIHPTEAAI----REDEEEVE-QSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFDSFGLSDSETVRKRMEQI

Query:  ATAQGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTG-HSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAME
        A  QGL GVS ECS +LNN LD+YLK+L+KSC+DL  ARS     G HS++KQQ++ +++NG+  +N   +Q  N  S++ +E+    S SLLDF+VAME
Subjt:  ATAQGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTG-HSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAME

Query:  LNPKQLGEDWPLLLEKISMRAFEE
        LNP QLGEDWPLL E+IS+  FEE
Subjt:  LNPKQLGEDWPLLLEKISMRAFEE

AT4G33890.1 unknown protein1.4e-4133.57Show/hide
Query:  QQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK C++ +GR+N+ LHN+LI+SI+KNAC+AK+PP I   G   S ++         
Subjt:  QQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRE

Query:  NGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPENDI
        NG  +  S      Q  P+  +   S S  K RS    K RDRPSPLGP GK   L   +T  E+  SK                    Q+  EL     
Subjt:  NGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPENDI

Query:  DGALQRPSEKPRIHPTEAAIREDEEEVEQ---SDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDF--LSCFDSFGLSDSETVRKRMEQIATA
             RP       P E    E+ EEVEQ     PS   R  L  PLG+  +S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    
Subjt:  DGALQRPSEKPRIHPTEAAIREDEEEVEQ---SDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDF--LSCFDSFGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPK
        +GL+ ++M+  S+LN+ LDV++++LI+ CL L   R   +                         RV+  N   +  Q+       S+ DF+  MELN +
Subjt:  QGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPK

Query:  QLGEDWPLLLEKISMRAFEE
         LGEDWP+ +EKI  RA ++
Subjt:  QLGEDWPLLLEKISMRAFEE

AT4G33890.2 unknown protein1.4e-4133.57Show/hide
Query:  QQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK C++ +GR+N+ LHN+LI+SI+KNAC+AK+PP I   G   S ++         
Subjt:  QQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRENVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRE

Query:  NGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPENDI
        NG  +  S      Q  P+  +   S S  K RS    K RDRPSPLGP GK   L   +T  E+  SK                    Q+  EL     
Subjt:  NGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTEDIGSKVITENGHVTMCDYQRPVQHLQAVAELPENDI

Query:  DGALQRPSEKPRIHPTEAAIREDEEEVEQ---SDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDF--LSCFDSFGLSDSETVRKRMEQIATA
             RP       P E    E+ EEVEQ     PS   R  L  PLG+  +S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    
Subjt:  DGALQRPSEKPRIHPTEAAIREDEEEVEQ---SDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDF--LSCFDSFGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPK
        +GL+ ++M+  S+LN+ LDV++++LI+ CL L   R   +                         RV+  N   +  Q+       S+ DF+  MELN +
Subjt:  QGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLDFKVAMELNPK

Query:  QLGEDWPLLLEKISMRAFEE
         LGEDWP+ +EKI  RA ++
Subjt:  QLGEDWPLLLEKISMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAAATTGGAAGAAGGAGGAAAAAAGAAAAGAAAAGACCGAGAGGGAGAAGCAAAGCAAGAGGGCGAAGGTGGGAGGATTGGAACCCATTCACTGCAACCTTTG
GTGTTCTTCGCTGCAAATTCAATTCCTCCAAATTTCTTCAATCACTCAAAATCCCCCCTTCGATTTTCATCTCCGTTCTTCAATTCCCCCTTTTCTTCTTCTTCTTCAAC
TTCCCCTTTCCCCTTTTCACTTCACTGATCCCCTTCTCCCCACTCCCAACTCCCCAAATTCCACTCCACAATCGAGGGTTTCTGTTCTTCTCCGATTCCCGCCTCTCCCG
CTCTTCCAGGCTCCTGTTCTAAACCTACCCTTTTCGCTTCTCGCAAGTTCTGTTTCTTTTTTTCTGTTCCCCCAAAATGTACAGCAATTCAGCTTCCATTTGAGTTACTG
GGGTTTGGAGGTTTGGTTTACAGTGTATGAACCGCTTTCTGGAGAAATGCAACCTCAGCAGAGCTCCAGAATTGATGTAGGCGACTTGAAAGCTCAGATAGTTAAGAAAC
TTGGGAACGATAAGTCTAAGCGGTACTTCTTCTACTTGAGCAGATTTTTGGGTCAGAAGCTGAGTAAGGTTGAGTTTGATAAGGCGTGTGTTCGCGTGCTAGGACGGGAG
AATGTTCAGCTACACAATCAGTTGATTAAGTCGATATTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATTAATGTGTCTGGACATTCACAATCTGTGCTACAAGC
TTCGACCAACTCTTGTTGTAGGGAAAATGGCCCTGAACAAGCTGGATCTGCTTTTCCAAATCAGAATCAAACTATGCCAGTTTGGCCAAATGGGGTTCTTTCGGAATCCC
CTTGGAAGGGGAGATCTGTTTTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTGGGTCCAAATGGAAAAACTACTTGTCTTTCGTATCAATCGACAGGTACTGAAGAT
ATCGGTAGCAAAGTTATTACAGAGAATGGTCATGTAACCATGTGTGACTATCAGAGACCGGTGCAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGG
AGCACTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCGATTCGTGAAGATGAAGAAGAGGTTGAACAGTCGGATCCCTCAAGCTTTCTGAGAGGTT
CTCTAGTTCCACCGCTCGGCATTCCGTTTCTCTCAGGTAGCGTTGGTGGGGCACGCAAGACCTTGCCAGTTGGTAGCAGTAGTAGTGGTGATTTTCTGAGTTGTTTTGAC
AGTTTTGGATTGTCTGATTCAGAGACAGTTAGAAAACGCATGGAGCAAATAGCAACTGCGCAAGGGCTTGAAGGAGTTTCTATGGAATGTTCTAGCATCCTGAATAATAC
TTTGGATGTGTACCTGAAGCAATTGATCAAGTCTTGCCTTGATTTGGTTAGAGCAAGGTCTACATTCGAACATACGGGACACTCCGTCCAGAAGCAACAGAATCAAGGAA
AGGTTATAAATGGTATGTGGCCCAGTAACCACCTGCGTGTACAGAATGGAAATGGGTGGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGTTCAGCATCATTGCTCGAT
TTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTATTGGAAAAAATTAGTATGCGTGCCTTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGAAATTGGAAGAAGGAGGAAAAAAGAAAAGAAAAGACCGAGAGGGAGAAGCAAAGCAAGAGGGCGAAGGTGGGAGGATTGGAACCCATTCACTGCAACCTTTG
GTGTTCTTCGCTGCAAATTCAATTCCTCCAAATTTCTTCAATCACTCAAAATCCCCCCTTCGATTTTCATCTCCGTTCTTCAATTCCCCCTTTTCTTCTTCTTCTTCAAC
TTCCCCTTTCCCCTTTTCACTTCACTGATCCCCTTCTCCCCACTCCCAACTCCCCAAATTCCACTCCACAATCGAGGGTTTCTGTTCTTCTCCGATTCCCGCCTCTCCCG
CTCTTCCAGGCTCCTGTTCTAAACCTACCCTTTTCGCTTCTCGCAAGTTCTGTTTCTTTTTTTCTGTTCCCCCAAAATGTACAGCAATTCAGCTTCCATTTGAGTTACTG
GGGTTTGGAGGTTTGGTTTACAGTGTATGAACCGCTTTCTGGAGAAATGCAACCTCAGCAGAGCTCCAGAATTGATGTAGGCGACTTGAAAGCTCAGATAGTTAAGAAAC
TTGGGAACGATAAGTCTAAGCGGTACTTCTTCTACTTGAGCAGATTTTTGGGTCAGAAGCTGAGTAAGGTTGAGTTTGATAAGGCGTGTGTTCGCGTGCTAGGACGGGAG
AATGTTCAGCTACACAATCAGTTGATTAAGTCGATATTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATTAATGTGTCTGGACATTCACAATCTGTGCTACAAGC
TTCGACCAACTCTTGTTGTAGGGAAAATGGCCCTGAACAAGCTGGATCTGCTTTTCCAAATCAGAATCAAACTATGCCAGTTTGGCCAAATGGGGTTCTTTCGGAATCCC
CTTGGAAGGGGAGATCTGTTTTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTGGGTCCAAATGGAAAAACTACTTGTCTTTCGTATCAATCGACAGGTACTGAAGAT
ATCGGTAGCAAAGTTATTACAGAGAATGGTCATGTAACCATGTGTGACTATCAGAGACCGGTGCAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGG
AGCACTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCGATTCGTGAAGATGAAGAAGAGGTTGAACAGTCGGATCCCTCAAGCTTTCTGAGAGGTT
CTCTAGTTCCACCGCTCGGCATTCCGTTTCTCTCAGGTAGCGTTGGTGGGGCACGCAAGACCTTGCCAGTTGGTAGCAGTAGTAGTGGTGATTTTCTGAGTTGTTTTGAC
AGTTTTGGATTGTCTGATTCAGAGACAGTTAGAAAACGCATGGAGCAAATAGCAACTGCGCAAGGGCTTGAAGGAGTTTCTATGGAATGTTCTAGCATCCTGAATAATAC
TTTGGATGTGTACCTGAAGCAATTGATCAAGTCTTGCCTTGATTTGGTTAGAGCAAGGTCTACATTCGAACATACGGGACACTCCGTCCAGAAGCAACAGAATCAAGGAA
AGGTTATAAATGGTATGTGGCCCAGTAACCACCTGCGTGTACAGAATGGAAATGGGTGGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGTTCAGCATCATTGCTCGAT
TTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTATTGGAAAAAATTAGTATGCGTGCCTTTGAGGAATAA
Protein sequenceShow/hide protein sequence
MLGNWKKEEKRKEKTEREKQSKRAKVGGLEPIHCNLWCSSLQIQFLQISSITQNPPFDFHLRSSIPPFLLLLQLPLSPFHFTDPLLPTPNSPNSTPQSRVSVLLRFPPLP
LFQAPVLNLPFSLLASSVSFFLFPQNVQQFSFHLSYWGLEVWFTVYEPLSGEMQPQQSSRIDVGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKACVRVLGRE
NVQLHNQLIKSILKNACVAKTPPPINVSGHSQSVLQASTNSCCRENGPEQAGSAFPNQNQTMPVWPNGVLSESPWKGRSVLRGKFRDRPSPLGPNGKTTCLSYQSTGTED
IGSKVITENGHVTMCDYQRPVQHLQAVAELPENDIDGALQRPSEKPRIHPTEAAIREDEEEVEQSDPSSFLRGSLVPPLGIPFLSGSVGGARKTLPVGSSSSGDFLSCFD
SFGLSDSETVRKRMEQIATAQGLEGVSMECSSILNNTLDVYLKQLIKSCLDLVRARSTFEHTGHSVQKQQNQGKVINGMWPSNHLRVQNGNGWSEVLQEKSLECSASLLD
FKVAMELNPKQLGEDWPLLLEKISMRAFEE