; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G017530 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G017530
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationCG_Chr05:29855614..29858020
RNA-Seq ExpressionClCG05G017530
SyntenyClCG05G017530
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]6.8e-21792.93Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQ SNNSP
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPE TGSAFPNQNQ+ P+W NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDSSSKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPLSFL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQNSN +SE LQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWPLLLEK
Subjt:  GEDWPLLLEK

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]7.5e-21692.93Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSP
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPE TGSAFPNQNQ+ P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPL FL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN+N +SE LQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWPLLLEK
Subjt:  GEDWPLLLEK

XP_022142878.1 uncharacterized protein LOC111012883 [Momordica charantia]1.1e-21190.24Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQ SNNSP
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPE TGSAFPNQNQ +P+WSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        ENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQ DPLSFL GPLLPPLGIPFCSASVGGAR+ALP+ +SGDF SCYDSIGLSD+ETVRKRMEQIATAQGL
Subjt:  ENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN-SNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        EGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGKVINGMWP+NHLRVQN SN + E LQEKSL+CSVSLLDFKVAMELNPKQL
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN-SNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWPLLLEK
Subjt:  GEDWPLLLEK

XP_022976270.1 uncharacterized protein LOC111476715 [Cucurbita maxima]3.8e-21291.73Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQ  QSSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQ SNN+P
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CRED PE TGSAFPNQNQ+IP+W+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQ DPLSFL GPLLPPLGIPFCSASVGGARKALPVSSSG   DFLSCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSN +SE L+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEK
        LGEDWPLLLEK
Subjt:  LGEDWPLLLEK

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]8.5e-22094.39Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP  SSRIDLGDLKAQIVKKLGND+SKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQPSN SP
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPE TGSAFPNQNQ+IP+WSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDS+SKVITENGNVTMCDYQRPVQHLQAVAELPE
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDIDGAV RPSEKPRIHPTEAAILEEGEEVEQ DPLSFL GPLLPPLGIPFCSASVGGARKALPV+SSG  DFLSCYDSIGLSDS TVRKRMEQIATAQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKV+N MWPTNHLRVQNSN +SE LQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWPLLLEK
Subjt:  GEDWPLLLEK

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein3.3e-21792.93Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQ SNNSP
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPE TGSAFPNQNQ+ P+W NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDSSSKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPLSFL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQNSN +SE LQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWPLLLEK
Subjt:  GEDWPLLLEK

A0A1S4E5S7 uncharacterized protein LOC1035037573.6e-21692.93Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSP
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPE TGSAFPNQNQ+ P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPL FL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN+N +SE LQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWPLLLEK
Subjt:  GEDWPLLLEK

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein3.6e-21692.93Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSP
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPE TGSAFPNQNQ+ P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQ DPL FL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN+N +SE LQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWPLLLEK
Subjt:  GEDWPLLLEK

A0A6J1CPD1 uncharacterized protein LOC1110128835.4e-21290.24Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP  SSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQ SNNSP
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPE TGSAFPNQNQ +P+WSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        ENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQ DPLSFL GPLLPPLGIPFCSASVGGAR+ALP+ +SGDF SCYDSIGLSD+ETVRKRMEQIATAQGL
Subjt:  ENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN-SNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        EGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGKVINGMWP+NHLRVQN SN + E LQEKSL+CSVSLLDFKVAMELNPKQL
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN-SNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWPLLLEK
Subjt:  GEDWPLLLEK

A0A6J1IIZ9 uncharacterized protein LOC1114767151.9e-21291.73Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQ  QSSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQ SNN+P
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CRED PE TGSAFPNQNQ+IP+W+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQ DPLSFL GPLLPPLGIPFCSASVGGARKALPVSSSG   DFLSCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSN +SE L+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEK
        LGEDWPLLLEK
Subjt:  LGEDWPLLLEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein8.4e-4031.93Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGP

Query:  EHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGA
              +P ++    L+ + V P SPRK RS    KFRDRPSPLGP GK   L   +T  ++S SK                 Q L              
Subjt:  EHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGA

Query:  VQRPSEKPRIHPTEAAILEEGEEVEQL--DPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSM
                   P E   +E+GEEVEQ+   P      PL  PLG+ F   S    +      +  +  +C  S  L D  T+R R+E+    +G++ +SM
Subjt:  VQRPSEKPRIHPTEAAILEEGEEVEQL--DPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSM

Query:  ECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPL
        +  N+LN  L+ Y+++LI+ CL L                                             Q+K    +VS+LDF  AME+NP+ LGE+WP+
Subjt:  ECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPL

Query:  LLEK
         LEK
Subjt:  LLEK

AT2G24530.1 unknown protein2.6e-11052.78Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQ  Q  RI L +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP
         R DG E +G+  PN +Q+ P+WSNGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +++       
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATA
         ++ DG   RP EKPRI   E   A  + + +  E+   ++    PL+ PLGIPFCSASVGG+ + +PVS++ + +SCYDS GL D E +RKRME IA A
Subjt:  ENDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNP
        QGLEGVSMEC   LNN LDVYLK+LI SC +LV ARST    G   I KQQ+Q K++NG+WPTN L++Q  N  S+  Q+     SVS+LDF+ AMELNP
Subjt:  QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEK
        +QLGEDWP L E+
Subjt:  KQLGEDWPLLLEK

AT4G31440.1 unknown protein1.5e-8446.72Show/hide
Query:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQ  Q  RIDL +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
         +EDGPE + S  P+  +N    SNGVL    R G    R   RD+P PLG NGK+                       +    Y RP ++         
Subjt:  CREDGPEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTE--AAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG
        ++ D A   P+E+  +   +  AA +   +E  Q+  LS  + P++ PLGIPFCSASVGG R+ +PVS+S   +SCYDS GLSD+E +RKRME IA  QG
Subjt:  NDIDGAVQRPSEKPRIHPTE--AAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQ
        L GVS EC  +LNN LD+YLK+L+KSC++L  ARS     G H ++KQQ++ +++NG+   N   +Q SN  S+  +E+    SVSLLDF+VAMELNP Q
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEK
        LGEDWPLL E+
Subjt:  LGEDWPLLLEK

AT4G33890.1 unknown protein3.3e-4434.15Show/hide
Query:  SSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDG
        SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N        
Subjt:  SSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDG

Query:  PEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDG
                   +Q  PL  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL       
Subjt:  PEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDG

Query:  AVQRPSEKPRIHPTEAAILEEGEEVEQL---DPLSFLSGPLLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG
           RP       P E   +EEGEEVEQ+    P      PL  PLG+   S   G  RK++      S S +  +C ++  L D+ T+R R+E+    +G
Subjt:  AVQRPSEKPRIHPTEAAILEEGEEVEQL---DPLSFLSGPLLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        L+ ++M+  ++LN+ LDV++++LI+ CL L   R   +                         RV+  N Q  + Q+      VS+ DF+  MELN + L
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWP+ +EK
Subjt:  GEDWPLLLEK

AT4G33890.2 unknown protein3.3e-4434.15Show/hide
Query:  SSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDG
        SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N        
Subjt:  SSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDG

Query:  PEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDG
                   +Q  PL  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL       
Subjt:  PEHTGSAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDG

Query:  AVQRPSEKPRIHPTEAAILEEGEEVEQL---DPLSFLSGPLLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG
           RP       P E   +EEGEEVEQ+    P      PL  PLG+   S   G  RK++      S S +  +C ++  L D+ T+R R+E+    +G
Subjt:  AVQRPSEKPRIHPTEAAILEEGEEVEQL---DPLSFLSGPLLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL
        L+ ++M+  ++LN+ LDV++++LI+ CL L   R   +                         RV+  N Q  + Q+      VS+ DF+  MELN + L
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEK
        GEDWP+ +EK
Subjt:  GEDWPLLLEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCCGCAAAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATT
TTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACATACTGGA
TCTGCCTTCCCAAATCAGAATCAGAATATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCCCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTCGGGATAGGCC
AAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTG
ACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAA
GCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTTGGATCCCTTAAGCTTCCTGAGTGGTCCTCTACTTCCGCCTCTTGGTATTCCATTTTGTTCAGCTAGTGTTGG
TGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTG
CAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTAGATGTATACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGA
GCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAA
TGTGCAATCTGAATTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTT
TGCTGTTGGAGAAAGGCTTTAAAATTGACCTGGGATGTCTCACCGTTCTTCCCATCTCACTTGGTCTCTCCCTTTTGGTTCAGGCTCCAAGTTTCCATGCTAAAATATTC
ATTCATGTTGGCCCCATTTGCCACTACCATATGAATCACCTGGGTGAGAGCTTCAGGTGCTCTCAATTGCAAATTTTCTTTCCCGCTCAGATAACCTACAGTTCTATGTG
CTGCAAAATATTTTCTTCATTTTTAGCTGAATATGGTCGTATTATGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCCGCAAAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATT
TTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACATACTGGA
TCTGCCTTCCCAAATCAGAATCAGAATATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCCCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTCGGGATAGGCC
AAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTG
ACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAA
GCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTTGGATCCCTTAAGCTTCCTGAGTGGTCCTCTACTTCCGCCTCTTGGTATTCCATTTTGTTCAGCTAGTGTTGG
TGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTG
CAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTAGATGTATACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGA
GCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAA
TGTGCAATCTGAATTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTT
TGCTGTTGGAGAAAGGCTTTAAAATTGACCTGGGATGTCTCACCGTTCTTCCCATCTCACTTGGTCTCTCCCTTTTGGTTCAGGCTCCAAGTTTCCATGCTAAAATATTC
ATTCATGTTGGCCCCATTTGCCACTACCATATGAATCACCTGGGTGAGAGCTTCAGGTGCTCTCAATTGCAAATTTTCTTTCCCGCTCAGATAACCTACAGTTCTATGTG
CTGCAAAATATTTTCTTCATTTTTAGCTGAATATGGTCGTATTATGAAGTAA
Protein sequenceShow/hide protein sequence
MQPPQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEHTG
SAFPNQNQNIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTE
AAILEEGEEVEQLDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVR
ARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNVQSEFLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKGFKIDLGCLTVLPISLGLSLLVQAPSFHAKIF
IHVGPICHYHMNHLGESFRCSQLQIFFPAQITYSSMCCKIFSSFLAEYGRIMK