; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G017580 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G017580
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationCG_Chr05:29920267..29921517
RNA-Seq ExpressionClCG05G017580
SyntenyClCG05G017580
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]8.5e-22192.58Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQ SNNSP
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFP+QNQS P+W NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+ED SSKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PLSFL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSC ELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWPLLLEK+ MRAFEE
Subjt:  GEDWPLLLEKVFMRAFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]4.2e-22092.82Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSP
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFP+QNQS P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+ED SSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PL FL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSC ELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWPLLLEK+ MRAFEE
Subjt:  GEDWPLLLEKVFMRAFEE

XP_022142878.1 uncharacterized protein LOC111012883 [Momordica charantia]7.0e-21589.47Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQ SNNSP
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFP+QNQ++P+WSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTED  SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        ENDI+ AVQRPSEKPRIHPTEAAILE+GEEVEQS+PLSFL GPLLPPLGIPFCSASVGGAR+ALP+ +SGDF SCYDSIGLSD+ETVRKRMEQIATAQGL
Subjt:  ENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN-INGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        EGVSMEC NILN+TLD+YLKQLIKSC ELVR+RST EHTGHPIQKQQNQGKVINGMWP+NHLRVQN  NGR EVLQEKSL+CSVSLLDFKVAMELNPKQL
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN-INGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWPLLLEK+ MR FEE
Subjt:  GEDWPLLLEKVFMRAFEE

XP_022976270.1 uncharacterized protein LOC111476715 [Cucurbita maxima]4.1e-21591.17Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQ + SSRIDLGDLKAQIVKKL NDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQ SNN+P
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CRED PEQTGSAFP+QNQSIP+W+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDID +VQRPS KPRI PTEA+ILEEGEEVEQS+PLSFL GPLLPPLGIPFCSASVGGARKALPVSSSG   DFLSCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSC ELVR RSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN NGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKVFMRAFEE
        LGEDWPLLLEK+ MRAFEE
Subjt:  LGEDWPLLLEKVFMRAFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]1.1e-22394.02Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP+HSSRIDLGDLKAQIVKKL ND+SKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQPSN SP
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFP+QNQSIP+WSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTED +SKVITENGNVTMCDYQRPVQHLQAVAELPE
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDID AV RPSEKPRIHPTEAAILEEGEEVEQS+PLSFL GPLLPPLGIPFCSASVGGARKALPV+SSG  DFLSCYDSIGLSDS TVRKRMEQIATAQG
Subjt:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSC ELVRARSTFEHTGHPIQKQQNQGKV+N MWPTNHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWPLLLEK+ MRAFEE
Subjt:  GEDWPLLLEKVFMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein4.1e-22192.58Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQ SNNSP
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFP+QNQS P+W NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+ED SSKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PLSFL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSC ELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWPLLLEK+ MRAFEE
Subjt:  GEDWPLLLEKVFMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037572.0e-22092.82Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSP
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFP+QNQS P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+ED SSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PL FL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSC ELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWPLLLEK+ MRAFEE
Subjt:  GEDWPLLLEKVFMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein2.0e-22092.82Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL  S NSP
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFP+QNQS P+W NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+ED SSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG
        NDID AVQRPSEKPRIHPTEAAILEEGEEVEQS+PL FL GPLLPPLGIPFCSASVGGARKALPVSSSG  DFLSCYDSIGLSDSETVRKRMEQIA+AQG
Subjt:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG--DFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSC ELVRARSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWPLLLEK+ MRAFEE
Subjt:  GEDWPLLLEKVFMRAFEE

A0A6J1CPD1 uncharacterized protein LOC1110128833.4e-21589.47Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQP+HSSRIDLGDLKAQIVKKL NDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQ SNNSP
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFP+QNQ++P+WSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTED  SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        ENDI+ AVQRPSEKPRIHPTEAAILE+GEEVEQS+PLSFL GPLLPPLGIPFCSASVGGAR+ALP+ +SGDF SCYDSIGLSD+ETVRKRMEQIATAQGL
Subjt:  ENDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN-INGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        EGVSMEC NILN+TLD+YLKQLIKSC ELVR+RST EHTGHPIQKQQNQGKVINGMWP+NHLRVQN  NGR EVLQEKSL+CSVSLLDFKVAMELNPKQL
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN-INGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWPLLLEK+ MR FEE
Subjt:  GEDWPLLLEKVFMRAFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767152.0e-21591.17Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQ + SSRIDLGDLKAQIVKKL NDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQ SNN+P
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CRED PEQTGSAFP+QNQSIP+W+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDID +VQRPS KPRI PTEA+ILEEGEEVEQS+PLSFL GPLLPPLGIPFCSASVGGARKALPVSSSG   DFLSCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDEAVQRPSEKPRIHPTEAAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSG---DFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSC ELVR RSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQN NGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKVFMRAFEE
        LGEDWPLLLEK+ MRAFEE
Subjt:  LGEDWPLLLEKVFMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.6e-3931.55Show/hide
Query:  SRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGP
        SR++  ++KA I +K+ + ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGP

Query:  EQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDEA
              +P ++    L+ + V P SPRK RS    KFRDRPSPLGP GK   L    T T D S                                    
Subjt:  EQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDEA

Query:  VQRPSEKPRIHPTEAAILEEGEEVEQ--SNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSM
              K +  P E   +E+GEEVEQ   +P      PL  PLG+ F   S    +      +  +  +C  S  L D  T+R R+E+    +G++ +SM
Subjt:  VQRPSEKPRIHPTEAAILEEGEEVEQ--SNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSM

Query:  ECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPL
        +  N+LN  L+ Y+++LI+ C  L                                             Q+K    +VS+LDF  AME+NP+ LGE+WP+
Subjt:  ECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPL

Query:  LLEKVFMRAFEE
         LEK+  RA EE
Subjt:  LLEKVFMRAFEE

AT2G24530.1 unknown protein1.4e-11252.49Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQ     RI L +LK  IVKK   ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELP
         R DG EQ+G+  P+ +Q  P+WSNGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED    V  ENG     DYQR  +++       
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDEAVQRPSEKPRIHPTE---AAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATA
         ++ D    RP EKPRI   E   A  + + +  E+   ++    PL+ PLGIPFCSASVGG+ + +PVS++ + +SCYDS GL D E +RKRME IA A
Subjt:  ENDIDEAVQRPSEKPRIHPTE---AAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNP
        QGLEGVSMEC   LNN LDVYLK+LI SCF+LV ARST    G   I KQQ+Q K++NG+WPTN L++Q  NG S++ Q+     SVS+LDF+ AMELNP
Subjt:  QGLEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKVFMRAFEE
        +QLGEDWP L E++ +R+FEE
Subjt:  KQLGEDWPLLLEKVFMRAFEE

AT4G31440.1 unknown protein1.8e-8345.58Show/hide
Query:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP
        MQ     RIDL +LK  IVKK+  ++S RYF+YL RFL QKL+K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSP

Query:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE
         +EDGPE++ S  P   ++    SNGVL    R G    R   RD+P PLG NGK+                       +    Y RP ++         
Subjt:  CREDGPEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDEAVQRPSEKPRIHPTE--AAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG
        ++ D A   P+E+  +   +  AA +   +E  Q   LS  + P++ PLGIPFCSASVGG R+ +PVS+S   +SCYDS GLSD+E +RKRME IA  QG
Subjt:  NDIDEAVQRPSEKPRIHPTE--AAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        L GVS EC  +LNN LD+YLK+L+KSC +L  ARS     G H ++KQQ++ +++NG+   N   +Q  N  S++ +E+    SVSLLDF+VAMELNP Q
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKVFMRAFEE
        LGEDWPLL E++ +  FEE
Subjt:  LGEDWPLLLEKVFMRAFEE

AT4G33890.1 unknown protein6.8e-4333.01Show/hide
Query:  SSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDG
        SSR+D  ++KA I +++ N +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N    +   
Subjt:  SSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDG

Query:  PEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDE
                   +Q  PL  +     S RK RS    K RDRPSPLGP GK   L   +T  E+  SK                    Q+  EL       
Subjt:  PEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDE

Query:  AVQRPSEKPRIHPTEAAILEEGEEVEQ---SNPLSFLSGPLLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG
           RP       P E   +EEGEEVEQ    +P      PL  PLG+   S   G  RK++      S S +  +C ++  L D+ T+R R+E+    +G
Subjt:  AVQRPSEKPRIHPTEAAILEEGEEVEQ---SNPLSFLSGPLLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        L+ ++M+  ++LN+ LDV++++LI+ C  L   R   +                         RV+ +N   +  Q+      VS+ DF+  MELN + L
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWP+ +EK+  RA ++
Subjt:  GEDWPLLLEKVFMRAFEE

AT4G33890.2 unknown protein6.8e-4333.01Show/hide
Query:  SSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDG
        SSR+D  ++KA I +++ N +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N    +   
Subjt:  SSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDG

Query:  PEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDE
                   +Q  PL  +     S RK RS    K RDRPSPLGP GK   L   +T  E+  SK                    Q+  EL       
Subjt:  PEQTGSAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDE

Query:  AVQRPSEKPRIHPTEAAILEEGEEVEQ---SNPLSFLSGPLLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG
           RP       P E   +EEGEEVEQ    +P      PL  PLG+   S   G  RK++      S S +  +C ++  L D+ T+R R+E+    +G
Subjt:  AVQRPSEKPRIHPTEAAILEEGEEVEQ---SNPLSFLSGPLLPPLGIPFCSASVGGARKALP----VSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        L+ ++M+  ++LN+ LDV++++LI+ C  L   R   +                         RV+ +N   +  Q+      VS+ DF+  MELN + L
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCFELVRARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKVFMRAFEE
        GEDWP+ +EK+  RA ++
Subjt:  GEDWPLLLEKVFMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTGAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTAGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATT
TTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGA
TCTGCCTTCCCACATCAGAATCAGAGTATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCGCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGAGATAGGCC
AAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATGGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTG
ACTATCAGAGACCGGTTCAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGAAGCAGTTCAGCGGCCATCCGAAAAACCAAGGATACATCCAACAGAA
GCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCAAATCCGTTAAGCTTCCTGAGTGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGG
TGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTCCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTG
CAACTGCACAAGGACTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATGTATACCTGAAGCAATTGATAAAGTCTTGTTTTGAGTTGGTGAGA
GCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATATCAA
TGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTT
TGCTGTTGGAGAAAGTTTTTATGCGTGCCTTCGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTGAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTAGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATT
TTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTTTCAGGACACGCTCAATCTGTTCTACAACCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGA
TCTGCCTTCCCACATCAGAATCAGAGTATACCACTTTGGTCAAATGGAGTTCTTCCAGTATCCCCGCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGAGATAGGCC
AAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATGGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTG
ACTATCAGAGACCGGTTCAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGATATAGATGAAGCAGTTCAGCGGCCATCCGAAAAACCAAGGATACATCCAACAGAA
GCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCAAATCCGTTAAGCTTCCTGAGTGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGG
TGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGTGATTTCCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTG
CAACTGCACAAGGACTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATGTATACCTGAAGCAATTGATAAAGTCTTGTTTTGAGTTGGTGAGA
GCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGTGTACAGAATATCAA
TGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAGATTGGCCTT
TGCTGTTGGAGAAAGTTTTTATGCGTGCCTTCGAGGAATAA
Protein sequenceShow/hide protein sequence
MQPEHSSRIDLGDLKAQIVKKLRNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQPSNNSPCREDGPEQTG
SAFPHQNQSIPLWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDGSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDEAVQRPSEKPRIHPTE
AAILEEGEEVEQSNPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSMECPNILNNTLDVYLKQLIKSCFELVR
ARSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNINGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKVFMRAFEE