; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G44650 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G44650
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationChr3:38213849..38216706
RNA-Seq ExpressionCSPI03G44650
SyntenyCSPI03G44650
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]6.5e-23799.76Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CREDGPEQTGSAFPNQ QSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]7.0e-23197.85Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CREDGPEQTGSAFPNQ QSKPIWPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTGSEDSSSKVITENGNVTLCDYQRPV+YLQSVAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

XP_022976270.1 uncharacterized protein LOC111476715 [Cucurbita maxima]1.2e-21490.93Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CRED PEQTGSAFPNQ QS PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK+ CLSYQSTG+ED   KVITENGNVT+CDYQRPV+ LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECP+ILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPLLLEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

XP_023536522.1 uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo]7.8e-21490.45Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQ Q  SRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CRED PEQTGSAFPNQ QS PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK+ CLSYQSTG+ED   KVITENGNVT+CDYQRPV+ LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GL+GVS+ECP+ILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPLLLEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]1.6e-22293.54Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP INASGHAQSVLQ SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CR+DGPEQTGSAFPNQ QS PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDS+SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAV RPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPV+SSGSSDFLSCYDSIGLSDS TVRKRMEQIA+AQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKI MRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein3.2e-23799.76Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CREDGPEQTGSAFPNQ QSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037573.4e-23197.85Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CREDGPEQTGSAFPNQ QSKPIWPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTGSEDSSSKVITENGNVTLCDYQRPV+YLQSVAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein3.4e-23197.85Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CREDGPEQTGSAFPNQ QSKPIWPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTGSEDSSSKVITENGNVTLCDYQRPV+YLQSVAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

A0A6J1FAS2 uncharacterized protein LOC1114436213.2e-21390.21Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQ Q SSRIDL DLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CRED PEQTGSAFPNQ Q  PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK+ CLSYQSTG+ED   KVITENGNVT+CDYQRPV+ LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFL CYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECP+ILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPLLLEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767155.8e-21590.93Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
        CRED PEQTGSAFPNQ QS PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK+ CLSYQSTG+ED   KVITENGNVT+CDYQRPV+ LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECP+ILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPLLLEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein8.4e-4132.13Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGP

Query:  EQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPENDIDGA
                 +   K ++ + V P SPRK RS    KFRDRPSPLGP GK   L   +T +++S SK                                  
Subjt:  EQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPENDIDGA

Query:  VQRPSEKPRIHPTEAAILEEGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGV
         QR        P E   +E+GEEVEQ    P    R PL  PLG+ F        +     S+    +  +C  S  L D  T+R R+E+    +G++ +
Subjt:  VQRPSEKPRIHPTEAAILEEGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGV

Query:  SMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDW
        SM+  ++LN  L+ Y+++LI+ CL L                                             Q+K    +VS+LDF  AME+NP+ LGE+W
Subjt:  SMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDW

Query:  PLLLEKISMRAFEE
        P+ LEKI  RA EE
Subjt:  PLLLEKISMRAFEE

AT2G24530.1 unknown protein1.3e-11353.19Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQ     RI L +LK  IVKK G ++S+RYF++L RFL QK++K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRG-KFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELP
         R DG EQ+G+  PN +Q +P+W NGVLP+SPRK RSG++  K RDRPSPLG NGK   + +Q    ED+   V  ENG     DYQR  RY+       
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRG-KFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELP

Query:  ENDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIA
         ++ DG   RP EKPRI   E   A  + + +  E+   ++    PL+ PLGIPFCSASVGG+ + +PVS+  +++ +SCYDS GL D E +RKRME IA
Subjt:  ENDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIA

Query:  SAQGLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMEL
         AQGLEGVSMEC   LNN LDVYLK+LI SC +LV ARST    G   I KQQ+Q K++NG+WPTN L++Q  NG S++ Q+     SVS+LDF+ AMEL
Subjt:  SAQGLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMEL

Query:  NPKQLGEDWPLLLEKISMRAFEE
        NP+QLGEDWP L E+IS+R+FEE
Subjt:  NPKQLGEDWPLLLEKISMRAFEE

AT4G31440.1 unknown protein3.6e-8445.52Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP
        MQ     RIDL +LK  IVKK+G ++S RYF++L RFL QK++K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE
         +EDGPE++ S  P+  ++     NGVL    R G    R   RD+P PLG NGK                        +    Y RP RY         
Subjt:  CREDGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTE--AAILEEGEEVE---QSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQI
        ++ D A   P+E+  +   +  AA +   +E +    S P      P++ PLGIPFCSASVGG R+ +PVS+S ++  +SCYDS GLSD+E +RKRME I
Subjt:  NDIDGAVQRPSEKPRIHPTE--AAILEEGEEVE---QSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQI

Query:  ASAQGLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAME
        A  QGL GVS EC  +LNN LD+YLK+L+KSC++L  ARS     G H ++KQQ++ +++NG+   N   +Q SN  S++ +E+    SVSLLDF+VAME
Subjt:  ASAQGLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAME

Query:  LNPKQLGEDWPLLLEKISMRAFEE
        LNP QLGEDWPLL E+IS+  FEE
Subjt:  LNPKQLGEDWPLLLEKISMRAFEE

AT4G33890.1 unknown protein1.3e-4634.76Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPENDI
                     +Q +P+  +     S RK RS    K RDRPSPLGP GK   L   +T +E+S SK                    QS  EL     
Subjt:  DGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPENDI

Query:  DGAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASA
             RP       P E   +EEGEEVEQ     P    R PL  PLG+   S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    
Subjt:  DGAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASA

Query:  QGLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ ++M+  S+LN+ LDV++++LI+ CL L   R                         T+ +R  N     +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKISMRAFEE
         LGEDWP+ +EKI  RA ++
Subjt:  QLGEDWPLLLEKISMRAFEE

AT4G33890.2 unknown protein1.3e-4634.76Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPENDI
                     +Q +P+  +     S RK RS    K RDRPSPLGP GK   L   +T +E+S SK                    QS  EL     
Subjt:  DGPEQTGSAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPENDI

Query:  DGAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASA
             RP       P E   +EEGEEVEQ     P    R PL  PLG+   S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    
Subjt:  DGAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASA

Query:  QGLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ ++M+  S+LN+ LDV++++LI+ CL L   R                         T+ +R  N     +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKISMRAFEE
         LGEDWP+ +EKI  RA ++
Subjt:  QLGEDWPLLLEKISMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGTGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGATAAGTCCAAGCGGTACTTCTTCTTCTTGAGCAGATT
CTTGGGTCAGAAGATGAGCAAGGTTGAATTTGATAAGGTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CTTGTGTAGCCAAGACCCCACCACCAATAAATGCTTCAGGACATGCACAATCTGTGTTACAAGCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCCGAACAAACTGGA
TCTGCCTTTCCAAATCAGACTCAGAGTAAACCAATTTGGCCAAATGGAGTTCTTCCGGTATCCCCACGGAAGGGTAGATCTGGCTTACGTGGAAAGTTTAGGGATAGGCC
AAGTCCGCTTGGTCCAAATGGAAAAAGCACATGTCTTTCATATCAATCAACTGGCTCTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCTTGTGTG
ACTATCAGAGACCAGTACGGTATCTCCAATCAGTAGCTGAGCTACCTGAAAATGACATAGATGGTGCAGTTCAACGGCCATCAGAAAAACCAAGGATACATCCAACAGAA
GCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCGGATCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGG
TGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTAGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGC
AAATTGCATCTGCACAAGGACTTGAAGGTGTTTCTATGGAATGTCCTAGCATCTTGAATAATACTCTGGATGTGTACCTGAAGCAATTGATAAAGTCTTGTCTTGAGTTG
GTGAGAGCAAGGTCTACATTTGAACATTCAGGGCACCCTATCCAGAAGCAACAAAATCAAGGGAAGGTTTTAAATGGCATGTGGCCTACTAACCACCTACGTGTACAGAA
CAGCAATGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGACT
GGCCTTTGCTGTTGGAGAAAATTTCTATGCGTGCCTTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
TCGCGATGATCAATACAGCTTTCTTTCCTCGATTTTTCTCGTTTTCTTTTTTTTTTTTCTTTCTGGGGGTTTAGTGATTTTAGCTTCTTTGATCTCTAGGGTTTTGGTTT
TTGCGACTATAGATCTGGGTTTGTGTTTTATTGCTTGTTGCCTTTGTTTTTTTTTATTGAGGAATCCTTATCCAATTAAATGTTACTTAGTCATTGCTTGAATTATGATT
TCAGGCTGCTGTAAAACCCCCACTTTTTTGCTTGGTGCGCGTATTATGTATATGTGTGTGTGTATATATAATATATTTATATATATATATATATTCCTTCAAATGGTGCG
GCGTTTTAGACTTGATTTGTTCTGAATGACGTGGAAGTACAAATTCTGGGGGATTTTTGTATTGGTACAATACCACAATTCAGCTTCAATTTTAGTGACTGGGGTTTGGA
GCTACAGTTTATAGTGTATGAAACCAACCGGCCTCTGGAGAAATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGTGACTTGAAAGCTCAGATAGTTAAAAAACTTGG
AAATGATAAGTCCAAGCGGTACTTCTTCTTCTTGAGCAGATTCTTGGGTCAGAAGATGAGCAAGGTTGAATTTGATAAGGTGTGCGTTCGTGTGCTTGGAAGGGAGAATA
TTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCTTGTGTAGCCAAGACCCCACCACCAATAAATGCTTCAGGACATGCACAATCTGTGTTACAAGCTTCA
AACAACTCTCCTTGCAGGGAAGATGGCCCCGAACAAACTGGATCTGCCTTTCCAAATCAGACTCAGAGTAAACCAATTTGGCCAAATGGAGTTCTTCCGGTATCCCCACG
GAAGGGTAGATCTGGCTTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGTCCAAATGGAAAAAGCACATGTCTTTCATATCAATCAACTGGCTCTGAAGATAGCA
GCAGCAAAGTTATTACAGAGAATGGTAATGTAACCTTGTGTGACTATCAGAGACCAGTACGGTATCTCCAATCAGTAGCTGAGCTACCTGAAAATGACATAGATGGTGCA
GTTCAACGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCGGATCCCTTAAGCTTCCTGAGAGGTCCTCT
ACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTAGTGATTTTCTGAGTTGTTATGACAGTA
TTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCATCTGCACAAGGACTTGAAGGTGTTTCTATGGAATGTCCTAGCATCTTGAATAATACTCTG
GATGTGTACCTGAAGCAATTGATAAAGTCTTGTCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATTCAGGGCACCCTATCCAGAAGCAACAAAATCAAGGGAAGGT
TTTAAATGGCATGTGGCCTACTAACCACCTACGTGTACAGAACAGCAATGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCA
AAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGACTGGCCTTTGCTGTTGGAGAAAATTTCTATGCGTGCCTTTGAGGAATAAGCAGTCGCAAAGGTTTTATTT
ATTATTCTACAGCCATCCAAGGCTGGTCGTTCTGATCGTTGGGGGTTTAAAATTTACCTGGGATATCATCCCGTCCTTACCTTCTTGCTGGGGTATACTTGATCGCTCCC
ATTTGGTTCAGGCTCCAAGTTTCCATGCTAATATATTGTAACTTTATTGCCTGCCCCATTGAGGAAGTTGAATTAACTCGACTAAGAAAGAGGGAGCTTTCAGCCAGAAT
GCCCATAAAAGGTTGTTCTGTTGCATTTTTAGCATTCTTCTTCACCCCATTTGCCACTACCATGTAACTTAGCTTTTCGTGTAACTTATTTTTGAGAGAATCAAAAAAGT
TTTGCAGGGATACTGTCATTTAGTTTGGTGAAAGATGAATCGCCTGGGTGAGAGCTTCAGGTGCTCTGTATGTATCTTTAGAATCTGAGAATTTATTGGAATTTCAATAT
CAATAACAAATTAACATCCATTTCAACGCACTTCGTGGTTTTTTCCCTTGCTCTTCTATATGATTAATCAATATATAATGATTTTGGTCTCATCTTCAAACTGAAATTAG
TTATTCTTAATCCCACCTTCACGATATGAATAATGTCAGCTGCCCCTCTAGATAAATGTTAATGTGTATGTGTGGGAAGTTCAAGTCTCAGGAGCAGCACAGCCTGACTT
CAAAGTGTCTGCGACCAGATATCTTGCTTGCTACTGAACAAATCAATGCTCTACGAGTGCCATGGCCAATTGGATGCAGATAATTCTTCTTAAAGGTTAACGG
Protein sequenceShow/hide protein sequence
MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQTG
SAFPNQTQSKPIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVRYLQSVAELPENDIDGAVQRPSEKPRIHPTE
AAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPSILNNTLDVYLKQLIKSCLEL
VRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE