; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh09G005260 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh09G005260
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationCma_Chr09:2335465..2339727
RNA-Seq ExpressionCmaCh09G005260
SyntenyCmaCh09G005260
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591626.1 hypothetical protein SDJN03_13972, partial [Cucurbita argyrosperma subsp. sororia]4.5e-23398.55Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQSQQSSRIDL DLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
        CREDAPEQTGSAFPNQNQ IPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND

Query:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
Subjt:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
        EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKIRILSF
        EDWPLLLEKI + +F
Subjt:  EDWPLLLEKIRILSF

XP_022937292.1 uncharacterized protein LOC111443621 [Cucurbita moschata]1.7e-23298.31Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQSQQSSRIDL DLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
        CREDAPEQTGSAFPNQNQ IPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND

Query:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFL CYDSIGLSDSETVRKRMEQIATAQGL
Subjt:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
        EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKIRILSF
        EDWPLLLEKI + +F
Subjt:  EDWPLLLEKIRILSF

XP_022976270.1 uncharacterized protein LOC111476715 [Cucurbita maxima]3.1e-23499.04Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
        CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND

Query:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
Subjt:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
        EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKIRILSF
        EDWPLLLEKI + +F
Subjt:  EDWPLLLEKIRILSF

XP_023536522.1 uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo]2.0e-23398.55Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQSQQ SRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
        CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND

Query:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
Subjt:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
        +GVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKIRILSF
        EDWPLLLEKI + +F
Subjt:  EDWPLLLEKIRILSF

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]1.8e-21390.89Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQ Q SSRIDLGDLKAQIVKKLGND+SKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN +P
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNVTMCDYQRPVQQLQAVAELPE
        CR+D PEQTGSAFPNQNQSIPIW+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNVTMCDYQRPVQQLQAVAELPE

Query:  NDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDG+V RPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPV+SSGS  DFLSCYDSIGLSDS TVRKRMEQIATAQ
Subjt:  NDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKIRILSF
        LGEDWPLLLEKI + +F
Subjt:  LGEDWPLLLEKIRILSF

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein4.2e-21390.17Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+P
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNVTMCDYQRPVQQLQAVAELPE
        CRED PEQTGSAFPNQNQS PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK+ CLSYQSTG+ED   KVITENGNVT+CDYQRPV+ LQ+VAELPE
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNVTMCDYQRPVQQLQAVAELPE

Query:  NDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ
        GLEGVS+ECP+ILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKIRILSF
        LGEDWPLLLEKI + +F
Subjt:  LGEDWPLLLEKIRILSF

A0A1S4E5S7 uncharacterized protein LOC1035037573.0e-21189.93Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVL AS N+P
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNVTMCDYQRPVQQLQAVAELPE
        CRED PEQTGSAFPNQNQS PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNVT+CDYQRPVQ LQ+VAELPE
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNVTMCDYQRPVQQLQAVAELPE

Query:  NDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN+NGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKIRILSF
        LGEDWPLLLEKI + +F
Subjt:  LGEDWPLLLEKIRILSF

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein3.0e-21189.93Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVL AS N+P
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNVTMCDYQRPVQQLQAVAELPE
        CRED PEQTGSAFPNQNQS PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNVT+CDYQRPVQ LQ+VAELPE
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTED--RKVITENGNVTMCDYQRPVQQLQAVAELPE

Query:  NDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN+NGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKIRILSF
        LGEDWPLLLEKI + +F
Subjt:  LGEDWPLLLEKIRILSF

A0A6J1FAS2 uncharacterized protein LOC1114436218.2e-23398.31Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQSQQSSRIDL DLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
        CREDAPEQTGSAFPNQNQ IPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND

Query:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFL CYDSIGLSDSETVRKRMEQIATAQGL
Subjt:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
        EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKIRILSF
        EDWPLLLEKI + +F
Subjt:  EDWPLLLEKIRILSF

A0A6J1IIZ9 uncharacterized protein LOC1114767151.5e-23499.04Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
        CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEND

Query:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
        IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
Subjt:  IDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
        EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKIRILSF
        EDWPLLLEKI + +F
Subjt:  EDWPLLLEKIRILSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein2.8e-3931.53Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAP
        SR++  ++KA I +K+G+ ++  YF  L KFL  ++SK EFDKLC + +GRENI LHN+L+RSILKNA VAK+PP                         
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAP

Query:  EQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPENDIDGSVQ
              +P ++    ++ + V P SPRK RS    KFRDRPSPLGP GK   L    T T D  +                                   
Subjt:  EQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPENDIDGSVQ

Query:  RPSGKPRIRPTEASILEEGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVS
            K +  P E   +E+GEEVEQ    P    R PL  PLG+ F         K+    S+ + ++  +C  S  L D  T+R R+E+    +G++ +S
Subjt:  RPSGKPRIRPTEASILEEGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVS

Query:  LECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWP
        ++  N+LN  L+ Y+++LI+ CL L                                             ++K    +VS+LDF  AME+NP+ LGE+WP
Subjt:  LECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWP

Query:  LLLEKI
        + LEKI
Subjt:  LLLEKI

AT2G24530.1 unknown protein7.6e-10650.95Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQ  Q  RI L +LK  IVKK G ++S+RYF+YL +FL QKL+K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PP  + +GH+      +N   
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKTACLSYQSTGTEDRK--VITENGNVTMCDYQRPVQQLQAVAELP
         R D  EQ+G+  PN +Q  P+W+NGVLP+SPRK RS ++  K RDRPSPLG NGK   + +Q    ED +  V  ENG     DYQR  + +       
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKTACLSYQSTGTEDRK--VITENGNVTMCDYQRPVQQLQAVAELP

Query:  ENDIDGSVQRPSGKPRIRPTE---ASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQI
         ++ DG   RP  KPRI   E   A  + + +  E+   ++    PL+ PLGIPFCSASVGG+ + +PVS++    + +SCYDS GL D E +RKRME I
Subjt:  ENDIDGSVQRPSGKPRIRPTE---ASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQI

Query:  ATAQGLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAME
        A AQGLEGVS+EC   LNN LDVYLK+LI SC +LV  RST    G   I KQQ+Q K++NG+WPTN L++Q  NG S++ ++     SVS+LDF+ AME
Subjt:  ATAQGLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAME

Query:  LNPKQLGEDWPLLLEKIRILSF
        LNP+QLGEDWP L E+I + SF
Subjt:  LNPKQLGEDWPLLLEKIRILSF

AT4G31440.1 unknown protein7.7e-8245.56Show/hide
Query:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP
        MQ  Q  RIDL +LK  IVKK+G ++S RYF+YL +FL QKL+K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTP

Query:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKT-ACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEN
         +ED PE++ S  P+  ++    +NGVL    R G    R   RD+P PLG NGK     +Y   G    +   E  +  +C    P +Q          
Subjt:  CREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKT-ACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPEN

Query:  DIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQG
              +  SGK ++     S  +E +    S P      P++ PLGIPFCSASVGG R+ +PVS+S + +   SCYDS GLSD+E +RKRME IA  QG
Subjt:  DIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQG

Query:  LEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ
        L GVS EC  +LNN LD+YLK+L+KSC++L   RS     G H ++KQQ++ +++NG+   N   +Q SN  S++  E+    SVSLLDF+VAMELNP Q
Subjt:  LEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTG-HPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKIRILSF
        LGEDWPLL E+I I  F
Subjt:  LGEDWPLLLEKIRILSF

AT4G33890.1 unknown protein3.7e-4433.74Show/hide
Query:  SQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCR
        +Q SSR+D  ++KA I +++GN +++ YF  L +F   K++K EFDKLC++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N     
Subjt:  SQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCR

Query:  EDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPENDID
               G +  N +Q  P+  +     S RK RS    K RDRPSPLGP GK   L+            T N        +  + + Q+  EL      
Subjt:  EDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPENDID

Query:  GSVQRPSGKPRIRPTEASILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALP-VSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ
            RP       P E   +EEGEEVEQ     P    R PL  PLG+   S   G  RK++  VS      +  +C ++  L D+ T+R R+E+    +
Subjt:  GSVQRPSGKPRIRPTEASILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALP-VSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ
        GL+ ++++  ++LN+ LDV++++LI+ CL L  TR                         T+ +R  N     +  ++      VS+ DF+  MELN + 
Subjt:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKI
        LGEDWP+ +EKI
Subjt:  LGEDWPLLLEKI

AT4G33890.2 unknown protein3.7e-4433.74Show/hide
Query:  SQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCR
        +Q SSR+D  ++KA I +++GN +++ YF  L +F   K++K EFDKLC++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N     
Subjt:  SQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCR

Query:  EDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPENDID
               G +  N +Q  P+  +     S RK RS    K RDRPSPLGP GK   L+            T N        +  + + Q+  EL      
Subjt:  EDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPENDID

Query:  GSVQRPSGKPRIRPTEASILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALP-VSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ
            RP       P E   +EEGEEVEQ     P    R PL  PLG+   S   G  RK++  VS      +  +C ++  L D+ T+R R+E+    +
Subjt:  GSVQRPSGKPRIRPTEASILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALP-VSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ
        GL+ ++++  ++LN+ LDV++++LI+ CL L  TR                         T+ +R  N     +  ++      VS+ DF+  MELN + 
Subjt:  GLEGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKI
        LGEDWP+ +EKI
Subjt:  LGEDWPLLLEKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCTCAGCAAAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGT
AAATTCTTGGGTCAGAAGCTGAGCAAGGTTGAGTTTGATAAGCTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATT
TTGAAGAATGCATGTGTAGCCAAGACCCCACCACTTATAAATGTATCAGGACATGCACAATCTGTGCTACAAGCTTCGAACAATACTCCTTGCAGAGAAGATGCC
CCTGAACAAACTGGATCTGCCTTTCCAAATCAGAATCAGAGTATACCAATTTGGACAAATGGGGTTCTGCCTGTGTCCCCACGGAAGGGTAGATCTGTCTTACGC
GGGAAGTTTAGGGATAGGCCGAGTCCGCTTGGTCCAAATGGAAAAACAGCATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGGAAAGTTATTACAGAGAAT
GGTAATGTAACCATGTGTGACTATCAGAGGCCTGTACAGCAACTTCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGATCAGTTCAGCGGCCGTCAGGA
AAACCAAGAATACGTCCAACAGAAGCATCCATTCTTGAAGAAGGAGAGGAGGTGGAACAATCAGATCCCTTAAGCTTCCTTAGAGGTCCTCTACTTCCACCTCTT
GGTATTCCATTTTGTTCAGCGAGTGTGGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTTGTGTCGATTTTCTGAGTTGTTATGACAGTATTGGA
TTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAACAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTTTGGAATGTCCTAACATCCTGAATAATACTCTT
GATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAACAAGGTCTACATTTGAACATACAGGGCACCCGATCCAGAAGCAACAAAATCAAGGG
AAGGTTATAAATGGTATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAACGGGCGATCTGAAGTTTTGGAGGAAAAGAGTTTTGAATGCTCAGTGTCATTG
CTGGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAATTAGAATACTGTCGTTTAGCTTGGTGAAAGAT
GAATCACCTGGAAACATCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATCTCAGCAAAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGT
AAATTCTTGGGTCAGAAGCTGAGCAAGGTTGAGTTTGATAAGCTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATT
TTGAAGAATGCATGTGTAGCCAAGACCCCACCACTTATAAATGTATCAGGACATGCACAATCTGTGCTACAAGCTTCGAACAATACTCCTTGCAGAGAAGATGCC
CCTGAACAAACTGGATCTGCCTTTCCAAATCAGAATCAGAGTATACCAATTTGGACAAATGGGGTTCTGCCTGTGTCCCCACGGAAGGGTAGATCTGTCTTACGC
GGGAAGTTTAGGGATAGGCCGAGTCCGCTTGGTCCAAATGGAAAAACAGCATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGGAAAGTTATTACAGAGAAT
GGTAATGTAACCATGTGTGACTATCAGAGGCCTGTACAGCAACTTCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGATCAGTTCAGCGGCCGTCAGGA
AAACCAAGAATACGTCCAACAGAAGCATCCATTCTTGAAGAAGGAGAGGAGGTGGAACAATCAGATCCCTTAAGCTTCCTTAGAGGTCCTCTACTTCCACCTCTT
GGTATTCCATTTTGTTCAGCGAGTGTGGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTTGTGTCGATTTTCTGAGTTGTTATGACAGTATTGGA
TTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAACAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTTTGGAATGTCCTAACATCCTGAATAATACTCTT
GATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAACAAGGTCTACATTTGAACATACAGGGCACCCGATCCAGAAGCAACAAAATCAAGGG
AAGGTTATAAATGGTATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAACGGGCGATCTGAAGTTTTGGAGGAAAAGAGTTTTGAATGCTCAGTGTCATTG
CTGGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAATTAGAATACTGTCGTTTAGCTTGGTGAAAGAT
GAATCACCTGGAAACATCTAA
Protein sequenceShow/hide protein sequence
MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDA
PEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPENDIDGSVQRPSG
KPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSLECPNILNNTL
DVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKIRILSFSLVKD
ESPGNI