; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013523 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013523
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationChr02:2325795..2327054
RNA-Seq ExpressionHG10013523
SyntenyHG10013523
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]1.5e-22594.51Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQS PIWPNGVLP SPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDSSSKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAA LEEGEEVEQSDPLSFL GPLLPPLGIPFCSASVGGARKALPVSSSGSS DF+SCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVSMECP+ILNNTLD+YLKQLIKSCLELVRARSTFEH GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]1.7e-22494.51Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQS PIWPNGVLP SPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAA LEEGEEVEQSDPL FL GPLLPPLGIPFCSASVGGARKALPVSSSGSS DF+SCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVSMECPNILNNTLD+YLKQLIKSCLELVRARSTFEH GHPIQKQQNQGKV+NGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

XP_022142878.1 uncharacterized protein LOC111012883 [Momordica charantia]2.1e-21991.69Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQ++PIW NGVLPASPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATA
        ENDI+GAVQRPSEKPRIHPTEAA LE+GEEVEQSDPLSFL GPLLPPLGIPFCSASVGGAR+ALP+   G+SGDF SCYDSIGLSD+ETVRKRMEQIATA
Subjt:  ENDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNP
        QGLEGVSMEC NILN+TLDLYLKQLIKSCLELVR+RST EH GHPIQKQQNQGKVINGMWP+NHLRVQN SNGR EVLQEKSL+CSVSLLDFKVAMELNP
Subjt:  QGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRAFEE
        KQLGEDWPLLLEKICMR FEE
Subjt:  KQLGEDWPLLLEKICMRAFEE

XP_023524221.1 uncharacterized protein LOC111788192 [Cucurbita pepo subsp. pepo]1.4e-21891.17Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRID+GDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSK EFDKMCVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+LQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPE  GS FPNQNQ++PIWPNGVLP SPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQS+GTEDSSSKVITENGNV MCDYQRPVQHL+AVAELPE
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAV RPSEKPRIHPTEAA LE+ +EVEQSDPLS L GPLLPPLGIPFCSASVGGARKALPV SSGSS DF+SCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLD+YLKQLIKSCLELVR+RST EH GHPIQKQQNQGKVINGM P+NH  VQNSNGRSEVLQEKSLECS SLLDFKVAME+NPKQ
Subjt:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPL+LEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]4.0e-22694.99Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFFYLSRFLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFPNQNQSIPIW NGVLP SPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDS+SKVITENGNVTMCDYQRPVQHLQAVAELPE
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAV RPSEKPRIHPTEAA LEEGEEVEQSDPLSFL GPLLPPLGIPFCSASVGGARKALPV+SSGSS DF+SCYDSIGLSDS TVRKRMEQIATAQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVSMECPNILNNTLD+YLKQLIKSCLELVRARSTFEH GHPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKICMRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein7.3e-22694.51Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQS PIWPNGVLP SPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDSSSKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAA LEEGEEVEQSDPLSFL GPLLPPLGIPFCSASVGGARKALPVSSSGSS DF+SCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVSMECP+ILNNTLD+YLKQLIKSCLELVRARSTFEH GHPIQKQQNQGKV+NGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037578.1e-22594.51Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQS PIWPNGVLP SPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAA LEEGEEVEQSDPL FL GPLLPPLGIPFCSASVGGARKALPVSSSGSS DF+SCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVSMECPNILNNTLD+YLKQLIKSCLELVRARSTFEH GHPIQKQQNQGKV+NGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein8.1e-22594.51Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQS PIWPNGVLP SPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDSSSKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAVQRPSEKPRIHPTEAA LEEGEEVEQSDPL FL GPLLPPLGIPFCSASVGGARKALPVSSSGSS DF+SCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVSMECPNILNNTLD+YLKQLIKSCLELVRARSTFEH GHPIQKQQNQGKV+NGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

A0A6J1CPD1 uncharacterized protein LOC1110128831.0e-21991.69Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQ++PIW NGVLPASPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATA
        ENDI+GAVQRPSEKPRIHPTEAA LE+GEEVEQSDPLSFL GPLLPPLGIPFCSASVGGAR+ALP+   G+SGDF SCYDSIGLSD+ETVRKRMEQIATA
Subjt:  ENDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNP
        QGLEGVSMEC NILN+TLDLYLKQLIKSCLELVR+RST EH GHPIQKQQNQGKVINGMWP+NHLRVQN SNGR EVLQEKSL+CSVSLLDFKVAMELNP
Subjt:  QGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRAFEE
        KQLGEDWPLLLEKICMR FEE
Subjt:  KQLGEDWPLLLEKICMRAFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767156.6e-21992.6Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQASNN+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CRED PEQTGSAFPNQNQSIPIW NGVLP SPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDG+VQRPS KPRI PTEA+ LEEGEEVEQSDPLSFL GPLLPPLGIPFCSASVGGARKALPVSSSGS  DF+SCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLD+YLKQLIKSCLELVR RSTFEH GHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein3.4e-4233.01Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGP

Query:  EQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGA
              +P ++    ++ + V P SPRK RS    KFRDRPSPLGP GK   L   +T  ++S SK                 Q L              
Subjt:  EQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGA

Query:  VQRPSEKPRIHPTEAANLEEGEEVEQ--SDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQGLEG
                   P E  ++E+GEEVEQ    P      PL  PLG+ F   S     KA   + +G + +  +C  S  L D  T+R R+E+    +G++ 
Subjt:  VQRPSEKPRIHPTEAANLEEGEEVEQ--SDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQGLEG

Query:  VSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGED
        +SM+  N+LN  L+ Y+++LI+ CL L                                             Q+K    +VS+LDF  AME+NP+ LGE+
Subjt:  VSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGED

Query:  WPLLLEKICMRAFEE
        WP+ LEKIC RA EE
Subjt:  WPLLLEKICMRAFEE

AT2G24530.1 unknown protein3.5e-11152.36Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ     RI L +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP
         R DG EQ+G+  PN +Q  P+W NGVLP SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +++       
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVQRPSEKPRIHPTE---AANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQI
         ++ DG   RP EKPRI   E   A ++ + +  E+   ++    PL+ PLGIPFCSASVGG+ + +PVS   ++ + +SCYDS GL D E +RKRME I
Subjt:  ENDIDGAVQRPSEKPRIHPTE---AANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQI

Query:  ATAQGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMG-HPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAME
        A AQGLEGVSMEC   LNN LD+YLK+LI SC +LV ARST    G   I KQQ+Q K++NG+WPTN L++Q  NG S++ Q+     SVS+LDF+ AME
Subjt:  ATAQGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMG-HPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAME

Query:  LNPKQLGEDWPLLLEKICMRAFEE
        LNP+QLGEDWP L E+I +R+FEE
Subjt:  LNPKQLGEDWPLLLEKICMRAFEE

AT4G31440.1 unknown protein1.6e-8445.86Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ     RIDL +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE
         +EDGPE++ S  P+  ++     NGVL A  R G    R   RD+P PLG NGK+                       +    Y RP ++         
Subjt:  CREDGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVQRPSEKPRIHPTE--AANLEEGEEVEQSDPLSFLS-GPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIA
        ++ D A   P+E+  +   +  AA +   +E +    +  LS  P++ PLGIPFCSASVGG R+ +PVS+S ++   +SCYDS GLSD+E +RKRME IA
Subjt:  NDIDGAVQRPSEKPRIHPTE--AANLEEGEEVEQSDPLSFLS-GPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIA

Query:  TAQGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMG-HPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMEL
          QGL GVS EC  +LNN LDLYLK+L+KSC++L  ARS     G H ++KQQ++ +++NG+   N   +Q SN  S++ +E+    SVSLLDF+VAMEL
Subjt:  TAQGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMG-HPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMEL

Query:  NPKQLGEDWPLLLEKICMRAFEE
        NP QLGEDWPLL E+I +  FEE
Subjt:  NPKQLGEDWPLLLEKICMRAFEE

AT4G33890.1 unknown protein5.1e-4634.05Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDI
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL     
Subjt:  DGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDI

Query:  DGAVQRPSEKPRIHPTEAANLEEGEEVEQ---SDPLSFLSGPLLPPLGIPFCSASVGGARKALP-VSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATA
             RP       P E  ++EEGEEVEQ     P      PL  PLG+   S   G  RK++  VS    S +  +C ++  L D+ T+R R+E+    
Subjt:  DGAVQRPSEKPRIHPTEAANLEEGEEVEQ---SDPLSFLSGPLLPPLGIPFCSASVGGARKALP-VSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ ++M+  ++LN+ LD+++++LI+ CL L   R                         T+ +R  N     +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRAFEE
         LGEDWP+ +EKIC RA ++
Subjt:  QLGEDWPLLLEKICMRAFEE

AT4G33890.2 unknown protein5.1e-4634.05Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDI
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL     
Subjt:  DGPEQTGSAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDI

Query:  DGAVQRPSEKPRIHPTEAANLEEGEEVEQ---SDPLSFLSGPLLPPLGIPFCSASVGGARKALP-VSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATA
             RP       P E  ++EEGEEVEQ     P      PL  PLG+   S   G  RK++  VS    S +  +C ++  L D+ T+R R+E+    
Subjt:  DGAVQRPSEKPRIHPTEAANLEEGEEVEQ---SDPLSFLSGPLLPPLGIPFCSASVGGARKALP-VSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ ++M+  ++LN+ LD+++++LI+ CL L   R                         T+ +R  N     +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGVSMECPNILNNTLDLYLKQLIKSCLELVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRAFEE
         LGEDWP+ +EKIC RA ++
Subjt:  QLGEDWPLLLEKICMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATT
TTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGATGTGCGTTCGTGTACTTGGAAGGGAAAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATTAATGTGTCAGGACACGCTCAATCTGTTCTACAAGCTTCGAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGA
TCTGCCTTCCCAAATCAGAATCAGAGTATACCAATTTGGCCAAATGGAGTTCTTCCAGCATCCCCGCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGGGATAGGCC
AAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTG
ACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAA
GCAGCTAATCTTGAAGAAGGAGAGGAGGTGGAACAGTCGGATCCCTTAAGCTTCCTTAGTGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGG
TGGGGCGCGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTAGTGGTGATTTTATGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGG
AGCAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATTTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAG
TTGGTGAGAGCAAGGTCTACATTTGAACATATGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGCGTACA
GAATAGCAATGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAG
ATTGGCCTTTGCTGTTGGAGAAAATTTGTATGCGTGCCTTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGTAGATT
TTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGATGTGCGTTCGTGTACTTGGAAGGGAAAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATTAATGTGTCAGGACACGCTCAATCTGTTCTACAAGCTTCGAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGA
TCTGCCTTCCCAAATCAGAATCAGAGTATACCAATTTGGCCAAATGGAGTTCTTCCAGCATCCCCGCGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGGGATAGGCC
AAGTCCACTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTG
ACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACAGAA
GCAGCTAATCTTGAAGAAGGAGAGGAGGTGGAACAGTCGGATCCCTTAAGCTTCCTTAGTGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGG
TGGGGCGCGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTAGTGGTGATTTTATGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGG
AGCAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACTCTGGATTTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAG
TTGGTGAGAGCAAGGTCTACATTTGAACATATGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGGATGTGGCCTACTAACCACCTACGCGTACA
GAATAGCAATGGGCGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTTAATCCAAAGCAGCTTGGGGAAG
ATTGGCCTTTGCTGTTGGAGAAAATTTGTATGCGTGCCTTTGAGGAATAA
Protein sequenceShow/hide protein sequence
MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKMCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTG
SAFPNQNQSIPIWPNGVLPASPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSSSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGAVQRPSEKPRIHPTE
AANLEEGEEVEQSDPLSFLSGPLLPPLGIPFCSASVGGARKALPVSSSGSSGDFMSCYDSIGLSDSETVRKRMEQIATAQGLEGVSMECPNILNNTLDLYLKQLIKSCLE
LVRARSTFEHMGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRAFEE