; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016216 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016216
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationscaffold9:42923619..42925527
RNA-Seq ExpressionSpg016216
SyntenySpg016216
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608203.1 hypothetical protein SDJN03_01545, partial [Cucurbita argyrosperma subsp. sororia]3.9e-22888.42Show/hide
Query:  PVPQFSFNWSDWGLE-LQFTVYEPPSGEMQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILK
        P P   FN SDWGLE LQFTVYEPPSGEMQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSK EFDK+CVRVLGRENIQLHN+LIRSILK
Subjt:  PVPQFSFNWSDWGLE-LQFTVYEPPSGEMQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILK

Query:  NACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTGSAFP--NQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSN
        NACVAKTPPPIN SGHAQS+LQASNNSPCREDGPE  GS FP  NQNQ +PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKI CLSYQS+GTEDS+
Subjt:  NACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTGSAFP--NQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSN

Query:  SKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGS
        SKVITENGNV MCDYQRPVQHL+AVAELPENDIDG V RPSEKPRIHPT AA+LED +EVEQSDPLS LRGPLLPPLGIPFCSASVGGARK LPV SSGS
Subjt:  SKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGS

Query:  SGDFLSCYDSIGLSDSETVRKRMEQIANAQGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTN
          DFLSCYDSIGLSD ETVRKRMEQIA AQGLEG+S+ECPNILNNTLDVYLKQLIKSCLELVR+RST EHTGHPIQKQQNQGK+INGM PSNH  VQN+N
Subjt:  SGDFLSCYDSIGLSDSETVRKRMEQIANAQGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTN

Query:  GRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRSFEE
        GRSEVLQEKSLECS SLLDFKVAME+NPKQLGEDWPL+LEKI MR+FE+
Subjt:  GRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRSFEE

XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]1.5e-22492.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK  CLSYQSTG+EDS+SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG VQRPSEKPRIHPT AAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+SMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]9.9e-22493.08Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKI CLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG VQRPSEKPRIHPT AAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+SMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

XP_022142878.1 uncharacterized protein LOC111012883 [Momordica charantia]1.3e-22090.97Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQT+PIWSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+ CLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA
        ENDI+G VQRPSEKPRIHPT AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ LP+   G+SGDF SCYDSIGLSD+ETVRKRMEQIA A
Subjt:  ENDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA

Query:  QGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQN-TNGRSEVLQEKSLECSVSLLDFKVAMELNP
        QGLEG+SMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGK+INGMWPSNHLRVQN +NGR EVLQEKSL+CSVSLLDFKVAMELNP
Subjt:  QGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQN-TNGRSEVLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRSFEE
        KQLGEDWPLLLEKICMR+FEE
Subjt:  KQLGEDWPLLLEKICMRSFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]1.9e-22794.27Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFPNQNQ+IPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKI CLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG V RPSEKPRIHPT AAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPV+SSGSS DFLSCYDSIGLSDS TVRKRMEQIA AQ
Subjt:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+SMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGK++N MWP+NHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKICMR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein7.4e-22592.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK  CLSYQSTG+EDS+SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG VQRPSEKPRIHPT AAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+SMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

A0A1S4E5S7 uncharacterized protein LOC1035037574.8e-22493.08Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKI CLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG VQRPSEKPRIHPT AAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+SMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein4.8e-22493.08Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKI CLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG VQRPSEKPRIHPT AAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+SMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQN NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

A0A6J1CPD1 uncharacterized protein LOC1110128836.5e-22190.97Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQT+PIWSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+ CLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA
        ENDI+G VQRPSEKPRIHPT AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ LP+   G+SGDF SCYDSIGLSD+ETVRKRMEQIA A
Subjt:  ENDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA

Query:  QGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQN-TNGRSEVLQEKSLECSVSLLDFKVAMELNP
        QGLEG+SMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGK+INGMWPSNHLRVQN +NGR EVLQEKSL+CSVSLLDFKVAMELNP
Subjt:  QGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQN-TNGRSEVLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRSFEE
        KQLGEDWPLLLEKICMR+FEE
Subjt:  KQLGEDWPLLLEKICMRSFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767151.9e-22091.89Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQASNN+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CRED PEQTGSAFPNQNQ+IPIW+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK ACLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG+VQRPS KPRI PT A+ILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA AQ
Subjt:  NDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGK+INGMWP+NHLRVQN+NGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein8.2e-4332.77Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGP

Query:  EQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGT
              +P ++    ++ + V P SPRK RS    KFRDRPSPLGP GK   L   +T  ++S SK                 Q L              
Subjt:  EQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGT

Query:  VQRPSEKPRIHPTAAAILEDGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQGLEG
                   P     +EDGEEVEQ    P    R PL  PLG+ F   S     K    + +G + +  +C  S  L D  T+R R+E+    +G++ 
Subjt:  VQRPSEKPRIHPTAAAILEDGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQGLEG

Query:  ISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGED
        +SM+  N+LN  L+ Y+++LI+ CL L                                             Q+K    +VS+LDF  AME+NP+ LGE+
Subjt:  ISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGED

Query:  WPLLLEKICMRSFEE
        WP+ LEKIC R+ EE
Subjt:  WPLLLEKICMRSFEE

AT2G24530.1 unknown protein8.1e-11553.3Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ     RI L +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP
         R DG EQ+G+  PN +Q  P+WSNGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +++       
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGTVQRPSEKPRI---HPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQI
         ++ DG   RP EKPRI      AA  + D +  E+   ++    PL+ PLGIPFCSASVGG+ +T+PVS   ++ + +SCYDS GL D E +RKRME I
Subjt:  ENDIDGTVQRPSEKPRI---HPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQI

Query:  ANAQGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAME
        A AQGLEG+SMEC   LNN LDVYLK+LI SC +LV ARST    G   I KQQ+Q KI+NG+WP+N L++Q  NG S++ Q+     SVS+LDF+ AME
Subjt:  ANAQGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAME

Query:  LNPKQLGEDWPLLLEKICMRSFEE
        LNP+QLGEDWP L E+I +RSFEE
Subjt:  LNPKQLGEDWPLLLEKICMRSFEE

AT4G31440.1 unknown protein8.7e-8545.86Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ     RIDL +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
         +EDGPE++ S  P+  +     SNGVL    R G    R   RD+P PLG NGK+                       +    Y RP ++         
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVQRPSEKPRI---HPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIA
        ++ D     P+E+  +      AA I  D E   Q   LS    P++ PLGIPFCSASVGG R+T+PVS+S ++   +SCYDS GLSD+E +RKRME IA
Subjt:  NDIDGTVQRPSEKPRI---HPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIA

Query:  NAQGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMEL
          QGL G+S EC  +LNN LD+YLK+L+KSC++L  ARS     G H ++KQQ++ +++NG+  +N   +Q +N  S++ +E+    SVSLLDF+VAMEL
Subjt:  NAQGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMEL

Query:  NPKQLGEDWPLLLEKICMRSFEE
        NP QLGEDWPLL E+I +  FEE
Subjt:  NPKQLGEDWPLLLEKICMRSFEE

AT4G33890.1 unknown protein1.2e-4634.05Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL     
Subjt:  DGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI

Query:  DGTVQRPSEKPRIHPTAAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA
             RP       P     +E+GEEVEQ     P    R PL  PLG+   S   G  RK++  VS    S +  +C ++  L D+ T+R R+E+    
Subjt:  DGTVQRPSEKPRIHPTAAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA

Query:  QGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ I+M+  ++LN+ LDV++++LI+ CL L   R   +                         RV+  N   +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRSFEE
         LGEDWP+ +EKIC R+ ++
Subjt:  QLGEDWPLLLEKICMRSFEE

AT4G33890.2 unknown protein1.2e-4634.05Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL     
Subjt:  DGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI

Query:  DGTVQRPSEKPRIHPTAAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA
             RP       P     +E+GEEVEQ     P    R PL  PLG+   S   G  RK++  VS    S +  +C ++  L D+ T+R R+E+    
Subjt:  DGTVQRPSEKPRIHPTAAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA

Query:  QGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ I+M+  ++LN+ LDV++++LI+ CL L   R   +                         RV+  N   +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRSFEE
         LGEDWP+ +EKIC R+ ++
Subjt:  QLGEDWPLLLEKICMRSFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTTCTTCTTCTTCATCTTCAATCCCTTCTTCTTCCAACCCCCAGCTCACTGATCTCCATCATTCTACTTCCAATTTCCTACTTACAACTTCCCAATTGCGCCTAAGGGA
CGAGGGTTTCCTCTCTAGCTGCTCTGCTGCAATTCGCGCCTCTTCCGTTCGGCTTCCAGTTCCACAATTCAGCTTCAATTGGAGTGACTGGGGTTTGGAGCTACAGTTTA
CAGTGTATGAACCGCCCTCTGGAGAAATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAG
AGGTACTTCTTCTACTTGAGTAGATTCTTGGGTCAGAAGCTAAGCAAGGTTGAATTTGATAAGGTGTGCGTCCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCA
ATTGATAAGGTCGATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGTGTCAGGACATGCACAATCTGTGCTACAAGCTTCAAACAACTCTCCTTGCA
GGGAAGATGGCCCTGAACAAACTGGATCTGCTTTTCCAAATCAGAATCAAACTATACCAATATGGTCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGGAGATCTGTC
TTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGTCCAAATGGAAAAATCGCATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAATAGCAAAGTTATTAC
AGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGACGGAACAGTTCAGCGGCCATCAG
AAAAACCAAGGATACACCCTACAGCAGCAGCTATTCTTGAAGATGGAGAAGAGGTGGAACAGTCCGACCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGT
ATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGACCTTGCCAGTGAGCAGTAGTGGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGA
TTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCAAATGCACAAGGGCTTGAAGGTATTTCTATGGAATGTCCTAACATTCTGAATAATACTCTGGATGTGTACCTAA
AGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGATTATAAATGGTATG
TGGCCTAGTAACCACCTACGTGTACAGAATACCAATGGGCGGTCTGAAGTTCTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGA
GCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAATTTGTATGCGCTCCTTTGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
TCTTCTTCTTCTTCATCTTCAATCCCTTCTTCTTCCAACCCCCAGCTCACTGATCTCCATCATTCTACTTCCAATTTCCTACTTACAACTTCCCAATTGCGCCTAAGGGA
CGAGGGTTTCCTCTCTAGCTGCTCTGCTGCAATTCGCGCCTCTTCCGTTCGGCTTCCAGTTCCACAATTCAGCTTCAATTGGAGTGACTGGGGTTTGGAGCTACAGTTTA
CAGTGTATGAACCGCCCTCTGGAGAAATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAG
AGGTACTTCTTCTACTTGAGTAGATTCTTGGGTCAGAAGCTAAGCAAGGTTGAATTTGATAAGGTGTGCGTCCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCA
ATTGATAAGGTCGATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGTGTCAGGACATGCACAATCTGTGCTACAAGCTTCAAACAACTCTCCTTGCA
GGGAAGATGGCCCTGAACAAACTGGATCTGCTTTTCCAAATCAGAATCAAACTATACCAATATGGTCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGGAGATCTGTC
TTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGTCCAAATGGAAAAATCGCATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAATAGCAAAGTTATTAC
AGAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGACGGAACAGTTCAGCGGCCATCAG
AAAAACCAAGGATACACCCTACAGCAGCAGCTATTCTTGAAGATGGAGAAGAGGTGGAACAGTCCGACCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGT
ATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGACCTTGCCAGTGAGCAGTAGTGGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGA
TTCAGAGACAGTGAGAAAACGCATGGAGCAAATTGCAAATGCACAAGGGCTTGAAGGTATTTCTATGGAATGTCCTAACATTCTGAATAATACTCTGGATGTGTACCTAA
AGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGATTATAAATGGTATG
TGGCCTAGTAACCACCTACGTGTACAGAATACCAATGGGCGGTCTGAAGTTCTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGA
GCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAATTTGTATGCGCTCCTTTGAAGAATAA
Protein sequenceShow/hide protein sequence
SSSSSSSIPSSSNPQLTDLHHSTSNFLLTTSQLRLRDEGFLSSCSAAIRASSVRLPVPQFSFNWSDWGLELQFTVYEPPSGEMQPQHSSRIDLGDLKAQIVKKLGNDKSK
RYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSV
LRGKFRDRPSPLGPNGKIACLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGTVQRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLG
IPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQGLEGISMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGM
WPSNHLRVQNTNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRSFEE