; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037319 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037319
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr2:5118261..5119520
RNA-Seq ExpressionLag0037319
SyntenyLag0037319
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]6.3e-22492.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDS+SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG V RPSEKPRIHPT AAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S ECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]5.4e-22392.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG V RPSEKPRIHPT AAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S ECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

XP_022142878.1 uncharacterized protein LOC111012883 [Momordica charantia]4.3e-22090.97Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQT+PIWSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA
        ENDI+G V RPSEKPRIHPT AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ LP+   G+SGDF SCYDSIGLSD+ETVRKRMEQIA A
Subjt:  ENDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA

Query:  QGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNP
        QGLEG+S EC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGK+INGMWPSNHLRVQN SNGR EVLQEKSL+CSVSLLDFKVAMELNP
Subjt:  QGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRSFEE
        KQLGEDWPLLLEKICMR+FEE
Subjt:  KQLGEDWPLLLEKICMRSFEE

XP_023524221.1 uncharacterized protein LOC111788192 [Cucurbita pepo subsp. pepo]7.3e-22090.69Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRID+GDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+LQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPE  GS FPNQNQ +PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQS+GTEDS+SKVITENGNV MCDYQRPVQHL+AVAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG VLRPSEKPRIHPT AA+LED +EVEQSDPLS LRGPLLPPLGIPFCSASVGGARK LPV SSGSS DFLSCYDSIGLSDSETVRKRMEQIA AQ
Subjt:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S ECPNILNNTLDVYLKQLIKSCLELVR+RST EHTGHPIQKQQNQGK+INGM PSNH  VQNSNGRSEVLQEKSLECS SLLDFKVAME+NPKQ
Subjt:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPL+LEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]2.8e-22794.51Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFPNQNQ+IPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG V RPSEKPRIHPT AAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPV+SSGSS DFLSCYDSIGLSDS TVRKRMEQIA AQ
Subjt:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S ECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGK++N MWP+NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKICMR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein3.1e-22492.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDS+SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG V RPSEKPRIHPT AAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S ECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

A0A1S4E5S7 uncharacterized protein LOC1035037572.6e-22392.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG V RPSEKPRIHPT AAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S ECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein2.6e-22392.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CREDGPEQTGSAFPNQNQ+ PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG V RPSEKPRIHPT AAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARK LPVSSSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S ECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGK++NGMWP+NHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

A0A6J1CPD1 uncharacterized protein LOC1110128832.1e-22090.97Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQT+PIWSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA
        ENDI+G V RPSEKPRIHPT AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ LP+   G+SGDF SCYDSIGLSD+ETVRKRMEQIA A
Subjt:  ENDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA

Query:  QGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNP
        QGLEG+S EC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGK+INGMWPSNHLRVQN SNGR EVLQEKSL+CSVSLLDFKVAMELNP
Subjt:  QGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRSFEE
        KQLGEDWPLLLEKICMR+FEE
Subjt:  KQLGEDWPLLLEKICMRSFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767153.9e-21991.65Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQASNN+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CRED PEQTGSAFPNQNQ+IPIW+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ
        NDIDG+V RPS KPRI PT A+ILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARK LPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA AQ
Subjt:  NDIDGTVLRPSEKPRIHPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQ

Query:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEG+S ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGK+INGMWP+NHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRSFEE
        LGEDWPLLLEKI MR+FEE
Subjt:  LGEDWPLLLEKICMRSFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein4.5e-4232.53Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGP

Query:  EQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGT
              +P ++    ++ + V P SPRK RS    KFRDRPSPLGP GK   L   +T  ++S SK                 Q L              
Subjt:  EQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGT

Query:  VLRPSEKPRIHPTAAAILEDGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQGLEG
                   P     +EDGEEVEQ    P    R PL  PLG+ F   S     K    + +G + +  +C  S  L D  T+R R+E+    +G++ 
Subjt:  VLRPSEKPRIHPTAAAILEDGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQGLEG

Query:  ISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGED
        +S +  N+LN  L+ Y+++LI+ CL L                                             Q+K    +VS+LDF  AME+NP+ LGE+
Subjt:  ISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGED

Query:  WPLLLEKICMRSFEE
        WP+ LEKIC R+ EE
Subjt:  WPLLLEKICMRSFEE

AT2G24530.1 unknown protein6.8e-11553.3Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ     RI L +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP
         R DG EQ+G+  PN +Q  P+WSNGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +++       
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGTVLRPSEKPRI---HPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQI
         ++ DG  LRP EKPRI      AA  + D +  E+   ++    PL+ PLGIPFCSASVGG+ +T+PVS   ++ + +SCYDS GL D E +RKRME I
Subjt:  ENDIDGTVLRPSEKPRI---HPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQI

Query:  ANAQGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAME
        A AQGLEG+S EC   LNN LDVYLK+LI SC +LV ARST    G   I KQQ+Q KI+NG+WP+N L++Q  NG S++ Q+     SVS+LDF+ AME
Subjt:  ANAQGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAME

Query:  LNPKQLGEDWPLLLEKICMRSFEE
        LNP+QLGEDWP L E+I +RSFEE
Subjt:  LNPKQLGEDWPLLLEKICMRSFEE

AT4G31440.1 unknown protein5.0e-8646.34Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ     RIDL +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
         +EDGPE++ S  P+  +     SNGVL    R G    R   RD+P PLG NGK+                       +    Y RP ++         
Subjt:  CREDGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGTVLRPSEKPRI---HPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIA
        ++ D   L P+E+  +      AA I  D E   Q   LS    P++ PLGIPFCSASVGG R+T+PVS+S ++   +SCYDS GLSD+E +RKRME IA
Subjt:  NDIDGTVLRPSEKPRI---HPTAAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIA

Query:  NAQGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMEL
          QGL G+S EC  +LNN LD+YLK+L+KSC++L  ARS     G H ++KQQ++ +++NG+  +N   +Q SN  S++ +E+    SVSLLDF+VAMEL
Subjt:  NAQGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMEL

Query:  NPKQLGEDWPLLLEKICMRSFEE
        NP QLGEDWPLL E+I +  FEE
Subjt:  NPKQLGEDWPLLLEKICMRSFEE

AT4G33890.1 unknown protein3.0e-4633.81Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL     
Subjt:  DGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI

Query:  DGTVLRPSEKPRIHPTAAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA
            L    +P   P     +E+GEEVEQ     P    R PL  PLG+   S   G  RK++  VS    S +  +C ++  L D+ T+R R+E+    
Subjt:  DGTVLRPSEKPRIHPTAAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA

Query:  QGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ I+ +  ++LN+ LDV++++LI+ CL L   R   +                         RV+  N   +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRSFEE
         LGEDWP+ +EKIC R+ ++
Subjt:  QLGEDWPLLLEKICMRSFEE

AT4G33890.2 unknown protein3.0e-4633.81Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL     
Subjt:  DGPEQTGSAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI

Query:  DGTVLRPSEKPRIHPTAAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA
            L    +P   P     +E+GEEVEQ     P    R PL  PLG+   S   G  RK++  VS    S +  +C ++  L D+ T+R R+E+    
Subjt:  DGTVLRPSEKPRIHPTAAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKTLP-VSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANA

Query:  QGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ I+ +  ++LN+ LDV++++LI+ CL L   R   +                         RV+  N   +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGISTECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRSFEE
         LGEDWP+ +EKIC R+ ++
Subjt:  QLGEDWPLLLEKICMRSFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAGAGGTACTTCTTCTACTTGAGTAGATT
CTTGGGTCAGAAGCTAAGCAAGGTTGAATTTGATAAGGTGTGCGTCCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCGATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTGTCAGGACATGCACAATCTGTGCTACAAGCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGA
TCTGCTTTTCCAAATCAGAATCAAACTATACCAATTTGGTCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGGAGATCTGTCTTACGTGGAAAGTTTAGGGATAGGCC
AAGTCCGCTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAATAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTG
ACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGACGGAACAGTTCTGCGGCCATCAGAAAAACCAAGGATACACCCTACAGCA
GCAGCTATTCTTGAAGATGGAGAAGAGGTGGAACAGTCCGACCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGG
TGGGGCACGCAAGACCTTGCCAGTGAGCAGTAGTGGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGG
AGCAAATTGCAAATGCACAAGGGCTTGAAGGTATTTCTACGGAATGTCCTAACATTCTGAATAATACTCTGGATGTGTACCTAAAGCAATTGATAAAGTCTTGCCTTGAG
TTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGATTATAAATGGTATGTGGCCTAGTAACCACCTACGTGTACA
GAATAGCAATGGGCGGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAG
ATTGGCCTTTGCTGTTGGAGAAAATTTGTATGCGTTCCTTCGAAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAGAGGTACTTCTTCTACTTGAGTAGATT
CTTGGGTCAGAAGCTAAGCAAGGTTGAATTTGATAAGGTGTGCGTCCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCGATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTGTCAGGACATGCACAATCTGTGCTACAAGCTTCAAACAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGA
TCTGCTTTTCCAAATCAGAATCAAACTATACCAATTTGGTCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGGAGATCTGTCTTACGTGGAAAGTTTAGGGATAGGCC
AAGTCCGCTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAATAGCAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTG
ACTATCAGAGACCGGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGACGGAACAGTTCTGCGGCCATCAGAAAAACCAAGGATACACCCTACAGCA
GCAGCTATTCTTGAAGATGGAGAAGAGGTGGAACAGTCCGACCCCTTAAGCTTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGG
TGGGGCACGCAAGACCTTGCCAGTGAGCAGTAGTGGCAGTAGTGGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGG
AGCAAATTGCAAATGCACAAGGGCTTGAAGGTATTTCTACGGAATGTCCTAACATTCTGAATAATACTCTGGATGTGTACCTAAAGCAATTGATAAAGTCTTGCCTTGAG
TTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGATTATAAATGGTATGTGGCCTAGTAACCACCTACGTGTACA
GAATAGCAATGGGCGGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAG
ATTGGCCTTTGCTGTTGGAGAAAATTTGTATGCGTTCCTTCGAAGAATAA
Protein sequenceShow/hide protein sequence
MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTG
SAFPNQNQTIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGTVLRPSEKPRIHPTA
AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKTLPVSSSGSSGDFLSCYDSIGLSDSETVRKRMEQIANAQGLEGISTECPNILNNTLDVYLKQLIKSCLE
LVRARSTFEHTGHPIQKQQNQGKIINGMWPSNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRSFEE