; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008435 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008435
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationscaffold4:1403221..1404474
RNA-Seq ExpressionMS008435
SyntenyMS008435
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]1.7e-21389.29Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+L+RFLGQK+ KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQ+ PIW NGVLP SPRKGRS LR  KFRDRPSPLGPNGK TCLSY STG+EDS SKVITENGNVTLCDYQRPV++LQ+VAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ
        ENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ALP+ +SG  DF SCYDSIGLSD+ETVRKRMEQIA+AQ
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ

Query:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK
        GLEGVSMEC +ILN+TLD+YLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGMWP+NHLRVQN SNGR EVLQEKSL+CSVSLLDFKVAMELNPK
Subjt:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRTFEE
        QLGEDWPLLLEKI MR FEE
Subjt:  QLGEDWPLLLEKICMRTFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]3.3e-21289.05Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+L+RFLGQK+ KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQ+ PIW NGVLP SPRKGRS+LR  KFRDRPSPLGPNGKITCLSY STG+EDS SKVITENGNVTLCDYQRPVQ+LQ+VAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ
        ENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGAR+ALP+ +SG  DF SCYDSIGLSD+ETVRKRMEQIA+AQ
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ

Query:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK
        GLEGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGMWP+NHLRVQN +NGR EVLQEKSL+CSVSLLDFKVAMELNPK
Subjt:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRTFEE
        QLGEDWPLLLEKI MR FEE
Subjt:  QLGEDWPLLLEKICMRTFEE

XP_022142878.1 uncharacterized protein LOC111012883 [Momordica charantia]1.3e-24099.76Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGK+TCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL
        ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL

Query:  EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQL
        EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQL
Subjt:  EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRTFEE
        GEDWPLLLEKICMRTFEE
Subjt:  GEDWPLLLEKICMRTFEE

XP_023524221.1 uncharacterized protein LOC111788192 [Cucurbita pepo subsp. pepo]4.4e-20987.17Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRID+GDLKAQIVKKLGNDKSKRYFFYL+RFLGQKL K EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+LQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CREDGPE  GS FPNQNQ +PIW NGVLP SPRKGRS+LR  KFRDRPSPLGPNGKITCLSY S+GTEDS SKVITENGNV +CDYQRPVQHL+AVAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG---DFSSCYDSIGLSDTETVRKRMEQIATA
        ENDI+GAV RPSEKPRIHPTEAA+LED +EVEQSDPLS LRGPLLPPLGIPFCSASVGGAR+ALP+G+SG   DF SCYDSIGLSD+ETVRKRMEQIATA
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG---DFSSCYDSIGLSDTETVRKRMEQIATA

Query:  QGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNP
        QGLEGVS+EC NILN+TLD+YLKQLIKSCLELVRSRST+EHTGHPIQKQQNQGKVINGM PSNH  VQN SNGR EVLQEKSL+CS SLLDFKVAME+NP
Subjt:  QGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRTFEE
        KQLGEDWPL+LEKI MR FEE
Subjt:  KQLGEDWPLLLEKICMRTFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]1.1e-21590Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CR+DGPEQTGSAFPNQNQ++PIWSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGKITCLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ
        ENDI+GAV RPSEKPRIHPTEAAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ALP+ +SG  DF SCYDSIGLSD+ TVRKRMEQIATAQ
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ

Query:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK
        GLEGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGKV+N MWP+NHLRVQN SNGR EVLQEKSL+CSVSLLDFKVAMELNPK
Subjt:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRTFEE
        QLGEDWPLLLEKICMR FEE
Subjt:  QLGEDWPLLLEKICMRTFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein8.4e-21489.29Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+L+RFLGQK+ KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQ+ PIW NGVLP SPRKGRS LR  KFRDRPSPLGPNGK TCLSY STG+EDS SKVITENGNVTLCDYQRPV++LQ+VAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ
        ENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ALP+ +SG  DF SCYDSIGLSD+ETVRKRMEQIA+AQ
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ

Query:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK
        GLEGVSMEC +ILN+TLD+YLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGMWP+NHLRVQN SNGR EVLQEKSL+CSVSLLDFKVAMELNPK
Subjt:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRTFEE
        QLGEDWPLLLEKI MR FEE
Subjt:  QLGEDWPLLLEKICMRTFEE

A0A1S4E5S7 uncharacterized protein LOC1035037571.6e-21289.05Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+L+RFLGQK+ KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQ+ PIW NGVLP SPRKGRS+LR  KFRDRPSPLGPNGKITCLSY STG+EDS SKVITENGNVTLCDYQRPVQ+LQ+VAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ
        ENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGAR+ALP+ +SG  DF SCYDSIGLSD+ETVRKRMEQIA+AQ
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ

Query:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK
        GLEGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGMWP+NHLRVQN +NGR EVLQEKSL+CSVSLLDFKVAMELNPK
Subjt:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRTFEE
        QLGEDWPLLLEKI MR FEE
Subjt:  QLGEDWPLLLEKICMRTFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein1.6e-21289.05Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+L+RFLGQK+ KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPPPIN SGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQ+ PIW NGVLP SPRKGRS+LR  KFRDRPSPLGPNGKITCLSY STG+EDS SKVITENGNVTLCDYQRPVQ+LQ+VAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ
        ENDI+GAVQRPSEKPRIHPTEAAILE+GEEVEQSDPL FLRGPLLPPLGIPFCSASVGGAR+ALP+ +SG  DF SCYDSIGLSD+ETVRKRMEQIA+AQ
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG--DFSSCYDSIGLSDTETVRKRMEQIATAQ

Query:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK
        GLEGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGMWP+NHLRVQN +NGR EVLQEKSL+CSVSLLDFKVAMELNPK
Subjt:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRTFEE
        QLGEDWPLLLEKI MR FEE
Subjt:  QLGEDWPLLLEKICMRTFEE

A0A6J1CPD1 uncharacterized protein LOC1110128836.2e-24199.76Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGK+TCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL
        ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGL

Query:  EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQL
        EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQL
Subjt:  EGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRTFEE
        GEDWPLLLEKICMRTFEE
Subjt:  GEDWPLLLEKICMRTFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767154.0e-20887.65Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFFYL++FLGQKL KVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPP INVSGHAQSVLQASNN+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
        CRED PEQTGSAFPNQNQ++PIW+NGVLP SPRKGRS+LR  KFRDRPSPLGPNGK  CLSY STGTED   KVITENGNVT+CDYQRPVQ LQAVAELP
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG---DFSSCYDSIGLSDTETVRKRMEQIATA
        ENDI+G+VQRPS KPRI PTEA+ILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ALP+ +SG   DF SCYDSIGLSD+ETVRKRMEQIATA
Subjt:  ENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSG---DFSSCYDSIGLSDTETVRKRMEQIATA

Query:  QGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNP
        QGLEGVS+EC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGKVINGMWP+NHLRVQN SNGR EVL+EKS +CSVSLLDFKVAMELNP
Subjt:  QGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRTFEE
        KQLGEDWPLLLEKI MR FEE
Subjt:  KQLGEDWPLLLEKICMRTFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.6e-4433.09Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++ K EFDKLC + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGP

Query:  EQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELPENDIEG
              +P ++    ++ + V P SPRK RS    RKFRDRPSPLGP GK   L   +T  ++S SK                 Q L             
Subjt:  EQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELPENDIEG

Query:  AVQRPSEKPRIHPTEAAILEDGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGLEGVS
                    P E   +EDGEEVEQ    P    R PL  PLG+ F   S    +      N  +  +C  S  L D  T+R R+E+    +G++ +S
Subjt:  AVQRPSEKPRIHPTEAAILEDGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGLEGVS

Query:  MECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQLGEDW
        M+ +N+LN  L+ Y+++LI+ CL L                                              Q+K    +VS+LDF  AME+NP+ LGE+W
Subjt:  MECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQLGEDW

Query:  PLLLEKICMRTFEE
        P+ LEKIC R  EE
Subjt:  PLLLEKICMRTFEE

AT2G24530.1 unknown protein2.0e-11151.66Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ     RI L +LK  IVKK G ++S+RYF+YL RFL QKL K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP
         R DG EQ+G+  PN +Q  P+WSNGVLP SPRK RS +++RK RDRPSPLG NGK+  + +     ED+   V  ENG     DYQR  +++       
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELP

Query:  ENDIEGAVQRPSEKPRIHPTE---AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATA
         ++ +G   RP EKPRI   E   A  + D +  E+   ++    PL+ PLGIPFCSASVGG+ R +P+  + +  SCYDS GL D E +RKRME IA A
Subjt:  ENDIEGAVQRPSEKPRIHPTE---AAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATA

Query:  QGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTG-HPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELN
        QGLEGVSMEC+  LN+ LD+YLK+LI SC +LV +RST    G   I KQQ+Q K++NG+WP+N L++Q + NG  ++ Q+     SVS+LDF+ AMELN
Subjt:  QGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTG-HPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELN

Query:  PKQLGEDWPLLLEKICMRTFEE
        P+QLGEDWP L E+I +R+FEE
Subjt:  PKQLGEDWPLLLEKICMRTFEE

AT4G31440.1 unknown protein4.3e-8546.79Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP
        MQ     RIDL +LK  IVKK+G ++S RYF+YL RFL QKL K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSP

Query:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKIT-CLSYPSTGTEDSGSKVITENGNVTLCD-YQRPVQHLQAVAE
         +EDGPE++ S  P+  +     SNGVL A  R G     DR  RD+P PLG NGK+    +Y   G      +   E  +  LC   Q+ V     VA 
Subjt:  CREDGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKIT-CLSYPSTGTEDSGSKVITENGNVTLCD-YQRPVQHLQAVAE

Query:  LPENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQ
            D E  V       RI  T                      P++ PLGIPFCSASVGG RR +P+  S    SCYDS GLSDTE +RKRME IA  Q
Subjt:  LPENDIEGAVQRPSEKPRIHPTEAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQ

Query:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTG-HPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNP
        GL GVS ECS +LN+ LDLYLK+L+KSC++L  +RS     G H ++KQQ++ +++NG+  +N   +Q +SN   ++ +E+    SVSLLDF+VAMELNP
Subjt:  GLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTG-HPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKICMRTFEE
         QLGEDWPLL E+I +  FEE
Subjt:  KQLGEDWPLLLEKICMRTFEE

AT4G33890.1 unknown protein1.0e-4633.18Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K+ K EFDKLC++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELPEND
                     +Q  P+  +     S RK RS    RK RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL    
Subjt:  DGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELPEND

Query:  IEGAVQRPSEKPRIHPTEAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARRALP----IGNSGDFSSCYDSIGLSDTETVRKRMEQIAT
              RP       P E   +E+GEEVEQ     P    R PL  PLG+   S   G  R+++        S +  +C ++  L DT T+R R+E+   
Subjt:  IEGAVQRPSEKPRIHPTEAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARRALP----IGNSGDFSSCYDSIGLSDTETVRKRMEQIAT

Query:  AQGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELN
         +GL+ ++M+  ++LNS LD+++++LI+ CL L  +R   +       +   Q + ++                             VS+ DF+  MELN
Subjt:  AQGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELN

Query:  PKQLGEDWPLLLEKICMRTFEE
         + LGEDWP+ +EKIC R  ++
Subjt:  PKQLGEDWPLLLEKICMRTFEE

AT4G33890.2 unknown protein1.0e-4633.18Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K+ K EFDKLC++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCRE

Query:  DGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELPEND
                     +Q  P+  +     S RK RS    RK RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL    
Subjt:  DGPEQTGSAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELPEND

Query:  IEGAVQRPSEKPRIHPTEAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARRALP----IGNSGDFSSCYDSIGLSDTETVRKRMEQIAT
              RP       P E   +E+GEEVEQ     P    R PL  PLG+   S   G  R+++        S +  +C ++  L DT T+R R+E+   
Subjt:  IEGAVQRPSEKPRIHPTEAAILEDGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARRALP----IGNSGDFSSCYDSIGLSDTETVRKRMEQIAT

Query:  AQGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELN
         +GL+ ++M+  ++LNS LD+++++LI+ CL L  +R   +       +   Q + ++                             VS+ DF+  MELN
Subjt:  AQGLEGVSMECSNILNSTLDLYLKQLIKSCLELVRSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELN

Query:  PKQLGEDWPLLLEKICMRTFEE
         + LGEDWP+ +EKIC R  ++
Subjt:  PKQLGEDWPLLLEKICMRTFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCAAAGCGCTACTTCTTCTACTTGAATAGATT
CTTGGGTCAGAAGCTGGGCAAGGTTGAATTTGATAAGTTGTGCGTCCGTGTGCTTGGAAGGGAAAATATTCAGCTCCACAATCAATTAATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTGTCAGGACATGCCCAATCTGTGCTACAAGCTTCAAACAATTCTCCTTGCAGGGAAGATGGTCCTGAACAAACTGGA
TCTGCTTTTCCAAATCAGAATCAGACTGTACCAATTTGGTCAAATGGAGTTCTGCCAGCATCCCCACGGAAGGGTAGGTCTCTCTTGCGTGATCGAAAGTTTAGGGATAG
GCCAAGTCCACTTGGACCAAATGGAAAGATCACATGTCTTTCATATCCATCAACTGGTACCGAAGATAGTGGCAGCAAAGTTATTACAGAGAATGGTAACGTAACCTTGT
GTGACTATCAGAGACCAGTACAGCATCTCCAAGCAGTAGCCGAGCTACCTGAGAATGACATAGAGGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACA
GAAGCAGCTATTCTTGAAGATGGAGAAGAGGTGGAACAGTCGGATCCCTTAAGCTTCCTGAGAGGTCCTCTGCTTCCACCTCTTGGCATTCCATTTTGTTCGGCTAGTGT
AGGTGGGGCACGCAGAGCCTTACCAATTGGCAATAGTGGTGATTTTTCAAGTTGTTATGACAGTATTGGACTGTCTGATACAGAGACAGTGAGAAAGCGCATGGAGCAAA
TTGCAACTGCGCAAGGGCTTGAAGGCGTCTCCATGGAATGTTCTAACATCCTGAATAGTACTCTGGATTTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTAGTG
AGATCAAGGTCTACACTTGAGCATACGGGCCACCCAATACAGAAGCAACAGAATCAAGGGAAGGTAATAAATGGTATGTGGCCTAGTAACCACCTACGTGTACAGAATAG
CAGCAATGGACGACCCGAAGTTTTGCAGGAAAAGAGTTTAGACTGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCGAAGCAGCTTGGGGAAGATT
GGCCTTTGCTGTTGGAGAAAATTTGTATGCGTACCTTTGAGGAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCAAAGCGCTACTTCTTCTACTTGAATAGATT
CTTGGGTCAGAAGCTGGGCAAGGTTGAATTTGATAAGTTGTGCGTCCGTGTGCTTGGAAGGGAAAATATTCAGCTCCACAATCAATTAATAAGGTCAATTTTGAAGAATG
CATGTGTAGCCAAGACCCCACCACCAATAAATGTGTCAGGACATGCCCAATCTGTGCTACAAGCTTCAAACAATTCTCCTTGCAGGGAAGATGGTCCTGAACAAACTGGA
TCTGCTTTTCCAAATCAGAATCAGACTGTACCAATTTGGTCAAATGGAGTTCTGCCAGCATCCCCACGGAAGGGTAGGTCTCTCTTGCGTGATCGAAAGTTTAGGGATAG
GCCAAGTCCACTTGGACCAAATGGAAAGATCACATGTCTTTCATATCCATCAACTGGTACCGAAGATAGTGGCAGCAAAGTTATTACAGAGAATGGTAACGTAACCTTGT
GTGACTATCAGAGACCAGTACAGCATCTCCAAGCAGTAGCCGAGCTACCTGAGAATGACATAGAGGGAGCAGTTCAGCGGCCATCAGAAAAACCAAGGATACATCCAACA
GAAGCAGCTATTCTTGAAGATGGAGAAGAGGTGGAACAGTCGGATCCCTTAAGCTTCCTGAGAGGTCCTCTGCTTCCACCTCTTGGCATTCCATTTTGTTCGGCTAGTGT
AGGTGGGGCACGCAGAGCCTTACCAATTGGCAATAGTGGTGATTTTTCAAGTTGTTATGACAGTATTGGACTGTCTGATACAGAGACAGTGAGAAAGCGCATGGAGCAAA
TTGCAACTGCGCAAGGGCTTGAAGGCGTCTCCATGGAATGTTCTAACATCCTGAATAGTACTCTGGATTTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTAGTG
AGATCAAGGTCTACACTTGAGCATACGGGCCACCCAATACAGAAGCAACAGAATCAAGGGAAGGTAATAAATGGTATGTGGCCTAGTAACCACCTACGTGTACAGAATAG
CAGCAATGGACGACCCGAAGTTTTGCAGGAAAAGAGTTTAGACTGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCGAAGCAGCTTGGGGAAGATT
GGCCTTTGCTGTTGGAGAAAATTTGTATGCGTACCTTTGAGGAA
Protein sequenceShow/hide protein sequence
MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLNRFLGQKLGKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINVSGHAQSVLQASNNSPCREDGPEQTG
SAFPNQNQTVPIWSNGVLPASPRKGRSLLRDRKFRDRPSPLGPNGKITCLSYPSTGTEDSGSKVITENGNVTLCDYQRPVQHLQAVAELPENDIEGAVQRPSEKPRIHPT
EAAILEDGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARRALPIGNSGDFSSCYDSIGLSDTETVRKRMEQIATAQGLEGVSMECSNILNSTLDLYLKQLIKSCLELV
RSRSTLEHTGHPIQKQQNQGKVINGMWPSNHLRVQNSSNGRPEVLQEKSLDCSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRTFEE