; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi09G002009 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi09G002009
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr9:64694016..64695272
RNA-Seq ExpressionBhi09G002009
SyntenyBhi09G002009
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]1.7e-22493.78Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP INASGHAQSVLQ SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFPNQNQS PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDS+SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG
        NDIDGAV RPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPV+SSGSSDFLSCYDSIGLSDS TVRKRMEQIA+AQG
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWPLLLEKI MRAFEE
Subjt:  GEDWPLLLEKICMRAFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]2.2e-22494.02Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP INASGHAQSVL  SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFPNQNQS PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG
        NDIDGAV RPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPV+SSGSSDFLSCYDSIGLSDS TVRKRMEQIA+AQG
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+N MWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWPLLLEKI MRAFEE
Subjt:  GEDWPLLLEKICMRAFEE

XP_022976270.1 uncharacterized protein LOC111476715 [Cucurbita maxima]5.7e-21791.65Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQ Q SSRIDLGDLKAQIVKKLGND+SKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN +P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+D PEQTGSAFPNQNQSIPIW+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSS-DFLSCYDSIGLSDSGTVRKRMEQIATAQ
        NDIDG+V RPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPV+SSGS  DFLSCYDSIGLSDS TVRKRMEQIATAQ
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSS-DFLSCYDSIGLSDSGTVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

XP_023536522.1 uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo]3.7e-21691.17Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQ Q  SRIDLGDLKAQIVKKLGND+SKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN +P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+D PEQTGSAFPNQNQSIPIW+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSS-DFLSCYDSIGLSDSGTVRKRMEQIATAQ
        NDIDG+V RPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPV+SSGS  DFLSCYDSIGLSDS TVRKRMEQIATAQ
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSS-DFLSCYDSIGLSDSGTVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GL+GVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]2.2e-240100Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG
        NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWPLLLEKICMRAFEE
Subjt:  GEDWPLLLEKICMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein8.1e-22593.78Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP INASGHAQSVLQ SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFPNQNQS PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTG+EDS+SKVITENGNVT+CDYQRPV++LQ+VAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG
        NDIDGAV RPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPV+SSGSSDFLSCYDSIGLSDS TVRKRMEQIA+AQG
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWPLLLEKI MRAFEE
Subjt:  GEDWPLLLEKICMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037571.1e-22494.02Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP INASGHAQSVL  SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFPNQNQS PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG
        NDIDGAV RPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPV+SSGSSDFLSCYDSIGLSDS TVRKRMEQIA+AQG
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+N MWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWPLLLEKI MRAFEE
Subjt:  GEDWPLLLEKICMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein1.1e-22494.02Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP INASGHAQSVL  SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+DGPEQTGSAFPNQNQS PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG
        NDIDGAV RPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPV+SSGSSDFLSCYDSIGLSDS TVRKRMEQIA+AQG
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+N MWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKICMRAFEE
        GEDWPLLLEKI MRAFEE
Subjt:  GEDWPLLLEKICMRAFEE

A0A6J1CPD1 uncharacterized protein LOC1110128835.2e-21689.76Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFFYL+RFLGQKL KVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP
        CR+DGPEQTGSAFPNQNQ++PIWSNGVLP SPRKGRS+LR  KFRDRPSPLGPNGK+TCLSY STGTEDS SKVITENGNVT+CDYQRPVQHLQAVAELP
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQ
        ENDI+GAV RPSEKPRIHPTEAAILE+GEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGAR+ALP+ +SG  DF SCYDSIGLSD+ TVRKRMEQIATAQ
Subjt:  ENDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        GLEGVSMEC NILN+TLD+YLKQLIKSCLELVR+RST EHTGHPIQKQQNQGKV+N MWP+NHLRVQN SNGR EVLQEKSL+CSVSLLDFKVAMELNPK
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQN-SNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRAFEE
        QLGEDWPLLLEKICMR FEE
Subjt:  QLGEDWPLLLEKICMRAFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767152.8e-21791.65Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQ Q SSRIDLGDLKAQIVKKLGND+SKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN +P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
        CR+D PEQTGSAFPNQNQSIPIW+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNVTMCDYQRPVQ LQAVAELPE
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSS-DFLSCYDSIGLSDSGTVRKRMEQIATAQ
        NDIDG+V RPS KPRI PTEA+ILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPV+SSGS  DFLSCYDSIGLSDS TVRKRMEQIATAQ
Subjt:  NDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSS-DFLSCYDSIGLSDSGTVRKRMEQIATAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKV+N MWPTNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKICMRAFEE
        LGEDWPLLLEKI MRAFEE
Subjt:  LGEDWPLLLEKICMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein5.3e-4332.85Show/hide
Query:  SRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRDDGP
        SR++  ++KA I +K+G+ R+  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PP                         
Subjt:  SRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRDDGP

Query:  EQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGA
              +P ++    ++ + V P SPRK RS    KFRDRPSPLGP GK   L   +T  ++S SK                 Q L              
Subjt:  EQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGA

Query:  VHRPSEKPRIHPTEAAILEEGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQGLEGV
                   P E   +E+GEEVEQ    P    R PL  PLG+ F   S         +N        +C  S  L D  T+R R+E+    +G++ +
Subjt:  VHRPSEKPRIHPTEAAILEEGEEVEQ--SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQGLEGV

Query:  SMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDW
        SM+  N+LN  L+ Y+++LI+ CL L                                             Q+K    +VS+LDF  AME+NP+ LGE+W
Subjt:  SMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDW

Query:  PLLLEKICMRAFEE
        P+ LEKIC RA EE
Subjt:  PLLLEKICMRAFEE

AT2G24530.1 unknown protein7.0e-11252.72Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQ     RI L +LK  IVKK G +RS+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PP  + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP
         R DG EQ+G+  PN +Q  P+WSNGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +++       
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELP

Query:  ENDIDGAVHRPSEKPRIHPTE---AAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIA
         ++ DG   RP EKPRI   E   A  + + +  E+   ++    PL+ PLGIPFCSASVGG+ + +PV  S +++ +SCYDS GL D   +RKRME IA
Subjt:  ENDIDGAVHRPSEKPRIHPTE---AAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIA

Query:  TAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMEL
         AQGLEGVSMEC   LNN LDVYLK+LI SC +LV ARST    G   I KQQ+Q K+VN +WPTN L++Q  NG S++ Q+     SVS+LDF+ AMEL
Subjt:  TAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMEL

Query:  NPKQLGEDWPLLLEKICMRAFEE
        NP+QLGEDWP L E+I +R+FEE
Subjt:  NPKQLGEDWPLLLEKICMRAFEE

AT4G31440.1 unknown protein1.8e-8345.28Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP
        MQ     RIDL +LK  IVKK+G +RS RYF+YL RFL QKL+K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PPS++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISP

Query:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE
         ++DGPE++ S  P+  ++    SNGVL    R G    R   RD+P PLG NGK+                       +    Y RP ++         
Subjt:  CRDDGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPE

Query:  NDIDGAVHRPSEKPRIHPTE--AAILEEGEEVE---QSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQI
        ++ D A   P+E+  +   +  AA +   +E +    S P      P++ PLGIPFCSASVGG R+ +PV++S ++  +SCYDS GLSD+  +RKRME I
Subjt:  NDIDGAVHRPSEKPRIHPTE--AAILEEGEEVE---QSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQI

Query:  ATAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAME
        A  QGL GVS EC  +LNN LD+YLK+L+KSC++L  ARS     G H ++KQQ++ ++VN +   N   +Q SN  S++ +E+    SVSLLDF+VAME
Subjt:  ATAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTG-HPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAME

Query:  LNPKQLGEDWPLLLEKICMRAFEE
        LNP QLGEDWPLL E+I +  FEE
Subjt:  LNPKQLGEDWPLLLEKICMRAFEE

AT4G33890.1 unknown protein9.3e-4835.24Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRD
        Q SSR+D  ++KA I +++GN R++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRD

Query:  DGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI
              G +  N +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL     
Subjt:  DGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI

Query:  DGAVHRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDF--LSCYDSIGLSDSGTVRKRMEQIATA
             RP       P E   +EEGEEVEQ     P    R PL  PLG+   S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    
Subjt:  DGAVHRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDF--LSCYDSIGLSDSGTVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ ++M+  ++LN+ LDV++++LI+ CL L   R                         T+ +R  N     +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRAFEE
         LGEDWP+ +EKIC RA ++
Subjt:  QLGEDWPLLLEKICMRAFEE

AT4G33890.2 unknown protein9.3e-4835.24Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRD
        Q SSR+D  ++KA I +++GN R++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRD

Query:  DGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI
              G +  N +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   +T  E+S SK                    Q+  EL     
Subjt:  DGPEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDI

Query:  DGAVHRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDF--LSCYDSIGLSDSGTVRKRMEQIATA
             RP       P E   +EEGEEVEQ     P    R PL  PLG+   S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    
Subjt:  DGAVHRPSEKPRIHPTEAAILEEGEEVEQ---SDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDF--LSCYDSIGLSDSGTVRKRMEQIATA

Query:  QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK
        +GL+ ++M+  ++LN+ LDV++++LI+ CL L   R                         T+ +R  N     +  Q+      VS+ DF+  MELN +
Subjt:  QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPK

Query:  QLGEDWPLLLEKICMRAFEE
         LGEDWP+ +EKIC RA ++
Subjt:  QLGEDWPLLLEKICMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGTGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAGGTCCAAGCGGTACTTCTTTTACTTGAGC
AGATTCTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGGAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATC
TTGAAGAATGCATGTGTAGCCAAGACCCCACCATCAATAAATGCTTCAGGACATGCACAATCTGTCCTACAACCTTCAAACATCTCGCCTTGCAGGGATGATGGC
CCTGAACAAACTGGATCTGCCTTTCCAAATCAGAATCAGAGTATACCAATTTGGTCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGTAGATCTGTCTTACGT
GGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAACAGCAAAGTTATTACA
GAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCAGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAAAATGACATAGATGGAGCAGTTCACCGGCCA
TCAGAAAAACCGAGGATACATCCAACGGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCAGATCCCTTAAGCTTCCTGCGAGGTCCTCTACTTCCA
CCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTCAACAGTAGTGGCAGTAGTGATTTTCTGAGTTGTTATGACAGTATT
GGATTATCTGATTCAGGGACAGTGAGAAAACGCATGGAGCAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACT
CTGGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCATCCGATCCAGAAGCAACAGAATCAA
GGGAAGGTTGTAAATGATATGTGGCCTACTAACCACCTACGTGTACAGAACAGCAATGGGAGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCC
TTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGTTGTTGGAGAAAATTTGTATGCGTGCCTTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGTGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGACAGGTCCAAGCGGTACTTCTTTTACTTGAGC
AGATTCTTGGGTCAGAAGCTGAGCAAGGTTGAATTTGATAAGGTGTGTGTTCGTGTGCTTGGGAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATC
TTGAAGAATGCATGTGTAGCCAAGACCCCACCATCAATAAATGCTTCAGGACATGCACAATCTGTCCTACAACCTTCAAACATCTCGCCTTGCAGGGATGATGGC
CCTGAACAAACTGGATCTGCCTTTCCAAATCAGAATCAGAGTATACCAATTTGGTCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGTAGATCTGTCTTACGT
GGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGTACTGAAGATAGCAACAGCAAAGTTATTACA
GAGAATGGTAATGTAACCATGTGTGACTATCAGAGACCAGTACAGCATCTCCAAGCAGTAGCTGAGCTACCTGAAAATGACATAGATGGAGCAGTTCACCGGCCA
TCAGAAAAACCGAGGATACATCCAACGGAAGCAGCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCAGATCCCTTAAGCTTCCTGCGAGGTCCTCTACTTCCA
CCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTCAACAGTAGTGGCAGTAGTGATTTTCTGAGTTGTTATGACAGTATT
GGATTATCTGATTCAGGGACAGTGAGAAAACGCATGGAGCAAATTGCAACTGCACAAGGGCTTGAAGGTGTTTCTATGGAATGTCCTAACATCCTGAATAATACT
CTGGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAGCAAGGTCTACATTTGAACATACGGGGCATCCGATCCAGAAGCAACAGAATCAA
GGGAAGGTTGTAAATGATATGTGGCCTACTAACCACCTACGTGTACAGAACAGCAATGGGAGATCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGTGTCC
TTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGTTGTTGGAGAAAATTTGTATGCGTGCCTTTGAGGAATAA
Protein sequenceShow/hide protein sequence
MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRDDG
PEQTGSAFPNQNQSIPIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNVTMCDYQRPVQHLQAVAELPENDIDGAVHRP
SEKPRIHPTEAAILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVNSSGSSDFLSCYDSIGLSDSGTVRKRMEQIATAQGLEGVSMECPNILNNT
LDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWPTNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRAFEE