; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008433 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008433
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationchr04:2580832..2585152
RNA-Seq ExpressionIVF0008433
SyntenyIVF0008433
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136450.1 uncharacterized protein LOC101212293 [Cucumis sativus]7.39e-29798.09Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASN-SP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVL ASN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASN-SP

Query:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE
        CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTGSEDSSSKVITENGNVTLCDYQRPV+YLQSVAELPE
Subjt:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

XP_008466308.1 PREDICTED: uncharacterized protein LOC103503757 [Cucumis melo]5.93e-305100Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC

Query:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN
        REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN
Subjt:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN

Query:  DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL
        DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL
Subjt:  DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG
        EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKISMRAFEE
        EDWPLLLEKISMRAFEE
Subjt:  EDWPLLLEKISMRAFEE

XP_022976270.1 uncharacterized protein LOC111476715 [Cucurbita maxima]5.99e-27390.93Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNS-P
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVL ASN+ P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNS-P

Query:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE
        CRED PEQTGSAFPNQNQS PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNVT+CDYQRPVQ LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN+NGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPLLLEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

XP_023524221.1 uncharacterized protein LOC111788192 [Cucurbita pepo subsp. pepo]3.20e-27389.26Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASN-SP
        MQPQHSSRID+GDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+L ASN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASN-SP

Query:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE
        CREDGPE  GS FPNQNQ+ PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQS+G+EDSSSKVITENGNV +CDYQRPVQ+L++VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ
        NDIDGAV RPSEKPRIHPTEAA+LE+ +EVEQSDPL  LRGPLLPPLGIPFCSASVGGARKALPV SSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGM P+NH  VQN+NGRSEVLQEKSLECS SLLDFKVAME+NPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPL+LEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

XP_038899147.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida]4.89e-28694.02Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASN-SP
        MQPQHSSRIDLGDLKAQIVKKLGND+SKRYFF+LSRFLGQK+SKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPP INASGHAQSVL  SN SP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASN-SP

Query:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE
        CR+DGPEQTGSAFPNQNQS PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTG+EDS+SKVITENGNVT+CDYQRPVQ+LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAV RPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPV+SSGSSDFLSCYDSIGLSDS TVRKRMEQIA+AQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEH+GHPIQKQQNQGKV+N MWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKI MRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein2.3e-23298.09Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVL AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSP

Query:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE
        CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQSTGSEDSSSKVITENGNVTLCDYQRPV+YLQSVAELPE
Subjt:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
        NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQG

Query:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
        LEGVSMECP+ILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQN+NGRSEVLQEKSLECSVSLLDFKVAMELNPKQL
Subjt:  LEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQL

Query:  GEDWPLLLEKISMRAFEE
        GEDWPLLLEKISMRAFEE
Subjt:  GEDWPLLLEKISMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037571.7e-238100Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC

Query:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN
        REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN
Subjt:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN

Query:  DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL
        DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL
Subjt:  DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG
        EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKISMRAFEE
        EDWPLLLEKISMRAFEE
Subjt:  EDWPLLLEKISMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein1.7e-238100Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC

Query:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN
        REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN
Subjt:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN

Query:  DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL
        DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL
Subjt:  DIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGL

Query:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG
        EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG
Subjt:  EGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLG

Query:  EDWPLLLEKISMRAFEE
        EDWPLLLEKISMRAFEE
Subjt:  EDWPLLLEKISMRAFEE

A0A6J1IIZ9 uncharacterized protein LOC1114767153.7e-21490.93Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSP
        MQ Q SSRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENIQLHNQLIRSILKNACVAKTPP IN SGHAQSVL AS N+P
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSP

Query:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE
        CRED PEQTGSAFPNQNQS PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNVT+CDYQRPVQ LQ+VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ
        NDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPL FLRGPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWPTNHLRVQN+NGRSEVL+EKS ECSVSLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPLLLEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

A0A6J1IVJ1 uncharacterized protein LOC1114809999.2e-21388.54Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+L AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHAS-NSP

Query:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE
        CREDGPE  GS FPNQNQ+ P+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQS+G+EDSSSKVITENGNV +CDYQRPVQ+L++VAELPE
Subjt:  CREDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ
        NDIDGAV RPSEKPRIHPTEAA+LE+ +EVEQS+PL  LRGPLLPPLGIPFCSASVGGA KALPV SSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR+RST EH+GHPIQKQQNQGK++NGMWP+NH  VQN+NG+SEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWPL+LEKISMRAFEE
Subjt:  LGEDWPLLLEKISMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein1.7e-4132.45Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPCREDGPE
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPCREDGPE

Query:  QTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPENDIDGAV
                +   K ++ + V P SPRK RS    KFRDRPSPLGP GK   L   +T +++S SK                                   
Subjt:  QTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPENDIDGAV

Query:  QRPSEKPRIHPTEAAILEEGEEVEQ--SDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVS
        QR        P E   +E+GEEVEQ    P    R PL  PLG+ F        +     S+    +  +C  S  L D  T+R R+E+    +G++ +S
Subjt:  QRPSEKPRIHPTEAAILEEGEEVEQ--SDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVS

Query:  MECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWP
        M+  N+LN  L+ Y+++LI+ CL L                                             Q+K    +VS+LDF  AME+NP+ LGE+WP
Subjt:  MECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWP

Query:  LLLEKISMRAFEE
        + LEKI  RA EE
Subjt:  LLLEKISMRAFEE

AT2G24530.1 unknown protein8.2e-11352.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC
        MQ     RI L +LK  IVKK G ++S+RYF++L RFL QK++K EFDK C+R+LGREN+ LHNQLIRSIL+NA VAK+PPP + +GH+     A+    
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC

Query:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE
        R DG EQ+G+  PN +Q +P+W NGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +Y+        
Subjt:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPE

Query:  NDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIAS
        ++ DG   RP EKPRI   E   A  + + +  E+   +     PL+ PLGIPFCSASVGG+ + +PVS+  +++ +SCYDS GL D E +RKRME IA 
Subjt:  NDIDGAVQRPSEKPRIHPTE---AAILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIAS

Query:  AQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELN
        AQGLEGVSMEC   LNN LDVYLK+LI SC +LV ARST    G   I KQQ+Q K++NG+WPTN L++Q  NG S++ Q+     SVS+LDF+ AMELN
Subjt:  AQGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELN

Query:  PKQLGEDWPLLLEKISMRAFEE
        P+QLGEDWP L E+IS+R+FEE
Subjt:  PKQLGEDWPLLLEKISMRAFEE

AT4G31440.1 unknown protein3.6e-8445.37Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC
        MQ     RIDL +LK  IVKK+G ++S RYF++L RFL QK++K EFDK C R+LGREN+ LHN+LIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPC

Query:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN
        +EDGPE++ S  P+  ++     NGVL    R G    R   RD+P PLG NGK+                       +    Y RP +Y         +
Subjt:  REDGPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPEN

Query:  DIDGAVQRPSEKPRIHPTE--AAILEEGEEVEQSDPLRFL-RGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASA
        + D A   P+E+  +   +  AA +   +E +    +R L   P++ PLGIPFCSASVGG R+ +PVS+S ++  +SCYDS GLSD+E +RKRME IA  
Subjt:  DIDGAVQRPSEKPRIHPTE--AAILEEGEEVEQSDPLRFL-RGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASA

Query:  QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNP
        QGL GVS EC  +LNN LD+YLK+L+KSC++L  ARS     G H ++KQQ++ +++NG+   N   +Q +N  S++ +E+    SVSLLDF+VAMELNP
Subjt:  QGLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSG-HPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNP

Query:  KQLGEDWPLLLEKISMRAFEE
         QLGEDWPLL E+IS+  FEE
Subjt:  KQLGEDWPLLLEKISMRAFEE

AT4G33890.1 unknown protein4.6e-4734.61Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPCRED
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S +   N   +++
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPCRED

Query:  GPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPENDID
                    +Q +P+  +     S RK RS    K RDRPSPLGP GK   L   +T +E+S SK                    QS  EL      
Subjt:  GPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPENDID

Query:  GAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASAQ
            RP       P E   +EEGEEVEQ     P    R PL  PLG+   S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    +
Subjt:  GAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GL+ ++M+  ++LN+ LDV++++LI+ CL L   R                         T+ +R  N     +  Q+      VS+ DF+  MELN + 
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWP+ +EKI  RA ++
Subjt:  LGEDWPLLLEKISMRAFEE

AT4G33890.2 unknown protein4.6e-4734.61Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPCRED
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S +   N   +++
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPCRED

Query:  GPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPENDID
                    +Q +P+  +     S RK RS    K RDRPSPLGP GK   L   +T +E+S SK                    QS  EL      
Subjt:  GPEQTGSAFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPENDID

Query:  GAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASAQ
            RP       P E   +EEGEEVEQ     P    R PL  PLG+   S   G  RK++   S  S  F   +C ++  L D+ T+R R+E+    +
Subjt:  GAVQRPSEKPRIHPTEAAILEEGEEVEQ---SDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDF--LSCYDSIGLSDSETVRKRMEQIASAQ

Query:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ
        GL+ ++M+  ++LN+ LDV++++LI+ CL L   R                         T+ +R  N     +  Q+      VS+ DF+  MELN + 
Subjt:  GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQ

Query:  LGEDWPLLLEKISMRAFEE
        LGEDWP+ +EKI  RA ++
Subjt:  LGEDWPLLLEKISMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGTGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGATAAGTCCAAGCGGTACTTCTTCTTCTTGAGCAGATT
CTTGGGTCAGAAGATGAGCAAGGTTGAATTTGATAAGGTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATG
CTTGTGTAGCCAAGACCCCACCACCAATAAATGCTTCAGGACATGCACAATCTGTGTTACACGCTTCCAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGATCT
GCCTTTCCAAATCAGAATCAGAGTAAACCAATTTGGCCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGGGATAGGCCAAG
TCCGCTTGGTCCAAATGGAAAAATCACATGTCTTTCGTATCAATCAACTGGCTCTGAAGATAGCAGCAGCAAAGTCATTACAGAGAATGGTAATGTAACCTTGTGTGACT
ATCAGAGACCAGTACAGTATCTCCAATCAGTAGCTGAGCTACCTGAAAATGACATAGATGGAGCAGTTCAAAGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCA
GCTATTCTTGAAGAAGGAGAGGAGGTGGAACAGTCGGATCCCTTAAGATTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGG
GGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTAGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACGGTGAGAAAACGCATGGAGCAAA
TTGCATCTGCACAAGGACTTGAAGGCGTTTCTATGGAATGTCCTAACATATTGAATAATACTTTGGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTAGTG
AGAGCAAGGTCTACATTTGAACATTCAGGGCACCCTATCCAGAAGCAACAAAATCAAGGGAAGGTCTTAAATGGCATGTGGCCTACTAACCACCTACGTGTACAGAACAA
CAATGGGCGATCCGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGACTGGC
CTTTGCTGTTGGAGAAAATTTCTATGCGCGCCTTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
TCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCATTTACATAACTTCATTCTTCTTCTCCTTTTCTTCTTCTTCTTCAATTAATCCCCCTTTTGT
TTTCCAACTTTCCTCATCATCCACTTTACTGATCTTCCCATCATTTTTACTTCCAATTTCCTACTTCCAACTTCAACCCACATCCCCAATTCCCTTTTATAAACAAGGGT
TTCTTCTATTTCTTACTCTCCTTCTGCAATTCGCGTCCTTTGCCTTTCTTCTTCCAGGCTGCTGTAAAACCCCCACTTTTTTGCTTGGTGCGCGTATGATGTATATTTGT
GTGTGTATATAATATATTTATATATATATTCCTTCAAATGGTGCGGCGTTTTAGACTTGATTTGTTCTGAATGACGTGGAAGTATAAATTCTGGGGGATTTTTGTATTGG
TACAATACCACAATTCAGCTTCAATTTTAGTGACTGGGGTTTGGAGCTACAGTTTATAGTGTATGAAACGAACCGGCCTCTGGAGAAATGCAACCTCAGCACAGCTCCAG
AATTGATTTAGGTGACTTGAAAGCTCAGATAGTTAAAAAACTTGGAAATGATAAGTCCAAGCGGTACTTCTTCTTCTTGAGCAGATTCTTGGGTCAGAAGATGAGCAAGG
TTGAATTTGATAAGGTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCTTGTGTAGCCAAGACCCCACCA
CCAATAAATGCTTCAGGACATGCACAATCTGTGTTACACGCTTCCAACTCTCCTTGCAGGGAAGATGGCCCTGAACAAACTGGATCTGCCTTTCCAAATCAGAATCAGAG
TAAACCAATTTGGCCAAATGGAGTTCTTCCAGTATCCCCACGGAAGGGTAGATCTGTCTTACGTGGAAAGTTTAGGGATAGGCCAAGTCCGCTTGGTCCAAATGGAAAAA
TCACATGTCTTTCGTATCAATCAACTGGCTCTGAAGATAGCAGCAGCAAAGTCATTACAGAGAATGGTAATGTAACCTTGTGTGACTATCAGAGACCAGTACAGTATCTC
CAATCAGTAGCTGAGCTACCTGAAAATGACATAGATGGAGCAGTTCAAAGGCCATCAGAAAAACCAAGGATACATCCAACAGAAGCAGCTATTCTTGAAGAAGGAGAGGA
GGTGGAACAGTCGGATCCCTTAAGATTCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTCA
GCAGTAGTGGCAGTAGTGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACGGTGAGAAAACGCATGGAGCAAATTGCATCTGCACAAGGACTTGAA
GGCGTTTCTATGGAATGTCCTAACATATTGAATAATACTTTGGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTAGTGAGAGCAAGGTCTACATTTGAACA
TTCAGGGCACCCTATCCAGAAGCAACAAAATCAAGGGAAGGTCTTAAATGGCATGTGGCCTACTAACCACCTACGTGTACAGAACAACAATGGGCGATCCGAAGTTTTGC
AGGAAAAGAGTTTAGAATGCTCGGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGACTGGCCTTTGCTGTTGGAGAAAATTTCT
ATGCGCGCCTTTGAGGAATAAGCAGTTTGCAAAGGTTTTATTTATTATTCTACAGCCATTCAAGGCTGGTCGTTCTGATCGTTGGGGGTTTAAAATTTACCTGGGATATC
ATCCCGTTCTTACCTTCTTGCTGGGGTATACTTGATCGCTCCCATTTGGTTCAGGCTCCAAGTTTCCATCCTAAAATATTGTGACTTTATTGCCTGCTCCATTGAGGAAG
TTGAATTAACCGACTAAGAAAGAGGGAGCTTTCAGCCAGAATGCCCATAAAAGGTTGTTCTGTTGCATTTTTAGCATTCTTCTTCACCCCATTTGCCACTACCATGTAAC
TTAGCTTTTCGTGTAACTTATTTTTGAGAGAATCAAAAAAGTTTTGCAGGGATACTGTCATTTAGTTTGGTGAAAGATGAATCACCTGGGTGAGAGCTTCAGGTGCTCTG
TATGTATCTTTAGAATCTGAGAATTTATTGGAATTTCAATATTAATAACAAATTAACATCCATTTCAACGCACTTTGTGGTTTTTCCCCTGTTCATCTCCATGATTAATC
AATATATAATCATTTTGGTCTTGTCTTCAAACTGAAATTAATTATTCTTAATCCCACCTTCACGATGTGAATAATGCCAGCTATAAATGTTAATGTGTATGTGTCGGAAG
TTCAAGTCTCAGGAGCAGCACAGCCTGACTTCAAAGTGAATGCAACCAGATATCTTGCGTGCTACTGAACAAATCAATGTTCTACAAGTGCCATGGCCAATTGGATGCAG
ATAATTCTTCTTAAAGGTTAACGGCATGATTGAAGTTTGAAGTAGACAAGAGTCAGAAGGCCATAATAACTCAAAAGTCTTTGTCCTTGCTGTTCATACCAACTGTGGCA
TGCTTGTTTTAAAGGATGATCATGTTGCTTTTTGCAGCAAATTCTGGTTTCATTTTACTCACTCAGCTATGTCTAAACTTTCTGTTCGTTTTCAGACTTTGATTCCTCAA
TGAGATATTCAAATGAGGTAGAAGCCACACTACTAAAGCACTTAAAAGAAAAAAAAGAACACTTCTCAGGATTCGATAACTGTGAGGTAGATGATCAAACCATCAACCTT
TGGAATGATAATTGATATCTATATTTACCTAGCTATGCACGTTGTAAAATATTATCTCGGATATTAATGTGCTTCTAGTTGTATAGGCTTTGTACATGCCCCAAGTGTTC
TATTTCTGTAACCATGTGTAAATCAATGCCCGTATAGGAGATGGAGACCATTCCGTTCGTTATCCAAAATGTGCATAGTTTGGGAG
Protein sequenceShow/hide protein sequence
MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENIQLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASNSPCREDGPEQTGS
AFPNQNQSKPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNVTLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEA
AILEEGEEVEQSDPLRFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSSDFLSCYDSIGLSDSETVRKRMEQIASAQGLEGVSMECPNILNNTLDVYLKQLIKSCLELV
RARSTFEHSGHPIQKQQNQGKVLNGMWPTNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE