; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G016140 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G016140
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationCmo_Chr01:12316960..12322318
RNA-Seq ExpressionCmoCh01G016140
SyntenyCmoCh01G016140
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608203.1 hypothetical protein SDJN03_01545, partial [Cucurbita argyrosperma subsp. sororia]6.8e-28198.2Show/hide
Query:  MMALQSTTRFLLFFPALLQFASLPFFFQAAINKPTLYASCTYCSSPTPSPLKWLPQFSFNLSDWGLELLQFTVYEPPSGEMQPQHSSRIDLGDLKAQIVK
        MMALQSTTRFLLFFPALLQFASLPFFFQAAINKPTLYASCTYCSSPTPSPLK      FNLSDWGLELLQFTVYEPPSGEMQPQHSSRIDLGDLKAQIVK
Subjt:  MMALQSTTRFLLFFPALLQFASLPFFFQAAINKPTLYASCTYCSSPTPSPLKWLPQFSFNLSDWGLELLQFTVYEPPSGEMQPQHSSRIDLGDLKAQIVK

Query:  KLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGPEHNGSTFPNQNQNQ
        KLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGPEHNGSTFPNQNQNQ
Subjt:  KLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGPEHNGSTFPNQNQNQ

Query:  AMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPENDIDGAVLRPSEKPRIHP
        AMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPENDIDGAVLRPSEKPRIHP
Subjt:  AMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPENDIDGAVLRPSEKPRIHP

Query:  TEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSIECPNILNNTLD
        TEAAVLEDRDEVEQSDPLSILRGPLLPPLG PFCSASVGGARKALPVGSSGSG DFLSCYDSIGLSD ETVRKRMEQIATAQGLEGVSIECPNILNNTLD
Subjt:  TEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSIECPNILNNTLD

Query:  VYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINPKQLGEDWPLMLEKISMRAFE
        VYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINPKQLGEDWPLMLEKISMRAFE
Subjt:  VYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINPKQLGEDWPLMLEKISMRAFE

Query:  K
        K
Subjt:  K

KAG7037562.1 hypothetical protein SDJN02_01190, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-23899.29Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLG PFCSASVGGARKALPVGSSGSG DFLSCYDSIGLSD ETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPLMLEKISMRAFEK
Subjt:  KQLGEDWPLMLEKISMRAFEK

XP_022940550.1 uncharacterized protein LOC111446116 [Cucurbita moschata]1.5e-240100Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPLMLEKISMRAFEK
Subjt:  KQLGEDWPLMLEKISMRAFEK

XP_022982022.1 uncharacterized protein LOC111480999 [Cucurbita maxima]4.2e-23096.44Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPEHNGSTFP  NQNQAMP+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQSSGTEDSSSKVITENGNVN+CDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQS+PLSILRGPLLPPLG PFCSASVGGA KALPVGSSGS  DFLSCYDSIGLSDSETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGK+INGM PSNHQHVQNSNG+SEVLQEKSLECSASLLDFKVAME+NP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPLMLEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEK

XP_023524221.1 uncharacterized protein LOC111788192 [Cucurbita pepo subsp. pepo]8.6e-23698.57Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRID+GDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPEHNGSTFP  NQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLG PFCSASVGGARKALPVGSSGS RDFLSCYDSIGLSDSETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPLMLEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEK

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein1.2e-21188.36Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+LQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPE  GS FPNQNQ++  PIWPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK TCLSYQS+G+EDSSSKVITENGNV +CDYQRPV++L++VAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAV RPSEKPRIHPTEAA+LE+ +EVEQSDPLS LRGPLLPPLG PFCSASVGGARKALPV SSGS  DFLSCYDSIGLSDSETVRKRMEQIA+
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVS+ECP+ILNNTLDVYLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGM P+NH  VQNSNGRSEVLQEKSLECS SLLDFKVAME+NP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPL+LEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEK

A0A1S4E5S7 uncharacterized protein LOC1035037571.0e-21088.36Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+L AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPE  GS FPNQNQ++  PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQS+G+EDSSSKVITENGNV +CDYQRPVQ+L++VAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAV RPSEKPRIHPTEAA+LE+ +EVEQSDPL  LRGPLLPPLG PFCSASVGGARKALPV SSGS  DFLSCYDSIGLSDSETVRKRMEQIA+
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGM P+NH  VQN+NGRSEVLQEKSLECS SLLDFKVAME+NP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPL+LEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEK

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein1.0e-21088.36Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+L AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPE  GS FPNQNQ++  PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQS+G+EDSSSKVITENGNV +CDYQRPVQ+L++VAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAV RPSEKPRIHPTEAA+LE+ +EVEQSDPL  LRGPLLPPLG PFCSASVGGARKALPV SSGS  DFLSCYDSIGLSDSETVRKRMEQIA+
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR+RST EH+GHPIQKQQNQGKV+NGM P+NH  VQN+NGRSEVLQEKSLECS SLLDFKVAME+NP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPL+LEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEK

A0A6J1FQY5 uncharacterized protein LOC1114461167.4e-241100Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPLMLEKISMRAFEK
Subjt:  KQLGEDWPLMLEKISMRAFEK

A0A6J1IVJ1 uncharacterized protein LOC1114809992.0e-23096.44Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
        CREDGPEHNGSTFP  NQNQAMP+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQSSGTEDSSSKVITENGNVN+CDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQS+PLSILRGPLLPPLG PFCSASVGGA KALPVGSSGS  DFLSCYDSIGLSDSETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGK+INGM PSNHQHVQNSNG+SEVLQEKSLECSASLLDFKVAME+NP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINP

Query:  KQLGEDWPLMLEKISMRAFEK
        KQLGEDWPLMLEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein2.5e-3931.89Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGP

Query:  EHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPENDID
              +P ++      ++ + V P SPRK RS    KFRDRPSPLGP GK   L   ++  ++S SK                 Q L            
Subjt:  EHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPENDID

Query:  GAVLRPSEKPRIHPTEAAVLEDRDEVEQ--SDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIATAQGL
                     P E   +ED +EVEQ    P    R PL  PLG  F   S     KA     +G  R+  +C  S  L D  T+R R+E+    +G+
Subjt:  GAVLRPSEKPRIHPTEAAVLEDRDEVEQ--SDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIATAQGL

Query:  EGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINPKQLG
        + +S++  N+LN  L+ Y+++LI+ CL L                                             Q+K    + S+LDF  AME+NP+ LG
Subjt:  EGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINPKQLG

Query:  EDWPLMLEKISMRAFEK
        E+WP+ LEKI  RA E+
Subjt:  EDWPLMLEKISMRAFEK

AT2G24530.1 unknown protein2.2e-10450Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQ     RI L +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHN+LIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAE
         R DG E +G+  PN +Q++  P+W NGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +++     
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAE

Query:  LPENDIDGAVLRPSEKPRIHPTE---AAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRME
           ++ DG  LRP EKPRI   E   A  + D    E+   +++   PL+ PLG PFCSASVGG+ + +PV ++    + +SCYDS GL D E +RKRME
Subjt:  LPENDIDGAVLRPSEKPRIHPTE---AAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRME

Query:  QIATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTG-HPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVA
         IA AQGLEGVS+EC   LNN LDVYLK+LI SC +LV +RST    G   I KQQ+Q K++NG+ P+N   +Q  NG S++ Q+     S S+LDF+ A
Subjt:  QIATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTG-HPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVA

Query:  MEINPKQLGEDWPLMLEKISMRAFEK
        ME+NP+QLGEDWP + E+IS+R+FE+
Subjt:  MEINPKQLGEDWPLMLEKISMRAFEK

AT4G31440.1 unknown protein3.3e-8445.39Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQ     RIDL +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK C R+LGREN+ LHNKLIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL
         +EDGPE + S  P+  +N       NGVL    R G    R   RD+P PLG NGK+                       +    Y RP ++       
Subjt:  CREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIH-PTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIA
          ++ D A L P+E+  +    + A    RD+  Q   LS    P++ PLG PFCSASVGG R+ +PV +S +    +SCYDS GLSD+E +RKRME IA
Subjt:  PENDIDGAVLRPSEKPRIH-PTEAAVLEDRDEVEQSDPLSILRGPLLPPLGFPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIA

Query:  TAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTG-HPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEI
          QGL GVS EC  +LNN LD+YLK+L+KSC++L  +RS     G H ++KQQ++ +++NG+R +N  H+Q SN  S++ +E+    S SLLDF+VAME+
Subjt:  TAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTG-HPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEI

Query:  NPKQLGEDWPLMLEKISMRAFEK
        NP QLGEDWPL+ E+IS+  FE+
Subjt:  NPKQLGEDWPLMLEKISMRAFEK

AT4G33890.1 unknown protein1.2e-4432.7Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCRE

Query:  DGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPEN
                   ++  +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   ++  E+S SK                    ++  EL   
Subjt:  DGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPEN

Query:  DIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQ---SDPLSILRGPLLPPLGFPFCSASVGGARKALP-VGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIA
              L    +P   P E   +E+ +EVEQ     P    R PL  PLG    S   G  RK++  V       +  +C ++  L D+ T+R R+E+  
Subjt:  DIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQ---SDPLSILRGPLLPPLGFPFCSASVGGARKALP-VGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIA

Query:  TAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEIN
          +GL+ ++++  ++LN+ LDV++++LI+ CL L  +R   +                  +R  N+Q+ Q S   S V          S+ DF+  ME+N
Subjt:  TAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEIN

Query:  PKQLGEDWPLMLEKISMRAFEK
         + LGEDWP+ +EKI  RA +K
Subjt:  PKQLGEDWPLMLEKISMRAFEK

AT4G33890.2 unknown protein1.2e-4432.7Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCRE

Query:  DGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPEN
                   ++  +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   ++  E+S SK                    ++  EL   
Subjt:  DGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPEN

Query:  DIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQ---SDPLSILRGPLLPPLGFPFCSASVGGARKALP-VGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIA
              L    +P   P E   +E+ +EVEQ     P    R PL  PLG    S   G  RK++  V       +  +C ++  L D+ T+R R+E+  
Subjt:  DIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQ---SDPLSILRGPLLPPLGFPFCSASVGGARKALP-VGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIA

Query:  TAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEIN
          +GL+ ++++  ++LN+ LDV++++LI+ CL L  +R   +                  +R  N+Q+ Q S   S V          S+ DF+  ME+N
Subjt:  TAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGMRPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEIN

Query:  PKQLGEDWPLMLEKISMRAFEK
         + LGEDWP+ +EKI  RA +K
Subjt:  PKQLGEDWPLMLEKISMRAFEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCTTTGCAGTCTACTACAAGGTTTCTTTTGTTCTTTCCTGCTCTGCTGCAATTCGCGTCACTTCCGTTCTTCTTCCAGGCTGCTATAAATAAACCTACCCTTTA
TGCTTCGTGCACGTACTGTTCCTCCCCTACCCCCTCCCCACTCAAATGGTTACCGCAATTCAGCTTCAATTTGAGTGACTGGGGTTTGGAGCTGCTACAGTTTACAGTGT
ATGAACCGCCCTCTGGAGAAATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAGCGGTAC
TTCTTCTACTTGAGCAGATTTCTGGGTCAGAAACTGAGCAAGTTTGAGTTTGATAAGATGTGCGTCCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATAAATTGAT
AAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGGGTCAGGACATGCACAGTCTATGCTACAAGCTTCAAACAACTCACCTTGCAGGGAAG
ATGGCCCTGAACATAATGGATCTACTTTTCCGAATCAGAATCAGAATCAGGCTATGCCAATTTGGCCAAATGGAGTTCTTCCAGTTTCCCCACGGAAAGGGAGATCTGTC
TTACGTGGAAAGTTTAGGGATAGGCCGAGTCCGCTTGGTCCGAATGGAAAAATAACATGTCTTTCGTATCAATCATCTGGTACTGAAGATAGCAGCAGCAAAGTTATTAC
AGAGAATGGTAACGTAAACATGTGTGACTATCAGAGACCGGTACAGCATCTTGAAGCAGTAGCTGAGCTACCTGAGAATGACATAGACGGAGCAGTTCTGCGGCCATCAG
AAAAACCAAGAATACATCCAACAGAAGCAGCTGTTCTTGAAGATAGAGACGAGGTGGAACAGTCAGATCCCTTAAGCATCCTCAGAGGTCCTCTACTTCCACCTCTTGGT
TTTCCCTTTTGCTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTTGGTAGTAGTGGCAGTGGCCGTGATTTTCTGAGTTGTTACGACAGTATTGGATTGTCTGA
TTCAGAGACGGTGAGAAAACGAATGGAGCAAATCGCAACTGCTCAAGGGCTTGAAGGTGTTTCTATAGAATGTCCCAACATCTTGAATAACACTCTGGATGTGTACCTGA
AGCAATTGATCAAGTCTTGCCTTGAGTTGGTGAGATCAAGGTCTACAATTGAACATACAGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGTATG
CGGCCCAGCAACCACCAACACGTACAGAATAGCAACGGGCGGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGCGTCATTGCTCGATTTCAAAGTTGCTATGGA
GATCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGATGTTGGAGAAAATTTCTATGCGTGCCTTTGAGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGCTTTGCAGTCTACTACAAGGTTTCTTTTGTTCTTTCCTGCTCTGCTGCAATTCGCGTCACTTCCGTTCTTCTTCCAGGCTGCTATAAATAAACCTACCCTTTA
TGCTTCGTGCACGTACTGTTCCTCCCCTACCCCCTCCCCACTCAAATGGTTACCGCAATTCAGCTTCAATTTGAGTGACTGGGGTTTGGAGCTGCTACAGTTTACAGTGT
ATGAACCGCCCTCTGGAGAAATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTGAAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAGCGGTAC
TTCTTCTACTTGAGCAGATTTCTGGGTCAGAAACTGAGCAAGTTTGAGTTTGATAAGATGTGCGTCCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATAAATTGAT
AAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAATGGGTCAGGACATGCACAGTCTATGCTACAAGCTTCAAACAACTCACCTTGCAGGGAAG
ATGGCCCTGAACATAATGGATCTACTTTTCCGAATCAGAATCAGAATCAGGCTATGCCAATTTGGCCAAATGGAGTTCTTCCAGTTTCCCCACGGAAAGGGAGATCTGTC
TTACGTGGAAAGTTTAGGGATAGGCCGAGTCCGCTTGGTCCGAATGGAAAAATAACATGTCTTTCGTATCAATCATCTGGTACTGAAGATAGCAGCAGCAAAGTTATTAC
AGAGAATGGTAACGTAAACATGTGTGACTATCAGAGACCGGTACAGCATCTTGAAGCAGTAGCTGAGCTACCTGAGAATGACATAGACGGAGCAGTTCTGCGGCCATCAG
AAAAACCAAGAATACATCCAACAGAAGCAGCTGTTCTTGAAGATAGAGACGAGGTGGAACAGTCAGATCCCTTAAGCATCCTCAGAGGTCCTCTACTTCCACCTCTTGGT
TTTCCCTTTTGCTCAGCTAGTGTAGGTGGGGCACGCAAGGCCTTGCCAGTTGGTAGTAGTGGCAGTGGCCGTGATTTTCTGAGTTGTTACGACAGTATTGGATTGTCTGA
TTCAGAGACGGTGAGAAAACGAATGGAGCAAATCGCAACTGCTCAAGGGCTTGAAGGTGTTTCTATAGAATGTCCCAACATCTTGAATAACACTCTGGATGTGTACCTGA
AGCAATTGATCAAGTCTTGCCTTGAGTTGGTGAGATCAAGGTCTACAATTGAACATACAGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGGTTATAAATGGTATG
CGGCCCAGCAACCACCAACACGTACAGAATAGCAACGGGCGGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGCGTCATTGCTCGATTTCAAAGTTGCTATGGA
GATCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGATGTTGGAGAAAATTTCTATGCGTGCCTTTGAGAAATAA
Protein sequenceShow/hide protein sequence
MMALQSTTRFLLFFPALLQFASLPFFFQAAINKPTLYASCTYCSSPTPSPLKWLPQFSFNLSDWGLELLQFTVYEPPSGEMQPQHSSRIDLGDLKAQIVKKLGNDKSKRY
FFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGPEHNGSTFPNQNQNQAMPIWPNGVLPVSPRKGRSV
LRGKFRDRPSPLGPNGKITCLSYQSSGTEDSSSKVITENGNVNMCDYQRPVQHLEAVAELPENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSDPLSILRGPLLPPLG
FPFCSASVGGARKALPVGSSGSGRDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKVINGM
RPSNHQHVQNSNGRSEVLQEKSLECSASLLDFKVAMEINPKQLGEDWPLMLEKISMRAFEK