; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh01G015690 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh01G015690
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionSAGA-Tad1 domain-containing protein
Genome locationCma_Chr01:10864752..10866945
RNA-Seq ExpressionCmaCh01G015690
SyntenyCmaCh01G015690
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0000124 - SAGA complex (cellular component)
GO:0003713 - transcription coactivator activity (molecular function)
InterPro domainsIPR024738 - Transcriptional coactivator Hfi1/Transcriptional adapter 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608203.1 hypothetical protein SDJN03_01545, partial [Cucurbita argyrosperma subsp. sororia]2.2e-24490.7Show/hide
Query:  FGSSLQFFEASFILVQAAINLPFMLRAL---------PQFSFNLSDWNLELLQFTVYEPLSGEMQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRF
        F + LQF    F   QAAIN P +  +          P   FNLSDW LELLQFTVYEP SGEMQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRF
Subjt:  FGSSLQFFEASFILVQAAINLPFMLRAL---------PQFSFNLSDWNLELLQFTVYEPLSGEMQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRF

Query:  LGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGPEHNGSTFP--NQNQAMPMWPNGVLPVSPRKG
        LGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGPEHNGSTFP  NQNQAMP+WPNGVLPVSPRKG
Subjt:  LGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGPEHNGSTFP--NQNQAMPMWPNGVLPVSPRKG

Query:  RSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNP
        RSVLRGKFRDRPSPLGPNGKIT LSYQSSGTEDSSSKVITENGNVN+CDYQRPVQHLEAVAELPENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQS+P
Subjt:  RSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNP

Query:  LSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSR
        LSILRGPLLPPLGIPFCSASVGGA KALPVGSSGS HDFLSCYDSIGLSD ETVRKRMEQIATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSR
Subjt:  LSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSR

Query:  STIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQLGEDWPLMLEKISMRAFEE
        STIEHTGHPIQKQQNQGK+INGM PSNHQHVQNSNG+SEVLQEKSLECSASLLDFKVAME+NPKQLGEDWPLMLEKISMRAFE+
Subjt:  STIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQLGEDWPLMLEKISMRAFEE

KAG7037562.1 hypothetical protein SDJN02_01190, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-23196.67Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFP--NQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAEL
        CREDGPEHNGSTFP  NQNQAMP+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQSSGTEDSSSKVITENGNVN+CDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFP--NQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQS+PLSILRGPLLPPLGIPFCSASVGGA KALPVGSSGS HDFLSCYDSIGLSD ETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGK+INGM PSNHQHVQNSNG+SEVLQEKSLECSASLLDFKVAME+NP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNP

Query:  KQLGEDWPLMLEKISMRAFEE
        KQLGEDWPLMLEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEE

XP_022940550.1 uncharacterized protein LOC111446116 [Cucurbita moschata]5.1e-23096.44Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFP--NQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAEL
        CREDGPEHNGSTFP  NQNQAMP+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQSSGTEDSSSKVITENGNVN+CDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFP--NQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQS+PLSILRGPLLPPLG PFCSASVGGA KALPVGSSGS  DFLSCYDSIGLSDSETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGK+INGM PSNHQHVQNSNG+SEVLQEKSLECSASLLDFKVAME+NP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNP

Query:  KQLGEDWPLMLEKISMRAFEE
        KQLGEDWPLMLEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEE

XP_022982022.1 uncharacterized protein LOC111480999 [Cucurbita maxima]2.1e-239100Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
        CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
Subjt:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE

Query:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
Subjt:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLMLEKISMRAFEE
        LGEDWPLMLEKISMRAFEE
Subjt:  LGEDWPLMLEKISMRAFEE

XP_023524221.1 uncharacterized protein LOC111788192 [Cucurbita pepo subsp. pepo]1.1e-23297.37Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRID+GDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
        CREDGPEHNGSTFPNQNQAMP+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQSSGTEDSSSKVITENGNVN+CDYQRPVQHLEAVAELPE
Subjt:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE

Query:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQS+PLSILRGPLLPPLGIPFCSASVGGA KALPVGSSGSS DFLSCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGK+INGM PSNHQHVQNSNG+SEVLQEKSLECSASLLDFKVAME+NPKQ
Subjt:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLMLEKISMRAFEE
        LGEDWPLMLEKISMRAFEE
Subjt:  LGEDWPLMLEKISMRAFEE

TrEMBL top hitse value%identityAlignment
A0A0A0LGS9 Uncharacterized protein1.8e-21288.54Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+LQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
        CREDGPE  GS FPNQNQ+ P+WPNGVLPVSPRKGRS LRGKFRDRPSPLGPNGK T LSYQS+G+EDSSSKVITENGNV +CDYQRPV++L++VAELPE
Subjt:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE

Query:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAV RPSEKPRIHPTEAA+LE+ +EVEQS+PLS LRGPLLPPLGIPFCSASVGGA KALPV SSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVS+ECP+ILNNTLDVYLKQLIKSCLELVR+RST EH+GHPIQKQQNQGK++NGMWP+NH  VQNSNG+SEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLMLEKISMRAFEE
        LGEDWPL+LEKISMRAFEE
Subjt:  LGEDWPLMLEKISMRAFEE

A0A1S4E5S7 uncharacterized protein LOC1035037571.5e-21188.54Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+L AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
        CREDGPE  GS FPNQNQ+ P+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQS+G+EDSSSKVITENGNV +CDYQRPVQ+L++VAELPE
Subjt:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE

Query:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAV RPSEKPRIHPTEAA+LE+ +EVEQS+PL  LRGPLLPPLGIPFCSASVGGA KALPV SSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR+RST EH+GHPIQKQQNQGK++NGMWP+NH  VQN+NG+SEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLMLEKISMRAFEE
        LGEDWPL+LEKISMRAFEE
Subjt:  LGEDWPLMLEKISMRAFEE

A0A5A7TBJ9 SAGA-Tad1 domain-containing protein1.5e-21188.54Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFF+LSRFLGQK+SK EFDK+CVRVLGRENIQLHN+LIRSILKNACVAKTPPPIN SGHAQS+L AS NSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
        CREDGPE  GS FPNQNQ+ P+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQS+G+EDSSSKVITENGNV +CDYQRPVQ+L++VAELPE
Subjt:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE

Query:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAV RPSEKPRIHPTEAA+LE+ +EVEQS+PL  LRGPLLPPLGIPFCSASVGGA KALPV SSGSS DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Subjt:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVS+ECPNILNNTLDVYLKQLIKSCLELVR+RST EH+GHPIQKQQNQGK++NGMWP+NH  VQN+NG+SEVLQEKSLECS SLLDFKVAMELNPKQ
Subjt:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLMLEKISMRAFEE
        LGEDWPL+LEKISMRAFEE
Subjt:  LGEDWPLMLEKISMRAFEE

A0A6J1FQY5 uncharacterized protein LOC1114461162.5e-23096.44Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFP--NQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAEL
        CREDGPEHNGSTFP  NQNQAMP+WPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKIT LSYQSSGTEDSSSKVITENGNVN+CDYQRPVQHLEAVAEL
Subjt:  CREDGPEHNGSTFP--NQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAEL

Query:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIAT
        PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQS+PLSILRGPLLPPLG PFCSASVGGA KALPVGSSGS  DFLSCYDSIGLSDSETVRKRMEQIAT
Subjt:  PENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIAT

Query:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNP
        AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGK+INGM PSNHQHVQNSNG+SEVLQEKSLECSASLLDFKVAME+NP
Subjt:  AQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNP

Query:  KQLGEDWPLMLEKISMRAFEE
        KQLGEDWPLMLEKISMRAFE+
Subjt:  KQLGEDWPLMLEKISMRAFEE

A0A6J1IVJ1 uncharacterized protein LOC1114809991.0e-239100Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
        CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
Subjt:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE

Query:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
        NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
Subjt:  NDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
        GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
Subjt:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLMLEKISMRAFEE
        LGEDWPLMLEKISMRAFEE
Subjt:  LGEDWPLMLEKISMRAFEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14850.1 unknown protein6.2e-4031.81Show/hide
Query:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGP
        SR++  ++KA I +K+G+ ++  YF  L +FL  ++SK EFDK+C + +GRENI LHN+L+RSILKNA VAK+PPP                        
Subjt:  SRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGP

Query:  EHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDIDGA
              +P ++    ++ + V P SPRK RS    KFRDRPSPLGP GK   L   ++  ++S SK                 Q L              
Subjt:  EHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDIDGA

Query:  VLRPSEKPRIHPTEAAVLEDRDEVEQ--SNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQGLEG
                   P E   +ED +EVEQ   +P    R PL  PLG+ F        LK+    S+ +  +  +C  S  L D  T+R R+E+    +G++ 
Subjt:  VLRPSEKPRIHPTEAAVLEDRDEVEQ--SNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQGLEG

Query:  VSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQLGED
        +S++  N+LN  L+ Y+++LI+ CL L                                             Q+K    + S+LDF  AME+NP+ LGE+
Subjt:  VSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQLGED

Query:  WPLMLEKISMRAFEE
        WP+ LEKI  RA EE
Subjt:  WPLMLEKISMRAFEE

AT2G24530.1 unknown protein6.9e-10851.42Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQ     RI L +LK  IVKK G ++S+RYF+YL RFL QKL+K EFDK C+R+LGREN+ LHN+LIRSIL+NA VAK+PPP + +GH+      +N   
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELP
         R DG E +G+  PN +Q  P+W NGVLP+SPRK RS ++  K RDRPSPLG NGK+  + +Q    ED+   V  ENG     DYQR  +++       
Subjt:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELP

Query:  ENDIDGAVLRPSEKPRIHPTE---AAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQI
         ++ DG  LRP EKPRI   E   A  + D    E+   +++   PL+ PLGIPFCSASVGG+ + +PV ++    + +SCYDS GL D E +RKRME I
Subjt:  ENDIDGAVLRPSEKPRIHPTE---AAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQI

Query:  ATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTG-HPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAME
        A AQGLEGVS+EC   LNN LDVYLK+LI SC +LV +RST    G   I KQQ+Q KI+NG+WP+N   +Q  NG S++ Q+     S S+LDF+ AME
Subjt:  ATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTG-HPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAME

Query:  LNPKQLGEDWPLMLEKISMRAFEE
        LNP+QLGEDWP + E+IS+R+FEE
Subjt:  LNPKQLGEDWPLMLEKISMRAFEE

AT4G31440.1 unknown protein1.1e-8445.84Show/hide
Query:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP
        MQ     RIDL +LK  IVKK+G ++S RYF+YL RFL QKL+K EFDK C R+LGREN+ LHNKLIRSIL+NA +AK+PP ++ SGH    L       
Subjt:  MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSP

Query:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE
         +EDGPE + S  P+  +      NGVL    R G    R   RD+P PLG NGK+ G                          Y RP ++         
Subjt:  CREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPE

Query:  NDIDGAVLRPSEKPRIH-PTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATA
        ++ D A L P+E+  +    + A    RD+  Q   LS    P++ PLGIPFCSASVGG  + +PV +S ++   +SCYDS GLSD+E +RKRME IA  
Subjt:  NDIDGAVLRPSEKPRIH-PTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATA

Query:  QGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTG-HPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNP
        QGL GVS EC  +LNN LD+YLK+L+KSC++L  +RS     G H ++KQQ++ +++NG+  +N  H+Q SN  S++ +E+    S SLLDF+VAMELNP
Subjt:  QGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTG-HPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNP

Query:  KQLGEDWPLMLEKISMRAFEE
         QLGEDWPL+ E+IS+  FEE
Subjt:  KQLGEDWPLMLEKISMRAFEE

AT4G33890.1 unknown protein1.6e-4331.98Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCRE

Query:  DGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDI
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   ++  E+S SK                    ++  EL     
Subjt:  DGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDI

Query:  DGAVLRPSEKPRIHPTEAAVLEDRDEVEQ---SNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
            L    +P   P E   +E+ +EVEQ    +P    R PL  PLG+     +         V     S +  +C ++  L D+ T+R R+E+    +
Subjt:  DGAVLRPSEKPRIHPTEAAVLEDRDEVEQ---SNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
        GL+ ++++  ++LN+ LDV++++LI+ CL L  +R   +                      N+Q+ Q S   S V          S+ DF+  MELN + 
Subjt:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLMLEKISMRAFEE
        LGEDWP+ +EKI  RA ++
Subjt:  LGEDWPLMLEKISMRAFEE

AT4G33890.2 unknown protein1.6e-4331.98Show/hide
Query:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCRE
        Q SSR+D  ++KA I +++GN +++ YF  L RF   K++K EFDK+C++ +GR+NI LHN+LIRSI+KNAC+AK+PP I   G   S ++  N      
Subjt:  QHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDKMCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCRE

Query:  DGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDI
                     +Q  P+  +     S RK RS    K RDRPSPLGP GK   L   ++  E+S SK                    ++  EL     
Subjt:  DGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDI

Query:  DGAVLRPSEKPRIHPTEAAVLEDRDEVEQ---SNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ
            L    +P   P E   +E+ +EVEQ    +P    R PL  PLG+     +         V     S +  +C ++  L D+ T+R R+E+    +
Subjt:  DGAVLRPSEKPRIHPTEAAVLEDRDEVEQ---SNPLSILRGPLLPPLGIPFCSASVGGALKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQ

Query:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ
        GL+ ++++  ++LN+ LDV++++LI+ CL L  +R   +                      N+Q+ Q S   S V          S+ DF+  MELN + 
Subjt:  GLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQHVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQ

Query:  LGEDWPLMLEKISMRAFEE
        LGEDWP+ +EKI  RA ++
Subjt:  LGEDWPLMLEKISMRAFEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTGGTTCTTCGCTGCAATTCTTCGAAGCAAGTTTTATTTTGGTTCAGGCTGCTATAAACCTACCCTTTATGCTTCGTGCACTACCGCAATTCAGCTTC
AATTTGAGTGACTGGAATTTGGAGCTGCTACAGTTTACAGTTTATGAACCGCTCTCTGGAGAAATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTG
AAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAGCGGTACTTCTTCTACTTGAGCAGATTTTTGGGTCAGAAACTGAGCAAGTTTGAATTTGATAAG
ATGTGCGTCCGTGTGCTTGGAAGGGAGAACATTCAGCTGCACAATAAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAAT
GGGTCAGGACATGCACAGTCTATGCTACAAGCTTCAAACAACTCACCTTGCAGGGAAGATGGCCCTGAACATAATGGATCTACTTTTCCGAATCAGAATCAGGCT
ATGCCAATGTGGCCAAATGGAGTTCTTCCAGTATCCCCACGGAAAGGGAGATCTGTCTTACGAGGAAAGTTCAGGGATAGACCGAGTCCGCTTGGTCCGAATGGA
AAAATCACAGGTCTTTCATATCAATCATCTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAACGTAAACATATGTGACTATCAGAGACCGGTA
CAGCATCTTGAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGAGCAGTTCTGCGGCCATCAGAAAAACCAAGAATACATCCAACAGAAGCAGCTGTTCTT
GAAGATAGAGACGAGGTGGAACAGTCAAATCCCTTAAGCATCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCCTTTTGCTCAGCTAGTGTAGGTGGGGCA
CTCAAGGCGTTGCCAGTTGGTAGTAGTGGCAGTAGTCATGATTTTCTGAGTTGTTACGACAGTATTGGATTGTCTGATTCAGAGACGGTGAGAAAACGAATGGAG
CAAATCGCAACTGCTCAAGGGCTTGAAGGTGTTTCTATAGAATGTCCCAACATCTTGAATAACACTCTGGATGTGTACCTGAAGCAATTGATCAAGTCTTGCCTT
GAGTTGGTGAGATCAAGGTCTACAATTGAACATACAGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGATTATAAATGGTATGTGGCCCAGTAACCACCAA
CACGTACAGAATAGCAACGGGCAGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGCGTCATTGCTCGATTTCAAAGTTGCTATGGAGCTCAATCCAAAG
CAGCTTGGGGAAGATTGGCCTTTGATGTTGGAGAAAATTTCTATGCGTGCCTTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTTGGTTCTTCGCTGCAATTCTTCGAAGCAAGTTTTATTTTGGTTCAGGCTGCTATAAACCTACCCTTTATGCTTCGTGCACTACCGCAATTCAGCTTC
AATTTGAGTGACTGGAATTTGGAGCTGCTACAGTTTACAGTTTATGAACCGCTCTCTGGAGAAATGCAACCTCAGCACAGCTCCAGAATTGATTTAGGCGACTTG
AAAGCTCAGATAGTTAAGAAACTTGGAAATGACAAGTCAAAGCGGTACTTCTTCTACTTGAGCAGATTTTTGGGTCAGAAACTGAGCAAGTTTGAATTTGATAAG
ATGTGCGTCCGTGTGCTTGGAAGGGAGAACATTCAGCTGCACAATAAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACCAATAAAT
GGGTCAGGACATGCACAGTCTATGCTACAAGCTTCAAACAACTCACCTTGCAGGGAAGATGGCCCTGAACATAATGGATCTACTTTTCCGAATCAGAATCAGGCT
ATGCCAATGTGGCCAAATGGAGTTCTTCCAGTATCCCCACGGAAAGGGAGATCTGTCTTACGAGGAAAGTTCAGGGATAGACCGAGTCCGCTTGGTCCGAATGGA
AAAATCACAGGTCTTTCATATCAATCATCTGGTACTGAAGATAGCAGCAGCAAAGTTATTACAGAGAATGGTAACGTAAACATATGTGACTATCAGAGACCGGTA
CAGCATCTTGAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGAGCAGTTCTGCGGCCATCAGAAAAACCAAGAATACATCCAACAGAAGCAGCTGTTCTT
GAAGATAGAGACGAGGTGGAACAGTCAAATCCCTTAAGCATCCTGAGAGGTCCTCTACTTCCACCTCTTGGTATTCCCTTTTGCTCAGCTAGTGTAGGTGGGGCA
CTCAAGGCGTTGCCAGTTGGTAGTAGTGGCAGTAGTCATGATTTTCTGAGTTGTTACGACAGTATTGGATTGTCTGATTCAGAGACGGTGAGAAAACGAATGGAG
CAAATCGCAACTGCTCAAGGGCTTGAAGGTGTTTCTATAGAATGTCCCAACATCTTGAATAACACTCTGGATGTGTACCTGAAGCAATTGATCAAGTCTTGCCTT
GAGTTGGTGAGATCAAGGTCTACAATTGAACATACAGGGCACCCTATCCAGAAGCAACAGAATCAAGGGAAGATTATAAATGGTATGTGGCCCAGTAACCACCAA
CACGTACAGAATAGCAACGGGCAGTCTGAAGTTTTGCAGGAAAAGAGTTTAGAATGCTCAGCGTCATTGCTCGATTTCAAAGTTGCTATGGAGCTCAATCCAAAG
CAGCTTGGGGAAGATTGGCCTTTGATGTTGGAGAAAATTTCTATGCGTGCCTTTGAGGAATAA
Protein sequenceShow/hide protein sequence
MKFGSSLQFFEASFILVQAAINLPFMLRALPQFSFNLSDWNLELLQFTVYEPLSGEMQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSRFLGQKLSKFEFDK
MCVRVLGRENIQLHNKLIRSILKNACVAKTPPPINGSGHAQSMLQASNNSPCREDGPEHNGSTFPNQNQAMPMWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNG
KITGLSYQSSGTEDSSSKVITENGNVNICDYQRPVQHLEAVAELPENDIDGAVLRPSEKPRIHPTEAAVLEDRDEVEQSNPLSILRGPLLPPLGIPFCSASVGGA
LKALPVGSSGSSHDFLSCYDSIGLSDSETVRKRMEQIATAQGLEGVSIECPNILNNTLDVYLKQLIKSCLELVRSRSTIEHTGHPIQKQQNQGKIINGMWPSNHQ
HVQNSNGQSEVLQEKSLECSASLLDFKVAMELNPKQLGEDWPLMLEKISMRAFEE