; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0745 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0745
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSAP30-binding protein-like isoform X2
Genome locationMC03:14216439..14222401
RNA-Seq ExpressionMC03g0745
SyntenyMC03g0745
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR012479 - SAP30-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653213.1 hypothetical protein Csa_019629 [Cucumis sativus]8.97e-25787.02Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
        MASKKK+SEGIALLSMYNDEDDEMEDVE+ +EE D EL  QQ +EEGGEEDY GVRV EEE VANSDRMI+SDSANDSTPPVA ENLTPDKLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ

Query:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
        PPQV+VSSSPM+LQ GQLDN GRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGD DR+SPG   +ST NNL+T QISESPHSGSMNN
Subjt:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN

Query:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
         +PESET KVEETVEEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYK+AGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGYD
Subjt:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLISG
        KSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQP   VVTAPKINIPFSGVSA         APASDAIPR DGRQNKKSKWDKVDGDRRNP+ISG
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLISG

Query:  GSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSE
        GSD  +AHAA+LSAANVGSGY+AFAQQRRREAEEKRS +
Subjt:  GSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSE

XP_004150215.1 uncharacterized protein LOC101206323 [Cucumis sativus]5.84e-26287.44Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
        MASKKK+SEGIALLSMYNDEDDEMEDVE+ +EE D EL  QQ +EEGGEEDY GVRV EEE VANSDRMI+SDSANDSTPPVA ENLTPDKLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ

Query:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
        PPQV+VSSSPM+LQ GQLDN GRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGD DR+SPG   +ST NNL+T QISESPHSGSMNN
Subjt:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN

Query:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
         +PESET KVEETVEEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYK+AGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGYD
Subjt:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLISG
        KSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQP   VVTAPKINIPFSGVSA         APASDAIPR DGRQNKKSKWDKVDGDRRNP+ISG
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLISG

Query:  GSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        GSD  +AHAA+LSAANVGSGY+AFAQQRRREAEEKRS ERKLDRRS
Subjt:  GSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

XP_008443368.1 PREDICTED: uncharacterized protein LOC103486971 [Cucumis melo]1.65e-25987.25Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEE-QQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVE+ ++EE D EL  QQ QE GGEEDY GVRV EEE VANSDRMI+SDSANDSTPPVA ENLTPDKLK+GSSTP
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEE-QQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTP

Query:  QPPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMN
        QPP V+VSSSPM+LQ GQLDN GRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGD DRTSPG   +ST NNL+T QISESPHSGSMN
Subjt:  QPPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMN

Query:  NAIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        N +PESET KVEETVEEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYK+AGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NAIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLIS
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ    VVTAPKINIPFSGVSA         APASDAIPR DGRQNKKSKWDKVDGDRRNP+IS
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLIS

Query:  GGSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        GGSD  +AHAA+LSAANVGSGY+AFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

XP_022140005.1 uncharacterized protein LOC111010773 isoform X1 [Momordica charantia]7.17e-30099.77Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
        MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ

Query:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
        PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
Subjt:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN

Query:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
Subjt:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEI-EADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHA
        KSDYYTEI EADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHA
Subjt:  KSDYYTEI-EADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHA

Query:  AMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        AMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
Subjt:  AMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

XP_022140007.1 uncharacterized protein LOC111010773 isoform X2 [Momordica charantia]1.02e-301100Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
        MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ

Query:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
        PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
Subjt:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN

Query:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
Subjt:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHAA
        KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHAA
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHAA

Query:  MLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        MLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
Subjt:  MLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

TrEMBL top hitse value%identityAlignment
A0A0A0LX73 Uncharacterized protein3.08e-25487.02Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
        MASKKK+SEGIALLSMYNDEDDEMEDVE+ +EE D EL  QQ +EEGGEEDY GVRV EEE VANSDRMI+SDSANDSTPPVA ENLTPDKLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ

Query:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
        PPQV+VSSSPM+LQ GQLDN GRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGD DR+SPG   +ST NNL+T QISESPHSGSMNN
Subjt:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN

Query:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
         +PESET KVEETVEEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYK+AGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGYD
Subjt:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLISG
        KSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQP   VVTAPKINIPFSGVSA         APASDAIPR DGRQNKKSKWDKVDGDRRNP+ISG
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLISG

Query:  GSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSE
        GSD  +AHAA+LSAANVGSGY+AFAQQRRREAEEKRS +
Subjt:  GSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSE

A0A1S3B7X1 uncharacterized protein LOC1034869718.01e-26087.25Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEE-QQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVE+ ++EE D EL  QQ QE GGEEDY GVRV EEE VANSDRMI+SDSANDSTPPVA ENLTPDKLK+GSSTP
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEE-QQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTP

Query:  QPPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMN
        QPP V+VSSSPM+LQ GQLDN GRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGD DRTSPG   +ST NNL+T QISESPHSGSMN
Subjt:  QPPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMN

Query:  NAIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        N +PESET KVEETVEEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYK+AGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NAIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLIS
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ    VVTAPKINIPFSGVSA         APASDAIPR DGRQNKKSKWDKVDGDRRNP+IS
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLIS

Query:  GGSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        GGSD  +AHAA+LSAANVGSGY+AFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

A0A5A7UPK6 SAP30-binding protein-like8.01e-26087.25Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEE-QQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVE+ ++EE D EL  QQ QE GGEEDY GVRV EEE VANSDRMI+SDSANDSTPPVA ENLTPDKLK+GSSTP
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEE-QQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTP

Query:  QPPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMN
        QPP V+VSSSPM+LQ GQLDN GRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGD DRTSPG   +ST NNL+T QISESPHSGSMN
Subjt:  QPPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMN

Query:  NAIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        N +PESET KVEETVEEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYK+AGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NAIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLIS
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ    VVTAPKINIPFSGVSA         APASDAIPR DGRQNKKSKWDKVDGDRRNP+IS
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSA-VVTAPKINIPFSGVSA---------APASDAIPRGDGRQNKKSKWDKVDGDRRNPLIS

Query:  GGSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        GGSD  +AHAA+LSAANVGSGY+AFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

A0A6J1CDV1 uncharacterized protein LOC111010773 isoform X13.47e-30099.77Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
        MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ

Query:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
        PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
Subjt:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN

Query:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
Subjt:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEI-EADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHA
        KSDYYTEI EADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHA
Subjt:  KSDYYTEI-EADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHA

Query:  AMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        AMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
Subjt:  AMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

A0A6J1CFK8 uncharacterized protein LOC111010773 isoform X24.96e-302100Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
        MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQ

Query:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
        PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN
Subjt:  PPQVLVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNN

Query:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
Subjt:  AIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHAA
        KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHAA
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHAA

Query:  MLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        MLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
Subjt:  MLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

SwissProt top hitse value%identityAlignment
Q02614 SAP30-binding protein3.6e-1529.46Show/hide
Query:  EAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNNAIPESETAKVE-----ETVEEEKKD--------------IDP
        +A  G  EE G +     + D  G+ D + PG                E     S  +   +SET K E     +  E EK+D              + P
Subjt:  EAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNNAIPESETAKVE-----ETVEEEKKD--------------IDP

Query:  LDKFLPPPPKEKCSEELQRKINKFLEYK-RAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDYYTEIEADMKREMERKELE
         +  +PP P  +CS  LQ KI K  E K + G   N  ++ +K++RNP      +++  ID++G+ + KD+FDPHG+ +  YY  +    K EM++ E  
Subjt:  LDKFLPPPPKEKCSEELQRKINKFLEYK-RAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDYYTEIEADMKREMERKELE

Query:  RKKSPKMEFVSG---GTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWD
        +K+  K+EFV+G   GT  +A  T           S + AS A+      Q +KSKWD
Subjt:  RKKSPKMEFVSG---GTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWD

Q9UHR5 SAP30-binding protein3.6e-1529.44Show/hide
Query:  NNAIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYK-RAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH
        N    + +  ++  +  E  +++ P +  +PP P  +CS  LQ KI K  E K + G   N  ++ +K++RNP      +++  ID++G+ + KD+FDPH
Subjt:  NNAIPESETAKVEETVEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYK-RAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH

Query:  GYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWD
        G+ +  YY  +    K EM++ E  +K+  K+EFV+ GT+      A       +  + A A          Q +KSKWD
Subjt:  GYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWD

Arabidopsis top hitse value%identityAlignment
AT1G29220.1 transcriptional regulator family protein6.9e-7044.47Show/hide
Query:  KESEGIALLSMYNDEDD-EMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQPPQV
        K+SEGIALLS+Y+DEDD EMED EE++EE +    +Q+ QEE         ++ EE+ V  ++ M                                   
Subjt:  KESEGIALLSMYNDEDD-EMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQPPQV

Query:  LVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNNAIPE
                                       DE            EE GR   G E   T   LD     ++   TP +L   + S    S   N  I E
Subjt:  LVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNNAIPE

Query:  SETAKVEETVEEEKKDIDP-LDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSD
        S  A  E   +   +  D  LD+FLPP P+E+CSEELQRKI+KFL  K+ GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSKDVFDP GYD SD
Subjt:  SETAKVEETVEEEKKDIDP-LDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSD

Query:  YYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPAS------DAIPRGDGRQNKKSKWDKVDGDRRNPLISGGS----D
        +   IE DMK E ERKE E KK+ K++FVS GTQP AV  A K NIP  G+ A   S        I   DGR NKKSKWDKVDGD +NP ++ G+     
Subjt:  YYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPAS------DAIPRGDGRQNKKSKWDKVDGDRRNPLISGGS----D

Query:  PVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
         + ++AA++SA + GSGY AFAQQRRRE E +RSSERKL+RRS
Subjt:  PVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS

AT1G29220.2 transcriptional regulator family protein3.2e-6743.3Show/hide
Query:  KESEGIALLSMYNDEDD-EMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQPPQV
        K+SEGIALLS+Y+DEDD EMED EE++EE +    +Q+ QEE         ++ EE+ V  ++ M                                   
Subjt:  KESEGIALLSMYNDEDD-EMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQPPQV

Query:  LVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNNAIPE
                                       DE            EE GR   G E   T   LD     ++   TP +L   + S    S   N  I E
Subjt:  LVSSSPMLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNNAIPE

Query:  SETAKVEETVEEEKKDIDP-LDKFLPPPPKEKCSEELQ------------RKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK
        S  A  E   +   +  D  LD+FLPP P+E+CSEELQ            RKI+KFL  K+ GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSK
Subjt:  SETAKVEETVEEEKKDIDP-LDKFLPPPPKEKCSEELQ------------RKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK

Query:  DVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPAS------DAIPRGDGRQNKKSKWDKVDGDRRN
        DVFDP GYD SD+   IE DMK E ERKE E KK+ K++FVS GTQP AV  A K NIP  G+ A   S        I   DGR NKKSKWDKVDGD +N
Subjt:  DVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPSAVVTAPKINIPFSGVSAAPAS------DAIPRGDGRQNKKSKWDKVDGDRRN

Query:  PLISGGS----DPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS
        P ++ G+      + ++AA++SA + GSGY AFAQQRRRE E +RSSERKL+RRS
Subjt:  PLISGGS----DPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGAAGAAGAAAGAATCAGAAGGTATAGCTTTGCTTTCGATGTATAATGATGAGGACGATGAGATGGAAGACGTCGAAGAACAACAAGAAGAAGGGGATACTGA
ACTGCCGCAGCAGCAGGGCCAAGAAGAGGGAGGAGAAGAAGATTATGGAGGAGTTAGGGTTACAGAAGAAGAGTCAGTTGCGAACAGTGACAGAATGATTCTCAGTGATT
CTGCCAATGATTCGACGCCGCCGGTTGCTGATGAAAATTTGACGCCAGATAAGCTCAAATTTGGGTCATCCACACCGCAACCGCCTCAGGTTTTGGTTTCATCGTCGCCG
ATGCTATTACAAGCTGGGCAACTAGATAATCCTGGTAGGAGAAGGGGGACGCTTGCGATAGTTGATTACGGTCACGATGAAGCCGCAATGTCTCCTGAGGCTGAGGATGG
AGAAATTGAAGAATCTGGTCGTGTCACTTTTGGCGATGAGCTTTTAGACACTAATGGTGATCTTGATAGAACATCTCCAGGAGCTGCAAGGGTTTCAACACCGAACAATC
TTGCCACTTCTCAAATTTCTGAATCACCACATTCTGGTTCAATGAACAATGCGATACCGGAATCTGAAACTGCAAAAGTTGAGGAAACCGTTGAAGAAGAGAAAAAAGAC
ATTGATCCCTTGGATAAGTTTCTTCCTCCACCACCAAAAGAAAAATGCTCAGAGGAGTTGCAAAGGAAAATCAATAAGTTTCTCGAGTATAAGAGAGCTGGAAAAAGCTT
CAATGCAGAAGTACGCAATAGGAAGGACTACCGGAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGATGTGT
TTGACCCCCATGGATATGATAAAAGTGACTACTATACTGAAATAGAGGCTGACATGAAACGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATGGAG
TTTGTTTCAGGAGGAACCCAACCTAGTGCAGTTGTGACTGCTCCTAAAATAAATATACCATTTTCAGGTGTTTCAGCAGCTCCTGCATCTGATGCCATTCCGAGGGGGGA
TGGAAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGTGACAGAAGAAACCCATTAATTTCTGGTGGGTCAGATCCAGTTACTGCTCATGCAGCTATGCTATCTG
CTGCTAATGTTGGCTCTGGATACGTGGCTTTTGCGCAACAGAGACGGCGAGAGGCTGAAGAAAAAAGATCCAGTGAGAGGAAGTTGGATAGAAGATCGTAA
mRNA sequenceShow/hide mRNA sequence
CGTGTTTCCTTCAAGTTTCTTCGTTTTCCCTTTCACTGGTTTTGCCCACCGGGGAAACTGAAAACCCTTCTCCCTAGTATCGACATTGATTCAAATTTCAAATCTTGTGC
ATCCAATTTGGATCCGTCTTTCGATTTCTGAAGTCCCATTCCGTTGCTGTTGAACTTATTTCTTCCGGGTGCCGAGAATTTTAGCTCTCATGGCATCGAAGAAGAAAGAA
TCAGAAGGTATAGCTTTGCTTTCGATGTATAATGATGAGGACGATGAGATGGAAGACGTCGAAGAACAACAAGAAGAAGGGGATACTGAACTGCCGCAGCAGCAGGGCCA
AGAAGAGGGAGGAGAAGAAGATTATGGAGGAGTTAGGGTTACAGAAGAAGAGTCAGTTGCGAACAGTGACAGAATGATTCTCAGTGATTCTGCCAATGATTCGACGCCGC
CGGTTGCTGATGAAAATTTGACGCCAGATAAGCTCAAATTTGGGTCATCCACACCGCAACCGCCTCAGGTTTTGGTTTCATCGTCGCCGATGCTATTACAAGCTGGGCAA
CTAGATAATCCTGGTAGGAGAAGGGGGACGCTTGCGATAGTTGATTACGGTCACGATGAAGCCGCAATGTCTCCTGAGGCTGAGGATGGAGAAATTGAAGAATCTGGTCG
TGTCACTTTTGGCGATGAGCTTTTAGACACTAATGGTGATCTTGATAGAACATCTCCAGGAGCTGCAAGGGTTTCAACACCGAACAATCTTGCCACTTCTCAAATTTCTG
AATCACCACATTCTGGTTCAATGAACAATGCGATACCGGAATCTGAAACTGCAAAAGTTGAGGAAACCGTTGAAGAAGAGAAAAAAGACATTGATCCCTTGGATAAGTTT
CTTCCTCCACCACCAAAAGAAAAATGCTCAGAGGAGTTGCAAAGGAAAATCAATAAGTTTCTCGAGTATAAGAGAGCTGGAAAAAGCTTCAATGCAGAAGTACGCAATAG
GAAGGACTACCGGAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGATGTGTTTGACCCCCATGGATATGATA
AAAGTGACTACTATACTGAAATAGAGGCTGACATGAAACGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATGGAGTTTGTTTCAGGAGGAACCCAA
CCTAGTGCAGTTGTGACTGCTCCTAAAATAAATATACCATTTTCAGGTGTTTCAGCAGCTCCTGCATCTGATGCCATTCCGAGGGGGGATGGAAGACAAAACAAAAAATC
AAAATGGGATAAGGTAGATGGTGACAGAAGAAACCCATTAATTTCTGGTGGGTCAGATCCAGTTACTGCTCATGCAGCTATGCTATCTGCTGCTAATGTTGGCTCTGGAT
ACGTGGCTTTTGCGCAACAGAGACGGCGAGAGGCTGAAGAAAAAAGATCCAGTGAGAGGAAGTTGGATAGAAGATCGTAAGATCAATGCATTCTGTTCCATAGTATTAAG
TTTCGAACCATTATTAAATTAGTGGATTCCCCTCCTATGAGTAAACTTGAGGAAGTCGGAACCCTTTGAGAGGTACGTAGATATACAAAAATCCATGTCTGATAACCATC
AAGTCAGTGAAAAATTGACAAATGATCATGTTAGATGGGTTGGCCAAAATGAGACAGCGATGTATTTGCCATCCAGCTTCTGCCCTGTCCCTTTCCCCACGCCCCTCGAT
CTTCAACCCTGCAAACAGCCAAAATTCCAAGCAATTATAAGCCCCCACACAGGCCACACTCAGATTTGATTTTGGGTGTGTCCTCACTTCATCTTCCACGCTCTGGTTTT
CTCTCACACACTCGGCACTACCAAACACCACCGAATGCTTTAAAGTCAATGCCACCAGCCTTTAATTCTTCTCGATGCATGTTTGCCGAAAGTTACCTCTTTCCTAAATA
ACCGCAAAAGAGAGAAGAAAATAAAACCACAGACCCCACAACTTTGTTTGCCATTTTGCTCTAAATTAACCCATTTCCATTCGTTTAATACAAGTTGTTTATGTTTTTGA
AGCAAAATTAACCAAAAACTGTTGGGTTTGTGAAAAGAGTGCAAATAAGGTTCCACGTTGGTTAGGAAAGTGAGTGATTCATGGTTTATAAGGGAAGACAATTATTTCTA
TTGGTACAGAGGCATTTTGGGTGAAACTAAAGCAAAGTCGTGACAATATCATACTATTGTAGAGATATTTGGGAGAACGAAGATACAGTTGTCCTCACCAAAATCAATGA
TGAAAAGGGTATGTTCTTCCATATTTGTAACATCGGCAGGAAGGAGATCCGGAAAGCATCAGTCATAGAGGTGGTTGTGGTTTGTAATTATAGCGTTGAGTCTCTAATGT
TTTGTTGTTGCAGATTTTCAACACAAGTACGTGGGTCAGGTTGAAAAGGTAGGCATTGGCACCCGAGAATTGGTGCTTTTCCCTCCACTCGAGGATATGGGATTCCTTTG
GATTGGAAGATGAAGAAAACTGTTTCTTCTCCTAACTCTCATCCCCTGATTTGTGAGACCAAAATACCCATCTGGTGAAACTGGTTTTATATGTTTTTGCACCCTTTGCT
TTTTCTGCACACAGTTGCTTTTGGAATACAACACAACTTATCTTCTTTTCAGATGCATGTGAGCCTAAACACTCGTCTCATGGATGAATGTGAAAAGACTAGCTTTCGCC
TAAGTTGATACTTGGGAGAGGGTGATGGAGTTTTCGGCACTTCCATAAAAACTTGTATTAATTTTTCCTTTGTATGGATAAAAAAAGTGATCAAATCACCATTTTAGTCT
CTTTGGATTTGATCTCAATTTCAATTTCGTTTCTATGATTCTAAAATAGTCATTAAACTCTCTCAAAATGTTTG
Protein sequenceShow/hide protein sequence
MASKKKESEGIALLSMYNDEDDEMEDVEEQQEEGDTELPQQQGQEEGGEEDYGGVRVTEEESVANSDRMILSDSANDSTPPVADENLTPDKLKFGSSTPQPPQVLVSSSP
MLLQAGQLDNPGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDLDRTSPGAARVSTPNNLATSQISESPHSGSMNNAIPESETAKVEETVEEEKKD
IDPLDKFLPPPPKEKCSEELQRKINKFLEYKRAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKME
FVSGGTQPSAVVTAPKINIPFSGVSAAPASDAIPRGDGRQNKKSKWDKVDGDRRNPLISGGSDPVTAHAAMLSAANVGSGYVAFAQQRRREAEEKRSSERKLDRRS