; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G08060 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G08060
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSAP30-binding protein-like
Genome locationClcChr03:8253039..8258013
RNA-Seq ExpressionClc03G08060
SyntenyClc03G08060
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR012479 - SAP30-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653213.1 hypothetical protein Csa_019629 [Cucumis sativus]1.1e-22193.88Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED  EEEED E+HPQQM+EEGG+EDYAGVRV EEELVANSDRMII+DSANDSTPPVAGEN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV +STSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        NV+ ESETEKVE+T+EEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI E DMKREMERKELERKKSPKMEFV+GGTQPGGTVVTAPKINIPFSGVSAIT SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER
        GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ++
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER

XP_004150215.1 uncharacterized protein LOC101206323 [Cucumis sativus]2.8e-22594.41Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED  EEEED E+HPQQM+EEGG+EDYAGVRV EEELVANSDRMII+DSANDSTPPVAGEN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV +STSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        NV+ ESETEKVE+T+EEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI E DMKREMERKELERKKSPKMEFV+GGTQPGGTVVTAPKINIPFSGVSAIT SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ERKLDRRS
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_008443368.1 PREDICTED: uncharacterized protein LOC103486971 [Cucumis melo]5.0e-22795.08Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED EEEEED E+HPQQMQE GG+EDYAGVRV EEELVANSDRMII+DSANDSTPPVAGEN TPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVT+STSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        N + ESETEKVE+T+EEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI E DMKREMERKELERKKSPKMEFVSGGTQ GGTVVTAPKINIPFSGVSAIT SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_038894985.1 uncharacterized protein LOC120083338 isoform X1 [Benincasa hispida]3.5e-22895.08Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED+  EEEDSE+HPQQMQEEGG+EDYAGVRV EEELV NSDRMII+DSAN STPPVA ENSTPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPM LQAGQ DNSGRRRGT+ IVDYGHDE AMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        NVILESETEKVE T+EEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEIVE DMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPK+NIPFSGVSAITGSGLHSAAPASD IPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GG DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_038894986.1 uncharacterized protein LOC120083338 isoform X2 [Benincasa hispida]3.2e-22694.85Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED+  EEEDSE+HPQQMQEEGG+EDYAGVRV EEELV NSDRMII+DSAN STPPVA ENSTPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPM LQAGQ DNSGRRRGT+ IVDYGHDE AMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        NVILESETEKVE T+EEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI E DMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPK+NIPFSGVSAITGSGLHSAAPASD IPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GG DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

TrEMBL top hitse value%identityAlignment
A0A0A0LX73 Uncharacterized protein5.2e-22293.88Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED  EEEED E+HPQQM+EEGG+EDYAGVRV EEELVANSDRMII+DSANDSTPPVAGEN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV +STSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        NV+ ESETEKVE+T+EEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI E DMKREMERKELERKKSPKMEFV+GGTQPGGTVVTAPKINIPFSGVSAIT SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER
        GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ++
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER

A0A1S3B7X1 uncharacterized protein LOC1034869712.4e-22795.08Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED EEEEED E+HPQQMQE GG+EDYAGVRV EEELVANSDRMII+DSANDSTPPVAGEN TPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVT+STSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        N + ESETEKVE+T+EEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI E DMKREMERKELERKKSPKMEFVSGGTQ GGTVVTAPKINIPFSGVSAIT SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A5A7UPK6 SAP30-binding protein-like2.4e-22795.08Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED EEEEED E+HPQQMQE GG+EDYAGVRV EEELVANSDRMII+DSANDSTPPVAGEN TPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVT+STSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        N + ESETEKVE+T+EEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI E DMKREMERKELERKKSPKMEFVSGGTQ GGTVVTAPKINIPFSGVSAIT SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A6J1GT35 DNA ligase 1-like isoform X11.4e-21190.16Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVED EEEEEDSE+  QQ QEEGG +DY GVRV EEE   NSDRMI+++SANDSTPPV  EN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTP

Query:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN
        QPPQ VVS+SPMLLQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV V T NNL+TPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMN

Query:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        N+ILESETEKVE+T+EEEKKDIDPLDKFLPPPPK+KCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
Subjt:  NVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYY EI E DMKREMERKELERKKSPKMEFVSGGTQPGGTVV APK+NIPFSGVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GGSDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A6J1K652 DNA ligase 1 isoform X13.0e-20987.91Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVED--------QEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDK
        MASKKK+SEGIALLSMYNDEDD+MEDVED        +EEEEEDSE+H QQ Q+EGG++DY GVRV EEE   NSDRMI+++SANDSTPPV  EN TP+K
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVED--------QEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDK

Query:  LKYGSSTPQPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISE
        LK+GSSTPQPPQ VVS SPMLLQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV V T NNL+TPQISE
Subjt:  LKYGSSTPQPPQVVVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISE

Query:  SPHSGSMNNVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK
        SPHSGSMNN+ILESETEKVE+T+EEEKKDI+PLDKFLPPPPK+KCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK
Subjt:  SPHSGSMNNVILESETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK

Query:  DVFDPHGYDKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDG
        DVFDPHGYDKSDYY EI E DMKREMERKELERKKSPKMEFVSGGTQPGGTVV APK+NIPFSGVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDG
Subjt:  DVFDPHGYDKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDG

Query:  DRRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        DRRNPVISGGSDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  DRRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

SwissProt top hitse value%identityAlignment
Q02614 SAP30-binding protein3.7e-1531.15Show/hide
Query:  ETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDY
        + +++  +  E  +++ P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP      +++  ID++G+ + KD+FDPHG+ +  Y
Subjt:  ETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDY

Query:  YTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWD
        Y  + +   K EM++ E  +K+  K+EFV+ GT+ G T                 T +   S + AS A+     Q +KSKWD
Subjt:  YTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWD

Q9UHR5 SAP30-binding protein4.1e-1432.64Show/hide
Query:  ETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDY
        + +++  +  E  +++ P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP      +++  ID++G+ + KD+FDPHG+ +  Y
Subjt:  ETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDY

Query:  YTEIVEVDMKREMERKELERKKSPKMEFVSG---GTQPGGTVVT
        Y  + +   K EM++ E  +K+  K+EFV+G   GT    T  T
Subjt:  YTEIVEVDMKREMERKELERKKSPKMEFVSG---GTQPGGTVVT

Arabidopsis top hitse value%identityAlignment
AT1G29220.1 transcriptional regulator family protein2.2e-7143.95Show/hide
Query:  KQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTPQPPQV
        K+SEGIALLS+Y+DEDD  E++ED EEEEE+ E    Q + E         ++ EE+ V  ++ M      ++      GE+S   +L  G        V
Subjt:  KQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTPQPPQV

Query:  VVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMNNVILE
          SSS                            A  +P + D   +ES R  + + ++G +G  D       +  +S+ L                    
Subjt:  VVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMNNVILE

Query:  SETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDY
                           LD+FLPP P+E+CSE+LQRKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSKDVFDP GYD SD 
Subjt:  SETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDY

Query:  YTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGS--
        + + +E+DMK E ERKE E KK+ K++FVS GTQP G V  A K NIP  G+ A+  SGL S    ++   RDGR NKKSKWDKVDGD +NP ++ G+  
Subjt:  YTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGS--

Query:  --DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
           +  ++AAL+SA + GSGY AFAQQRRRE E +RSSERKL+RRS
Subjt:  --DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

AT1G29220.2 transcriptional regulator family protein1.0e-6842.79Show/hide
Query:  KQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTPQPPQV
        K+SEGIALLS+Y+DEDD  E++ED EEEEE+ E    Q + E         ++ EE+ V  ++ M      ++      GE+S   +L  G        V
Subjt:  KQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTPQPPQV

Query:  VVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMNNVILE
          SSS                            A  +P + D   +ES R  + + ++G +G  D       +  +S+ L                    
Subjt:  VVSSSPMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMNNVILE

Query:  SETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKD
                           LD+FLPP P+E+CSE+LQ            RKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSKD
Subjt:  SETEKVEKTLEEEKKDIDPLDKFLPPPPKEKCSEDLQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKD

Query:  VFDPHGYDKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGD
        VFDP GYD SD + + +E+DMK E ERKE E KK+ K++FVS GTQP G V  A K NIP  G+ A+  SGL S    ++   RDGR NKKSKWDKVDGD
Subjt:  VFDPHGYDKSDYYTEIVEVDMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGD

Query:  RRNPVISGGS----DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
         +NP ++ G+     +  ++AAL+SA + GSGY AFAQQRRRE E +RSSERKL+RRS
Subjt:  RRNPVISGGS----DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGAAGAAGAAACAATCTGAAGGTATAGCTTTACTCTCGATGTACAATGACGAGGACGATGAGATGGAAGACGTTGAAGACCAAGAAGAAGAAGAAGAAGACAG
TGAAATGCATCCGCAGCAGATGCAAGAAGAGGGAGGACAGGAAGATTATGCTGGAGTTAGGGTTGATGAAGAAGAGTTGGTTGCGAACAGTGATAGAATGATTATCACTG
ATTCTGCCAATGATTCGACGCCGCCGGTTGCTGGTGAAAATTCGACTCCAGATAAGCTCAAATACGGTTCATCCACACCACAGCCGCCCCAGGTTGTGGTTTCATCGTCG
CCAATGCTATTACAAGCTGGGCAATTAGATAATTCTGGTAGGAGAAGGGGGACACTTGCGATAGTTGATTACGGTCATGATGAAGCCGCAATGTCTCCTGAGGCTGAGGA
TGGAGAAATTGAAGAATCTGGTCGTGTCACATTTGGCGATGAGCTTTTAGGCACTAATGGTGATTTTGATAGAACATCTCCAGGAACTGTAACGGTCTCAACATCAAACA
ATCTATCCACTCCTCAAATTTCTGAATCGCCACATTCTGGTTCAATGAACAATGTGATACTGGAATCTGAAACTGAAAAAGTTGAGAAAACTCTTGAAGAAGAGAAAAAA
GACATTGACCCCTTGGACAAGTTTCTTCCTCCTCCACCAAAAGAAAAATGCTCAGAGGACCTGCAAAGGAAAATCAATAAGTTTCTTGAGTACAAAAAAGCCGGAAAGAG
CTTCAATGCAGAAGTACGCAATAGGAAAGATTACCGAAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGATG
TGTTTGATCCTCATGGATATGATAAAAGTGACTACTATACTGAAATAGTAGAGGTTGACATGAAACGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAG
ATGGAGTTTGTTTCAGGAGGAACACAACCCGGTGGTACAGTTGTGACTGCTCCTAAAATAAATATACCTTTTTCAGGTGTTTCAGCCATCACTGGTAGTGGATTACATTC
AGCAGCTCCTGCATCTGATGCCATTCCTAGGGATGGAAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGAGATAGAAGAAATCCAGTAATTTCCGGTGGGTCAG
ATGCAGCTAGTGCCCATGCAGCTTTACTATCTGCTGCTAATGTTGGCTCTGGATACATGGCTTTTGCGCAACAGAGGCGGCGAGAGGCTGAAGAAAAAAGATCAAGTGAG
AGGAAATTGGATAGAAGATCCTAA
mRNA sequenceShow/hide mRNA sequence
CACTTTTCTTCCTTTTCATTCACTGGCTCCGACCGCCCGGGAAACTGAAAACCCTTCTCCCAGTACGCCATTGAGTCAAAATTCCCAAGCTTTCTGCATCCAATTGAAAT
CCAACTTTCTATATCTAAAGTCCTGTTTCTTTTTTGTTGAACTATTTCCTTCCGGGTGCCGAGGATCGAAGCTCTCATGGCATCGAAGAAGAAACAATCTGAAGGTATAG
CTTTACTCTCGATGTACAATGACGAGGACGATGAGATGGAAGACGTTGAAGACCAAGAAGAAGAAGAAGAAGACAGTGAAATGCATCCGCAGCAGATGCAAGAAGAGGGA
GGACAGGAAGATTATGCTGGAGTTAGGGTTGATGAAGAAGAGTTGGTTGCGAACAGTGATAGAATGATTATCACTGATTCTGCCAATGATTCGACGCCGCCGGTTGCTGG
TGAAAATTCGACTCCAGATAAGCTCAAATACGGTTCATCCACACCACAGCCGCCCCAGGTTGTGGTTTCATCGTCGCCAATGCTATTACAAGCTGGGCAATTAGATAATT
CTGGTAGGAGAAGGGGGACACTTGCGATAGTTGATTACGGTCATGATGAAGCCGCAATGTCTCCTGAGGCTGAGGATGGAGAAATTGAAGAATCTGGTCGTGTCACATTT
GGCGATGAGCTTTTAGGCACTAATGGTGATTTTGATAGAACATCTCCAGGAACTGTAACGGTCTCAACATCAAACAATCTATCCACTCCTCAAATTTCTGAATCGCCACA
TTCTGGTTCAATGAACAATGTGATACTGGAATCTGAAACTGAAAAAGTTGAGAAAACTCTTGAAGAAGAGAAAAAAGACATTGACCCCTTGGACAAGTTTCTTCCTCCTC
CACCAAAAGAAAAATGCTCAGAGGACCTGCAAAGGAAAATCAATAAGTTTCTTGAGTACAAAAAAGCCGGAAAGAGCTTCAATGCAGAAGTACGCAATAGGAAAGATTAC
CGAAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGATGTGTTTGATCCTCATGGATATGATAAAAGTGACTA
CTATACTGAAATAGTAGAGGTTGACATGAAACGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATGGAGTTTGTTTCAGGAGGAACACAACCCGGTG
GTACAGTTGTGACTGCTCCTAAAATAAATATACCTTTTTCAGGTGTTTCAGCCATCACTGGTAGTGGATTACATTCAGCAGCTCCTGCATCTGATGCCATTCCTAGGGAT
GGAAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGAGATAGAAGAAATCCAGTAATTTCCGGTGGGTCAGATGCAGCTAGTGCCCATGCAGCTTTACTATCTGC
TGCTAATGTTGGCTCTGGATACATGGCTTTTGCGCAACAGAGGCGGCGAGAGGCTGAAGAAAAAAGATCAAGTGAGAGGAAATTGGATAGAAGATCCTAAGAGCAATGAA
TTCTGTTCCATAGTATTAAGTATTGAACCATTTTGAAAAGCAATGAAAATGGCTTGTAGCTTCGTATCTGTGACTAACCATGTATACGGTCAGAATGAAAATGTAATTCT
TCAGTATTAGTTCCCTCTTGAAAGTGTATTATTTATTGCCCATAAACTCATTTTTTTTTCCCATTAAATTTCTTGACTTGTAAAGATGATCCAGGGGGATAACAAATTAC
AGCAGAGCGAAGGAGGAGGCTCCTTTTTCTGTAAGTTTCATGAGAGTAAGAAAATTGACTAC
Protein sequenceShow/hide protein sequence
MASKKKQSEGIALLSMYNDEDDEMEDVEDQEEEEEDSEMHPQQMQEEGGQEDYAGVRVDEEELVANSDRMIITDSANDSTPPVAGENSTPDKLKYGSSTPQPPQVVVSSS
PMLLQAGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTVSTSNNLSTPQISESPHSGSMNNVILESETEKVEKTLEEEKK
DIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDYYTEIVEVDMKREMERKELERKKSPK
MEFVSGGTQPGGTVVTAPKINIPFSGVSAITGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSE
RKLDRRS