; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc12g0330331 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc12g0330331
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionSAP30-binding protein-like
Genome locationCMiso1.1chr12:21336621..21343148
RNA-Seq ExpressionCmc12g0330331
SyntenyCmc12g0330331
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
InterPro domainsIPR012479 - SAP30-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653213.1 hypothetical protein Csa_019629 [Cucumis sativus]1.8e-22997.05Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDL EEEEDGELHPQQM+EEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVA ENLTPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPMVLQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV ISTSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N MPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQPGGTVVTAPKINIPFSGVSAIT+SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ++
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER

XP_004150215.1 uncharacterized protein LOC101206323 [Cucumis sativus]4.7e-23397.53Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDL EEEEDGELHPQQM+EEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVA ENLTPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPMVLQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV ISTSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N MPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQPGGTVVTAPKINIPFSGVSAIT+SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_008443368.1 PREDICTED: uncharacterized protein LOC103486971 [Cucumis melo]2.4e-23799.1Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQE GGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVA ENLTPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ GGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_038894985.1 uncharacterized protein LOC120083338 isoform X1 [Benincasa hispida]2.3e-22494.41Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED   EEED ELHPQQMQEEGGEEDYAGVRVAEEELV NSDRMIISDSAN STPPVASEN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPM LQ GQ DNSGRRRGT+ IVDYGHDE AMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVT+STSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N + ESETEKVE+TVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEI-EADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI EADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPK+NIPFSGVSAIT SGLHSAAPASD IPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEI-EADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GG DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_038894986.1 uncharacterized protein LOC120083338 isoform X2 [Benincasa hispida]9.4e-22694.62Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED   EEED ELHPQQMQEEGGEEDYAGVRVAEEELV NSDRMIISDSAN STPPVASEN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPM LQ GQ DNSGRRRGT+ IVDYGHDE AMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVT+STSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N + ESETEKVE+TVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPK+NIPFSGVSAIT SGLHSAAPASD IPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        G DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

TrEMBL top hitse value%identityAlignment
A0A0A0LX73 Uncharacterized protein8.9e-23097.05Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDL EEEEDGELHPQQM+EEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVA ENLTPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPQVVVSSSPMVLQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV ISTSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N MPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQPGGTVVTAPKINIPFSGVSAIT+SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ++
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER

A0A1S3B7X1 uncharacterized protein LOC1034869711.2e-23799.1Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQE GGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVA ENLTPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ GGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A5A7UPK6 SAP30-binding protein-like1.2e-23799.1Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQE GGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVA ENLTPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ GGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A6J1GT35 DNA ligase 1-like isoform X17.6e-21390.13Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVED+EEEEED EL  QQ QEEGG++DY GVRVAEEE   NSDRMI+S+SANDSTPPV  EN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTP

Query:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPQ VVS+SPM+LQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV + T NNL+TPQISESPHSGSMN
Subjt:  QPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N + ESETEKVEETVEEEKKDIDPLDKFLPPPPK+KCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVV APK+NIPFSGVSAI  SGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A6J1K652 DNA ligase 1 isoform X17.8e-21088.11Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDL--------EEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDK
        MASKKK+SEGIALLSMYNDEDD+MEDVED+        EEEEED ELH QQ Q+EGGE+DY GVRVAEEE   NSDRMI+S+SANDSTPPV  EN TP+K
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDL--------EEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDK

Query:  LKYGSSTPQPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISE
        LK+GSSTPQPPQ VVS SPM+LQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV + T NNL+TPQISE
Subjt:  LKYGSSTPQPPQVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISE

Query:  SPHSGSMNNGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK
        SPHSGSMNN + ESETEKVEETVEEEKKDI+PLDKFLPPPPK+KCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK
Subjt:  SPHSGSMNNGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK

Query:  EVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGD
        +VFDPHGYDKSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVV APK+NIPFSGVSAI  SGLHSAA ASDAIPRDGRQNKKSKWDKVDGD
Subjt:  EVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGD

Query:  RRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        RRNPVISGGSDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  RRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

SwissProt top hitse value%identityAlignment
Q02614 SAP30-binding protein6.3e-1531.02Show/hide
Query:  ESPHSGSMNNGMPESETEKVE-----ETVEEEKKD--------------IDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNP
        E     S  +   +SETEK E     +  E EK+D              + P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP
Subjt:  ESPHSGSMNNGMPESETEKVE-----ETVEEEKKD--------------IDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNP

Query:  DFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPAS
              +++  ID++G+ + K++FDPHG+ +  YY  +    K EM++ E  +K+  K+EFV+ GT+ G T                 T++   S + AS
Subjt:  DFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPAS

Query:  DAIPRDGRQNKKSKWD
         A+     Q +KSKWD
Subjt:  DAIPRDGRQNKKSKWD

Q9UHR5 SAP30-binding protein1.4e-1430.56Show/hide
Query:  ESPHSGSMNNGMPESETEKVE-----ETVEEEKKD--------------IDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNP
        E     S  +   +SETEK E     +  E EK+D              + P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP
Subjt:  ESPHSGSMNNGMPESETEKVE-----ETVEEEKKD--------------IDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNP

Query:  DFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPAS
              +++  ID++G+ + K++FDPHG+ +  YY  +    K EM++ E  +K+  K+EFV+ GT+ G T           +  S  T++   + A A 
Subjt:  DFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPAS

Query:  DAIPRDGRQNKKSKWD
                Q +KSKWD
Subjt:  DAIPRDGRQNKKSKWD

Arabidopsis top hitse value%identityAlignment
AT1G29220.1 transcriptional regulator family protein1.3e-7143.37Show/hide
Query:  KQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTPQPPQV
        K+SEGIALLS+Y+DEDD  E++ED EEEEE+ E    Q + E         ++ EE+ V  ++ M   +                      S TP+    
Subjt:  KQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTPQPPQV

Query:  VVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPE
        V +SS        LDN                             +ES R  + + ++G +G  D       +  +S+ L                    
Subjt:  VVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPE

Query:  SETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDY
                           LD+FLPP P+E+CSE+LQRKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSK+VFDP GYD SD+
Subjt:  SETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDY

Query:  YTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGS---
           IE DMK E ERKE E KK+ K++FVS GTQP G V  A K NIP  G+ A+ +SGL S    ++   RDGR NKKSKWDKVDGD +NP ++ G+   
Subjt:  YTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGS---

Query:  -DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
          +  ++AAL+SA + GSGY AFAQQRRRE E +RSSERKL+RRS
Subjt:  -DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

AT1G29220.2 transcriptional regulator family protein6.0e-6942.23Show/hide
Query:  KQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTPQPPQV
        K+SEGIALLS+Y+DEDD  E++ED EEEEE+ E    Q + E         ++ EE+ V  ++ M   +                      S TP+    
Subjt:  KQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTPQPPQV

Query:  VVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPE
        V +SS        LDN                             +ES R  + + ++G +G  D       +  +S+ L                    
Subjt:  VVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPE

Query:  SETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKE
                           LD+FLPP P+E+CSE+LQ            RKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSK+
Subjt:  SETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKE

Query:  VFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDR
        VFDP GYD SD+   IE DMK E ERKE E KK+ K++FVS GTQP G V  A K NIP  G+ A+ +SGL S    ++   RDGR NKKSKWDKVDGD 
Subjt:  VFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDR

Query:  RNPVISGGS----DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        +NP ++ G+     +  ++AAL+SA + GSGY AFAQQRRRE E +RSSERKL+RRS
Subjt:  RNPVISGGS----DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCAAGAAGAAACAATCTGAAGGAATAGCTTTACTCTCGATGTACAATGATGAGGATGATGAAATGGAAGACGTTGAAGACCTAGAAGAAGAAGAAGAAGATGG
TGAGCTCCATCCGCAACAGATGCAAGAAGAGGGAGGAGAGGAAGATTATGCTGGAGTTAGGGTTGCAGAAGAAGAGTTGGTTGCAAACAGTGATAGAATGATTATCAGTG
ATTCTGCTAATGATTCGACGCCACCGGTTGCTAGTGAAAATTTGACTCCAGATAAGCTCAAATACGGGTCATCCACACCGCAGCCACCCCAGGTTGTGGTTTCATCGTCG
CCAATGGTATTACAGACTGGGCAATTAGATAATTCTGGTAGGAGAAGGGGGACACTTGCGATAGTTGATTACGGTCATGATGAAGCCGCAATGTCTCCTGAGGCTGAGGA
TGGAGAAATTGAGGAATCTGGTCGTGTCACATTTGGTGACGAGCTTTTAGGCACTAATGGTGATTTTGATAGAACATCTCCCGGAACTGTAACGATCTCAACATCAAACA
ATCTATCCACTCCTCAAATTTCTGAATCGCCACATTCTGGTTCAATGAACAATGGGATGCCAGAATCTGAAACTGAAAAAGTTGAGGAAACTGTTGAAGAAGAGAAAAAA
GATATTGATCCCTTGGACAAGTTTCTTCCTCCTCCACCAAAAGAAAAATGCTCAGAGGATCTGCAAAGGAAAATCAATAAGTTTCTCGAGTATAAGAAAGCTGGAAAAAG
CTTCAATGCAGAAGTACGGAATAGGAAGGACTACCGGAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGAAG
TGTTTGACCCTCATGGATATGATAAAAGTGACTACTATACGGAAATAGAGGCCGACATGAAACGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATG
GAGTTTGTTTCAGGAGGAACCCAACCTGGTGGTACAGTTGTTACTGCTCCTAAAATAAATATACCTTTTTCAGGTGTTTCAGCTATCACAAGTAGTGGATTGCATTCAGC
AGCTCCTGCATCTGATGCCATTCCTAGGGATGGAAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGAGATAGAAGGAATCCAGTAATTTCTGGCGGGTCAGATG
CAGCTAGTGCCCATGCAGCTTTACTATCTGCTGCTAATGTTGGCTCTGGATACATGGCTTTTGCGCAACAAAGACGGCGAGAGGCTGAAGAAAAAAGATCCAGCGAGAGG
AAATTGGATAGAAGATCATAA
mRNA sequenceShow/hide mRNA sequence
GAAACAGTTCAAACAGTTGAAAGCCTCATTTCAATATCATCCGACTGTGGAGCTTCGTCGGGGCGGTTGAGTTTTCTCCAATTTTCTTCAATTTTCTTCGTTTTAATTCA
ATTGGAGACTGAAAACCCTTCTTCCCACTTCCCAGTATCGCCATTGATTCAAAATCCCAATCGTTCTGCATTCAATTGGCATCCAATTTTCGATATCTGAAGTATCGTTT
CTTTCTTATTCAACTGTTTCCTTTCGGGTGCTGAGGATCGAAGCTCTTATGGCATCCAAGAAGAAACAATCTGAAGGAATAGCTTTACTCTCGATGTACAATGATGAGGA
TGATGAAATGGAAGACGTTGAAGACCTAGAAGAAGAAGAAGAAGATGGTGAGCTCCATCCGCAACAGATGCAAGAAGAGGGAGGAGAGGAAGATTATGCTGGAGTTAGGG
TTGCAGAAGAAGAGTTGGTTGCAAACAGTGATAGAATGATTATCAGTGATTCTGCTAATGATTCGACGCCACCGGTTGCTAGTGAAAATTTGACTCCAGATAAGCTCAAA
TACGGGTCATCCACACCGCAGCCACCCCAGGTTGTGGTTTCATCGTCGCCAATGGTATTACAGACTGGGCAATTAGATAATTCTGGTAGGAGAAGGGGGACACTTGCGAT
AGTTGATTACGGTCATGATGAAGCCGCAATGTCTCCTGAGGCTGAGGATGGAGAAATTGAGGAATCTGGTCGTGTCACATTTGGTGACGAGCTTTTAGGCACTAATGGTG
ATTTTGATAGAACATCTCCCGGAACTGTAACGATCTCAACATCAAACAATCTATCCACTCCTCAAATTTCTGAATCGCCACATTCTGGTTCAATGAACAATGGGATGCCA
GAATCTGAAACTGAAAAAGTTGAGGAAACTGTTGAAGAAGAGAAAAAAGATATTGATCCCTTGGACAAGTTTCTTCCTCCTCCACCAAAAGAAAAATGCTCAGAGGATCT
GCAAAGGAAAATCAATAAGTTTCTCGAGTATAAGAAAGCTGGAAAAAGCTTCAATGCAGAAGTACGGAATAGGAAGGACTACCGGAATCCAGATTTCTTGTTACATGCTG
TGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGAAGTGTTTGACCCTCATGGATATGATAAAAGTGACTACTATACGGAAATAGAGGCCGACATGAAA
CGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATGGAGTTTGTTTCAGGAGGAACCCAACCTGGTGGTACAGTTGTTACTGCTCCTAAAATAAATAT
ACCTTTTTCAGGTGTTTCAGCTATCACAAGTAGTGGATTGCATTCAGCAGCTCCTGCATCTGATGCCATTCCTAGGGATGGAAGACAAAACAAAAAATCAAAATGGGATA
AGGTAGATGGAGATAGAAGGAATCCAGTAATTTCTGGCGGGTCAGATGCAGCTAGTGCCCATGCAGCTTTACTATCTGCTGCTAATGTTGGCTCTGGATACATGGCTTTT
GCGCAACAAAGACGGCGAGAGGCTGAAGAAAAAAGATCCAGCGAGAGGAAATTGGATAGAAGATCATAAAAGCAAATAAGTTCTGTTCCATAGTTTTAAGTATCAAACGA
TTTTGAAAAGGAAGGGAAATGGCTTGTAGCTTCGTATCTTTGACTAACCATGTATACGGTCAGAACAAAAATATGATCAGGGGGAAAAGTGGAAAACAAATTGCACCAGA
CCGAAGGAGGAGGCTCCTTTTTCTGTGGTGATTGCATATTTTCAACACAAGTACGTGTGGGTCAGCTTGAAAAGGTAAGTCATTGGCACCCGAAAAATGGTTTGAAGTTC
CTTTTTTTTTTCTTTTCCCTCCACTCGAGAGAATATGCAATGGGATTCTTTGGATTGGAAGGCCAAAGACTGTTTATTCTCCTCTCATTTTCTAACCAACAATACTTTGG
TGCTTCTCATGAGTCCATATTCGTGCATTCTACTTTAATAAAAGAAAAGAAGTATTTTCAAAGAAAAGTAAGAAGAGTTGACATGGAAAAACTACGAATAATTAGGTAGA
TGCATGAAAATGAATGCAGCTTAGGATGCACCTAAAACTTATTTTTTATGTGGGTAGACCAATATATCCTTACAAAACAAAAAAGTTTGACATCGATGGTAGAGATTAAT
TCAGTATTTTTGACATAAAAAATTATCATCCATTAATCTCAATATAAATATGATTGACAAATAAAAATAACTGAATATACTACTTCATCAAGTACATAGATGTTTTAAAT
CTTTGGATGCTCTAGGATGTTTCAGTTTTGTGAGTGTGTAGTAAAAGACCAAAATACACACCTGCTTTGTCGATGGTGGAAATGGATTTATATGTTTTTGCACCCTTTAA
TTTGCTTTTTGGTTAAAACTGCTTTTGTTGGTAGTTGCTTTTGGAATAGAACACAAGTTTTCTTCTTTTGAGAGAGAGAAAGGAAGAGATGCATGTGAAGGTAAGGTAAA
AGAGTTGTCCCATTGCCATGGTTGCCTTTGCATGGATAAATTAATGCACGCTTTCACCTCAGTTGAGACCAGAGAAAAGGGTGACGGAGTTTCCGGTGCTTCTTATTATT
TTTATGCATCACCTTTCACCTTTGTGTAGTTATACATCATCTATCATCCTTTTATATTGGCCACATAAAAATAACCCCTCAAAGTTTCATCTTTTATGGTCACTATCCTT
TTTTATAAATTGTTATAACAGTCTCTCTTGATCCCTCCCGATCAAAATCGACCGTGAGATTTGATTAAAGTATTTAAATTTCTTTCCTTATAACCTCAATGCAATCTCCA
ACTTTCCAAAGAACACTTTCCAAGAATTTGTAATTATTTGGATAAAAAATTAAACTCTACATGGGCAAAGTAATAGAAAAATACAATATTTTCCCACATCTCCT
Protein sequenceShow/hide protein sequence
MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEEGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVASENLTPDKLKYGSSTPQPPQVVVSSS
PMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPESETEKVEETVEEEKK
DIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKM
EFVSGGTQPGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER
KLDRRS