; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0008463 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0008463
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionSAP30-binding protein-like
Genome locationchr12:20634288..20639413
RNA-Seq ExpressionIVF0008463
SyntenyIVF0008463
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR012479 - SAP30-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653213.1 hypothetical protein Csa_019629 [Cucumis sativus]5.58e-30496.72Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEE DGELHPQQM+E GGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPMVLQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV ISTSNNLSTPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N MPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQ GGTVVTAPKINIPFSGVSAIT+SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQGEKWKTNCTRPKEEAPFS
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS+DQGEKWK NCTR KEEAPFS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQGEKWKTNCTRPKEEAPFS

XP_004150215.1 uncharacterized protein LOC101206323 [Cucumis sativus]3.76e-29096.81Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEE DGELHPQQM+E GGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPMVLQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV ISTSNNLSTPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N MPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQ GGTVVTAPKINIPFSGVSAIT+SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSND
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS +
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSND

XP_008443368.1 PREDICTED: uncharacterized protein LOC103486971 [Cucumis melo]1.51e-30199.54Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSND
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS++
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSND

XP_038894985.1 uncharacterized protein LOC120083338 isoform X1 [Benincasa hispida]6.48e-27792.95Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED EEE  D ELHPQQMQE GGEEDYAGVRVAEEELV NSDRMIISDSAN STPPVA EN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPM LQ GQ DNSGRRRGT+ IVDYGHDE AMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVT+STSNNLSTPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N + ESETEKVE+TVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEI-EADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS
        DKSDYYTEI EADMKREMERKELERKKSPKMEFVSGGTQ GGTVVTAPK+NIPFSGVSAIT SGLHSAAPASD IPRDGRQNKKSKWDKVDGDRRNPVIS
Subjt:  DKSDYYTEI-EADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS

Query:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSND
        GG DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS++
Subjt:  GGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSND

XP_038894986.1 uncharacterized protein LOC120083338 isoform X2 [Benincasa hispida]9.33e-27993.17Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVED EEE  D ELHPQQMQE GGEEDYAGVRVAEEELV NSDRMIISDSAN STPPVA EN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPM LQ GQ DNSGRRRGT+ IVDYGHDE AMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVT+STSNNLSTPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N + ESETEKVE+TVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ GGTVVTAPK+NIPFSGVSAIT SGLHSAAPASD IPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSND
        G DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS++
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSND

TrEMBL top hitse value%identityAlignment
A0A0A0LX73 Uncharacterized protein0.0e+0089.8Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDL EEEEDGELHPQQM+E GGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP VVVSSSPMVLQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV ISTSNNLSTPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N MPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQ GGTVVTAPKINIPFSGVSAIT+SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQGEKWKTNCTRPKEEAPFSYSGFPPMSKKGKSEPLKEIRRCTKSMSYNHQVGHIETVTFLP
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS+DQGEKWK NCTR KEEAPFSYSGF PMSKKGKSEPLKEIRRCTKSMSYNHQVGHIETVTFLP
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQGEKWKTNCTRPKEEAPFSYSGFPPMSKKGKSEPLKEIRRCTKSMSYNHQVGHIETVTFLP

Query:  YSFMPCPCPFPHASRSSSTLQTAKSPSNYEPHNQIKLLDFRLSSLPLCFL------------------------------------------QNYDVDLE
        YSFMPCPCPFPHAS SSSTLQTAK+PSNYEPHNQIKLLDFRLSSL LCFL                                          QNYDVDLE
Subjt:  YSFMPCPCPFPHASRSSSTLQTAKSPSNYEPHNQIKLLDFRLSSLPLCFL------------------------------------------QNYDVDLE

Query:  KGILCFTSKQRRSKGSNEKDGSVYNYNVVESDQSLLFFLQVVIAYFQHKYVWVSLKR
        KGILCFTSKQR  +GSN+KDG VYNYNVVESDQSLLFFLQVV+AYFQHKYVWVSLKR
Subjt:  KGILCFTSKQRRSKGSNEKDGSVYNYNVVESDQSLLFFLQVVIAYFQHKYVWVSLKR

A0A1S3B7X1 uncharacterized protein LOC1034869715.5e-23899.32Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS+++
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ

A0A5A7UPK6 SAP30-binding protein-like5.5e-23899.32Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS+++
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ

A0A6J1GT35 DNA ligase 1-like isoform X11.3e-20788.64Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVED+EEEEED EL  QQ QE GG++DY GVRVAEEE   NSDRMI+S+SANDSTPPV  EN TPDKLK+GSSTP
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTP

Query:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN
        QPP  VVS+SPM+LQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV + T NNL+TPQISESPHSGSMN
Subjt:  QPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMN

Query:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY
        N + ESETEKVEETVEEEKKDIDPLDKFLPPPPK+KCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  NGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
        DKSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQ GGTVV APK+NIPFSGVSAI  SGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVISG
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISG

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ
        GSDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRS+++
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ

A0A6J1K652 DNA ligase 1 isoform X11.4e-20486.61Show/hide
Query:  MASKKKQSEGIALLSMYNDEDDEMEDVEDL--------EEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDK
        MASKKK+SEGIALLSMYNDEDD+MEDVED+        EEEEED ELH QQ Q+ GGE+DY GVRVAEEE   NSDRMI+S+SANDSTPPV  EN TP+K
Subjt:  MASKKKQSEGIALLSMYNDEDDEMEDVEDL--------EEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDK

Query:  LKYGSSTPQPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISE
        LK+GSSTPQPP  VVS SPM+LQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV + T NNL+TPQISE
Subjt:  LKYGSSTPQPPHVVVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISE

Query:  SPHSGSMNNGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK
        SPHSGSMNN + ESETEKVEETVEEEKKDI+PLDKFLPPPPK+KCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK
Subjt:  SPHSGSMNNGMPESETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK

Query:  EVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGD
        +VFDPHGYDKSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQ GGTVV APK+NIPFSGVSAI  SGLHSAA ASDAIPRDGRQNKKSKWDKVDGD
Subjt:  EVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGD

Query:  RRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ
        RRNPVISGGSDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRS+++
Subjt:  RRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ

SwissProt top hitse value%identityAlignment
Q02614 SAP30-binding protein8.7e-1531.02Show/hide
Query:  ESPHSGSMNNGMPESETEKVE-----ETVEEEKKD--------------IDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNP
        E     S  +   +SETEK E     +  E EK+D              + P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP
Subjt:  ESPHSGSMNNGMPESETEKVE-----ETVEEEKKD--------------IDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNP

Query:  DFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPAS
              +++  ID++G+ + K++FDPHG+ +  YY  +    K EM++ E  +K+  K+EFV+ GT+ G T                 T++   S + AS
Subjt:  DFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPAS

Query:  DAIPRDGRQNKKSKWD
         A+     Q +KSKWD
Subjt:  DAIPRDGRQNKKSKWD

Q9UHR5 SAP30-binding protein1.9e-1430.56Show/hide
Query:  ESPHSGSMNNGMPESETEKVE-----ETVEEEKKD--------------IDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNP
        E     S  +   +SETEK E     +  E EK+D              + P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP
Subjt:  ESPHSGSMNNGMPESETEKVE-----ETVEEEKKD--------------IDPLDKFLPPPPKEKCSEDLQRKINKFLEYK-KAGKSFNAEVRNRKDYRNP

Query:  DFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPAS
              +++  ID++G+ + K++FDPHG+ +  YY  +    K EM++ E  +K+  K+EFV+ GT+ G T           +  S  T++   + A A 
Subjt:  DFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPAS

Query:  DAIPRDGRQNKKSKWD
                Q +KSKWD
Subjt:  DAIPRDGRQNKKSKWD

Arabidopsis top hitse value%identityAlignment
AT1G29220.1 transcriptional regulator family protein7.0e-6842.37Show/hide
Query:  KQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTPQPPHV
        K+SEGIALLS+Y+DEDD  E++ED EEEEE+ E   Q+ QE             E E +   D++  ++  ++      GE+         S TP+    
Subjt:  KQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTPQPPHV

Query:  VVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPE
        V +SS        LDN                             +ES R  + + ++G +G  D       +  +S+ L                    
Subjt:  VVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPE

Query:  SETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDY
                           LD+FLPP P+E+CSE+LQRKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSK+VFDP GYD SD+
Subjt:  SETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDY

Query:  YTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGS---
           IE DMK E ERKE E KK+ K++FVS GTQ  G V  A K NIP  G+ A+ +SGL S    ++   RDGR NKKSKWDKVDGD +NP ++ G+   
Subjt:  YTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGS---

Query:  -DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ
          +  ++AAL+SA + GSGY AFAQQRRRE E +RS+++
Subjt:  -DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ

AT1G29220.2 transcriptional regulator family protein3.2e-6541.24Show/hide
Query:  KQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTPQPPHV
        K+SEGIALLS+Y+DEDD  E++ED EEEEE+ E   Q+ QE             E E +   D++  ++  ++      GE+         S TP+    
Subjt:  KQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTPQPPHV

Query:  VVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPE
        V +SS        LDN                             +ES R  + + ++G +G  D       +  +S+ L                    
Subjt:  VVSSSPMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPE

Query:  SETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKE
                           LD+FLPP P+E+CSE+LQ            RKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSK+
Subjt:  SETEKVEETVEEEKKDIDPLDKFLPPPPKEKCSEDLQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKE

Query:  VFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDR
        VFDP GYD SD+   IE DMK E ERKE E KK+ K++FVS GTQ  G V  A K NIP  G+ A+ +SGL S    ++   RDGR NKKSKWDKVDGD 
Subjt:  VFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDR

Query:  RNPVISGGS----DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ
        +NP ++ G+     +  ++AAL+SA + GSGY AFAQQRRRE E +RS+++
Subjt:  RNPVISGGS----DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCCAAGAAGAAACAATCTGAAGGAATAGCTTTACTCTCGATGTACAATGATGAGGATGATGAAATGGAAGACGTTGAAGACCTAGAAGAAGAAGAAGAAGATGG
TGAGCTCCATCCGCAACAGATGCAAGAAGGGGGAGGAGAGGAAGATTATGCTGGAGTTAGGGTTGCAGAAGAAGAGTTGGTTGCAAACAGTGATAGAATGATTATCAGTG
ATTCTGCTAATGATTCGACGCCACCGGTTGCTGGTGAAAATTTGACTCCAGATAAGCTCAAATACGGGTCATCCACACCGCAGCCACCCCATGTTGTGGTTTCATCGTCG
CCAATGGTATTACAGACTGGGCAATTAGATAATTCTGGTAGGAGAAGGGGGACACTTGCGATAGTTGATTATGGTCATGATGAAGCCGCAATGTCTCCTGAGGCTGAGGA
TGGAGAAATTGAGGAATCTGGTCGTGTCACATTTGGTGACGAGCTTTTAGGCACTAATGGTGATTTTGATAGAACATCTCCCGGAACTGTAACGATCTCAACATCAAACA
ATCTATCCACTCCTCAAATTTCTGAATCGCCACATTCTGGTTCAATGAACAATGGGATGCCAGAATCTGAAACTGAAAAAGTTGAGGAAACTGTTGAAGAAGAGAAAAAA
GATATTGATCCCTTGGACAAGTTTCTTCCTCCTCCACCAAAAGAAAAATGCTCAGAGGATCTGCAAAGGAAAATCAATAAGTTTCTCGAGTATAAGAAAGCTGGAAAAAG
CTTCAATGCAGAAGTACGGAATAGGAAGGACTACCGGAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGAAG
TGTTTGACCCTCATGGATATGATAAAAGTGACTACTATACGGAAATAGAGGCCGACATGAAACGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATG
GAGTTTGTTTCAGGAGGAACCCAATCTGGTGGTACAGTTGTTACTGCTCCTAAAATAAATATACCTTTTTCAGGTGTTTCAGCTATCACAAGTAGTGGATTGCATTCAGC
AGCTCCTGCATCTGATGCCATTCCTAGGGATGGAAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGAGATAGAAGGAATCCAGTAATTTCTGGCGGGTCAGATG
CAGCTAGTGCCCATGCAGCTTTACTATCTGCTGCTAATGTTGGCTCTGGATACATGGCTTTTGCGCAACAAAGACGGCGAGAGGCTGAAGAAAAAAGATCCAATGATCAG
GGGGAAAAGTGGAAAACAAATTGCACCAGACCGAAGGAGGAGGCTCCTTTTTCTTATAGTGGATTCCCTCCTATGAGTAAAAAGGGTAAATCAGAACCCCTAAAGGAGAT
ACGTAGATGTACAAAATCCATGTCTTATAACCATCAAGTTGGCCATATTGAGACAGTGACGTTTTTGCCTTACAGCTTCATGCCCTGTCCCTGTCCTTTTCCCCACGCCT
CTCGATCTTCTTCAACCCTACAAACAGCCAAAAGCCCAAGCAATTATGAACCCCACAATCAAATCAAGCTGCTTGATTTTAGGCTGTCTTCACTTCCTCTTTGTTTTCTA
CAGAATTATGACGTAGATTTGGAAAAAGGAATCTTGTGTTTTACATCGAAACAAAGAAGATCCAAAGGCAGTAATGAGAAAGATGGGTCGGTTTACAATTATAATGTTGT
TGAGTCCGATCAATCCCTCCTTTTCTTCTTGCAGGTGGTGATTGCATATTTTCAACACAAGTACGTGTGGGTCAGCTTGAAAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCCAAGAAGAAACAATCTGAAGGAATAGCTTTACTCTCGATGTACAATGATGAGGATGATGAAATGGAAGACGTTGAAGACCTAGAAGAAGAAGAAGAAGATGG
TGAGCTCCATCCGCAACAGATGCAAGAAGGGGGAGGAGAGGAAGATTATGCTGGAGTTAGGGTTGCAGAAGAAGAGTTGGTTGCAAACAGTGATAGAATGATTATCAGTG
ATTCTGCTAATGATTCGACGCCACCGGTTGCTGGTGAAAATTTGACTCCAGATAAGCTCAAATACGGGTCATCCACACCGCAGCCACCCCATGTTGTGGTTTCATCGTCG
CCAATGGTATTACAGACTGGGCAATTAGATAATTCTGGTAGGAGAAGGGGGACACTTGCGATAGTTGATTATGGTCATGATGAAGCCGCAATGTCTCCTGAGGCTGAGGA
TGGAGAAATTGAGGAATCTGGTCGTGTCACATTTGGTGACGAGCTTTTAGGCACTAATGGTGATTTTGATAGAACATCTCCCGGAACTGTAACGATCTCAACATCAAACA
ATCTATCCACTCCTCAAATTTCTGAATCGCCACATTCTGGTTCAATGAACAATGGGATGCCAGAATCTGAAACTGAAAAAGTTGAGGAAACTGTTGAAGAAGAGAAAAAA
GATATTGATCCCTTGGACAAGTTTCTTCCTCCTCCACCAAAAGAAAAATGCTCAGAGGATCTGCAAAGGAAAATCAATAAGTTTCTCGAGTATAAGAAAGCTGGAAAAAG
CTTCAATGCAGAAGTACGGAATAGGAAGGACTACCGGAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGAAG
TGTTTGACCCTCATGGATATGATAAAAGTGACTACTATACGGAAATAGAGGCCGACATGAAACGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATG
GAGTTTGTTTCAGGAGGAACCCAATCTGGTGGTACAGTTGTTACTGCTCCTAAAATAAATATACCTTTTTCAGGTGTTTCAGCTATCACAAGTAGTGGATTGCATTCAGC
AGCTCCTGCATCTGATGCCATTCCTAGGGATGGAAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGAGATAGAAGGAATCCAGTAATTTCTGGCGGGTCAGATG
CAGCTAGTGCCCATGCAGCTTTACTATCTGCTGCTAATGTTGGCTCTGGATACATGGCTTTTGCGCAACAAAGACGGCGAGAGGCTGAAGAAAAAAGATCCAATGATCAG
GGGGAAAAGTGGAAAACAAATTGCACCAGACCGAAGGAGGAGGCTCCTTTTTCTTATAGTGGATTCCCTCCTATGAGTAAAAAGGGTAAATCAGAACCCCTAAAGGAGAT
ACGTAGATGTACAAAATCCATGTCTTATAACCATCAAGTTGGCCATATTGAGACAGTGACGTTTTTGCCTTACAGCTTCATGCCCTGTCCCTGTCCTTTTCCCCACGCCT
CTCGATCTTCTTCAACCCTACAAACAGCCAAAAGCCCAAGCAATTATGAACCCCACAATCAAATCAAGCTGCTTGATTTTAGGCTGTCTTCACTTCCTCTTTGTTTTCTA
CAGAATTATGACGTAGATTTGGAAAAAGGAATCTTGTGTTTTACATCGAAACAAAGAAGATCCAAAGGCAGTAATGAGAAAGATGGGTCGGTTTACAATTATAATGTTGT
TGAGTCCGATCAATCCCTCCTTTTCTTCTTGCAGGTGGTGATTGCATATTTTCAACACAAGTACGTGTGGGTCAGCTTGAAAAGGTAA
Protein sequenceShow/hide protein sequence
MASKKKQSEGIALLSMYNDEDDEMEDVEDLEEEEEDGELHPQQMQEGGGEEDYAGVRVAEEELVANSDRMIISDSANDSTPPVAGENLTPDKLKYGSSTPQPPHVVVSSS
PMVLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVTISTSNNLSTPQISESPHSGSMNNGMPESETEKVEETVEEEKK
DIDPLDKFLPPPPKEKCSEDLQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKEVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKM
EFVSGGTQSGGTVVTAPKINIPFSGVSAITSSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSNDQ
GEKWKTNCTRPKEEAPFSYSGFPPMSKKGKSEPLKEIRRCTKSMSYNHQVGHIETVTFLPYSFMPCPCPFPHASRSSSTLQTAKSPSNYEPHNQIKLLDFRLSSLPLCFL
QNYDVDLEKGILCFTSKQRRSKGSNEKDGSVYNYNVVESDQSLLFFLQVVIAYFQHKYVWVSLKR