; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020237 (gene) of Snake gourd v1 genome

Gene IDTan0020237
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSAP30-binding protein-like isoform X2
Genome locationLG01:105283056..105284390
RNA-Seq ExpressionTan0020237
SyntenyTan0020237
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR012479 - SAP30-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573316.1 SAP30-binding protein, partial [Cucurbita argyrosperma subsp. sororia]2.2e-20688.39Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSS
        MASKKKESEGIALLSMYNDEDDEMEDVE   D E EEEDSEL QQQRQEEGG++DYGVRVAEEES  NSDRMI+S++ NDSTPPV DEN TPDKL FGSS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSS

Query:  TLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGS
        T Q     VS+SPMLLQ    DNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRV TPNNL+TPQISESPHSGS
Subjt:  TLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGS

Query:  MNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH
        MNNMILESET KVEETVEEEKKDI+PLDKFLPPPP +KCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH
Subjt:  MNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH

Query:  GYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVI
        GYDKSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPG TVV+APK+NIPF+GVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVI
Subjt:  GYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVI

Query:  SGGSDAA----LLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        SGGSDAA     L ++ANVGSGYMAFAQQRRREAEEK S+ERKLDRRS
Subjt:  SGGSDAA----LLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

XP_004150215.1 uncharacterized protein LOC101206323 [Cucumis sativus]1.2e-20788.42Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS
        MASKKK+SEGIALLSMYNDEDDEMEDVE+L+    EEED ELH QQ +EEGGEEDY GVRVAEEE VANSDRMIISD+ NDSTPPVA ENLTPDKL FGS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS

Query:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG
        ST Q   V VSSSPM+LQ  QLDNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV +ST NNLSTPQISESPHSG
Subjt:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG

Query:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP
        SMNN++ ESET KVEETVEEEKKDIDPLDKFLPPPP EKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDP
Subjt:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP

Query:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
        HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQPG TVV+APKINIPF+GVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
Subjt:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV

Query:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        ISGGSDA    A L +AANVGSGYMAFAQQRRREAEEK S ERKLDRRS
Subjt:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

XP_008443368.1 PREDICTED: uncharacterized protein LOC103486971 [Cucumis melo]5.2e-20888.86Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS
        MASKKK+SEGIALLSMYNDEDDEMEDVE+L   E EEED ELH QQ QE GGEEDY GVRVAEEE VANSDRMIISD+ NDSTPPVA ENLTPDKL +GS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS

Query:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG
        ST Q  HV VSSSPM+LQ  QLDNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV +ST NNLSTPQISESPHSG
Subjt:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG

Query:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP
        SMNN + ESET KVEETVEEEKKDIDPLDKFLPPPP EKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDP
Subjt:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP

Query:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
        HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ G TVV+APKINIPF+GVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
Subjt:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV

Query:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        ISGGSDA    A L +AANVGSGYMAFAQQRRREAEEK S+ERKLDRRS
Subjt:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

XP_022955191.1 DNA ligase 1-like isoform X1 [Cucurbita moschata]5.7e-20788.62Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSS
        MASKKKESEGIALLSMYNDEDDEMEDVE   D E EEEDSEL QQQRQEEGG++DYGVRVAEEES  NSDRMI+S++ NDSTPPV DEN TPDKL FGSS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSS

Query:  TLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGS
        T Q     VS+SPMLLQ    DNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRV TPNNL+TPQISESPHSGS
Subjt:  TLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGS

Query:  MNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH
        MNNMILESET KVEETVEEEKKDIDPLDKFLPPPP +KCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH
Subjt:  MNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH

Query:  GYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVI
        GYDKSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPG TVV+APK+NIPF+GVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVI
Subjt:  GYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVI

Query:  SGGSDAA----LLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        SGGSDAA     L ++ANVGSGYMAFAQQRRREAEEK S+ERKLDRRS
Subjt:  SGGSDAA----LLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

XP_038894986.1 uncharacterized protein LOC120083338 isoform X2 [Benincasa hispida]3.7e-20687.97Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS
        MASKKK+SEGIALLSMYNDEDDEMEDV     E+ EEEDSELH QQ QEEGGEEDY GVRVAEEE V NSDRMIISD+ N STPPVA EN TPDKL FGS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS

Query:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG
        ST Q   V VSSSPM LQA Q DNS RRRGT+ IVDYGHDE AMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV VST NNLSTPQISESPHSG
Subjt:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG

Query:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP
        SMNN+ILESET KVE+TVEEEKKDIDPLDKFLPPPP EKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP
Subjt:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP

Query:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
        HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPG TVV+APK+NIPF+GVSAI GSGLHSAAPASD IPRDGRQNKKSKWDKVDGDRRNPV
Subjt:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV

Query:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        ISGG DA    A L +AANVGSGYMAFAQQRRREAEEK S+ERKLDRRS
Subjt:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

TrEMBL top hitse value%identityAlignment
A0A0A0LX73 Uncharacterized protein2.2e-20487.81Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS
        MASKKK+SEGIALLSMYNDEDDEMEDVE+L+    EEED ELH QQ +EEGGEEDY GVRVAEEE VANSDRMIISD+ NDSTPPVA ENLTPDKL FGS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS

Query:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG
        ST Q   V VSSSPM+LQ  QLDNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDR+SPGTV +ST NNLSTPQISESPHSG
Subjt:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG

Query:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP
        SMNN++ ESET KVEETVEEEKKDIDPLDKFLPPPP EKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDP
Subjt:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP

Query:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
        HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQPG TVV+APKINIPF+GVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
Subjt:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV

Query:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTER
        ISGGSDA    A L +AANVGSGYMAFAQQRRREAEEK S ++
Subjt:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTER

A0A1S3B7X1 uncharacterized protein LOC1034869712.5e-20888.86Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS
        MASKKK+SEGIALLSMYNDEDDEMEDVE+L   E EEED ELH QQ QE GGEEDY GVRVAEEE VANSDRMIISD+ NDSTPPVA ENLTPDKL +GS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS

Query:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG
        ST Q  HV VSSSPM+LQ  QLDNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV +ST NNLSTPQISESPHSG
Subjt:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG

Query:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP
        SMNN + ESET KVEETVEEEKKDIDPLDKFLPPPP EKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDP
Subjt:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP

Query:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
        HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ G TVV+APKINIPF+GVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
Subjt:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV

Query:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        ISGGSDA    A L +AANVGSGYMAFAQQRRREAEEK S+ERKLDRRS
Subjt:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

A0A5A7UPK6 SAP30-binding protein-like2.5e-20888.86Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS
        MASKKK+SEGIALLSMYNDEDDEMEDVE+L   E EEED ELH QQ QE GGEEDY GVRVAEEE VANSDRMIISD+ NDSTPPVA ENLTPDKL +GS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDY-GVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGS

Query:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG
        ST Q  HV VSSSPM+LQ  QLDNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTV +ST NNLSTPQISESPHSG
Subjt:  STLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSG

Query:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP
        SMNN + ESET KVEETVEEEKKDIDPLDKFLPPPP EKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDP
Subjt:  SMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDP

Query:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
        HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ G TVV+APKINIPF+GVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV
Subjt:  HGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPV

Query:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        ISGGSDA    A L +AANVGSGYMAFAQQRRREAEEK S+ERKLDRRS
Subjt:  ISGGSDA----ALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

A0A6J1GT35 DNA ligase 1-like isoform X12.8e-20788.62Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSS
        MASKKKESEGIALLSMYNDEDDEMEDVE   D E EEEDSEL QQQRQEEGG++DYGVRVAEEES  NSDRMI+S++ NDSTPPV DEN TPDKL FGSS
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSS

Query:  TLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGS
        T Q     VS+SPMLLQ    DNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRV TPNNL+TPQISESPHSGS
Subjt:  TLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGS

Query:  MNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH
        MNNMILESET KVEETVEEEKKDIDPLDKFLPPPP +KCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH
Subjt:  MNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPH

Query:  GYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVI
        GYDKSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPG TVV+APK+NIPF+GVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVI
Subjt:  GYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVI

Query:  SGGSDAA----LLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        SGGSDAA     L ++ANVGSGYMAFAQQRRREAEEK S+ERKLDRRS
Subjt:  SGGSDAA----LLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

A0A6J1K652 DNA ligase 1 isoform X15.2e-20686.75Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEEL-----KDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKL
        MASKKKESEGIALLSMYNDEDD+MEDVE++     ++EE EEEDSELH QQRQ+EGGE+DYGVRVAEEES  NSDRMI+S++ NDSTPPV DEN TP+KL
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEEL-----KDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKL

Query:  NFGSSTLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISES
         FGSST Q     VS SPMLLQ    DNS RRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRV TPNNL+TPQISES
Subjt:  NFGSSTLQQLHVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISES

Query:  PHSGSMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKD
        PHSGSMNN+ILESET KVEETVEEEKKDI+PLDKFLPPPP +KCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKD
Subjt:  PHSGSMNNMILESETAKVEETVEEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKD

Query:  VFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDR
        VFDPHGYDKSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPG TVV+APK+NIPF+GVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDGDR
Subjt:  VFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDR

Query:  RNPVISGGSDAA----LLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        RNPVISGGSDAA     L ++ANVGSGYMAFAQQRRREAEEK S+ERKLDRRS
Subjt:  RNPVISGGSDAA----LLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

SwissProt top hitse value%identityAlignment
Q02614 SAP30-binding protein9.7e-1632.62Show/hide
Query:  ESETAKVEETV---EEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        E+E    +E V    E  +++ P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP      +++  ID++G+ + KD+FDPHG+
Subjt:  ESETAKVEETV---EEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWD
         +  YY  +    K EM++ E  +K+  K+EFV+ GT+ G T              +A A S   ++   +DA      Q +KSKWD
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWD

Q9UHR5 SAP30-binding protein3.7e-1531.55Show/hide
Query:  ESETAKVEETV---EEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        E+E    +E V    E  +++ P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP      +++  ID++G+ + KD+FDPHG+
Subjt:  ESETAKVEETV---EEEKKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWD
         +  YY  +    K EM++ E  +K+  K+EFV+ GT+ G T              +A + +   ++   +DA      Q +KSKWD
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWD

Arabidopsis top hitse value%identityAlignment
AT1G29220.1 transcriptional regulator family protein5.4e-7044.77Show/hide
Query:  KESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSSTLQQL
        K+SEGIALLS+Y+DEDD     EE++D E EEE+ E  + Q + E        ++ EE+ V  ++ M              DE    +K   G       
Subjt:  KESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSSTLQQL

Query:  HVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGSMNNMI
            S +P LL      +SA                                                         TP +L   + S    S   N MI
Subjt:  HVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGSMNNMI

Query:  LESETAKVEETVEEEKKDIDP-LDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDK
         ES  A  E   +   +  D  LD+FLPP P E+CSEELQRKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSKDVFDP GYD 
Subjt:  LESETAKVEETVEEEKKDIDP-LDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDK

Query:  SDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGG-
        SD+   IE DMK E ERKE E KK+ K++FVS GTQPGA V +A K NIP  G+ A+A SGL S    ++   RDGR NKKSKWDKVDGD +NP ++ G 
Subjt:  SDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGG-

Query:  --------SDAALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
                S+AAL+S A + GSGY AFAQQRRRE E + S+ERKL+RRS
Subjt:  --------SDAALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS

AT1G29220.2 transcriptional regulator family protein2.5e-6743.6Show/hide
Query:  KESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSSTLQQL
        K+SEGIALLS+Y+DEDD     EE++D E EEE+ E  + Q + E        ++ EE+ V  ++ M              DE    +K   G       
Subjt:  KESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSSTLQQL

Query:  HVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGSMNNMI
            S +P LL      +SA                                                         TP +L   + S    S   N MI
Subjt:  HVAVSSSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGSMNNMI

Query:  LESETAKVEETVEEEKKDIDP-LDKFLPPPPNEKCSEELQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCF
         ES  A  E   +   +  D  LD+FLPP P E+CSEELQ            RKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCF
Subjt:  LESETAKVEETVEEEKKDIDP-LDKFLPPPPNEKCSEELQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCF

Query:  SKDVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVD
        SKDVFDP GYD SD+   IE DMK E ERKE E KK+ K++FVS GTQPGA V +A K NIP  G+ A+A SGL S    ++   RDGR NKKSKWDKVD
Subjt:  SKDVFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVD

Query:  GDRRNPVISGG---------SDAALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS
        GD +NP ++ G         S+AAL+S A + GSGY AFAQQRRRE E + S+ERKL+RRS
Subjt:  GDRRNPVISGG---------SDAALLSAAANVGSGYMAFAQQRRREAEEKSSTERKLDRRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGAAGAAGAAAGAATCCGAAGGTATTGCTTTGCTCTCGATGTACAATGATGAGGACGATGAGATGGAAGACGTTGAAGAGCTAAAAGATGAAGAAGGAGAAGA
AGAAGATAGTGAACTGCACCAGCAACAAAGGCAAGAAGAGGGAGGAGAGGAAGATTATGGAGTTAGGGTTGCAGAAGAAGAGTCAGTTGCGAATAGTGATAGAATGATTA
TCAGTGATAATGTTAATGATTCAACACCGCCTGTTGCTGATGAAAATTTGACTCCAGATAAGCTCAATTTCGGGTCGTCCACACTGCAGCAGCTCCATGTTGCGGTTTCA
TCGTCGCCAATGCTATTACAAGCTGCGCAACTAGATAATTCTGCTAGGAGAAGGGGGACACTTGCGATAGTTGATTACGGTCACGATGAAGCCGCAATGTCTCCTGAGGC
CGAGGATGGAGAAATTGAAGAATCTGGTCGCGTCACATTTGGCGATGAGCTCTTGGGCACTAATGGTGATTTTGATAGAACATCTCCCGGAACTGTAAGAGTTTCAACTC
CAAACAATTTATCCACTCCTCAAATTTCTGAATCACCACATTCTGGTTCAATGAACAACATGATACTGGAATCTGAAACTGCAAAAGTTGAGGAAACTGTTGAAGAGGAG
AAAAAAGATATTGATCCCTTGGACAAGTTTCTTCCTCCACCACCAAACGAAAAATGCTCAGAGGAGCTGCAAAGGAAAATTAATAAGTTTCTTGAGTATAAGAAAGCTGG
AAAAAGCTTCAATGCAGAAGTCCGTAATAGGAAGGACTACCGGAATCCAGATTTCTTATTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTA
AGGATGTGTTTGACCCGCATGGATATGATAAAAGTGACTACTATACTGAAATAGAGGCTGACATGAAACGTGAGATGGAGAGAAAGGAGCTGGAAAGGAAGAAAAGTCCG
AAGATGGAGTTTGTTTCAGGAGGAACACAACCCGGTGCTACAGTTGTGTCTGCTCCTAAAATAAATATACCTTTTACAGGTGTTTCAGCCATTGCTGGTAGTGGACTGCA
TTCAGCAGCTCCTGCATCTGATGCCATTCCTAGGGATGGAAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGTGATAGAAGAAATCCAGTAATTTCAGGTGGGT
CAGATGCAGCTTTACTATCTGCTGCTGCCAATGTTGGCTCTGGATACATGGCTTTTGCGCAACAGAGAAGGCGAGAGGCTGAAGAAAAAAGTTCCACTGAGAGAAAGTTA
GATAGAAGATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCGAAGAAGAAAGAATCCGAAGGTATTGCTTTGCTCTCGATGTACAATGATGAGGACGATGAGATGGAAGACGTTGAAGAGCTAAAAGATGAAGAAGGAGAAGA
AGAAGATAGTGAACTGCACCAGCAACAAAGGCAAGAAGAGGGAGGAGAGGAAGATTATGGAGTTAGGGTTGCAGAAGAAGAGTCAGTTGCGAATAGTGATAGAATGATTA
TCAGTGATAATGTTAATGATTCAACACCGCCTGTTGCTGATGAAAATTTGACTCCAGATAAGCTCAATTTCGGGTCGTCCACACTGCAGCAGCTCCATGTTGCGGTTTCA
TCGTCGCCAATGCTATTACAAGCTGCGCAACTAGATAATTCTGCTAGGAGAAGGGGGACACTTGCGATAGTTGATTACGGTCACGATGAAGCCGCAATGTCTCCTGAGGC
CGAGGATGGAGAAATTGAAGAATCTGGTCGCGTCACATTTGGCGATGAGCTCTTGGGCACTAATGGTGATTTTGATAGAACATCTCCCGGAACTGTAAGAGTTTCAACTC
CAAACAATTTATCCACTCCTCAAATTTCTGAATCACCACATTCTGGTTCAATGAACAACATGATACTGGAATCTGAAACTGCAAAAGTTGAGGAAACTGTTGAAGAGGAG
AAAAAAGATATTGATCCCTTGGACAAGTTTCTTCCTCCACCACCAAACGAAAAATGCTCAGAGGAGCTGCAAAGGAAAATTAATAAGTTTCTTGAGTATAAGAAAGCTGG
AAAAAGCTTCAATGCAGAAGTCCGTAATAGGAAGGACTACCGGAATCCAGATTTCTTATTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTA
AGGATGTGTTTGACCCGCATGGATATGATAAAAGTGACTACTATACTGAAATAGAGGCTGACATGAAACGTGAGATGGAGAGAAAGGAGCTGGAAAGGAAGAAAAGTCCG
AAGATGGAGTTTGTTTCAGGAGGAACACAACCCGGTGCTACAGTTGTGTCTGCTCCTAAAATAAATATACCTTTTACAGGTGTTTCAGCCATTGCTGGTAGTGGACTGCA
TTCAGCAGCTCCTGCATCTGATGCCATTCCTAGGGATGGAAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGTGATAGAAGAAATCCAGTAATTTCAGGTGGGT
CAGATGCAGCTTTACTATCTGCTGCTGCCAATGTTGGCTCTGGATACATGGCTTTTGCGCAACAGAGAAGGCGAGAGGCTGAAGAAAAAAGTTCCACTGAGAGAAAGTTA
GATAGAAGATCCTAA
Protein sequenceShow/hide protein sequence
MASKKKESEGIALLSMYNDEDDEMEDVEELKDEEGEEEDSELHQQQRQEEGGEEDYGVRVAEEESVANSDRMIISDNVNDSTPPVADENLTPDKLNFGSSTLQQLHVAVS
SSPMLLQAAQLDNSARRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLGTNGDFDRTSPGTVRVSTPNNLSTPQISESPHSGSMNNMILESETAKVEETVEEE
KKDIDPLDKFLPPPPNEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDYYTEIEADMKREMERKELERKKSP
KMEFVSGGTQPGATVVSAPKINIPFTGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISGGSDAALLSAAANVGSGYMAFAQQRRREAEEKSSTERKL
DRRS