; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g008060 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g008060
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionSAP30-binding protein-like isoform X2
Genome locationChr06:7876458..7882184
RNA-Seq ExpressionLcy06g008060
SyntenyLcy06g008060
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016874 - ligase activity (molecular function)
InterPro domainsIPR012479 - SAP30-binding protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653213.1 hypothetical protein Csa_019629 [Cucumis sativus]5.4e-21390.21Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ
        MASKKK+SEGIALLSMYNDEDDEMEDVE+LEEEED  L  QQ +EE GEEDYAGVRV EEE VANSDRMIIS+SANDSTPPVA E LTP+KLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ

Query:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND
        PPQ VVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+R+SPG V +ST NNL+TPQISESPHSGSMN+
Subjt:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND

Query:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        ++PE ET K EET+EEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGYD
Subjt:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG
        KSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQPG TVVTAPKINIPFSGVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS G
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG

Query:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER
        SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ++
Subjt:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER

KAG6573316.1 SAP30-binding protein, partial [Cucurbita argyrosperma subsp. sororia]2.3e-21189.89Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ
        MASKKKESEGIALLSMYNDEDDEMEDVE++EEEE+    QQQRQEE G++DY GVRV EEES  NSDRMI+SESANDSTPPV DE  TP+KLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ

Query:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND
        PPQAVVS+SPMLLQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+RTSPG VRV TPNNLATPQISESPHSGSMN+
Subjt:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND

Query:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        MI E ET K EET+EEEKKDI+PLDKFLPPPPK+KCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
Subjt:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG
        KSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPG TVV APK+NIPFSGVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVIS G
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG

Query:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        SDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_004150215.1 uncharacterized protein LOC101206323 [Cucumis sativus]1.8e-21690.79Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ
        MASKKK+SEGIALLSMYNDEDDEMEDVE+LEEEED  L  QQ +EE GEEDYAGVRV EEE VANSDRMIIS+SANDSTPPVA E LTP+KLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ

Query:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND
        PPQ VVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+R+SPG V +ST NNL+TPQISESPHSGSMN+
Subjt:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND

Query:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        ++PE ET K EET+EEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGYD
Subjt:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG
        KSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQPG TVVTAPKINIPFSGVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS G
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG

Query:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ERKLDRRS
Subjt:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_008443368.1 PREDICTED: uncharacterized protein LOC103486971 [Cucumis melo]5.7e-21590.81Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSG-LKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVE+LEEEE+ G L  QQ QE  GEEDYAGVRV EEE VANSDRMIIS+SANDSTPPVA E LTP+KLK+GSSTP
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSG-LKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTP

Query:  QPPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMN
        QPP  VVSSSPM+LQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+RTSPG V +ST NNL+TPQISESPHSGSMN
Subjt:  QPPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMN

Query:  DMIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        + +PE ET K EET+EEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  DMIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIST
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ G TVVTAPKINIPFSGVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS 
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIST

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

XP_022955191.1 DNA ligase 1-like isoform X1 [Cucurbita moschata]5.9e-21290.11Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ
        MASKKKESEGIALLSMYNDEDDEMEDVE++EEEE+    QQQRQEE G++DY GVRV EEES  NSDRMI+SESANDSTPPV DE  TP+KLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ

Query:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND
        PPQAVVS+SPMLLQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+RTSPG VRV TPNNLATPQISESPHSGSMN+
Subjt:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND

Query:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        MI E ET K EET+EEEKKDIDPLDKFLPPPPK+KCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
Subjt:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG
        KSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPG TVV APK+NIPFSGVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVIS G
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG

Query:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        SDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

TrEMBL top hitse value%identityAlignment
A0A0A0LX73 Uncharacterized protein2.6e-21390.21Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ
        MASKKK+SEGIALLSMYNDEDDEMEDVE+LEEEED  L  QQ +EE GEEDYAGVRV EEE VANSDRMIIS+SANDSTPPVA E LTP+KLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ

Query:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND
        PPQ VVSSSPM+LQ GQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+R+SPG V +ST NNL+TPQISESPHSGSMN+
Subjt:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND

Query:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        ++PE ET K EET+EEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGYD
Subjt:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG
        KSDYYTEIEADMKREMERKELERKKSPKMEFV+GGTQPG TVVTAPKINIPFSGVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS G
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG

Query:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER
        SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRS ++
Subjt:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSER

A0A1S3B7X1 uncharacterized protein LOC1034869712.8e-21590.81Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSG-LKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVE+LEEEE+ G L  QQ QE  GEEDYAGVRV EEE VANSDRMIIS+SANDSTPPVA E LTP+KLK+GSSTP
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSG-LKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTP

Query:  QPPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMN
        QPP  VVSSSPM+LQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+RTSPG V +ST NNL+TPQISESPHSGSMN
Subjt:  QPPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMN

Query:  DMIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        + +PE ET K EET+EEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  DMIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIST
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ G TVVTAPKINIPFSGVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS 
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIST

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A5A7UPK6 SAP30-binding protein-like2.8e-21590.81Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSG-LKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTP
        MASKKK+SEGIALLSMYNDEDDEMEDVE+LEEEE+ G L  QQ QE  GEEDYAGVRV EEE VANSDRMIIS+SANDSTPPVA E LTP+KLK+GSSTP
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSG-LKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTP

Query:  QPPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMN
        QPP  VVSSSPM+LQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+RTSPG V +ST NNL+TPQISESPHSGSMN
Subjt:  QPPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMN

Query:  DMIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY
        + +PE ET K EET+EEEKKDIDPLDKFLPPPPKEKCSE+LQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSK+VFDPHGY
Subjt:  DMIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGY

Query:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIST
        DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQ G TVVTAPKINIPFSGVSAI  SGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIS 
Subjt:  DKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVIST

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A6J1CFK8 uncharacterized protein LOC111010773 isoform X22.5e-20889.01Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ
        MASKKKESEGIALLSMYNDEDDEMEDVEE +EE D+ L QQQ QEE GEEDY GVRV EEESVANSDRMI+S+SANDSTPPVADE LTP+KLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ

Query:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND
        PPQ +VSSSPMLLQ GQLDN GRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGD +RTSPG  RVSTPNNLAT QISESPHSGSMN+
Subjt:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND

Query:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
         IPE ETAK EET+EEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYK+AGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
Subjt:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPR-DGRQNKKSKWDKVDGDRRNPVIST
        KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQP   VVTAPKINIPFSGV         SAAPASDAIPR DGRQNKKSKWDKVDGDRRNP+IS 
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPR-DGRQNKKSKWDKVDGDRRNPVIST

Query:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        GSD  +AHAA+LSAANVGSGY+AFAQQRRREAEEKRSSERKLDRRS
Subjt:  GSDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

A0A6J1GT35 DNA ligase 1-like isoform X12.9e-21290.11Show/hide
Query:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ
        MASKKKESEGIALLSMYNDEDDEMEDVE++EEEE+    QQQRQEE G++DY GVRV EEES  NSDRMI+SESANDSTPPV DE  TP+KLKFGSSTPQ
Subjt:  MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQ

Query:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND
        PPQAVVS+SPMLLQ    DNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELL TNGDF+RTSPG VRV TPNNLATPQISESPHSGSMN+
Subjt:  PPQAVVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMND

Query:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
        MI E ET K EET+EEEKKDIDPLDKFLPPPPK+KCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD
Subjt:  MIPEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYD

Query:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG
        KSDYY EIEADMKREMERKELERKKSPKMEFVSGGTQPG TVV APK+NIPFSGVSAI GSGLHSAA ASDAIPRDGRQNKKSKWDKVDGDRRNPVIS G
Subjt:  KSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTG

Query:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        SDAASAH ALLS+ANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
Subjt:  SDAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

SwissProt top hitse value%identityAlignment
Q02614 SAP30-binding protein3.3e-1632.43Show/hide
Query:  PEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDK
        P+   A F E +    +++ P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP      +++  ID++G+ + KD+FDPHG+ +
Subjt:  PEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDK

Query:  SDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWD
          YY  +    K EM++ E  +K+  K+EFV+ GT+ G T              +A A S   ++   +DA      Q +KSKWD
Subjt:  SDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWD

Q9UHR5 SAP30-binding protein1.7e-1535Show/hide
Query:  PEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDK
        P+   A F E +    +++ P +  +PP P  +CS  LQ KI K  E K K G   N  ++ +K++RNP      +++  ID++G+ + KD+FDPHG+ +
Subjt:  PEFETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYK-KAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDK

Query:  SDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVT
          YY  +    K EM++ E  +K+  K+EFV+ GT+ G T
Subjt:  SDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVT

Arabidopsis top hitse value%identityAlignment
AT1G29220.1 transcriptional regulator family protein1.4e-7346.29Show/hide
Query:  KESEGIALLSMYNDEDD-EMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQPPQA
        K+SEGIALLS+Y+DEDD EMED EE EEEED    ++QR +EE E      +++EE+ V  ++ M   E                      S TP+    
Subjt:  KESEGIALLSMYNDEDD-EMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQPPQA

Query:  VVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMNDMIPE
        V +SS        LDN               DE++  P+  +  I ESG V  G+   D +G+ + T                                 
Subjt:  VVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMNDMIPE

Query:  FETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDY
                           LD+FLPP P+E+CSEELQRKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSKDVFDP GYD SD+
Subjt:  FETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDY

Query:  YTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTGS---
           IE DMK E ERKE E KK+ K++FVS GTQPG  V  A K NIP  G+ A+A SGL S    ++   RDGR NKKSKWDKVDGD +NP ++ G+   
Subjt:  YTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTGS---

Query:  -DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
          +  ++AAL+SA + GSGY AFAQQRRRE E +RSSERKL+RRS
Subjt:  -DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS

AT1G29220.2 transcriptional regulator family protein6.4e-7145.08Show/hide
Query:  KESEGIALLSMYNDEDD-EMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQPPQA
        K+SEGIALLS+Y+DEDD EMED EE EEEED    ++QR +EE E      +++EE+ V  ++ M   E                      S TP+    
Subjt:  KESEGIALLSMYNDEDD-EMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQPPQA

Query:  VVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMNDMIPE
        V +SS        LDN               DE++  P+  +  I ESG V  G+   D +G+ + T                                 
Subjt:  VVSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMNDMIPE

Query:  FETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKD
                           LD+FLPP P+E+CSEELQ            RKI+KFL  KK GKSFN+EVRNRK+YRNPDFLLHAV YQDIDQIGSCFSKD
Subjt:  FETAKFEETIEEEKKDIDPLDKFLPPPPKEKCSEELQ------------RKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKD

Query:  VFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDR
        VFDP GYD SD+   IE DMK E ERKE E KK+ K++FVS GTQPG  V  A K NIP  G+ A+A SGL S    ++   RDGR NKKSKWDKVDGD 
Subjt:  VFDPHGYDKSDYYTEIEADMKREMERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDR

Query:  RNPVISTGS----DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS
        +NP ++ G+     +  ++AAL+SA + GSGY AFAQQRRRE E +RSSERKL+RRS
Subjt:  RNPVISTGS----DAASAHAALLSAANVGSGYMAFAQQRRREAEEKRSSERKLDRRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCGAAGAAGAAAGAATCTGAAGGTATAGCTTTACTTTCGATGTACAATGACGAGGACGATGAGATGGAAGATGTTGAAGAGCTGGAAGAAGAAGAAGAT
AGTGGATTGAAGCAGCAGCAGAGGCAAGAAGAGGAAGGAGAGGAAGATTATGCAGGAGTTAGGGTTGTAGAAGAAGAGTCAGTTGCGAACAGTGATAGAATGATT
ATCAGTGAATCTGCTAATGATTCCACGCCGCCGGTTGCTGATGAATATCTGACTCCGAATAAGCTGAAATTCGGGTCTTCCACACCGCAGCCGCCCCAGGCTGTG
GTTTCATCGTCGCCAATGCTATTACAAACTGGGCAACTAGATAATTCTGGTAGGAGAAGGGGGACGCTTGCGATAGTTGATTACGGTCACGATGAAGCCGCAATG
TCTCCTGAGGCTGAGGATGGAGAAATCGAAGAATCTGGTCGTGTCACATTTGGCGATGAGCTTTTAGACACTAATGGTGATTTTAATAGAACGTCTCCTGGAATT
GTAAGAGTTTCAACACCAAACAATCTAGCCACTCCTCAAATTTCTGAATCACCACATTCTGGTTCAATGAACGACATGATTCCAGAATTTGAAACTGCAAAATTC
GAGGAAACCATTGAAGAAGAGAAAAAAGATATTGATCCCTTGGACAAGTTTCTTCCTCCACCACCAAAAGAGAAATGCTCAGAGGAGCTGCAAAGGAAAATCAAC
AAGTTTCTTGAGTATAAGAAAGCTGGAAAAAGCTTCAATGCAGAAGTACGCAATAGGAAGGACTACCGGAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAA
GATATTGACCAGATTGGGTCTTGCTTCAGTAAGGATGTGTTTGACCCGCATGGATATGATAAAAGTGACTACTATACTGAAATAGAGGCTGACATGAAGCGTGAG
ATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATGGAGTTTGTTTCAGGAGGAACACAACCTGGTGTTACAGTTGTGACTGCTCCTAAAATAAATATA
CCTTTTTCAGGTGTTTCAGCTATCGCTGGTAGTGGACTACATTCAGCAGCTCCTGCATCTGATGCCATTCCTAGGGATGGCAGACAAAACAAAAAATCAAAATGG
GATAAGGTAGATGGTGATAGAAGAAATCCAGTAATTTCTACTGGGTCAGATGCAGCTAGTGCTCATGCAGCTTTACTATCTGCTGCTAATGTTGGCTCCGGATAC
ATGGCTTTTGCGCAACAGAGACGGCGAGAGGCTGAAGAAAAAAGATCCAGCGAGAGAAAGTTGGATAGAAGATCCTAA
mRNA sequenceShow/hide mRNA sequence
GGGAAACTGAAAACCCTTCTCCCTAGTATCGACATTGATTCAAAATCCCAAGCTTTCTGCATTCAATTGAGATCCAACTTTCGATTTCGAAAGTCCAATTCTTCG
TTGTTGAACTATTTCCTTCCGGGTGCCGAGAATTGAAGCTCTCATGGCATCGAAGAAGAAAGAATCTGAAGGTATAGCTTTACTTTCGATGTACAATGACGAGGA
CGATGAGATGGAAGATGTTGAAGAGCTGGAAGAAGAAGAAGATAGTGGATTGAAGCAGCAGCAGAGGCAAGAAGAGGAAGGAGAGGAAGATTATGCAGGAGTTAG
GGTTGTAGAAGAAGAGTCAGTTGCGAACAGTGATAGAATGATTATCAGTGAATCTGCTAATGATTCCACGCCGCCGGTTGCTGATGAATATCTGACTCCGAATAA
GCTGAAATTCGGGTCTTCCACACCGCAGCCGCCCCAGGCTGTGGTTTCATCGTCGCCAATGCTATTACAAACTGGGCAACTAGATAATTCTGGTAGGAGAAGGGG
GACGCTTGCGATAGTTGATTACGGTCACGATGAAGCCGCAATGTCTCCTGAGGCTGAGGATGGAGAAATCGAAGAATCTGGTCGTGTCACATTTGGCGATGAGCT
TTTAGACACTAATGGTGATTTTAATAGAACGTCTCCTGGAATTGTAAGAGTTTCAACACCAAACAATCTAGCCACTCCTCAAATTTCTGAATCACCACATTCTGG
TTCAATGAACGACATGATTCCAGAATTTGAAACTGCAAAATTCGAGGAAACCATTGAAGAAGAGAAAAAAGATATTGATCCCTTGGACAAGTTTCTTCCTCCACC
ACCAAAAGAGAAATGCTCAGAGGAGCTGCAAAGGAAAATCAACAAGTTTCTTGAGTATAAGAAAGCTGGAAAAAGCTTCAATGCAGAAGTACGCAATAGGAAGGA
CTACCGGAATCCAGATTTCTTGTTACATGCTGTGAGGTATCAAGATATTGACCAGATTGGGTCTTGCTTCAGTAAGGATGTGTTTGACCCGCATGGATATGATAA
AAGTGACTACTATACTGAAATAGAGGCTGACATGAAGCGTGAGATGGAGAGGAAGGAGCTGGAAAGGAAGAAAAGTCCGAAGATGGAGTTTGTTTCAGGAGGAAC
ACAACCTGGTGTTACAGTTGTGACTGCTCCTAAAATAAATATACCTTTTTCAGGTGTTTCAGCTATCGCTGGTAGTGGACTACATTCAGCAGCTCCTGCATCTGA
TGCCATTCCTAGGGATGGCAGACAAAACAAAAAATCAAAATGGGATAAGGTAGATGGTGATAGAAGAAATCCAGTAATTTCTACTGGGTCAGATGCAGCTAGTGC
TCATGCAGCTTTACTATCTGCTGCTAATGTTGGCTCCGGATACATGGCTTTTGCGCAACAGAGACGGCGAGAGGCTGAAGAAAAAAGATCCAGCGAGAGAAAGTT
GGATAGAAGATCCTAAGAGAAATGAATTCTGTTCCATATAATTTAATTTCTGAACCATTTTGAAACGTAAGGGAAATGGCCACATTGTAGCTTTGTATCTTGTGA
CTAACCATGTATACGGTCAGATTGAAAATGCAATACGTCAGTTACCATTAACTTCCTCTTCTGAAAGTTTAATATTGCTTGTAACTCAATTTTTTCATCAAATTT
CTTGACTTGGAAAGATTATCCAGGGGAAAACAAATTACACCAAAACGAGGAAGGAGGCTCCTTTTTGTATGAGATTCATGAGGGAAAAAAAAATTGACTACACCT
CAAGTACATGGTAACTTATAAATAGATTTGCTCTCTCTACTGCATAGTATTAAAGAGATCTTTATTGATTGCTCACTTGTCAGAGGTTTAGTCAGCTTAAGGTTT
TAATTACCAATCAAGTAATTCACTTGCTACTTCCCTAGGATATGAAGCTGAGGAGGGATTTTTGTTCACAATATGCAGCTTTGATCTATCTGTATTCTGTTATCC
TTTTGTCTTGATAGTGTTGGATTCCCTCCTATGAGTAAAAAAAAAGGAGGAAATCAGAACCCCTAGAGCTACGTAGATGTACAAAATCCATGTCTGATAACCATC
AAGTCAGTGAAAAAATTGACAAATGATCCTCTAATATGGGTTGGCCATATGGAGACAGTGACGTCTTTGCCTTACAGCTTCTGCCCTGTCCTTTTCCCCACGCCC
CTCGATCTTCAACCCTACAAACAGCCAAAATCCCGAGCAAGTATAAGCCCCACAGGCCACAATCAAGCTTGATTCTACTAGGTGTCTTCACTAAAATCATCTTCC
ACCACACTCTGGTTTTCTCACTACCAAACATCACCGAATGCTTTAAAGTCAATACCATCAGCCTTTCCTTTAATTCTTCTCGATGCATGTTTGCCGAAAGTTACC
CTCTTCCCTAAATAATCGCAAAAGAGTAAACCACAGACCCCACCCTCTGCCCCCACCCCCATTGCCTTTTGTTCTAAATTAACAAAACAACTTGTTTATGTTTCA
ACTTAGGAACCGGGCTTGAAGCAAAATTATGAAGAAAAGGAATCTTCTTCCATATTTCAACATCGAAACAAAGGAGATCCGAAGGCATTAATGAGAAGATGGTTG
CTTTGTAATTATAATGTTTGAGTCCCTTCCTTTTTCTTGCAGGTGGTTGCATATTTTCAACACAAGTACGTGGGTCAGCTTGAAAAGGCAAGCCATTGGCACCCG
AGAATTGGTGTGTTCTTTCTTTCTTTTTATTTTGTTTCCCTCCACTCGACTCTCGAGGATATGTATGGGATTCCTTGGAATTGGAAGGCGAAGAAGACTGTTTGT
TTATTCTCCCATCTCATTTCCTGATTGCTGATAAAGACCAAAATGCCCATATGCTTGCTAGCTGGTGGTGCTGGTGGAAGTGGATTTATATGTTTTTGAACCCTT
TGCTTTTTCCCTTAAAACTGCTTTTCTGCACAGCTGCTTTTGGAATAGAACACAAGTTGTCTTCTTTTGAGAGAGAGAAGAAAGAAAGAGATGTGCATGTGATTG
TGAGTGAAGCTAAAGACTTGTCCCATGGATGCATGTAAAAGACTGCCCTTTCGCCTTAGTTGACAGAGAGAGTGACGGAGTTTCCGGTGCTTCTCATTAAAACTT
GCATAAATTTCC
Protein sequenceShow/hide protein sequence
MASKKKESEGIALLSMYNDEDDEMEDVEELEEEEDSGLKQQQRQEEEGEEDYAGVRVVEEESVANSDRMIISESANDSTPPVADEYLTPNKLKFGSSTPQPPQAV
VSSSPMLLQTGQLDNSGRRRGTLAIVDYGHDEAAMSPEAEDGEIEESGRVTFGDELLDTNGDFNRTSPGIVRVSTPNNLATPQISESPHSGSMNDMIPEFETAKF
EETIEEEKKDIDPLDKFLPPPPKEKCSEELQRKINKFLEYKKAGKSFNAEVRNRKDYRNPDFLLHAVRYQDIDQIGSCFSKDVFDPHGYDKSDYYTEIEADMKRE
MERKELERKKSPKMEFVSGGTQPGVTVVTAPKINIPFSGVSAIAGSGLHSAAPASDAIPRDGRQNKKSKWDKVDGDRRNPVISTGSDAASAHAALLSAANVGSGY
MAFAQQRRREAEEKRSSERKLDRRS