; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013492 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013492
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSET and MYND domain-containing protein 4
Genome locationscaffold402:1934082..1938342
RNA-Seq ExpressionMS013492
SyntenyMS013492
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019525.1 SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-21983.22Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
        AWYRRGKANA++GNF DA+RDF I++NVE+SFNGKKQ++DELK+IQ QHKRSN V EHS NKLD+     DEPIQVKLHVTTSNKGRGMVSP EIPPSS+
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM

Query:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC
        VH+EEPYA VILKH RETHCHYC NELPADK+PCPSC+IPLYCSQ CQIQAGGQMLQ  PDN++ILK+LSDDL+KYVQEIT   FAD+R +DVPEHKHEC
Subjt:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC

Query:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
        DGVHWPAILPSEIVLAGRIV K+V Q G FADAS++VDMLNLSHHFS+M  DSKLECIIYSIIL SCL+QFFPSQLP+N N+ISQIVILISQIRTNSISI
Subjt:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI

Query:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
        VR+KSFDAPGS DQ G LSS  PFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTT FV+VGCPLELSYGPQVGQLDCKDRLKLLEDEYS
Subjt:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS

Query:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD
        FKCQCSGCS+V+I DLVLNAFCCIN  C GVVLDRS+F+CENKK +D
Subjt:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD

XP_022139922.1 SET and MYND domain-containing protein 4 [Momordica charantia]4.2e-25898.44Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
        AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLD      DEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM

Query:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC
        VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIR EDVPEHKHEC
Subjt:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC

Query:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
        DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
Subjt:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI

Query:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
        VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
Subjt:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS

Query:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF
        FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF
Subjt:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF

XP_022927244.1 SET and MYND domain-containing protein 4 isoform X1 [Cucurbita moschata]3.9e-21983Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
        AWYRRGKANA++GNF DA+ DF I++NVE+SFNGKKQ++DELK+IQ QHKRSN V EHS NKLD+     DEPIQVKLHVTTSNKGRGMVSP EIPPSS+
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM

Query:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC
        VH+EEPYA VILKH RETHCHYC NELPADK+PCPSC+IPLYCSQ CQIQAGGQMLQ  PDN++ILK+LSDDL+KYVQEIT   FAD+R +DVPEHKHEC
Subjt:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC

Query:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
        DGVHWPAILPSEIVLAGRIV K+V Q G FADAS++VDMLNLSHHFS+M  DSKLECIIYSIIL SCL+QFFPSQLP+N N+ISQIVILISQIRTNSISI
Subjt:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI

Query:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
        VR+KSFDAPGS DQ G LSS  PFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTT FV+VGCPLELSYGPQVGQLDCKDRLKLLEDEYS
Subjt:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS

Query:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD
        FKCQCSGCS+V+I DLVLNAFCCIN  C GVVLDRS+F+CENKK +D
Subjt:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD

XP_022927246.1 SET and MYND domain-containing protein 4 isoform X2 [Cucurbita moschata]3.9e-21983Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
        AWYRRGKANA++GNF DA+ DF I++NVE+SFNGKKQ++DELK+IQ QHKRSN V EHS NKLD+     DEPIQVKLHVTTSNKGRGMVSP EIPPSS+
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM

Query:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC
        VH+EEPYA VILKH RETHCHYC NELPADK+PCPSC+IPLYCSQ CQIQAGGQMLQ  PDN++ILK+LSDDL+KYVQEIT   FAD+R +DVPEHKHEC
Subjt:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC

Query:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
        DGVHWPAILPSEIVLAGRIV K+V Q G FADAS++VDMLNLSHHFS+M  DSKLECIIYSIIL SCL+QFFPSQLP+N N+ISQIVILISQIRTNSISI
Subjt:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI

Query:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
        VR+KSFDAPGS DQ G LSS  PFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTT FV+VGCPLELSYGPQVGQLDCKDRLKLLEDEYS
Subjt:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS

Query:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD
        FKCQCSGCS+V+I DLVLNAFCCIN  C GVVLDRS+F+CENKK +D
Subjt:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD

XP_038895099.1 SET and MYND domain-containing protein 4 isoform X5 [Benincasa hispida]7.3e-21882.81Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
        AWYRRGKAN ++ NFDDA+ DF I+++VE+SFNGKKQI+DELKVIQ  H RSN V EHSK+KLD+   + DEPIQVKLHVTTS+KGRGMVSPTE+PPSS+
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM

Query:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC
        VH+EEPYA VILKH RETHCHYC NELPADK+PCPSC+IPLYCSQHCQIQAGG+MLQ   DNQDI KNLSD L+KYVQEITS  F+D+R EDVPEHKHEC
Subjt:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC

Query:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
        DGVHWPAILPSEIVLAGRIV K+V Q+  FADAS++VDMLNLSHHFS+M TDSKLECIIYSIIL SCLQQFFP QL INGN+ISQI ILISQIRTNSISI
Subjt:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI

Query:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
        VR+KSFDAPGSPDQRG LSS VPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIR T F SVGCPLELSYGPQVGQLDCK RLKLLEDEYS
Subjt:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS

Query:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF
        F+CQCSGCSLV+ISDLVLNAFCCIN +C GVVLDRS+F+CEN K +DF
Subjt:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF

TrEMBL top hitse value%identityAlignment
A0A6J1CF99 SET and MYND domain-containing protein 42.0e-25898.44Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
        AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLD      DEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM

Query:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC
        VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIR EDVPEHKHEC
Subjt:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC

Query:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
        DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
Subjt:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI

Query:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
        VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
Subjt:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS

Query:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF
        FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF
Subjt:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF

A0A6J1EGM3 SET and MYND domain-containing protein 4 isoform X21.9e-21983Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
        AWYRRGKANA++GNF DA+ DF I++NVE+SFNGKKQ++DELK+IQ QHKRSN V EHS NKLD+     DEPIQVKLHVTTSNKGRGMVSP EIPPSS+
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM

Query:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC
        VH+EEPYA VILKH RETHCHYC NELPADK+PCPSC+IPLYCSQ CQIQAGGQMLQ  PDN++ILK+LSDDL+KYVQEIT   FAD+R +DVPEHKHEC
Subjt:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC

Query:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
        DGVHWPAILPSEIVLAGRIV K+V Q G FADAS++VDMLNLSHHFS+M  DSKLECIIYSIIL SCL+QFFPSQLP+N N+ISQIVILISQIRTNSISI
Subjt:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI

Query:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
        VR+KSFDAPGS DQ G LSS  PFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTT FV+VGCPLELSYGPQVGQLDCKDRLKLLEDEYS
Subjt:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS

Query:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD
        FKCQCSGCS+V+I DLVLNAFCCIN  C GVVLDRS+F+CENKK +D
Subjt:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD

A0A6J1EHH4 SET and MYND domain-containing protein 4 isoform X11.9e-21983Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM
        AWYRRGKANA++GNF DA+ DF I++NVE+SFNGKKQ++DELK+IQ QHKRSN V EHS NKLD+     DEPIQVKLHVTTSNKGRGMVSP EIPPSS+
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSM

Query:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC
        VH+EEPYA VILKH RETHCHYC NELPADK+PCPSC+IPLYCSQ CQIQAGGQMLQ  PDN++ILK+LSDDL+KYVQEIT   FAD+R +DVPEHKHEC
Subjt:  VHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHEC

Query:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI
        DGVHWPAILPSEIVLAGRIV K+V Q G FADAS++VDMLNLSHHFS+M  DSKLECIIYSIIL SCL+QFFPSQLP+N N+ISQIVILISQIRTNSISI
Subjt:  DGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISI

Query:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS
        VR+KSFDAPGS DQ G LSS  PFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTT FV+VGCPLELSYGPQVGQLDCKDRLKLLEDEYS
Subjt:  VRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYS

Query:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD
        FKCQCSGCS+V+I DLVLNAFCCIN  C GVVLDRS+F+CENKK +D
Subjt:  FKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD

A0A6J1KIH9 SET and MYND domain-containing protein 4 isoform X23.0e-21782.59Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHS-KNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSS
        AWYRRGKANA++GNF DA+RDF I++NVE+SFNGKKQ++DELK+IQ Q+KRSN V EHS  NKLD+     DEPIQVKLHVTTSNKGRGMVSP EIPPSS
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHS-KNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSS

Query:  MVHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHE
        +VH+EEPYA VILKH RETHCHYC NELPADK+PCPSC+IPLYCSQ CQIQAGG+MLQ  PDN++ILK+LSDDL+KYVQEITS  FAD+R +DVPEHKHE
Subjt:  MVHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHE

Query:  CDGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSIS
        CDGVHWPAILPSEIVLAGRI+ K+V Q G FADAS++VDMLNLSHHFS+M  DSKLECIIYSIIL SCL+QFFPSQLP+N N+ISQIVILISQIRTNSIS
Subjt:  CDGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSIS

Query:  IVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEY
        IVR+KSFDAPGS DQ G LSS  PFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTT  V+VGCPLELSYGPQVGQLDCKDRLKLLEDEY
Subjt:  IVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEY

Query:  SFKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD
        SFKCQCSGCSLV+ISDLVL+AFCCIN  C GVVLDRS+F+CENKK +D
Subjt:  SFKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD

A0A6J1KMM0 SET and MYND domain-containing protein 4 isoform X43.0e-21782.59Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHS-KNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSS
        AWYRRGKANA++GNF DA+RDF I++NVE+SFNGKKQ++DELK+IQ Q+KRSN V EHS  NKLD+     DEPIQVKLHVTTSNKGRGMVSP EIPPSS
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHS-KNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSS

Query:  MVHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHE
        +VH+EEPYA VILKH RETHCHYC NELPADK+PCPSC+IPLYCSQ CQIQAGG+MLQ  PDN++ILK+LSDDL+KYVQEITS  FAD+R +DVPEHKHE
Subjt:  MVHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHE

Query:  CDGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSIS
        CDGVHWPAILPSEIVLAGRI+ K+V Q G FADAS++VDMLNLSHHFS+M  DSKLECIIYSIIL SCL+QFFPSQLP+N N+ISQIVILISQIRTNSIS
Subjt:  CDGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSIS

Query:  IVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEY
        IVR+KSFDAPGS DQ G LSS  PFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTT  V+VGCPLELSYGPQVGQLDCKDRLKLLEDEY
Subjt:  IVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEY

Query:  SFKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD
        SFKCQCSGCSLV+ISDLVL+AFCCIN  C GVVLDRS+F+CENKK +D
Subjt:  SFKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQD

SwissProt top hitse value%identityAlignment
Q54Q80 SET and MYND domain-containing protein DDB_G02840592.2e-0736.25Show/hide
Query:  QVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYSFKCQCSGCS
        Q ++G A+Y   SL NHSC  N H  ++  +L I++   +  G  +   YGP       KDRL  L +E+ F C+C  CS
Subjt:  QVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYSFKCQCSGCS

Q5F3V0 SET and MYND domain-containing protein 46.8e-0922.19Show/hide
Query:  TSNKGRGMVSPTEIPPSSMVHIEEPYASV---------ILKHYRET-----------HCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPD
        ++ +GR +V+  +I P   +  E+ + SV         +L+   ET           +CH+C  +L A  IPC  C+   YCSQ+C   A  Q  +    
Subjt:  TSNKGRGMVSPTEIPPSSMVHIEEPYASV---------ILKHYRET-----------HCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPD

Query:  NQDILKNLSDDLQKYVQEITSHGFADIR--IEDVPEHKH------ECDGVHWPAILPSEIV--LAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPT
           +L  L       ++ +   GF+++   +E   +  +      E  G H     PSE +   AGR V+      G +   SS   + NL  H  +   
Subjt:  NQDILKNLSDDLQKYVQEITSHGFADIR--IEDVPEHKH------ECDGVHWPAILPSEIV--LAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPT

Query:  DSKLECIIYSIILLSCLQQFFPSQLPINGNSIS-----------------QIVIL-------ISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCN
        + K  C++  + +   LQ+       +NG S +                 +++I+       + Q++ N+ +I  ++             L S      N
Subjt:  DSKLECIIYSIILLSCLQQFFPSQLPINGNSIS-----------------QIVIL-------ISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCN

Query:  MEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQV
         + VR+  A +   SL NHSC PNI   F+     +R ++ +  G  +   YG ++
Subjt:  MEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQV

Q8BTK5 SET and MYND domain-containing protein 42.1e-1020.82Show/hide
Query:  KGRGMVSPTEIPPSSMVHIEEPYASVIL------KHY------------RETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQA-----------GGQM
        KGR +V+  +I P  ++  E+ + SV++       H+             + +CH C     A  +PC SC+   YCSQ C  QA           GG +
Subjt:  KGRGMVSPTEIPPSSMVHIEEPYASVIL------KHY------------RETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQA-----------GGQM

Query:  LQAPPDNQDILKNL----SDDLQKYVQEITSH-GFADIRIEDVPEHKHECDGVHWPAILPSE--------IVLAGRIVVKYVAQKGGFADASSVVDMLNL
        L         L+       +D+ + V+ +    G  D  +   PE K+      + +   SE         +    +  KY +            +  + 
Subjt:  LQAPPDNQDILKNL----SDDLQKYVQEITSH-GFADIRIEDVPEHKHECDGVHWPAILPSE--------IVLAGRIVVKYVAQKGGFADASSVVDMLNL

Query:  SHHF------SQMPTDSKLECIIYSIILLSCLQQFFP---SQLPINGNSISQIVILISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRV
         H F      S +    K + +    +    L+   P   + L + G ++ + ++   Q++ N+ +I  I                S      N  Q+R+
Subjt:  SHHF------SQMPTDSKLECIIYSIILLSCLQQFFP---SQLPINGNSISQIVILISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRV

Query:  GQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYSFKCQCSGCSLVNISDLVL---NAFCCINADCLG
           I+   SL NHSC+PN    F      +R  + ++ G  +   YGP   ++   +R + L  +Y F C+C  C    +         AFCC    C  
Subjt:  GQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYSFKCQCSGCSLVNISDLVL---NAFCCINADCLG

Query:  VVLDRSVFDCENK
        ++    V  C N+
Subjt:  VVLDRSVFDCENK

Q9CWR2 Histone-lysine N-methyltransferase SMYD36.1e-1024.2Show/hide
Query:  PEHKHECDGVH--WPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFF--PSQLPINGNSISQIVIL
        P+H+ EC  +    P   P  + L GR++VK + +K   +++  +    +L  + S++  D K      ++     +++     SQLP +      +   
Subjt:  PEHKHECDGVH--WPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFF--PSQLPINGNSISQIVIL

Query:  ISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCK
         +++  NS +I                         CN E   VG  +Y + SL NHSC PN    FN   L +R    +  G  L + Y   +  +  +
Subjt:  ISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCK

Query:  DRLKLLEDEYSFKCQCSGC
        +R K L D+Y F+C C  C
Subjt:  DRLKLLEDEYSFKCQCSGC

Q9H7B4 Histone-lysine N-methyltransferase SMYD31.4e-0925.11Show/hide
Query:  PEHKHECDGVH--WPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSK--LECIIYSIILLSCLQQFFPSQLPINGNSISQIVIL
        P+HK EC  +    P   P  + L GR+V K +   G  +++  +    +L  + +++  D K  L  ++ +       +    SQLP        +   
Subjt:  PEHKHECDGVH--WPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSK--LECIIYSIILLSCLQQFFPSQLPINGNSISQIVIL

Query:  ISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCK
         +++  NS +I                         CN E   VG  +Y + SL NHSC PN    FN   L +R    + VG  L + Y   +  +  +
Subjt:  ISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCK

Query:  DRLKLLEDEYSFKCQCSGC
        +R K L D+Y F+C C  C
Subjt:  DRLKLLEDEYSFKCQCSGC

Arabidopsis top hitse value%identityAlignment
AT1G33400.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-12548.23Show/hide
Query:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEP---IQVKLH-VTTSNKGRGMVSPTEIP
        AWYRRGK N  +GN+ DA RD  ++ ++E S  GKKQ+++ELK I D   ++N  +EH + +  N   V   P   ++VKL  V+T  KGRGMVS  +I 
Subjt:  AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEP---IQVKLH-VTTSNKGRGMVSPTEIP

Query:  PSSMVHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEH
         +S++H+EEP++ VI K  RETHCH+C NELPAD +PCPSC+IP+YCS+ CQIQ+GG +     D   I + L DD+ ++++ +TS        + + EH
Subjt:  PSSMVHIEEPYASVILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEH

Query:  KHECDGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTN
        +HEC G +WPA+LPS+ VLAGRI++K + Q     D S++ ++L LSH +S+M  ++KLE  + SI+L+ CL +     L +   S++Q +IL+SQI+ N
Subjt:  KHECDGVHWPAILPSEIVLAGRIVVKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTN

Query:  SISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLE
        SI++ R+KS          G +S+  P   ++EQ+RVGQA+Y TGSLFNHSCKPNIH YF SR L ++TTEFV  GCPLELSYGP+VG+ DCK+R++ LE
Subjt:  SISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQVRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLE

Query:  DEYSFKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF
        +EY F C+C GC+ +NISDLV+N + C+N +C GVVLD +V  CE++K+  F
Subjt:  DEYSFKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDCENKKIQDF

AT2G17900.1 SET domain group 374.5e-0826.87Show/hide
Query:  SPDQRGVLSSTVPFTCNMEQV------RVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYSFKCQ
        S D R +  +   F+CN   +        G  ++   S+ NHSC PN    F  +   +R  + +S    + +SY    G      R K L+++Y F CQ
Subjt:  SPDQRGVLSSTVPFTCNMEQV------RVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYSFKCQ

Query:  CSGCSLVN-----ISDLVLNAFCCINADCLGVVL
        C+ CS            +L  + C N  C G +L
Subjt:  CSGCSLVN-----ISDLVLNAFCCINADCLGVVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCATGGTACAGAAGAGGTAAAGCAAATGCTACTGTGGGAAATTTTGATGATGCTGTCCGTGACTTTCATATTGCTGAGAATGTGGAGATGTCATTCAATGGAAAGAAGCA
GATAGAAGATGAGCTGAAGGTCATTCAAGATCAGCACAAGAGGTCAAATATAGTAATTGAACATAGCAAGAACAAATTGGATAACTTGTTATGTGTAACAGATGAGCCAA
TCCAAGTAAAATTACATGTCACCACATCGAATAAGGGGAGAGGGATGGTTTCACCCACTGAGATACCTCCATCTTCCATGGTCCATATTGAAGAACCTTATGCCTCGGTA
ATATTGAAGCATTATAGAGAAACTCATTGCCATTACTGCTTTAATGAACTACCAGCAGATAAAATACCCTGTCCATCATGCACAATTCCTCTGTACTGCTCACAACATTG
CCAAATACAAGCAGGGGGGCAAATGTTACAAGCTCCTCCAGATAATCAAGATATTTTAAAAAATCTATCTGATGACCTCCAGAAGTATGTTCAAGAAATAACTTCACACG
GTTTTGCTGACATTAGGATTGAAGATGTTCCTGAACATAAACATGAATGTGATGGTGTGCACTGGCCTGCTATATTGCCATCTGAGATTGTTTTGGCTGGGCGAATAGTG
GTCAAATATGTAGCACAAAAAGGTGGCTTTGCAGATGCTTCTAGCGTTGTGGATATGTTGAATCTTTCACACCATTTTTCACAAATGCCTACTGATAGCAAGCTGGAGTG
TATCATATATTCCATAATATTGTTAAGTTGTCTTCAGCAATTTTTCCCTTCTCAACTTCCAATTAATGGGAACTCTATCTCGCAGATTGTCATACTTATATCCCAAATTA
GGACAAACTCTATATCTATTGTCCGTATTAAATCCTTTGATGCACCCGGATCACCAGATCAGCGTGGAGTATTATCTAGCACGGTTCCTTTTACTTGTAATATGGAACAA
GTCAGAGTAGGTCAAGCTATTTATACCACTGGAAGCTTGTTTAACCATTCCTGCAAACCAAACATCCATGCATATTTCAATTCACGTACTCTCTTTATACGGACAACCGA
GTTCGTGTCAGTCGGGTGTCCCCTAGAGTTGTCGTATGGTCCCCAGGTTGGTCAATTGGACTGTAAAGACCGGCTTAAGTTGCTAGAGGATGAGTACTCTTTTAAATGCC
AGTGTAGTGGTTGCTCATTGGTGAACATATCTGACCTTGTCCTTAATGCGTTTTGTTGCATTAATGCTGATTGCCTCGGAGTAGTCCTGGATAGATCCGTCTTCGATTGT
GAAAATAAGAAAATCCAGGACTTC
mRNA sequenceShow/hide mRNA sequence
GCATGGTACAGAAGAGGTAAAGCAAATGCTACTGTGGGAAATTTTGATGATGCTGTCCGTGACTTTCATATTGCTGAGAATGTGGAGATGTCATTCAATGGAAAGAAGCA
GATAGAAGATGAGCTGAAGGTCATTCAAGATCAGCACAAGAGGTCAAATATAGTAATTGAACATAGCAAGAACAAATTGGATAACTTGTTATGTGTAACAGATGAGCCAA
TCCAAGTAAAATTACATGTCACCACATCGAATAAGGGGAGAGGGATGGTTTCACCCACTGAGATACCTCCATCTTCCATGGTCCATATTGAAGAACCTTATGCCTCGGTA
ATATTGAAGCATTATAGAGAAACTCATTGCCATTACTGCTTTAATGAACTACCAGCAGATAAAATACCCTGTCCATCATGCACAATTCCTCTGTACTGCTCACAACATTG
CCAAATACAAGCAGGGGGGCAAATGTTACAAGCTCCTCCAGATAATCAAGATATTTTAAAAAATCTATCTGATGACCTCCAGAAGTATGTTCAAGAAATAACTTCACACG
GTTTTGCTGACATTAGGATTGAAGATGTTCCTGAACATAAACATGAATGTGATGGTGTGCACTGGCCTGCTATATTGCCATCTGAGATTGTTTTGGCTGGGCGAATAGTG
GTCAAATATGTAGCACAAAAAGGTGGCTTTGCAGATGCTTCTAGCGTTGTGGATATGTTGAATCTTTCACACCATTTTTCACAAATGCCTACTGATAGCAAGCTGGAGTG
TATCATATATTCCATAATATTGTTAAGTTGTCTTCAGCAATTTTTCCCTTCTCAACTTCCAATTAATGGGAACTCTATCTCGCAGATTGTCATACTTATATCCCAAATTA
GGACAAACTCTATATCTATTGTCCGTATTAAATCCTTTGATGCACCCGGATCACCAGATCAGCGTGGAGTATTATCTAGCACGGTTCCTTTTACTTGTAATATGGAACAA
GTCAGAGTAGGTCAAGCTATTTATACCACTGGAAGCTTGTTTAACCATTCCTGCAAACCAAACATCCATGCATATTTCAATTCACGTACTCTCTTTATACGGACAACCGA
GTTCGTGTCAGTCGGGTGTCCCCTAGAGTTGTCGTATGGTCCCCAGGTTGGTCAATTGGACTGTAAAGACCGGCTTAAGTTGCTAGAGGATGAGTACTCTTTTAAATGCC
AGTGTAGTGGTTGCTCATTGGTGAACATATCTGACCTTGTCCTTAATGCGTTTTGTTGCATTAATGCTGATTGCCTCGGAGTAGTCCTGGATAGATCCGTCTTCGATTGT
GAAAATAAGAAAATCCAGGACTTC
Protein sequenceShow/hide protein sequence
AWYRRGKANATVGNFDDAVRDFHIAENVEMSFNGKKQIEDELKVIQDQHKRSNIVIEHSKNKLDNLLCVTDEPIQVKLHVTTSNKGRGMVSPTEIPPSSMVHIEEPYASV
ILKHYRETHCHYCFNELPADKIPCPSCTIPLYCSQHCQIQAGGQMLQAPPDNQDILKNLSDDLQKYVQEITSHGFADIRIEDVPEHKHECDGVHWPAILPSEIVLAGRIV
VKYVAQKGGFADASSVVDMLNLSHHFSQMPTDSKLECIIYSIILLSCLQQFFPSQLPINGNSISQIVILISQIRTNSISIVRIKSFDAPGSPDQRGVLSSTVPFTCNMEQ
VRVGQAIYTTGSLFNHSCKPNIHAYFNSRTLFIRTTEFVSVGCPLELSYGPQVGQLDCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINADCLGVVLDRSVFDC
ENKKIQDF