; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016382 (gene) of Snake gourd v1 genome

Gene IDTan0016382
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSET and MYND domain-containing protein 4
Genome locationLG05:77349048..77358095
RNA-Seq ExpressionTan0016382
SyntenyTan0016382
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583908.1 SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.57Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVPENLKQ VGSST DDLP SCS LLRLFQQSQLFFQVIGDLAM+PEN LCGKKKDAALELKRQGNQCF+KGDY  AL+YYSQALQVAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKAN ++GNF DAIRDFQISKNVE S NGKKQ++DELK IQ Q KRS TV+EH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  ---------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRD
                 +EP QVKLHVTTS+KGRGMVSP EIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQ CQIQAGGQMLQN  D
Subjt:  ---------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRD

Query:  NEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYS
        N++ILK+LS +LRKYVQEITLP F++LRT+DVPEHKHECDGVHWPAIL SEIVLAGRIVAK V Q   F DASNLVDMLNLSHHF +MH DSKLECIIYS
Subjt:  NEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYS

Query:  IILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTL
        IILSSCLQQFFPSQLP+N NT+SQIV+LISQIRTNSISIVRMKSFDAPGS  Q GRLSS  PFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSRTL
Subjt:  IILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTL

Query:  FIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFM
        FIRTT FVTVGCPLELSYGPQVGQLDCK+RLKLLEDEYSFKCQCSGCSLV+I DLVLNAFCCINP+C GVVLDRSIFNCENKKTKD LTV++QSRLEPFM
Subjt:  FIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFM

Query:  QSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKS
         +DSFLH G SHCLKCGSYR+IKSS STVDEA +HFTRLQQE+  N VSETT SDAL+AL SLKSTLHAYN+RIAEAEDNLSQAFCLLGKLE AA HCK+
Subjt:  QSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKS

Query:  SILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        SI ILEKLYGENHIAIGNELLKLSSILLSVGD N V+CIKRLSEIFRC+YG HA  MFPF N LEEETHK  STD+
Subjt:  SILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

KAG7019525.1 SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0085.68Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVPENLKQ VGSST DDLP SCS LLRLFQQSQLFFQVIGDLAM+PEN LCGKKKDAALELKRQGNQCF+KGDY  AL+YYSQALQVAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKAN ++GNF DAIRDFQISKNVE SFNGKKQ++DELK IQ Q KRS TV EH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN
                +EP QVKLHVTTS+KGRGMVSP EIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQ CQIQAGGQMLQN  DN
Subjt:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN

Query:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI
        ++ILK+LS +LRKYVQEITLP F++LRT+DVPEHKHECDGVHWPAIL SEIVLAGRIVAK V Q G F DASNLVDMLNLSHHF +MH DSKLECIIYSI
Subjt:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI

Query:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF
        ILSSCL+QFFPSQLP+N NT+SQIV+LISQIRTNSISIVRMKSFDAPGS  Q GRLSS  PFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSRTLF
Subjt:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF

Query:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ
        IRTT FVTVGCPLELSYGPQVGQLDCK+RLKLLEDEYSFKCQCSGCS+V+I DLVLNAFCCINP+C GVVLDRSIFNCENKKTKD LTV++QSRLEPFM 
Subjt:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ

Query:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS
        +DSFLH G SHCLKCGSYR+IKSS STVDEA +HFTRLQQE+  N VSETT SDAL+AL SLKSTLHAYN+RIAEAEDNLSQAFCLLGKLE AA HCK+S
Subjt:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS

Query:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        I ILEKLYGENHIAIGNELLKLSSILLSVGD N V+CIKRLSEIFRC+YG HA  MFPF N LEEETHK  STD+
Subjt:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

XP_022927244.1 SET and MYND domain-containing protein 4 isoform X1 [Cucurbita moschata]0.0e+0085.68Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVPENLKQ VGSST DDLP SCS LLRLFQQSQLFFQVIGDLAM+PEN LCGKKKDAALELKRQGNQCF+KGDY  AL+YYSQALQVAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKAN ++GNF DAI DFQISKNVE SFNGKKQ++DELK IQ Q KRS TV EH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN
                +EP QVKLHVTTS+KGRGMVSP EIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQ CQIQAGGQMLQN  DN
Subjt:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN

Query:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI
        ++ILK+LS +LRKYVQEITLP F++LRT+DVPEHKHECDGVHWPAIL SEIVLAGRIVAK V Q G F DASNLVDMLNLSHHF +MH DSKLECIIYSI
Subjt:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI

Query:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF
        ILSSCL+QFFPSQLP+N NT+SQIV+LISQIRTNSISIVRMKSFDAPGS  Q GRLSS  PFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSRTLF
Subjt:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF

Query:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ
        IRTT FVTVGCPLELSYGPQVGQLDCK+RLKLLEDEYSFKCQCSGCS+V+I DLVLNAFCCINP+C GVVLDRSIFNCENKKTKD LTV++QSRLEPFM 
Subjt:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ

Query:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS
        +DSFLH G SHCLKCGSYR+IKSSRSTVDEA +HFTRLQQE+  N VSETT SDAL+AL SLKSTLHAYN+RIAEAEDNLSQAFCLLGKLE AA HCK+S
Subjt:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS

Query:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        I ILEKLYGENHIAIGNELLKLSSILLSVGD N V+CIKRLSEIFRC+YG HA  MFPF N LEEETHK  STD+
Subjt:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

XP_038895094.1 SET and MYND domain-containing protein 4 isoform X1 [Benincasa hispida]0.0e+0085.73Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVPENLKQMVGS+TADDL  SCS LLRLFQQSQLFFQVI D+A++PEN LCGKK DAALELKRQGNQCF+KGDY NAL+YYSQALQVAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVLHKMD+QLEC+RDCNRALQISSTYAKAWYRRGKAN ++ NFDDAI DFQISK+VE SFNGKKQI+DELK IQH   RS  VNEH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  -----------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNS
                   +EP QVKLHVTTS KGRGMVSPTE+PPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGG+MLQN 
Subjt:  -----------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNS

Query:  RDNEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECII
         DN+DI KNLS  LRKYVQEIT   FS+LRTEDVPEHKHECDGVHWPAIL SEIVLAGRIVAK V QR  F DASNLVDMLNLSHHF +MHTDSKLECII
Subjt:  RDNEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECII

Query:  YSIILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSR
        YSIILSSCLQQFFP QL INGNT+SQI +LISQIRTNSISIVRMKSFDAPGS  Q GRLSS VPFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSR
Subjt:  YSIILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSR

Query:  TLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEP
        TLFIR T F +VGCPLELSYGPQVGQLDCK RLKLLEDEYSF+CQCSGCSLV+ISDLVLNAFCCINPNCHGVVLDRSIFNCEN KTKDFLTV+ QS+LEP
Subjt:  TLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEP

Query:  FMQSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHC
         MQ+DSFLH G SHCLKCGSYRDIKSS STVDEAG+HFTRLQ EI  NRVSETT SDAL+AL+SLKSTLH YNRRIAEAEDNLSQAFCLLGKLELAA HC
Subjt:  FMQSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHC

Query:  KSSILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        K+SI ILEKLYGENHIAIGNELLKLSSIL+SVGDHNAVDCIKRLS+IFRCYYGSH   MFPF N L+EET K  STDL
Subjt:  KSSILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

XP_038895095.1 uncharacterized protein LOC120083413 isoform X2 [Benincasa hispida]0.0e+0086.06Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVPENLKQMVGS+TADDL  SCS LLRLFQQSQLFFQVI D+A++PEN LCGKK DAALELKRQGNQCF+KGDY NAL+YYSQALQVAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVLHKMD+QLEC+RDCNRALQISSTYAKAWYRRGKAN ++ NFDDAI DFQISK+VE SFNGKKQI+DELK IQH   RS  VNEH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN
                +EP QVKLHVTTS KGRGMVSPTE+PPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGG+MLQN  DN
Subjt:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN

Query:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI
        +DI KNLS  LRKYVQEIT   FS+LRTEDVPEHKHECDGVHWPAIL SEIVLAGRIVAK V QR  F DASNLVDMLNLSHHF +MHTDSKLECIIYSI
Subjt:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI

Query:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF
        ILSSCLQQFFP QL INGNT+SQI +LISQIRTNSISIVRMKSFDAPGS  Q GRLSS VPFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSRTLF
Subjt:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF

Query:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ
        IR T F +VGCPLELSYGPQVGQLDCK RLKLLEDEYSF+CQCSGCSLV+ISDLVLNAFCCINPNCHGVVLDRSIFNCEN KTKDFLTV+ QS+LEP MQ
Subjt:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ

Query:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS
        +DSFLH G SHCLKCGSYRDIKSS STVDEAG+HFTRLQ EI  NRVSETT SDAL+AL+SLKSTLH YNRRIAEAEDNLSQAFCLLGKLELAA HCK+S
Subjt:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS

Query:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        I ILEKLYGENHIAIGNELLKLSSIL+SVGDHNAVDCIKRLS+IFRCYYGSH   MFPF N L+EET K  STDL
Subjt:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

TrEMBL top hitse value%identityAlignment
A0A0A0LUY2 Uncharacterized protein0.0e+0084.13Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVPENLKQMVGS+TADDLP S S LLRLFQQSQLFFQ+IGDLAM+PEN LCGKKKDAALELKRQGNQCF+ GDY NAL+YYS+ALQVAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVLHKMDLQLEC+RDCNRALQISSTYAKAWYRRGKAN ++  FDDAIRDF+ISK+VE SFNGKK I+DELK +QHQ  RS T NEH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN
                ++P QVKLHVTTS KGRGMVSPTEIPPSSLVHVEEPYA+VILKHCRETHCHYCLNELP DKVPCPSCSIPLYCSQHCQIQAGG+MLQN  D 
Subjt:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN

Query:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI
        +DI KNLS +LRKYVQEITL  FS LRTEDVPEHKHECDGVHWPAIL SEIVLAGRIVAK +AQRG FTDASN+VDMLNLSHHFP+MH DSKLECIIYSI
Subjt:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI

Query:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF
        +L SCLQQFFPS++ INGNT SQI +LISQIRTNSISIVRMKSFDAPGS  +   LSS VPFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSRTLF
Subjt:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF

Query:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ
        IR T F+ VGCPLELSYGPQVGQLDCK+RL+LL+DEYSF CQCSGCS V+ISDLV+NAFCCINPNC GVVLDRSIF+CEN KTKDFLTVN Q  LEPFMQ
Subjt:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ

Query:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS
        +DSFLH G SHCLKCGSY DIKSSR TVD+AG+HFTRLQQEI  NRVSETT SDAL AL+SLKSTLH YNRRIAEAEDNLSQAF LLGKLELAA HCK+S
Subjt:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS

Query:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        I ILEKLYGENHIAIGNEL KLSSIL+SVGDHNAVDCIKRLS+IFRCYYGS+   MFPF N LEEETHK  ST L
Subjt:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

A0A5A7UI72 SET and MYND domain-containing protein 4 isoform X10.0e+0083.74Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVPENLKQMVGS+TADDLP S S LLRLFQQSQLFFQVIGDL M+PEN LCGKKKDAALELKRQGNQCF+ GDY NAL+YYS+AL VAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVLHKMDLQLE +RDCNRALQISS YAKAWYRRGKAN ++  FDDAIRDFQISK+VE SFNGKKQI+DELK IQHQ  RS TVNEH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN
                ++  QVKLHVTTS+KGRGMVSPTEIPPSSL+HVEEPYA+VILKHCRETHCHYCLNELP DKVPCPSCSIPLYCSQHCQIQAGG ML+N  D 
Subjt:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN

Query:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI
        +DI KNLS +LR Y+QEITL  FS LRTE+V EHKHECDGVHWPAIL SEIVLAGRIVAK +AQRG F DASNLVDMLNLSHHFP+MHTDSKLECIIYSI
Subjt:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI

Query:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF
        IL SCLQQFFPSQ+ INGNT SQI +LISQIRTNSISIVRMKSFDAPGS  +  RLSS +PFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSRTLF
Subjt:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF

Query:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ
        IR T F  VGCPLELSYGPQVGQLDCK+RLKLL+DEYSF CQCSGCS+V+ISDLV+NAFCCINPNC GVVLDRSIFNCEN KTKDFLTV+ Q  LEP MQ
Subjt:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ

Query:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS
        +DSFLH G SHCLKCGSY DIKSSR TVD+AG+HFTRLQQEI  NRVSETT SDAL AL+SLKSTLH YNRRIAEAEDNLSQAFCLLGKLELAA HCK+S
Subjt:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS

Query:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        I ILEKLYG NHIAIGNELLKLSSIL+SVGDHNA DCIKR S+IFRCYYGS+A  MFPF N LEEETHK  ST L
Subjt:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

A0A6J1EHH4 SET and MYND domain-containing protein 4 isoform X10.0e+0085.68Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVPENLKQ VGSST DDLP SCS LLRLFQQSQLFFQVIGDLAM+PEN LCGKKKDAALELKRQGNQCF+KGDY  AL+YYSQALQVAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKAN ++GNF DAI DFQISKNVE SFNGKKQ++DELK IQ Q KRS TV EH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN
                +EP QVKLHVTTS+KGRGMVSP EIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQ CQIQAGGQMLQN  DN
Subjt:  --------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDN

Query:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI
        ++ILK+LS +LRKYVQEITLP F++LRT+DVPEHKHECDGVHWPAIL SEIVLAGRIVAK V Q G F DASNLVDMLNLSHHF +MH DSKLECIIYSI
Subjt:  EDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSI

Query:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF
        ILSSCL+QFFPSQLP+N NT+SQIV+LISQIRTNSISIVRMKSFDAPGS  Q GRLSS  PFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSRTLF
Subjt:  ILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLF

Query:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ
        IRTT FVTVGCPLELSYGPQVGQLDCK+RLKLLEDEYSFKCQCSGCS+V+I DLVLNAFCCINP+C GVVLDRSIFNCENKKTKD LTV++QSRLEPFM 
Subjt:  IRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQ

Query:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS
        +DSFLH G SHCLKCGSYR+IKSSRSTVDEA +HFTRLQQE+  N VSETT SDAL+AL SLKSTLHAYN+RIAEAEDNLSQAFCLLGKLE AA HCK+S
Subjt:  SDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSS

Query:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        I ILEKLYGENHIAIGNELLKLSSILLSVGD N V+CIKRLSEIFRC+YG HA  MFPF N LEEETHK  STD+
Subjt:  ILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

A0A6J1KIH9 SET and MYND domain-containing protein 4 isoform X20.0e+0084.92Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSLVP+NL+Q VGSST DDLP SCS LLRLFQQSQLFFQ+IGDL M+PEN LCGKKKDAALELKRQGNQCF+KGDY  AL+YYSQALQVAPMNAVD
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
        MDKNLVATLYVNRASVL KMDLQLEC+RDCNR LQISS YAKAWYRRGKAN ++GNF DAIRDFQISKNVE SFNGKKQ++DELK IQ Q KRS TV EH
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  ---------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRD
                 +EP QVKLHVTTS+KGRGMVSP EIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQ CQIQAGG+MLQN  D
Subjt:  ---------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRD

Query:  NEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYS
        N++ILK+LS +LRKYVQEIT P F++LRT+DVPEHKHECDGVHWPAIL SEIVLAGRI+AK V Q G F DASNLVDMLNLSHHF +MH DSKLECIIYS
Subjt:  NEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYS

Query:  IILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTL
        IILSSCL+QFFPSQLP+N NT+SQIV+LISQIRTNSISIVRMKSFDAPGS  Q GRLSS  PFTCNMEQVRVGQAIYTTGS+FNHSCKPNIHAYFNSRTL
Subjt:  IILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTL

Query:  FIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFM
        FIRTT  VTVGCPLELSYGPQVGQLDCK+RLKLLEDEYSFKCQCSGCSLV+ISDLVL+AFCCINP+C GVVLDRSIFNCENKKTKD LTV++QSRLEPFM
Subjt:  FIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFM

Query:  QSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKS
         +DSFLH G SHCLKCGSYR+IKSS STVDEA +HFTRLQQEI  NRVSETT SDAL+AL SLKSTLHAYN+RIAEAEDNLSQAFCLLGKLELAA HCK+
Subjt:  QSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKS

Query:  SILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        SI ILEKLYGENHIAIGNELLKLSSILLSVGD N V+CIKRLSEIFRC+YG HA  MFPF N LEEETHK  STD+
Subjt:  SILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

A0A6J1KL29 SET and MYND domain-containing protein 4 isoform X10.0e+0083.12Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQV------------------IGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDN
        MEKLKSLVP+NL+Q VGSST DDLP SCS LLRLFQQSQLFFQV                  IGDL M+PEN LCGKKKDAALELKRQGNQCF+KGDY  
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQV------------------IGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDN

Query:  ALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIED
        AL+YYSQALQVAPMNAVDMDKNLVATLYVNRASVL KMDLQLEC+RDCNR LQISS YAKAWYRRGKAN ++GNF DAIRDFQISKNVE SFNGKKQ++D
Subjt:  ALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIED

Query:  ELKAIQHQRKRSKTVNEH---------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYC
        ELK IQ Q KRS TV EH         +EP QVKLHVTTS+KGRGMVSP EIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYC
Subjt:  ELKAIQHQRKRSKTVNEH---------NEPFQVKLHVTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYC

Query:  SQHCQIQAGGQMLQNSRDNEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLS
        SQ CQIQAGG+MLQN  DN++ILK+LS +LRKYVQEIT P F++LRT+DVPEHKHECDGVHWPAIL SEIVLAGRI+AK V Q G F DASNLVDMLNLS
Subjt:  SQHCQIQAGGQMLQNSRDNEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLS

Query:  HHFPQMHTDSKLECIIYSIILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSM
        HHF +MH DSKLECIIYSIILSSCL+QFFPSQLP+N NT+SQIV+LISQIRTNSISIVRMKSFDAPGS  Q GRLSS  PFTCNMEQVRVGQAIYTTGS+
Subjt:  HHFPQMHTDSKLECIIYSIILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSM

Query:  FNHSCKPNIHAYFNSRTLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENK
        FNHSCKPNIHAYFNSRTLFIRTT  VTVGCPLELSYGPQVGQLDCK+RLKLLEDEYSFKCQCSGCSLV+ISDLVL+AFCCINP+C GVVLDRSIFNCENK
Subjt:  FNHSCKPNIHAYFNSRTLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENK

Query:  KTKDFLTVNKQSRLEPFMQSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLS
        KTKD LTV++QSRLEPFM +DSFLH G SHCLKCGSYR+IKSS STVDEA +HFTRLQQEI  NRVSETT SDAL+AL SLKSTLHAYN+RIAEAEDNLS
Subjt:  KTKDFLTVNKQSRLEPFMQSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLS

Query:  QAFCLLGKLELAAGHCKSSILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL
        QAFCLLGKLELAA HCK+SI ILEKLYGENHIAIGNELLKLSSILLSVGD N V+CIKRLSEIFRC+YG HA  MFPF N LEEETHK  STD+
Subjt:  QAFCLLGKLELAAGHCKSSILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL

SwissProt top hitse value%identityAlignment
Q84JR9 TPR repeat-containing thioredoxin TTL42.8e-1030.88Show/hide
Query:  KRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQIS
        + +GN+ F  G Y  A + Y   L++   N+V         LY NRA+   K+ +  + V DCN+AL+I  +Y KA  RR  + G +G ++DA+RD+++ 
Subjt:  KRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQIS

Query:  KNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNEP
          +     G  ++ + L     QR R+   N+  EP
Subjt:  KNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNEP

Q8BTK5 SET and MYND domain-containing protein 46.8e-2520.88Show/hide
Query:  VPENLKQMVGSS-TADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLV
        +P++++  + ++ T  D+ L  SSLL+   + ++F + +        +    K  DA L  + +GN+ F + +Y +A + YS+ +  +  N  D     +
Subjt:  VPENLKQMVGSS-TADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLV

Query:  ATLYVNRASVLHKMDLQLECVRDCNRA--------LQISSTYAKA-----WYRRGKANGTIGNFDDA--------IRDFQI-SKNVE---ASFNGKKQIE
        +  Y NR++ L  +     C++D   A        LQ      K        R  +A  TI + + +        +  +QI  +NV+        K+ + 
Subjt:  ATLYVNRASVLHKMDLQLECVRDCNRA--------LQISSTYAKA-----WYRRGKANGTIGNFDDA--------IRDFQI-SKNVE---ASFNGKKQIE

Query:  DELKAIQHQRKRSKTVNEHNEPFQ-VKLHV---TTSSKGRGMVSPTEIPPSSLVHVEEPYALVIL-------KHCRET-----------HCHYCLNELPA
        + + A          + E N       L V   T   KGR +V+  +I P  L+  E+ +  V++        HC E            +CH CL    A
Subjt:  DELKAIQHQRKRSKTVNEHNEPFQ-VKLHV---TTSSKGRGMVSPTEIPPSSLVHVEEPYALVIL-------KHCRET-----------HCHYCLNELPA

Query:  DKVPCPSCSIPLYCSQHCQIQA-----------GGQMLQNSRDNEDILKNLSVELRKYVQEITLPCFSNLRTED--VPEHKHECDGVHWPAI-LSSEIVL
          VPC SCS   YCSQ C  QA           GG +L         L+   +   + V  +       + + D  +PE K+      + +   S E   
Subjt:  DKVPCPSCSIPLYCSQHCQIQA-----------GGQMLQNSRDNEDILKNLSVELRKYVQEITLPCFSNLRTED--VPEHKHECDGVHWPAI-LSSEIVL

Query:  AGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSIILSSCLQQFFPSQLPINGNTVSQIVM-LISQIRTNSISIVR-----MKSFDAP
         G          G +   SN   + +L  H  +   + +  C I    L   L+        +    +  +   L + +     +++R       +  A 
Subjt:  AGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSIILSSCLQQFFPSQLPINGNTVSQIVM-LISQIRTNSISIVR-----MKSFDAP

Query:  GSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCS
         S+   G   S +    N  Q+R+   I+   S+ NHSC+PN    F      +R  + +  G  +   YGP   ++   ER + L  +Y F C+C  C 
Subjt:  GSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCS

Query:  LVNISDLVL---NAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNK-QSRLEPFMQSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIK
           +         AFCC    C  ++    + +C N+   + ++ ++  SRL+   Q              C + + +++ +                  
Subjt:  LVNISDLVL---NAFCCINPNCHGVVLDRSIFNCENKKTKDFLTVNK-QSRLEPFMQSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIK

Query:  FNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSSILILEKLYGENHIAIGNELLKLSSILLS-VGDHNAVDCIKRLS
             E      L+   + +S L A +  + E ED L+QA   LG    +A H + S+ ++E  +G + + IG+EL KL+ +L + +    A+  I +  
Subjt:  FNRVSETTFSDALKALLSLKSTLHAYNRRIAEAEDNLSQAFCLLGKLELAAGHCKSSILILEKLYGENHIAIGNELLKLSSILLS-VGDHNAVDCIKRLS

Query:  EIFRCYYGSHAKKM
         I   + G  ++++
Subjt:  EIFRCYYGSHAKKM

Q8CGY6 Protein unc-45 homolog B6.2e-1039.81Show/hide
Query:  ALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRD
        A +LK +GN+ F   DY  A   YSQAL++        DK L+ATLY NRA+   KM+   +   D +RA+ I+S   KA YRR +A   +G  D A +D
Subjt:  ALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRD

Query:  FQISKNVE
         Q    +E
Subjt:  FQISKNVE

Q91Z38 Tetratricopeptide repeat protein 11.2e-1027.17Show/hide
Query:  MVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRA
        + GS  +DD              S+L  + + +L  N       K+++ + +LK +GN+ F +GDY  A   YSQALQ+ P      D+++   L+ NRA
Subjt:  MVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRA

Query:  SVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQ--------ISKNVEASFNGKKQIEDELKAIQHQ
        +   K D +   + DC++A+Q++ TY +A  RR +        D+A+ D++        + +  EA     KQIE+  + ++ +
Subjt:  SVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQ--------ISKNVEASFNGKKQIEDELKAIQHQ

Q9HGM9 DnaJ homolog subfamily C member 7 homolog2.3e-1237.5Show/hide
Query:  KRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQIS
        K QGN  F +G+Y +A   YS+ALQ+ P N     K  VA LY+NRA+VL ++    E + D + AL I S+Y K    R KA+  +  +++A+RD Q +
Subjt:  KRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQIS

Query:  KNVEASFNGKKQIEDELKAIQHQRKRSK
          ++AS      +  EL+ +Q + K+SK
Subjt:  KNVEASFNGKKQIEDELKAIQHQRKRSK

Arabidopsis top hitse value%identityAlignment
AT1G33400.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-20747.04Show/hide
Query:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD
        MEKLKSL+PE+L Q V SS+ DDL  + SSLLRLF     F Q + +LA NPE G CGK ++ +L+LKR+GN CF   D+D AL  YS+AL+VAP++A+D
Subjt:  MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVD

Query:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH
         DK+L+A+L++NRA+VLH + L  E +RDC+RAL+I   YAKAWYRRGK N  +GN+ DA RD  +S ++E+S  GKKQ+++ELKAI   +      ++ 
Subjt:  MDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEH

Query:  NEP-------------FQVKLH-VTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQML
          P              +VKL  V+T  KGRGMVS  +I  +S++HVEEP+++VI K CRETHCH+CLNELPAD VPCPSCSIP+YCS+ CQIQ+GG + 
Subjt:  NEP-------------FQVKLH-VTTSSKGRGMVSPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQML

Query:  QNSRDNEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLE
         N  D   I + L  ++ ++++ +T        T+ + EH+HEC G +WPA+L S+ VLAGRI+ K + Q    TD SNL ++L LSH + +M+ ++KLE
Subjt:  QNSRDNEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHECDGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLE

Query:  CIIYSIILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYF
          + SI+L  CL +     L +   +V+Q ++L+SQI+ NSI++ RMKS          G +S+  P   ++EQ+RVGQA+Y TGS+FNHSCKPNIH YF
Subjt:  CIIYSIILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPGSLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYF

Query:  NSRTLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLT----VN
         SR L ++TTEFV  GCPLELSYGP+VG+ DCK R++ LE+EY F C+C GC+ +NISDLV+N + C+N NC GVVLD ++  CE++K   F T    V+
Subjt:  NSRTLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNAFCCINPNCHGVVLDRSIFNCENKKTKDFLT----VN

Query:  KQSRLEPFMQSD-------------SFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAE
        +Q ++   + +D               LH+    CLKCGS  DI++S + V++A  H  R+++ +   R + +  SD  +++  L++ LH YN+ IA+AE
Subjt:  KQSRLEPFMQSD-------------SFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHAYNRRIAEAE

Query:  DNLSQAFCLLGKLELAAGHCKSSILILEKLYGENHIAIGNELLKLSSILLSVGDHN-AVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHK
        D ++QA  L G+L  A  HC++SI IL++LY + H+ IGNE++KL+SI L+ GD + A D  KR S+IF  YYGSHA+ +F +   L++ET K
Subjt:  DNLSQAFCLLGKLELAAGHCKSSILILEKLYGENHIAIGNELLKLSSILLSVGDHN-AVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHK

AT2G42580.1 tetratricopetide-repeat thioredoxin-like 31.1e-0926.14Show/hide
Query:  NPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKA
        NPE           +  + +GN+ F  G +  A + Y   L+    N+V         LY NRA+  +K+ L  + V DCN AL+   +Y KA  RR  +
Subjt:  NPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKA

Query:  NGTIGNFDDAIRDFQ-ISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNE
         G +G ++DA++D++ + + +       + +E     + ++ + SK++  +NE
Subjt:  NGTIGNFDDAIRDFQ-ISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNE

AT2G42810.2 protein phosphatase 5.23.7e-1027.33Show/hide
Query:  ALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRD
        A E K Q N+ F    Y +A+  Y++A+++   NAV          + NRA    K++     ++D ++A+++ S Y+K +YRRG A   +G F DA++D
Subjt:  ALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRD

Query:  FQ----ISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNEPFQVKLHVTTSSKGRGMVSPTEIPPSSLV
        FQ    +S N   +    K+ E  +  ++ +   S  V+E     +     T  +K R    PT+   +++V
Subjt:  FQ----ISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNEPFQVKLHVTTSSKGRGMVSPTEIPPSSLV

AT3G58620.1 tetratricopetide-repeat thioredoxin-like 42.0e-1130.88Show/hide
Query:  KRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQIS
        + +GN+ F  G Y  A + Y   L++   N+V         LY NRA+   K+ +  + V DCN+AL+I  +Y KA  RR  + G +G ++DA+RD+++ 
Subjt:  KRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQIS

Query:  KNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNEP
          +     G  ++ + L     QR R+   N+  EP
Subjt:  KNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNEP

AT4G32070.1 Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide repeat (TPR)-containing protein4.4e-1134.13Show/hide
Query:  ALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDL--QLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAI
        ALELK +GN+ F K D++ A+L + +AL++ P + +D     VA L  + AS   +M L      + +CN AL+ S  Y+KA  RR +    +   D A 
Subjt:  ALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLYVNRASVLHKMDL--QLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAI

Query:  RDFQISKNVEASFNGKKQIEDELKAI
        RD +I  N+E       +I D +K +
Subjt:  RDFQISKNVEASFNGKKQIEDELKAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGCTGAAGTCACTGGTGCCGGAAAACTTGAAGCAGATGGTGGGTTCAAGCACCGCCGACGATCTTCCTTTGTCGTGTTCTTCCTTATTACGCCTTTTTCAGCA
ATCGCAGCTCTTCTTCCAAGTCATTGGGGACTTGGCAATGAATCCTGAAAATGGTCTCTGTGGTAAGAAAAAGGACGCTGCTCTGGAGTTGAAGCGCCAAGGAAATCAAT
GCTTCGTAAAGGGGGATTATGATAATGCGTTGCTTTATTATTCCCAGGCACTGCAAGTTGCTCCAATGAATGCTGTTGACATGGACAAGAATTTGGTCGCTACTTTATAT
GTGAATCGAGCATCAGTTTTGCATAAAATGGATCTGCAATTGGAGTGTGTACGAGATTGCAATAGAGCACTTCAAATTTCATCAACCTATGCAAAGGCATGGTACAGAAG
AGGTAAAGCAAATGGTACTATAGGAAATTTTGATGATGCTATCCGTGACTTTCAAATTTCTAAGAATGTGGAGGCATCATTCAATGGAAAGAAGCAGATAGAAGACGAGC
TGAAGGCCATCCAACATCAGCGTAAGAGGTCAAAGACAGTAAATGAACACAATGAGCCATTCCAAGTAAAATTACATGTCACCACATCGAGTAAGGGGAGAGGAATGGTT
TCACCCACTGAGATACCTCCATCTTCCTTGGTCCATGTTGAAGAGCCTTATGCCTTGGTAATATTGAAGCATTGTAGAGAAACTCATTGCCATTACTGCTTGAATGAACT
ACCAGCTGATAAAGTACCTTGTCCATCATGCTCGATTCCTCTGTACTGCTCACAACACTGCCAGATACAAGCAGGGGGGCAAATGTTACAAAATTCTCGAGATAATGAAG
ATATTCTAAAAAATCTCTCGGTTGAACTCAGAAAGTATGTTCAAGAAATAACTTTGCCCTGTTTTTCCAACTTAAGGACTGAAGATGTTCCTGAACATAAACATGAATGT
GATGGTGTACACTGGCCTGCAATATTGTCATCTGAAATAGTTTTGGCTGGCCGAATAGTGGCTAAATGTGTAGCACAGAGAGGTGGCTTTACGGATGCTTCTAACCTTGT
GGATATGTTGAATCTTTCACACCATTTTCCACAGATGCACACTGATAGCAAGCTGGAGTGTATTATATATTCCATTATATTATCAAGTTGTCTTCAGCAATTTTTCCCCT
CTCAACTTCCAATCAATGGGAACACGGTCTCACAGATTGTCATGCTTATATCCCAGATTAGGACAAACTCTATATCTATAGTCCGTATGAAATCCTTCGATGCACCCGGA
TCACTAGGTCAGTGTGGAAGATTGTCTAGTGCGGTTCCTTTTACTTGTAATATGGAACAAGTCAGAGTAGGTCAGGCTATTTATACAACTGGAAGCATGTTTAACCACTC
ATGCAAACCAAACATCCATGCGTATTTCAATTCACGTACACTCTTTATACGGACAACTGAGTTCGTGACAGTTGGGTGTCCCCTAGAGTTATCATATGGTCCACAGGTTG
GTCAATTGGACTGTAAAGAGCGTCTTAAGTTGCTGGAGGATGAGTATTCTTTTAAATGTCAGTGTAGTGGTTGCTCATTGGTGAATATATCTGACCTTGTCCTCAACGCA
TTTTGTTGCATTAATCCAAATTGCCATGGCGTAGTGTTGGATAGATCCATCTTCAACTGTGAAAACAAGAAAACCAAGGACTTTCTTACTGTCAACAAGCAAAGTAGGCT
GGAGCCTTTCATGCAGAGTGACAGCTTCCTTCATGTTGGTCATAGCCATTGTTTGAAATGTGGGTCTTATCGTGATATAAAATCATCTCGTTCGACAGTGGATGAGGCCG
GGGTTCACTTTACAAGGTTGCAGCAGGAGATAAAATTTAATAGGGTGTCAGAAACTACATTCTCAGATGCTTTGAAAGCTTTGTTGTCACTGAAATCTACTTTGCATGCA
TATAATAGGCGCATAGCAGAAGCTGAAGACAATCTGTCGCAGGCCTTCTGTTTGCTTGGAAAACTAGAGCTTGCAGCAGGCCATTGTAAATCATCAATTCTGATTCTGGA
GAAGTTGTATGGAGAAAACCATATCGCCATTGGCAATGAACTCTTGAAGCTTTCTTCCATTCTGTTATCTGTGGGTGATCACAATGCTGTGGACTGCATTAAGCGATTGA
GTGAAATTTTCAGGTGTTATTATGGATCTCATGCCAAGAAAATGTTCCCATTTTTTAACTTCTTGGAGGAAGAAACTCACAAAATTTTCAGCACAGATCTTTGA
mRNA sequenceShow/hide mRNA sequence
GGGGCTATAATTTTTTAAACCCCACCCAAAAAAAATCTCCAAAGATAAATGGAGGTAGTCATCAACAACAACAAAATGCTGGCTCCGTGACAGTGTTCAATCTTTTGGAG
CTTGTACTTGGCTAGGTACGTTTCCCAAAATTATGGAGAAGCTGAAGTCACTGGTGCCGGAAAACTTGAAGCAGATGGTGGGTTCAAGCACCGCCGACGATCTTCCTTTG
TCGTGTTCTTCCTTATTACGCCTTTTTCAGCAATCGCAGCTCTTCTTCCAAGTCATTGGGGACTTGGCAATGAATCCTGAAAATGGTCTCTGTGGTAAGAAAAAGGACGC
TGCTCTGGAGTTGAAGCGCCAAGGAAATCAATGCTTCGTAAAGGGGGATTATGATAATGCGTTGCTTTATTATTCCCAGGCACTGCAAGTTGCTCCAATGAATGCTGTTG
ACATGGACAAGAATTTGGTCGCTACTTTATATGTGAATCGAGCATCAGTTTTGCATAAAATGGATCTGCAATTGGAGTGTGTACGAGATTGCAATAGAGCACTTCAAATT
TCATCAACCTATGCAAAGGCATGGTACAGAAGAGGTAAAGCAAATGGTACTATAGGAAATTTTGATGATGCTATCCGTGACTTTCAAATTTCTAAGAATGTGGAGGCATC
ATTCAATGGAAAGAAGCAGATAGAAGACGAGCTGAAGGCCATCCAACATCAGCGTAAGAGGTCAAAGACAGTAAATGAACACAATGAGCCATTCCAAGTAAAATTACATG
TCACCACATCGAGTAAGGGGAGAGGAATGGTTTCACCCACTGAGATACCTCCATCTTCCTTGGTCCATGTTGAAGAGCCTTATGCCTTGGTAATATTGAAGCATTGTAGA
GAAACTCATTGCCATTACTGCTTGAATGAACTACCAGCTGATAAAGTACCTTGTCCATCATGCTCGATTCCTCTGTACTGCTCACAACACTGCCAGATACAAGCAGGGGG
GCAAATGTTACAAAATTCTCGAGATAATGAAGATATTCTAAAAAATCTCTCGGTTGAACTCAGAAAGTATGTTCAAGAAATAACTTTGCCCTGTTTTTCCAACTTAAGGA
CTGAAGATGTTCCTGAACATAAACATGAATGTGATGGTGTACACTGGCCTGCAATATTGTCATCTGAAATAGTTTTGGCTGGCCGAATAGTGGCTAAATGTGTAGCACAG
AGAGGTGGCTTTACGGATGCTTCTAACCTTGTGGATATGTTGAATCTTTCACACCATTTTCCACAGATGCACACTGATAGCAAGCTGGAGTGTATTATATATTCCATTAT
ATTATCAAGTTGTCTTCAGCAATTTTTCCCCTCTCAACTTCCAATCAATGGGAACACGGTCTCACAGATTGTCATGCTTATATCCCAGATTAGGACAAACTCTATATCTA
TAGTCCGTATGAAATCCTTCGATGCACCCGGATCACTAGGTCAGTGTGGAAGATTGTCTAGTGCGGTTCCTTTTACTTGTAATATGGAACAAGTCAGAGTAGGTCAGGCT
ATTTATACAACTGGAAGCATGTTTAACCACTCATGCAAACCAAACATCCATGCGTATTTCAATTCACGTACACTCTTTATACGGACAACTGAGTTCGTGACAGTTGGGTG
TCCCCTAGAGTTATCATATGGTCCACAGGTTGGTCAATTGGACTGTAAAGAGCGTCTTAAGTTGCTGGAGGATGAGTATTCTTTTAAATGTCAGTGTAGTGGTTGCTCAT
TGGTGAATATATCTGACCTTGTCCTCAACGCATTTTGTTGCATTAATCCAAATTGCCATGGCGTAGTGTTGGATAGATCCATCTTCAACTGTGAAAACAAGAAAACCAAG
GACTTTCTTACTGTCAACAAGCAAAGTAGGCTGGAGCCTTTCATGCAGAGTGACAGCTTCCTTCATGTTGGTCATAGCCATTGTTTGAAATGTGGGTCTTATCGTGATAT
AAAATCATCTCGTTCGACAGTGGATGAGGCCGGGGTTCACTTTACAAGGTTGCAGCAGGAGATAAAATTTAATAGGGTGTCAGAAACTACATTCTCAGATGCTTTGAAAG
CTTTGTTGTCACTGAAATCTACTTTGCATGCATATAATAGGCGCATAGCAGAAGCTGAAGACAATCTGTCGCAGGCCTTCTGTTTGCTTGGAAAACTAGAGCTTGCAGCA
GGCCATTGTAAATCATCAATTCTGATTCTGGAGAAGTTGTATGGAGAAAACCATATCGCCATTGGCAATGAACTCTTGAAGCTTTCTTCCATTCTGTTATCTGTGGGTGA
TCACAATGCTGTGGACTGCATTAAGCGATTGAGTGAAATTTTCAGGTGTTATTATGGATCTCATGCCAAGAAAATGTTCCCATTTTTTAACTTCTTGGAGGAAGAAACTC
ACAAAATTTTCAGCACAGATCTTTGA
Protein sequenceShow/hide protein sequence
MEKLKSLVPENLKQMVGSSTADDLPLSCSSLLRLFQQSQLFFQVIGDLAMNPENGLCGKKKDAALELKRQGNQCFVKGDYDNALLYYSQALQVAPMNAVDMDKNLVATLY
VNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANGTIGNFDDAIRDFQISKNVEASFNGKKQIEDELKAIQHQRKRSKTVNEHNEPFQVKLHVTTSSKGRGMV
SPTEIPPSSLVHVEEPYALVILKHCRETHCHYCLNELPADKVPCPSCSIPLYCSQHCQIQAGGQMLQNSRDNEDILKNLSVELRKYVQEITLPCFSNLRTEDVPEHKHEC
DGVHWPAILSSEIVLAGRIVAKCVAQRGGFTDASNLVDMLNLSHHFPQMHTDSKLECIIYSIILSSCLQQFFPSQLPINGNTVSQIVMLISQIRTNSISIVRMKSFDAPG
SLGQCGRLSSAVPFTCNMEQVRVGQAIYTTGSMFNHSCKPNIHAYFNSRTLFIRTTEFVTVGCPLELSYGPQVGQLDCKERLKLLEDEYSFKCQCSGCSLVNISDLVLNA
FCCINPNCHGVVLDRSIFNCENKKTKDFLTVNKQSRLEPFMQSDSFLHVGHSHCLKCGSYRDIKSSRSTVDEAGVHFTRLQQEIKFNRVSETTFSDALKALLSLKSTLHA
YNRRIAEAEDNLSQAFCLLGKLELAAGHCKSSILILEKLYGENHIAIGNELLKLSSILLSVGDHNAVDCIKRLSEIFRCYYGSHAKKMFPFFNFLEEETHKIFSTDL