; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019451 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019451
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSET and MYND domain-containing protein 4
Genome locationtig00153347:720243..737648
RNA-Seq ExpressionSgr019451
SyntenySgr019451
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR019734 - Tetratricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583908.1 SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp. sororia]1.2e-28671.22Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        MEKLKSLVPENLKQ VG+ST DDLPSSCS LLRLFQQS+LFFQV+GDLAMDPE ALCGKKKDAALELKRQGNQCFLKGDYA ALV+YSQALQ+AP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL
        +DKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKANAS GNF DA+RDF I+K+VE+S NGKKQ++DEL +IQ QHK S TV   
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL

Query:  QPPELVD--------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPD
             +D           V+TS       S I+I    L H+   ++       RETHCHYCLNELPADKVPCPSC+IPLYCSQRCQIQAGGQ+ Q  PD
Subjt:  QPPELVD--------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPD

Query:  NQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML---------------------
        N++I ++LSDDLRKYVQEIT+ SF DLRT+DVPEHKHECD                  VAK V Q    ADASN+VDML                     
Subjt:  NQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML---------------------

Query:  ------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTL
                                IVILISQIRTNSISIVRMKSFDAPGS  Q+GRL+S  P TCNMEQVRVGQAIYTTGSLFNHSCKPN+H YFNSRTL
Subjt:  ------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTL

Query:  FIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFM
        FIRTT FVTVGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCSLV+I DLVLNAF CIN  C G+VLDRS+FNCENKKTKDS +VD+ +RLE FM
Subjt:  FIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFM

Query:  QSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKA
         +DSFLH G SHCLKCGSY +IKSS STVDEAW +FTRLQQEMN N +SETT+SDAL+AL SL+STLHAYN+RIAEAEDNLSQAFCLLGKLE AADHCKA
Subjt:  QSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKA

Query:  SIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        SIRILEKLY +NHIAIGNEL+KLSSIL SVGD    V+CI RLSEIF
Subjt:  SIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

KAG7019525.1 SET and MYND domain-containing protein 4, partial [Cucurbita argyrosperma subsp. argyrosperma]4.9e-28871.58Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        MEKLKSLVPENLKQ VG+ST DDLPSSCS LLRLFQQS+LFFQV+GDLAMDPE ALCGKKKDAALELKRQGNQCFLKGDYA ALV+YSQALQ+AP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL
        +DKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKANAS GNF DA+RDF I+K+VE+SFNGKKQ++DEL +IQ QHK S TV+  
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL

Query:  QPPELVD-------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPDN
           +L D          V+TS       S I+I    L H+   ++       RETHCHYCLNELPADKVPCPSC+IPLYCSQRCQIQAGGQ+ Q  PDN
Subjt:  QPPELVD-------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPDN

Query:  QDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------------------
        ++I ++LSDDLRKYVQEIT+ SF DLRT+DVPEHKHECD                  VAK V Q G  ADASN+VDML                      
Subjt:  QDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------------------

Query:  -----------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLF
                               IVILISQIRTNSISIVRMKSFDAPGS  Q GRL+S  P TCNMEQVRVGQAIYTTGSLFNHSCKPN+H YFNSRTLF
Subjt:  -----------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLF

Query:  IRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFMQ
        IRTT FVTVGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCS+V+I DLVLNAF CIN  C G+VLDRS+FNCENKKTKDS +VD+ +RLE FM 
Subjt:  IRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFMQ

Query:  SDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKAS
        +DSFLH G SHCLKCGSY +IKSS STVDEAW +FTRLQQEMN N +SETT+SDAL+AL SL+STLHAYN+RIAEAEDNLSQAFCLLGKLE AADHCKAS
Subjt:  SDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKAS

Query:  IRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        IRILEKLY +NHIAIGNEL+KLSSIL SVGD    V+CI RLSEIF
Subjt:  IRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

XP_022139922.1 SET and MYND domain-containing protein 4 [Momordica charantia]7.3e-30071.64Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        ME+LKSLVPENLKQ VG+STA+DLPSSCSSLLRLFQ+ ELFFQV+GDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYA+ALVHYS+ALQLAP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHKSGTV----
        ++KNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANA+ GNFDDAVRDFHIA++VE+SFNGKKQIEDEL VIQ QHK   +    
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHKSGTV----

Query:  -------------------------KALQPPELVDSSLVSTSCSFIDISRFLSHLTNDFSSSCTVRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQ
                                   + P E+  SS+V     +  +   L H           RETHCHYC NELPADK+PCPSCTIPLYCSQ CQIQ
Subjt:  -------------------------KALQPPELVDSSLVSTSCSFIDISRFLSHLTNDFSSSCTVRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQ

Query:  AGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------
        AGGQ+ Q  PDNQDI +NLSDDL+KYVQEIT   F D+RTEDVPEHKHECD                  V K VAQ+G  ADAS+VVDML          
Subjt:  AGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------

Query:  -----------------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKP
                                           IVILISQIRTNSISIVR+KSFDAPGSP Q G L+S VP TCNMEQVRVGQAIYTTGSLFNHSCKP
Subjt:  -----------------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKP

Query:  NVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHS
        N+H YFNSRTLFIRTT+FV+VGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAF CINADC G+VLDRSVF+CENKK +D HS
Subjt:  NVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHS

Query:  VDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLG
        VDK++RLE F+QS SFLH+GHSHCLKCG Y DIKSS+STVDEAW YFTRLQQE+NLNR+SETTLSDAL+ALFSL+STLHAYNRRIAEAED+LSQAFCL+G
Subjt:  VDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLG

Query:  KLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        KLELAADHCKASIRILEKLY ++HIAIGNEL+KLSSI  S+GD TTA DCINRL+ IF
Subjt:  KLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

XP_022927244.1 SET and MYND domain-containing protein 4 isoform X1 [Cucurbita moschata]2.2e-28871.58Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        MEKLKSLVPENLKQ VG+ST DDLPSSCS LLRLFQQS+LFFQV+GDLAMDPE ALCGKKKDAALELKRQGNQCFLKGDYA ALV+YSQALQ+AP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL
        +DKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKANAS GNF DA+ DF I+K+VE+SFNGKKQ++DEL +IQ QHK S TV+  
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL

Query:  QPPELVD-------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPDN
           +L D          V+TS       S I+I    L H+   ++       RETHCHYCLNELPADKVPCPSC+IPLYCSQRCQIQAGGQ+ Q  PDN
Subjt:  QPPELVD-------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPDN

Query:  QDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------------------
        ++I ++LSDDLRKYVQEIT+ SF DLRT+DVPEHKHECD                  VAK V Q G  ADASN+VDML                      
Subjt:  QDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------------------

Query:  -----------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLF
                               IVILISQIRTNSISIVRMKSFDAPGS  Q GRL+S  P TCNMEQVRVGQAIYTTGSLFNHSCKPN+H YFNSRTLF
Subjt:  -----------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLF

Query:  IRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFMQ
        IRTT FVTVGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCS+V+I DLVLNAF CIN  C G+VLDRS+FNCENKKTKDS +VD+ +RLE FM 
Subjt:  IRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFMQ

Query:  SDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKAS
        +DSFLH G SHCLKCGSY +IKSSRSTVDEAW +FTRLQQEMN N +SETT+SDAL+AL SL+STLHAYN+RIAEAEDNLSQAFCLLGKLE AADHCKAS
Subjt:  SDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKAS

Query:  IRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        IRILEKLY +NHIAIGNEL+KLSSIL SVGD    V+CI RLSEIF
Subjt:  IRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

XP_023520329.1 SET and MYND domain-containing protein 4 isoform X1 [Cucurbita pepo subsp. pepo]1.9e-28771.49Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        MEKLKSLVPENLKQ VG+ST DDLPSSCS LLRLFQQS+LFFQV+GDLAMDPE ALCGKKKDAALELKRQGNQCFLKGDYA ALV+YSQALQ+AP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTV---
        +DKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKANAS GNF DA+RDF ++KSVE+SFNGKKQ++DEL +IQ QHK S TV   
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTV---

Query:  ----KALQPPELVDSSL-VSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPD
            K     E +   L V+TS       S I+I    L H+   ++       RETHCHYCLNELPADKVPCPSC+IPLYCSQRCQIQAGG++ Q  PD
Subjt:  ----KALQPPELVDSSL-VSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPD

Query:  NQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML---------------------
        N++I ++LSDDLRKYVQEIT+ SF DLRT+DVPEHKHECD                  VAK V Q G  ADASN+VDML                     
Subjt:  NQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML---------------------

Query:  ------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTL
                                IVILISQIRTNSISIVRMKSFDAPGS  Q GRL+S VP TCNMEQVRVGQAIYTTGSLFNHSCKPN+H YFNSRTL
Subjt:  ------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTL

Query:  FIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFM
        FIRTT  VTVGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCS+V+I DLVLNAF CIN+ C G+VLDRS+FNCENKKTKD  +VD+ +RLE FM
Subjt:  FIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFM

Query:  QSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKA
         +DSFLH G SHCLKCGSY +IKSSRSTVDEAW +FTRLQQE+N NR+SETT+SDAL+AL SL+STLHAYN+RIAEAEDNLSQAFCLLGKLELAADHCKA
Subjt:  QSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKA

Query:  SIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        SIRILEKLY +NHI IGNEL+KLSSIL SVGD    V+CI RLSEIF
Subjt:  SIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

TrEMBL top hitse value%identityAlignment
A0A0A0LUY2 Uncharacterized protein4.8e-28168.21Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        MEKLKSLVPENLKQMVG++TADDLPSS S LLRLFQQS+LFFQ++GDLAMDPE ALCGKKKDAALELKRQGNQCFL GDY +ALV+YS+ALQ+AP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQH-KSGTVK--
        +DKNLVATLYVNRASVLHKMDLQLEC+RDCNRALQISSTYAKAWYRRGKAN S   FDDA+RDF I+K VE+SFNGKK I+DEL V+QHQH +S T    
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQH-KSGTVK--

Query:  --------------------------ALQPPELVDSSLVSTSCSFIDISRFLSHLTNDFSSSCTVRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQ
                                   + P E+  SSLV     +  +   L H           RETHCHYCLNELP DKVPCPSC+IPLYCSQ CQIQ
Subjt:  --------------------------ALQPPELVDSSLVSTSCSFIDISRFLSHLTNDFSSSCTVRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQ

Query:  AGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------
        AGG++ Q  PD QDIF+NLSDDLRKYVQEIT+CSF++LRTEDVPEHKHECD                  VAK +AQRG   DASN+VDML          
Subjt:  AGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------

Query:  -----------------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKP
                                           I ILISQIRTNSISIVRMKSFDAPGSP +   L+S VP TCNMEQVRVGQAIYTTGSLFNHSCKP
Subjt:  -----------------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKP

Query:  NVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHS
        N+H YFNSRTLFIR T F+ VGCPLELSYGPQVGQL CKDRL+LL+DEYSF CQCSGCS V+ISDLV+NAF CIN +C G+VLDRS+F+CEN KTKD  +
Subjt:  NVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHS

Query:  VDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLG
        V+    LE FMQ+DSFLH G SHCLKCGSYCDIKSSR TVD+A  +FTRLQQE+NLNR+SETT+SDAL AL SL+STLH YNRRIAEAEDNLSQAF LLG
Subjt:  VDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLG

Query:  KLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        KLELAA+HCKASIRILEKLY +NHIAIGNEL KLSSIL SVGD   AVDCI RLS+IF
Subjt:  KLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

A0A6J1CF99 SET and MYND domain-containing protein 43.5e-30071.64Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        ME+LKSLVPENLKQ VG+STA+DLPSSCSSLLRLFQ+ ELFFQV+GDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYA+ALVHYS+ALQLAP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHKSGTV----
        ++KNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANA+ GNFDDAVRDFHIA++VE+SFNGKKQIEDEL VIQ QHK   +    
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHKSGTV----

Query:  -------------------------KALQPPELVDSSLVSTSCSFIDISRFLSHLTNDFSSSCTVRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQ
                                   + P E+  SS+V     +  +   L H           RETHCHYC NELPADK+PCPSCTIPLYCSQ CQIQ
Subjt:  -------------------------KALQPPELVDSSLVSTSCSFIDISRFLSHLTNDFSSSCTVRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQ

Query:  AGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------
        AGGQ+ Q  PDNQDI +NLSDDL+KYVQEIT   F D+RTEDVPEHKHECD                  V K VAQ+G  ADAS+VVDML          
Subjt:  AGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------

Query:  -----------------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKP
                                           IVILISQIRTNSISIVR+KSFDAPGSP Q G L+S VP TCNMEQVRVGQAIYTTGSLFNHSCKP
Subjt:  -----------------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKP

Query:  NVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHS
        N+H YFNSRTLFIRTT+FV+VGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAF CINADC G+VLDRSVF+CENKK +D HS
Subjt:  NVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHS

Query:  VDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLG
        VDK++RLE F+QS SFLH+GHSHCLKCG Y DIKSS+STVDEAW YFTRLQQE+NLNR+SETTLSDAL+ALFSL+STLHAYNRRIAEAED+LSQAFCL+G
Subjt:  VDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLG

Query:  KLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        KLELAADHCKASIRILEKLY ++HIAIGNEL+KLSSI  S+GD TTA DCINRL+ IF
Subjt:  KLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

A0A6J1EHH4 SET and MYND domain-containing protein 4 isoform X11.1e-28871.58Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        MEKLKSLVPENLKQ VG+ST DDLPSSCS LLRLFQQS+LFFQV+GDLAMDPE ALCGKKKDAALELKRQGNQCFLKGDYA ALV+YSQALQ+AP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL
        +DKNLVATLYVNRASVL KMDLQLEC+RDCNRALQISS YAKAWYRRGKANAS GNF DA+ DF I+K+VE+SFNGKKQ++DEL +IQ QHK S TV+  
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL

Query:  QPPELVD-------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPDN
           +L D          V+TS       S I+I    L H+   ++       RETHCHYCLNELPADKVPCPSC+IPLYCSQRCQIQAGGQ+ Q  PDN
Subjt:  QPPELVD-------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPDN

Query:  QDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------------------
        ++I ++LSDDLRKYVQEIT+ SF DLRT+DVPEHKHECD                  VAK V Q G  ADASN+VDML                      
Subjt:  QDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------------------

Query:  -----------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLF
                               IVILISQIRTNSISIVRMKSFDAPGS  Q GRL+S  P TCNMEQVRVGQAIYTTGSLFNHSCKPN+H YFNSRTLF
Subjt:  -----------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLF

Query:  IRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFMQ
        IRTT FVTVGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCS+V+I DLVLNAF CIN  C G+VLDRS+FNCENKKTKDS +VD+ +RLE FM 
Subjt:  IRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFMQ

Query:  SDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKAS
        +DSFLH G SHCLKCGSY +IKSSRSTVDEAW +FTRLQQEMN N +SETT+SDAL+AL SL+STLHAYN+RIAEAEDNLSQAFCLLGKLE AADHCKAS
Subjt:  SDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKAS

Query:  IRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        IRILEKLY +NHIAIGNEL+KLSSIL SVGD    V+CI RLSEIF
Subjt:  IRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

A0A6J1KIH9 SET and MYND domain-containing protein 4 isoform X22.7e-28470.41Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        MEKLKSLVP+NL+Q VG+ST DDLPSSCS LLRLFQQS+LFFQ++GDL MDPE ALCGKKKDAALELKRQGNQCFLKGDYA+ALV+YSQALQ+AP+NAVD
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL
        +DKNLVATLYVNRASVL KMDLQLEC+RDCNR LQISS YAKAWYRRGKANAS GNF DA+RDF I+K+VE+SFNGKKQ++DEL +IQ Q+K S TV+  
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHK-SGTVKAL

Query:  QPPELVD--------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPD
             +D           V+TS       S I+I    L H+   ++       RETHCHYCLNELPADKVPCPSC+IPLYCSQRCQIQAGG++ Q  PD
Subjt:  QPPELVD--------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIFQKFPD

Query:  NQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML---------------------
        N++I ++LSDDLRKYVQEIT  SF DLRT+DVPEHKHECD                  +AK V Q G  ADASN+VDML                     
Subjt:  NQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML---------------------

Query:  ------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTL
                                IVILISQIRTNSISIVRMKSFDAPGS  Q GRL+S  P TCNMEQVRVGQAIYTTGSLFNHSCKPN+H YFNSRTL
Subjt:  ------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTL

Query:  FIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFM
        FIRTT  VTVGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCSLV+ISDLVL+AF CIN  C G+VLDRS+FNCENKKTKDS +VD+ +RLE FM
Subjt:  FIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFM

Query:  QSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKA
         +DSFLH G SHCLKCGSY +IKSS STVDEAW +FTRLQQE+N NR+SETT+SDAL+AL SL+STLHAYN+RIAEAEDNLSQAFCLLGKLELAADHCKA
Subjt:  QSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKA

Query:  SIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        SIRILEKLY +NHIAIGNEL+KLSSIL SVGD    V+CI RLSEIF
Subjt:  SIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

A0A6J1KL29 SET and MYND domain-containing protein 4 isoform X12.8e-28168.89Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQV------------------VGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYAS
        MEKLKSLVP+NL+Q VG+ST DDLPSSCS LLRLFQQS+LFFQV                  +GDL MDPE ALCGKKKDAALELKRQGNQCFLKGDYA+
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQV------------------VGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYAS

Query:  ALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIED
        ALV+YSQALQ+AP+NAVD+DKNLVATLYVNRASVL KMDLQLEC+RDCNR LQISS YAKAWYRRGKANAS GNF DA+RDF I+K+VE+SFNGKKQ++D
Subjt:  ALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIED

Query:  ELNVIQHQHK-SGTVKALQPPELVD--------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYC
        EL +IQ Q+K S TV+       +D           V+TS       S I+I    L H+   ++       RETHCHYCLNELPADKVPCPSC+IPLYC
Subjt:  ELNVIQHQHK-SGTVKALQPPELVD--------SSLVSTS------CSFIDI-SRFLSHLTNDFSSSCT--VRETHCHYCLNELPADKVPCPSCTIPLYC

Query:  SQRCQIQAGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML---
        SQRCQIQAGG++ Q  PDN++I ++LSDDLRKYVQEIT  SF DLRT+DVPEHKHECD                  +AK V Q G  ADASN+VDML   
Subjt:  SQRCQIQAGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML---

Query:  ------------------------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSL
                                                  IVILISQIRTNSISIVRMKSFDAPGS  Q GRL+S  P TCNMEQVRVGQAIYTTGSL
Subjt:  ------------------------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSL

Query:  FNHSCKPNVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENK
        FNHSCKPN+H YFNSRTLFIRTT  VTVGCPLELSYGPQVGQL CKDRLKLLEDEYSFKCQCSGCSLV+ISDLVL+AF CIN  C G+VLDRS+FNCENK
Subjt:  FNHSCKPNVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENK

Query:  KTKDSHSVDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLS
        KTKDS +VD+ +RLE FM +DSFLH G SHCLKCGSY +IKSS STVDEAW +FTRLQQE+N NR+SETT+SDAL+AL SL+STLHAYN+RIAEAEDNLS
Subjt:  KTKDSHSVDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAEDNLS

Query:  QAFCLLGKLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        QAFCLLGKLELAADHCKASIRILEKLY +NHIAIGNEL+KLSSIL SVGD    V+CI RLSEIF
Subjt:  QAFCLLGKLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

SwissProt top hitse value%identityAlignment
Q8BTK5 SET and MYND domain-containing protein 42.5e-1624.06Show/hide
Query:  NMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVL---NAFRC
        N  Q+R+   I+   SL NHSC+PN    F      +R    +  G  +   YGP   ++G  +R + L  +Y F C+C  C    +         AF C
Subjt:  NMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVL---NAFRC

Query:  INADCPGIVLDRSVFNCENKKTKDSHSVDKM-NRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALF
            C  ++    V +C N+   +S S D++ +RL+   Q                                     + Q++      E  +   L+   
Subjt:  INADCPGIVLDRSVFNCENKKTKDSHSVDKM-NRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALF

Query:  SLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSIL
        +  S L A +  + E ED L+QA   LG    +A H + S++++E  +  + + IG+EL KL+ +L
Subjt:  SLRSTLHAYNRRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSIL

Q8CGY6 Protein unc-45 homolog B1.0e-0939.81Show/hide
Query:  ALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRD
        A +LK +GN+ F   DY +A   YSQAL+L        DK L+ATLY NRA+   KM+   +   D +RA+ I+S   KA YRR +A    G  D A +D
Subjt:  ALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRD

Query:  FHIAKSVE
             ++E
Subjt:  FHIAKSVE

Q8IWX7 Protein unc-45 homolog B1.7e-0938.89Show/hide
Query:  ALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRD
        A++LK +GN+ F   DY +A   YSQAL+L        DK L+ATLY NRA+   K +  ++   D +RA+ I+S+  KA YRR +A    G  D A +D
Subjt:  ALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRD

Query:  FHIAKSVE
             ++E
Subjt:  FHIAKSVE

Q91Z38 Tetratricopeptide repeat protein 17.7e-1033.96Show/hide
Query:  KKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFD
        K+++ + +LK +GN+ F +GDY  A   YSQALQ+ P      D+++   L+ NRA+   K D +   + DC++A+Q++ TY +A  RR +        D
Subjt:  KKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFD

Query:  DAVRDF
        +A+ D+
Subjt:  DAVRDF

Q9HGM9 DnaJ homolog subfamily C member 7 homolog4.5e-1036.15Show/hide
Query:  KRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIA
        K QGN  F +G+Y  A   YS+ALQ+ P N     K  VA LY+NRA+VL ++    E + D + AL I S+Y K    R KA+ +   +++AVRD   A
Subjt:  KRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIA

Query:  KSVEISFNGKKQIEDELNVIQHQHKSGTVK
          ++ S      +  EL  +Q + K    K
Subjt:  KSVEISFNGKKQIEDELNVIQHQHKSGTVK

Arabidopsis top hitse value%identityAlignment
AT1G33400.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-15941.35Show/hide
Query:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD
        MEKLKSL+PE+L Q V +S+ DDL S+ SSLLRLF     F Q V +LA +PE   CGK ++ +L+LKR+GN CF   D+  AL  YS+AL++AP++A+D
Subjt:  MEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQSELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVD

Query:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHKSGTVKALQ
         DK+L+A+L++NRA+VLH + L  E +RDC+RAL+I   YAKAWYRRGK N   GN+ DA RD  ++ S+E S  GKKQ+++EL  I     + T++  +
Subjt:  IDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHKSGTVKALQ

Query:  PPELVDSSL--------------VSTS------CSFIDISR-FLSHLTNDFS--SSCTVRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIF
             D+ +              VST        S  DI    + H+   FS   S + RETHCH+CLNELPAD VPCPSC+IP+YCS+ CQIQ+GG + 
Subjt:  PPELVDSSL--------------VSTS------CSFIDISR-FLSHLTNDFS--SSCTVRETHCHYCLNELPADKVPCPSCTIPLYCSQRCQIQAGGQIF

Query:  QKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------------
            D   IF+ L DD+ ++++ +T        T+ + EH+HEC                   + KL+ Q     D SN+ ++L                
Subjt:  QKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECD------------------VAKLVAQRGDVADASNVVDML----------------

Query:  -----------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYF
                                      +IL+SQI+ NSI++ RMKS          G +++  P+  ++EQ+RVGQA+Y TGSLFNHSCKPN+H YF
Subjt:  -----------------------------IVILISQIRTNSISIVRMKSFDAPGSPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYF

Query:  NSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKD----SHSVD
         SR L ++TT+FV  GCPLELSYGP+VG+  CK+R++ LE+EY F C+C GC+ +NISDLV+N + C+N +C G+VLD +V  CE++K         +VD
Subjt:  NSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNAFRCINADCPGIVLDRSVFNCENKKTKD----SHSVD

Query:  KMNRLERFMQSD-------------SFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAE
        +  ++   + +D               LH+    CLKCGS CDI++S + V++AW +  R+++ MN  R + + LSD  +++  LR+ LH YN+ IA+AE
Subjt:  KMNRLERFMQSD-------------SFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHAYNRRIAEAE

Query:  DNLSQAFCLLGKLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF
        D ++QA  L G+L  A  HC+ASI+IL++LY   H+ IGNE+VKL+SI  + GD + A D   R S+IF
Subjt:  DNLSQAFCLLGKLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIF

AT3G58620.1 tetratricopetide-repeat thioredoxin-like 42.7e-1035.64Show/hide
Query:  KRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIA
        + +GN+ F  G Y+ A V Y   L+L   N+V         LY NRA+   K+ +  + V DCN+AL+I  +Y KA  RR  +    G ++DAVRD+ + 
Subjt:  KRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRGKANASKGNFDDAVRDFHIA

Query:  K
        +
Subjt:  K

AT3G59090.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)7.1e-3537.27Show/hide
Query:  FELTENSCFPRVLVGVHVGLALVDAIIAVLAFYQIKNL----------LQLAPHSLLKGKNQ-----------KKCPR-------SSILLL---HILSVA
        FE T++ C+    + V++ LA +DA +A +AF Q+              Q   H ++   N              C R          LL+    IL +A
Subjt:  FELTENSCFPRVLVGVHVGLALVDAIIAVLAFYQIKNL----------LQLAPHSLLKGKNQ-----------KKCPR-------SSILLL---HILSVA

Query:  YVTVESFYRVDLCHQ--PDDEDDEDEERSFEEGLLEKISSEPSSSNTDWSKRWLPVRLPHVGSRQNLVILVIMIIFVLTLGFAVILWIGMGNKSIDSLVV
           +   + VD+CHQ   +++DD+DEE S ++ LLEK  S+P SSN    ++       HVG+RQ  V+  I+++F+L + FA+++WI  G   ++S ++
Subjt:  YVTVESFYRVDLCHQ--PDDEDDEDEERSFEEGLLEKISSEPSSSNTDWSKRWLPVRLPHVGSRQNLVILVIMIIFVLTLGFAVILWIGMGNKSIDSLVV

Query:  LQVVYVDLFAAAMLVLGGALACYGLLLFLKMRKVRSERASSEILKVAGLAAVSVVCFTSSALVALLTNIPV
         + VYVD+FAA +L+ GG +  YGL L   +RKVRSE+ SSE+ KV+GLA VSVVCFT S+L+ALLT+IP+
Subjt:  LQVVYVDLFAAAMLVLGGALACYGLLLFLKMRKVRSERASSEILKVAGLAAVSVVCFTSSALVALLTNIPV

AT3G59090.2 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1084 (InterPro:IPR009457)7.1e-3537.27Show/hide
Query:  FELTENSCFPRVLVGVHVGLALVDAIIAVLAFYQIKNL----------LQLAPHSLLKGKNQ-----------KKCPR-------SSILLL---HILSVA
        FE T++ C+    + V++ LA +DA +A +AF Q+              Q   H ++   N              C R          LL+    IL +A
Subjt:  FELTENSCFPRVLVGVHVGLALVDAIIAVLAFYQIKNL----------LQLAPHSLLKGKNQ-----------KKCPR-------SSILLL---HILSVA

Query:  YVTVESFYRVDLCHQ--PDDEDDEDEERSFEEGLLEKISSEPSSSNTDWSKRWLPVRLPHVGSRQNLVILVIMIIFVLTLGFAVILWIGMGNKSIDSLVV
           +   + VD+CHQ   +++DD+DEE S ++ LLEK  S+P SSN    ++       HVG+RQ  V+  I+++F+L + FA+++WI  G   ++S ++
Subjt:  YVTVESFYRVDLCHQ--PDDEDDEDEERSFEEGLLEKISSEPSSSNTDWSKRWLPVRLPHVGSRQNLVILVIMIIFVLTLGFAVILWIGMGNKSIDSLVV

Query:  LQVVYVDLFAAAMLVLGGALACYGLLLFLKMRKVRSERASSEILKVAGLAAVSVVCFTSSALVALLTNIPV
         + VYVD+FAA +L+ GG +  YGL L   +RKVRSE+ SSE+ KV+GLA VSVVCFT S+L+ALLT+IP+
Subjt:  LQVVYVDLFAAAMLVLGGALACYGLLLFLKMRKVRSERASSEILKVAGLAAVSVVCFTSSALVALLTNIPV

AT3G59090.3 LOCATED IN: endomembrane system1.9e-3246.02Show/hide
Query:  ILSVAYVTVESFYRVDLCHQ--PDDEDDEDEERSFEEGLLEKISSEPSSSNTDWSKRWLPVRLPHVGSRQNLVILVIMIIFVLTLGFAVILWIGMGNKSI
        IL +A   +   + VD+CHQ   +++DD+DEE S ++ LLEK  S+P SSN    ++       HVG+RQ  V+  I+++F+L + FA+++WI  G   +
Subjt:  ILSVAYVTVESFYRVDLCHQ--PDDEDDEDEERSFEEGLLEKISSEPSSSNTDWSKRWLPVRLPHVGSRQNLVILVIMIIFVLTLGFAVILWIGMGNKSI

Query:  DSLVVLQVVYVDLFAAAMLVLGGALACYGLLLFLKMRKVRSERASSEILKVAGLAAVSVVCFTSSALVALLTNIPV
        +S ++ + VYVD+FAA +L+ GG +  YGL L   +RKVRSE+ SSE+ KV+GLA VSVVCFT S+L+ALLT+IP+
Subjt:  DSLVVLQVVYVDLFAAAMLVLGGALACYGLLLFLKMRKVRSERASSEILKVAGLAAVSVVCFTSSALVALLTNIPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTTTGAATTAACTGAAAACTCTTGCTTTCCACGGGTCTTGGTAGGTGTACATGTGGGTCTTGCTCTCGTTGATGCCATTATTGCAGTTCTTGCATTTTATCAGAT
CAAAAACTTGTTGCAGCTTGCACCACACTCATTACTGAAAGGGAAAAACCAAAAGAAATGTCCTCGATCCTCTATACTGCTCTTACACATTCTGTCTGTTGCTTATGTCA
CTGTGGAAAGCTTTTATAGGGTTGACCTCTGCCATCAGCCAGATGATGAAGATGATGAGGACGAAGAGAGAAGTTTCGAGGAAGGCTTGTTGGAGAAGATCTCTAGTGAA
CCAAGTTCATCAAATACAGATTGGAGCAAGAGGTGGCTCCCGGTACGGCTGCCTCATGTTGGAAGCCGCCAAAATTTAGTAATACTGGTGATCATGATTATCTTTGTTTT
AACATTGGGATTTGCTGTGATTCTCTGGATTGGAATGGGAAACAAATCGATTGATTCTTTAGTAGTTCTTCAGGTAGTATATGTAGATCTTTTTGCCGCAGCGATGCTCG
TATTAGGAGGAGCATTAGCTTGCTATGGTCTGCTATTGTTCTTGAAAATGAGAAAGGTTAGATCTGAGAGGGCATCATCTGAAATATTAAAGGTTGCGGGTTTGGCTGCT
GTTTCTGTTGTATGTTTCACTTCGAGTGCACTTGTAGCTCTTTTAACCAATATACCGGTCAAGGACATAGCAGAGTATTGCATTCCAATACAGTCGCATTATGTTGCGGT
ATCGGAGACGTTTAGAACTGTAGTTCTCCAAATCGGACCCTCCTTGCAGCTTCAACACATCAAATTTGATACCCCTAAGATGCTATTGCACTCATTTCCCAACTCTATGG
AGAAGCTGAAATCATTGGTGCCGGAAAACTTGAAGCAGATGGTGGGTGCAAGCACCGCCGATGATCTTCCTTCGTCGTGTTCTTCCCTACTACGCCTATTTCAGCAGTCG
GAGCTCTTCTTCCAAGTGGTCGGAGACTTGGCCATGGACCCTGAAAAGGCTCTCTGTGGTAAGAAAAAGGACGCTGCTTTGGAGTTGAAGCGCCAGGGCAATCAATGCTT
CTTAAAGGGGGATTATGCTAGTGCGCTGGTTCATTATTCCCAGGCACTGCAACTTGCTCCAATAAATGCCGTTGACATTGACAAGAATTTGGTTGCAACCTTATATGTGA
ATCGAGCATCAGTTTTGCACAAAATGGATCTGCAGTTGGAGTGTGTACGAGATTGCAATAGAGCACTTCAAATTTCATCAACCTATGCAAAGGCATGGTACAGAAGAGGT
AAAGCAAATGCGTCTAAGGGAAATTTTGATGATGCAGTCCGTGACTTTCATATTGCTAAGAGCGTGGAGATATCATTCAATGGAAAGAAGCAAATAGAAGACGAGCTGAA
TGTCATTCAACATCAGCATAAGAGCGGCACTGTAAAGGCTTTGCAACCTCCAGAACTGGTTGATAGCAGTCTAGTCAGTACCTCTTGCTCCTTTATAGACATCTCTCGGT
TCTTGTCGCATCTGACTAATGACTTCAGCAGCTCTTGTACTGTTCGAGAAACTCATTGCCATTACTGCTTGAATGAACTACCAGCAGATAAAGTACCTTGTCCATCATGC
ACAATTCCTCTGTACTGCTCACAACGTTGCCAAATACAAGCAGGGGGGCAGATATTTCAAAAATTTCCAGATAATCAAGATATTTTCGAAAATCTATCTGATGACCTCAG
AAAGTATGTTCAAGAGATAACTATGTGCAGTTTTACCGACTTAAGGACTGAAGATGTTCCTGAACATAAACATGAATGTGATGTTGCTAAACTTGTAGCACAAAGAGGTG
ACGTTGCAGATGCCTCTAACGTTGTGGATATGCTGATTGTCATTCTTATATCCCAAATTAGGACGAACTCTATATCAATTGTTCGTATGAAATCCTTTGATGCACCTGGA
TCACCACATCAGTATGGAAGATTAACTAGTGCGGTTCCTTTAACGTGTAATATGGAACAAGTCAGAGTAGGTCAAGCTATTTATACGACGGGAAGCTTGTTTAACCATTC
CTGCAAACCAAACGTCCATCAATATTTCAATTCACGTACTCTCTTTATACGGACAACTGACTTCGTGACAGTCGGGTGTCCCCTAGAATTGTCATACGGTCCACAGGTTG
GTCAATTGGGCTGTAAAGACCGGCTTAAGTTGCTAGAGGATGAGTACTCTTTTAAATGTCAGTGTAGTGGTTGCTCATTGGTGAATATATCTGACCTCGTCCTCAATGCG
TTTCGTTGCATTAATGCTGATTGCCCCGGCATAGTCTTGGATAGATCTGTTTTCAACTGTGAAAATAAGAAAACCAAGGACTCTCATTCAGTCGACAAGATGAATAGGTT
GGAGCGTTTTATGCAGAGTGACAGCTTCCTTCATGTTGGTCATAGCCATTGTTTGAAATGTGGATCTTATTGCGATATAAAATCATCTCGTTCCACAGTGGATGAGGCCT
GGTTTTACTTTACAAGGTTGCAGCAGGAGATGAATTTAAATAGGTTGTCAGAAACTACACTCTCAGATGCTTTGAAAGCTCTGTTCTCACTGAGATCTACATTGCATGCA
TATAATAGGCGCATAGCAGAAGCAGAAGACAATTTGTCGCAGGCCTTCTGTTTGCTTGGAAAACTAGAGCTTGCAGCAGACCATTGTAAAGCATCGATTCGAATCCTGGA
GAAGTTGTACAGCAAAAACCATATCGCCATTGGCAACGAACTCGTGAAACTTTCATCCATTCTGTCATCTGTGGGCGACCAGACTACTGCGGTGGACTGCATTAACCGAT
TGAGTGAAATTTTCAG
mRNA sequenceShow/hide mRNA sequence
ATGACTTTTGAATTAACTGAAAACTCTTGCTTTCCACGGGTCTTGGTAGGTGTACATGTGGGTCTTGCTCTCGTTGATGCCATTATTGCAGTTCTTGCATTTTATCAGAT
CAAAAACTTGTTGCAGCTTGCACCACACTCATTACTGAAAGGGAAAAACCAAAAGAAATGTCCTCGATCCTCTATACTGCTCTTACACATTCTGTCTGTTGCTTATGTCA
CTGTGGAAAGCTTTTATAGGGTTGACCTCTGCCATCAGCCAGATGATGAAGATGATGAGGACGAAGAGAGAAGTTTCGAGGAAGGCTTGTTGGAGAAGATCTCTAGTGAA
CCAAGTTCATCAAATACAGATTGGAGCAAGAGGTGGCTCCCGGTACGGCTGCCTCATGTTGGAAGCCGCCAAAATTTAGTAATACTGGTGATCATGATTATCTTTGTTTT
AACATTGGGATTTGCTGTGATTCTCTGGATTGGAATGGGAAACAAATCGATTGATTCTTTAGTAGTTCTTCAGGTAGTATATGTAGATCTTTTTGCCGCAGCGATGCTCG
TATTAGGAGGAGCATTAGCTTGCTATGGTCTGCTATTGTTCTTGAAAATGAGAAAGGTTAGATCTGAGAGGGCATCATCTGAAATATTAAAGGTTGCGGGTTTGGCTGCT
GTTTCTGTTGTATGTTTCACTTCGAGTGCACTTGTAGCTCTTTTAACCAATATACCGGTCAAGGACATAGCAGAGTATTGCATTCCAATACAGTCGCATTATGTTGCGGT
ATCGGAGACGTTTAGAACTGTAGTTCTCCAAATCGGACCCTCCTTGCAGCTTCAACACATCAAATTTGATACCCCTAAGATGCTATTGCACTCATTTCCCAACTCTATGG
AGAAGCTGAAATCATTGGTGCCGGAAAACTTGAAGCAGATGGTGGGTGCAAGCACCGCCGATGATCTTCCTTCGTCGTGTTCTTCCCTACTACGCCTATTTCAGCAGTCG
GAGCTCTTCTTCCAAGTGGTCGGAGACTTGGCCATGGACCCTGAAAAGGCTCTCTGTGGTAAGAAAAAGGACGCTGCTTTGGAGTTGAAGCGCCAGGGCAATCAATGCTT
CTTAAAGGGGGATTATGCTAGTGCGCTGGTTCATTATTCCCAGGCACTGCAACTTGCTCCAATAAATGCCGTTGACATTGACAAGAATTTGGTTGCAACCTTATATGTGA
ATCGAGCATCAGTTTTGCACAAAATGGATCTGCAGTTGGAGTGTGTACGAGATTGCAATAGAGCACTTCAAATTTCATCAACCTATGCAAAGGCATGGTACAGAAGAGGT
AAAGCAAATGCGTCTAAGGGAAATTTTGATGATGCAGTCCGTGACTTTCATATTGCTAAGAGCGTGGAGATATCATTCAATGGAAAGAAGCAAATAGAAGACGAGCTGAA
TGTCATTCAACATCAGCATAAGAGCGGCACTGTAAAGGCTTTGCAACCTCCAGAACTGGTTGATAGCAGTCTAGTCAGTACCTCTTGCTCCTTTATAGACATCTCTCGGT
TCTTGTCGCATCTGACTAATGACTTCAGCAGCTCTTGTACTGTTCGAGAAACTCATTGCCATTACTGCTTGAATGAACTACCAGCAGATAAAGTACCTTGTCCATCATGC
ACAATTCCTCTGTACTGCTCACAACGTTGCCAAATACAAGCAGGGGGGCAGATATTTCAAAAATTTCCAGATAATCAAGATATTTTCGAAAATCTATCTGATGACCTCAG
AAAGTATGTTCAAGAGATAACTATGTGCAGTTTTACCGACTTAAGGACTGAAGATGTTCCTGAACATAAACATGAATGTGATGTTGCTAAACTTGTAGCACAAAGAGGTG
ACGTTGCAGATGCCTCTAACGTTGTGGATATGCTGATTGTCATTCTTATATCCCAAATTAGGACGAACTCTATATCAATTGTTCGTATGAAATCCTTTGATGCACCTGGA
TCACCACATCAGTATGGAAGATTAACTAGTGCGGTTCCTTTAACGTGTAATATGGAACAAGTCAGAGTAGGTCAAGCTATTTATACGACGGGAAGCTTGTTTAACCATTC
CTGCAAACCAAACGTCCATCAATATTTCAATTCACGTACTCTCTTTATACGGACAACTGACTTCGTGACAGTCGGGTGTCCCCTAGAATTGTCATACGGTCCACAGGTTG
GTCAATTGGGCTGTAAAGACCGGCTTAAGTTGCTAGAGGATGAGTACTCTTTTAAATGTCAGTGTAGTGGTTGCTCATTGGTGAATATATCTGACCTCGTCCTCAATGCG
TTTCGTTGCATTAATGCTGATTGCCCCGGCATAGTCTTGGATAGATCTGTTTTCAACTGTGAAAATAAGAAAACCAAGGACTCTCATTCAGTCGACAAGATGAATAGGTT
GGAGCGTTTTATGCAGAGTGACAGCTTCCTTCATGTTGGTCATAGCCATTGTTTGAAATGTGGATCTTATTGCGATATAAAATCATCTCGTTCCACAGTGGATGAGGCCT
GGTTTTACTTTACAAGGTTGCAGCAGGAGATGAATTTAAATAGGTTGTCAGAAACTACACTCTCAGATGCTTTGAAAGCTCTGTTCTCACTGAGATCTACATTGCATGCA
TATAATAGGCGCATAGCAGAAGCAGAAGACAATTTGTCGCAGGCCTTCTGTTTGCTTGGAAAACTAGAGCTTGCAGCAGACCATTGTAAAGCATCGATTCGAATCCTGGA
GAAGTTGTACAGCAAAAACCATATCGCCATTGGCAACGAACTCGTGAAACTTTCATCCATTCTGTCATCTGTGGGCGACCAGACTACTGCGGTGGACTGCATTAACCGAT
TGAGTGAAATTTTCAG
Protein sequenceShow/hide protein sequence
MTFELTENSCFPRVLVGVHVGLALVDAIIAVLAFYQIKNLLQLAPHSLLKGKNQKKCPRSSILLLHILSVAYVTVESFYRVDLCHQPDDEDDEDEERSFEEGLLEKISSE
PSSSNTDWSKRWLPVRLPHVGSRQNLVILVIMIIFVLTLGFAVILWIGMGNKSIDSLVVLQVVYVDLFAAAMLVLGGALACYGLLLFLKMRKVRSERASSEILKVAGLAA
VSVVCFTSSALVALLTNIPVKDIAEYCIPIQSHYVAVSETFRTVVLQIGPSLQLQHIKFDTPKMLLHSFPNSMEKLKSLVPENLKQMVGASTADDLPSSCSSLLRLFQQS
ELFFQVVGDLAMDPEKALCGKKKDAALELKRQGNQCFLKGDYASALVHYSQALQLAPINAVDIDKNLVATLYVNRASVLHKMDLQLECVRDCNRALQISSTYAKAWYRRG
KANASKGNFDDAVRDFHIAKSVEISFNGKKQIEDELNVIQHQHKSGTVKALQPPELVDSSLVSTSCSFIDISRFLSHLTNDFSSSCTVRETHCHYCLNELPADKVPCPSC
TIPLYCSQRCQIQAGGQIFQKFPDNQDIFENLSDDLRKYVQEITMCSFTDLRTEDVPEHKHECDVAKLVAQRGDVADASNVVDMLIVILISQIRTNSISIVRMKSFDAPG
SPHQYGRLTSAVPLTCNMEQVRVGQAIYTTGSLFNHSCKPNVHQYFNSRTLFIRTTDFVTVGCPLELSYGPQVGQLGCKDRLKLLEDEYSFKCQCSGCSLVNISDLVLNA
FRCINADCPGIVLDRSVFNCENKKTKDSHSVDKMNRLERFMQSDSFLHVGHSHCLKCGSYCDIKSSRSTVDEAWFYFTRLQQEMNLNRLSETTLSDALKALFSLRSTLHA
YNRRIAEAEDNLSQAFCLLGKLELAADHCKASIRILEKLYSKNHIAIGNELVKLSSILSSVGDQTTAVDCINRLSEIFX