; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G22800 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G22800
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDNA-dependent metalloprotease WSS1 isoform X2
Genome locationClcChr09:36181144..36184178
RNA-Seq ExpressionClc09G22800
SyntenyClc09G22800
Gene Ontology termsGO:0008233 - peptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001876 - Zinc finger, RanBP2-type
IPR013536 - WLM domain
IPR036443 - Zinc finger, RanBP2-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004150300.2 uncharacterized protein LOC101209563 [Cucumis sativus]2.3e-21080.65Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDVGDLNKVWEIKALKKAGEKEAK++LERIAKQVQPIM +HKWRVK+LS+  PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR
        HGPHNANFYKLWDELRK                                              ECEEL+AKG+SGTAQGFDLPGRRLGGN RQP LSSLR
Subjt:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR

Query:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKR
        KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS   MPVDEDCCP  PSEAAHSSQAGKSGPF NLSK VDALHQKR
Subjt:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKR

Query:  SRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLA
         RESERS NK+S G L+PDFVDLSKD+AIP  SADY AESNKRHK+PD + FP+SSAETSSID SCSSSNLM  +DGTIHP ELSMWECGNCTLLNPPLA
Subjt:  SRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLA

Query:  PMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT
        P+CELCFSQKP D+DTRYKFWSCKFCTLENSVKLEKC+ACDQWRYSHGQPVSTRGPNLGT
Subjt:  PMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT

XP_022976236.1 uncharacterized protein LOC111476695 isoform X1 [Cucurbita maxima]7.1e-20473.13Show/hide
Query:  SRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMH
        S Q  I VK ISSSKFQCRFEFQMI NLL+  F+INRFRF                    MDV D+NKVWEIKAL KKAGEKEAKE+LERIAKQVQPIM 
Subjt:  SRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMH

Query:  RHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIY
        RHKWRVKILS+  PKNPALLG+NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRK                      
Subjt:  RHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIY

Query:  IHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQA
                                ECEELMAKGISGTAQGFD+PGRRLGGN+ QPPLSSLRKSSLAAAEGRRRL SLLPSGP RLGGDS+IMVALSPVQA
Subjt:  IHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQA

Query:  AAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAE
        AAMAAERRLQDDIWCASS  MPVDEDCC D  SE A   QAG+S  FSN S+G+DA H KR RE+ERSS K+S GHLKPDFV       IP  SADY AE
Subjt:  AAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAE

Query:  SNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSA
        SNKRHKM   VPFP+S AETSSID  CSSSNLMP HDGT HP ELSMWECGNCTLLNPPLAP+CELCFS K K ADT+YKFWSCKFCTLENSVKLEKCSA
Subjt:  SNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSA

Query:  CDQWRYSHGQPVSTRGPNLGT
        C QWRYSHGQPVSTRGPN+GT
Subjt:  CDQWRYSHGQPVSTRGPNLGT

XP_022976238.1 uncharacterized protein LOC111476695 isoform X2 [Cucurbita maxima]1.2e-20373.13Show/hide
Query:  SRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMH
        S Q  I VK ISSSKFQCRFEFQMI NLL+  F+INRFRF                    MDV D+NKVWEIKAL KKAGEKEAKE+LERIAKQVQPIM 
Subjt:  SRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMH

Query:  RHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIY
        RHKWRVKILS+  PKNPALLG+NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRK                      
Subjt:  RHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIY

Query:  IHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQA
                                ECEELMAKGISGTAQGFD+PGRRLGGN+ QPPLSSLRKSSLAAAEGRRRL SLLPSGP RLGGDS+IMVALSPVQA
Subjt:  IHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQA

Query:  AAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAE
        AAMAAERRLQDDIWCASS  MPVDEDCC D  SE A   QAG+S  FSN S+G+DA H KR RE+ERSS K+S GHLKPDFV       IP  SADY AE
Subjt:  AAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAE

Query:  SNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSA
        SNKRHKM   VPFP+S AETSSID  CSSSNLMP HDGT HP ELSMWECGNCTLLNPPLAP+CELCFS K K ADT+YKFWSCKFCTLENSVKLEKCSA
Subjt:  SNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSA

Query:  CDQWRYSHGQPVSTRGPNLGT
        C QWRYSHGQPVSTRGPN+GT
Subjt:  CDQWRYSHGQPVSTRGPNLGT

XP_023536323.1 uncharacterized protein LOC111797532 isoform X1 [Cucurbita pepo subsp. pepo]2.7e-21175.1Show/hide
Query:  VKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMHRHKWRVK
        VK ISSSKFQCRFEFQMI NLL+  F+INRFRF                    MDV D+NKVWEIKAL KKAGEKEAKE+LERIAKQVQPIM RHKWRVK
Subjt:  VKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMHRHKWRVK

Query:  ILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGIS
        +LS+  PKNPALLG+NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRK                             
Subjt:  ILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGIS

Query:  SLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAER
                         ECEELMA+GISGTAQGFD+PGRRLGGN  QPPLSSLRKSSL AAEGRRRL SLLPSGP RLGGDS+IMVALSPVQAAAMAAER
Subjt:  SLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAER

Query:  RLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKM
        RLQDDIWCASS  MPVDEDCC D PSEAA   QAG+SGPFSN S+GVDALH KR RESERSS K+S GHLKPDFVDLSKD+ IP  SA Y AESNKRHKM
Subjt:  RLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKM

Query:  PDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYS
           VPFP+S AET+SID  CSSSNLMP HDGT HP ELSMWECGNCTLLNPPLAP+CELCFS KPK ADT+YKFWSCKFCTLENSVKLEKCSAC QWRYS
Subjt:  PDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYS

Query:  HGQPVSTRGPNLGT
        HGQPVSTRGPN+GT
Subjt:  HGQPVSTRGPNLGT

XP_038898345.1 uncharacterized protein LOC120086023 [Benincasa hispida]1.3e-21884.35Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIM RHKWRVK+LS+  PKNPALLGLNVGRGIHVKLRLRRPNRDGDF+PFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR
        HGPHNANFYKLWDELRK                                              ECEELMAKGISGTAQGFDLPGRRLGGN RQPPLSSL 
Subjt:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR

Query:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKR
        KSSLAAAEGRR LGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASS EMPVDEDCCPD PSEAAH SQAGKSGPFSNLS G+DAL QKR
Subjt:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKR

Query:  SRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLA
        SRESERSSNK+S GHLKPDFVDLSKDD IP  SADYGAESNKRHKMPD V FPKSSAETSSID S SSSNLMPSHDGTIHP ELSMWECGNCTLLNPPLA
Subjt:  SRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLA

Query:  PMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT
        PMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVST GPNLGT
Subjt:  PMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT

TrEMBL top hitse value%identityAlignment
A0A0A0K3J6 Uncharacterized protein1.1e-21080.65Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDVGDLNKVWEIKALKKAGEKEAK++LERIAKQVQPIM +HKWRVK+LS+  PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR
        HGPHNANFYKLWDELRK                                              ECEEL+AKG+SGTAQGFDLPGRRLGGN RQP LSSLR
Subjt:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR

Query:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKR
        KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS   MPVDEDCCP  PSEAAHSSQAGKSGPF NLSK VDALHQKR
Subjt:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKR

Query:  SRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLA
         RESERS NK+S G L+PDFVDLSKD+AIP  SADY AESNKRHK+PD + FP+SSAETSSID SCSSSNLM  +DGTIHP ELSMWECGNCTLLNPPLA
Subjt:  SRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLA

Query:  PMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT
        P+CELCFSQKP D+DTRYKFWSCKFCTLENSVKLEKC+ACDQWRYSHGQPVSTRGPNLGT
Subjt:  PMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT

A0A1S4E403 uncharacterized protein LOC1035012347.7e-20478.8Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDV DLNKVWEIKALKKAGEKEAK+ILERIAKQVQPIM +HKWRVK+LS+  PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR
        HGPHNANFYKLWDELRK                                              ECEEL+AKGISGTAQGFDLPGRRLGGN RQPPLSSLR
Subjt:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR

Query:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEM-------PVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGV
        KSSLAAAEGRRRL SLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS   M       PVDEDCCP  PSE AHSS+ G     +NLSKGV
Subjt:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEM-------PVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGV

Query:  DALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCT
        DALHQKRSRESERSSNK+S GHL  DFVDLSKDDAIP  SA+Y AESNKRHK+PD + FP+SSAE SSID SCSSSNLMP HDGTIHP ELSMWECGNCT
Subjt:  DALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCT

Query:  LLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT
        LLNPPLAP+CELCFSQKPKD+DTRYKFWSCKFCTLENSVKLEKC+AC QWRYSHGQPVSTRGPNLGT
Subjt:  LLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT

A0A5D3C9S9 DNA damage response protein WSS11.7e-20378.59Show/hide
Query:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
        MDV DLNKVWEIKALKKAGEKEAK+ILERIAKQVQPIM +HKWRVK+LS+  PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL
Subjt:  MDVGDLNKVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNL

Query:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR
        HGPHNANFYKLWDELRK                                              ECEEL+AKGISGTAQGFDLPGRRLGGN RQPPLSSLR
Subjt:  HGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLR

Query:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEM-------PVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGV
        KSSLAAAEGRRRL SLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS   M       PVDEDCCP  PSE AHSS+ G     +NLSKGV
Subjt:  KSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASSLEM-------PVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGV

Query:  DALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCT
        DALHQKRSRESERSSNK+S GH+  DFVDLSKDDAIP  SA+Y AESNKRHK+PD + FP+SSAE SSID SCSSSNLMP HDGTIHP ELSMWECGNCT
Subjt:  DALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCT

Query:  LLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT
        LLNPPLAP+CELCFSQKPKD+DTRYKFWSCKFCTLENSVKLEKC+AC QWRYSHGQPVSTRGPNLGT
Subjt:  LLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT

A0A6J1IF83 uncharacterized protein LOC111476695 isoform X13.4e-20473.13Show/hide
Query:  SRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMH
        S Q  I VK ISSSKFQCRFEFQMI NLL+  F+INRFRF                    MDV D+NKVWEIKAL KKAGEKEAKE+LERIAKQVQPIM 
Subjt:  SRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMH

Query:  RHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIY
        RHKWRVKILS+  PKNPALLG+NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRK                      
Subjt:  RHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIY

Query:  IHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQA
                                ECEELMAKGISGTAQGFD+PGRRLGGN+ QPPLSSLRKSSLAAAEGRRRL SLLPSGP RLGGDS+IMVALSPVQA
Subjt:  IHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQA

Query:  AAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAE
        AAMAAERRLQDDIWCASS  MPVDEDCC D  SE A   QAG+S  FSN S+G+DA H KR RE+ERSS K+S GHLKPDFV       IP  SADY AE
Subjt:  AAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAE

Query:  SNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSA
        SNKRHKM   VPFP+S AETSSID  CSSSNLMP HDGT HP ELSMWECGNCTLLNPPLAP+CELCFS K K ADT+YKFWSCKFCTLENSVKLEKCSA
Subjt:  SNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSA

Query:  CDQWRYSHGQPVSTRGPNLGT
        C QWRYSHGQPVSTRGPN+GT
Subjt:  CDQWRYSHGQPVSTRGPNLGT

A0A6J1ILI5 uncharacterized protein LOC111476695 isoform X25.9e-20473.13Show/hide
Query:  SRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMH
        S Q  I VK ISSSKFQCRFEFQMI NLL+  F+INRFRF                    MDV D+NKVWEIKAL KKAGEKEAKE+LERIAKQVQPIM 
Subjt:  SRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKAL-KKAGEKEAKEILERIAKQVQPIMH

Query:  RHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIY
        RHKWRVKILS+  PKNPALLG+NVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRK                      
Subjt:  RHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIY

Query:  IHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQA
                                ECEELMAKGISGTAQGFD+PGRRLGGN+ QPPLSSLRKSSLAAAEGRRRL SLLPSGP RLGGDS+IMVALSPVQA
Subjt:  IHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQA

Query:  AAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAE
        AAMAAERRLQDDIWCASS  MPVDEDCC D  SE A   QAG+S  FSN S+G+DA H KR RE+ERSS K+S GHLKPDFV       IP  SADY AE
Subjt:  AAMAAERRLQDDIWCASSLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAE

Query:  SNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSA
        SNKRHKM   VPFP+S AETSSID  CSSSNLMP HDGT HP ELSMWECGNCTLLNPPLAP+CELCFS K K ADT+YKFWSCKFCTLENSVKLEKCSA
Subjt:  SNKRHKMPDGVPFPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSA

Query:  CDQWRYSHGQPVSTRGPNLGT
        C QWRYSHGQPVSTRGPN+GT
Subjt:  CDQWRYSHGQPVSTRGPNLGT

SwissProt top hitse value%identityAlignment
O94580 DNA-dependent metalloprotease WSS1 homolog 27.8e-1235.48Show/hide
Query:  EIKALKKAGEKEAKEILERIAKQ--VQPIMHRHKWRVKILS--DP-----KNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGP
        E+  L    +  A   LER+     ++ IM  H+W V +LS  DP      +   LGLN  +G H++LRLR    DG F  +  V  T++HEL HN+HG 
Subjt:  EIKALKKAGEKEAKEILERIAKQ--VQPIMHRHKWRVKILS--DP-----KNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGP

Query:  HNANFYKLWDELRKTQNDIHVLSK
        H+++F++L+ +L K  +   +L K
Subjt:  HNANFYKLWDELRKTQNDIHVLSK

P38838 DNA-dependent metalloprotease WSS19.9e-1529.96Show/hide
Query:  KAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELR
        K  +++A  +++ IA +V  +M  + ++V  L +  P++  LLG+NV  G  + LRLR    +  F P   ++ TMLHEL HNL GPH+  FY   DEL 
Subjt:  KAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELR

Query:  KTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNA-----RQPPLSSLRKSSLAAAEGRR
                                    I    VI                   +G+  T  G    G+RLGG A     R P       + +    G+ 
Subjt:  KTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNA-----RQPPLSSLRKSSLAAAEGRR

Query:  -RLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASS
         +LGSL P       G S+I    SP + AA AAERR +DD WC  +
Subjt:  -RLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCASS

Q9P7B5 DNA-dependent metalloprotease WSS1 homolog7.6e-0734.26Show/hide
Query:  KVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD-PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANF
        K+  I A+K      + + L+RIA    PIM  H + V  L +   N    G N  +G  ++L LR  +    + PF  V+D  LHELCH   GPH+  F
Subjt:  KVWEIKALKKAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD-PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANF

Query:  YKLWDELR
        +     LR
Subjt:  YKLWDELR

Arabidopsis top hitse value%identityAlignment
AT1G55915.1 zinc ion binding2.9e-11049.26Show/hide
Query:  SMDVGDLNKVWEIKALK-KAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCH
        S ++ DLNKVWEIKALK K  E EA++ILE++A QVQPIM R KWRVK+LS+  P NP LLG+NV RG+ VKLRLRR N D DF  ++++LDTMLHELCH
Subjt:  SMDVGDLNKVWEIKALK-KAGEKEAKEILERIAKQVQPIMHRHKWRVKILSD--PKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCH

Query:  NLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSS
        N HGPHNA+FYKLWDELRK                                              ECEELM+KGI+GT QGFD+PG+RLGG +RQP LS 
Subjt:  NLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGISSLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSS

Query:  LRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS-SLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALH
        LR ++  AAE R R G+LLPSGP RLGGDS+IM  LSP+QAAAMAAERRL DDIWC S S +   DE+   D   E     +   S         V+   
Subjt:  LRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS-SLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALH

Query:  QKR--SRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMP-DGVP-----FPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWEC
         KR  S  +  S   +S      D +DL+++         +     KR++ P D  P      P +    SSI    +S N   S       EE +MWEC
Subjt:  QKR--SRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMP-DGVP-----FPKSSAETSSIDFSCSSSNLMPSHDGTIHPEELSMWEC

Query:  GNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT
          CTLLNP LAP+CELC + KPK+ + ++K WSCKFCTLEN VKLEKC AC QWRYS+G P+ST  PN+GT
Subjt:  GNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGCGCTCCTCATCAGGATTTCTTGCCGTGGAACGAACTCAAAACTCCCGCCAACTCCTGATCAAGGTGAAATATATCTCTTCTTCCAAATTTCAATGCCGATTCGA
GTTCCAAATGATCCCTAATCTGCTTCAAACATCCTTTCAAATTAACAGATTTCGATTCCAGCCCCGTCTTTTCGTGACCTTATTCGCTTGGGGGATCCCGAATTTTCCCT
CTCATAGTATGGATGTGGGTGATCTTAACAAAGTTTGGGAAATTAAAGCCCTGAAGAAGGCTGGAGAGAAAGAAGCAAAGGAGATTCTGGAGAGAATTGCTAAACAAGTC
CAACCAATTATGCATAGACATAAATGGCGAGTCAAGATTCTTTCGGACCCAAAAAATCCAGCACTCTTAGGGTTAAATGTGGGACGTGGAATTCACGTGAAGTTGAGGCT
TCGAAGGCCAAATAGGGATGGAGATTTCTTCCCCTTCAATCAAGTTTTGGATACAATGCTACATGAGCTTTGCCACAATCTTCATGGTCCTCACAATGCCAATTTCTACA
AGCTTTGGGATGAGCTTAGAAAGACACAGAATGATATTCATGTATTGTCAAAGGACTATGGTTTTCTTATTTGCATCCTTTCCATTTACATTCACCTTCCCGGTATAAGT
TCTCTGCATGTCATAAATGTTCATCCTGTTTTCCTGACTGACTGCAAATTTGAATGTGAGGAGTTGATGGCTAAGGGAATTAGTGGTACAGCCCAGGGATTTGATCTCCC
GGGGAGGCGTTTGGGTGGTAATGCACGTCAACCTCCTCTTTCTTCCCTCCGCAAATCTTCCCTAGCTGCTGCAGAAGGGAGAAGACGTTTGGGATCTCTACTTCCATCTG
GACCTAATCGGCTTGGTGGTGATAGCAACATCATGGTCGCACTAAGTCCTGTACAAGCAGCTGCAATGGCTGCAGAAAGGAGGCTTCAGGATGATATTTGGTGTGCTTCA
TCTCTAGAAATGCCTGTGGATGAGGATTGTTGCCCTGATCTTCCATCAGAAGCTGCGCATTCCTCCCAAGCAGGTAAATCTGGGCCATTTAGCAATTTAAGTAAGGGTGT
GGATGCATTACACCAGAAAAGAAGTCGTGAGTCAGAAAGGAGTTCTAACAAGGCTTCCTATGGTCATCTGAAACCTGATTTTGTTGATTTGTCCAAAGATGATGCCATCC
CTTGTTTTTCTGCCGACTATGGTGCTGAATCAAATAAGCGTCATAAAATGCCAGATGGAGTTCCATTTCCAAAATCTTCTGCAGAAACTAGCTCAATAGATTTCTCCTGT
TCATCCTCTAATTTGATGCCAAGTCATGATGGAACTATTCATCCAGAAGAACTTTCCATGTGGGAATGTGGAAATTGCACCTTACTGAATCCACCACTAGCTCCAATGTG
TGAGCTCTGTTTCTCACAAAAGCCAAAAGATGCTGATACCCGATACAAATTCTGGTCATGTAAATTCTGCACCTTAGAAAACAGTGTGAAGTTGGAGAAATGCTCAGCAT
GTGATCAATGGAGATATTCTCATGGCCAGCCAGTGTCGACTAGAGGACCAAATCTTGGCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGGCGCTCCTCATCAGGATTTCTTGCCGTGGAACGAACTCAAAACTCCCGCCAACTCCTGATCAAGGTGAAATATATCTCTTCTTCCAAATTTCAATGCCGATTCGA
GTTCCAAATGATCCCTAATCTGCTTCAAACATCCTTTCAAATTAACAGATTTCGATTCCAGCCCCGTCTTTTCGTGACCTTATTCGCTTGGGGGATCCCGAATTTTCCCT
CTCATAGTATGGATGTGGGTGATCTTAACAAAGTTTGGGAAATTAAAGCCCTGAAGAAGGCTGGAGAGAAAGAAGCAAAGGAGATTCTGGAGAGAATTGCTAAACAAGTC
CAACCAATTATGCATAGACATAAATGGCGAGTCAAGATTCTTTCGGACCCAAAAAATCCAGCACTCTTAGGGTTAAATGTGGGACGTGGAATTCACGTGAAGTTGAGGCT
TCGAAGGCCAAATAGGGATGGAGATTTCTTCCCCTTCAATCAAGTTTTGGATACAATGCTACATGAGCTTTGCCACAATCTTCATGGTCCTCACAATGCCAATTTCTACA
AGCTTTGGGATGAGCTTAGAAAGACACAGAATGATATTCATGTATTGTCAAAGGACTATGGTTTTCTTATTTGCATCCTTTCCATTTACATTCACCTTCCCGGTATAAGT
TCTCTGCATGTCATAAATGTTCATCCTGTTTTCCTGACTGACTGCAAATTTGAATGTGAGGAGTTGATGGCTAAGGGAATTAGTGGTACAGCCCAGGGATTTGATCTCCC
GGGGAGGCGTTTGGGTGGTAATGCACGTCAACCTCCTCTTTCTTCCCTCCGCAAATCTTCCCTAGCTGCTGCAGAAGGGAGAAGACGTTTGGGATCTCTACTTCCATCTG
GACCTAATCGGCTTGGTGGTGATAGCAACATCATGGTCGCACTAAGTCCTGTACAAGCAGCTGCAATGGCTGCAGAAAGGAGGCTTCAGGATGATATTTGGTGTGCTTCA
TCTCTAGAAATGCCTGTGGATGAGGATTGTTGCCCTGATCTTCCATCAGAAGCTGCGCATTCCTCCCAAGCAGGTAAATCTGGGCCATTTAGCAATTTAAGTAAGGGTGT
GGATGCATTACACCAGAAAAGAAGTCGTGAGTCAGAAAGGAGTTCTAACAAGGCTTCCTATGGTCATCTGAAACCTGATTTTGTTGATTTGTCCAAAGATGATGCCATCC
CTTGTTTTTCTGCCGACTATGGTGCTGAATCAAATAAGCGTCATAAAATGCCAGATGGAGTTCCATTTCCAAAATCTTCTGCAGAAACTAGCTCAATAGATTTCTCCTGT
TCATCCTCTAATTTGATGCCAAGTCATGATGGAACTATTCATCCAGAAGAACTTTCCATGTGGGAATGTGGAAATTGCACCTTACTGAATCCACCACTAGCTCCAATGTG
TGAGCTCTGTTTCTCACAAAAGCCAAAAGATGCTGATACCCGATACAAATTCTGGTCATGTAAATTCTGCACCTTAGAAAACAGTGTGAAGTTGGAGAAATGCTCAGCAT
GTGATCAATGGAGATATTCTCATGGCCAGCCAGTGTCGACTAGAGGACCAAATCTTGGCACTTGAGAGGCTTCTGTTATATCCTTGTAAGGCTTCTAAACTACATGATAA
GTTAGCTGATTGCTAAGAAAATTAAGCTGCATCTTGCCTTTTGAAATTTGGAAGGCTTCTGTCGTTTTCCTCCCTAGAGACTAGCCATTTCTTTCAAGCATGCTTTTTTT
GACGTGTGGAAACACTTGCACATACTTATGTAAATTGTCCTCTCATGATGTAGTGAAGTTTCTTTATTCCTTTGGTTCAACTTAATATTTAATCAGAATATGGTAATGCC
TTAGAAGAGCCTGAGGAGTCTTACGGTTGAATCTTAGCAATTTTTTGTCATCCACTATGCTCATTTTTTTGAGCTCATAGAGAATTGTTCTATTGTTCGAACATATTATC
TTGCATTGTATTAAAAAG
Protein sequenceShow/hide protein sequence
MRRSSSGFLAVERTQNSRQLLIKVKYISSSKFQCRFEFQMIPNLLQTSFQINRFRFQPRLFVTLFAWGIPNFPSHSMDVGDLNKVWEIKALKKAGEKEAKEILERIAKQV
QPIMHRHKWRVKILSDPKNPALLGLNVGRGIHVKLRLRRPNRDGDFFPFNQVLDTMLHELCHNLHGPHNANFYKLWDELRKTQNDIHVLSKDYGFLICILSIYIHLPGIS
SLHVINVHPVFLTDCKFECEELMAKGISGTAQGFDLPGRRLGGNARQPPLSSLRKSSLAAAEGRRRLGSLLPSGPNRLGGDSNIMVALSPVQAAAMAAERRLQDDIWCAS
SLEMPVDEDCCPDLPSEAAHSSQAGKSGPFSNLSKGVDALHQKRSRESERSSNKASYGHLKPDFVDLSKDDAIPCFSADYGAESNKRHKMPDGVPFPKSSAETSSIDFSC
SSSNLMPSHDGTIHPEELSMWECGNCTLLNPPLAPMCELCFSQKPKDADTRYKFWSCKFCTLENSVKLEKCSACDQWRYSHGQPVSTRGPNLGT