; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014982 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014982
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGTD-binding domain-containing protein
Genome locationtig00002486:836178..841212
RNA-Seq ExpressionSgr014982
SyntenySgr014982
Gene Ontology termsGO:0006511 - ubiquitin-dependent protein catabolic process (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147055.1 probable myosin-binding protein 5 [Momordica charantia]0.0e+0077.82Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MAC+AIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSS+KIS+VL S REKFPLDSMWD EPK 
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSF-------------GYGRH-----------
        CFKSMLV  RNV+EA+VELEGE SGSSFSKTRP+G IYGDF GV + DA AGL   KEEGKFYNMGNGVSF             G G             
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSF-------------GYGRH-----------

Query:  -------FQSSSSFSRFGDNNTEDSFFSVDSGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEE
                QS SSFS FGD++TEDSFFSVDS DG EAS DNSDQ KVFP LELDDS DEKIC EMYESSV +AGNN   EL LDGNESDTIKLLE+ALEE
Subjt:  -------FQSSSSFSRFGDNNTEDSFFSVDSGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEE

Query:  EQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSI----FDGVDV
        EQ ARATL LELEKERSAAATAADEAMAMILRLQEEKASIEM+ARQ+QRMIEEKT YDAEEMSILKEILVRREREMHFLEKE+EAYRKS     +DG+DV
Subjt:  EQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSI----FDGVDV

Query:  DMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDN
        DMLDLE TPQRTPSFPYSTE+PSH+LQC +RS+S+KQD N+   SLQHETPS+ES NLTFEFGEES F Q DELAD A+AGGMPL  VAD+ KD EEIDN
Subjt:  DMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDN

Query:  ELQEKGMVEDENSYILQGEVNELEPYLQSNESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLH
        ELQEKGMVEDENSYILQGE NELEPYLQSN+SNG G+VEKS E  ADE EKVD VS DELAS+KTILP  +YNLEKNGDHQ  Q+T  LNS+T  D H H
Subjt:  ELQEKGMVEDENSYILQGEVNELEPYLQSNESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLH

Query:  DIHVIDDEASADASKKPVVNGTSSIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQE
        DIHVIDDEASA+ASK+PV NGTSS PVKCD  SFS L+SELDITRSSSDAT RFPPMA SRSNCLRSELRRNSMSAVDYER KIGNEVEWLRERLK+VQE
Subjt:  DIHVIDDEASADASKKPVVNGTSSIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQE

Query:  GREKLKLTAECKEK-NNQLQLLDNITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
        GREKLK T E +EK NNQLQLL++ITNHL EI+QLTDPGKATLQAPLPPSSK         + SLS+  SS
Subjt:  GREKLKLTAECKEK-NNQLQLLDNITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

XP_022942015.1 probable myosin-binding protein 5 [Cucurbita moschata]3.7e-27373.26Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MACEAIQ WTFNGLVAAFLDLGIAYL+LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCF K+LVDRSS+K+SSV+ SAREK PL+SM DQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD
        CFKS L+HERN  EAHVE EGE SG SF KTR PQ MIYGDF  VKES     +   K      N  +           S SSFS FGDNNTED FFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD

Query:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM
        SGD R E+SSD+SD+ KVFP        DEKI AE + +S +EAGNNCRGEL LDGNESDTIKLLEQALEEEQ ARATL LELEKERSAAATA DEAMAM
Subjt:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR
        ILRLQEEKASIEM+ARQ+QRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEA++KS F  DGVDVDMLD E TPQ  PSF YS++ PSH+LQC +R
Subjt:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR

Query:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE
        S+ DKQDAN+  RS Q+E PSIES  LTFEF +ESPFI +DE ADAAEAGGM LH  ADNF   EE DNELQE+GMVEDEN YILQGEVNELEPYLQSN 
Subjt:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE

Query:  SNGPGLVEKS--MESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS
        SNG   VEKS  +E  ADE EKVD VS D LASAKTI P VEYNLEKNGDHQK QQ +DL+ +T+ DP  HDIHVID+EA      SADASK+PV+NGTS
Subjt:  SNGPGLVEKS--MESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS

Query:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD
        S P  C S SFS LQSELDITRS+SDA+ RFPP ARSRSNCLRS+LRRNSMSAVDYER KIG+EVE LRERLK+VQE REKL+ + E K+K NNQLQLL+
Subjt:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD

Query:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
        +ITNHL+EI+ LTDPGKA LQAP PPSSK         + SLS+  SS
Subjt:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

XP_022974988.1 uncharacterized protein LOC111473854 [Cucurbita maxima]1.2e-26872.33Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MACEAIQ WTFNGLV AFLDLGIAYLLLCASSLVFFTSKFLALFGL LPCPCDGLFGNLSSDHCFQK+L DRSS+K+SSV+ SA EK PL+SMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD
        C K+ L+HERN  EAHVE EGE SG SF KTR PQ  IYGDF  VKES     +   K      N  +           S SSFS FGDNNTED FFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD

Query:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM
        SGD R E+SSDNSDQ KVFP        DEKI AE + +S +EAGNNCRGEL LD NESDTIKLLEQALEEEQ ARATL LELEKERSAAATA DEAMAM
Subjt:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR
        ILRLQEEKASIEM+ARQ+QRMIEEKTAYDAEEMSILKEILVRRERE+HFLE EVEA++KS F  DGVDVDMLD E TPQ  PSF YS++ PSH+L+C +R
Subjt:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR

Query:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE
        S+ DKQDAN+  RS Q+E PSIES   TFEF +ESPFI +DE ADAAE G M LH  ADNF   EE DNELQEKGMVEDEN YILQ EVNELEPYLQSN 
Subjt:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE

Query:  SNGPGLVEKSM--ESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS
        SNG   VEKS   E  ADE EKV  VS D LASAKTI+P VEYNLEKNGDHQK QQT+DL+S+T+ DP  HDIH+ID+EA      SADASK+PV+NGTS
Subjt:  SNGPGLVEKSM--ESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS

Query:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD
        S P  C S SFS LQSELDITRS+SDA+ RFPP ARSRSNCLRSELRRNSMSAVDYER KIG+EVE LRERLK+VQE REKL+ + E K+K NNQLQLL+
Subjt:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD

Query:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
        +ITNHL+EI+QLTDPGKA LQAP PPSSK         + SLS+  SS
Subjt:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

XP_023539873.1 uncharacterized protein LOC111800419 [Cucurbita pepo subsp. pepo]4.0e-27573.66Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MACEAIQ WTFNGLVAAFLDLGIAYL+LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQK+LVDRSS+K+SSV+ SAREK PL+SMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRP-QGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD
        CFKS L+HERN  +AHVE EGE S  SFSKTR  Q MIYGDF  VKES     +   K      N  +           S SSFS FGDNNTED FFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRP-QGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD

Query:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM
        SGD R E+SSDNSDQ KVFP        DEKI AE + +S +EAG+ CRGEL LDGNESDTIKLLEQALEEEQ ARATL LELEKERSAAATA DEAMAM
Subjt:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR
        ILRLQEEKASIEM+ARQ+QRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEA++KS F  DGVDVDMLD E TPQ  PSF YS++ PSH+LQC +R
Subjt:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR

Query:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE
        S+ DKQDAN+  RS Q+E PSIE   LTFEF +ESPFI +DE ADAAEAGGM LH  ADNF   EE DNELQEKGMVEDEN YILQGEVNELEPYLQSN 
Subjt:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE

Query:  SNGPGLVEKS--MESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS
        SNG   VEKS  +E  ADE EKVD VS D LASAKTI+P VEYNLEKNGDHQK QQ +DL+S+T+ DP  HDIHVID+EA      SADASK+PV+NGTS
Subjt:  SNGPGLVEKS--MESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS

Query:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD
        S P  C S SFS LQSELDITRS+SDA+ RFPP ARSRSNCLRSELRRNSMSAVDYER KIG+EVE LRERLK+VQE REKL+ + E K+K NNQLQLL+
Subjt:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD

Query:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
        +ITNHL+EI+QLTDPGKA LQAP PPSSK         + SLS+  SS
Subjt:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

XP_038893600.1 uncharacterized protein LOC120082482 isoform X1 [Benincasa hispida]1.3e-28675.44Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MACEAI+LWTFNGLVAAFLDLGIA+LLLCAS+LVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVL S R+KFPLDS+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSG-SSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD
        CFKS+LVHERNVKEAHVELEGE SG SSF    PQGM+YGDFP V +     GL   K      N  +           S SSFS FGDNNTED FFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGEVSG-SSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD

Query:  SGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMI
        SGD REAS DNSDQ KVFP LELDDS+D KICAEMYE+SV+EAGN CRGEL LDGNESDTIKLLEQALEEEQTARATL LELEKERSAAATAADEAMAMI
Subjt:  SGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRS
        LRLQEEKASIEM+ARQ+QRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEA+RKS F  DG  VDMLD E TP RTP  PY TE+PSH+LQC NRS
Subjt:  LRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRS

Query:  VSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNES
          D++ AN+K  SLQ+  PS+ES NLTFEFGEES  I ADE+A AA+A GM LH VADNFK SEEID ELQ KGM+EDE  YI+ GEVNEL+PYLQSNES
Subjt:  VSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNES

Query:  NGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEAS------ADASKKPVVNGTSSIP
        NG G VEK  E  ADE EKVD VS D+LA  KTI P +EYNLEKNGDHQK  +TRD++S+   DPH HDIHV++DEA       A+AS++P VNGTSSIP
Subjt:  NGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEAS------ADASKKPVVNGTSSIP

Query:  VKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLDNIT
        VKCDS SF  LQSEL+I R+SSDAT RFPPMARSRSN LRSELRRNSMSAVDYER KIGNEVEWLR RLK+VQEGREKLK + E KEK NNQLQLL+NIT
Subjt:  VKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLDNIT

Query:  NHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
        NHL+EI QL DPGK  LQAPLPPSSK         + SLS+  SS
Subjt:  NHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

TrEMBL top hitse value%identityAlignment
A0A1S4DSI6 Ubiquitinyl hydrolase 13.0e-26072.63Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MACEAI+LWTFNGLVAAFLDLGIA+LLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLS DHCFQKLLVD SSRKISSV+ S REKFPLDS+ D EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTE-DSFFSV
        CFKSMLVHER+VKEA V+LEGE SGSS  K R PQ MIYGD+P V E     G    K      N+            +S SSFS FGD+NTE D FFSV
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTE-DSFFSV

Query:  DSGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM
        DSGD REASSDNSDQ KVFP LELDDS DEKICAEM E+SV+EAGN+CR +L LDGNESDTIK LEQALEEEQ+ RATL +ELEKERSAAATAADEAMAM
Subjt:  DSGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR
        ILRLQEEKASIEM+ARQ+QRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEA R S F  DGV VDMLD E TP R PSF Y TE+P     CTN 
Subjt:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR

Query:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE
        S         +  SLQHE PS+ES  LTFEF EESP I+ADE+ADAA+A  + L  V DNFK SEEID ELQ KGM+EDEN YI+ G+V EL PYLQSNE
Subjt:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE

Query:  SNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEASA------DASKKPVVNGTSSI
        SNG G V+K  E  ADE EKVD VS D LA AKTILP      EKNGDHQ+   TRDL S+ + DPH HDIHV++DEA        + S++P+VNGTS+I
Subjt:  SNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEASA------DASKKPVVNGTSSI

Query:  PVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLDNI
        PVKCDS SFS LQSELDITRSSSDAT RFPP+ARSRS+ ++S+LRRNSMSAVDYER KIGNEVEWLR RLK+VQEGREKLK + E KEK +NQ QLL+NI
Subjt:  PVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLDNI

Query:  TNHLQEIQQLTDPGKATLQAPLPPSSK
        TNHL+EI+QLTDPGKA+L APLPPSSK
Subjt:  TNHLQEIQQLTDPGKATLQAPLPPSSK

A0A6J1CZ28 probable myosin-binding protein 50.0e+0077.82Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MAC+AIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSS+KIS+VL S REKFPLDSMWD EPK 
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSF-------------GYGRH-----------
        CFKSMLV  RNV+EA+VELEGE SGSSFSKTRP+G IYGDF GV + DA AGL   KEEGKFYNMGNGVSF             G G             
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSF-------------GYGRH-----------

Query:  -------FQSSSSFSRFGDNNTEDSFFSVDSGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEE
                QS SSFS FGD++TEDSFFSVDS DG EAS DNSDQ KVFP LELDDS DEKIC EMYESSV +AGNN   EL LDGNESDTIKLLE+ALEE
Subjt:  -------FQSSSSFSRFGDNNTEDSFFSVDSGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEE

Query:  EQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSI----FDGVDV
        EQ ARATL LELEKERSAAATAADEAMAMILRLQEEKASIEM+ARQ+QRMIEEKT YDAEEMSILKEILVRREREMHFLEKE+EAYRKS     +DG+DV
Subjt:  EQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSI----FDGVDV

Query:  DMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDN
        DMLDLE TPQRTPSFPYSTE+PSH+LQC +RS+S+KQD N+   SLQHETPS+ES NLTFEFGEES F Q DELAD A+AGGMPL  VAD+ KD EEIDN
Subjt:  DMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDN

Query:  ELQEKGMVEDENSYILQGEVNELEPYLQSNESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLH
        ELQEKGMVEDENSYILQGE NELEPYLQSN+SNG G+VEKS E  ADE EKVD VS DELAS+KTILP  +YNLEKNGDHQ  Q+T  LNS+T  D H H
Subjt:  ELQEKGMVEDENSYILQGEVNELEPYLQSNESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLH

Query:  DIHVIDDEASADASKKPVVNGTSSIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQE
        DIHVIDDEASA+ASK+PV NGTSS PVKCD  SFS L+SELDITRSSSDAT RFPPMA SRSNCLRSELRRNSMSAVDYER KIGNEVEWLRERLK+VQE
Subjt:  DIHVIDDEASADASKKPVVNGTSSIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQE

Query:  GREKLKLTAECKEK-NNQLQLLDNITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
        GREKLK T E +EK NNQLQLL++ITNHL EI+QLTDPGKATLQAPLPPSSK         + SLS+  SS
Subjt:  GREKLKLTAECKEK-NNQLQLLDNITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

A0A6J1FVE4 probable myosin-binding protein 51.8e-27373.26Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MACEAIQ WTFNGLVAAFLDLGIAYL+LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCF K+LVDRSS+K+SSV+ SAREK PL+SM DQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD
        CFKS L+HERN  EAHVE EGE SG SF KTR PQ MIYGDF  VKES     +   K      N  +           S SSFS FGDNNTED FFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD

Query:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM
        SGD R E+SSD+SD+ KVFP        DEKI AE + +S +EAGNNCRGEL LDGNESDTIKLLEQALEEEQ ARATL LELEKERSAAATA DEAMAM
Subjt:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR
        ILRLQEEKASIEM+ARQ+QRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEA++KS F  DGVDVDMLD E TPQ  PSF YS++ PSH+LQC +R
Subjt:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR

Query:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE
        S+ DKQDAN+  RS Q+E PSIES  LTFEF +ESPFI +DE ADAAEAGGM LH  ADNF   EE DNELQE+GMVEDEN YILQGEVNELEPYLQSN 
Subjt:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE

Query:  SNGPGLVEKS--MESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS
        SNG   VEKS  +E  ADE EKVD VS D LASAKTI P VEYNLEKNGDHQK QQ +DL+ +T+ DP  HDIHVID+EA      SADASK+PV+NGTS
Subjt:  SNGPGLVEKS--MESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS

Query:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD
        S P  C S SFS LQSELDITRS+SDA+ RFPP ARSRSNCLRS+LRRNSMSAVDYER KIG+EVE LRERLK+VQE REKL+ + E K+K NNQLQLL+
Subjt:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD

Query:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
        +ITNHL+EI+ LTDPGKA LQAP PPSSK         + SLS+  SS
Subjt:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

A0A6J1IHY7 uncharacterized protein LOC1114738546.0e-26972.33Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MACEAIQ WTFNGLV AFLDLGIAYLLLCASSLVFFTSKFLALFGL LPCPCDGLFGNLSSDHCFQK+L DRSS+K+SSV+ SA EK PL+SMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD
        C K+ L+HERN  EAHVE EGE SG SF KTR PQ  IYGDF  VKES     +   K      N  +           S SSFS FGDNNTED FFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD

Query:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM
        SGD R E+SSDNSDQ KVFP        DEKI AE + +S +EAGNNCRGEL LD NESDTIKLLEQALEEEQ ARATL LELEKERSAAATA DEAMAM
Subjt:  SGDGR-EASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR
        ILRLQEEKASIEM+ARQ+QRMIEEKTAYDAEEMSILKEILVRRERE+HFLE EVEA++KS F  DGVDVDMLD E TPQ  PSF YS++ PSH+L+C +R
Subjt:  ILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIF--DGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNR

Query:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE
        S+ DKQDAN+  RS Q+E PSIES   TFEF +ESPFI +DE ADAAE G M LH  ADNF   EE DNELQEKGMVEDEN YILQ EVNELEPYLQSN 
Subjt:  SVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNE

Query:  SNGPGLVEKSM--ESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS
        SNG   VEKS   E  ADE EKV  VS D LASAKTI+P VEYNLEKNGDHQK QQT+DL+S+T+ DP  HDIH+ID+EA      SADASK+PV+NGTS
Subjt:  SNGPGLVEKSM--ESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEA------SADASKKPVVNGTS

Query:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD
        S P  C S SFS LQSELDITRS+SDA+ RFPP ARSRSNCLRSELRRNSMSAVDYER KIG+EVE LRERLK+VQE REKL+ + E K+K NNQLQLL+
Subjt:  SIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLD

Query:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
        +ITNHL+EI+QLTDPGKA LQAP PPSSK         + SLS+  SS
Subjt:  NITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

A0A6J1IVQ3 uncharacterized protein LOC111479667 isoform X14.4e-25670.47Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        MACEAIQLWTFNGLVAAFLDLGIA+LLLCA+SLVFFTSKFLALFG CLPCPCDGLFG+L SDHCFQKLLVD SS+KISSVL S REKFPLDSMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD
        CFKSM VH+RNVKEA VE + E SG S+ KTR P+GMIYGD   + ES    G+   K      N              S SSF  FGDNN ED FFSVD
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTR-PQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVD

Query:  SGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMI
        SGD  EAS DNS+Q KVFP+LELDDS+DEKICAEMY +S +EA NNCRGE  LDGNESD IKLLEQ+LEEEQ ARATL LELEKERSAAATAADEAMAMI
Subjt:  SGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMI

Query:  LRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIFD--GVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRS
        LRLQEEKASIEM ARQ+QRMIEEKTAYDAEEMSILKEILVRRE+EMHFL+KEV A+R+S FD  GV VDMLD E TP   PS PY TE+PSH+LQC N S
Subjt:  LRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIFD--GVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRS

Query:  VSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNES
        ++DKQ          HE PS+ES NL FEFGEESP IQA E ADAA+A GM LH VADNF+  EEID ELQ KGMVED+N YI+ GEVNELEPY +SN S
Subjt:  VSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNES

Query:  NGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDD------EASADASKKPVVNGTSSIP
        N  G VE+  E   DE EKV   S D L SA+T LP VEYNLEK  DHQK Q TRD  S+ D D   HDIHVI+D      EASA+A ++ +VNG+SSIP
Subjt:  NGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDD------EASADASKKPVVNGTSSIP

Query:  VKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLDNIT
        V CDS+SFS LQ+ELDITRSSSDAT RFPPMARSRSN LRSELRRNSMSAVDYER KIGNEVEWLR RLK+VQE REK KL+ E KEK NNQLQLL+NIT
Subjt:  VKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEK-NNQLQLLDNIT

Query:  NHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS
              + L+DPGKA LQAPLPPSSK         + SL +  SS
Subjt:  NHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSS

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 61.2e-1142.86Show/hide
Query:  DGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEV
        D      +  L++ +  ++ +   L +EL++ERSA+A AA+EAMAMI RLQ EKA+++M A Q+QRM++E+  YD E +  +   L +RE EM  LE E 
Subjt:  DGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEV

Query:  EAYRK
        E YR+
Subjt:  EAYRK

F4HXQ7 Myosin-binding protein 19.9e-1146.81Show/hide
Query:  LEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYR
        L++ ++ ++     L  ELE+ERSA+A A ++AMAMI RLQEEKAS +M A Q+ RM+EE+  YD E +  L ++LV RE+ +  LE E+E +R
Subjt:  LEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYR

Q0WNW4 Myosin-binding protein 33.9e-1525.35Show/hide
Query:  SSVDEAGNNCRGELRLDGNES-DTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILK
        ++ ++AG+       +DG +   TI+ L + +  EQ A   L  ELE+ERSA+A +A++ MAMI RLQEEKA ++M A Q+QRM+EE+  YD E + +L 
Subjt:  SSVDEAGNNCRGELRLDGNES-DTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILK

Query:  EILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETP-SIESHNLTFEFGEESPFI
         ++V+RE+E   L++E+E YR  + +                    Y ++  + I+   N   +D  D   +NR   + +   ++   +T +  +    +
Subjt:  EILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETP-SIESHNLTFEFGEESPFI

Query:  QADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVED----ENSY----------ILQGEVNELEPYLQS----NESNGPGLVEKSMESFADEHEK
          + L++  E       ++ D  K  E+    +Q+K   ED     NSY           +      L P L +    +E    GL E   ++F  + EK
Subjt:  QADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVED----ENSY----------ILQGEVNELEPYLQS----NESNGPGLVEKSMESFADEHEK

Query:  VDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVID
        ++ +   +    +      +    KN     ++  +  + L D   HL D+  I+
Subjt:  VDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVID

Q9CAC4 Myosin-binding protein 21.1e-1444.96Show/hide
Query:  EMYESSVDEAGNNCRGELRLDGNESD----TIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDA
        E   SS+  A  +    + +DG E      T+  L+  L+EE+ A   L  ELE ER+A+A AA E MAMI RL EEKA+++M A Q+QRM+EE+  +D 
Subjt:  EMYESSVDEAGNNCRGELRLDGNESD----TIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDA

Query:  EEMSILKEILVRREREMHFLEKEVEAYRK
        E + +L E++V RE+E   LEKE+E YRK
Subjt:  EEMSILKEILVRREREMHFLEKEVEAYRK

Q9LMC8 Probable myosin-binding protein 57.6e-1138.04Show/hide
Query:  SVDSGDGREASSDNSDQGKVFPELELDDSHDEK---ICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAAD
        S+D+ D R  S      G  F  + L DS           M +S VD+  N    E+ LDG+    ++ L + +  ++ +   L +EL++ERSA+A AA+
Subjt:  SVDSGDGREASSDNSDQGKVFPELELDDSHDEK---ICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAAD

Query:  EAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYR
         AMAMI RLQ EKA+++M A Q+QRM++E+  YD E +  +  +LV+RE EM  LE  +E YR
Subjt:  EAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYR

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5932.0e-2731.66Show/hide
Query:  RHFQSSSSFSRFGDNNTEDSFFSVDSGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGN-ESDTIKLLEQALEEEQTA
        R   SS +F      N    +    S D      D  D GK      L  SH  +   E   S   +  +N   E     N E  +++ LE+ L+EE+ A
Subjt:  RHFQSSSSFSRFGDNNTEDSFFSVDSGDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGN-ESDTIKLLEQALEEEQTA

Query:  RATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGT
        RAT+ +EL+KERSAAA+AADEAMAMI RLQ+EKA+IEM ARQ QR++EE++ +DAEEM ILK+IL+RRERE HFLEKEVEAYR+ +      +  +LE +
Subjt:  RATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGT

Query:  PQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMV
          +  + P    EP H           KQ+                                                      KD +E       + ++
Subjt:  PQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMV

Query:  EDENSYILQGEVNELEPYLQSNESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDE
          E    L G V ++ PY +                                          E N +KN D  K            A   + D++++ DE
Subjt:  EDENSYILQGEVNELEPYLQSNESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDE

Query:  ASADASKKPVVNGTSSIPVK-CDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSE-LRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKL
            + KK +   +   P +  D  S              S    + PP+ R R   L S   RR SMSAVDYERLKI NEVE LRERLK VQE RE+L
Subjt:  ASADASKKPVVNGTSSIPVK-CDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSE-LRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKL

AT4G13160.1 Protein of unknown function, DUF5935.9e-2765.66Show/hide
Query:  DTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYR
        D ++LLE A+E+E+ A+A L +ELE+ER+A+A+AADEAMAMILRLQ +KAS+EM  +Q++RMI+EK AYD EEM+ILKEIL +RERE HFLEKE+E Y+
Subjt:  DTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYR

AT4G13160.1 Protein of unknown function, DUF5934.4e-0643.75Show/hide
Query:  TFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFP
        TF G++ AF++L  AY LLC S+ VF TSK L    L +PC      G  +SD C QKLL D   R I  V + A    P
Subjt:  TFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFP

AT4G13630.1 Protein of unknown function, DUF5931.9e-6533.85Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        M C+ ++ WTF GLVAAF+DL +A+ LLCAS +V+ TSKFL LFGL LPCPCDGL+       CFQ+ L +   +KISSV +S + + P DS+     K 
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVDS
                +R  +   V+LE EVS ++ S  + +    G F  +       G   VK +   ++      +G+  H+QS      F  +  E+    V+S
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVDS

Query:  GDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNC---RGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMA
         D  +A  D S                          SV  +   C    G     G    T+++ EQ L EE+ ARA+L LELEKER+AAA+AADEA+ 
Subjt:  GDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNC---RGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMA

Query:  MILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHI--LQCTN
        MILRLQEEKASIEM ARQ+QRMIEEK+A+DAEEMSILKEIL+RRERE HFLEKEV+ YR+   +            P  TP       +P+ I  LQ   
Subjt:  MILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHI--LQCTN

Query:  RSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSN
        +      D    N S   E  + +  N   +     PF   D L+D                        EL+E+           +GE   L   L S 
Subjt:  RSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSN

Query:  ESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEASADASKKPVVNGTSSIPVKCD
         S+    +  S +   D H                                            D D H+HDIHV+ DE +      P  + T  +     
Subjt:  ESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEASADASKKPVVNGTSSIPVKCD

Query:  STSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEKNNQLQLLDNITNHLQE
                 +LD ++S SD +  FP   + +SN + + +RRNSMSA+DYERLKI +EV  LR RL+ VQ+GREK+  +++ + K +Q+Q   + T+   E
Subjt:  STSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEKNNQLQLLDNITNHLQE

Query:  IQQ
         ++
Subjt:  IQQ

AT4G13630.2 Protein of unknown function, DUF5931.9e-6533.85Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC
        M C+ ++ WTF GLVAAF+DL +A+ LLCAS +V+ TSKFL LFGL LPCPCDGL+       CFQ+ L +   +KISSV +S + + P DS+     K 
Subjt:  MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKC

Query:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVDS
                +R  +   V+LE EVS ++ S  + +    G F  +       G   VK +   ++      +G+  H+QS      F  +  E+    V+S
Subjt:  CFKSMLVHERNVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVDS

Query:  GDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNC---RGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMA
         D  +A  D S                          SV  +   C    G     G    T+++ EQ L EE+ ARA+L LELEKER+AAA+AADEA+ 
Subjt:  GDGREASSDNSDQGKVFPELELDDSHDEKICAEMYESSVDEAGNNC---RGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMA

Query:  MILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHI--LQCTN
        MILRLQEEKASIEM ARQ+QRMIEEK+A+DAEEMSILKEIL+RRERE HFLEKEV+ YR+   +            P  TP       +P+ I  LQ   
Subjt:  MILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHI--LQCTN

Query:  RSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSN
        +      D    N S   E  + +  N   +     PF   D L+D                        EL+E+           +GE   L   L S 
Subjt:  RSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSN

Query:  ESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEASADASKKPVVNGTSSIPVKCD
         S+    +  S +   D H                                            D D H+HDIHV+ DE +      P  + T  +     
Subjt:  ESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVIDDEASADASKKPVVNGTSSIPVKCD

Query:  STSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEKNNQLQLLDNITNHLQE
                 +LD ++S SD +  FP   + +SN + + +RRNSMSA+DYERLKI +EV  LR RL+ VQ+GREK+  +++ + K +Q+Q   + T+   E
Subjt:  STSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERLKVVQEGREKLKLTAECKEKNNQLQLLDNITNHLQE

Query:  IQQ
         ++
Subjt:  IQQ

AT5G16720.1 Protein of unknown function, DUF5932.8e-1625.35Show/hide
Query:  SSVDEAGNNCRGELRLDGNES-DTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILK
        ++ ++AG+       +DG +   TI+ L + +  EQ A   L  ELE+ERSA+A +A++ MAMI RLQEEKA ++M A Q+QRM+EE+  YD E + +L 
Subjt:  SSVDEAGNNCRGELRLDGNES-DTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEEMSILK

Query:  EILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETP-SIESHNLTFEFGEESPFI
         ++V+RE+E   L++E+E YR  + +                    Y ++  + I+   N   +D  D   +NR   + +   ++   +T +  +    +
Subjt:  EILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETP-SIESHNLTFEFGEESPFI

Query:  QADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVED----ENSY----------ILQGEVNELEPYLQS----NESNGPGLVEKSMESFADEHEK
          + L++  E       ++ D  K  E+    +Q+K   ED     NSY           +      L P L +    +E    GL E   ++F  + EK
Subjt:  QADELADAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVED----ENSY----------ILQGEVNELEPYLQS----NESNGPGLVEKSMESFADEHEK

Query:  VDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVID
        ++ +   +    +      +    KN     ++  +  + L D   HL D+  I+
Subjt:  VDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQTRDLNSLTDADPHLHDIHVID


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTGCGAAGCTATACAACTGTGGACATTTAACGGCTTAGTGGCTGCATTTCTTGATCTTGGCATAGCTTATCTTTTATTATGTGCATCGAGTCTTGTTTTCTTTAC
CTCCAAATTTCTTGCACTGTTTGGATTATGTCTGCCTTGCCCTTGTGATGGGCTGTTTGGGAACCTTAGTAGTGATCACTGCTTCCAGAAGTTACTAGTGGATCGTTCAT
CCAGAAAAATATCTTCAGTCCTACAGTCCGCTAGGGAAAAGTTCCCATTGGATTCCATGTGGGATCAAGAGCCAAAATGTTGTTTTAAGTCAATGTTGGTGCACGAGAGG
AATGTCAAGGAGGCACACGTTGAATTGGAAGGTGAAGTATCAGGCAGCTCATTTTCTAAAACCAGACCACAAGGTATGATTTATGGAGACTTTCCCGGTGTAAAAGAGTC
GGATGCCACGGCGGGCTTGACTTCCGTGAAGGAAGAAGGAAAGTTTTATAATATGGGGAATGGAGTTTCCTTTGGATATGGAAGGCATTTTCAGTCTTCTTCAAGCTTCA
GTAGATTTGGAGATAATAATACTGAGGACAGCTTCTTTTCTGTTGATTCTGGAGATGGAAGAGAGGCTTCGTCAGACAATAGTGATCAAGGTAAAGTATTTCCCGAACTT
GAACTGGATGATTCTCATGATGAGAAAATATGCGCAGAGATGTATGAATCATCTGTTGACGAGGCTGGGAACAACTGCAGAGGGGAGTTACGCTTGGATGGTAATGAGAG
TGATACAATCAAACTATTGGAACAAGCACTTGAAGAAGAGCAGACGGCTCGTGCTACCCTGGACCTGGAGCTTGAGAAAGAGAGAAGTGCTGCTGCCACTGCTGCTGATG
AGGCCATGGCTATGATATTACGTCTTCAAGAGGAGAAGGCATCAATAGAAATGAATGCTAGGCAACACCAGAGGATGATAGAGGAGAAAACAGCTTATGATGCCGAGGAA
ATGAGTATTCTTAAAGAAATTTTAGTGAGGAGAGAGCGGGAAATGCATTTCCTGGAGAAGGAAGTTGAGGCTTATCGGAAAAGTATCTTTGATGGAGTGGATGTTGATAT
GCTTGACTTGGAAGGTACACCCCAAAGGACGCCTTCTTTCCCCTATTCAACTGAAGAGCCATCTCATATTCTTCAATGTACTAATAGATCAGTTAGTGATAAGCAAGATG
CAAATTTCAAAAATCGTTCTTTGCAACATGAGACCCCATCGATTGAGTCACACAACTTAACTTTTGAGTTTGGGGAAGAGTCGCCATTTATTCAAGCAGATGAACTTGCT
GATGCTGCAGAAGCTGGGGGCATGCCGTTGCATCTAGTTGCTGATAACTTCAAGGATAGTGAAGAGATTGACAATGAGTTACAAGAGAAGGGCATGGTAGAAGATGAGAA
CTCATACATTCTACAAGGAGAAGTAAATGAACTGGAGCCATATCTGCAAAGCAACGAGTCAAATGGTCCGGGTCTGGTTGAGAAATCCATGGAATCGTTTGCAGATGAAC
ATGAAAAAGTTGACGGAGTTTCAGATGATGAGTTGGCATCGGCTAAAACAATTCTTCCCTGGGTTGAGTACAATTTGGAAAAGAATGGTGACCATCAAAAGCAGCAGCAG
ACAAGAGATCTCAACTCTTTGACTGATGCAGATCCCCATCTTCACGATATTCATGTCATTGATGATGAAGCAAGTGCTGATGCAAGCAAAAAACCGGTGGTCAATGGTAC
CTCGAGTATTCCTGTAAAATGTGATAGTACATCCTTCAGCTGGTTGCAGAGTGAACTAGACATCACAAGAAGCAGCTCAGATGCCACTGATAGATTTCCACCAATGGCTC
GTTCTCGAAGCAACTGCTTGCGTTCTGAATTGCGGAGAAACTCGATGTCTGCTGTTGATTATGAAAGGTTGAAAATTGGCAATGAAGTCGAATGGCTCAGAGAAAGGTTA
AAGGTTGTTCAGGAGGGAAGGGAAAAACTCAAGCTCACCGCAGAATGCAAAGAGAAGAACAATCAGCTGCAACTTTTAGACAATATAACAAATCATCTTCAGGAAATTCA
ACAACTGACAGATCCCGGAAAGGCAACACTGCAGGCTCCATTGCCTCCATCCTCTAAGCCCTTTATTACTCTTACTACTGCTGAAACTCTCTCTCTCTCTCTCTCTCTCT
CTTCTCTCTCTCCTCATCAGAAGCCCTCAGGCCTCACTAATCTACCTCTGAGCATCTCCATCTTCCGGATTGAGAAGAGTTTCAGAAGCAGCAGCGGGAATGCGCACGCC
TCGCTTTCCCCTTCTCCATTTCCATCTCAACCTCTTGAGATCGTAAAGCTTAAATCTTCAATATTCAATCTTCCTTCCCCCATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTGCGAAGCTATACAACTGTGGACATTTAACGGCTTAGTGGCTGCATTTCTTGATCTTGGCATAGCTTATCTTTTATTATGTGCATCGAGTCTTGTTTTCTTTAC
CTCCAAATTTCTTGCACTGTTTGGATTATGTCTGCCTTGCCCTTGTGATGGGCTGTTTGGGAACCTTAGTAGTGATCACTGCTTCCAGAAGTTACTAGTGGATCGTTCAT
CCAGAAAAATATCTTCAGTCCTACAGTCCGCTAGGGAAAAGTTCCCATTGGATTCCATGTGGGATCAAGAGCCAAAATGTTGTTTTAAGTCAATGTTGGTGCACGAGAGG
AATGTCAAGGAGGCACACGTTGAATTGGAAGGTGAAGTATCAGGCAGCTCATTTTCTAAAACCAGACCACAAGGTATGATTTATGGAGACTTTCCCGGTGTAAAAGAGTC
GGATGCCACGGCGGGCTTGACTTCCGTGAAGGAAGAAGGAAAGTTTTATAATATGGGGAATGGAGTTTCCTTTGGATATGGAAGGCATTTTCAGTCTTCTTCAAGCTTCA
GTAGATTTGGAGATAATAATACTGAGGACAGCTTCTTTTCTGTTGATTCTGGAGATGGAAGAGAGGCTTCGTCAGACAATAGTGATCAAGGTAAAGTATTTCCCGAACTT
GAACTGGATGATTCTCATGATGAGAAAATATGCGCAGAGATGTATGAATCATCTGTTGACGAGGCTGGGAACAACTGCAGAGGGGAGTTACGCTTGGATGGTAATGAGAG
TGATACAATCAAACTATTGGAACAAGCACTTGAAGAAGAGCAGACGGCTCGTGCTACCCTGGACCTGGAGCTTGAGAAAGAGAGAAGTGCTGCTGCCACTGCTGCTGATG
AGGCCATGGCTATGATATTACGTCTTCAAGAGGAGAAGGCATCAATAGAAATGAATGCTAGGCAACACCAGAGGATGATAGAGGAGAAAACAGCTTATGATGCCGAGGAA
ATGAGTATTCTTAAAGAAATTTTAGTGAGGAGAGAGCGGGAAATGCATTTCCTGGAGAAGGAAGTTGAGGCTTATCGGAAAAGTATCTTTGATGGAGTGGATGTTGATAT
GCTTGACTTGGAAGGTACACCCCAAAGGACGCCTTCTTTCCCCTATTCAACTGAAGAGCCATCTCATATTCTTCAATGTACTAATAGATCAGTTAGTGATAAGCAAGATG
CAAATTTCAAAAATCGTTCTTTGCAACATGAGACCCCATCGATTGAGTCACACAACTTAACTTTTGAGTTTGGGGAAGAGTCGCCATTTATTCAAGCAGATGAACTTGCT
GATGCTGCAGAAGCTGGGGGCATGCCGTTGCATCTAGTTGCTGATAACTTCAAGGATAGTGAAGAGATTGACAATGAGTTACAAGAGAAGGGCATGGTAGAAGATGAGAA
CTCATACATTCTACAAGGAGAAGTAAATGAACTGGAGCCATATCTGCAAAGCAACGAGTCAAATGGTCCGGGTCTGGTTGAGAAATCCATGGAATCGTTTGCAGATGAAC
ATGAAAAAGTTGACGGAGTTTCAGATGATGAGTTGGCATCGGCTAAAACAATTCTTCCCTGGGTTGAGTACAATTTGGAAAAGAATGGTGACCATCAAAAGCAGCAGCAG
ACAAGAGATCTCAACTCTTTGACTGATGCAGATCCCCATCTTCACGATATTCATGTCATTGATGATGAAGCAAGTGCTGATGCAAGCAAAAAACCGGTGGTCAATGGTAC
CTCGAGTATTCCTGTAAAATGTGATAGTACATCCTTCAGCTGGTTGCAGAGTGAACTAGACATCACAAGAAGCAGCTCAGATGCCACTGATAGATTTCCACCAATGGCTC
GTTCTCGAAGCAACTGCTTGCGTTCTGAATTGCGGAGAAACTCGATGTCTGCTGTTGATTATGAAAGGTTGAAAATTGGCAATGAAGTCGAATGGCTCAGAGAAAGGTTA
AAGGTTGTTCAGGAGGGAAGGGAAAAACTCAAGCTCACCGCAGAATGCAAAGAGAAGAACAATCAGCTGCAACTTTTAGACAATATAACAAATCATCTTCAGGAAATTCA
ACAACTGACAGATCCCGGAAAGGCAACACTGCAGGCTCCATTGCCTCCATCCTCTAAGCCCTTTATTACTCTTACTACTGCTGAAACTCTCTCTCTCTCTCTCTCTCTCT
CTTCTCTCTCTCCTCATCAGAAGCCCTCAGGCCTCACTAATCTACCTCTGAGCATCTCCATCTTCCGGATTGAGAAGAGTTTCAGAAGCAGCAGCGGGAATGCGCACGCC
TCGCTTTCCCCTTCTCCATTTCCATCTCAACCTCTTGAGATCGTAAAGCTTAAATCTTCAATATTCAATCTTCCTTCCCCCATGTGA
Protein sequenceShow/hide protein sequence
MACEAIQLWTFNGLVAAFLDLGIAYLLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSRKISSVLQSAREKFPLDSMWDQEPKCCFKSMLVHER
NVKEAHVELEGEVSGSSFSKTRPQGMIYGDFPGVKESDATAGLTSVKEEGKFYNMGNGVSFGYGRHFQSSSSFSRFGDNNTEDSFFSVDSGDGREASSDNSDQGKVFPEL
ELDDSHDEKICAEMYESSVDEAGNNCRGELRLDGNESDTIKLLEQALEEEQTARATLDLELEKERSAAATAADEAMAMILRLQEEKASIEMNARQHQRMIEEKTAYDAEE
MSILKEILVRREREMHFLEKEVEAYRKSIFDGVDVDMLDLEGTPQRTPSFPYSTEEPSHILQCTNRSVSDKQDANFKNRSLQHETPSIESHNLTFEFGEESPFIQADELA
DAAEAGGMPLHLVADNFKDSEEIDNELQEKGMVEDENSYILQGEVNELEPYLQSNESNGPGLVEKSMESFADEHEKVDGVSDDELASAKTILPWVEYNLEKNGDHQKQQQ
TRDLNSLTDADPHLHDIHVIDDEASADASKKPVVNGTSSIPVKCDSTSFSWLQSELDITRSSSDATDRFPPMARSRSNCLRSELRRNSMSAVDYERLKIGNEVEWLRERL
KVVQEGREKLKLTAECKEKNNQLQLLDNITNHLQEIQQLTDPGKATLQAPLPPSSKPFITLTTAETLSLSLSLSSLSPHQKPSGLTNLPLSISIFRIEKSFRSSSGNAHA
SLSPSPFPSQPLEIVKLKSSIFNLPSPM