; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0015687 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0015687
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionGTD-binding domain-containing protein
Genome locationchr01:20780148..20784216
RNA-Seq ExpressionPI0015687
SyntenyPI0015687
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK30690.1 putative myosin-binding protein 5 isoform X2 [Cucumis melo var. makuwa]5.0e-24191.91Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGF LPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDL  MDD+Q+
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
        GNSNRNLL ENGISRLQSEVCCST PRLQN+VDK  EYDGK KK+MYQKPRTKIRRRRRAAVDNGKLSKGI EG ETRKEREFVALVERQDFITDDGNES
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
        NH + GQRIWQGFESSGSLGENNY  KGSSTVGQ  SNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
Subjt:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS

Query:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK
        FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQ+D AEKEQLKGNRD+I DEHKELSATAH SNGDPPI +PIGNA+SL R  K
Subjt:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK

Query:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN
        LNELRDNSLL D  AIEAAPHCGGFEKSFLSRGALQNL+HITHAV+DLGGSI+DMEIDVQDIHVIDEKLHMEDTK ERKVDN
Subjt:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN

XP_004139633.1 probable myosin-binding protein 5 [Cucumis sativus]1.1e-24090.89Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        MSFHEIHSWTLSG+VRAFLDLAVVYFLLCVSATMFIPSKILKVVGF LPCPCTGFYGNHNTNLC HKLVVNWPKRKIYLVLDLVKN FPFDLILMDD+++
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
        GNSNRNLL ENGISRLQSEVCCSTAPRLQNLVD D E+DGKGKKIMYQKPRTKIRRRRRA +DNGKLSKG+ E  ETRK REFVALVERQDFITDDGNES
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
        NHL+ GQRIWQGFESSGSLGEN+Y  KGSSTVGQGTS AEER IIRNEASTIRLLELALEEER ARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
Subjt:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS

Query:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK
        FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQ+D AE+E LKGN+DFI DEH ELSATAHYSNGDPPIV+PIGNA+SLSR AK
Subjt:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK

Query:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTK-EERKVDN
        LNELRDNSLL DHIAI+AAPHCGGFEKSFLSRGALQNLEHITHAV+DLG SILDMEIDVQDIHVIDEKLHME TK E RK DN
Subjt:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTK-EERKVDN

XP_008458006.1 PREDICTED: uncharacterized protein LOC103497547 isoform X1 [Cucumis melo]9.2e-24391.91Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGF LPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDL  MD +Q+
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
        GNSNRN+L ENGISRLQSEVCCST PRLQN+VDK  EYDGKGKK+MYQKPRTKIRRRRRAAVDNGKLSKGI EG ETRKEREFVALVERQDFITDDGNES
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
        NH + GQRIWQGFESSGSLGENNY  KGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
Subjt:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS

Query:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK
        FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQ+D AEKEQLKGNRD+I DEHKELS TAH SNGDPPI +PIGNA+SL R  K
Subjt:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK

Query:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN
        LNELRDNSLL D  AIEAAPHCGGFEKSFLSRGALQNL+HITHAV+DLGGSI+DMEIDVQDIHVIDEKLHMEDTK ERKVDN
Subjt:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN

XP_038877576.1 uncharacterized protein LOC120069830 isoform X1 [Benincasa hispida]1.1e-21980.23Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        MSFHEIHSWTLSGL+RAFLDL VVYFLLCVSAT+FIPSKILK+VGF LPCPC+GFYGN+N NLC H+L+VNWPKRKIYLVLDLVKNRFPFDLILM D+QM
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTEN----GISRLQSEVCCST--APRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFIT
         NSNRNLL EN    GIS+ QSEVCCST  APRLQNLVDKD EYDGKGK+IMYQ+P+TKIRRRRRAA+DNGKLSKGI EG ET KEREFVALVERQDFIT
Subjt:  GNSNRNLLTEN----GISRLQSEVCCST--APRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFIT

Query:  DDGNESNHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRL
        DD NESNH++ GQR WQGFESSGS GENN+  K SST+GQGTSNAEERDIIRNEAS+IRLLE ALEEERAARASLFVELEEERAAAATAADEAIAMITRL
Subjt:  DDGNESNHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRL

Query:  QNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAIS
        QNEKAS EMEARQYHR +EEKF+YDEE++NILREILVKRDIDYHVLEKEIEAYRQ+D +EKEQLK N DF+ DEH E S TAHYSNGDPPIVH IGNAIS
Subjt:  QNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAIS

Query:  LSRMAKL---------------------------NELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQ-NLEHITHAVDDLGGSILDMEIDVQDIHVID
         SR AKL                           NEL+DNSLLCDHIAIEAAP CGGF+KSFLSRGALQ +LEH+ HAV+DL  SILDMEIDVQDIHVID
Subjt:  LSRMAKL---------------------------NELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQ-NLEHITHAVDDLGGSILDMEIDVQDIHVID

Query:  EKLHMEDTKEERKVDN
        EKLHMEDTK+E+KVDN
Subjt:  EKLHMEDTKEERKVDN

XP_038877577.1 uncharacterized protein LOC120069830 isoform X2 [Benincasa hispida]5.2e-21479.03Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        MSFHEIHSWTLSGL+RAFLDL VVYFLLCVSAT+FIPSKILK+VGF LPCPC+GFYGN+N NLC H+L+VNWPKRKIYLVLDLVKNRFPFDLILM D+QM
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTEN----GISRLQSEVCCST--APRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFIT
         NSNRNLL EN    GIS+ QSEVCCST  APRLQNLVDKD EYDGKGK+IMYQ+P+TKIRRRRRAA+DNGKLSKGI EG ET KEREFVALVERQDFIT
Subjt:  GNSNRNLLTEN----GISRLQSEVCCST--APRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFIT

Query:  DDGNESNHLEGQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQ
                  GQR WQGFESSGS GENN+  K SST+GQGTSNAEERDIIRNEAS+IRLLE ALEEERAARASLFVELEEERAAAATAADEAIAMITRLQ
Subjt:  DDGNESNHLEGQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQ

Query:  NEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISL
        NEKAS EMEARQYHR +EEKF+YDEE++NILREILVKRDIDYHVLEKEIEAYRQ+D +EKEQLK N DF+ DEH E S TAHYSNGDPPIVH IGNAIS 
Subjt:  NEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISL

Query:  SRMAKL---------------------------NELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQ-NLEHITHAVDDLGGSILDMEIDVQDIHVIDE
        SR AKL                           NEL+DNSLLCDHIAIEAAP CGGF+KSFLSRGALQ +LEH+ HAV+DL  SILDMEIDVQDIHVIDE
Subjt:  SRMAKL---------------------------NELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQ-NLEHITHAVDDLGGSILDMEIDVQDIHVIDE

Query:  KLHMEDTKEERKVDN
        KLHMEDTK+E+KVDN
Subjt:  KLHMEDTKEERKVDN

TrEMBL top hitse value%identityAlignment
A0A0A0K8C0 GTD-binding domain-containing protein5.4e-24190.89Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        MSFHEIHSWTLSG+VRAFLDLAVVYFLLCVSATMFIPSKILKVVGF LPCPCTGFYGNHNTNLC HKLVVNWPKRKIYLVLDLVKN FPFDLILMDD+++
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
        GNSNRNLL ENGISRLQSEVCCSTAPRLQNLVD D E+DGKGKKIMYQKPRTKIRRRRRA +DNGKLSKG+ E  ETRK REFVALVERQDFITDDGNES
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
        NHL+ GQRIWQGFESSGSLGEN+Y  KGSSTVGQGTS AEER IIRNEASTIRLLELALEEER ARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
Subjt:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS

Query:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK
        FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQ+D AE+E LKGN+DFI DEH ELSATAHYSNGDPPIV+PIGNA+SLSR AK
Subjt:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK

Query:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTK-EERKVDN
        LNELRDNSLL DHIAI+AAPHCGGFEKSFLSRGALQNLEHITHAV+DLG SILDMEIDVQDIHVIDEKLHME TK E RK DN
Subjt:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTK-EERKVDN

A0A1S3C7F2 uncharacterized protein LOC103497547 isoform X14.4e-24391.91Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGF LPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDL  MD +Q+
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
        GNSNRN+L ENGISRLQSEVCCST PRLQN+VDK  EYDGKGKK+MYQKPRTKIRRRRRAAVDNGKLSKGI EG ETRKEREFVALVERQDFITDDGNES
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
        NH + GQRIWQGFESSGSLGENNY  KGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
Subjt:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS

Query:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK
        FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQ+D AEKEQLKGNRD+I DEHKELS TAH SNGDPPI +PIGNA+SL R  K
Subjt:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK

Query:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN
        LNELRDNSLL D  AIEAAPHCGGFEKSFLSRGALQNL+HITHAV+DLGGSI+DMEIDVQDIHVIDEKLHMEDTK ERKVDN
Subjt:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN

A0A1S4E1X3 probable myosin-binding protein 5 isoform X28.8e-18391.05Show/hide
Query:  SNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNESNH
        S RN+L ENGISRLQSEVCCST PRLQN+VDK  EYDGKGKK+MYQKPRTKIRRRRRAAVDNGKLSKGI EG ETRKEREFVALVERQDFITDDGNESNH
Subjt:  SNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNESNH

Query:  LE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFE
         + GQRIWQGFESSGSLGENNY  KGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFE
Subjt:  LE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFE

Query:  MEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAKLN
        MEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQ+D AEKEQLKGNRD+I DEHKELS TAH SNGDPPI +PIGNA+SL R  KLN
Subjt:  MEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAKLN

Query:  ELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN
        ELRDNSLL D  AIEAAPHCGGFEKSFLSRGALQNL+HITHAV+DLGGSI+DMEIDVQDIHVIDEKLHMEDTK ERKVDN
Subjt:  ELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN

A0A5A7SKD5 Putative myosin-binding protein 5 isoform X25.5e-18590.72Show/hide
Query:  MDDDQMGNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFIT
        MDD+Q+GNSNRNLL ENGISRLQSEVCCST PRLQN+VDK  EYDGK KK+MYQKPRTKIRRRRRAAVDNGKLSKGI EG ETRKEREFVALVERQDFIT
Subjt:  MDDDQMGNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFIT

Query:  DDGNESNHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRL
        DDGNESNH + GQRIWQGFESSGSLGENNY  KGSSTVGQ  SNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRL
Subjt:  DDGNESNHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRL

Query:  QNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAIS
        QNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQ+D AEKEQLKGNRD+I DEHKELSATAH SNGDPPI +PIGNA+S
Subjt:  QNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAIS

Query:  LSRMAKLNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN
        L R  KLNELRDNSLL D  AIEAAPHCGGFEKSFLSRGALQNL+HITHAV+DLGGSI+DMEIDVQDIHVIDEKLHMEDTK ERKVDN
Subjt:  LSRMAKLNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN

A0A5D3E523 Putative myosin-binding protein 5 isoform X22.4e-24191.91Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGF LPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDL  MDD+Q+
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
        GNSNRNLL ENGISRLQSEVCCST PRLQN+VDK  EYDGK KK+MYQKPRTKIRRRRRAAVDNGKLSKGI EG ETRKEREFVALVERQDFITDDGNES
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
        NH + GQRIWQGFESSGSLGENNY  KGSSTVGQ  SNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS
Subjt:  NHLE-GQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKAS

Query:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK
        FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQ+D AEKEQLKGNRD+I DEHKELSATAH SNGDPPI +PIGNA+SL R  K
Subjt:  FEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAK

Query:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN
        LNELRDNSLL D  AIEAAPHCGGFEKSFLSRGALQNL+HITHAV+DLGGSI+DMEIDVQDIHVIDEKLHMEDTK ERKVDN
Subjt:  LNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHITHAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 67.0e-1230.52Show/hide
Query:  TSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDI
        T NA +      E S +  L+  +  ++ +   L++EL+EER+A+A AA+EA+AMITRLQ EKA+ +MEA QY R ++E+  YD+E +  +   L KR+ 
Subjt:  TSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDI

Query:  DYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGN-AISLSRMAKLNELRD---NSLLCDHIAIEAAPHCGGFEKSFLS
        +   LE E E YR+      +Q     +F    HK+    + Y   D     P+ + A+S S   +  E  D    S   +    E        + S   
Subjt:  DYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGN-AISLSRMAKLNELRD---NSLLCDHIAIEAAPHCGGFEKSFLS

Query:  RGALQNLEHITHAVDDL--GGSILDMEIDVQDIH-------VIDEKLHM
         G ++ L  IT  +  L   G +L    DV D+         I + LHM
Subjt:  RGALQNLEHITHAVDDL--GGSILDMEIDVQDIH-------VIDEKLHM

F4HXQ7 Myosin-binding protein 17.8e-1144.95Show/hide
Query:  LELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYR------
        L+  ++ +R     L+ ELEEER+A+A A ++A+AMITRLQ EKASF+MEA Q  R +EE+  YD E +  L ++LV+R+     LE EIE +R      
Subjt:  LELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYR------

Query:  --QIDLAEK
          ++D+AEK
Subjt:  --QIDLAEK

Q0WNW4 Myosin-binding protein 34.9e-1340.65Show/hide
Query:  YKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNI
        Y  +   G G     E D   +   TI  L   +  E+ A   L+ ELEEER+A+A +A++ +AMITRLQ EKA  +MEA QY R +EE+  YD+E + +
Subjt:  YKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNI

Query:  LREILVKRDIDYHVLEKEIEAYR
        L  ++VKR+ +   L++E+E YR
Subjt:  LREILVKRDIDYHVLEKEIEAYR

Q9CAC4 Myosin-binding protein 21.5e-1440.15Show/hide
Query:  TIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQI
        T+  L+  L+EER A  +L+ ELE ER A+A AA E +AMI RL  EKA+ +MEA QY R +EE+  +D+E + +L E++V R+ +   LEKE+E YR+ 
Subjt:  TIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQI

Query:  DLAEKEQLKGNRDFIFDEHKELSATAHYSNGD
           E+ + K     +    ++ S  ++ +NGD
Subjt:  DLAEKEQLKGNRDFIFDEHKELSATAHYSNGD

Q9LMC8 Probable myosin-binding protein 51.6e-1137.5Show/hide
Query:  NAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDY
        N  E +++  + S ++ L   +  +R +   L++EL+EER+A+A AA+ A+AMITRLQ EKA+ +MEA QY R ++E+  YD+E +  +  +LVKR+ + 
Subjt:  NAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDY

Query:  HVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELS
          LE  IE YR      +E+     +F+ +E K +S
Subjt:  HVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELS

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5937.4e-2539.73Show/hide
Query:  KGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQD--FITDDGNESNHL-EGQRI--WQGFESSGSLGENNYTYKGSSTVGQGT
        KGK+ + ++ R  ++  RR+   N    + I    E   E     L++  D     DD  +     EG  +  W  FE + S+ +              +
Subjt:  KGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQD--FITDDGNESNHL-EGQRI--WQGFESSGSLGENNYTYKGSSTVGQGT

Query:  SNAEERDIIRN-EASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDI
        +++ E+  +RN E  ++R LE  L+EERAARA++ VEL++ER+AAA+AADEA+AMI RLQ+EKA+ EMEARQ+ R VEE+ ++D E+M IL++IL++R+ 
Subjt:  SNAEERDIIRN-EASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDI

Query:  DYHVLEKEIEAYRQIDLAEKEQLK
        + H LEKE+EAYRQ+ L E E+L+
Subjt:  DYHVLEKEIEAYRQIDLAEKEQLK

AT1G70750.1 Protein of unknown function, DUF5931.1e-1540.15Show/hide
Query:  TIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQI
        T+  L+  L+EER A  +L+ ELE ER A+A AA E +AMI RL  EKA+ +MEA QY R +EE+  +D+E + +L E++V R+ +   LEKE+E YR+ 
Subjt:  TIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQI

Query:  DLAEKEQLKGNRDFIFDEHKELSATAHYSNGD
           E+ + K     +    ++ S  ++ +NGD
Subjt:  DLAEKEQLKGNRDFIFDEHKELSATAHYSNGD

AT4G13160.1 Protein of unknown function, DUF5932.9e-2928.61Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        M + E +  T  G++ AF++LA  Y LLCVSA +FI SK+L     F+PC      G  N++LC+ KL+ +WP R I  V  L     P  L   + +Q 
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
                                                                                   E  +E+E   +V++           
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLEGQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASF
                                            N+E  D        +RLLE+A+E+E+ A+A+L VELE+ERAA+A+AADEA+AMI RLQ +KAS 
Subjt:  NHLEGQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASF

Query:  EMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPI--VHPIGN----AISL
        EME +QY R ++EKF+YDEE+MNIL+EIL KR+ + H LEKE+E Y+ ID  + ++ + N     DE           +G+P +  VH I +     +  
Subjt:  EMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPI--VHPIGN----AISL

Query:  SRMAKLNELRDNSLLCDHIAIEA
          +A+  E+++  ++ DH ++ +
Subjt:  SRMAKLNELRDNSLLCDHIAIEA

AT4G13630.1 Protein of unknown function, DUF5937.4e-4136.19Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        M   E+ SWT  GLV AF+DL+V + LLC S  +++ SK L + G  LPCPC G Y       C  + + N P +KI  V   VKNR PFD IL      
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
         N  +    E    +L+ EV  ST P +    +K   +D                                L   ++ K+  F    +R  F        
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLEGQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEA-----------------STIRLLELALEEERAARASLFVELEEERAAAATAA
        NH +     + FE  GS  EN+     S+  G+   +   R  +   +                  T+ + E  L EERAARASL +ELE+ER AAA+AA
Subjt:  NHLEGQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEA-----------------STIRLLELALEEERAARASLFVELEEERAAAATAA

Query:  DEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQ
        DEA+ MI RLQ EKAS EMEARQY R +EEK ++D E+M+IL+EIL++R+ + H LEKE++ YRQ+ L E EQ
Subjt:  DEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQ

AT4G13630.2 Protein of unknown function, DUF5937.4e-4136.19Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM
        M   E+ SWT  GLV AF+DL+V + LLC S  +++ SK L + G  LPCPC G Y       C  + + N P +KI  V   VKNR PFD IL      
Subjt:  MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQM

Query:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES
         N  +    E    +L+ EV  ST P +    +K   +D                                L   ++ K+  F    +R  F        
Subjt:  GNSNRNLLTENGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNES

Query:  NHLEGQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEA-----------------STIRLLELALEEERAARASLFVELEEERAAAATAA
        NH +     + FE  GS  EN+     S+  G+   +   R  +   +                  T+ + E  L EERAARASL +ELE+ER AAA+AA
Subjt:  NHLEGQRIWQGFESSGSLGENNYTYKGSSTVGQGTSNAEERDIIRNEA-----------------STIRLLELALEEERAARASLFVELEEERAAAATAA

Query:  DEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQ
        DEA+ MI RLQ EKAS EMEARQY R +EEK ++D E+M+IL+EIL++R+ + H LEKE++ YRQ+ L E EQ
Subjt:  DEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILVKRDIDYHVLEKEIEAYRQIDLAEKEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTCCACGAGATTCATTCGTGGACTTTATCTGGACTAGTTAGAGCTTTTCTTGACCTAGCTGTTGTTTATTTTCTTTTGTGTGTGTCGGCCACTATGTTTATTCC
TTCCAAGATTTTGAAAGTTGTTGGATTTTTCTTGCCTTGTCCCTGTACTGGATTTTATGGGAATCACAACACGAATTTGTGTTTGCATAAACTCGTTGTTAATTGGCCTA
AGAGGAAGATCTATTTGGTGCTCGATTTGGTTAAGAATAGGTTTCCTTTTGATTTGATTTTGATGGATGATGACCAAATGGGGAATTCGAATAGGAATTTGTTAACGGAG
AATGGAATTTCTCGGTTGCAATCAGAAGTATGCTGTAGTACTGCCCCAAGGTTACAAAATCTGGTTGATAAAGATGATGAATATGATGGTAAGGGTAAGAAGATTATGTA
CCAGAAGCCGAGGACTAAAATCCGACGGCGGAGGAGAGCTGCGGTTGACAATGGGAAATTGTCCAAGGGAATCTTGGAGGGGAAAGAAACTAGAAAGGAAAGGGAATTTG
TGGCATTGGTTGAGAGACAAGACTTTATTACAGATGATGGTAATGAATCAAATCACCTGGAGGGTCAAAGAATCTGGCAAGGCTTTGAATCAAGTGGTTCACTTGGCGAA
AATAATTATACGTATAAAGGTTCTTCCACTGTAGGACAAGGTACCAGTAATGCTGAAGAGAGAGACATTATTAGAAATGAAGCAAGTACTATTAGATTGTTGGAGCTGGC
CCTTGAAGAAGAGAGAGCTGCACGGGCATCTCTTTTTGTGGAACTAGAGGAGGAGAGAGCTGCCGCTGCTACTGCTGCTGATGAAGCAATAGCCATGATAACACGTTTGC
AAAATGAGAAGGCGTCTTTTGAAATGGAAGCAAGACAATATCATAGGGAAGTAGAAGAAAAATTTTCTTATGATGAAGAAAAGATGAATATCCTTCGAGAGATCCTTGTC
AAAAGGGACATAGACTATCATGTTCTGGAGAAGGAAATAGAAGCGTATAGACAGATAGATCTTGCAGAAAAGGAACAGTTAAAAGGAAACCGGGATTTCATTTTTGATGA
ACATAAAGAACTGTCTGCCACAGCTCATTACTCAAATGGAGATCCACCCATTGTTCATCCAATTGGTAATGCTATTTCACTCTCAAGGATGGCAAAGTTGAACGAACTGA
GGGATAATAGCCTGTTGTGTGATCATATCGCTATTGAGGCAGCTCCACACTGTGGTGGTTTTGAGAAAAGCTTTCTTTCCCGTGGGGCACTTCAAAATTTGGAGCACATA
ACTCATGCAGTCGATGATTTAGGAGGTTCTATCCTTGATATGGAAATAGATGTTCAAGATATTCATGTGATTGATGAAAAACTCCACATGGAGGACACTAAAGAGGAGAG
AAAAGTTGATAATTGA
mRNA sequenceShow/hide mRNA sequence
CACGTTCTGCTTTTTTAGCGGCGACACTGTGTCAACCATATCACTGCCGCATGTGTTTCATCTTCTTCTTTTTCTTTTCTTTTTTCTTTTCTATTTTCCCTTTTAGTTCC
TCAATCAAATCCTAACCATGATCCAGAACACGACACAGCTTTAATCCTTGTCTTCTTCCTCTTCCTCTTCCTCTTCCTCTTCCAACCTACTCTGCCCCTTGAAATGGACA
CCTGATTCATTTCCCCTCTTTCCAGCCCAACCCATCATCAATTTCCCTCGATTCTTACATCGGTTGTTCCAGTTTTGCAAATTTCTTCACTTTTTAATCTTGCCCCTCTG
TGTTATTCAGGCTTTTTTTTTTTTTTCTTTTGCTCCTTTTTCGCCTTTTTTTACCTACTCTGTTTCTGTTCTTTCAGCTTCAGCTCTTTTCCTTCGGTTTTTGGAAAGCT
TTCTGGACGGTATTTGTTTCTCAACTCTCTTAGTCTTTTGTTGGTTGTCTGATTTTTCTTAGTTGCTAAGTTTTAGAAAGAGTTTGGGATTTTGTTAATTGAAATCGATG
TCTTTCCACGAGATTCATTCGTGGACTTTATCTGGACTAGTTAGAGCTTTTCTTGACCTAGCTGTTGTTTATTTTCTTTTGTGTGTGTCGGCCACTATGTTTATTCCTTC
CAAGATTTTGAAAGTTGTTGGATTTTTCTTGCCTTGTCCCTGTACTGGATTTTATGGGAATCACAACACGAATTTGTGTTTGCATAAACTCGTTGTTAATTGGCCTAAGA
GGAAGATCTATTTGGTGCTCGATTTGGTTAAGAATAGGTTTCCTTTTGATTTGATTTTGATGGATGATGACCAAATGGGGAATTCGAATAGGAATTTGTTAACGGAGAAT
GGAATTTCTCGGTTGCAATCAGAAGTATGCTGTAGTACTGCCCCAAGGTTACAAAATCTGGTTGATAAAGATGATGAATATGATGGTAAGGGTAAGAAGATTATGTACCA
GAAGCCGAGGACTAAAATCCGACGGCGGAGGAGAGCTGCGGTTGACAATGGGAAATTGTCCAAGGGAATCTTGGAGGGGAAAGAAACTAGAAAGGAAAGGGAATTTGTGG
CATTGGTTGAGAGACAAGACTTTATTACAGATGATGGTAATGAATCAAATCACCTGGAGGGTCAAAGAATCTGGCAAGGCTTTGAATCAAGTGGTTCACTTGGCGAAAAT
AATTATACGTATAAAGGTTCTTCCACTGTAGGACAAGGTACCAGTAATGCTGAAGAGAGAGACATTATTAGAAATGAAGCAAGTACTATTAGATTGTTGGAGCTGGCCCT
TGAAGAAGAGAGAGCTGCACGGGCATCTCTTTTTGTGGAACTAGAGGAGGAGAGAGCTGCCGCTGCTACTGCTGCTGATGAAGCAATAGCCATGATAACACGTTTGCAAA
ATGAGAAGGCGTCTTTTGAAATGGAAGCAAGACAATATCATAGGGAAGTAGAAGAAAAATTTTCTTATGATGAAGAAAAGATGAATATCCTTCGAGAGATCCTTGTCAAA
AGGGACATAGACTATCATGTTCTGGAGAAGGAAATAGAAGCGTATAGACAGATAGATCTTGCAGAAAAGGAACAGTTAAAAGGAAACCGGGATTTCATTTTTGATGAACA
TAAAGAACTGTCTGCCACAGCTCATTACTCAAATGGAGATCCACCCATTGTTCATCCAATTGGTAATGCTATTTCACTCTCAAGGATGGCAAAGTTGAACGAACTGAGGG
ATAATAGCCTGTTGTGTGATCATATCGCTATTGAGGCAGCTCCACACTGTGGTGGTTTTGAGAAAAGCTTTCTTTCCCGTGGGGCACTTCAAAATTTGGAGCACATAACT
CATGCAGTCGATGATTTAGGAGGTTCTATCCTTGATATGGAAATAGATGTTCAAGATATTCATGTGATTGATGAAAAACTCCACATGGAGGACACTAAAGAGGAGAGAAA
AGTTGATAATTGATTCAGCTGGCATCAAATAGTCCTACAACTATGGCAACCTTTGGAACGTCTAATGTCTAAAGTAGAGATTCATTGTATAAAAGTAGAATTCTAGGCTG
TTGCAACGTGCTCCAGAAGAGTTTATTTCTTCGGTATATAGTGGAAGGTTGTGTCATAGAGCTATGGGGTTACAAGGGTCAGAACGTCTCCATCTGTGAAGAAAAAGAAA
GAGCAAAATGGAATCTCCTATTGAAATATCAAACTCCAATATATTTGACGGCTTGGAGTTGCTTTTTAACAGGTTTTGTTAACCTCTTCTCTTTTAAGGTATGAATTATC
TGTACTTTACCTCTTGAAATTGAATTTGATTGATTTGGAAAATTGCAGATATTAATTCCATTTTTGTGAAGAAAAAAAACTTACCATAAATTATCATTAGTGCTTAAGAG
AGCAAATTATGAATCTAAAGATGTGGCAGAACATTGGAATGTGCAAGAAGAAAAATAGAGTAAAATCATATCAATATGGAGCTATGAAATATTATCCAAATAATTTCCGC
TAATTAATAGTTTCTCTTCCTGTGATTCCTTCTGCTTGTATGGTTGTTGCTGCCGCTTATCAGCATTCCTTGTTCTTTGTAATTTGCTTTATTCATTTACTCAACATATT
CAAATTTTTAAAATTAACTCATTAATTGGATATAAAATTGAACTTTTTGTTCAATTGTATCTTTCAATTTTGTGTGTACTCGTTGATTAGGTAATGGATTTATTACATAC
AAAATTGAAGATGTGGACTTATTAGATCCAAAATAGTAAGTTTAAGGACATTAATAAGACACTTTAAATGTTAGGGATTTATTTGACACACAAATGAAGTTTAGGGACTT
CTAAGATATTGTGTGTAGAAGATTCTCATTCATATATGGTTAACTTTCTTGGGTGCAAATATTTTTGAAAGTCCTCGTTTACATATTTTTGAAGGGTCTTCTATTAATCT
CTACTAGTTGGATACTAGATGTGTTGCATGTGTTTCATTTATGTCTATATATTCTTTTGATTACCTTTCATGTTCATGTACGTCTAGATATTCTCTGTGAAGGATGGAAC
TGATGGATTAGAATCTGGATTTTATCTTAGCACTTTATAGTATATCGATATTGGAAAGTATGTGTAGATCAAGAATTATCAAATCAAGGAACTCATGAAGATTATTTGAA
ACATAAATAAGCATGGAACATTATTTAGAGAAGTTAGTAAATGTTTGTTATTATTAGTATTTAGATTCCTATTAATAGGAATAGTTTTGTTGATAATTTTCAGGTCAGGA
GGCTAAGACTCATATATGGATTGTAATTGCCATTCTGTGAAGAAGAATTTTCTGTATACTCTTTTCTTTTCAGTTATATTATATCTGTAACAATTTCCTTTGTAAATTAA
TTATATCTCAAGCTGTAGTAGCTTCTTCACTTTGAGATAGGTGCTATGTAAAGATTTAAATAGTGTCTGTGAATCTGTTGTAAAGATTTACATATTGTATTGTTCTTTCA
AACAAAATAAAATTTTGTTAATATTCTCCAAAA
Protein sequenceShow/hide protein sequence
MSFHEIHSWTLSGLVRAFLDLAVVYFLLCVSATMFIPSKILKVVGFFLPCPCTGFYGNHNTNLCLHKLVVNWPKRKIYLVLDLVKNRFPFDLILMDDDQMGNSNRNLLTE
NGISRLQSEVCCSTAPRLQNLVDKDDEYDGKGKKIMYQKPRTKIRRRRRAAVDNGKLSKGILEGKETRKEREFVALVERQDFITDDGNESNHLEGQRIWQGFESSGSLGE
NNYTYKGSSTVGQGTSNAEERDIIRNEASTIRLLELALEEERAARASLFVELEEERAAAATAADEAIAMITRLQNEKASFEMEARQYHREVEEKFSYDEEKMNILREILV
KRDIDYHVLEKEIEAYRQIDLAEKEQLKGNRDFIFDEHKELSATAHYSNGDPPIVHPIGNAISLSRMAKLNELRDNSLLCDHIAIEAAPHCGGFEKSFLSRGALQNLEHI
THAVDDLGGSILDMEIDVQDIHVIDEKLHMEDTKEERKVDN