; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g009520 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g009520
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionGTD-binding domain-containing protein
Genome locationChr05:9896253..9901364
RNA-Seq ExpressionLcy05g009520
SyntenyLcy05g009520
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK30690.1 putative myosin-binding protein 5 isoform X2 [Cucumis melo var. makuwa]7.4e-17372.03Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDL-ILTDNPM
        MSFHEIHSWTLSGLVRAFLD+ VVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N  LC H+L+V+WPKRKIYLVLDLVK RFPFDL  + D  +
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDL-ILTDNPM

Query:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VDK GEYDGK K+++YQ+PRTKIRRRR+   +N +LSKG  EGNE R+ERE VALVE+Q FI D
Subjt:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG-TSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN
        D  ESN  DLG+R W GFESSGS+GENN MNKGS+T+G  SN EER II+NE S+IRLLE ALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQN
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG-TSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN

Query:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLS
        EKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+EKE+LK N D+ILDEHK  SAT H SNGDPPI + I NA+SL 
Subjt:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLS

Query:  R----------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
        R                +EAA   GGFEKSFL RGALQ +L+H  HAVNDLG SI+DMEIDVQDIHVIDEKL
Subjt:  R----------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

XP_004139633.1 probable myosin-binding protein 5 [Cucumis sativus]1.6e-17271.58Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDN-PM
        MSFHEIHSWTLSG+VRAFLD+ VVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N  LCFH+L+V+WPKRKIYLVLDLVK  FPFDLIL D+  +
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDN-PM

Query:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VD DGE+DGKGK+I+YQ+PRTKIRRRR+   +N +LSKG  E NE R+ RE VALVE+Q FI D
Subjt:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D  ESN +DLG+R W GFESSGS+GEN+ MNKGS+T+  GTS  EER II+NE S+IRLLE ALEEE+ ARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL
        NEKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+E+E LK N DFILDEH   SAT HYSNGDPPI + I NA+SL
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL

Query:  SR------------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
        SR                  ++AA   GGFEKSFL RGALQ +LEH  HAVNDLG SILDMEIDVQDIHVIDEKL
Subjt:  SR------------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

XP_008458006.1 PREDICTED: uncharacterized protein LOC103497547 isoform X1 [Cucumis melo]1.9e-17372.09Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTD-NPM
        MSFHEIHSWTLSGLVRAFLD+ VVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N  LC H+L+V+WPKRKIYLVLDLVK RFPFDL   D   +
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTD-NPM

Query:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VDK GEYDGKGK+++YQ+PRTKIRRRR+   +N +LSKG  EGNE R+ERE VALVE+Q FI D
Subjt:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D  ESN  DLG+R W GFESSGS+GENN MNKGS+T+  GTSN EER II+NE S+IRLLE ALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL
        NEKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+EKE+LK N D+ILDEHK  S T H SNGDPPI + I NA+SL
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL

Query:  SR----------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
         R                +EAA   GGFEKSFL RGALQ +L+H  HAVNDLG SI+DMEIDVQDIHVIDEKL
Subjt:  SR----------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

XP_038877576.1 uncharacterized protein LOC120069830 isoform X1 [Benincasa hispida]4.3e-18971.29Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG
        MSFHEIHSWTLSGL+RAFLD+ VVYFLLCVSATVF PSKILK+VG CLPCPC+GFYGN N  LCFHRL+V+WPKRKIYLVLDLVK RFPFDLIL D  M 
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQPDGIPQ-TAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD
        NSNRN+LREN   DGI Q  +E CC +FS  R QN VDKD EYDGKGKRI+YQRP+TKIRRRR+   +N +LSKG  EGNE  +ERE VALVE+Q FI D
Subjt:  NSNRNVLRENQQPDGIPQ-TAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D+ ESN +DLG+RTW GFESSGS GENN+MNK S+T+  GTSN EER II+NE SSIRLLEQALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL
        NEKASVEMEARQY RVIEEKFAYDEE++NILREILVK++IDYHVLEKEIEAYRQMDFSEKE+LK NWDF+LDEH   S T HYSNGDPPI HQI NAIS 
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL

Query:  SR---------------------------------------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDE
        SR                                             +EAA   GGF+KSFL RGALQESLEH DHAVNDL SSILDMEIDVQDIHVIDE
Subjt:  SR---------------------------------------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDE

Query:  KLSHGRNERGEK
        KL H  + + EK
Subjt:  KLSHGRNERGEK

XP_038877577.1 uncharacterized protein LOC120069830 isoform X2 [Benincasa hispida]6.7e-18269.92Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG
        MSFHEIHSWTLSGL+RAFLD+ VVYFLLCVSATVF PSKILK+VG CLPCPC+GFYGN N  LCFHRL+V+WPKRKIYLVLDLVK RFPFDLIL D  M 
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQPDGIPQ-TAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD
        NSNRN+LREN   DGI Q  +E CC +FS  R QN VDKD EYDGKGKRI+YQRP+TKIRRRR+   +N +LSKG  EGNE  +ERE VALVE+Q FI  
Subjt:  NSNRNVLRENQQPDGIPQ-TAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
                  G+RTW GFESSGS GENN+MNK S+T+  GTSN EER II+NE SSIRLLEQALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL
        NEKASVEMEARQY RVIEEKFAYDEE++NILREILVK++IDYHVLEKEIEAYRQMDFSEKE+LK NWDF+LDEH   S T HYSNGDPPI HQI NAIS 
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL

Query:  SR---------------------------------------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDE
        SR                                             +EAA   GGF+KSFL RGALQESLEH DHAVNDL SSILDMEIDVQDIHVIDE
Subjt:  SR---------------------------------------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDE

Query:  KLSHGRNERGEK
        KL H  + + EK
Subjt:  KLSHGRNERGEK

TrEMBL top hitse value%identityAlignment
A0A0A0K8C0 GTD-binding domain-containing protein8.0e-17371.58Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDN-PM
        MSFHEIHSWTLSG+VRAFLD+ VVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N  LCFH+L+V+WPKRKIYLVLDLVK  FPFDLIL D+  +
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDN-PM

Query:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VD DGE+DGKGK+I+YQ+PRTKIRRRR+   +N +LSKG  E NE R+ RE VALVE+Q FI D
Subjt:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D  ESN +DLG+R W GFESSGS+GEN+ MNKGS+T+  GTS  EER II+NE S+IRLLE ALEEE+ ARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL
        NEKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+E+E LK N DFILDEH   SAT HYSNGDPPI + I NA+SL
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL

Query:  SR------------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
        SR                  ++AA   GGFEKSFL RGALQ +LEH  HAVNDLG SILDMEIDVQDIHVIDEKL
Subjt:  SR------------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

A0A1S3C7F2 uncharacterized protein LOC103497547 isoform X19.4e-17472.09Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTD-NPM
        MSFHEIHSWTLSGLVRAFLD+ VVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N  LC H+L+V+WPKRKIYLVLDLVK RFPFDL   D   +
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTD-NPM

Query:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VDK GEYDGKGK+++YQ+PRTKIRRRR+   +N +LSKG  EGNE R+ERE VALVE+Q FI D
Subjt:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D  ESN  DLG+R W GFESSGS+GENN MNKGS+T+  GTSN EER II+NE S+IRLLE ALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL
        NEKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+EKE+LK N D+ILDEHK  S T H SNGDPPI + I NA+SL
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISL

Query:  SR----------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
         R                +EAA   GGFEKSFL RGALQ +L+H  HAVNDLG SI+DMEIDVQDIHVIDEKL
Subjt:  SR----------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

A0A5D3E523 Putative myosin-binding protein 5 isoform X23.6e-17372.03Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDL-ILTDNPM
        MSFHEIHSWTLSGLVRAFLD+ VVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N  LC H+L+V+WPKRKIYLVLDLVK RFPFDL  + D  +
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDL-ILTDNPM

Query:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VDK GEYDGK K+++YQ+PRTKIRRRR+   +N +LSKG  EGNE R+ERE VALVE+Q FI D
Subjt:  GNSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG-TSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN
        D  ESN  DLG+R W GFESSGS+GENN MNKGS+T+G  SN EER II+NE S+IRLLE ALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQN
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG-TSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN

Query:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLS
        EKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+EKE+LK N D+ILDEHK  SAT H SNGDPPI + I NA+SL 
Subjt:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLS

Query:  R----------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
        R                +EAA   GGFEKSFL RGALQ +L+H  HAVNDLG SI+DMEIDVQDIHVIDEKL
Subjt:  R----------------MEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

A0A6J1GQ95 probable myosin-binding protein 61.1e-15865.87Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG
        M+FHEIHSWT  GLVRAFLD+ VVYFLLCVSATVF PSKIL+VVG CLPCPCTGFYGNQN  LC HRLL SWPKRKIYLVLD VK RFPFDLIL D+ MG
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPDD
          NRN+LREN   DG+P      C S                  KG RI++QRPR   RRR    +           G   R+E E +AL +KQ FIPDD
Subjt:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPDD

Query:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN
        + ES  +DLG RTW GFESSGSVGEN+ +NKGS+TI  GT+N +ER I  NEV SIRLLEQALEEEKAARASLFLELEEERAAAA+AADEAIAMITRLQN
Subjt:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN

Query:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLS
        EKASVEMEARQY+RVIEEKFAYDEEEMNILREILV++EIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHK  S+T  YSNGDPP+ HQIENA+SLS
Subjt:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLS

Query:  ---------------------------------------------RMEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEK
                                                      +E A   GGFEKS L RGALQ  LEH DH +NDLGSSILDMEIDVQDIHVIDEK
Subjt:  ---------------------------------------------RMEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEK

Query:  L
        L
Subjt:  L

A0A6J1KX80 uncharacterized protein LOC111497962 isoform X11.9e-15869.32Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG
        M+FHEIHSWTLSGLVRAF+D+ +VY LLCVSATVF PSK+LKVVGLCLPCPCTGFYGN+N  LC HRLLVSWPKRKI LVL+LVKGRFPFDLIL D+ M 
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPDD
        N+N + L EN   DGIP+    CC + SG  FQNS+  D E+DGKGKRI+Y+RPRTKIRR+R T  E+ +LSKG  EG+E R+ERE VALVE+Q F+PDD
Subjt:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPDD

Query:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN
          ESN ++L ER W GFESSGSVGEN+ MNKGS+TI  GTSNT+ER I +NEVS IRLL+QA EE   ARASLFLELE+ERAAAA+A DEAIAMITRLQN
Subjt:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN

Query:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLS
        EKAS EMEARQY+R +EEK AYDEEEMNILREILVK+EIDYHVLEK+I+AYR MD SEKE+LKR WDFILDE K  SATTH +  D              
Subjt:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLS

Query:  RMEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVID
          E   C    +KSF+  G   ES+EH +HAVNDLGSSILDMEI VQDIH+ID
Subjt:  RMEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVID

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 61.8e-1240.65Show/hide
Query:  MNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNIL
        +NK  N   T++     I+      +RL       +K +   L++EL+EER+A+A AA+EA+AMITRLQ EKA+V+MEA QY+R+++E+  YD+E +  +
Subjt:  MNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNIL

Query:  REILVKKEIDYHVLEKEIEAYRQ
           L K+E +   LE E E YR+
Subjt:  REILVKKEIDYHVLEKEIEAYRQ

Q0WNW4 Myosin-binding protein 37.3e-1445.92Show/hide
Query:  SIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR
        +I  L + +  E+ A   L+ ELEEER+A+A +A++ +AMITRLQ EKA V+MEA QY+R++EE+  YD+E + +L  ++VK+E +   L++E+E YR
Subjt:  SIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR

Q9CAC4 Myosin-binding protein 29.5e-1439.55Show/hide
Query:  VSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR
        V ++  L+  L+EE+ A  +L+ ELE ER A+A AA E +AMI RL  EKA+++MEA QY+R++EE+  +D+E + +L E++V +E +   LEKE+E YR
Subjt:  VSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR

Query:  QMDFSEKEELKRNWDFILDEHKGPSATTHYSNGD
        +    E+ E K     +    +  S  ++ +NGD
Subjt:  QMDFSEKEELKRNWDFILDEHKGPSATTHYSNGD

Q9FG14 Myosin-binding protein 75.8e-1136.42Show/hide
Query:  IKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEI
        I+NE   + LL + +  ++ +   L+ EL+EER AA++AA EA++MI RLQ +KA ++ME RQ++R  EEK  +D++E+  L +++ K+E     L  E 
Subjt:  IKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEI

Query:  EAY--RQMDF--------SEKEELKRNWDFILDEHKGPSATTHYSNGDPPI
        +AY  R M F        +EK  L RN   I ++++    T+ Y    PPI
Subjt:  EAY--RQMDF--------SEKEELKRNWDFILDEHKGPSATTHYSNGDPPI

Q9LMC8 Probable myosin-binding protein 51.2e-1140.5Show/hide
Query:  SSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQ
        S ++ L + +  ++ +   L++EL+EER+A+A AA+ A+AMITRLQ EKA+V+MEA QY+R+++E+  YD+E +  +  +LVK+E +   LE  IE YR 
Subjt:  SSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQ

Query:  MDFSEKEELKRNWDFILDEHK
             +EE     +F+ +E K
Subjt:  MDFSEKEELKRNWDFILDEHK

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5931.8e-2841.63Show/hide
Query:  KGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVE--KQAFIPDDIKESNQIDLGERT--WHGFESSGSVGENNNMNKGSNTIGTSN
        KGKR V +R R  ++  R++   N    +     +EA  E     L++   +    DD K+  +   G     W  FE + SV +  N N  +      N
Subjt:  KGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVE--KQAFIPDDIKESNQIDLGERT--WHGFESSGSVGENNNMNKGSNTIGTSN

Query:  TEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYH
         E+R        S+R LE+ L+EE+AARA++ +EL++ER+AAASAADEA+AMI RLQ+EKA++EMEARQ++R++EE+  +D EEM IL++IL+++E + H
Subjt:  TEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYH

Query:  VLEKEIEAYRQMDFSEKEELK
         LEKE+EAYRQ+   E EEL+
Subjt:  VLEKEIEAYRQMDFSEKEELK

AT4G13160.1 Protein of unknown function, DUF5931.3e-2931.48Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG
        M + E +  T  G++ AF+++   Y LLCVSA VF  SK+L    L +PC      G QN  LC  +LL  W                PF +IL    + 
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPDD
         +NR  +  +Q+ +                                                                 E  +E+E   +V+K       
Subjt:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPDD

Query:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEK
         K S  +D                                             +RLLE A+E+EK A+A+L +ELE+ERAA+ASAADEA+AMI RLQ +K
Subjt:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEK

Query:  ASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEE
        AS+EME +QYER+I+EKFAYDEEEMNIL+EIL K+E + H LEKE+E Y+ +D  ++ E
Subjt:  ASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEE

AT4G13630.1 Protein of unknown function, DUF5932.0e-4334.64Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG
        M   E+ SWT  GLV AF+D+ V + LLC S  V+  SK L + GL LPCPC G Y       CF   L + P +KI  V   VK R PFD IL +   G
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVD----------KDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVAL
           +   R  Q  D +  T        S  +F+N             K G +  K KR+ + R     +   ++     +  +G Y+ N+         L
Subjt:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVD----------KDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVAL

Query:  VEKQAFIPDDIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAI
        V       D  K    + L +       S  SVG           +G       G+++    ++ + EQ L EE+AARASL LELE+ER AAASAADEA+
Subjt:  VEKQAFIPDDIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAI

Query:  AMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDF------------ILDEHKGPSATT
         MI RLQ EKAS+EMEARQY+R+IEEK A+D EEM+IL+EIL+++E + H LEKE++ YRQM F E E+     D              + E      T 
Subjt:  AMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDF------------ILDEHKGPSATT

Query:  HYSNGDPPISHQIENA-ISLSRMEAAQCYGGFEKSFLPRG-ALQESLEHR--DHAVN-DLGSSILDMEIDVQDIHVIDEKLSHGR
        + S+G    ++Q++N     SR E        E      G  L   L  R  D AV+  L     D++  V DIHV+ ++ + G+
Subjt:  HYSNGDPPISHQIENA-ISLSRMEAAQCYGGFEKSFLPRG-ALQESLEHR--DHAVN-DLGSSILDMEIDVQDIHVIDEKLSHGR

AT4G13630.2 Protein of unknown function, DUF5932.0e-4334.64Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG
        M   E+ SWT  GLV AF+D+ V + LLC S  V+  SK L + GL LPCPC G Y       CF   L + P +KI  V   VK R PFD IL +   G
Subjt:  MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVD----------KDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVAL
           +   R  Q  D +  T        S  +F+N             K G +  K KR+ + R     +   ++     +  +G Y+ N+         L
Subjt:  NSNRNVLRENQQPDGIPQTAEECCGSFSGSRFQNSVD----------KDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVAL

Query:  VEKQAFIPDDIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAI
        V       D  K    + L +       S  SVG           +G       G+++    ++ + EQ L EE+AARASL LELE+ER AAASAADEA+
Subjt:  VEKQAFIPDDIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAI

Query:  AMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDF------------ILDEHKGPSATT
         MI RLQ EKAS+EMEARQY+R+IEEK A+D EEM+IL+EIL+++E + H LEKE++ YRQM F E E+     D              + E      T 
Subjt:  AMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDF------------ILDEHKGPSATT

Query:  HYSNGDPPISHQIENA-ISLSRMEAAQCYGGFEKSFLPRG-ALQESLEHR--DHAVN-DLGSSILDMEIDVQDIHVIDEKLSHGR
        + S+G    ++Q++N     SR E        E      G  L   L  R  D AV+  L     D++  V DIHV+ ++ + G+
Subjt:  HYSNGDPPISHQIENA-ISLSRMEAAQCYGGFEKSFLPRG-ALQESLEHR--DHAVN-DLGSSILDMEIDVQDIHVIDEKLSHGR

AT5G16720.1 Protein of unknown function, DUF5935.2e-1545.92Show/hide
Query:  SIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR
        +I  L + +  E+ A   L+ ELEEER+A+A +A++ +AMITRLQ EKA V+MEA QY+R++EE+  YD+E + +L  ++VK+E +   L++E+E YR
Subjt:  SIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCCACGAGATTCATTCTTGGACCTTGTCCGGACTTGTCAGAGCTTTTCTAGACATGGTTGTTGTTTATTTTCTTTTGTGTGTATCGGCGACCGTGTTTTTCCC
GTCCAAGATTTTGAAAGTGGTTGGGTTGTGTTTGCCTTGTCCTTGTACTGGATTTTATGGGAATCAGAACCGTATTTTGTGTTTCCATAGATTGCTTGTTAGTTGGCCGA
AGAGGAAGATTTATTTGGTGCTCGACTTGGTCAAGGGTAGGTTTCCTTTTGATTTGATTTTGACCGATAACCCAATGGGTAATTCGAATAGGAATGTGTTAAGGGAGAAT
CAGCAGCCGGATGGAATTCCTCAGACTGCTGAGGAATGTTGTGGTTCTTTTTCTGGCTCAAGATTTCAGAATTCGGTCGATAAAGATGGTGAATATGATGGTAAGGGTAA
GAGAATTGTGTATCAGAGGCCGAGGACCAAAATCCGACGACGGAGGAAAACTCCTTTTGAGAATTGGAGATTGTCCAAGGGATTCTATGAGGGGAATGAAGCTAGAAGGG
AAAGGGAATCTGTGGCATTGGTTGAGAAACAAGCATTTATTCCAGATGATATTAAAGAATCAAATCAAATTGATTTGGGTGAAAGAACCTGGCATGGCTTTGAATCAAGT
GGTTCAGTTGGTGAAAATAATAATATGAATAAAGGTTCTAACACTATAGGTACCAGTAATACAGAAGAGAGAGGCATTATCAAAAATGAAGTAAGCTCTATTAGATTGTT
GGAGCAAGCGCTTGAAGAAGAGAAAGCTGCTCGAGCATCTCTGTTTCTGGAACTGGAGGAGGAGAGAGCTGCCGCTGCTTCTGCTGCTGATGAAGCAATAGCCATGATAA
CACGTTTGCAAAATGAGAAGGCATCAGTTGAAATGGAAGCAAGACAGTATGAGAGGGTAATAGAAGAAAAATTTGCTTATGATGAAGAAGAGATGAATATACTTCGAGAG
ATCCTCGTCAAGAAGGAAATAGATTATCATGTTCTGGAGAAGGAAATTGAAGCATATAGGCAGATGGATTTTTCAGAAAAAGAAGAGTTAAAAAGAAACTGGGATTTCAT
ATTGGATGAACATAAAGGACCGTCTGCCACTACCCATTACTCAAATGGAGATCCACCCATTTCTCATCAAATTGAAAATGCTATTTCTCTTTCAAGGATGGAGGCAGCTC
AATGTTATGGTGGTTTTGAGAAAAGCTTTCTTCCCCGTGGGGCACTACAAGAAAGTTTGGAGCACAGAGATCACGCAGTTAATGATCTAGGAAGTTCCATTCTTGATATG
GAAATAGATGTTCAAGATATTCATGTGATTGATGAAAAACTCTCACATGGAAGAAATGAAAGAGGAGAGAAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
CGGCGACACTGTGGCAACCAAATGAGTGCCGCATGTGTTTCAATGTTCTTCATTTACCATTTTTCCATTTCTTTCTTTATTACCCTTTTAAACCTTCAGATCAATAATGA
CGATGATCCACAAAATGCCATCGCTTTCCTGCCTTCCTCGTCTTCTTTCTTCTCCTCTGCCCCTTCAAATGGACCCCTGATTCCACTCCCTTCGTCGCCCCCCAACTTCC
CCTCTCTTTCCACGCCGAACCCATCACCAATTTCCCTCAATTCTGCGAAATTTTGCACTTCCCAATCCCGCTTCTCTGTGTATTCAGGTCTTTCGGCTTTGTTGGGTACT
CTGTTTCTGTAATTTCAGCTCTTTTTCTTGAAGTGTTTGGAAAGCTTGCTCTGTTTCGTAGTTTGTTGCCGATTCGTGATTTCTTGGCTGCGAAGTTAGGGGAAGATTTT
GGGTTTTCGATCTGGGGTGGTCCGATTTTTAGTTTTTTTGCTTTTATCGGGTGATTGAAATCGATGTCGTTCCACGAGATTCATTCTTGGACCTTGTCCGGACTTGTCAG
AGCTTTTCTAGACATGGTTGTTGTTTATTTTCTTTTGTGTGTATCGGCGACCGTGTTTTTCCCGTCCAAGATTTTGAAAGTGGTTGGGTTGTGTTTGCCTTGTCCTTGTA
CTGGATTTTATGGGAATCAGAACCGTATTTTGTGTTTCCATAGATTGCTTGTTAGTTGGCCGAAGAGGAAGATTTATTTGGTGCTCGACTTGGTCAAGGGTAGGTTTCCT
TTTGATTTGATTTTGACCGATAACCCAATGGGTAATTCGAATAGGAATGTGTTAAGGGAGAATCAGCAGCCGGATGGAATTCCTCAGACTGCTGAGGAATGTTGTGGTTC
TTTTTCTGGCTCAAGATTTCAGAATTCGGTCGATAAAGATGGTGAATATGATGGTAAGGGTAAGAGAATTGTGTATCAGAGGCCGAGGACCAAAATCCGACGACGGAGGA
AAACTCCTTTTGAGAATTGGAGATTGTCCAAGGGATTCTATGAGGGGAATGAAGCTAGAAGGGAAAGGGAATCTGTGGCATTGGTTGAGAAACAAGCATTTATTCCAGAT
GATATTAAAGAATCAAATCAAATTGATTTGGGTGAAAGAACCTGGCATGGCTTTGAATCAAGTGGTTCAGTTGGTGAAAATAATAATATGAATAAAGGTTCTAACACTAT
AGGTACCAGTAATACAGAAGAGAGAGGCATTATCAAAAATGAAGTAAGCTCTATTAGATTGTTGGAGCAAGCGCTTGAAGAAGAGAAAGCTGCTCGAGCATCTCTGTTTC
TGGAACTGGAGGAGGAGAGAGCTGCCGCTGCTTCTGCTGCTGATGAAGCAATAGCCATGATAACACGTTTGCAAAATGAGAAGGCATCAGTTGAAATGGAAGCAAGACAG
TATGAGAGGGTAATAGAAGAAAAATTTGCTTATGATGAAGAAGAGATGAATATACTTCGAGAGATCCTCGTCAAGAAGGAAATAGATTATCATGTTCTGGAGAAGGAAAT
TGAAGCATATAGGCAGATGGATTTTTCAGAAAAAGAAGAGTTAAAAAGAAACTGGGATTTCATATTGGATGAACATAAAGGACCGTCTGCCACTACCCATTACTCAAATG
GAGATCCACCCATTTCTCATCAAATTGAAAATGCTATTTCTCTTTCAAGGATGGAGGCAGCTCAATGTTATGGTGGTTTTGAGAAAAGCTTTCTTCCCCGTGGGGCACTA
CAAGAAAGTTTGGAGCACAGAGATCACGCAGTTAATGATCTAGGAAGTTCCATTCTTGATATGGAAATAGATGTTCAAGATATTCATGTGATTGATGAAAAACTCTCACA
TGGAAGAAATGAAAGAGGAGAGAAAAGTTGATAACTGGTTCATATGGCATCAAATGGTCCTAGAATTATGCTATCACCTTTGGAACATTTAATGCCTGAAGCATGGATTG
ACCGTCTAAATCAACCATAGAAGTAGAGTTCTGGGCTGTTGACTTTTTCGTAGTTCAATGTAATGTGCTCCAGAAGAATTTTGTCTTCAGTAAATATTGAAAGGTCGTTG
TAGTGTTGTGGGGTTAGAAGGGTCAGGAAGAAAAGCATAATGGAACTTCCTGTTCAAATATCAAACTCCAAAATATTTGACTGCTAGGAGATCCTTTTTAACAGGTTTTA
TTGACCTCCTCCTCTTTAAGGTCAGGAGGCCAAGACTCATATATGTATTATAATTGTCTGTCTGTGAAGAAGAATCTCTGTATAGTCTTTTCTTTTCCAGTCATATTTGT
AACAGTTTCCTTTATCTGTAGCTTCTTCTCTTTGAGATAGTACTATGTATTGATATTGTTTTGTGTTTGTAATCTGTTGTAAAGATTTACACATTTTCCTTCCAAATAAG
GAATTGTGAGTAAACTTGTCTTTTAATTGTTTTAATTTATTTTCTTGCTTCTTTTTTTT
Protein sequenceShow/hide protein sequence
MSFHEIHSWTLSGLVRAFLDMVVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRILCFHRLLVSWPKRKIYLVLDLVKGRFPFDLILTDNPMGNSNRNVLREN
QQPDGIPQTAEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNEARRERESVALVEKQAFIPDDIKESNQIDLGERTWHGFESS
GSVGENNNMNKGSNTIGTSNTEERGIIKNEVSSIRLLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILRE
ILVKKEIDYHVLEKEIEAYRQMDFSEKEELKRNWDFILDEHKGPSATTHYSNGDPPISHQIENAISLSRMEAAQCYGGFEKSFLPRGALQESLEHRDHAVNDLGSSILDM
EIDVQDIHVIDEKLSHGRNERGEKS