; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002118 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002118
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGTD-binding domain-containing protein
Genome locationchr4:39480114..39482063
RNA-Seq ExpressionLag0002118
SyntenyLag0002118
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK30690.1 putative myosin-binding protein 5 isoform X2 [Cucumis melo var. makuwa]1.3e-17272.03Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDL-ILTDNPM
        MSFHEIHSWTLSGLVRAFLD+AVVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N NLC ++L+V+WPKRK+YLVLDLVK RFPFDL  + D  +
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDL-ILTDNPM

Query:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VDK GEYDGK K+++YQ+PRTKIRRRR+   +N +LSKG  EGNETR+ERE VALVERQ FI D
Subjt:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG-ISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN
        D  ESN  DLG+R W GFESSGS+GENN MNKGS+T+G  SN EER IIRNE S+I LLE ALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQN
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG-ISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN

Query:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLS
        EKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+EKE++K N D+ILDEHK  SAT H SN DPPI + I NA+SL 
Subjt:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLS

Query:  R----------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
        R                +EAA   GGFEKSFL RGALQ +L+H  HAVNDLG SI+DMEIDVQDIHVIDEKL
Subjt:  R----------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

XP_004139633.1 probable myosin-binding protein 5 [Cucumis sativus]1.1e-17171.37Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDN-PM
        MSFHEIHSWTLSG+VRAFLD+AVVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N NLCF++L+V+WPKRK+YLVLDLVK  FPFDLIL D+  +
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDN-PM

Query:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VD DGE+DGKGK+I+YQ+PRTKIRRRR+   +N +LSKG  E NETR+ RE VALVERQ FI D
Subjt:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D  ESN +DLG+R W GFESSGS+GEN+ MNKGS+T+  G S  EER IIRNE S+I LLE ALEEE+ ARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL
        NEKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+E+E +K N DFILDEH   SAT HYSN DPPI + I NA+SL
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL

Query:  SR------------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
        SR                  ++AA   GGFEKSFL RGALQ +LEH  HAVNDLG SILDMEIDVQDIHVIDEKL
Subjt:  SR------------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

XP_008458006.1 PREDICTED: uncharacterized protein LOC103497547 isoform X1 [Cucumis melo]1.3e-17271.88Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTD-NPM
        MSFHEIHSWTLSGLVRAFLD+AVVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N NLC ++L+V+WPKRK+YLVLDLVK RFPFDL   D   +
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTD-NPM

Query:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VDK GEYDGKGK+++YQ+PRTKIRRRR+   +N +LSKG  EGNETR+ERE VALVERQ FI D
Subjt:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D  ESN  DLG+R W GFESSGS+GENN MNKGS+T+  G SN EER IIRNE S+I LLE ALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL
        NEKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+EKE++K N D+ILDEHK  S T H SN DPPI + I NA+SL
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL

Query:  SR----------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
         R                +EAA   GGFEKSFL RGALQ +L+H  HAVNDLG SI+DMEIDVQDIHVIDEKL
Subjt:  SR----------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

XP_038877576.1 uncharacterized protein LOC120069830 isoform X1 [Benincasa hispida]2.8e-18870.9Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG
        MSFHEIHSWTLSGL+RAFLD+ VVYFLLCVSATVF PSKILK+VG CLPCPC+GFYGN N NLCF+RL+V+WPKRK+YLVLDLVK RFPFDLIL D  M 
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQLDGIPQ-TTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
        NSNRN+LREN  +DGI Q  +E CC +FS  R QN VDKD EYDGKGKRI+YQRP+TKIRRRR+   +N +LSKG  EGNET +ERE VALVERQ FI D
Subjt:  NSNRNVLRENQQLDGIPQ-TTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D+ ESN +DLG+RTW GFESSGS GENN+MNK S+T+  G SN EER IIRNE SSI LLEQALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL
        NEKASVEMEARQY RVIEEKFAYDEE++NILREILVK++IDYHVLEKEIEAYRQMDFSEKE++K NWDF+LDEH   S T HYSN DPPI HQI NAIS 
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL

Query:  SR---------------------------------------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDE
        SR                                             +EAA   GGF+KSFL RGALQESLEH DHAVNDL SSILDMEIDVQDIHVIDE
Subjt:  SR---------------------------------------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDE

Query:  KLSHGRNERGEK
        KL H  + + EK
Subjt:  KLSHGRNERGEK

XP_038877577.1 uncharacterized protein LOC120069830 isoform X2 [Benincasa hispida]4.3e-18169.53Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG
        MSFHEIHSWTLSGL+RAFLD+ VVYFLLCVSATVF PSKILK+VG CLPCPC+GFYGN N NLCF+RL+V+WPKRK+YLVLDLVK RFPFDLIL D  M 
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQLDGIPQ-TTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
        NSNRN+LREN  +DGI Q  +E CC +FS  R QN VDKD EYDGKGKRI+YQRP+TKIRRRR+   +N +LSKG  EGNET +ERE VALVERQ FI  
Subjt:  NSNRNVLRENQQLDGIPQ-TTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
                  G+RTW GFESSGS GENN+MNK S+T+  G SN EER IIRNE SSI LLEQALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL
        NEKASVEMEARQY RVIEEKFAYDEE++NILREILVK++IDYHVLEKEIEAYRQMDFSEKE++K NWDF+LDEH   S T HYSN DPPI HQI NAIS 
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL

Query:  SR---------------------------------------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDE
        SR                                             +EAA   GGF+KSFL RGALQESLEH DHAVNDL SSILDMEIDVQDIHVIDE
Subjt:  SR---------------------------------------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDE

Query:  KLSHGRNERGEK
        KL H  + + EK
Subjt:  KLSHGRNERGEK

TrEMBL top hitse value%identityAlignment
A0A0A0K8C0 GTD-binding domain-containing protein5.2e-17271.37Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDN-PM
        MSFHEIHSWTLSG+VRAFLD+AVVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N NLCF++L+V+WPKRK+YLVLDLVK  FPFDLIL D+  +
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDN-PM

Query:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VD DGE+DGKGK+I+YQ+PRTKIRRRR+   +N +LSKG  E NETR+ RE VALVERQ FI D
Subjt:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D  ESN +DLG+R W GFESSGS+GEN+ MNKGS+T+  G S  EER IIRNE S+I LLE ALEEE+ ARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL
        NEKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+E+E +K N DFILDEH   SAT HYSN DPPI + I NA+SL
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL

Query:  SR------------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
        SR                  ++AA   GGFEKSFL RGALQ +LEH  HAVNDLG SILDMEIDVQDIHVIDEKL
Subjt:  SR------------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

A0A1S3C7F2 uncharacterized protein LOC103497547 isoform X16.1e-17371.88Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTD-NPM
        MSFHEIHSWTLSGLVRAFLD+AVVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N NLC ++L+V+WPKRK+YLVLDLVK RFPFDL   D   +
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTD-NPM

Query:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VDK GEYDGKGK+++YQ+PRTKIRRRR+   +N +LSKG  EGNETR+ERE VALVERQ FI D
Subjt:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ
        D  ESN  DLG+R W GFESSGS+GENN MNKGS+T+  G SN EER IIRNE S+I LLE ALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQ
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQ

Query:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL
        NEKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+EKE++K N D+ILDEHK  S T H SN DPPI + I NA+SL
Subjt:  NEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISL

Query:  SR----------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
         R                +EAA   GGFEKSFL RGALQ +L+H  HAVNDLG SI+DMEIDVQDIHVIDEKL
Subjt:  SR----------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

A0A5D3E523 Putative myosin-binding protein 5 isoform X26.1e-17372.03Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDL-ILTDNPM
        MSFHEIHSWTLSGLVRAFLD+AVVYFLLCVSAT+F PSKILKVVG CLPCPCTGFYGN N NLC ++L+V+WPKRK+YLVLDLVK RFPFDL  + D  +
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDL-ILTDNPM

Query:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
        GNSNRN+LREN    GI +   E C S +  R QN VDK GEYDGK K+++YQ+PRTKIRRRR+   +N +LSKG  EGNETR+ERE VALVERQ FI D
Subjt:  GNSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG-ISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN
        D  ESN  DLG+R W GFESSGS+GENN MNKGS+T+G  SN EER IIRNE S+I LLE ALEEE+AARASLF+ELEEERAAAA+AADEAIAMITRLQN
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG-ISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN

Query:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLS
        EKAS EMEARQY R +EEKF+YDEE+MNILREILVK++IDYHVLEKEIEAYRQMDF+EKE++K N D+ILDEHK  SAT H SN DPPI + I NA+SL 
Subjt:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLS

Query:  R----------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL
        R                +EAA   GGFEKSFL RGALQ +L+H  HAVNDLG SI+DMEIDVQDIHVIDEKL
Subjt:  R----------------MEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVIDEKL

A0A6J1H6X8 uncharacterized protein LOC111461056 isoform X13.6e-15769.09Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG
        M+FHEIHSWTLSGLVRAF+D+A+VY LLCVSATVF PSKILKVVGLCLPCPCTGFYGN+N NLC +RLLVSWPKRK+ LVL+LVKGRFPFDLIL D+ M 
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPDD
        N+N + L EN   DGIP   E CC + SG   QNS+  D E+DGKGKRI+Y+RPRTKIRRRR T  E+ +LSKG  EG+ETR+ER  VALVE Q F+PDD
Subjt:  NSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPDD

Query:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG--ISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN
          ESN ++L ER W GFESSGSVGEN++MNKGS+TIG   SN +ER I RNEVS I LL+QA EE   ARASLFLELE+ERAAAA+A DEAIAMITRLQN
Subjt:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIG--ISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN

Query:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLS
        EKAS EMEARQY+R +EEK AYDEEEMNILREILVK+EIDYHVLEK+I+AYR MD SEKE++KR WDFILDE +  SATTH +  D              
Subjt:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLS

Query:  RMEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVID
          E   C    +KSF+  G   ESLEH +HAV+DLGSSILDMEIDVQDIH+ID
Subjt:  RMEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVID

A0A6J1KX80 uncharacterized protein LOC111497962 isoform X11.7e-15969.32Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG
        M+FHEIHSWTLSGLVRAF+D+A+VY LLCVSATVF PSK+LKVVGLCLPCPCTGFYGN+N NLC +RLLVSWPKRK+ LVL+LVKGRFPFDLIL D+ M 
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPDD
        N+N + L EN   DGIP+    CC + SG  FQNS+  D E+DGKGKRI+Y+RPRTKIRR+R T  E+ +LSKG  EG+ETR+ERE VALVERQ F+PDD
Subjt:  NSNRNVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPDD

Query:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN
          ESN ++L ER W GFESSGSVGEN+ MNKGS+TI  G SNT+ER I RNEVS I LL+QA EE   ARASLFLELE+ERAAAA+A DEAIAMITRLQN
Subjt:  IKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTI--GISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQN

Query:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLS
        EKAS EMEARQY+R +EEK AYDEEEMNILREILVK+EIDYHVLEK+I+AYR MD SEKE++KR WDFILDE K  SATTH +  D              
Subjt:  EKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLS

Query:  RMEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVID
          E   C    +KSF+  G   ES+EH +HAVNDLGSSILDMEI VQDIH+ID
Subjt:  RMEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDMEIDVQDIHVID

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 61.3e-1034.84Show/hide
Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEE---EKAARASLFLELEEERAAAASAADEAIAMITRL
        D+  +     G   + G   S S   +   +  S    + N  E      + +   +L Q  +E   +K +   L++EL+EER+A+A AA+EA+AMITRL
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEE---EKAARASLFLELEEERAAAASAADEAIAMITRL

Query:  QNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQ
        Q EKA+V+MEA QY+R+++E+  YD+E +  +   L K+E +   LE E E YR+
Subjt:  QNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQ

Q0WNW4 Myosin-binding protein 31.2e-1346.81Show/hide
Query:  LEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR
        L + +  E+ A   L+ ELEEER+A+A +A++ +AMITRLQ EKA V+MEA QY+R++EE+  YD+E + +L  ++VK+E +   L++E+E YR
Subjt:  LEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR

Q9CAC4 Myosin-binding protein 21.4e-1238.81Show/hide
Query:  VSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR
        V ++  L+  L+EE+ A  +L+ ELE ER A+A AA E +AMI RL  EKA+++MEA QY+R++EE+  +D+E + +L E++V +E +   LEKE+E YR
Subjt:  VSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR

Query:  QMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVD
        +    E+ E K     +    +  S  ++ +N D
Subjt:  QMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVD

Q9FG14 Myosin-binding protein 71.7e-1035.76Show/hide
Query:  IRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEI
        I NE+    LL + +  ++ +   L+ EL+EER AA++AA EA++MI RLQ +KA ++ME RQ++R  EEK  +D++E+  L +++ K+E     L  E 
Subjt:  IRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEI

Query:  EAY--RQMDF--------SEKEEIKRNWDFILDEHKGPSATTHYSNVDPPI
        +AY  R M F        +EK  + RN   I ++++    T+ Y    PPI
Subjt:  EAY--RQMDF--------SEKEEIKRNWDFILDEHKGPSATTHYSNVDPPI

Q9LMC8 Probable myosin-binding protein 51.5e-1141.38Show/hide
Query:  LEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSE
        L + +  ++ +   L++EL+EER+A+A AA+ A+AMITRLQ EKA+V+MEA QY+R+++E+  YD+E +  +  +LVK+E +   LE  IE YR      
Subjt:  LEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSE

Query:  KEEIKRNWDFILDEHK
        +EE     +F+ +E K
Subjt:  KEEIKRNWDFILDEHK

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5932.6e-2740.27Show/hide
Query:  KGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVE--RQAFIPDDIKESNQIDLGERT--WHGFESSGSVGENNNMNKGSNTIGISN
        KGKR V +R R  ++  R++   N    +     +E   E     L++   +    DD K+  +   G     W  FE + SV +  N N  +    + N
Subjt:  KGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVE--RQAFIPDDIKESNQIDLGERT--WHGFESSGSVGENNNMNKGSNTIGISN

Query:  TEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYH
         E+R        S+  LE+ L+EE+AARA++ +EL++ER+AAASAADEA+AMI RLQ+EKA++EMEARQ++R++EE+  +D EEM IL++IL+++E + H
Subjt:  TEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYH

Query:  VLEKEIEAYRQMDFSEKEEIK
         LEKE+EAYRQ+   E EE++
Subjt:  VLEKEIEAYRQMDFSEKEEIK

AT4G13160.1 Protein of unknown function, DUF5931.7e-2932.5Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG
        M + E +  T  G++ AF+++A  Y LLCVSA VF  SK+L    L +PC      G QN +LC  +LL  W                PF +IL    + 
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNR-NVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD
         +NR +VL   +Q     Q  EE          +  VDKD                                                            
Subjt:  NSNR-NVLRENQQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPD

Query:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNE
          K S  +D                                             + LLE A+E+EK A+A+L +ELE+ERAA+ASAADEA+AMI RLQ +
Subjt:  DIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNE

Query:  KASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEE
        KAS+EME +QYER+I+EKFAYDEEEMNIL+EIL K+E + H LEKE+E Y+ +D  ++ E
Subjt:  KASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEE

AT4G13630.1 Protein of unknown function, DUF5935.3e-4439.56Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG
        M   E+ SWT  GLV AF+D++V + LLC S  V+  SK L + GL LPCPC G Y       CF   L + P +K+  V   VK R PFD IL +   G
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQLDGIPQTTEECCGSF----SGSRFQNSVD-KDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQA
           R   R   QL+    +T    G F    SG     +   K G +  K KR+ + R     +   ++     +  +G Y+ N+         LV    
Subjt:  NSNRNVLRENQQLDGIPQTTEECCGSF----SGSRFQNSVD-KDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQA

Query:  FIPDDIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITR
           D  K    + L +       S  SVG           +G       G+++  V    + EQ L EE+AARASL LELE+ER AAASAADEA+ MI R
Subjt:  FIPDDIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITR

Query:  LQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEE
        LQ EKAS+EMEARQY+R+IEEK A+D EEM+IL+EIL+++E + H LEKE++ YRQM F E E+
Subjt:  LQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEE

AT4G13630.2 Protein of unknown function, DUF5935.3e-4439.56Show/hide
Query:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG
        M   E+ SWT  GLV AF+D++V + LLC S  V+  SK L + GL LPCPC G Y       CF   L + P +K+  V   VK R PFD IL +   G
Subjt:  MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMG

Query:  NSNRNVLRENQQLDGIPQTTEECCGSF----SGSRFQNSVD-KDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQA
           R   R   QL+    +T    G F    SG     +   K G +  K KR+ + R     +   ++     +  +G Y+ N+         LV    
Subjt:  NSNRNVLRENQQLDGIPQTTEECCGSF----SGSRFQNSVD-KDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQA

Query:  FIPDDIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITR
           D  K    + L +       S  SVG           +G       G+++  V    + EQ L EE+AARASL LELE+ER AAASAADEA+ MI R
Subjt:  FIPDDIKESNQIDLGERTWHGFESSGSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITR

Query:  LQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEE
        LQ EKAS+EMEARQY+R+IEEK A+D EEM+IL+EIL+++E + H LEKE++ YRQM F E E+
Subjt:  LQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYRQMDFSEKEE

AT5G16720.1 Protein of unknown function, DUF5938.8e-1546.81Show/hide
Query:  LEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR
        L + +  E+ A   L+ ELEEER+A+A +A++ +AMITRLQ EKA V+MEA QY+R++EE+  YD+E + +L  ++VK+E +   L++E+E YR
Subjt:  LEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILREILVKKEIDYHVLEKEIEAYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCCACGAGATTCATTCTTGGACCTTGTCCGGACTTGTCAGAGCTTTTCTCGACATGGCGGTTGTTTATTTTCTCTTGTGTGTATCGGCGACCGTGTTTTTCCC
GTCCAAGATTTTGAAAGTGGTTGGGTTGTGTTTGCCTTGTCCTTGTACTGGATTTTATGGGAATCAGAACCGTAATCTGTGTTTCTATAGATTGCTTGTTAGTTGGCCGA
AGAGGAAGGTTTATTTGGTGCTCGACTTGGTCAAGGGTAGGTTTCCTTTTGATTTGATTTTGACCGATAACCCAATGGGTAATTCGAATAGGAATGTGTTAAGGGAGAAT
CAGCAGCTGGATGGAATTCCTCAGACTACTGAGGAATGTTGTGGTTCTTTTTCTGGCTCAAGATTTCAGAATTCGGTCGATAAAGATGGTGAATATGATGGTAAGGGTAA
GAGAATTGTGTATCAGAGGCCGAGGACTAAAATCCGACGAAGGAGGAAAACTCCTTTTGAGAATTGGAGATTGTCCAAGGGATTCTATGAGGGGAATGAAACTAGAAGGG
AAAGGGAATCTGTGGCATTGGTTGAGAGACAAGCATTTATTCCAGATGATATTAAAGAATCAAATCAAATTGATTTGGGTGAAAGAACCTGGCATGGCTTTGAATCAAGT
GGTTCAGTAGGCGAAAATAATAATATGAATAAAGGTTCTAACACTATAGGTATCAGTAATACAGAAGAGAGAGGCATTATCAGAAATGAAGTGAGCTCTATTATATTGTT
GGAGCAAGCACTTGAAGAAGAGAAAGCTGCTCGAGCATCTCTGTTTCTGGAACTGGAGGAGGAGAGAGCTGCCGCTGCTTCTGCTGCTGATGAAGCAATAGCCATGATAA
CACGTTTGCAAAATGAGAAGGCATCAGTTGAAATGGAAGCAAGACAGTATGAGAGGGTAATAGAAGAAAAATTTGCTTATGATGAAGAAGAGATGAATATACTTCGAGAG
ATCCTCGTCAAGAAGGAAATAGATTATCATGTTCTGGAGAAGGAAATCGAAGCATATAGGCAGATGGATTTTTCAGAAAAAGAAGAGATAAAAAGAAACTGGGATTTCAT
ATTGGATGAACATAAAGGACCGTCTGCCACTACCCATTACTCAAATGTAGATCCACCCATTTCTCATCAAATTGAAAATGCTATTTCTCTTTCAAGGATGGAGGCAGCTC
AATGTTATGGTGGTTTTGAGAAAAGCTTTCTTCTCCGTGGGGCACTACAAGAAAGTTTGGAGCACAGAGATCACGCAGTTAATGATCTAGGAAGTTCCATTCTTGATATG
GAAATAGATGTTCAAGATATTCATGTGATTGATGAAAAACTCTCACATGGAAGAAATGAAAGAGGAGAGAAAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCCACGAGATTCATTCTTGGACCTTGTCCGGACTTGTCAGAGCTTTTCTCGACATGGCGGTTGTTTATTTTCTCTTGTGTGTATCGGCGACCGTGTTTTTCCC
GTCCAAGATTTTGAAAGTGGTTGGGTTGTGTTTGCCTTGTCCTTGTACTGGATTTTATGGGAATCAGAACCGTAATCTGTGTTTCTATAGATTGCTTGTTAGTTGGCCGA
AGAGGAAGGTTTATTTGGTGCTCGACTTGGTCAAGGGTAGGTTTCCTTTTGATTTGATTTTGACCGATAACCCAATGGGTAATTCGAATAGGAATGTGTTAAGGGAGAAT
CAGCAGCTGGATGGAATTCCTCAGACTACTGAGGAATGTTGTGGTTCTTTTTCTGGCTCAAGATTTCAGAATTCGGTCGATAAAGATGGTGAATATGATGGTAAGGGTAA
GAGAATTGTGTATCAGAGGCCGAGGACTAAAATCCGACGAAGGAGGAAAACTCCTTTTGAGAATTGGAGATTGTCCAAGGGATTCTATGAGGGGAATGAAACTAGAAGGG
AAAGGGAATCTGTGGCATTGGTTGAGAGACAAGCATTTATTCCAGATGATATTAAAGAATCAAATCAAATTGATTTGGGTGAAAGAACCTGGCATGGCTTTGAATCAAGT
GGTTCAGTAGGCGAAAATAATAATATGAATAAAGGTTCTAACACTATAGGTATCAGTAATACAGAAGAGAGAGGCATTATCAGAAATGAAGTGAGCTCTATTATATTGTT
GGAGCAAGCACTTGAAGAAGAGAAAGCTGCTCGAGCATCTCTGTTTCTGGAACTGGAGGAGGAGAGAGCTGCCGCTGCTTCTGCTGCTGATGAAGCAATAGCCATGATAA
CACGTTTGCAAAATGAGAAGGCATCAGTTGAAATGGAAGCAAGACAGTATGAGAGGGTAATAGAAGAAAAATTTGCTTATGATGAAGAAGAGATGAATATACTTCGAGAG
ATCCTCGTCAAGAAGGAAATAGATTATCATGTTCTGGAGAAGGAAATCGAAGCATATAGGCAGATGGATTTTTCAGAAAAAGAAGAGATAAAAAGAAACTGGGATTTCAT
ATTGGATGAACATAAAGGACCGTCTGCCACTACCCATTACTCAAATGTAGATCCACCCATTTCTCATCAAATTGAAAATGCTATTTCTCTTTCAAGGATGGAGGCAGCTC
AATGTTATGGTGGTTTTGAGAAAAGCTTTCTTCTCCGTGGGGCACTACAAGAAAGTTTGGAGCACAGAGATCACGCAGTTAATGATCTAGGAAGTTCCATTCTTGATATG
GAAATAGATGTTCAAGATATTCATGTGATTGATGAAAAACTCTCACATGGAAGAAATGAAAGAGGAGAGAAAAGTTGA
Protein sequenceShow/hide protein sequence
MSFHEIHSWTLSGLVRAFLDMAVVYFLLCVSATVFFPSKILKVVGLCLPCPCTGFYGNQNRNLCFYRLLVSWPKRKVYLVLDLVKGRFPFDLILTDNPMGNSNRNVLREN
QQLDGIPQTTEECCGSFSGSRFQNSVDKDGEYDGKGKRIVYQRPRTKIRRRRKTPFENWRLSKGFYEGNETRRERESVALVERQAFIPDDIKESNQIDLGERTWHGFESS
GSVGENNNMNKGSNTIGISNTEERGIIRNEVSSIILLEQALEEEKAARASLFLELEEERAAAASAADEAIAMITRLQNEKASVEMEARQYERVIEEKFAYDEEEMNILRE
ILVKKEIDYHVLEKEIEAYRQMDFSEKEEIKRNWDFILDEHKGPSATTHYSNVDPPISHQIENAISLSRMEAAQCYGGFEKSFLLRGALQESLEHRDHAVNDLGSSILDM
EIDVQDIHVIDEKLSHGRNERGEKS