; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006189 (gene) of Chayote v1 genome

Gene IDSed0006189
OrganismSechium edule (Chayote v1)
DescriptionGTD-binding domain-containing protein
Genome locationLG05:5447841..5454320
RNA-Seq ExpressionSed0006189
SyntenySed0006189
Gene Ontology termsNA
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458006.1 PREDICTED: uncharacterized protein LOC103497547 isoform X1 [Cucumis melo]1.8e-13960.69Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMD-HQT
        MSFHEIHSWTLS L+RAFLDLA VYFLL VSAT+FIPSKIL+VVGFCLPC CTGFYGN N N C H L V WPK KIY V  LVK  FPFDL  MD  Q 
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMD-HQT

Query:  GNSNGSL---DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVN
        GNSN ++   +GI +LQSE CCS++   R +N VD   +Y+GKGK+ MYQ+PR KIRRRRR +VDN KL  GICEGN TRK+ E VALVERQ+ + DD N
Subjt:  GNSNGSL---DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVN

Query:  ESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEK
        ESNH DLG+R W GFESS  +G+N+Y NKGSS++ QG + A E DI R      R LE ALEE +AARASL VELE+ERAAAA+AADEAIAMITRLQNEK
Subjt:  ESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEK

Query:  ALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSRK
        A  EMEARQY+R +EEKF+YDEE+M+ILREILVK++IDYHVLEKEIEAY Q D                          C N + PI + I NA+SL R 
Subjt:  ALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSRK

Query:  AKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDC-DFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKHHMEDIKEGRKVD
         KLN            +  LL     +D  +    C  FEK FLS  ALQ + ++I HAVNDLG SI+DM+I+VQDIHVIDEK HMED K  RKVD
Subjt:  AKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDC-DFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKHHMEDIKEGRKVD

XP_022953644.1 probable myosin-binding protein 6 [Cucurbita moschata]1.4e-13959.72Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG
        M+FHEIHSWT   L+RAFLDLA VYFLL VSATVFIPSKILEVVGFCLPC CTGFYGNQNPN C H L   WPK KIY V   VK  FPFDLIL+D Q G
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG

Query:  NS------NGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADD
         +      N   DG+P LQS  CCS                   KG R M+QRP     RRRR SV+  KLF         RK+ E +AL ++Q+ + DD
Subjt:  NS------NGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADD

Query:  VNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDI-----NRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNE
        VNES HMDLG RTW GFESS  VG+NSY NKGSS+I  G N   E DI     + R LE+ALEE KAARASL +ELE+ERAAAA+AADEAIAMITRLQNE
Subjt:  VNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDI-----NRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNE

Query:  KALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSR
        KA VEMEARQY RVIEEKFAYDEEEM+ILREILV++EIDYHVLEKEIEAY Q D                            N + P+VH+IENA+SLS 
Subjt:  KALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSR

Query:  KAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCD----------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKH
        KAK N +N CNSQ HFNEE LLKQT WTDKD+EL D                  FEK  LS  ALQ   E+IDH +NDLGSSILDM+I+VQDIHVIDEK 
Subjt:  KAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCD----------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKH

Query:  HMEDIKEGR
        HM+D  E R
Subjt:  HMEDIKEGR

XP_023548420.1 probable myosin-binding protein 6 [Cucurbita pepo subsp. pepo]3.1e-13959.53Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG
        M+FHEIHSWT   L+RAFLDLA VYFLL VSATVFIPSKILEVVGFCLPC CTGFYGNQNPN C H L   WPK KIY V   VK  FPFDLIL+D Q G
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG

Query:  NS------NGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADD
         +      N   DG+P LQS  CCS                   KG R M+QRP     RRRR SV+  KLF         RK+ E +AL ++Q+ + DD
Subjt:  NS------NGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADD

Query:  VNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDI-----NRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNE
        VNES HMDLG RTW GF+SS  VG+NSY NKGSS+I  G N   E DI     + R LE+ALEE KAARASL +ELE+ERAAAA+AADEAIAMITRLQNE
Subjt:  VNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDI-----NRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNE

Query:  KALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSR
        KA VEMEARQY RVIEEKFAYDEEEM+ILREILV++EIDYHVLEKEIEAY Q D                            N + P+VH+IENA+SLS 
Subjt:  KALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSR

Query:  KAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCD----------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKH
        KAK N +N CNSQ HFNEE LLKQT WTDKD+EL D                  FEK FLS  ALQ   E+IDH +NDLGSSI DM+I+VQDIHVIDEK 
Subjt:  KAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCD----------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKH

Query:  HMEDIKEGR
        HM+D  E R
Subjt:  HMEDIKEGR

XP_038877576.1 uncharacterized protein LOC120069830 isoform X1 [Benincasa hispida]4.0e-16364.79Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG
        MSFHEIHSWTLS LIRAFLDL  VYFLL VSATVFIPSKIL++VGFCLPC C+GFYGN N N CFH L V WPK KIY V  LVK  FPFDLILMD Q  
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG

Query:  NSNGSL-------DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLAD
        NSN +L       DGI Q QSE CCS+ S  R +N VD D +Y+GKGKR MYQRP+ KIRRRRR ++DN KL  GICEGN T K+ E VALVERQ+ + D
Subjt:  NSNGSL-------DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLAD

Query:  DVNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQ
        DVNESNHMDLG+RTW GFESS   G+N++ NK SS++ QG + A E DI R      R LE+ALEE +AARASL VELE+ERAAAA+AADEAIAMITRLQ
Subjt:  DVNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQ

Query:  NEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISL
        NEKA VEMEARQY+RVIEEKFAYDEE+++ILREILVK++IDYHVLEKEIEAY Q D                            N + PIVH+I NAIS 
Subjt:  NEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISL

Query:  SRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRD----CD------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDE
        SRKAKLN TN C SQ HFNEE  LKQT W DK++EL+D    CD            F+K FLS  ALQES E++DHAVNDL SSILDM+I+VQDIHVIDE
Subjt:  SRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRD----CD------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDE

Query:  KHHMEDIKEGRKVD
        K HMED K+ +KVD
Subjt:  KHHMEDIKEGRKVD

XP_038877577.1 uncharacterized protein LOC120069830 isoform X2 [Benincasa hispida]7.6e-15462.65Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG
        MSFHEIHSWTLS LIRAFLDL  VYFLL VSATVFIPSKIL++VGFCLPC C+GFYGN N N CFH L V WPK KIY V  LVK  FPFDLILMD Q  
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG

Query:  NSNGSL-------DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLAD
        NSN +L       DGI Q QSE CCS+ S  R +N VD D +Y+GKGKR MYQRP+ KIRRRRR ++DN KL  GICEGN T K+ E VALVERQ+ +  
Subjt:  NSNGSL-------DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLAD

Query:  DVNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQ
                  G+RTW GFESS   G+N++ NK SS++ QG + A E DI R      R LE+ALEE +AARASL VELE+ERAAAA+AADEAIAMITRLQ
Subjt:  DVNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQ

Query:  NEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISL
        NEKA VEMEARQY+RVIEEKFAYDEE+++ILREILVK++IDYHVLEKEIEAY Q D                            N + PIVH+I NAIS 
Subjt:  NEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISL

Query:  SRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRD----CD------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDE
        SRKAKLN TN C SQ HFNEE  LKQT W DK++EL+D    CD            F+K FLS  ALQES E++DHAVNDL SSILDM+I+VQDIHVIDE
Subjt:  SRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRD----CD------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDE

Query:  KHHMEDIKEGRKVD
        K HMED K+ +KVD
Subjt:  KHHMEDIKEGRKVD

TrEMBL top hitse value%identityAlignment
A0A0A0K8C0 GTD-binding domain-containing protein2.0e-13961.09Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILM-DHQT
        MSFHEIHSWTLS ++RAFLDLA VYFLL VSAT+FIPSKIL+VVGFCLPC CTGFYGN N N CFH L V WPK KIY V  LVK MFPFDLILM D + 
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILM-DHQT

Query:  GNSNGSL---DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVN
        GNSN +L   +GI +LQSE CCS  +  R +N VDND +++GKGK+ MYQ+PR KIRRRRR ++DN KL  G+CE N TRK  E VALVERQ+ + DD N
Subjt:  GNSNGSL---DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVN

Query:  ESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEK
        ESNH+DLG+R W GFESS  +G+NSY NKGSS++ QG + A E  I R      R LE ALEE + ARASL VELE+ERAAAA+AADEAIAMITRLQNEK
Subjt:  ESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEK

Query:  ALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSRK
        A  EMEARQY+R +EEKF+YDEE+M+ILREILVK++IDYHVLEKEIEAY Q D                            N + PIV+ I NA+SLSR+
Subjt:  ALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSRK

Query:  AKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCDFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKHHMEDIK-EGRKVD
        AKLN            +  LL      D         FEK FLS  ALQ + E+I HAVNDLG SILDM+I+VQDIHVIDEK HME  K E RK D
Subjt:  AKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCDFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKHHMEDIK-EGRKVD

A0A1S3C7F2 uncharacterized protein LOC103497547 isoform X18.8e-14060.69Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMD-HQT
        MSFHEIHSWTLS L+RAFLDLA VYFLL VSAT+FIPSKIL+VVGFCLPC CTGFYGN N N C H L V WPK KIY V  LVK  FPFDL  MD  Q 
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMD-HQT

Query:  GNSNGSL---DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVN
        GNSN ++   +GI +LQSE CCS++   R +N VD   +Y+GKGK+ MYQ+PR KIRRRRR +VDN KL  GICEGN TRK+ E VALVERQ+ + DD N
Subjt:  GNSNGSL---DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVN

Query:  ESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEK
        ESNH DLG+R W GFESS  +G+N+Y NKGSS++ QG + A E DI R      R LE ALEE +AARASL VELE+ERAAAA+AADEAIAMITRLQNEK
Subjt:  ESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEK

Query:  ALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSRK
        A  EMEARQY+R +EEKF+YDEE+M+ILREILVK++IDYHVLEKEIEAY Q D                          C N + PI + I NA+SL R 
Subjt:  ALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSRK

Query:  AKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDC-DFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKHHMEDIKEGRKVD
         KLN            +  LL     +D  +    C  FEK FLS  ALQ + ++I HAVNDLG SI+DM+I+VQDIHVIDEK HMED K  RKVD
Subjt:  AKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDC-DFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKHHMEDIKEGRKVD

A0A5D3E523 Putative myosin-binding protein 5 isoform X21.1e-13760.69Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILM-DHQT
        MSFHEIHSWTLS L+RAFLDLA VYFLL VSAT+FIPSKIL+VVGFCLPC CTGFYGN N N C H L V WPK KIY V  LVK  FPFDL  M D Q 
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILM-DHQT

Query:  GNSNGSL---DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVN
        GNSN +L   +GI +LQSE CCS++   R +N VD   +Y+GK K+ MYQ+PR KIRRRRR +VDN KL  GICEGN TRK+ E VALVERQ+ + DD N
Subjt:  GNSNGSL---DGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVN

Query:  ESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEK
        ESNH DLG+R W GFESS  +G+N+Y NKGSS++ Q  N A E DI R      R LE ALEE +AARASL VELE+ERAAAA+AADEAIAMITRLQNEK
Subjt:  ESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINR------RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEK

Query:  ALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSRK
        A  EMEARQY+R +EEKF+YDEE+M+ILREILVK++IDYHVLEKEIEAY Q D                          C N + PI + I NA+SL R 
Subjt:  ALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSRK

Query:  AKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDC-DFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKHHMEDIKEGRKVD
         KLN            +  LL     +D  +    C  FEK FLS  ALQ + ++I HAVNDLG SI+DM+I+VQDIHVIDEK HMED K  RKVD
Subjt:  AKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDC-DFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKHHMEDIKEGRKVD

A0A6J1GQ95 probable myosin-binding protein 66.7e-14059.72Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG
        M+FHEIHSWT   L+RAFLDLA VYFLL VSATVFIPSKILEVVGFCLPC CTGFYGNQNPN C H L   WPK KIY V   VK  FPFDLIL+D Q G
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG

Query:  NS------NGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADD
         +      N   DG+P LQS  CCS                   KG R M+QRP     RRRR SV+  KLF         RK+ E +AL ++Q+ + DD
Subjt:  NS------NGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADD

Query:  VNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDI-----NRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNE
        VNES HMDLG RTW GFESS  VG+NSY NKGSS+I  G N   E DI     + R LE+ALEE KAARASL +ELE+ERAAAA+AADEAIAMITRLQNE
Subjt:  VNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDI-----NRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNE

Query:  KALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSR
        KA VEMEARQY RVIEEKFAYDEEEM+ILREILV++EIDYHVLEKEIEAY Q D                            N + P+VH+IENA+SLS 
Subjt:  KALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSR

Query:  KAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCD----------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKH
        KAK N +N CNSQ HFNEE LLKQT WTDKD+EL D                  FEK  LS  ALQ   E+IDH +NDLGSSILDM+I+VQDIHVIDEK 
Subjt:  KAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCD----------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKH

Query:  HMEDIKEGR
        HM+D  E R
Subjt:  HMEDIKEGR

A0A6J1JSM2 probable myosin-binding protein 62.2e-13858.94Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG
        M+FHEIHSWT   L+RAFL+LA VYFLL VSATVFIPSKIL+VVGFCLPC CTGFYGNQNPN C H L   WPK KIY V   VK  FPFDLIL+D Q G
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG

Query:  NS------NGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADD
         +      N   DG+P LQS  CCS                   KG R M+QRP     RRRR SV+  KLF         RK+ E +AL ++Q+ + DD
Subjt:  NS------NGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADD

Query:  VNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDI-----NRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNE
        VNES HMDLG RTW GFESS LVG+NS  NKGSS+I  G N   E DI     + R LE+ALEE KAARASL +ELE+ERAAAA+AADEAIAMITRLQNE
Subjt:  VNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDI-----NRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNE

Query:  KALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSR
        KA VEMEARQY R+IEEKFAYDEEEM+ILREILV++EIDYHVLEKEIEAY Q D                            N + P+VH+IENA+SLS 
Subjt:  KALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD--------------------------CFNCNSPIVHKIENAISLSR

Query:  KAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCD----------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKH
        KAK N +N CNSQ HFNEE LLKQT WTDKD+EL D                  FEK FLS   +Q   E+IDH +NDLGSSILDM+I+VQDIHVIDEK 
Subjt:  KAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCD----------------FEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDIHVIDEKH

Query:  HMEDIKEGR
        HM+D +E R
Subjt:  HMEDIKEGR

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 61.3e-1044.55Show/hide
Query:  ESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEA
        ES +N+  L++ +   K +   L +EL++ER+A+A AA+EA+AMITRLQ EKA V+MEA QY R+++E+  YD+E +  +   L K+E +   LE E E 
Subjt:  ESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEA

Query:  Y
        Y
Subjt:  Y

F4HXQ7 Myosin-binding protein 13.5e-0836.8Show/hide
Query:  LVGKNSYTNKGSSSIVQGINYAH-ESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEE
        L+ K       S+  ++G++    E +     L+R ++  +     L  ELE+ER+A+A A ++A+AMITRLQ EKA  +MEA Q  R++EE+  YD E 
Subjt:  LVGKNSYTNKGSSSIVQGINYAH-ESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEE

Query:  MSILREILVKKEIDYHVLEKEIEAY
        +  L ++LV++E     LE EIE +
Subjt:  MSILREILVKKEIDYHVLEKEIEAY

Q0WNW4 Myosin-binding protein 32.3e-1233.33Show/hide
Query:  RFLERALEEGKA---ARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQ
        R +ER  E  +A   A   L  ELE+ER+A+A +A++ +AMITRLQ EKA V+MEA QY R++EE+  YD+E + +L  ++VK+E +   L++E+E Y  
Subjt:  RFLERALEEGKA---ARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQ

Query:  TDCFNCNSPIVHKIENAISLSRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELR----DCDFEKCFLSGRALQESSE
                    K+    S ++   +   N C +     EE   ++ N ++ D +L     DC  +   + G +L E  E
Subjt:  TDCFNCNSPIVHKIENAISLSRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELR----DCDFEKCFLSGRALQESSE

Q9CAC4 Myosin-binding protein 25.7e-1146.24Show/hide
Query:  LERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAY
        L+  L+E + A  +L  ELE ER A+A AA E +AMI RL  EKA ++MEA QY R++EE+  +D+E + +L E++V +E +   LEKE+E Y
Subjt:  LERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAY

Q9LMC8 Probable myosin-binding protein 52.2e-1028.63Show/hide
Query:  MDHQTGNSNGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKG-ELVALVERQESLAD
        ++++T NSN   D   + Q   CC  +   ++     N+  + G    +    PR     +R   + N+K      + +    KG  L A+ +R  S   
Subjt:  MDHQTGNSNGSLDGIPQLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKG-ELVALVERQESLAD

Query:  DVNESNHMDLGERTWHGFESSSLVGKNSYTNKG--SSSIVQGINYAHESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKA
          N+   + L +   +    S    K S+ ++    S ++ G       D   + L R +   + +   L +EL++ER+A+A AA+ A+AMITRLQ EKA
Subjt:  DVNESNHMDLGERTWHGFESSSLVGKNSYTNKG--SSSIVQGINYAHESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKA

Query:  LVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAY
         V+MEA QY R+++E+  YD+E +  +  +LVK+E +   LE  IE Y
Subjt:  LVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAY

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5935.8e-2734.2Show/hide
Query:  KGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLA-DDVNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYA
        KGKR + +R R  ++  RR+   N      I       +  +   +    E+ + DD  +      G    H  E    V  N   N  SS+    +   
Subjt:  KGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLA-DDVNESNHMDLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYA

Query:  HESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIE
         +  +  R LE  L+E +AARA++CVEL+KER+AAASAADEA+AMI RLQ+EKA +EMEARQ+ R++EE+  +D EEM IL++IL+++E + H LEKE+E
Subjt:  HESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIE

Query:  AYWQ----TDCFNC------NSPIVHKIENAISLSRKAKL-NATNGCNSQSHFNEERLLKQTNWTDKDDELRDCDFEKCFLSGR---ALQESSENIDHAV
        AY Q    T+   C      N P     +N     R+A L    +G      + EE         DK+ +L   D E  +   R    +++ +ENI    
Subjt:  AYWQ----TDCFNC------NSPIVHKIENAISLSRKAKL-NATNGCNSQSHFNEERLLKQTNWTDKDDELRDCDFEKCFLSGR---ALQESSENIDHAV

Query:  NDLGSSI
        N   SS+
Subjt:  NDLGSSI

AT4G13160.1 Protein of unknown function, DUF5933.2e-2564.65Show/hide
Query:  RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD
        R LE A+E+ K A+A+L VELE+ERAA+ASAADEA+AMI RLQ +KA +EME +QY R+I+EKFAYDEEEM+IL+EIL K+E + H LEKE+E Y   D
Subjt:  RFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQTD

AT4G13160.1 Protein of unknown function, DUF5931.1e-0437.08Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFP
        M + E +  T   ++ AF++LAF Y LL VSA VFI SK+L      +PC      G QN + C   L   WP   I  V +L     P
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFP

AT4G13630.1 Protein of unknown function, DUF5934.2e-3335.98Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG
        M   E+ SWT   L+ AF+DL+  + LL  S  V++ SK L + G  LPC C G Y       CF       P  KI SV + VK   PFD IL  +  G
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG

Query:  NSNGSLDGIPQLQSEDCCSSLSGSRNRNSVDN-DI---------KYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQES
                  QL+ E   ++ S  +  N     D+          ++ K KR  + R     +   ++ +  +K F G  + N               + 
Subjt:  NSNGSLDGIPQLQSEDCCSSLSGSRNRNSVDN-DI---------KYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQES

Query:  LADDVNESNHM--DLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQN
        L  + N+S     D+  R     +S SL        +G+ S    +    E        E+ L E +AARASL +ELEKER AAASAADEA+ MI RLQ 
Subjt:  LADDVNESNHM--DLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQN

Query:  EKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQ
        EKA +EMEARQY R+IEEK A+D EEMSIL+EIL+++E + H LEKE++ Y Q
Subjt:  EKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQ

AT4G13630.2 Protein of unknown function, DUF5934.2e-3335.98Show/hide
Query:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG
        M   E+ SWT   L+ AF+DL+  + LL  S  V++ SK L + G  LPC C G Y       CF       P  KI SV + VK   PFD IL  +  G
Subjt:  MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTG

Query:  NSNGSLDGIPQLQSEDCCSSLSGSRNRNSVDN-DI---------KYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQES
                  QL+ E   ++ S  +  N     D+          ++ K KR  + R     +   ++ +  +K F G  + N               + 
Subjt:  NSNGSLDGIPQLQSEDCCSSLSGSRNRNSVDN-DI---------KYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQES

Query:  LADDVNESNHM--DLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQN
        L  + N+S     D+  R     +S SL        +G+ S    +    E        E+ L E +AARASL +ELEKER AAASAADEA+ MI RLQ 
Subjt:  LADDVNESNHM--DLGERTWHGFESSSLVGKNSYTNKGSSSIVQGINYAHESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQN

Query:  EKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQ
        EKA +EMEARQY R+IEEK A+D EEMSIL+EIL+++E + H LEKE++ Y Q
Subjt:  EKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQ

AT5G16720.1 Protein of unknown function, DUF5931.6e-1333.33Show/hide
Query:  RFLERALEEGKA---ARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQ
        R +ER  E  +A   A   L  ELE+ER+A+A +A++ +AMITRLQ EKA V+MEA QY R++EE+  YD+E + +L  ++VK+E +   L++E+E Y  
Subjt:  RFLERALEEGKA---ARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYHVLEKEIEAYWQ

Query:  TDCFNCNSPIVHKIENAISLSRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELR----DCDFEKCFLSGRALQESSE
                    K+    S ++   +   N C +     EE   ++ N ++ D +L     DC  +   + G +L E  E
Subjt:  TDCFNCNSPIVHKIENAISLSRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELR----DCDFEKCFLSGRALQESSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCCACGAGATTCATTCGTGGACCTTGTCTGCACTAATCAGGGCTTTTCTTGACCTAGCTTTTGTTTATTTTCTCTTATTCGTATCGGCCACTGTTTTTATCCC
ATCCAAGATTTTGGAAGTTGTTGGGTTTTGCTTGCCCTGTTCTTGTACAGGGTTTTATGGAAATCAGAACCCTAATTTCTGTTTCCATTCACTGTTTGTTATTTGGCCCA
AAGGGAAGATTTATTCGGTGTTCCAATTGGTTAAGGCTATGTTTCCTTTTGATTTGATTTTGATGGATCATCAAACGGGTAATTCTAATGGGAGTTTGGATGGAATTCCT
CAGTTGCAGTCTGAGGATTGCTGTAGTTCTTTATCTGGTTCGAGAAATCGGAATTCGGTTGATAACGATATTAAATATGAGGGTAAGGGTAAGAGGACTATGTATCAGAG
GCCGAGGGCTAAAATCCGACGGCGGAGGAGAACCTCTGTTGATAATGTGAAATTGTTTATGGGAATCTGTGAGGGGAATGGAACTAGAAAGAAAGGAGAATTGGTGGCAT
TGGTTGAGAGGCAAGAGTCTCTTGCAGATGATGTTAATGAATCAAATCACATGGACTTGGGTGAAAGAACCTGGCACGGCTTTGAATCAAGTAGTTTGGTTGGCAAAAAT
AGTTATACGAATAAAGGTTCTTCGTCTATAGTACAAGGTATCAATTATGCTCATGAGAGTGACATTAACAGACGATTTTTGGAGCGAGCACTTGAAGAAGGGAAAGCTGC
TCGAGCATCTCTTTGTGTGGAACTTGAGAAGGAGAGAGCTGCCGCTGCTTCTGCAGCAGACGAAGCAATAGCCATGATAACACGTTTGCAAAATGAGAAGGCCTTAGTTG
AAATGGAAGCAAGACAATATTATAGGGTAATAGAAGAAAAATTTGCTTATGATGAAGAAGAGATGAGTATCCTAAGAGAGATACTTGTCAAGAAGGAAATAGATTATCAT
GTTCTCGAGAAGGAAATTGAAGCATATTGGCAGACGGATTGCTTCAACTGTAATTCACCCATTGTTCATAAAATTGAAAATGCTATTTCTCTTTCAAGGAAAGCAAAGTT
GAATGCAACTAACGGTTGTAACTCTCAATCCCATTTTAATGAAGAAAGGTTGCTTAAGCAAACTAACTGGACGGATAAAGACGATGAACTGAGGGATTGTGATTTCGAGA
AATGTTTTCTTTCCGGTAGGGCACTGCAAGAAAGTTCGGAGAACATAGATCACGCAGTTAATGATCTAGGAAGTTCCATTCTTGATATGCAAATAAATGTTCAAGATATT
CATGTGATTGATGAAAAACACCACATGGAAGACATAAAAGAGGGAAGGAAAGTCGATTATTGA
mRNA sequenceShow/hide mRNA sequence
ATTCCTTGTAAAAAGCTTTGTAGCAATAATGACGATGTGATCCTCAAATCATCATCTTCGTCTTCTTCCTCTGCCCCTTGAATTCACAATGAACACCTGATTTTTCAAAA
TTTTCTGCAAATTCATTCATACCCTTTTCTCCATCCTCTCTTTAATCCAGGTTTTTCTTCCCCCTTTTCCCCCTTTCTTGAATCAAACCCATTTGCGCCAATCCAGATGT
GTTTCTTCATTTCTTAGTTCCAAATTTAACGAAAAAGATTTCTGGGTTTTCTCTGGCGGGATTTTATGCTTTGAAATCCATGTCGTTCCACGAGATTCATTCGTGGACCT
TGTCTGCACTAATCAGGGCTTTTCTTGACCTAGCTTTTGTTTATTTTCTCTTATTCGTATCGGCCACTGTTTTTATCCCATCCAAGATTTTGGAAGTTGTTGGGTTTTGC
TTGCCCTGTTCTTGTACAGGGTTTTATGGAAATCAGAACCCTAATTTCTGTTTCCATTCACTGTTTGTTATTTGGCCCAAAGGGAAGATTTATTCGGTGTTCCAATTGGT
TAAGGCTATGTTTCCTTTTGATTTGATTTTGATGGATCATCAAACGGGTAATTCTAATGGGAGTTTGGATGGAATTCCTCAGTTGCAGTCTGAGGATTGCTGTAGTTCTT
TATCTGGTTCGAGAAATCGGAATTCGGTTGATAACGATATTAAATATGAGGGTAAGGGTAAGAGGACTATGTATCAGAGGCCGAGGGCTAAAATCCGACGGCGGAGGAGA
ACCTCTGTTGATAATGTGAAATTGTTTATGGGAATCTGTGAGGGGAATGGAACTAGAAAGAAAGGAGAATTGGTGGCATTGGTTGAGAGGCAAGAGTCTCTTGCAGATGA
TGTTAATGAATCAAATCACATGGACTTGGGTGAAAGAACCTGGCACGGCTTTGAATCAAGTAGTTTGGTTGGCAAAAATAGTTATACGAATAAAGGTTCTTCGTCTATAG
TACAAGGTATCAATTATGCTCATGAGAGTGACATTAACAGACGATTTTTGGAGCGAGCACTTGAAGAAGGGAAAGCTGCTCGAGCATCTCTTTGTGTGGAACTTGAGAAG
GAGAGAGCTGCCGCTGCTTCTGCAGCAGACGAAGCAATAGCCATGATAACACGTTTGCAAAATGAGAAGGCCTTAGTTGAAATGGAAGCAAGACAATATTATAGGGTAAT
AGAAGAAAAATTTGCTTATGATGAAGAAGAGATGAGTATCCTAAGAGAGATACTTGTCAAGAAGGAAATAGATTATCATGTTCTCGAGAAGGAAATTGAAGCATATTGGC
AGACGGATTGCTTCAACTGTAATTCACCCATTGTTCATAAAATTGAAAATGCTATTTCTCTTTCAAGGAAAGCAAAGTTGAATGCAACTAACGGTTGTAACTCTCAATCC
CATTTTAATGAAGAAAGGTTGCTTAAGCAAACTAACTGGACGGATAAAGACGATGAACTGAGGGATTGTGATTTCGAGAAATGTTTTCTTTCCGGTAGGGCACTGCAAGA
AAGTTCGGAGAACATAGATCACGCAGTTAATGATCTAGGAAGTTCCATTCTTGATATGCAAATAAATGTTCAAGATATTCATGTGATTGATGAAAAACACCACATGGAAG
ACATAAAAGAGGGAAGGAAAGTCGATTATTGATTCAACTGGCATGAAATGGTCCTACGGTTATGCTTTCACCTTTGGAACATTTGATGCATAAAGCATGGTTTGGAAGTC
TAAATCAACCATAGAAGTAGAATTTTGGGCTGTTGACTTTTTTCATAGTTTGGTGCAATCTTCCCCAGAAGAATTGACTTCAGTAAATATTGAAAGGTTAGTGTAGTGTT
GTCGGGTTAGAAGGGTCAGAACTTCTCCATCTGTGAAGAAAGAAAAGAGGATAATGGAAGTTCCTGTTGAAATATCAAACTCCTAAGGCTAAGATATTTGACTGCCGAGA
TCCTTTTTAACAGGTGAGGGGGCAAGATTCATATGTGGATTGTAGTTTCTTTGTAAAGATTTATTTCTTTTTAGTTTTTTCCTGTTTTTTTTTTCTTTTTCCAGCCAATT
TTATAAGATGAAAAGATGGGATATTCTCAACTTTTTCTTTCTTCACCCTTCTAACTTCCTTTGTAAATTTTCTTTTACAAAAAGCAAAAAAAAAAAAAAAAAAAAAAAAC
CTATATTTCTAGCTTCCCTATGAGATAGGTACCATGTCTTGATATTATTGTGTTTGTAATATGTTGTAAGAGTACATACTGTTCCTTCAAAGAAATACAATTGTAT
Protein sequenceShow/hide protein sequence
MSFHEIHSWTLSALIRAFLDLAFVYFLLFVSATVFIPSKILEVVGFCLPCSCTGFYGNQNPNFCFHSLFVIWPKGKIYSVFQLVKAMFPFDLILMDHQTGNSNGSLDGIP
QLQSEDCCSSLSGSRNRNSVDNDIKYEGKGKRTMYQRPRAKIRRRRRTSVDNVKLFMGICEGNGTRKKGELVALVERQESLADDVNESNHMDLGERTWHGFESSSLVGKN
SYTNKGSSSIVQGINYAHESDINRRFLERALEEGKAARASLCVELEKERAAAASAADEAIAMITRLQNEKALVEMEARQYYRVIEEKFAYDEEEMSILREILVKKEIDYH
VLEKEIEAYWQTDCFNCNSPIVHKIENAISLSRKAKLNATNGCNSQSHFNEERLLKQTNWTDKDDELRDCDFEKCFLSGRALQESSENIDHAVNDLGSSILDMQINVQDI
HVIDEKHHMEDIKEGRKVDY