; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002456 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002456
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGTD-binding domain-containing protein
Genome locationscaffold318:171628..173930
RNA-Seq ExpressionMS002456
SyntenyMS002456
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575980.1 Myosin-binding protein 3, partial [Cucurbita argyrosperma subsp. sororia]5.4e-15563.71Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        M+ HEIH WT  GL+ AFLDLA+VY LLC+SAT FIP KIL+++G  LPCPC+GFYG QN NLC  RLLF+WPKRKI  VL  VK RFPFDL+LVDDQ+G
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKLQ----TKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERT
         +   L+    T GVP  Q            KG R+M+QRPR      RRA+VE G+L          RKE E +AL ++ +FI D +NES H+ L  RT
Subjt:  NSNRKLQ----TKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERT

Query:  WQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQ
        WQG ESSGSVGEN+Y++KGSSTIG  T+N +ER I  NE  SI LLE+ALEEEKAARASL++ELEEERAAAATAADEAIAMI RLQNEKASVEMEARQYQ
Subjt:  WQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQ

Query:  RVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNS
        RVIEEKFAYDEEEMN+LREILV+REIDYHVLEKEIEAYR+MDFSE+E+  RNWDFILD+HKEQS+T  YSNGDPP+V QIENA+SLS KAK NE+ S NS
Subjt:  RVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNS

Query:  QCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKED
        QCHFNE GLL+QT WT++DNE+ DNSL+ E    + A                        H +  LGSS LDMEIDVQDIHVIDEKLH+ D  E+
Subjt:  QCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKED

XP_022150079.1 uncharacterized protein LOC111018343 [Momordica charantia]1.1e-25999.37Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKLQTKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGS
        NSNRKLQTKGVPQSQQNLAD+DGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGS
Subjt:  NSNRKLQTKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGS

Query:  ESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIE
        ESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIE
Subjt:  ESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIE

Query:  EKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNSQCHF
        EKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNSQCHF
Subjt:  EKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNSQCHF

Query:  NEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN
        NEGGLLEQTTWTNEDNELKDNSLVCEHF +KAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN
Subjt:  NEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN

XP_022953644.1 probable myosin-binding protein 6 [Cucurbita moschata]1.9e-15563.91Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        M+ HEIH WT  GL+ AFLDLA+VY LLC+SAT FIP KIL+++G  LPCPC+GFYG QN NLC  RLLF+WPKRKI  VL  VK RFPFDL+LVDDQ+G
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKLQ----TKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERT
         +   L+    T GVP  Q            KG R+M+QRPR      RRA+VE G+L          RKE E +AL ++ +FI D +NES H+ L  RT
Subjt:  NSNRKLQ----TKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERT

Query:  WQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQ
        WQG ESSGSVGEN+Y++KGSSTIG  T+N +ER I  NE  SI LLE+ALEEEKAARASL++ELEEERAAAATAADEAIAMI RLQNEKASVEMEARQYQ
Subjt:  WQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQ

Query:  RVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNS
        RVIEEKFAYDEEEMN+LREILV+REIDYHVLEKEIEAYR+MDFSE+E+  RNWDFILD+HKEQS+T  YSNGDPP+V QIENA+SLS KAK NE+ S NS
Subjt:  RVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNS

Query:  QCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKED
        QCHFNE GLL+QT WT++DNEL DNSL+ E    + A                        H +  LGSS LDMEIDVQDIHVIDEKLH+ D  E+
Subjt:  QCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKED

XP_038877576.1 uncharacterized protein LOC120069830 isoform X1 [Benincasa hispida]2.0e-18167.77Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        MS HEIH WTLSGL+ AFLDL +VY LLC+SAT FIP KILK++G  LPCPCSGFYG  NANLCF RL+ NWPKRKI  VL  VK RFPFDL+L+D+Q+ 
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKL-----QTKGVPQSQ-------------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHD
        NSNR L        G+ Q Q             QNL DKD +YD KGKR+MYQRP+TKIRR RRAA++ G+L++GIC+ NE  KERE VALVER +FI D
Subjt:  NSNRKL-----QTKGVPQSQ-------------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHD

Query:  VINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQ
         +NESNH+ L +RTWQG ESSGS GENN+++K SST+GQ TSNAEER IIRNEASSI LLE+ALEEE+AARASL++ELEEERAAAATAADEAIAMI RLQ
Subjt:  VINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQ

Query:  NEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISL
        NEKASVEMEARQY RVIEEKFAYDEE++N+LREILVKR+IDYHVLEKEIEAYR+MDFSE+EQ   NWDF+LD+H EQS T HYSNGDPPIV QI NAIS 
Subjt:  NEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISL

Query:  SRKAKLNETKSYNSQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDE
        SRKAKLNET S  SQCHFNE G L+QT W +++NELKDNSL+C+H   +AA                        H V  L SS LDMEIDVQDIHVIDE
Subjt:  SRKAKLNETKSYNSQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDE

Query:  KLHLGDKKEDIKVDN
        KLH+ D K++ KVDN
Subjt:  KLHLGDKKEDIKVDN

XP_038877577.1 uncharacterized protein LOC120069830 isoform X2 [Benincasa hispida]8.0e-17566.41Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        MS HEIH WTLSGL+ AFLDL +VY LLC+SAT FIP KILK++G  LPCPCSGFYG  NANLCF RL+ NWPKRKI  VL  VK RFPFDL+L+D+Q+ 
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKL-----QTKGVPQSQ-------------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHD
        NSNR L        G+ Q Q             QNL DKD +YD KGKR+MYQRP+TKIRR RRAA++ G+L++GIC+ NE  KERE VALVER +FI  
Subjt:  NSNRKL-----QTKGVPQSQ-------------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHD

Query:  VINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQ
                   +RTWQG ESSGS GENN+++K SST+GQ TSNAEER IIRNEASSI LLE+ALEEE+AARASL++ELEEERAAAATAADEAIAMI RLQ
Subjt:  VINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQ

Query:  NEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISL
        NEKASVEMEARQY RVIEEKFAYDEE++N+LREILVKR+IDYHVLEKEIEAYR+MDFSE+EQ   NWDF+LD+H EQS T HYSNGDPPIV QI NAIS 
Subjt:  NEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISL

Query:  SRKAKLNETKSYNSQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDE
        SRKAKLNET S  SQCHFNE G L+QT W +++NELKDNSL+C+H   +AA                        H V  L SS LDMEIDVQDIHVIDE
Subjt:  SRKAKLNETKSYNSQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDE

Query:  KLHLGDKKEDIKVDN
        KLH+ D K++ KVDN
Subjt:  KLHLGDKKEDIKVDN

TrEMBL top hitse value%identityAlignment
A0A0A0K8C0 GTD-binding domain-containing protein2.9e-15461.57Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQ-I
        MS HEIH WTLSG++ AFLDLA+VY LLC+SAT FIP KILK++G  LPCPC+GFYG  N NLCF +L+ NWPKRKI  VL  VK  FPFDL+L+DD+ +
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQ-I

Query:  GNSNRK-LQTKGVPQSQ-----------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINES
        GNSNR  L+  G+ + Q           QNL D DG++D KGK++MYQ+PRTKIRR RRA ++ G+L++G+C+ NE RK RE VALVER +FI D  NES
Subjt:  GNSNRK-LQTKGVPQSQ-----------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINES

Query:  NHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKAS
        NH+ L +R WQG ESSGS+GEN+Y++KGSST+GQ TS AEER IIRNEAS+I LLE ALEEE+ ARASL++ELEEERAAAATAADEAIAMI RLQNEKAS
Subjt:  NHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKAS

Query:  VEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAK
         EMEARQY R +EEKF+YDEE+MN+LREILVKR+IDYHVLEKEIEAYR+MDF+EEE    N DFILD+H E SAT HYSNGDPPIV  I NA+SLSR+AK
Subjt:  VEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAK

Query:  LNETKSYNSQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA-----------------------HVVRSLGSSELDMEIDVQDIHVIDEKLHL-G
        L                           NEL+DNSL+ +H    AA                       H V  LG S LDMEIDVQDIHVIDEKLH+ G
Subjt:  LNETKSYNSQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA-----------------------HVVRSLGSSELDMEIDVQDIHVIDEKLHL-G

Query:  DKKEDIKVDN
         K E  K DN
Subjt:  DKKEDIKVDN

A0A1S3C7F2 uncharacterized protein LOC103497547 isoform X11.4e-15363.64Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVD-DQI
        MS HEIH WTLSGL+ AFLDLA+VY LLC+SAT FIP KILK++G  LPCPC+GFYG  N NLC  +L+ NWPKRKI  VL  VK RFPFDL  +D +QI
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVD-DQI

Query:  GNSNRK-LQTKGVPQSQ-----------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINES
        GNSNR  L+  G+ + Q           QN+ DK G+YD KGK++MYQ+PRTKIRR RRAAV+ G+L++GIC+ NE RKERE VALVER +FI D  NES
Subjt:  GNSNRK-LQTKGVPQSQ-----------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINES

Query:  NHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKAS
        NH  L +R WQG ESSGS+GENNY++KGSST+GQ TSNAEER IIRNEAS+I LLE ALEEE+AARASL++ELEEERAAAATAADEAIAMI RLQNEKAS
Subjt:  NHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKAS

Query:  VEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAK
         EMEARQY R +EEKF+YDEE+MN+LREILVKR+IDYHVLEKEIEAYR+MDF+E+EQ   N D+ILD+HKE S T H SNGDPPI   I NA+SL R  K
Subjt:  VEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAK

Query:  LNETKSYN---------SQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN
        LNE +  +           C   E   L +    N D           H T    H V  LG S +DMEIDVQDIHVIDEKLH+ D K + KVDN
Subjt:  LNETKSYN---------SQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN

A0A5D3E523 Putative myosin-binding protein 5 isoform X26.0e-15263.64Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDD-QI
        MS HEIH WTLSGL+ AFLDLA+VY LLC+SAT FIP KILK++G  LPCPC+GFYG  N NLC  +L+ NWPKRKI  VL  VK RFPFDL  +DD QI
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDD-QI

Query:  GNSNRK-LQTKGVPQSQ-----------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINES
        GNSNR  L+  G+ + Q           QN+ DK G+YD K K++MYQ+PRTKIRR RRAAV+ G+L++GIC+ NE RKERE VALVER +FI D  NES
Subjt:  GNSNRK-LQTKGVPQSQ-----------QNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINES

Query:  NHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKAS
        NH  L +R WQG ESSGS+GENNY++KGSST+GQ  SNAEER IIRNEAS+I LLE ALEEE+AARASL++ELEEERAAAATAADEAIAMI RLQNEKAS
Subjt:  NHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKAS

Query:  VEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAK
         EMEARQY R +EEKF+YDEE+MN+LREILVKR+IDYHVLEKEIEAYR+MDF+E+EQ   N D+ILD+HKE SAT H SNGDPPI   I NA+SL R  K
Subjt:  VEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAK

Query:  LNETKSYN---------SQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN
        LNE +  +           C   E   L +    N D           H T    H V  LG S +DMEIDVQDIHVIDEKLH+ D K + KVDN
Subjt:  LNETKSYN---------SQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN

A0A6J1D8Y8 uncharacterized protein LOC1110183435.1e-26099.37Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKLQTKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGS
        NSNRKLQTKGVPQSQQNLAD+DGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGS
Subjt:  NSNRKLQTKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGS

Query:  ESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIE
        ESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIE
Subjt:  ESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIE

Query:  EKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNSQCHF
        EKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNSQCHF
Subjt:  EKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNSQCHF

Query:  NEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN
        NEGGLLEQTTWTNEDNELKDNSLVCEHF +KAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN
Subjt:  NEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN

A0A6J1GQ95 probable myosin-binding protein 69.0e-15663.91Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        M+ HEIH WT  GL+ AFLDLA+VY LLC+SAT FIP KIL+++G  LPCPC+GFYG QN NLC  RLLF+WPKRKI  VL  VK RFPFDL+LVDDQ+G
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKLQ----TKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERT
         +   L+    T GVP  Q            KG R+M+QRPR      RRA+VE G+L          RKE E +AL ++ +FI D +NES H+ L  RT
Subjt:  NSNRKLQ----TKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERT

Query:  WQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQ
        WQG ESSGSVGEN+Y++KGSSTIG  T+N +ER I  NE  SI LLE+ALEEEKAARASL++ELEEERAAAATAADEAIAMI RLQNEKASVEMEARQYQ
Subjt:  WQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQ

Query:  RVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNS
        RVIEEKFAYDEEEMN+LREILV+REIDYHVLEKEIEAYR+MDFSE+E+  RNWDFILD+HKEQS+T  YSNGDPP+V QIENA+SLS KAK NE+ S NS
Subjt:  RVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNS

Query:  QCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKED
        QCHFNE GLL+QT WT++DNEL DNSL+ E    + A                        H +  LGSS LDMEIDVQDIHVIDEKLH+ D  E+
Subjt:  QCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAA------------------------HVVRSLGSSELDMEIDVQDIHVIDEKLHLGDKKED

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 61.6e-1335.56Show/hide
Query:  SSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYRE
        S +  L++ +  +K +   LYMEL+EER+A+A AA+EA+AMI RLQ EKA+V+MEA QYQR+++E+  YD+E +  +   L KRE +   LE E E YRE
Subjt:  SSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYRE

Query:  MDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIEN-AISLSRKAKLNETKSYNSQCHFNEGGLLEQTTWTNED
              +Q     +F    HK+    + Y   D    + + + A+S S + +  E    N Q   +E    E     +E+
Subjt:  MDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIEN-AISLSRKAKLNETKSYNSQCHFNEGGLLEQTTWTNED

Q0WNW4 Myosin-binding protein 31.9e-1443.41Show/hide
Query:  SIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREM
        +I  L E +  E+ A   LY ELEEER+A+A +A++ +AMI RLQ EKA V+MEA QYQR++EE+  YD+E + LL  ++VKRE +   L++E+E YR  
Subjt:  SIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREM

Query:  DFSEEEQSIRNWDFIL-------DDHKEQ
           E E   +N   ++       DD KE+
Subjt:  DFSEEEQSIRNWDFIL-------DDHKEQ

Q9CAC4 Myosin-binding protein 21.5e-1441.67Show/hide
Query:  SIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREM
        ++  L+  L+EE+ A  +LY ELE ER A+A AA E +AMI RL  EKA+++MEA QYQR++EE+  +D+E + LL E++V RE +   LEKE+E YR+ 
Subjt:  SIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREM

Query:  DFSEEEQSIRNWDFILDDHKEQSATTHYSNGD
           EE ++      +    ++ S  ++ +NGD
Subjt:  DFSEEEQSIRNWDFILDDHKEQSATTHYSNGD

Q9FG14 Myosin-binding protein 76.9e-1235.52Show/hide
Query:  TSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREI
        +SNA+ +  I NE   + LL E +  ++ +   LY EL+EER AA+TAA EA++MI+RLQ +KA ++ME RQ++R  EEK  +D++E+  L +++ KRE 
Subjt:  TSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREI

Query:  DYHVLEKEIEAY--REMDF--------SEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQI-ENAISLSRKAKLNETKSY
            L  E +AY  R M F        +E+    RN   I +D++    T+ Y    PPI   + EN   L     +++ + Y
Subjt:  DYHVLEKEIEAY--REMDF--------SEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQI-ENAISLSRKAKLNETKSY

Q9LMC8 Probable myosin-binding protein 51.8e-1248.94Show/hide
Query:  LEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYR
        L   +  ++ +   LYMEL+EER+A+A AA+ A+AMI RLQ EKA+V+MEA QYQR+++E+  YD+E +  +  +LVKRE +   LE  IE YR
Subjt:  LEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYR

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5933.3e-2541.89Show/hide
Query:  KGKRVMYQRPRTKIRRGRRAA-VECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERT----WQGSESSGSVGENNYIDKGSSTIGQD
        KGKR + +R R  ++  RR+  V C  L R I  S+EA  E     L++  +      ++ +     E T    W   E + SV  N  ++  SS     
Subjt:  KGKRVMYQRPRTKIRRGRRAA-VECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERT----WQGSESSGSVGENNYIDKGSSTIGQD

Query:  TSNAEERGIIRN-EASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKRE
             E+  +RN E  S+  LEE L+EE+AARA++ +EL++ER+AAA+AADEA+AMI RLQ+EKA++EMEARQ+QR++EE+  +D EEM +L++IL++RE
Subjt:  TSNAEERGIIRN-EASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKRE

Query:  IDYHVLEKEIEAYREMDFSEEE
         + H LEKE+EAYR++    EE
Subjt:  IDYHVLEKEIEAYREMDFSEEE

AT1G70750.1 Protein of unknown function, DUF5931.1e-1541.67Show/hide
Query:  SIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREM
        ++  L+  L+EE+ A  +LY ELE ER A+A AA E +AMI RL  EKA+++MEA QYQR++EE+  +D+E + LL E++V RE +   LEKE+E YR+ 
Subjt:  SIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREM

Query:  DFSEEEQSIRNWDFILDDHKEQSATTHYSNGD
           EE ++      +    ++ S  ++ +NGD
Subjt:  DFSEEEQSIRNWDFILDDHKEQSATTHYSNGD

AT4G13160.1 Protein of unknown function, DUF5933.6e-3233.43Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        M   E +  T  G+L AF++LA  Y LLC+SA  FI  K+L    LF+PC      G QN++LC ++LLF+WP R I  V                    
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKLQTKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGS
           +KL T   P                   V++ + +                        E  +E+E   +V++D+                      
Subjt:  NSNRKLQTKGVPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGS

Query:  ESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIE
                                N+E    +R       LLE A+E+EK A+A+L +ELE+ERAA+A+AADEA+AMI+RLQ +KAS+EME +QY+R+I+
Subjt:  ESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIE

Query:  EKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQ
        EKFAYDEEEMN+L+EIL KRE + H LEKE+E Y+ +D  +E +
Subjt:  EKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQ

AT4G13630.1 Protein of unknown function, DUF5931.1e-4137.16Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        M   E+  WT  GL+ AF+DL++ + LLC S   ++  K L + GL LPCPC G Y       CF+  L N P +KISSV   VK R PFD +L +   G
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKLQTKGVPQSQ------------QNLAD----------KDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDE
           RK + + V                +N A           K G +  K KR+ + R     +   ++ +           S E   +     LV    
Subjt:  NSNRKLQTKGVPQSQ------------QNLAD----------KDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDE

Query:  FIHDVINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMI
              N+S    L + + + S S  SVG            G         G+++    ++ + E+ L EE+AARASL +ELE+ER AAA+AADEA+ MI
Subjt:  FIHDVINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMI

Query:  IRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQ
        +RLQ EKAS+EMEARQYQR+IEEK A+D EEM++L+EIL++RE + H LEKE++ YR+M F E EQ
Subjt:  IRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQ

AT4G13630.2 Protein of unknown function, DUF5931.1e-4137.16Show/hide
Query:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG
        M   E+  WT  GL+ AF+DL++ + LLC S   ++  K L + GL LPCPC G Y       CF+  L N P +KISSV   VK R PFD +L +   G
Subjt:  MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIG

Query:  NSNRKLQTKGVPQSQ------------QNLAD----------KDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDE
           RK + + V                +N A           K G +  K KR+ + R     +   ++ +           S E   +     LV    
Subjt:  NSNRKLQTKGVPQSQ------------QNLAD----------KDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDE

Query:  FIHDVINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMI
              N+S    L + + + S S  SVG            G         G+++    ++ + E+ L EE+AARASL +ELE+ER AAA+AADEA+ MI
Subjt:  FIHDVINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIGQDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMI

Query:  IRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQ
        +RLQ EKAS+EMEARQYQR+IEEK A+D EEM++L+EIL++RE + H LEKE++ YR+M F E EQ
Subjt:  IRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKEIEAYREMDFSEEEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTCCACGAGATCCATCCATGGACCCTCTCCGGATTACTCGGAGCCTTTCTCGATCTTGCTCTCGTTTATGTTCTCCTCTGCATATCCGCCACTGCCTTCATCCC
CTGCAAGATTCTCAAGATGATTGGATTGTTCCTGCCTTGCCCCTGCAGTGGGTTCTACGGCCGTCAGAACGCTAATCTATGCTTCCGCAGACTGCTCTTCAATTGGCCCA
AGAGGAAGATCTCTTCCGTCCTCCACTTCGTCAAGACCAGGTTTCCTTTCGATTTGCTTTTGGTCGACGACCAAATTGGTAACTCCAATAGGAAGCTTCAGACCAAGGGG
GTTCCTCAATCGCAGCAGAATCTCGCTGATAAAGATGGCGACTATGATAGAAAGGGCAAGAGGGTTATGTACCAGAGGCCCAGGACTAAAATCCGGCGTGGGAGGAGAGC
TGCTGTTGAATGTGGGAGATTGGCCAGGGGAATCTGTGACAGCAATGAAGCTAGGAAGGAAAGGGAATCCGTGGCGTTGGTTGAGAGAGATGAATTTATTCACGATGTTA
TTAATGAATCAAATCACATTCACTTGAGCGAAAGAACCTGGCAAGGTTCCGAATCAAGTGGTTCGGTTGGCGAAAATAATTATATAGATAAAGGTTCTTCAACTATAGGA
CAAGATACCAGCAATGCCGAAGAGAGAGGCATTATCAGAAATGAAGCAAGCTCTATTGGATTGTTGGAGGAAGCGCTTGAAGAAGAGAAAGCTGCTCGGGCATCTCTGTA
TATGGAACTGGAGGAGGAGAGAGCAGCTGCTGCTACTGCTGCTGACGAAGCAATAGCCATGATAATACGTTTACAAAATGAGAAGGCATCAGTTGAAATGGAAGCAAGAC
AATATCAGAGGGTAATAGAAGAAAAATTTGCTTACGATGAAGAAGAGATGAATCTACTTAGAGAGATCCTTGTCAAGAGGGAAATAGATTATCACGTACTGGAGAAGGAA
ATTGAAGCATATAGGGAGATGGATTTTTCAGAAGAGGAACAATCAATAAGAAATTGGGATTTTATATTGGATGACCATAAAGAACAGTCTGCCACTACCCATTACTCAAA
TGGAGATCCACCCATTGTTCAGCAAATTGAGAATGCTATTTCTCTTTCAAGGAAAGCAAAGTTGAATGAAACTAAAAGTTATAACTCCCAATGCCATTTTAATGAAGGCG
GCTTGCTTGAGCAAACTACCTGGACTAATGAAGACAATGAACTGAAGGATAATAGCTTAGTATGTGAGCATTTCACTTCTAAGGCAGCTCATGTGGTTCGCAGTTTAGGA
AGTTCCGAACTTGATATGGAAATAGATGTTCAAGATATTCATGTGATTGATGAAAAACTCCACCTAGGAGACAAGAAGGAGGATATAAAAGTTGATAAT
mRNA sequenceShow/hide mRNA sequence
ATGTCACTCCACGAGATCCATCCATGGACCCTCTCCGGATTACTCGGAGCCTTTCTCGATCTTGCTCTCGTTTATGTTCTCCTCTGCATATCCGCCACTGCCTTCATCCC
CTGCAAGATTCTCAAGATGATTGGATTGTTCCTGCCTTGCCCCTGCAGTGGGTTCTACGGCCGTCAGAACGCTAATCTATGCTTCCGCAGACTGCTCTTCAATTGGCCCA
AGAGGAAGATCTCTTCCGTCCTCCACTTCGTCAAGACCAGGTTTCCTTTCGATTTGCTTTTGGTCGACGACCAAATTGGTAACTCCAATAGGAAGCTTCAGACCAAGGGG
GTTCCTCAATCGCAGCAGAATCTCGCTGATAAAGATGGCGACTATGATAGAAAGGGCAAGAGGGTTATGTACCAGAGGCCCAGGACTAAAATCCGGCGTGGGAGGAGAGC
TGCTGTTGAATGTGGGAGATTGGCCAGGGGAATCTGTGACAGCAATGAAGCTAGGAAGGAAAGGGAATCCGTGGCGTTGGTTGAGAGAGATGAATTTATTCACGATGTTA
TTAATGAATCAAATCACATTCACTTGAGCGAAAGAACCTGGCAAGGTTCCGAATCAAGTGGTTCGGTTGGCGAAAATAATTATATAGATAAAGGTTCTTCAACTATAGGA
CAAGATACCAGCAATGCCGAAGAGAGAGGCATTATCAGAAATGAAGCAAGCTCTATTGGATTGTTGGAGGAAGCGCTTGAAGAAGAGAAAGCTGCTCGGGCATCTCTGTA
TATGGAACTGGAGGAGGAGAGAGCAGCTGCTGCTACTGCTGCTGACGAAGCAATAGCCATGATAATACGTTTACAAAATGAGAAGGCATCAGTTGAAATGGAAGCAAGAC
AATATCAGAGGGTAATAGAAGAAAAATTTGCTTACGATGAAGAAGAGATGAATCTACTTAGAGAGATCCTTGTCAAGAGGGAAATAGATTATCACGTACTGGAGAAGGAA
ATTGAAGCATATAGGGAGATGGATTTTTCAGAAGAGGAACAATCAATAAGAAATTGGGATTTTATATTGGATGACCATAAAGAACAGTCTGCCACTACCCATTACTCAAA
TGGAGATCCACCCATTGTTCAGCAAATTGAGAATGCTATTTCTCTTTCAAGGAAAGCAAAGTTGAATGAAACTAAAAGTTATAACTCCCAATGCCATTTTAATGAAGGCG
GCTTGCTTGAGCAAACTACCTGGACTAATGAAGACAATGAACTGAAGGATAATAGCTTAGTATGTGAGCATTTCACTTCTAAGGCAGCTCATGTGGTTCGCAGTTTAGGA
AGTTCCGAACTTGATATGGAAATAGATGTTCAAGATATTCATGTGATTGATGAAAAACTCCACCTAGGAGACAAGAAGGAGGATATAAAAGTTGATAAT
Protein sequenceShow/hide protein sequence
MSLHEIHPWTLSGLLGAFLDLALVYVLLCISATAFIPCKILKMIGLFLPCPCSGFYGRQNANLCFRRLLFNWPKRKISSVLHFVKTRFPFDLLLVDDQIGNSNRKLQTKG
VPQSQQNLADKDGDYDRKGKRVMYQRPRTKIRRGRRAAVECGRLARGICDSNEARKERESVALVERDEFIHDVINESNHIHLSERTWQGSESSGSVGENNYIDKGSSTIG
QDTSNAEERGIIRNEASSIGLLEEALEEEKAARASLYMELEEERAAAATAADEAIAMIIRLQNEKASVEMEARQYQRVIEEKFAYDEEEMNLLREILVKREIDYHVLEKE
IEAYREMDFSEEEQSIRNWDFILDDHKEQSATTHYSNGDPPIVQQIENAISLSRKAKLNETKSYNSQCHFNEGGLLEQTTWTNEDNELKDNSLVCEHFTSKAAHVVRSLG
SSELDMEIDVQDIHVIDEKLHLGDKKEDIKVDN