; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020975 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020975
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGTD-binding domain-containing protein
Genome locationchr7:3630444..3633705
RNA-Seq ExpressionLag0020975
SyntenyLag0020975
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0044267 - cellular protein metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0080115 - myosin XI tail binding (molecular function)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147055.1 probable myosin-binding protein 5 [Momordica charantia]9.5e-30073.88Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MAC+AIQLWTFNGLVAAFLDLG+AY LLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKIS+VLHS REKFPL+SMWD EPK 
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKE---------------------------------SHCKGGGVGRRKVISVSPND
        CFKSM +  RN ++A+VELEGEASGSSF +T   +G IYGDF  V +                                   CK GGVG RKVISV   D
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKE---------------------------------SHCKGGGVGRRKVISVSPND

Query:  ILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMYEED----GNNCRGELCLDGNESDTIKLLERAL
        IL+ D+ED+ QSPS+FS FGD++T D FFSVDS D  EAS DNSD+ K+FP LELD+S DEKIC EMYE      GNN   EL LDGNESDTIKLLERAL
Subjt:  ILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMYEED----GNNCRGELCLDGNESDTIKLLERAL

Query:  EEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFAD-----
        EEEQ ARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKT YDAEEMSILKEILVRREREMHFLEKE+EA++KS   D     
Subjt:  EEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFAD-----

Query:  --DGLDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDS
          D LD EVTPQR P F YSTEDPSHML+CI+RSI +KQD NY   TKHS ++E PS+ESR LT+EFGEES F Q D+LAD AKAGGM L QV D+ +D 
Subjt:  --DGLDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDS

Query:  EEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIE
        EEIDNELQEK MVEDEN+YILQGE NE EPYLQS +S+GL ++EKSTELIAD+CEKVD VSYD LA SKTI P  +YNLEKN D Q Q T  LNSVT  +
Subjt:  EEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIE

Query:  PHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEW
         HPH+IHV+ D      EASA+ASK+ V NGTSS P K D PSFSLL+S+LDITRSSSDATGRFPPMA SRSN L+SELRRNSMSAVDYERSKIGNEVEW
Subjt:  PHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEW

Query:  LRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        LRERLKIVQEGREKLKF+VEH+E+ENNQLQLLE+ITNHL EIRQLTDPGK+TLQAPLPPSSKA SKK+CWRSSSLSIHRSS
Subjt:  LRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

XP_022942015.1 probable myosin-binding protein 5 [Cucurbita moschata]1.4e-30677.45Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MACEAIQ WTFNGLVAAFLDLG+AY +LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCF K+LVDRSSKK+SSV+HSAREK PLNSM DQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS
        CFKS  MHERN  +AHVE EGE SG SFF+T S Q MIYGDF SVKES C  G V  +KV SVSPNDI Q D+EDL  SPS+FSGFGDNNT DGFFSVDS
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS

Query:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ
        GDER E+SSD+SDR K+FPD   ++ + EK  A  +EE GNNCRGELCLDGNESDTIKLLE+ALEEEQ ARATLYLELEKERSAAATA DEAMAMILRLQ
Subjt:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ

Query:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK
        EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKS F DDG     LD E+TPQ  P F YS++ PSHML+CI+RSIRDK
Subjt:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK

Query:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD
        QDANY   TK SS+YEIPS+ESRKLT+EF +ESPFI +D+ ADAA+AGGM LHQ  DNF   EE DNELQE+ MVEDEN YILQGEVNE EPYLQS  S+
Subjt:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD

Query:  GLHLIEKSTEL--IADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP
        GL  +EKSTEL  IAD+ EKVD+VSYDGLA +KTI P VEYNLEKN D QKQ  +DL+ +T+++P  H+IHV+ +E S  ++ SADASKE V+NGTSS P
Subjt:  GLHLIEKSTEL--IADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP

Query:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT
        A   SPSFSLLQS+LDITRS+SDA+GRFPP ARSRSN L+S+LRRNSMSAVDYERSKIG+EVE LRERLKIVQE REKL+FS+EHK++ENNQLQLLE+IT
Subjt:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT

Query:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        NHLREIR LTDPGK+ LQAP PPSSK  SKKRCWRSSSLSIHRSS
Subjt:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

XP_022974988.1 uncharacterized protein LOC111473854 [Cucurbita maxima]2.4e-30377.05Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MACEAIQ WTFNGLV AFLDLG+AY LLCASSLVFFTSKFLALFGL LPCPCDGLFGNLSSDHCFQK+L DRSSKK+SSV+HSA EK PLNSMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS
        C K+  MHERN  +AHVE EGE SG SFF+T S Q  IYGDF SVKESHCK G V  +KV SVSPNDI Q D+EDL  SPS+FSGFGDNNT DGFFSVDS
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS

Query:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ
        GDER E+SSDNSD+ K+FPD   ++ + EK  A  +EE GNNCRGELCLD NESDTIKLLE+ALEEEQ ARATLYLELEKERSAAATA DEAMAMILRLQ
Subjt:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ

Query:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK
        EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRRERE+HFLE EVEAFQKS F DDG     LD E+TPQ  P F YS++ PSHMLRCI+RSIRDK
Subjt:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK

Query:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD
        QDANY   TK SS+YEIPS+ESRK T+EF +ESPFI +D+ ADAA+ G M LHQ  DNF   EE DNELQEK MVEDEN YILQ EVNE EPYLQS  S+
Subjt:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD

Query:  GLHLIEKST--ELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP
        GL  +EKST  E IAD+ EKV +VSYDGLA +KTI P VEYNLEKN D QKQ T+DL+S+T+I+P  H+IH++ +E S  NE SADASKE V+NGTSS P
Subjt:  GLHLIEKST--ELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP

Query:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT
        A   SPSFSLLQS+LDITRS+SDA+GRFPP ARSRSN L+SELRRNSMSAVDYERSKIG+EVE LRERLKIVQE REKL+FSVEHK++ENNQLQLLE+IT
Subjt:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT

Query:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        NHLREIRQLTDPGK+ LQAP PPSSK  +KKRCWRSSSLSIHRSS
Subjt:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

XP_023539873.1 uncharacterized protein LOC111800419 [Cucurbita pepo subsp. pepo]4.7e-30777.72Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MACEAIQ WTFNGLVAAFLDLG+AY +LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQK+LVDRSSKK+SSV+HSAREK PLNSMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS
        CFKS  MHERN   AHVE EGE S  SF +T S Q MIYGDF SVKES C  G V  +KV+SVSPNDI Q D+EDL  SPS+FSGFGDNNT DGFFSVDS
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS

Query:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ
        GDER E+SSDNSD+ K+FPD   ++ + EK  A  +EE G+ CRGELCLDGNESDTIKLLE+ALEEEQ ARATLYLELEKERSAAATA DEAMAMILRLQ
Subjt:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ

Query:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK
        EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKS F DDG     LD E+TPQ  P F YS++ PSHML+CI+RSIRDK
Subjt:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK

Query:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD
        QDANY   TK SS+YEIPS+E RKLT+EF +ESPFI +D+ ADAA+AGGM LHQ  DNF   EE DNELQEK MVEDEN YILQGEVNE EPYLQS  S+
Subjt:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD

Query:  GLHLIEKSTEL--IADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP
        GL  +EKSTEL  IAD+ EKVD+VSYDGLA +KTI P VEYNLEKN D QKQ  +DL+S+T+I+P  H+IHV+ +E S  ++ SADASKE V+NGTSS P
Subjt:  GLHLIEKSTEL--IADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP

Query:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT
        A   SPSFSLLQS+LDITRS+SDA+GRFPP ARSRSN L+SELRRNSMSAVDYERSKIG+EVE LRERLKIVQE REKL+FS+EHK++ENNQLQLLE+IT
Subjt:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT

Query:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        NHLREIRQLTDPGK+ LQAP PPSSK  SKKRCWRSSSLSIHRSS
Subjt:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

XP_038893600.1 uncharacterized protein LOC120082482 isoform X1 [Benincasa hispida]5.7e-31078.07Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MACEAI+LWTFNGLVAAFLDLG+A+ LLCAS+LVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSS+KISSVL S R+KFPL+S+WD EPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDV--EDLCQSPSTFSGFGDNNTVDGFFSV
        CFKS+ +HERN K+AHVELEGEASG S F++ S QGM+YGDFPSV +   +GGG+  RKVIS S N+I QSDV  EDL  SPS+FSGFGDNNT DGFFSV
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDV--EDLCQSPSTFSGFGDNNTVDGFFSV

Query:  DSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMY----EEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAM
        DSGDEREAS DNSD+YK+FP+LELD+SYD KICAEMY    EE GN CRGELCLDGNESDTIKLLE+ALEEEQTARATLYLELEKERSAAATAADEAMAM
Subjt:  DSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMY----EEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAPFL-YSTEDPSHMLRCINR
        ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAF+KS F DDG     LDSE TP R P   Y TEDPSHML+CINR
Subjt:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAPFL-YSTEDPSHMLRCINR

Query:  SIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQ
        S RD++ ANY     HS +Y IPS+ESR LT+EFGEES  I AD++A AAKA GM LHQV DNF+ SEEID ELQ K M+EDE  YI+ GEVNE +PYLQ
Subjt:  SIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQ

Query:  SKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTS
        S ES+GL  +EK TE+IAD+ EKVD+VSYD LAL+KTI P +EYNLEKN D QK  TRD++SV   +PHPH+IHV+ DE  + NEA A+AS+E  VNGTS
Subjt:  SKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTS

Query:  SIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLE
        SIP K DSPSF LLQS+L+I R+SSDAT RFPPMARSRSN L+SELRRNSMSAVDYERSKIGNEVEWLR RLKIVQEGREKLKFSVEHKE+ENNQLQLLE
Subjt:  SIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLE

Query:  NITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        NITNHLREI QL DPGK  LQAPLPPSSK  SKKRCWRSSSLS+HRSS
Subjt:  NITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

TrEMBL top hitse value%identityAlignment
A0A6J1CZ28 probable myosin-binding protein 54.6e-30073.88Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MAC+AIQLWTFNGLVAAFLDLG+AY LLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKIS+VLHS REKFPL+SMWD EPK 
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKE---------------------------------SHCKGGGVGRRKVISVSPND
        CFKSM +  RN ++A+VELEGEASGSSF +T   +G IYGDF  V +                                   CK GGVG RKVISV   D
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKE---------------------------------SHCKGGGVGRRKVISVSPND

Query:  ILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMYEED----GNNCRGELCLDGNESDTIKLLERAL
        IL+ D+ED+ QSPS+FS FGD++T D FFSVDS D  EAS DNSD+ K+FP LELD+S DEKIC EMYE      GNN   EL LDGNESDTIKLLERAL
Subjt:  ILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMYEED----GNNCRGELCLDGNESDTIKLLERAL

Query:  EEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFAD-----
        EEEQ ARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKT YDAEEMSILKEILVRREREMHFLEKE+EA++KS   D     
Subjt:  EEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFAD-----

Query:  --DGLDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDS
          D LD EVTPQR P F YSTEDPSHML+CI+RSI +KQD NY   TKHS ++E PS+ESR LT+EFGEES F Q D+LAD AKAGGM L QV D+ +D 
Subjt:  --DGLDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDS

Query:  EEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIE
        EEIDNELQEK MVEDEN+YILQGE NE EPYLQS +S+GL ++EKSTELIAD+CEKVD VSYD LA SKTI P  +YNLEKN D Q Q T  LNSVT  +
Subjt:  EEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIE

Query:  PHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEW
         HPH+IHV+ D      EASA+ASK+ V NGTSS P K D PSFSLL+S+LDITRSSSDATGRFPPMA SRSN L+SELRRNSMSAVDYERSKIGNEVEW
Subjt:  PHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEW

Query:  LRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        LRERLKIVQEGREKLKF+VEH+E+ENNQLQLLE+ITNHL EIRQLTDPGK+TLQAPLPPSSKA SKK+CWRSSSLSIHRSS
Subjt:  LRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

A0A6J1EFQ3 uncharacterized protein LOC111432110 isoform X16.5e-28673.73Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MACEA+QLWTFNGLVAAFLDLG+A+ LLCA+SLVFFTSKFLALFG CLPCPCDGLFGNL SDHCFQKLLVD SSK+ISSVLHS REKFPL+SMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDV--EDLCQSPSTFSGFGDNNTVDGFFSV
        CFKSMS+H+RNAK+  VE +GEASG S+F+T S +GMIYGD  +V ES  K GGVG RK+ SVSPND+ QSDV  EDLC SPS+  GFGDNN  DGFFSV
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDV--EDLCQSPSTFSGFGDNNTVDGFFSV

Query:  DSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMY----EEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAM
        DSGDE EAS DNS++YK+FPDLELD+SYDEKICAEMY    EE  NNCRGELCLDGNESD IKLL ++LEEEQ ARATLYLELEKERSAAATAADEAMAM
Subjt:  DSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMY----EEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAPFL-YSTEDPSHMLRCINR
        ILRLQEEKA IEM+ARQYQRMIEEKTAYDAEEMSILKEILVRRE+E HFL+KEVEAF+KSLF +DG     LDSE TPQ AP     TEDPSH+L+CIN 
Subjt:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAPFL-YSTEDPSHMLRCINR

Query:  SIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQ
        SI DKQ             +E+PS+ESR L +EFGEESP IQA + ADAAKA G+ LHQV D FE SEEIDNELQ K MVED+N YI+ GEVNE EPY +
Subjt:  SIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQ

Query:  SKE--SDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNG
        S    S+GL  +E+ TEL AD+ EKV D S+DGLA ++   P VEYNLEK+ + QKQWTRD  SV D +  PH+IHV+ DE  M NEASA+A KE  VNG
Subjt:  SKE--SDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNG

Query:  TSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQL
         S IP   DS SFSLLQ+ LDITRSSSDATGRFPPM RSRSN L+ ELRRNSMSAVDYERSKIGNEVEWLR RLKIVQE REKLKFSVE KE+E NQLQL
Subjt:  TSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQL

Query:  LENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        LENIT      + L+DPGK+ LQAPLPPSSK  SKKRCWRSSSLSIHRSS
Subjt:  LENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

A0A6J1FVE4 probable myosin-binding protein 56.7e-30777.45Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MACEAIQ WTFNGLVAAFLDLG+AY +LCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCF K+LVDRSSKK+SSV+HSAREK PLNSM DQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS
        CFKS  MHERN  +AHVE EGE SG SFF+T S Q MIYGDF SVKES C  G V  +KV SVSPNDI Q D+EDL  SPS+FSGFGDNNT DGFFSVDS
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS

Query:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ
        GDER E+SSD+SDR K+FPD   ++ + EK  A  +EE GNNCRGELCLDGNESDTIKLLE+ALEEEQ ARATLYLELEKERSAAATA DEAMAMILRLQ
Subjt:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ

Query:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK
        EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKS F DDG     LD E+TPQ  P F YS++ PSHML+CI+RSIRDK
Subjt:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK

Query:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD
        QDANY   TK SS+YEIPS+ESRKLT+EF +ESPFI +D+ ADAA+AGGM LHQ  DNF   EE DNELQE+ MVEDEN YILQGEVNE EPYLQS  S+
Subjt:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD

Query:  GLHLIEKSTEL--IADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP
        GL  +EKSTEL  IAD+ EKVD+VSYDGLA +KTI P VEYNLEKN D QKQ  +DL+ +T+++P  H+IHV+ +E S  ++ SADASKE V+NGTSS P
Subjt:  GLHLIEKSTEL--IADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP

Query:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT
        A   SPSFSLLQS+LDITRS+SDA+GRFPP ARSRSN L+S+LRRNSMSAVDYERSKIG+EVE LRERLKIVQE REKL+FS+EHK++ENNQLQLLE+IT
Subjt:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT

Query:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        NHLREIR LTDPGK+ LQAP PPSSK  SKKRCWRSSSLSIHRSS
Subjt:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

A0A6J1IHY7 uncharacterized protein LOC1114738541.2e-30377.05Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MACEAIQ WTFNGLV AFLDLG+AY LLCASSLVFFTSKFLALFGL LPCPCDGLFGNLSSDHCFQK+L DRSSKK+SSV+HSA EK PLNSMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS
        C K+  MHERN  +AHVE EGE SG SFF+T S Q  IYGDF SVKESHCK G V  +KV SVSPNDI Q D+EDL  SPS+FSGFGDNNT DGFFSVDS
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS

Query:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ
        GDER E+SSDNSD+ K+FPD   ++ + EK  A  +EE GNNCRGELCLD NESDTIKLLE+ALEEEQ ARATLYLELEKERSAAATA DEAMAMILRLQ
Subjt:  GDER-EASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQ

Query:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK
        EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRRERE+HFLE EVEAFQKS F DDG     LD E+TPQ  P F YS++ PSHMLRCI+RSIRDK
Subjt:  EEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAP-FLYSTEDPSHMLRCINRSIRDK

Query:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD
        QDANY   TK SS+YEIPS+ESRK T+EF +ESPFI +D+ ADAA+ G M LHQ  DNF   EE DNELQEK MVEDEN YILQ EVNE EPYLQS  S+
Subjt:  QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESD

Query:  GLHLIEKST--ELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP
        GL  +EKST  E IAD+ EKV +VSYDGLA +KTI P VEYNLEKN D QKQ T+DL+S+T+I+P  H+IH++ +E S  NE SADASKE V+NGTSS P
Subjt:  GLHLIEKST--ELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIP

Query:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT
        A   SPSFSLLQS+LDITRS+SDA+GRFPP ARSRSN L+SELRRNSMSAVDYERSKIG+EVE LRERLKIVQE REKL+FSVEHK++ENNQLQLLE+IT
Subjt:  AKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENIT

Query:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        NHLREIRQLTDPGK+ LQAP PPSSK  +KKRCWRSSSLSIHRSS
Subjt:  NHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

A0A6J1IVQ3 uncharacterized protein LOC111479667 isoform X14.2e-28573.26Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        MACEAIQLWTFNGLVAAFLDLG+A+ LLCA+SLVFFTSKFLALFG CLPCPCDGLFG+L SDHCFQKLLVD SSKKISSVLHS REKFPL+SMWDQEPKC
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDV--EDLCQSPSTFSGFGDNNTVDGFFSV
        CFKSMS+H+RN K+A VE + EASG S+F+T S +GMIYGD  ++ ES  K GGVG RK+ SVSPND+ QSDV  EDLC SPS+F GFGDNN  DGFFSV
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDV--EDLCQSPSTFSGFGDNNTVDGFFSV

Query:  DSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMY----EEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAM
        DSGDE EAS DNS++YK+FPDLELD+SYDEKICAEMY    EE  NNCRGE CLDGNESD IKLLE++LEEEQ ARATLYLELEKERSAAATAADEAMAM
Subjt:  DSGDEREASSDNSDRYKIFPDLELDESYDEKICAEMY----EEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAM

Query:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAPFL-YSTEDPSHMLRCINR
        ILRLQEEKASIEM+ARQYQRMIEEKTAYDAEEMSILKEILVRRE+EMHFL+KEV AF++S F + G     LD+E TP  AP   Y TEDPSHML+CIN 
Subjt:  ILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDG-----LDSEVTPQRAPFL-YSTEDPSHMLRCINR

Query:  SIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQ
        SI DKQ             +E+PS+ESR L +EFGEESP IQA + ADAAKA GM LHQV DNFE  EEID ELQ K MVED+N YI+ GEVNE EPY +
Subjt:  SIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQ

Query:  SKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTS
        S  S+ L  +E+ TE   D+ EKV   S DGL  ++T  P VEYNLEK  D QKQWTRD  SV D +  PH+IHV+ DE  M NEASA+A +E +VNG+S
Subjt:  SKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTS

Query:  SIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLE
        SIP   DS SFSLLQ++LDITRSSSDATGRFPPMARSRSN L+SELRRNSMSAVDYERSKIGNEVEWLR RLKIVQE REK K SVE KE+ENNQLQLLE
Subjt:  SIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLE

Query:  NITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS
        NIT      + L+DPGK+ LQAPLPPSSK  SKKRCWRSSSL IHRSS
Subjt:  NITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 63.4e-1342.86Show/hide
Query:  DGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEV
        D      +  L++ +  ++ +   LY+EL++ERSA+A AA+EAMAMI RLQ EKA+++M+A QYQRM++E+  YD E +  +   L +RE EM  LE E 
Subjt:  DGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEV

Query:  EAFQK
        E +++
Subjt:  EAFQK

Q0WNW4 Myosin-binding protein 31.9e-1646.09Show/hide
Query:  DGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRR
        DGN    E+   G+   TI+ L   +  EQ A   LY ELE+ERSA+A +A++ MAMI RLQEEKA ++M+A QYQRM+EE+  YD E + +L  ++V+R
Subjt:  DGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRR

Query:  EREMHFLEKEVEAFQ
        E+E   L++E+E ++
Subjt:  EREMHFLEKEVEAFQ

Q9CAC4 Myosin-binding protein 21.6e-1530.53Show/hide
Query:  TIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKS
        T+  L+  L+EE+ A   LY ELE ER+A+A AA E MAMI RL EEKA+++M+A QYQRM+EE+  +D E + +L E++V RE+E   LEKE+E ++K 
Subjt:  TIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKS

Query:  LFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDK----------QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMS
        +                  Y  ++   MLR   R +RD            D N     +  +   +   + R+   E       ++ D+  D      +S
Subjt:  LFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDK----------QDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMS

Query:  -LHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELI
         L ++    E   +++NE  ++   E+   +   G +N  E ++  KE++G H + KS  L+
Subjt:  -LHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELI

Q9FG14 Myosin-binding protein 79.8e-1332.89Show/hide
Query:  PDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIE
        P+LELD S + K+  E                    + ++LL   +  +Q +   LY EL++ER+AA+TAA EAM+MILRLQ +KA ++M+ RQ++R  E
Subjt:  PDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIE

Query:  EKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQR
        EK  +D +E+  L++++ +RE+ +  L  E +A++  + +    ++EV  ++
Subjt:  EKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQR

Q9LMC8 Probable myosin-binding protein 56.8e-1439.86Show/hide
Query:  DGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRR
        D N    E+ LDG+    ++ L R +  ++ +   LY+EL++ERSA+A AA+ AMAMI RLQ EKA+++M+A QYQRM++E+  YD E +  +  +LV+R
Subjt:  DGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRR

Query:  EREMHFLEKEVEAF--------QKSLFADDGLDSEVTP
        E EM  LE  +E +        ++   A++ LD E  P
Subjt:  EREMHFLEKEVEAF--------QKSLFADDGLDSEVTP

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5932.8e-3129.61Show/hide
Query:  EKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEM
        EK  +   + + N+   +  +   E  +++ LE  L+EE+ ARAT+ +EL+KERSAAA+AADEAMAMI RLQ+EKA+IEM+ARQ+QR++EE++ +DAEEM
Subjt:  EKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEM

Query:  SILKEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESP
         ILK+IL+RRERE HFLEKEVEA+++ L   + L+  +  ++     +  +P H      +  +D Q+     V +         ++   L   + EE  
Subjt:  SILKEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESP

Query:  FIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIP
            D   D  K+             DSE   + +++  MV+DE   I                                                    
Subjt:  FIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIP

Query:  PSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRS
         S + NLE+++  + + + D NS+                               +V+G +                             + PP+ R R 
Subjt:  PSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRS

Query:  NFLQSE-LRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCW-
          L S   RR SMSAVDYER KI NEVE LRERLK VQE RE+L                           R+ + P       PLP   +A S+KR W 
Subjt:  NFLQSE-LRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCW-

Query:  RSSSLSIHRS
        RS S+ +H S
Subjt:  RSSSLSIHRS

AT4G13160.1 Protein of unknown function, DUF5936.5e-2859.48Show/hide
Query:  DTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQK
        D ++LLE A+E+E+ A+A L +ELE+ER+A+A+AADEAMAMILRLQ +KAS+EM+ +QY+RMI+EK AYD EEM+ILKEIL +RERE HFLEKE+E ++ 
Subjt:  DTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQK

Query:  SLFADDGLDSEVTPQR
            DD  ++E   +R
Subjt:  SLFADDGLDSEVTPQR

AT4G13160.1 Protein of unknown function, DUF5931.5e-0548.39Show/hide
Query:  TFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVD
        TF G++ AF++L  AY LLC S+ VF TSK L    L +PC      G  +SD C QKLL D
Subjt:  TFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVD

AT4G13630.1 Protein of unknown function, DUF5931.8e-6234.05Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        M C+ ++ WTF GLVAAF+DL VA+ LLCAS +V+ TSKFL LFGL LPCPCDGL+       CFQ+ L +   KKISSV  S + + P +S+     K 
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS
                +R  +   V+LE E S S+    G  +    G      +S  KG    + K +S   +     +    C    +F G  D N       V+S
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS

Query:  GDEREASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQE
         D  +A  D S R  +        S     C            G     G    T+++ E+ L EE+ ARA+L LELEKER+AAA+AADEA+ MILRLQE
Subjt:  GDEREASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQE

Query:  EKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDKQDANYTN
        EKASIEM+ARQYQRMIEEK+A+DAEEMSILKEIL+RRERE HFLEKEV+ +++                      TE P +                   
Subjt:  EKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDKQDANYTN

Query:  VTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEK
         T  S   +I  +++ +   E     P+   DD+     + G    ++  N  D+ E D                     +EP   L   E   L   E+
Subjt:  VTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEK

Query:  STELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFS
           L A+   +  D+     A+SK +            DP            DI+ H H+IHV+ DE +                G  ++P+   +    
Subjt:  STELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFS

Query:  LLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQ
             LD ++S SD +  FP   + +SN + + +RRNSMSA+DYER KI +EV  LR RL+ VQ+GREK+ FS   K++  +Q+Q   + T+   E R+
Subjt:  LLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQ

AT4G13630.2 Protein of unknown function, DUF5931.8e-6234.05Show/hide
Query:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC
        M C+ ++ WTF GLVAAF+DL VA+ LLCAS +V+ TSKFL LFGL LPCPCDGL+       CFQ+ L +   KKISSV  S + + P +S+     K 
Subjt:  MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKC

Query:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS
                +R  +   V+LE E S S+    G  +    G      +S  KG    + K +S   +     +    C    +F G  D N       V+S
Subjt:  CFKSMSMHERNAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDS

Query:  GDEREASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQE
         D  +A  D S R  +        S     C            G     G    T+++ E+ L EE+ ARA+L LELEKER+AAA+AADEA+ MILRLQE
Subjt:  GDEREASSDNSDRYKIFPDLELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQE

Query:  EKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDKQDANYTN
        EKASIEM+ARQYQRMIEEK+A+DAEEMSILKEIL+RRERE HFLEKEV+ +++                      TE P +                   
Subjt:  EKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDKQDANYTN

Query:  VTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEK
         T  S   +I  +++ +   E     P+   DD+     + G    ++  N  D+ E D                     +EP   L   E   L   E+
Subjt:  VTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKAGGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEK

Query:  STELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFS
           L A+   +  D+     A+SK +            DP            DI+ H H+IHV+ DE +                G  ++P+   +    
Subjt:  STELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNSVTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFS

Query:  LLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQ
             LD ++S SD +  FP   + +SN + + +RRNSMSA+DYER KI +EV  LR RL+ VQ+GREK+ FS   K++  +Q+Q   + T+   E R+
Subjt:  LLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERLKIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQ

AT5G16720.1 Protein of unknown function, DUF5931.3e-1746.09Show/hide
Query:  DGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRR
        DGN    E+   G+   TI+ L   +  EQ A   LY ELE+ERSA+A +A++ MAMI RLQEEKA ++M+A QYQRM+EE+  YD E + +L  ++V+R
Subjt:  DGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSILKEILVRR

Query:  EREMHFLEKEVEAFQ
        E+E   L++E+E ++
Subjt:  EREMHFLEKEVEAFQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTGTGAAGCTATACAGCTGTGGACTTTTAATGGATTAGTGGCTGCGTTTCTTGATCTTGGTGTAGCTTATTTTTTATTATGTGCATCAAGTCTTGTTTTCTTTAC
ATCCAAATTTCTTGCACTGTTTGGATTATGTCTGCCTTGCCCTTGTGATGGGCTATTTGGGAACCTTAGTAGTGATCACTGCTTCCAGAAGTTACTCGTGGATCGTTCTT
CCAAGAAAATATCTTCAGTCCTTCATTCGGCTAGAGAAAAGTTCCCATTGAATTCCATGTGGGATCAAGAACCAAAATGTTGTTTTAAGTCAATGTCGATGCACGAGAGG
AATGCCAAGGATGCACATGTTGAATTAGAAGGTGAAGCATCGGGTAGTTCCTTTTTTAGAACCGGATCATCACAAGGTATGATTTATGGAGACTTTCCCAGTGTAAAAGA
ATCGCATTGTAAAGGCGGCGGGGTTGGTCGCAGGAAGGTCATATCCGTGTCTCCGAATGACATTTTACAGTCAGATGTGGAAGACCTTTGTCAGTCTCCTTCAACCTTCA
GTGGATTCGGGGATAATAATACTGTGGACGGCTTCTTTTCTGTTGATTCTGGAGATGAAAGGGAGGCTTCATCAGACAACAGTGATCGATATAAAATATTTCCGGATCTT
GAATTGGATGAATCTTATGATGAGAAAATATGCGCAGAGATGTATGAAGAGGATGGTAACAACTGCAGAGGAGAGTTATGCTTGGATGGTAATGAGAGTGATACAATCAA
ACTATTGGAACGAGCACTTGAAGAAGAGCAGACAGCTCGTGCTACCTTGTACCTGGAGCTTGAGAAAGAGAGAAGTGCAGCTGCCACTGCTGCTGATGAGGCCATGGCTA
TGATATTACGTCTCCAAGAGGAGAAGGCATCAATAGAAATGGATGCTAGGCAATACCAGAGGATGATAGAGGAGAAAACTGCTTATGATGCTGAGGAAATGAGTATTCTT
AAAGAAATTCTAGTGAGGAGAGAGCGGGAAATGCATTTCCTGGAGAAGGAAGTTGAAGCTTTTCAGAAAAGTTTATTTGCAGATGATGGGTTGGATTCAGAAGTTACACC
CCAAAGGGCCCCTTTCCTATATTCAACTGAAGATCCATCTCATATGCTTCGATGCATTAATAGATCCATCAGAGACAAGCAAGATGCAAATTACACAAATGTCACAAAAC
ATTCTTCAAAATATGAGATCCCATCGATGGAGTCACGAAAGTTAACTTATGAATTTGGGGAAGAGTCGCCATTCATTCAAGCAGATGATCTTGCTGATGCTGCAAAAGCT
GGGGGCATGTCGTTGCACCAAGTTACTGATAACTTCGAGGACAGTGAGGAGATTGACAATGAATTACAAGAAAAGGCCATGGTAGAAGATGAGAATGCGTACATTCTACA
AGGAGAAGTAAATGAACCGGAGCCATATCTGCAAAGCAAAGAATCTGATGGTCTGCATTTAATTGAGAAATCCACAGAATTAATTGCAGATGATTGTGAAAAAGTTGATG
ACGTTTCATATGATGGGTTGGCATTATCTAAAACAATTCCTCCCAGTGTCGAATACAATTTGGAAAAGAATGCTGACCCTCAAAAGCAGTGGACAAGAGATCTCAACTCT
GTGACTGATATAGAACCTCATCCTCATGAGATTCATGTCCTTGGTGATGAAGTTAGCATGCGCAATGAAGCAAGTGCTGATGCAAGTAAAGAACTGGTGGTCAATGGTAC
CTCGAGTATTCCAGCAAAATTTGATAGTCCATCCTTCAGCTTGTTGCAGAGTGACCTAGACATCACTAGAAGCAGCTCAGATGCCACTGGTAGATTTCCACCAATGGCTC
GGTCTCGAAGCAATTTTTTGCAGTCTGAGTTGCGGAGAAATTCGATGTCTGCTGTCGATTATGAAAGGTCAAAAATTGGCAATGAAGTTGAGTGGCTCAGAGAAAGATTG
AAGATTGTTCAGGAGGGAAGAGAAAAACTCAAGTTCTCTGTTGAACACAAAGAGAGGGAGAACAATCAGTTGCAACTTTTAGAAAATATAACAAATCACCTTCGTGAAAT
CCGACAACTGACGGATCCTGGAAAGTCAACTCTGCAAGCTCCACTGCCTCCATCCTCTAAGGCTGCGTCAAAGAAACGTTGCTGGAGAAGTTCATCCCTAAGCATTCATA
GAAGCAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTGTGAAGCTATACAGCTGTGGACTTTTAATGGATTAGTGGCTGCGTTTCTTGATCTTGGTGTAGCTTATTTTTTATTATGTGCATCAAGTCTTGTTTTCTTTAC
ATCCAAATTTCTTGCACTGTTTGGATTATGTCTGCCTTGCCCTTGTGATGGGCTATTTGGGAACCTTAGTAGTGATCACTGCTTCCAGAAGTTACTCGTGGATCGTTCTT
CCAAGAAAATATCTTCAGTCCTTCATTCGGCTAGAGAAAAGTTCCCATTGAATTCCATGTGGGATCAAGAACCAAAATGTTGTTTTAAGTCAATGTCGATGCACGAGAGG
AATGCCAAGGATGCACATGTTGAATTAGAAGGTGAAGCATCGGGTAGTTCCTTTTTTAGAACCGGATCATCACAAGGTATGATTTATGGAGACTTTCCCAGTGTAAAAGA
ATCGCATTGTAAAGGCGGCGGGGTTGGTCGCAGGAAGGTCATATCCGTGTCTCCGAATGACATTTTACAGTCAGATGTGGAAGACCTTTGTCAGTCTCCTTCAACCTTCA
GTGGATTCGGGGATAATAATACTGTGGACGGCTTCTTTTCTGTTGATTCTGGAGATGAAAGGGAGGCTTCATCAGACAACAGTGATCGATATAAAATATTTCCGGATCTT
GAATTGGATGAATCTTATGATGAGAAAATATGCGCAGAGATGTATGAAGAGGATGGTAACAACTGCAGAGGAGAGTTATGCTTGGATGGTAATGAGAGTGATACAATCAA
ACTATTGGAACGAGCACTTGAAGAAGAGCAGACAGCTCGTGCTACCTTGTACCTGGAGCTTGAGAAAGAGAGAAGTGCAGCTGCCACTGCTGCTGATGAGGCCATGGCTA
TGATATTACGTCTCCAAGAGGAGAAGGCATCAATAGAAATGGATGCTAGGCAATACCAGAGGATGATAGAGGAGAAAACTGCTTATGATGCTGAGGAAATGAGTATTCTT
AAAGAAATTCTAGTGAGGAGAGAGCGGGAAATGCATTTCCTGGAGAAGGAAGTTGAAGCTTTTCAGAAAAGTTTATTTGCAGATGATGGGTTGGATTCAGAAGTTACACC
CCAAAGGGCCCCTTTCCTATATTCAACTGAAGATCCATCTCATATGCTTCGATGCATTAATAGATCCATCAGAGACAAGCAAGATGCAAATTACACAAATGTCACAAAAC
ATTCTTCAAAATATGAGATCCCATCGATGGAGTCACGAAAGTTAACTTATGAATTTGGGGAAGAGTCGCCATTCATTCAAGCAGATGATCTTGCTGATGCTGCAAAAGCT
GGGGGCATGTCGTTGCACCAAGTTACTGATAACTTCGAGGACAGTGAGGAGATTGACAATGAATTACAAGAAAAGGCCATGGTAGAAGATGAGAATGCGTACATTCTACA
AGGAGAAGTAAATGAACCGGAGCCATATCTGCAAAGCAAAGAATCTGATGGTCTGCATTTAATTGAGAAATCCACAGAATTAATTGCAGATGATTGTGAAAAAGTTGATG
ACGTTTCATATGATGGGTTGGCATTATCTAAAACAATTCCTCCCAGTGTCGAATACAATTTGGAAAAGAATGCTGACCCTCAAAAGCAGTGGACAAGAGATCTCAACTCT
GTGACTGATATAGAACCTCATCCTCATGAGATTCATGTCCTTGGTGATGAAGTTAGCATGCGCAATGAAGCAAGTGCTGATGCAAGTAAAGAACTGGTGGTCAATGGTAC
CTCGAGTATTCCAGCAAAATTTGATAGTCCATCCTTCAGCTTGTTGCAGAGTGACCTAGACATCACTAGAAGCAGCTCAGATGCCACTGGTAGATTTCCACCAATGGCTC
GGTCTCGAAGCAATTTTTTGCAGTCTGAGTTGCGGAGAAATTCGATGTCTGCTGTCGATTATGAAAGGTCAAAAATTGGCAATGAAGTTGAGTGGCTCAGAGAAAGATTG
AAGATTGTTCAGGAGGGAAGAGAAAAACTCAAGTTCTCTGTTGAACACAAAGAGAGGGAGAACAATCAGTTGCAACTTTTAGAAAATATAACAAATCACCTTCGTGAAAT
CCGACAACTGACGGATCCTGGAAAGTCAACTCTGCAAGCTCCACTGCCTCCATCCTCTAAGGCTGCGTCAAAGAAACGTTGCTGGAGAAGTTCATCCCTAAGCATTCATA
GAAGCAGCTAG
Protein sequenceShow/hide protein sequence
MACEAIQLWTFNGLVAAFLDLGVAYFLLCASSLVFFTSKFLALFGLCLPCPCDGLFGNLSSDHCFQKLLVDRSSKKISSVLHSAREKFPLNSMWDQEPKCCFKSMSMHER
NAKDAHVELEGEASGSSFFRTGSSQGMIYGDFPSVKESHCKGGGVGRRKVISVSPNDILQSDVEDLCQSPSTFSGFGDNNTVDGFFSVDSGDEREASSDNSDRYKIFPDL
ELDESYDEKICAEMYEEDGNNCRGELCLDGNESDTIKLLERALEEEQTARATLYLELEKERSAAATAADEAMAMILRLQEEKASIEMDARQYQRMIEEKTAYDAEEMSIL
KEILVRREREMHFLEKEVEAFQKSLFADDGLDSEVTPQRAPFLYSTEDPSHMLRCINRSIRDKQDANYTNVTKHSSKYEIPSMESRKLTYEFGEESPFIQADDLADAAKA
GGMSLHQVTDNFEDSEEIDNELQEKAMVEDENAYILQGEVNEPEPYLQSKESDGLHLIEKSTELIADDCEKVDDVSYDGLALSKTIPPSVEYNLEKNADPQKQWTRDLNS
VTDIEPHPHEIHVLGDEVSMRNEASADASKELVVNGTSSIPAKFDSPSFSLLQSDLDITRSSSDATGRFPPMARSRSNFLQSELRRNSMSAVDYERSKIGNEVEWLRERL
KIVQEGREKLKFSVEHKERENNQLQLLENITNHLREIRQLTDPGKSTLQAPLPPSSKAASKKRCWRSSSLSIHRSS