; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016862 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016862
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGTD-binding domain-containing protein
Genome locationtig00153014:271377..273556
RNA-Seq ExpressionSgr016862
SyntenySgr016862
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR007656 - GTD-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139633.1 probable myosin-binding protein 5 [Cucumis sativus]9.7e-18168.49Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        MSFHEIHSWTL G+V AFLDLAVVYFLLCVSAT+FIPSKI +V+G CLPCPCTGFYG  N N C HKL+VNWPKRKI LVL+L K  FPFDLIL+DD+  
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE
          NSNRNL   NGI  LQSE CCST   PRLQNLVD DGE DGKGK+IMYQ+PRTKIRRRRR  ++NGKLSKG+CE NETRK RE VALVER+DF     
Subjt:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE

Query:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL
                           I+ D NESNH+DLG+R WQGFESSGS+GEN+YM+KGSST+GQGTS A+ER IIRNEAS+IRLLE ALEEE+ ARASL++EL
Subjt:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL

Query:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQ---S
        EEERAAAATAADEAIAMITRLQNEKAS EMEARQY R +EEKF+YDE +MNILREILVKR+IDYHVLEKEIEAYR+MDF+E+E LK N +FILDE    S
Subjt:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQ---S

Query:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV
        AT HYSNGDPPIV  I NA+SL R+AK+                           NEL++NSLL +HI I+AAP CGGFEK FLSRGALQ +LE + H V
Subjt:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV

Query:  NGIGSSILDME
        N +G SILDME
Subjt:  NGIGSSILDME

XP_022150079.1 uncharacterized protein LOC111018343 [Momordica charantia]2.7e-17868.49Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        MS HEIH WTL GL+GAFLDLA+VY LLC+SAT FIP KI ++IGL LPCPC+GFYG QN N C  +LL NWPKRKI  VL   KTRFPFDL+L+DDQ+G
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE
        NSN       T G+P  Q             QNL D+DG+ D KGKR+MYQRPRTKIRR RR  VE G+L++GIC+ NE RKERESVALVER++FI    
Subjt:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE

Query:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL
            H +                INESNHI L ERTWQG ESSGSVGENNY+DKGSSTIGQ TS+A+ERGIIRNEASSI LLE ALEEEKAARASLY+EL
Subjt:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL

Query:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---EQS
        EEERAAAATAADEAIAMI RLQNEKASVEMEARQYQRVIEEKFAYDE EMN+LREILVKREIDYHVLEKEIEAYREMDFSE+EQ  RNW+FILD   EQS
Subjt:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---EQS

Query:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV
        ATTHYSNGDPPIVQQIENAISL RKAK+NET SYNSQCHFNE  LL+QT WT +DNELK+NSL+CEH   +AA                        H V
Subjt:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV

Query:  NGIGSSILDME
          +GSS LDME
Subjt:  NGIGSSILDME

XP_022953644.1 probable myosin-binding protein 6 [Cucurbita moschata]1.0e-17769.14Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        M+FHEIHSWT CGLV AFLDLAVVYFLLCVSATVFIPSKI EV+G CLPCPCTGFYG QN N CLH+LL +WPKRKI LVL+  K RFPFDLIL+DDQMG
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  -NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGG
         N N  R  +HT+G+PLLQS ACCS                   KG RIM+QRP     RRRR  VE GKL          RKE E +AL +++DF    
Subjt:  -NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGG

Query:  EGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLE
                            I  D+NES H+DLG RTWQGFESSGSVGEN+Y++KGSSTIG GT++ QER I  NE  SIRLLE+ALEEEKAARASL+LE
Subjt:  EGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLE

Query:  LEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---EQ
        LEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDE EMNILREILV+REIDYHVLEKEIEAYR+MDFSEKE+LKRNW+FILD   EQ
Subjt:  LEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---EQ

Query:  SATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHT
        S+T  YSNGDPP+V QIENA+SL  KAK NE+NS NSQCHFNEE LLKQTIWT KDNEL +NSLL E   IE A   GGFEK  LSRGALQ  LE +DHT
Subjt:  SATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHT

Query:  VNGIGSSILDME
        +N +GSSILDME
Subjt:  VNGIGSSILDME

XP_038877576.1 uncharacterized protein LOC120069830 isoform X1 [Benincasa hispida]6.3e-21275.44Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        MSFHEIHSWTL GL+ AFLDL VVYFLLCVSATVFIPSKI +++G CLPCPC+GFYG  N N C H+L+VNWPKRKI LVL+L K RFPFDLIL+D+QM 
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSN--RNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILG
        NSN N  R  +H +GI   QSE CCSTFS PRLQNLVDKD E DGKGKRIMYQRP+TKIRRRRR  ++NGKLSKGICEGNET KERE VALVER+DF   
Subjt:  NSNSN--RNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILG

Query:  GEGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYL
                             I+ D+NESNH+DLG+RTWQGFESSGS GENN+M+K SST+GQGTS+A+ER IIRNEASSIRLLE+ALEEE+AARASL++
Subjt:  GEGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYL

Query:  ELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---E
        ELEEERAAAATAADEAIAMITRLQNEKASVEMEARQY RVIEEKFAYDE ++NILREILVKR+IDYHVLEKEIEAYR+MDFSEKEQLK NW+F+LD   E
Subjt:  ELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---E

Query:  QSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDH
        QS T HYSNGDPPIV QI NAIS  RKAK+NETNS  SQCHFNEE  LKQTIW  K+NELK+NSLLC+HI IEAAP CGGF+K FLSRGALQESLE +DH
Subjt:  QSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDH

Query:  TVNGIGSSILDME
         VN + SSILDME
Subjt:  TVNGIGSSILDME

XP_038877577.1 uncharacterized protein LOC120069830 isoform X2 [Benincasa hispida]9.7e-20573.88Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        MSFHEIHSWTL GL+ AFLDL VVYFLLCVSATVFIPSKI +++G CLPCPC+GFYG  N N C H+L+VNWPKRKI LVL+L K RFPFDLIL+D+QM 
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSN--RNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILG
        NSN N  R  +H +GI   QSE CCSTFS PRLQNLVDKD E DGKGKRIMYQRP+TKIRRRRR  ++NGKLSKGICEGNET KERE VALVER+DFI  
Subjt:  NSNSN--RNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILG

Query:  GEGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYL
                                          G+RTWQGFESSGS GENN+M+K SST+GQGTS+A+ER IIRNEASSIRLLE+ALEEE+AARASL++
Subjt:  GEGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYL

Query:  ELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---E
        ELEEERAAAATAADEAIAMITRLQNEKASVEMEARQY RVIEEKFAYDE ++NILREILVKR+IDYHVLEKEIEAYR+MDFSEKEQLK NW+F+LD   E
Subjt:  ELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---E

Query:  QSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDH
        QS T HYSNGDPPIV QI NAIS  RKAK+NETNS  SQCHFNEE  LKQTIW  K+NELK+NSLLC+HI IEAAP CGGF+K FLSRGALQESLE +DH
Subjt:  QSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDH

Query:  TVNGIGSSILDME
         VN + SSILDME
Subjt:  TVNGIGSSILDME

TrEMBL top hitse value%identityAlignment
A0A0A0K8C0 GTD-binding domain-containing protein4.7e-18168.49Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        MSFHEIHSWTL G+V AFLDLAVVYFLLCVSAT+FIPSKI +V+G CLPCPCTGFYG  N N C HKL+VNWPKRKI LVL+L K  FPFDLIL+DD+  
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE
          NSNRNL   NGI  LQSE CCST   PRLQNLVD DGE DGKGK+IMYQ+PRTKIRRRRR  ++NGKLSKG+CE NETRK RE VALVER+DF     
Subjt:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE

Query:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL
                           I+ D NESNH+DLG+R WQGFESSGS+GEN+YM+KGSST+GQGTS A+ER IIRNEAS+IRLLE ALEEE+ ARASL++EL
Subjt:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL

Query:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQ---S
        EEERAAAATAADEAIAMITRLQNEKAS EMEARQY R +EEKF+YDE +MNILREILVKR+IDYHVLEKEIEAYR+MDF+E+E LK N +FILDE    S
Subjt:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQ---S

Query:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV
        AT HYSNGDPPIV  I NA+SL R+AK+                           NEL++NSLL +HI I+AAP CGGFEK FLSRGALQ +LE + H V
Subjt:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV

Query:  NGIGSSILDME
        N +G SILDME
Subjt:  NGIGSSILDME

A0A1S3C7F2 uncharacterized protein LOC103497547 isoform X15.4e-17768.16Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILID-DQM
        MSFHEIHSWTL GLV AFLDLAVVYFLLCVSAT+FIPSKI +V+G CLPCPCTGFYG  N N CLHKL+VNWPKRKI LVL+L K RFPFDL  +D +Q+
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILID-DQM

Query:  GNSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGG
        G  NSNRN+   NGI  LQSE CCST   PRLQN+VDK GE DGKGK++MYQ+PRTKIRRRRR  V+NGKLSKGICEGNETRKERE VALVER+DF    
Subjt:  GNSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGG

Query:  EGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLE
                            I+ D NESNH DLG+R WQGFESSGS+GENNYM+KGSST+GQGTS+A+ER IIRNEAS+IRLLE ALEEE+AARASL++E
Subjt:  EGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLE

Query:  LEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQ---
        LEEERAAAATAADEAIAMITRLQNEKAS EMEARQY R +EEKF+YDE +MNILREILVKR+IDYHVLEKEIEAYR+MDF+EKEQLK N ++ILDE    
Subjt:  LEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQ---

Query:  SATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHT
        S T H SNGDPPI   I NA+SL R  K+                           NEL++NSLL +   IEAAP CGGFEK FLSRGALQ +L+ + H 
Subjt:  SATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHT

Query:  VNGIGSSILDME
        VN +G SI+DME
Subjt:  VNGIGSSILDME

A0A5D3E523 Putative myosin-binding protein 5 isoform X21.1e-17467.91Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        MSFHEIHSWTL GLV AFLDLAVVYFLLCVSAT+FIPSKI +V+G CLPCPCTGFYG  N N CLHKL+VNWPKRKI LVL+L K RFPFDL  +DD+  
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE
          NSNRNL   NGI  LQSE CCST   PRLQN+VDK GE DGK K++MYQ+PRTKIRRRRR  V+NGKLSKGICEGNETRKERE VALVER+DF     
Subjt:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE

Query:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL
                           I+ D NESNH DLG+R WQGFESSGS+GENNYM+KGSST+GQ  S+A+ER IIRNEAS+IRLLE ALEEE+AARASL++EL
Subjt:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL

Query:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQ---S
        EEERAAAATAADEAIAMITRLQNEKAS EMEARQY R +EEKF+YDE +MNILREILVKR+IDYHVLEKEIEAYR+MDF+EKEQLK N ++ILDE    S
Subjt:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQ---S

Query:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV
        AT H SNGDPPI   I NA+SL R  K+                           NEL++NSLL +   IEAAP CGGFEK FLSRGALQ +L+ + H V
Subjt:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV

Query:  NGIGSSILDME
        N +G SI+DME
Subjt:  NGIGSSILDME

A0A6J1D8Y8 uncharacterized protein LOC1110183431.3e-17868.49Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        MS HEIH WTL GL+GAFLDLA+VY LLC+SAT FIP KI ++IGL LPCPC+GFYG QN N C  +LL NWPKRKI  VL   KTRFPFDL+L+DDQ+G
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE
        NSN       T G+P  Q             QNL D+DG+ D KGKR+MYQRPRTKIRR RR  VE G+L++GIC+ NE RKERESVALVER++FI    
Subjt:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGE

Query:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL
            H +                INESNHI L ERTWQG ESSGSVGENNY+DKGSSTIGQ TS+A+ERGIIRNEASSI LLE ALEEEKAARASLY+EL
Subjt:  GGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLEL

Query:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---EQS
        EEERAAAATAADEAIAMI RLQNEKASVEMEARQYQRVIEEKFAYDE EMN+LREILVKREIDYHVLEKEIEAYREMDFSE+EQ  RNW+FILD   EQS
Subjt:  EEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---EQS

Query:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV
        ATTHYSNGDPPIVQQIENAISL RKAK+NET SYNSQCHFNE  LL+QT WT +DNELK+NSL+CEH   +AA                        H V
Subjt:  ATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTV

Query:  NGIGSSILDME
          +GSS LDME
Subjt:  NGIGSSILDME

A0A6J1GQ95 probable myosin-binding protein 64.9e-17869.14Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        M+FHEIHSWT CGLV AFLDLAVVYFLLCVSATVFIPSKI EV+G CLPCPCTGFYG QN N CLH+LL +WPKRKI LVL+  K RFPFDLIL+DDQMG
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  -NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGG
         N N  R  +HT+G+PLLQS ACCS                   KG RIM+QRP     RRRR  VE GKL          RKE E +AL +++DF    
Subjt:  -NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGG

Query:  EGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLE
                            I  D+NES H+DLG RTWQGFESSGSVGEN+Y++KGSSTIG GT++ QER I  NE  SIRLLE+ALEEEKAARASL+LE
Subjt:  EGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLE

Query:  LEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---EQ
        LEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDE EMNILREILV+REIDYHVLEKEIEAYR+MDFSEKE+LKRNW+FILD   EQ
Subjt:  LEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILD---EQ

Query:  SATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHT
        S+T  YSNGDPP+V QIENA+SL  KAK NE+NS NSQCHFNEE LLKQTIWT KDNEL +NSLL E   IE A   GGFEK  LSRGALQ  LE +DHT
Subjt:  SATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHT

Query:  VNGIGSSILDME
        +N +GSSILDME
Subjt:  VNGIGSSILDME

SwissProt top hitse value%identityAlignment
F4HVS6 Probable myosin-binding protein 62.1e-1334.43Show/hide
Query:  SSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYRE
        S +  L++ +  +K +   LY+EL+EER+A+A AA+EA+AMITRLQ EKA+V+MEA QYQR+++E+  YD+  +  +   L KRE +   LE E E YRE
Subjt:  SSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYRE

Query:  MDFSEKEQLKRNWNFILDEQSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTG--KDNELKE
              +Q      F     +A+ +    +   V  +  A+S   + +  E    N Q   +EE   +  +     K +E KE
Subjt:  MDFSEKEQLKRNWNFILDEQSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLKQTIWTG--KDNELKE

Q0WNW4 Myosin-binding protein 32.5e-1447.96Show/hide
Query:  SIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYR
        +I  L   +  E+ A   LY ELEEER+A+A +A++ +AMITRLQ EKA V+MEA QYQR++EE+  YD+  + +L  ++VKRE +   L++E+E YR
Subjt:  SIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYR

Q9CAC4 Myosin-binding protein 27.4e-1443.08Show/hide
Query:  LERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREM--DF
        L+  L+EE+ A  +LY ELE ER A+A AA E +AMI RL  EKA+++MEA QYQR++EE+  +D+  + +L E++V RE +   LEKE+E YR+   ++
Subjt:  LERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREM--DF

Query:  SEKEQ---LKRNWNFILDEQSATTHYSNGD
          KE+   L+R     L + S  ++ +NGD
Subjt:  SEKEQ---LKRNWNFILDEQSATTHYSNGD

Q9FG14 Myosin-binding protein 75.8e-1134.55Show/hide
Query:  SSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAY--
        + + LL   +  ++ +   LY EL+EER AA+TAA EA++MI RLQ +KA ++ME RQ++R  EEK  +D+ E+  L +++ KRE     L  E +AY  
Subjt:  SSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAY--

Query:  REMDF--------SEKEQLKRNWNFILDEQSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSY
        R M F        +EK  L RN + I ++       S+  P      EN   L     V++   Y
Subjt:  REMDF--------SEKEQLKRNWNFILDEQSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSY

Q9LMC8 Probable myosin-binding protein 58.1e-1347.47Show/hide
Query:  SSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYR
        S ++ L R +  ++ +   LY+EL+EER+A+A AA+ A+AMITRLQ EKA+V+MEA QYQR+++E+  YD+  +  +  +LVKRE +   LE  IE YR
Subjt:  SSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYR

Arabidopsis top hitse value%identityAlignment
AT1G04890.1 Protein of unknown function, DUF5937.0e-2834.14Show/hide
Query:  WQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRN-EASSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQY
        W  FE + SV  N  ++  SS          E+  +RN E  S+R LE  L+EE+AARA++ +EL++ER+AAA+AADEA+AMI RLQ+EKA++EMEARQ+
Subjt:  WQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRN-EASSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQY

Query:  QRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQSA--TTHYSNGD-----PPIVQQIENAI-SLPRKAKVNE
        QR++EE+  +D  EM IL++IL++RE + H LEKE+EAYR++   E E+L+ +   ++ E++     H  N D       +VQ+++  +  +P + + N 
Subjt:  QRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQSA--TTHYSNGD-----PPIVQQIENAI-SLPRKAKVNE

Query:  TNS---YNSQCHF-------------------NEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDH
          +   Y S                        +++L + ++   K++ L ENS++   I  +  P C   +K   S G+ ++S+  +D+
Subjt:  TNS---YNSQCHF-------------------NEEDLLKQTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDH

AT4G13160.1 Protein of unknown function, DUF5936.0e-2762.26Show/hide
Query:  IRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMD
        +RLLE A+E+EK A+A+L +ELE+ERAA+A+AADEA+AMI RLQ +KAS+EME +QY+R+I+EKFAYDE EMNIL+EIL KRE + H LEKE+E Y+ +D
Subjt:  IRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMD

Query:  FSEKEQ
          ++ +
Subjt:  FSEKEQ

AT4G13160.1 Protein of unknown function, DUF5933.9e-1043.82Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFP
        M + E +  T  G++ AF++LA  Y LLCVSA VFI SK+     L +PC      G QN + C+ KLL +WP R I  V +LA T  P
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFP

AT4G13630.1 Protein of unknown function, DUF5931.0e-3934.17Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        M   E+ SWT  GLV AF+DL+V + LLC S  V++ SK   + GL LPCPC G Y     + C  + L N P +KI  V    K R PFD IL +    
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVD----------KDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALV
             R +        L+ E   +T S  + +N             K G    K KR+ + R     +   +                            
Subjt:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVD----------KDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALV

Query:  EREDFILGGEGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSST----IGQGTSDAQERGIIRNEASSIRLLERAL
                              SC  + +  G  +E++ + +          SG   E+  + K  S      G G       G+++    ++ + E+ L
Subjt:  EREDFILGGEGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSST----IGQGTSDAQERGIIRNEASSIRLLERAL

Query:  EEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQ
         EE+AARASL LELE+ER AAA+AADEA+ MI RLQ EKAS+EMEARQYQR+IEEK A+D  EM+IL+EIL++RE + H LEKE++ YR+M F E EQ
Subjt:  EEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQ

AT4G13630.2 Protein of unknown function, DUF5931.0e-3934.17Show/hide
Query:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG
        M   E+ SWT  GLV AF+DL+V + LLC S  V++ SK   + GL LPCPC G Y     + C  + L N P +KI  V    K R PFD IL +    
Subjt:  MSFHEIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMG

Query:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVD----------KDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALV
             R +        L+ E   +T S  + +N             K G    K KR+ + R     +   +                            
Subjt:  NSNSNRNLYHTNGIPLLQSEACCSTFSGPRLQNLVD----------KDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALV

Query:  EREDFILGGEGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSST----IGQGTSDAQERGIIRNEASSIRLLERAL
                              SC  + +  G  +E++ + +          SG   E+  + K  S      G G       G+++    ++ + E+ L
Subjt:  EREDFILGGEGGVLHRIFAISCSCFCILAISGDINESNHIDLGERTWQGFESSGSVGENNYMDKGSST----IGQGTSDAQERGIIRNEASSIRLLERAL

Query:  EEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQ
         EE+AARASL LELE+ER AAA+AADEA+ MI RLQ EKAS+EMEARQYQR+IEEK A+D  EM+IL+EIL++RE + H LEKE++ YR+M F E EQ
Subjt:  EEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQ

AT5G16720.1 Protein of unknown function, DUF5931.8e-1547.96Show/hide
Query:  SIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYR
        +I  L   +  E+ A   LY ELEEER+A+A +A++ +AMITRLQ EKA V+MEA QYQR++EE+  YD+  + +L  ++VKRE +   L++E+E YR
Subjt:  SIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQYQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACGAAGCTCAATCCTCCGCAAAACGACATCGCTTTCATCCTTGTCCTCTACCTTCCCTTCACCTTGCCGTCTCCTTTCTATTCTGGCTGTCCACCTCCTCCTCT
CCACTGCTCCTTGAAATGGACACCTGATTTGACTCGCTTCGTCATCGCTCCCTTCCCCCTCTCTTTCCTCGCCGAATCCAACACCAATTTTCCTCAGTTCTTCTTACCAA
GTTATAGAAAGAGTTCGGGTTTTCTCTGTGGGATTGTCGATTTTTTTTTTTTTTTTTTAAAACGTGGTGGTGGTGGTTCTGGAGATCGGAATCGGAGGATGTCATTCCAC
GAGATTCATTCATGGACCTTGTGTGGACTAGTTGGAGCATTTCTTGACCTAGCTGTAGTTTATTTTCTTTTGTGCGTATCGGCGACCGTATTTATCCCGTCGAAGATTTT
CGAAGTTATTGGATTGTGCTTGCCTTGTCCTTGTACTGGATTTTATGGGATTCAGAACTGTAATTTCTGCTTGCATAAACTGCTTGTCAATTGGCCAAAGAGGAAGATCT
GTTTGGTGCTTGAGTTGGCCAAGACTAGGTTTCCTTTTGATTTGATTTTAATCGATGACCAAATGGGTAATTCGAATTCGAATAGGAATTTATATCATACGAATGGAATT
CCTCTGTTGCAGTCTGAGGCATGTTGTAGTACTTTCTCTGGCCCAAGATTGCAGAATCTGGTTGATAAAGATGGCGAGTGTGATGGTAAGGGGAAGAGGATTATGTACCA
GAGGCCAAGGACTAAAATCCGACGGCGGAGGAGAACTGTTGTTGAAAATGGGAAATTGTCCAAGGGAATCTGTGAGGGCAATGAAACTAGAAAGGAAAGGGAATCTGTGG
CATTGGTTGAGAGAGAAGATTTTATTCTAGGGGGTGAGGGTGGGGTTCTACATCGAATTTTTGCTATATCCTGCTCATGTTTCTGTATCCTTGCTATTTCAGGTGATATT
AATGAATCAAATCACATTGACTTGGGTGAAAGAACCTGGCAAGGTTTTGAATCAAGTGGCTCGGTTGGCGAAAATAATTATATGGATAAAGGTTCTTCAACTATAGGACA
AGGTACCAGTGATGCCCAAGAGAGAGGCATTATCAGAAATGAAGCTAGTTCTATTAGATTGTTGGAGCGAGCACTTGAAGAAGAGAAAGCTGCTCGGGCATCTCTGTACC
TGGAACTGGAAGAGGAGAGAGCTGCTGCTGCTACTGCTGCTGATGAAGCAATAGCCATGATAACACGTCTGCAAAATGAGAAGGCGTCAGTTGAAATGGAAGCAAGACAA
TATCAGAGGGTAATAGAAGAAAAATTTGCTTATGATGAAGTAGAGATGAATATCCTTAGAGAGATCCTTGTCAAGAGGGAAATAGATTATCACGTTTTGGAGAAGGAAAT
TGAAGCGTATAGGGAGATGGATTTTTCAGAAAAGGAACAGTTAAAAAGAAATTGGAATTTTATATTGGATGAACAGTCTGCCACTACCCATTACTCAAATGGAGATCCCC
CCATTGTTCAGCAAATTGAGAATGCTATTTCTCTTCCACGAAAAGCAAAGGTGAATGAAACTAACAGTTATAACTCTCAATGCCATTTTAATGAGGAAGACTTGCTCAAG
CAAACTATCTGGACGGGTAAAGACAATGAACTGAAGGAAAATAGCTTATTATGTGAGCATATAACAATTGAGGCAGCTCCATTTTGTGGTGGTTTTGAGAAAGGCTTTCT
TTCCCGTGGTGCATTACAAGAAAGTTTGGAGCCCTTAGATCACACGGTTAATGGTATCGGAAGTTCCATACTTGATATGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGACGAAGCTCAATCCTCCGCAAAACGACATCGCTTTCATCCTTGTCCTCTACCTTCCCTTCACCTTGCCGTCTCCTTTCTATTCTGGCTGTCCACCTCCTCCTCT
CCACTGCTCCTTGAAATGGACACCTGATTTGACTCGCTTCGTCATCGCTCCCTTCCCCCTCTCTTTCCTCGCCGAATCCAACACCAATTTTCCTCAGTTCTTCTTACCAA
GTTATAGAAAGAGTTCGGGTTTTCTCTGTGGGATTGTCGATTTTTTTTTTTTTTTTTTAAAACGTGGTGGTGGTGGTTCTGGAGATCGGAATCGGAGGATGTCATTCCAC
GAGATTCATTCATGGACCTTGTGTGGACTAGTTGGAGCATTTCTTGACCTAGCTGTAGTTTATTTTCTTTTGTGCGTATCGGCGACCGTATTTATCCCGTCGAAGATTTT
CGAAGTTATTGGATTGTGCTTGCCTTGTCCTTGTACTGGATTTTATGGGATTCAGAACTGTAATTTCTGCTTGCATAAACTGCTTGTCAATTGGCCAAAGAGGAAGATCT
GTTTGGTGCTTGAGTTGGCCAAGACTAGGTTTCCTTTTGATTTGATTTTAATCGATGACCAAATGGGTAATTCGAATTCGAATAGGAATTTATATCATACGAATGGAATT
CCTCTGTTGCAGTCTGAGGCATGTTGTAGTACTTTCTCTGGCCCAAGATTGCAGAATCTGGTTGATAAAGATGGCGAGTGTGATGGTAAGGGGAAGAGGATTATGTACCA
GAGGCCAAGGACTAAAATCCGACGGCGGAGGAGAACTGTTGTTGAAAATGGGAAATTGTCCAAGGGAATCTGTGAGGGCAATGAAACTAGAAAGGAAAGGGAATCTGTGG
CATTGGTTGAGAGAGAAGATTTTATTCTAGGGGGTGAGGGTGGGGTTCTACATCGAATTTTTGCTATATCCTGCTCATGTTTCTGTATCCTTGCTATTTCAGGTGATATT
AATGAATCAAATCACATTGACTTGGGTGAAAGAACCTGGCAAGGTTTTGAATCAAGTGGCTCGGTTGGCGAAAATAATTATATGGATAAAGGTTCTTCAACTATAGGACA
AGGTACCAGTGATGCCCAAGAGAGAGGCATTATCAGAAATGAAGCTAGTTCTATTAGATTGTTGGAGCGAGCACTTGAAGAAGAGAAAGCTGCTCGGGCATCTCTGTACC
TGGAACTGGAAGAGGAGAGAGCTGCTGCTGCTACTGCTGCTGATGAAGCAATAGCCATGATAACACGTCTGCAAAATGAGAAGGCGTCAGTTGAAATGGAAGCAAGACAA
TATCAGAGGGTAATAGAAGAAAAATTTGCTTATGATGAAGTAGAGATGAATATCCTTAGAGAGATCCTTGTCAAGAGGGAAATAGATTATCACGTTTTGGAGAAGGAAAT
TGAAGCGTATAGGGAGATGGATTTTTCAGAAAAGGAACAGTTAAAAAGAAATTGGAATTTTATATTGGATGAACAGTCTGCCACTACCCATTACTCAAATGGAGATCCCC
CCATTGTTCAGCAAATTGAGAATGCTATTTCTCTTCCACGAAAAGCAAAGGTGAATGAAACTAACAGTTATAACTCTCAATGCCATTTTAATGAGGAAGACTTGCTCAAG
CAAACTATCTGGACGGGTAAAGACAATGAACTGAAGGAAAATAGCTTATTATGTGAGCATATAACAATTGAGGCAGCTCCATTTTGTGGTGGTTTTGAGAAAGGCTTTCT
TTCCCGTGGTGCATTACAAGAAAGTTTGGAGCCCTTAGATCACACGGTTAATGGTATCGGAAGTTCCATACTTGATATGGAATAG
Protein sequenceShow/hide protein sequence
MMTKLNPPQNDIAFILVLYLPFTLPSPFYSGCPPPPLHCSLKWTPDLTRFVIAPFPLSFLAESNTNFPQFFLPSYRKSSGFLCGIVDFFFFFLKRGGGGSGDRNRRMSFH
EIHSWTLCGLVGAFLDLAVVYFLLCVSATVFIPSKIFEVIGLCLPCPCTGFYGIQNCNFCLHKLLVNWPKRKICLVLELAKTRFPFDLILIDDQMGNSNSNRNLYHTNGI
PLLQSEACCSTFSGPRLQNLVDKDGECDGKGKRIMYQRPRTKIRRRRRTVVENGKLSKGICEGNETRKERESVALVEREDFILGGEGGVLHRIFAISCSCFCILAISGDI
NESNHIDLGERTWQGFESSGSVGENNYMDKGSSTIGQGTSDAQERGIIRNEASSIRLLERALEEEKAARASLYLELEEERAAAATAADEAIAMITRLQNEKASVEMEARQ
YQRVIEEKFAYDEVEMNILREILVKREIDYHVLEKEIEAYREMDFSEKEQLKRNWNFILDEQSATTHYSNGDPPIVQQIENAISLPRKAKVNETNSYNSQCHFNEEDLLK
QTIWTGKDNELKENSLLCEHITIEAAPFCGGFEKGFLSRGALQESLEPLDHTVNGIGSSILDME