; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029981 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029981
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPolyadenylate-binding protein 1-like
Genome locationtig00153554:1753799..1765022
RNA-Seq ExpressionSgr029981
SyntenySgr029981
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006418 - tRNA aminoacylation for protein translation (biological process)
GO:0016592 - mediator complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0003723 - RNA binding (molecular function)
GO:0004812 - aminoacyl-tRNA ligase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR002305 - Aminoacyl-tRNA synthetase, class Ic
IPR008831 - Mediator complex, subunit Med31
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold
IPR035979 - RNA-binding domain superfamily
IPR038089 - Mediator of RNA polymerase II, subunit Med31 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF9668146.1 hypothetical protein SADUNF_Sadunf15G0098100 [Salix dunnii]2.7e-13462.31Show/hide
Query:  MDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHF
        MDADVDMSS RA+E+ Y+  +S          KRRLKEIEEEAGALREMQAKVEKEMGA+ +           K        ++  VDY+CTPEEVQQHF
Subjt:  MDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHF

Query:  QSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIPAATFYPSYGYGPPHVAFP
        QSCGTVNRVTILTDKFG PKGFAYVEF+EVDAVQNAL+LNESELHGRQLKVSAKRTNVPGMKQ+RGRR + +GFR +R F+P A FY  YGYG       
Subjt:  QSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIPAATFYPSYGYGPPHVAFP

Query:  VRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYI
             S  +  + +       Y     ++          SFE A                E   + PSSP  +YKDPDDGRQRFLLELEFVQCLANPTYI
Subjt:  VRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYI

Query:  HYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPPVSAP
        HYLAQNRYF+D+AFIGYLKYL YWQ+PEY+KFIMYPHCL+FLELLQNANFRNAMAHP NKELAHRQQF+FWKNYRNNRLKHILPR LPEPAA PP  +  
Subjt:  HYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPPVSAP

Query:  PQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK
        P  PVPA    M A+    A  SPM YG+PPGS   K+D++S G +RRKRK
Subjt:  PQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK

RXH94803.1 hypothetical protein DVH24_024487 [Malus domestica]8.4e-16068.65Show/hide
Query:  EEHEHEVYGGDIP-DDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSV
        E+HEHEVYG +IP DDGE   DVDMSS RAD    E  + + NSK+LEDMK+RLKEIEEEAGALR+MQAKVEKEMGAVQ D+SS SA+QAEKEEVDSRS+
Subjt:  EEHEHEVYGGDIP-DDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSV

Query:  YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIP
        YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQNALLLNESELHGRQLKVSAKRTNVPG+KQYRGRRPN F FR RRPFIP
Subjt:  YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIP

Query:  AATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEE--------VDNP------S
        AA FYPS+GYG        RR  SL    F++L           L L  E     G +          D   ++H  ++ E        V +P       
Subjt:  AATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEE--------VDNP------S

Query:  SPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQF
          +++YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQ+PEY +FIMYPHCLFFLE LQNANFR AMAHPGNKELAHRQQF
Subjt:  SPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQF

Query:  YFWKNYRNNRLKHILPRTLPEPA-ALPPPVSAPPQAPVPATAPAMAASPAGTA---ALSPMQYGIPPGSGLAKNDMKSGGIDRRKRKH
        YFWKNYRNNRLKHILPR LPEP  ALPPP  APPQ  VP   P  A + + TA   A SPMQY +PPGS   K + ++ G+DRRKRKH
Subjt:  YFWKNYRNNRLKHILPRTLPEPA-ALPPPVSAPPQAPVPATAPAMAASPAGTA---ALSPMQYGIPPGSGLAKNDMKSGGIDRRKRKH

TKR68723.1 polyadenylate-binding protein 1-like [Populus alba]1.8e-16266.95Show/hide
Query:  EHNFNYEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV
        +H  +  EHEHEVYGG+IPD+GEMDADVDMSS RA+E+    E+ + NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DS   SATQAEKEEV
Subjt:  EHNFNYEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV

Query:  DSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGR
        DSRS+YVGNVDY+CTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEVDA+QNALLLNESELHGRQLKVSAKRTNVPGMKQ+RGRRP+ +GFR R
Subjt:  DSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGR

Query:  RPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDP
        RPF+ AA FYP+YGYG                                                                            P  +YKDP
Subjt:  RPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDP

Query:  DDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNN
        DDGRQRFL+ELEFVQCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQ+PEY+KFIMYPHCL+FLELLQNANFRNAMAHPGNKELAHRQQF+FWKNYRNN
Subjt:  DDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNN

Query:  RLKHILPRTLPEPAALPPPVSAPPQAPV-PATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK
        RLKHILPR LPEP   PP  + PP  P  P +A  +  S A  A  SPM YG+PPGS   K D++S G DRRKRK
Subjt:  RLKHILPRTLPEPAALPPPVSAPPQAPV-PATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK

XP_025014092.1 polyadenylate-binding protein 2 isoform X1 [Ricinus communis]5.6e-16469.04Show/hide
Query:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAE----HSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD
        +EE EHEVYG +IPD+GEMDAD+DMSS R +EE  E E      N NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DSSS SATQAEKEEVD
Subjt:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAE----HSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD

Query:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSF-GFRGR
        SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTN+PGMKQYRGRRPN + GFR R
Subjt:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSF-GFRGR

Query:  RPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDP
        RPF+P A F P YGYG                                                                            P ++YKDP
Subjt:  RPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDP

Query:  DDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNN
        DDG+QRFLLELEFVQCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNA+FRNAMAHPGNKEL HRQQF+FWKNYRNN
Subjt:  DDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNN

Query:  RLKHILPRTLPEPAALPPPVSAP----PQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK
        RLKHILPR LPEP   PP  + P      +PVP T  AM A  A  +ALSPM YG+PPGS LAKNDM++ GIDRRKRK
Subjt:  RLKHILPRTLPEPAALPPPVSAP----PQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK

XP_025014094.1 polyadenylate-binding protein 2 isoform X2 [Ricinus communis]3.1e-16268.62Show/hide
Query:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAE----HSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD
        +EE EHEVYG +IPD+GEMDAD+DMSS R +EE  E E      N NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DSSS SATQAEKEEVD
Subjt:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAE----HSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD

Query:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSF-GFRGR
        SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTN+PGMKQYRGRRPN + GFR R
Subjt:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSF-GFRGR

Query:  RPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDP
        RPF+P A F P YG                                                                              P ++YKDP
Subjt:  RPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDP

Query:  DDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNN
        DDG+QRFLLELEFVQCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNA+FRNAMAHPGNKEL HRQQF+FWKNYRNN
Subjt:  DDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNN

Query:  RLKHILPRTLPEPAALPPPVSAP----PQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK
        RLKHILPR LPEP   PP  + P      +PVP T  AM A  A  +ALSPM YG+PPGS LAKNDM++ GIDRRKRK
Subjt:  RLKHILPRTLPEPAALPPPVSAP----PQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK

TrEMBL top hitse value%identityAlignment
A0A498JLW5 RRM domain-containing protein4.1e-16068.65Show/hide
Query:  EEHEHEVYGGDIP-DDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSV
        E+HEHEVYG +IP DDGE   DVDMSS RAD    E  + + NSK+LEDMK+RLKEIEEEAGALR+MQAKVEKEMGAVQ D+SS SA+QAEKEEVDSRS+
Subjt:  EEHEHEVYGGDIP-DDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSV

Query:  YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIP
        YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQNALLLNESELHGRQLKVSAKRTNVPG+KQYRGRRPN F FR RRPFIP
Subjt:  YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIP

Query:  AATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEE--------VDNP------S
        AA FYPS+GYG        RR  SL    F++L           L L  E     G +          D   ++H  ++ E        V +P       
Subjt:  AATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEE--------VDNP------S

Query:  SPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQF
          +++YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQ+PEY +FIMYPHCLFFLE LQNANFR AMAHPGNKELAHRQQF
Subjt:  SPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQF

Query:  YFWKNYRNNRLKHILPRTLPEPA-ALPPPVSAPPQAPVPATAPAMAASPAGTA---ALSPMQYGIPPGSGLAKNDMKSGGIDRRKRKH
        YFWKNYRNNRLKHILPR LPEP  ALPPP  APPQ  VP   P  A + + TA   A SPMQY +PPGS   K + ++ G+DRRKRKH
Subjt:  YFWKNYRNNRLKHILPRTLPEPA-ALPPPVSAPPQAPVPATAPAMAASPAGTA---ALSPMQYGIPPGSGLAKNDMKSGGIDRRKRKH

A0A4U5MIM5 Polyadenylate-binding protein 1-like8.8e-16366.95Show/hide
Query:  EHNFNYEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV
        +H  +  EHEHEVYGG+IPD+GEMDADVDMSS RA+E+    E+ + NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DS   SATQAEKEEV
Subjt:  EHNFNYEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV

Query:  DSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGR
        DSRS+YVGNVDY+CTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEVDA+QNALLLNESELHGRQLKVSAKRTNVPGMKQ+RGRRP+ +GFR R
Subjt:  DSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGR

Query:  RPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDP
        RPF+ AA FYP+YGYG                                                                            P  +YKDP
Subjt:  RPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDP

Query:  DDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNN
        DDGRQRFL+ELEFVQCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQ+PEY+KFIMYPHCL+FLELLQNANFRNAMAHPGNKELAHRQQF+FWKNYRNN
Subjt:  DDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNN

Query:  RLKHILPRTLPEPAALPPPVSAPPQAPV-PATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK
        RLKHILPR LPEP   PP  + PP  P  P +A  +  S A  A  SPM YG+PPGS   K D++S G DRRKRK
Subjt:  RLKHILPRTLPEPAALPPPVSAPPQAPV-PATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK

A0A6A6MFI3 Mediator of RNA polymerase II transcription subunit 311.6e-11666.76Show/hide
Query:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSV
        +EE EH+VYG +IPD+GE+DAD+DMSS R +E+    E    NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DSSS SATQAEKEEVDSRS+
Subjt:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSV

Query:  YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIP
        YVGNVDY CTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVE DAVQNALLLNESELHGRQLKVSAKRTN+PGMKQYRGRRPN +GFR RRPF+P
Subjt:  YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIP

Query:  AATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDPDDGRQ
        A  FYP YGYG                                                                            P ++YKDPDDGRQ
Subjt:  AATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDPDDGRQ

Query:  RFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIM
        RFLLELEFVQCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQ+PEYIKFIM
Subjt:  RFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIM

A0A6N2MJL7 RRM domain-containing protein3.8e-14261.43Show/hide
Query:  EHNFNYEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV
        +H  +  EHEHEVYGG+IPD+GEMDAD+DMSS RA+E+    E+ + NSKDLEDMK+RLKEIEEEA ALREMQAKVEKEMGAVQ DS   SATQAEKEEV
Subjt:  EHNFNYEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV

Query:  DSRSVYVGN--VDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFR
        DSRS+YVGN  VDY+CTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVE DA+QNALLLNESELHGRQLKVSAKRTNVPGMK +RGRRP+++GFR
Subjt:  DSRSVYVGN--VDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFR

Query:  GRRPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYK
         RRPF+P    YP+YGYG                              F    + C         F+G+        +      +E++ +  +  T+   
Subjt:  GRRPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYK

Query:  DPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYR
            G+    L+L             YLAQNRYFDD+AFIGYLKYLQYWQ+PEY+KFIMYPHCL+FLELLQN NFRNAMAHPGNKELAHRQQF+FWKNYR
Subjt:  DPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYR

Query:  NNRLKHILPRTLPEPAALPPPVSAPPQAPVPATAPAMAASPAGTAAL-SPMQYGIPPGSGLAKNDMKSGGIDRRKRK
        NNRLKHILPR LPEP   PP  + PPQ P    + A    PA + A+ SPM YG+PPGS   K+D++S G DRRKRK
Subjt:  NNRLKHILPRTLPEPAALPPPVSAPPQAPVPATAPAMAASPAGTAAL-SPMQYGIPPGSGLAKNDMKSGGIDRRKRK

A0A803P1V0 Uncharacterized protein5.5e-14968.28Show/hide
Query:  EHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHP
        EH     ++LEDMK+RLKE+EEEAGALREMQAKVEKEMGAVQ DSSSTSATQAEKEEVDSRS+YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG P
Subjt:  EHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHP

Query:  KGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFL
        KGFAYVEFVEV+AVQNALLLNESELHGRQLKVSAKRTNVPGMKQ+RGRR     FR RRPF PA+ FYPSYGYG                         +
Subjt:  KGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIPAATFYPSYGYGPPHVAFPVRRGTSLARAYFIILCQFL

Query:  PQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLK
        P++R                                           P    ++YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYF+D+AFIGYLK
Subjt:  PQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLK

Query:  YLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPPVSAPPQ---APVPATAPAMAASP
        YLQYWQQPEY+KFIMYPHCL+FLELLQNANFRNAMAHPG+KELAHRQQFYFWKNYRNNRLKHILPR+LPEP   P P   PPQ    PV A+   MAA+P
Subjt:  YLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPPVSAPPQ---APVPATAPAMAASP

Query:  AGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK
             LSPMQY  P GS L K DM++ G DRRKRK
Subjt:  AGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRK

SwissProt top hitse value%identityAlignment
O14327 Polyadenylate-binding protein 21.8e-3552.66Show/hide
Query:  NANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKF-GHPKG
        +   K+L +MK R+ E+E EA  LR MQ +++ E          T A + +KE +D++SVYVGNVDY+ TPEE+Q HF SCG+VNRVTIL DKF GHPKG
Subjt:  NANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKF-GHPKG

Query:  FAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIPAATFYPSY
        FAY+EF E   V NALLLN S LH R LKV+ KRTNVPGM + RGR       RGR  +   A  +  Y
Subjt:  FAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIPAATFYPSY

Q8VYB1 Mediator of RNA polymerase II transcription subunit 313.4e-7170.79Show/hide
Query:  MASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRN
        MAS   + ++  + PS P N YKDPD GRQRFLLELEF+QCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQ+PEYIKFIMYPHCL+FLELLQN NFR 
Subjt:  MASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRN

Query:  AMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPPVSAPPQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKND---MKSGGIDRRK
        AMAHP NKELAHRQQFY+WKNYRNNRLKHILPR LPEP  +PP    PP AP  +  PA +A+ A + ALSPMQY     + L+KND   M + GIDRRK
Subjt:  AMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPPVSAPPQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKND---MKSGGIDRRK

Query:  RK
        RK
Subjt:  RK

Q93VI4 Polyadenylate-binding protein 16.9e-7267.7Show/hide
Query:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGY----EAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD
        ++E EHEVYGG+IP++ E + D +       EEG     E     ++S+DLEDMK+R+KEIEEEAGALREMQAK EK+MGA Q+ S   SA  AEKEEVD
Subjt:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGY----EAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD

Query:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGR-RPNSFGFRGR
        SRS+YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQN+L+LNESELHGRQ+KVSAKRTNVPGM+Q+RGR RP    FR  
Subjt:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGR-RPNSFGFRGR

Query:  RPFIPAATFYPSYGYG-PPHVAFPVR
        R F+P   FYP Y YG  P    P+R
Subjt:  RPFIPAATFYPSYGYG-PPHVAFPVR

Q9FJN9 Polyadenylate-binding protein 22.7e-6865.93Show/hide
Query:  EEHEHEVYGGDIPDDGEMDADV-----DMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD
        EE EHEVYGG+IPD GEMD D+     D+    AD++           K+L++MK+RLKE+E+EA ALREMQAKVEKEMGA  +D +S +A QA KEEVD
Subjt:  EEHEHEVYGGDIPDDGEMDADV-----DMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD

Query:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSF-GFRGR
        +RSV+VGNVDYACTPEEVQQHFQ+CGTV+RVTILTDKFG PKGFAYVEFVEV+AVQ AL LNESELHGRQLKV  KRTNVPG+KQ+RGRR N + G+R R
Subjt:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSF-GFRGR

Query:  RPFIPAATFYPSYGYG-PPHVAFPVR
        RPF+ +   Y  YGYG  P    P+R
Subjt:  RPFIPAATFYPSYGYG-PPHVAFPVR

Q9LX90 Polyadenylate-binding protein 31.7e-6768.3Show/hide
Query:  EEHEHEVYGGDIPDDGEMDA---DVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSR
        EE EHEVYGG+IP+ G+ D    D+DMS+  ADE+            +L +MKRRLKE+EEEA ALREMQAKVEKEMGA Q D +S +A Q  KEEVD+R
Subjt:  EEHEHEVYGGDIPDDGEMDA---DVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSR

Query:  SVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYR-GRRPNSFGFRGRRP
        SVYVGNVDYACTPEEVQ HFQ+CGTVNRVTIL DKFG PKGFAYVEFVEV+AVQ AL LNESELHGRQLKVS KRTNVPGMKQY  GR   S G+R RRP
Subjt:  SVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYR-GRRPNSFGFRGRRP

Query:  FIPAATFYPSYGYG-PPHVAFPVR
        F+P   FY  YGYG  P    P+R
Subjt:  FIPAATFYPSYGYG-PPHVAFPVR

Arabidopsis top hitse value%identityAlignment
AT5G10350.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.2e-6868.3Show/hide
Query:  EEHEHEVYGGDIPDDGEMDA---DVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSR
        EE EHEVYGG+IP+ G+ D    D+DMS+  ADE+            +L +MKRRLKE+EEEA ALREMQAKVEKEMGA Q D +S +A Q  KEEVD+R
Subjt:  EEHEHEVYGGDIPDDGEMDA---DVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSR

Query:  SVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYR-GRRPNSFGFRGRRP
        SVYVGNVDYACTPEEVQ HFQ+CGTVNRVTIL DKFG PKGFAYVEFVEV+AVQ AL LNESELHGRQLKVS KRTNVPGMKQY  GR   S G+R RRP
Subjt:  SVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYR-GRRPNSFGFRGRRP

Query:  FIPAATFYPSYGYG-PPHVAFPVR
        F+P   FY  YGYG  P    P+R
Subjt:  FIPAATFYPSYGYG-PPHVAFPVR

AT5G10350.2 RNA-binding (RRM/RBD/RNP motifs) family protein1.6e-6870.09Show/hide
Query:  EEHEHEVYGGDIPDDGEMDA---DVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSR
        EE EHEVYGG+IP+ G+ D    D+DMS+  ADE+            +L +MKRRLKE+EEEA ALREMQAKVEKEMGA Q D +S +A Q  KEEVD+R
Subjt:  EEHEHEVYGGDIPDDGEMDA---DVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSR

Query:  SVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYR-GRRPNSFGFRGRRP
        SVYVGNVDYACTPEEVQ HFQ+CGTVNRVTIL DKFG PKGFAYVEFVEV+AVQ AL LNESELHGRQLKVS KRTNVPGMKQY  GR   S G+R RRP
Subjt:  SVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYR-GRRPNSFGFRGRRP

Query:  FIPAATFYPSYGYG
        F+P   FY  YGYG
Subjt:  FIPAATFYPSYGYG

AT5G19910.1 SOH1 family protein2.4e-7270.79Show/hide
Query:  MASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRN
        MAS   + ++  + PS P N YKDPD GRQRFLLELEF+QCLANPTYIHYLAQNRYF+D+AFIGYLKYLQYWQ+PEYIKFIMYPHCL+FLELLQN NFR 
Subjt:  MASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRN

Query:  AMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPPVSAPPQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKND---MKSGGIDRRK
        AMAHP NKELAHRQQFY+WKNYRNNRLKHILPR LPEP  +PP    PP AP  +  PA +A+ A + ALSPMQY     + L+KND   M + GIDRRK
Subjt:  AMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPPVSAPPQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKND---MKSGGIDRRK

Query:  RK
        RK
Subjt:  RK

AT5G51120.1 polyadenylate-binding protein 14.9e-7367.7Show/hide
Query:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGY----EAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD
        ++E EHEVYGG+IP++ E + D +       EEG     E     ++S+DLEDMK+R+KEIEEEAGALREMQAK EK+MGA Q+ S   SA  AEKEEVD
Subjt:  YEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGY----EAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD

Query:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGR-RPNSFGFRGR
        SRS+YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQN+L+LNESELHGRQ+KVSAKRTNVPGM+Q+RGR RP    FR  
Subjt:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGR-RPNSFGFRGR

Query:  RPFIPAATFYPSYGYG-PPHVAFPVR
        R F+P   FYP Y YG  P    P+R
Subjt:  RPFIPAATFYPSYGYG-PPHVAFPVR

AT5G65260.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.9e-6965.93Show/hide
Query:  EEHEHEVYGGDIPDDGEMDADV-----DMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD
        EE EHEVYGG+IPD GEMD D+     D+    AD++           K+L++MK+RLKE+E+EA ALREMQAKVEKEMGA  +D +S +A QA KEEVD
Subjt:  EEHEHEVYGGDIPDDGEMDADV-----DMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD

Query:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSF-GFRGR
        +RSV+VGNVDYACTPEEVQQHFQ+CGTV+RVTILTDKFG PKGFAYVEFVEV+AVQ AL LNESELHGRQLKV  KRTNVPG+KQ+RGRR N + G+R R
Subjt:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSF-GFRGR

Query:  RPFIPAATFYPSYGYG-PPHVAFPVR
        RPF+ +   Y  YGYG  P    P+R
Subjt:  RPFIPAATFYPSYGYG-PPHVAFPVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCACAATTTTAACTACGAGGAGCACGAACACGAAGTCTATGGAGGCGATATTCCCGACGACGGAGAGATGGATGCCGACGTCGACATGTCCTCTGGTAGG
GCCGACGAAGAAGGTTACGAGGCTGAGCATAGCAACGCCAACTCGAAGGACTTGGAGGACATGAAAAGGAGGCTCAAGGAGATCGAAGAGGAAGCTGGTGCCCTT
CGAGAAATGCAGGCCAAGGTTGAGAAGGAGATGGGTGCTGTTCAAGAAGATTCATCCAGTACCTCTGCAACTCAGGCTGAAAAGGAGGAAGTAGATTCTCGATCC
GTATATGTTGGTAATGTTGATTATGCATGCACTCCTGAGGAGGTCCAACAGCATTTTCAATCATGTGGAACAGTGAATAGAGTGACAATTTTGACAGACAAGTTC
GGTCACCCCAAGGGATTTGCTTATGTCGAGTTCGTGGAGGTTGATGCCGTACAGAATGCCTTGCTGTTGAATGAATCAGAGTTGCACGGTCGCCAACTAAAGGTC
TCTGCCAAAAGGACGAATGTCCCTGGCATGAAGCAATACCGAGGAAGGCGGCCTAATTCTTTTGGCTTTCGAGGTCGCCGGCCTTTCATACCTGCTGCAACTTTT
TATCCTTCGTATGGTTATGGTCCTCCCCATGTCGCCTTCCCTGTTCGTCGCGGAACGTCGTTAGCACGAGCATATTTCATTATTTTGTGTCAATTTCTTCCACAG
TATCGATTTCAGTCGCTTGCTTTGGGATGTGAATGTAGCCCTAGTTGGGGAACTTCGTTTGAAGGAGCTTTAAAGTTCGACTGGTTAGATTCGATGGCCTCTAAT
CATGGATTAAACGAAGAAGAAGTTGATAATCCATCATCGCCTACGAATGTATACAAGGATCCAGATGACGGGCGGCAGCGGTTCTTGCTTGAATTGGAATTTGTT
CAATGTCTTGCCAATCCGACTTACATTCACTATCTGGCTCAGAATCGTTACTTCGATGATGATGCTTTCATTGGTTACTTGAAGTATCTTCAATATTGGCAACAA
CCAGAGTATATAAAGTTTATAATGTATCCTCATTGCCTTTTTTTCCTTGAACTTCTACAAAATGCAAACTTCCGAAATGCAATGGCTCATCCTGGCAACAAGGAA
TTGGCACACAGGCAACAATTCTACTTCTGGAAGAACTATAGGAACAATCGATTGAAACACATTTTACCGAGAACTCTTCCTGAACCTGCAGCTTTACCACCTCCA
GTTTCTGCTCCACCTCAGGCGCCTGTGCCAGCAACAGCCCCTGCTATGGCAGCTTCTCCTGCTGGCACAGCTGCCCTTTCACCGATGCAGTATGGTATTCCCCCT
GGATCCGGACTTGCAAAAAATGACATGAAGAGTGGAGGAATTGATCGACGAAAGAGAAAACATAGCTGCGGCGACCAAGAGCTGAACTGGAAGTATGGCCATTGT
TGCATCTGCTTGTGGCTCCAGACCTTCCTTCTCTCCCAACGAAGCCTCTTCTTCTCCACCTCCATCATTTCCTCTAAGCCCACCTTCTCCGTCAAAAGCAGTTGC
CGCTGCTTGCTTCCTCTGCAAGTTCTCTACAACCTCCTTCCGTTGAACCCCACAGCGCTTCTCGCCGCAACGTCGTCGAAATTCTCGAAGAACGAGGCCTGCTCG
AGTCCATCACCAGCGACAATCTCCGCTCCGCTTGTCTCAGCCCTCTCAAGGTCTATTGCGGCTTCGACCCCACGCCGAGAGCTTACATTTGGGGAATCTCCTGGT
CTCATCGTCCTCTCTTGGTTCCGCCGATGCGGCCACACCACAGTCGCTCTCATCGGCGGCGCCACCGTGCGAGTCGGCGACCCCTCCGGCAAGAGCCTGGAGCGG
CCCGAGCTCGATCTCCGGACCTTGGAGGATAACACGCTCGGAATTACCAATTCCATATCTAGAATTTTGAGGAATTCGGTTTCCGATTCAAGTTTTAGCCCTAAT
TTCGTGATTCTCAACAATTATGATTGGTGGAAGGATTTCCGATTGCTGGATTTCTTGAAAGGAGTGGGTCGGTTTTCGAGAGTTGGGACGATGATGCTAAGGAGA
GCGTTAGGAGGAGGTTGGAATCGGAGCAAGGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCACAATTTTAACTACGAGGAGCACGAACACGAAGTCTATGGAGGCGATATTCCCGACGACGGAGAGATGGATGCCGACGTCGACATGTCCTCTGGTAGG
GCCGACGAAGAAGGTTACGAGGCTGAGCATAGCAACGCCAACTCGAAGGACTTGGAGGACATGAAAAGGAGGCTCAAGGAGATCGAAGAGGAAGCTGGTGCCCTT
CGAGAAATGCAGGCCAAGGTTGAGAAGGAGATGGGTGCTGTTCAAGAAGATTCATCCAGTACCTCTGCAACTCAGGCTGAAAAGGAGGAAGTAGATTCTCGATCC
GTATATGTTGGTAATGTTGATTATGCATGCACTCCTGAGGAGGTCCAACAGCATTTTCAATCATGTGGAACAGTGAATAGAGTGACAATTTTGACAGACAAGTTC
GGTCACCCCAAGGGATTTGCTTATGTCGAGTTCGTGGAGGTTGATGCCGTACAGAATGCCTTGCTGTTGAATGAATCAGAGTTGCACGGTCGCCAACTAAAGGTC
TCTGCCAAAAGGACGAATGTCCCTGGCATGAAGCAATACCGAGGAAGGCGGCCTAATTCTTTTGGCTTTCGAGGTCGCCGGCCTTTCATACCTGCTGCAACTTTT
TATCCTTCGTATGGTTATGGTCCTCCCCATGTCGCCTTCCCTGTTCGTCGCGGAACGTCGTTAGCACGAGCATATTTCATTATTTTGTGTCAATTTCTTCCACAG
TATCGATTTCAGTCGCTTGCTTTGGGATGTGAATGTAGCCCTAGTTGGGGAACTTCGTTTGAAGGAGCTTTAAAGTTCGACTGGTTAGATTCGATGGCCTCTAAT
CATGGATTAAACGAAGAAGAAGTTGATAATCCATCATCGCCTACGAATGTATACAAGGATCCAGATGACGGGCGGCAGCGGTTCTTGCTTGAATTGGAATTTGTT
CAATGTCTTGCCAATCCGACTTACATTCACTATCTGGCTCAGAATCGTTACTTCGATGATGATGCTTTCATTGGTTACTTGAAGTATCTTCAATATTGGCAACAA
CCAGAGTATATAAAGTTTATAATGTATCCTCATTGCCTTTTTTTCCTTGAACTTCTACAAAATGCAAACTTCCGAAATGCAATGGCTCATCCTGGCAACAAGGAA
TTGGCACACAGGCAACAATTCTACTTCTGGAAGAACTATAGGAACAATCGATTGAAACACATTTTACCGAGAACTCTTCCTGAACCTGCAGCTTTACCACCTCCA
GTTTCTGCTCCACCTCAGGCGCCTGTGCCAGCAACAGCCCCTGCTATGGCAGCTTCTCCTGCTGGCACAGCTGCCCTTTCACCGATGCAGTATGGTATTCCCCCT
GGATCCGGACTTGCAAAAAATGACATGAAGAGTGGAGGAATTGATCGACGAAAGAGAAAACATAGCTGCGGCGACCAAGAGCTGAACTGGAAGTATGGCCATTGT
TGCATCTGCTTGTGGCTCCAGACCTTCCTTCTCTCCCAACGAAGCCTCTTCTTCTCCACCTCCATCATTTCCTCTAAGCCCACCTTCTCCGTCAAAAGCAGTTGC
CGCTGCTTGCTTCCTCTGCAAGTTCTCTACAACCTCCTTCCGTTGAACCCCACAGCGCTTCTCGCCGCAACGTCGTCGAAATTCTCGAAGAACGAGGCCTGCTCG
AGTCCATCACCAGCGACAATCTCCGCTCCGCTTGTCTCAGCCCTCTCAAGGTCTATTGCGGCTTCGACCCCACGCCGAGAGCTTACATTTGGGGAATCTCCTGGT
CTCATCGTCCTCTCTTGGTTCCGCCGATGCGGCCACACCACAGTCGCTCTCATCGGCGGCGCCACCGTGCGAGTCGGCGACCCCTCCGGCAAGAGCCTGGAGCGG
CCCGAGCTCGATCTCCGGACCTTGGAGGATAACACGCTCGGAATTACCAATTCCATATCTAGAATTTTGAGGAATTCGGTTTCCGATTCAAGTTTTAGCCCTAAT
TTCGTGATTCTCAACAATTATGATTGGTGGAAGGATTTCCGATTGCTGGATTTCTTGAAAGGAGTGGGTCGGTTTTCGAGAGTTGGGACGATGATGCTAAGGAGA
GCGTTAGGAGGAGGTTGGAATCGGAGCAAGGAATGA
Protein sequenceShow/hide protein sequence
MEHNFNYEEHEHEVYGGDIPDDGEMDADVDMSSGRADEEGYEAEHSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRS
VYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNSFGFRGRRPFIPAATF
YPSYGYGPPHVAFPVRRGTSLARAYFIILCQFLPQYRFQSLALGCECSPSWGTSFEGALKFDWLDSMASNHGLNEEEVDNPSSPTNVYKDPDDGRQRFLLELEFV
QCLANPTYIHYLAQNRYFDDDAFIGYLKYLQYWQQPEYIKFIMYPHCLFFLELLQNANFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRTLPEPAALPPP
VSAPPQAPVPATAPAMAASPAGTAALSPMQYGIPPGSGLAKNDMKSGGIDRRKRKHSCGDQELNWKYGHCCICLWLQTFLLSQRSLFFSTSIISSKPTFSVKSSC
RCLLPLQVLYNLLPLNPTALLAATSSKFSKNEACSSPSPATISAPLVSALSRSIAASTPRRELTFGESPGLIVLSWFRRCGHTTVALIGGATVRVGDPSGKSLER
PELDLRTLEDNTLGITNSISRILRNSVSDSSFSPNFVILNNYDWWKDFRLLDFLKGVGRFSRVGTMMLRRALGGGWNRSKE