; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001891 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001891
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPolyadenylate-binding protein 1-like
Genome locationChr11:1435434..1446804
RNA-Seq ExpressionHG10001891
SyntenyHG10001891
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016592 - mediator complex (cellular component)
GO:0003712 - transcription coregulator activity (molecular function)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR008831 - Mediator complex, subunit Med31
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily
IPR038089 - Mediator of RNA polymerase II, subunit Med31 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF9668146.1 hypothetical protein SADUNF_Sadunf15G0098100 [Salix dunnii]2.6e-13764.69Show/hide
Query:  MDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHF
        MDAD+DMSS RA+E+ Y  EP++         KRRLKEIEEEAGALREMQAKVEKEMGA+ +           K        ++  VDY+CTPEEVQQHF
Subjt:  MDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHF

Query:  QSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAAPYPSYGYGSVLNVAIV
        QSCGTVNRVTILTDKFG PKGFAYVEF+EVDAVQNAL+LNESELHGRQLKVSAKRTNVPGMKQ+RGRR +P+GFR +R FMPA  Y  YGYGS +     
Subjt:  QSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAAPYPSYGYGSVLNVAIV

Query:  ESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAF
           E Y       +  +L   +++           + T  ++    E   + PSSP  +YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAF
Subjt:  ESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAF

Query:  IGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAA
        IGYLKYL YWQRPEY+KFIMYPHCL+FLELLQN+NFRNAMAHP NKELAHRQQF+FWKNYRNNRLKHILPRPLPEPAA PP  +  P  PVPA    M A
Subjt:  IGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAA

Query:  SPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE
        +    P  SPM YG+PPGS   K+D++S+G +RRKRK E
Subjt:  SPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE

RXH94803.1 hypothetical protein DVH24_024487 [Malus domestica]8.7e-16268.74Show/hide
Query:  DEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVY
        ++HEHEVYG +IP+D   D D+   +  AD+E Y+ +P   NSK+LEDMK+RLKEIEEEAGALR+MQAKVEKEMGAVQ D+SS SA+QAEKEEVDSRS+Y
Subjt:  DEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVY

Query:  VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPA
        VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQNALLLNESELHGRQLKVSAKRTNVPG+KQYRGRRPNPF FR RRPF+PA
Subjt:  VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPA

Query:  AP-YPSYGYGSVLNVAIVESFERYHHIAPLR--ILLLLNFGFNLEFD-------CTILDFGWSDTMASNH--------GLNEEAADNP------SSPTNV
        AP YPS+GYG V          R+     LR  I LLL     +E +           D   +D   ++H        G  +    +P         +++
Subjt:  AP-YPSYGYGSVLNVAIVESFERYHHIAPLR--ILLLLNFGFNLEFD-------CTILDFGWSDTMASNH--------GLNEEAADNP------SSPTNV

Query:  YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKN
        YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAFIGYLKYLQYWQRPEY +FIMYPHCLFFLE LQN+NFR AMAHPGNKELAHRQQFYFWKN
Subjt:  YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKN

Query:  YRNNRLKHILPRPLPEPA-ALPPPVSAPPQAPVP--APAPAMAASPAGT-PALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKH
        YRNNRLKHILPRPLPEP  ALPPP  APPQ  VP   P PA   S   T PA SPMQY +PPGS  PK + ++ G+DRRKRKH
Subjt:  YRNNRLKHILPRPLPEPA-ALPPPVSAPPQAPVP--APAPAMAASPAGT-PALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKH

TKR68723.1 polyadenylate-binding protein 1-like [Populus alba]6.4e-16569.87Show/hide
Query:  EHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYV
        EHEHEVYGG+IPD+ EMDAD+DMSS RA+E+ Y     + NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DS   SATQAEKEEVDSRS+YV
Subjt:  EHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYV

Query:  GNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAA
        GNVDY+CTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEVDA+QNALLLNESELHGRQLKVSAKRTNVPGMKQ+RGRRP+P+GFR RRPFM A 
Subjt:  GNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAA

Query:  PYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPT
         YP+YGYG V                                                             P  +YKDPDDGRQRFL+ELEFVQCLANPT
Subjt:  PYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPT

Query:  YIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVS
        YIHYLAQNRY EDEAFIGYLKYLQYWQRPEY+KFIMYPHCL+FLELLQN+NFRNAMAHPGNKELAHRQQF+FWKNYRNNRLKHILPRPLPEP   PP  +
Subjt:  YIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVS

Query:  APPQAPVPAPAPA---MAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE
         PP  P P  + A   M+A+P   P  SPM YG+PPGS   K D++S+G DRRKRK E
Subjt:  APPQAPVPAPAPA---MAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE

XP_025014092.1 polyadenylate-binding protein 2 isoform X1 [Ricinus communis]5.5e-16470.04Show/hide
Query:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAE----PSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD
        ++E EHEVYG +IPD+ EMDAD+DMSS R +EE  + E      N NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DSSS SATQAEKEEVD
Subjt:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAE----PSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD

Query:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPF-GFRGR
        SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTN+PGMKQYRGRRPNP+ GFR R
Subjt:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPF-GFRGR

Query:  RPFMPAAPYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFV
        RPFMPA   P YGYG                                                               P ++YKDPDDG+QRFLLELEFV
Subjt:  RPFMPAAPYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFV

Query:  QCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPA
        QCLANPTYIHYLAQNRY EDEAFIGYLKYLQYWQ+PEYIKFIMYPHCLFFLELLQN++FRNAMAHPGNKEL HRQQF+FWKNYRNNRLKHILPRPLPEP 
Subjt:  QCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPA

Query:  ALPP--PVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE
          PP   +  PP    P P   +A   A   ALSPM YG+PPGS L KNDM++ GIDRRKRK E
Subjt:  ALPP--PVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE

XP_025014094.1 polyadenylate-binding protein 2 isoform X2 [Ricinus communis]2.3e-16269.61Show/hide
Query:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAE----PSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD
        ++E EHEVYG +IPD+ EMDAD+DMSS R +EE  + E      N NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DSSS SATQAEKEEVD
Subjt:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAE----PSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVD

Query:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPF-GFRGR
        SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTN+PGMKQYRGRRPNP+ GFR R
Subjt:  SRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPF-GFRGR

Query:  RPFMPAAPYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFV
        RPFMPA   P YG                                                                 P ++YKDPDDG+QRFLLELEFV
Subjt:  RPFMPAAPYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFV

Query:  QCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPA
        QCLANPTYIHYLAQNRY EDEAFIGYLKYLQYWQ+PEYIKFIMYPHCLFFLELLQN++FRNAMAHPGNKEL HRQQF+FWKNYRNNRLKHILPRPLPEP 
Subjt:  QCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPA

Query:  ALPP--PVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE
          PP   +  PP    P P   +A   A   ALSPM YG+PPGS L KNDM++ GIDRRKRK E
Subjt:  ALPP--PVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE

TrEMBL top hitse value%identityAlignment
A0A498JLW5 RRM domain-containing protein4.2e-16268.74Show/hide
Query:  DEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVY
        ++HEHEVYG +IP+D   D D+   +  AD+E Y+ +P   NSK+LEDMK+RLKEIEEEAGALR+MQAKVEKEMGAVQ D+SS SA+QAEKEEVDSRS+Y
Subjt:  DEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVY

Query:  VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPA
        VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQNALLLNESELHGRQLKVSAKRTNVPG+KQYRGRRPNPF FR RRPF+PA
Subjt:  VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPA

Query:  AP-YPSYGYGSVLNVAIVESFERYHHIAPLR--ILLLLNFGFNLEFD-------CTILDFGWSDTMASNH--------GLNEEAADNP------SSPTNV
        AP YPS+GYG V          R+     LR  I LLL     +E +           D   +D   ++H        G  +    +P         +++
Subjt:  AP-YPSYGYGSVLNVAIVESFERYHHIAPLR--ILLLLNFGFNLEFD-------CTILDFGWSDTMASNH--------GLNEEAADNP------SSPTNV

Query:  YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKN
        YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAFIGYLKYLQYWQRPEY +FIMYPHCLFFLE LQN+NFR AMAHPGNKELAHRQQFYFWKN
Subjt:  YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKN

Query:  YRNNRLKHILPRPLPEPA-ALPPPVSAPPQAPVP--APAPAMAASPAGT-PALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKH
        YRNNRLKHILPRPLPEP  ALPPP  APPQ  VP   P PA   S   T PA SPMQY +PPGS  PK + ++ G+DRRKRKH
Subjt:  YRNNRLKHILPRPLPEPA-ALPPPVSAPPQAPVP--APAPAMAASPAGT-PALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKH

A0A4U5MIM5 Polyadenylate-binding protein 1-like3.1e-16569.87Show/hide
Query:  EHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYV
        EHEHEVYGG+IPD+ EMDAD+DMSS RA+E+ Y     + NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DS   SATQAEKEEVDSRS+YV
Subjt:  EHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYV

Query:  GNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAA
        GNVDY+CTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEVDA+QNALLLNESELHGRQLKVSAKRTNVPGMKQ+RGRRP+P+GFR RRPFM A 
Subjt:  GNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAA

Query:  PYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPT
         YP+YGYG V                                                             P  +YKDPDDGRQRFL+ELEFVQCLANPT
Subjt:  PYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPT

Query:  YIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVS
        YIHYLAQNRY EDEAFIGYLKYLQYWQRPEY+KFIMYPHCL+FLELLQN+NFRNAMAHPGNKELAHRQQF+FWKNYRNNRLKHILPRPLPEP   PP  +
Subjt:  YIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVS

Query:  APPQAPVPAPAPA---MAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE
         PP  P P  + A   M+A+P   P  SPM YG+PPGS   K D++S+G DRRKRK E
Subjt:  APPQAPVPAPAPA---MAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE

A0A6A6MFI3 Mediator of RNA polymerase II transcription subunit 318.3e-11869.91Show/hide
Query:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSV
        ++E EH+VYG +IPD+ E+DAD+DMSS R +E+    E    NSKDLEDMK+RLKEIEEEAGALREMQAKVEKEMGAVQ DSSS SATQAEKEEVDSRS+
Subjt:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSV

Query:  YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMP
        YVGNVDY CTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVE DAVQNALLLNESELHGRQLKVSAKRTN+PGMKQYRGRRPNP+GFR RRPFMP
Subjt:  YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMP

Query:  AAP-YPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLA
        A P YP YGYG                                                               P ++YKDPDDGRQRFLLELEFVQCLA
Subjt:  AAP-YPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLA

Query:  NPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIM
        NPTYIHYLAQNRY EDEAFIGYLKYLQYWQRPEYIKFIM
Subjt:  NPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIM

A0A6N2MJL7 RRM domain-containing protein6.8e-14464.71Show/hide
Query:  EHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYV
        EHEHEVYGG+IPD+ EMDAD+DMSS RA+E+ Y     + NSKDLEDMK+RLKEIEEEA ALREMQAKVEKEMGAVQ DS   SATQAEKEEVDSRS+YV
Subjt:  EHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYV

Query:  GN--VDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMP
        GN  VDY+CTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVE DA+QNALLLNESELHGRQLKVSAKRTNVPGMK +RGRRP+ +GFR RRPFM 
Subjt:  GN--VDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMP

Query:  AAPYPSYGYGSVLN-VAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLA
          PYP+YGYG  LN ++  E F+                             G  D   + H  +E+  +  +  T+       G+    L+L       
Subjt:  AAPYPSYGYGSVLN-VAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLA

Query:  NPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPP
              YLAQNRY +DEAFIGYLKYLQYWQRPEY+KFIMYPHCL+FLELLQN NFRNAMAHPGNKELAHRQQF+FWKNYRNNRLKHILPRPLPEP   PP
Subjt:  NPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPP

Query:  PVSAPPQAPVPAPAPAMAASPAGTPAL-SPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE
          + PPQ P    + A    PA + A+ SPM YG+PPGS   K+D++S+G DRRKRK E
Subjt:  PVSAPPQAPVPAPAPAMAASPAGTPAL-SPMQYGIPPGSGLPKNDMKSAGIDRRKRKHE

A0A803P1V0 Uncharacterized protein2.0e-14870.85Show/hide
Query:  EPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHP
        E      ++LEDMK+RLKE+EEEAGALREMQAKVEKEMGAVQ DSSSTSATQAEKEEVDSRS+YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG P
Subjt:  EPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHP

Query:  KGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAAP-YPSYGYGSVLNVAIVESFERYHHIAPLRILLLL
        KGFAYVEFVEV+AVQNALLLNESELHGRQLKVSAKRTNVPGMKQ+RGRR  P  FR RRPF PA+P YPSYGYG       V  F R             
Subjt:  KGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAAP-YPSYGYGSVLNVAIVESFERYHHIAPLRILLLL

Query:  NFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKF
                                          P    ++YKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRY EDEAFIGYLKYLQYWQ+PEY+KF
Subjt:  NFGFNLEFDCTILDFGWSDTMASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKF

Query:  IMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQ---APVPAPAPAMAASPAGTPALSPMQYGI
        IMYPHCL+FLELLQN+NFRNAMAHPG+KELAHRQQFYFWKNYRNNRLKHILPR LPEP   P P   PPQ    PV A    MAA+P   P LSPMQY  
Subjt:  IMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQ---APVPAPAPAMAASPAGTPALSPMQYGI

Query:  PPGSGLPKNDMKSAGIDRRKRK
        P GS LPK DM++ G DRRKRK
Subjt:  PPGSGLPKNDMKSAGIDRRKRK

SwissProt top hitse value%identityAlignment
O14327 Polyadenylate-binding protein 21.3e-3554.66Show/hide
Query:  DAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKF-
        D +  +   K+L +MK R+ E+E EA  LR MQ +++ E          T A + +KE +D++SVYVGNVDY+ TPEE+Q HF SCG+VNRVTIL DKF 
Subjt:  DAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKF-

Query:  GHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGR
        GHPKGFAY+EF E   V NALLLN S LH R LKV+ KRTNVPGM + RGR       RGR
Subjt:  GHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGR

Q8VYB1 Mediator of RNA polymerase II transcription subunit 313.5e-7373.27Show/hide
Query:  MASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRN
        MAS   + ++A++ PS P N YKDPD GRQRFLLELEF+QCLANPTYIHYLAQNRY EDEAFIGYLKYLQYWQRPEYIKFIMYPHCL+FLELLQN NFR 
Subjt:  MASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRN

Query:  AMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKND---MKSAGIDRRK
        AMAHP NKELAHRQQFY+WKNYRNNRLKHILPRPLPEP  +PP    PP AP  +  PA +A+ A +PALSPMQY     + L KND   M + GIDRRK
Subjt:  AMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKND---MKSAGIDRRK

Query:  RK
        RK
Subjt:  RK

Q93VI4 Polyadenylate-binding protein 11.3e-7270.78Show/hide
Query:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDA-----EPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV
        +DE EHEVYGG+IP++ E + D +       EEG  A     EP  ++S+DLEDMK+R+KEIEEEAGALREMQAK EK+MGA Q+ S   SA  AEKEEV
Subjt:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDA-----EPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV

Query:  DSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGR-RPNPFGFRG
        DSRS+YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQN+L+LNESELHGRQ+KVSAKRTNVPGM+Q+RGR RP    FR 
Subjt:  DSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGR-RPNPFGFRG

Query:  RRPFMPAAP-YPSYGYGSV
         R FMP  P YP Y YG V
Subjt:  RRPFMPAAP-YPSYGYGSV

Q9FJN9 Polyadenylate-binding protein 21.8e-6969.05Show/hide
Query:  DEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVY
        +E EHEVYGG+IPD  EMD D++  +   D    D +      K+L++MK+RLKE+E+EA ALREMQAKVEKEMGA  +D +S +A QA KEEVD+RSV+
Subjt:  DEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVY

Query:  VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPF-GFRGRRPFMP
        VGNVDYACTPEEVQQHFQ+CGTV+RVTILTDKFG PKGFAYVEFVEV+AVQ AL LNESELHGRQLKV  KRTNVPG+KQ+RGRR NP+ G+R RRPFM 
Subjt:  VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPF-GFRGRRPFMP

Query:  AAPYPSYGYG
           Y  YGYG
Subjt:  AAPYPSYGYG

Q9LX90 Polyadenylate-binding protein 31.4e-6669.01Show/hide
Query:  DEHEHEVYGGDIPDDAEMDA---DLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSR
        +E EHEVYGG+IP+  + D    D+DMS+  ADE+            +L +MKRRLKE+EEEA ALREMQAKVEKEMGA Q D +S +A Q  KEEVD+R
Subjt:  DEHEHEVYGGDIPDDAEMDA---DLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSR

Query:  SVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNP-FGFRGRRP
        SVYVGNVDYACTPEEVQ HFQ+CGTVNRVTIL DKFG PKGFAYVEFVEV+AVQ AL LNESELHGRQLKVS KRTNVPGMKQY   R NP  G+R RRP
Subjt:  SVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNP-FGFRGRRP

Query:  FMPAAPYPSYGYG
        F+P   Y  YGYG
Subjt:  FMPAAPYPSYGYG

Arabidopsis top hitse value%identityAlignment
AT5G19910.1 SOH1 family protein2.5e-7473.27Show/hide
Query:  MASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRN
        MAS   + ++A++ PS P N YKDPD GRQRFLLELEF+QCLANPTYIHYLAQNRY EDEAFIGYLKYLQYWQRPEYIKFIMYPHCL+FLELLQN NFR 
Subjt:  MASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRN

Query:  AMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKND---MKSAGIDRRK
        AMAHP NKELAHRQQFY+WKNYRNNRLKHILPRPLPEP  +PP    PP AP  +  PA +A+ A +PALSPMQY     + L KND   M + GIDRRK
Subjt:  AMAHPGNKELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKND---MKSAGIDRRK

Query:  RK
        RK
Subjt:  RK

AT5G19910.2 SOH1 family protein1.4e-6963.79Show/hide
Query:  MASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRN
        MAS   + ++A++ PS P N YKDPD GRQRFLLELEF+QCLANPTYIHYLAQNRY EDEAFIGYLKYLQYWQRPEYIKFIMYPHCL+FLELLQN NFR 
Subjt:  MASNHGLNEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRN

Query:  AMAHPGNK------------------------------ELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAASPAGTPAL
        AMAHP NK                              ELAHRQQFY+WKNYRNNRLKHILPRPLPEP  +PP    PP AP  +  PA +A+ A +PAL
Subjt:  AMAHPGNK------------------------------ELAHRQQFYFWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAASPAGTPAL

Query:  SPMQYGIPPGSGLPKND---MKSAGIDRRKRK
        SPMQY     + L KND   M + GIDRRKRK
Subjt:  SPMQYGIPPGSGLPKND---MKSAGIDRRKRK

AT5G51120.1 polyadenylate-binding protein 19.5e-7470.78Show/hide
Query:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDA-----EPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV
        +DE EHEVYGG+IP++ E + D +       EEG  A     EP  ++S+DLEDMK+R+KEIEEEAGALREMQAK EK+MGA Q+ S   SA  AEKEEV
Subjt:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDA-----EPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEV

Query:  DSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGR-RPNPFGFRG
        DSRS+YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQN+L+LNESELHGRQ+KVSAKRTNVPGM+Q+RGR RP    FR 
Subjt:  DSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGR-RPNPFGFRG

Query:  RRPFMPAAP-YPSYGYGSV
         R FMP  P YP Y YG V
Subjt:  RRPFMPAAP-YPSYGYGSV

AT5G51120.2 polyadenylate-binding protein 12.7e-6859.61Show/hide
Query:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDA-----EPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQ----------------
        +DE EHEVYGG+IP++ E + D +       EEG  A     EP  ++S+DLEDMK+R+KEIEEEAGALREMQAK EK+MGA Q                
Subjt:  YDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDA-----EPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQ----------------

Query:  --------------------EDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNE
                            ++  ++  + AEKEEVDSRS+YVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFG PKGFAYVEFVEV+AVQN+L+LNE
Subjt:  --------------------EDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNE

Query:  SELHGRQLKVSAKRTNVPGMKQYRGR-RPNPFGFRGRRPFMPAAP-YPSYGYGSV
        SELHGRQ+KVSAKRTNVPGM+Q+RGR RP    FR  R FMP  P YP Y YG V
Subjt:  SELHGRQLKVSAKRTNVPGMKQYRGR-RPNPFGFRGRRPFMPAAP-YPSYGYGSV

AT5G65260.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.3e-7069.05Show/hide
Query:  DEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVY
        +E EHEVYGG+IPD  EMD D++  +   D    D +      K+L++MK+RLKE+E+EA ALREMQAKVEKEMGA  +D +S +A QA KEEVD+RSV+
Subjt:  DEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNANSKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVY

Query:  VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPF-GFRGRRPFMP
        VGNVDYACTPEEVQQHFQ+CGTV+RVTILTDKFG PKGFAYVEFVEV+AVQ AL LNESELHGRQLKV  KRTNVPG+KQ+RGRR NP+ G+R RRPFM 
Subjt:  VGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQNALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPF-GFRGRRPFMP

Query:  AAPYPSYGYG
           Y  YGYG
Subjt:  AAPYPSYGYG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCCCCATCCTCATCCTTATTTACCTCACGGGGAATTTTGTCTCACCACTTCCTCCTCATTCCTCGTCCAGATAGGGATTCTCCACCAGCCCTGTCTCCATGGGG
AAAACGAACATATACAAAATACACATACCAACCAGCGATCGAGGGGCCACCGGAGACGGAGACGGAGACGGAGATGGAGCGCAATTTTAACTACGACGAGCACGAACACG
AAGTCTACGGTGGCGATATTCCCGACGACGCCGAGATGGATGCCGACCTCGACATGTCCTCTGGTAGGGCCGACGAAGAAGGCTACGACGCCGAGCCTAGTAACGCCAAC
TCCAAGGACTTGGAGGACATGAAGAGGAGGCTTAAGGAGATCGAAGAGGAGGCCGGTGCCCTTCGGGAAATGCAGGCCAAAGTTGAGAAGGAGATGGGTGCTGTTCAAGA
AGATTCATCCAGTACCTCTGCAACTCAGGCTGAAAAGGAGGAAGTAGATTCTCGATCGGTATATGTTGGTAATGTCGACTATGCATGCACTCCTGAAGAGGTCCAACAGC
ATTTTCAATCATGTGGAACAGTGAATAGAGTGACAATTTTGACGGACAAATTTGGTCACCCCAAGGGATTTGCTTATGTTGAGTTCGTGGAGGTTGATGCTGTCCAGAAT
GCTTTGCTGTTAAATGAGTCGGAGTTGCATGGTCGTCAACTAAAGGTCTCTGCTAAAAGGACGAACGTGCCTGGCATGAAGCAATACCGAGGAAGGAGGCCAAACCCTTT
TGGCTTTCGAGGTCGCCGGCCTTTCATGCCTGCTGCACCTTATCCTTCGTATGGTTATGGCTCTGTGTTGAATGTAGCCATTGTTGAGAGCTTCGAAAGATATCACCATA
TCGCTCCACTACGAATCTTGTTGCTTCTTAATTTTGGGTTCAATTTGGAATTCGATTGCACGATATTAGACTTCGGCTGGTCAGATACAATGGCTTCTAATCATGGATTA
AACGAAGAAGCAGCTGATAACCCGTCATCGCCTACGAATGTTTACAAAGATCCTGATGACGGACGGCAGCGGTTCTTGCTCGAATTGGAATTTGTTCAATGTCTTGCCAA
TCCAACCTACATTCATTATCTGGCTCAGAATCGTTACCTCGAGGATGAAGCTTTTATTGGTTACTTGAAGTACCTTCAATATTGGCAACGGCCAGAGTATATTAAGTTTA
TAATGTACCCTCATTGTCTTTTTTTCCTTGAACTTCTACAAAATTCAAACTTCCGAAATGCAATGGCTCATCCTGGCAACAAGGAATTGGCACACAGGCAACAATTTTAC
TTCTGGAAGAACTATAGGAACAATCGTTTGAAACACATTTTACCGCGACCTCTTCCCGAACCTGCAGCATTACCACCCCCAGTTTCTGCTCCACCTCAAGCACCTGTGCC
GGCCCCAGCCCCCGCTATGGCAGCTTCACCTGCTGGTACACCTGCCCTTTCTCCGATGCAGTATGGTATTCCTCCTGGTTCTGGACTTCCAAAGAACGACATGAAGAGTG
CAGGAATTGATCGACGAAAGAGAAAACACGAAAGAAGTATGACGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATTCCCCATCCTCATCCTTATTTACCTCACGGGGAATTTTGTCTCACCACTTCCTCCTCATTCCTCGTCCAGATAGGGATTCTCCACCAGCCCTGTCTCCATGGGG
AAAACGAACATATACAAAATACACATACCAACCAGCGATCGAGGGGCCACCGGAGACGGAGACGGAGACGGAGATGGAGCGCAATTTTAACTACGACGAGCACGAACACG
AAGTCTACGGTGGCGATATTCCCGACGACGCCGAGATGGATGCCGACCTCGACATGTCCTCTGGTAGGGCCGACGAAGAAGGCTACGACGCCGAGCCTAGTAACGCCAAC
TCCAAGGACTTGGAGGACATGAAGAGGAGGCTTAAGGAGATCGAAGAGGAGGCCGGTGCCCTTCGGGAAATGCAGGCCAAAGTTGAGAAGGAGATGGGTGCTGTTCAAGA
AGATTCATCCAGTACCTCTGCAACTCAGGCTGAAAAGGAGGAAGTAGATTCTCGATCGGTATATGTTGGTAATGTCGACTATGCATGCACTCCTGAAGAGGTCCAACAGC
ATTTTCAATCATGTGGAACAGTGAATAGAGTGACAATTTTGACGGACAAATTTGGTCACCCCAAGGGATTTGCTTATGTTGAGTTCGTGGAGGTTGATGCTGTCCAGAAT
GCTTTGCTGTTAAATGAGTCGGAGTTGCATGGTCGTCAACTAAAGGTCTCTGCTAAAAGGACGAACGTGCCTGGCATGAAGCAATACCGAGGAAGGAGGCCAAACCCTTT
TGGCTTTCGAGGTCGCCGGCCTTTCATGCCTGCTGCACCTTATCCTTCGTATGGTTATGGCTCTGTGTTGAATGTAGCCATTGTTGAGAGCTTCGAAAGATATCACCATA
TCGCTCCACTACGAATCTTGTTGCTTCTTAATTTTGGGTTCAATTTGGAATTCGATTGCACGATATTAGACTTCGGCTGGTCAGATACAATGGCTTCTAATCATGGATTA
AACGAAGAAGCAGCTGATAACCCGTCATCGCCTACGAATGTTTACAAAGATCCTGATGACGGACGGCAGCGGTTCTTGCTCGAATTGGAATTTGTTCAATGTCTTGCCAA
TCCAACCTACATTCATTATCTGGCTCAGAATCGTTACCTCGAGGATGAAGCTTTTATTGGTTACTTGAAGTACCTTCAATATTGGCAACGGCCAGAGTATATTAAGTTTA
TAATGTACCCTCATTGTCTTTTTTTCCTTGAACTTCTACAAAATTCAAACTTCCGAAATGCAATGGCTCATCCTGGCAACAAGGAATTGGCACACAGGCAACAATTTTAC
TTCTGGAAGAACTATAGGAACAATCGTTTGAAACACATTTTACCGCGACCTCTTCCCGAACCTGCAGCATTACCACCCCCAGTTTCTGCTCCACCTCAAGCACCTGTGCC
GGCCCCAGCCCCCGCTATGGCAGCTTCACCTGCTGGTACACCTGCCCTTTCTCCGATGCAGTATGGTATTCCTCCTGGTTCTGGACTTCCAAAGAACGACATGAAGAGTG
CAGGAATTGATCGACGAAAGAGAAAACACGAAAGAAGTATGACGTAG
Protein sequenceShow/hide protein sequence
MHSPSSSLFTSRGILSHHFLLIPRPDRDSPPALSPWGKRTYTKYTYQPAIEGPPETETETEMERNFNYDEHEHEVYGGDIPDDAEMDADLDMSSGRADEEGYDAEPSNAN
SKDLEDMKRRLKEIEEEAGALREMQAKVEKEMGAVQEDSSSTSATQAEKEEVDSRSVYVGNVDYACTPEEVQQHFQSCGTVNRVTILTDKFGHPKGFAYVEFVEVDAVQN
ALLLNESELHGRQLKVSAKRTNVPGMKQYRGRRPNPFGFRGRRPFMPAAPYPSYGYGSVLNVAIVESFERYHHIAPLRILLLLNFGFNLEFDCTILDFGWSDTMASNHGL
NEEAADNPSSPTNVYKDPDDGRQRFLLELEFVQCLANPTYIHYLAQNRYLEDEAFIGYLKYLQYWQRPEYIKFIMYPHCLFFLELLQNSNFRNAMAHPGNKELAHRQQFY
FWKNYRNNRLKHILPRPLPEPAALPPPVSAPPQAPVPAPAPAMAASPAGTPALSPMQYGIPPGSGLPKNDMKSAGIDRRKRKHERSMT