; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G002050 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G002050
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionpolyadenylate-binding protein RBP47C-like
Genome locationCmo_Chr09:942757..954981
RNA-Seq ExpressionCmoCh09G002050
SyntenyCmoCh09G002050
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR006869 - Domain of unknown function DUF547
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025757 - Ternary complex factor MIP1, leucine-zipper
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591362.1 Polyadenylate-binding protein RBP47C, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0078.04Show/hide
Query:  MQSNGSDSQPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLH
        MQSNGSDSQPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLH
Subjt:  MQSNGSDSQPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLH

Query:  NCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDNDPDLSIFVGDLAADVTDALLYDTFSSKF
        NCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSD+DPDLSIFVGDLAADVTDALLYDTFSSKF
Subjt:  NCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDNDPDLSIFVGDLAADVTDALLYDTFSSKF

Query:  PSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICCPL
        PSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQ                                
Subjt:  PSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICCPL

Query:  FIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNAT-----------------
           GGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNAT                 
Subjt:  FIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNAT-----------------

Query:  --FRDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQDRLLLKMDLWVGWYPTLGFKIITGFVADLLNKPLFIAAAAA--KDNDKG
          FRDFGNQWNGAYYGGHIYDGYGYGLPPHDPS                                    FKIIT FVADLLNK LFIAAAAA  KDNDKG
Subjt:  --FRDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQDRLLLKMDLWVGWYPTLGFKIITGFVADLLNKPLFIAAAAA--KDNDKG

Query:  RKM-------------------------------------------------------------------------------------------------
        RKM                                                                                                 
Subjt:  RKM-------------------------------------------------------------------------------------------------

Query:  -------SFEEMDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLG
               SFEEMDRKGRTRVQSMRASANHE        KGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLG
Subjt:  -------SFEEMDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLG

Query:  ALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQK
        ALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQK
Subjt:  ALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQK

Query:  THTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEET
        THTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEET
Subjt:  THTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEET

Query:  DHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK
        DHQDPY ICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK
Subjt:  DHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK

Query:  VQIIID-----------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFG
          I +                  Y S   +  S                 +PLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFG
Subjt:  VQIIID-----------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFG

Query:  IPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
        IPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERR+NQPLSQFVKVIPYEFSFRYLLCT
Subjt:  IPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT

KAG7024238.1 hypothetical protein SDJN02_13052 [Cucurbita argyrosperma subsp. argyrosperma]5.4e-26189.19Show/hide
Query:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
        MDRKGRTRVQSMRASANHE        KGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
Subjt:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP

Query:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
        NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
Subjt:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS

Query:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
        PAR+KSSAKPNSPKLNFENRETSPENAEARQLR PDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPY ICSE
Subjt:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE

Query:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----
        FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMIN+FLQLGIPESPEMVAGLMQK  I +      
Subjt:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----

Query:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
                    Y S   +  S                 +PLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
Subjt:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD

Query:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
        FGKDLDSLVDWVCLQLPSELGKEAIKLIERR+NQPLSQFVKVIPYEFSFRYLLCT
Subjt:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT

XP_022936168.1 uncharacterized protein LOC111442849 isoform X1 [Cucurbita moschata]2.6e-26390.09Show/hide
Query:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
        MDRKGRTRVQSMRASANHE        KGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
Subjt:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP

Query:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
        NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
Subjt:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS

Query:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
        PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
Subjt:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE

Query:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----
        FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK  I +      
Subjt:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----

Query:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
                    Y S   +  S                 +PLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
Subjt:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD

Query:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
        FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
Subjt:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT

XP_022975185.1 uncharacterized protein LOC111474227 isoform X1 [Cucurbita maxima]3.1e-25687.57Show/hide
Query:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
        MDRKGRTR+QSMRASANHE        KGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
Subjt:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP

Query:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
        NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNME SLKNNSKQ+QSE SVQKTDNGLGKENESRMNSTSNH+VSSLQKTHTIKTP+KKS
Subjt:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS

Query:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
        PARYKSS KPNSPKLNFENRETSPENAEAR+LRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRG TESLHLFSMLTT QTEETDHQDPY ICSE
Subjt:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE

Query:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----
        FGMRDIGPYKNV IVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK  I +      
Subjt:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----

Query:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
                    Y S   +  S                 +PL+TFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
Subjt:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD

Query:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
        FGKDLDSLVDWVCLQLPSELGKEAIKLIERR+NQPLSQFVKVIPYEFSFRYLLCT
Subjt:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT

XP_023535391.1 uncharacterized protein LOC111796844 isoform X1 [Cucurbita pepo subsp. pepo]3.9e-25989.01Show/hide
Query:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
        MDRKGRTRVQSMRASANHE        KGSVDMPAANLPDAAKATTS RVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
Subjt:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP

Query:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
        NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSE SVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
Subjt:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS

Query:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
        PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLS ILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPY ICSE
Subjt:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE

Query:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----
        FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK  I +      
Subjt:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----

Query:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
                    Y S   +  S                 +PLVTFALSCGSWSSPAVRVYTASQVENQLELAKR+YLQAAVGIASEKFGIPKLLDWYLLD
Subjt:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD

Query:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
        FGKDLDSLVDWVCLQLPSELGKEAIKLIERR+NQPLSQFVKVIPYEFSFRYLLCT
Subjt:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT

TrEMBL top hitse value%identityAlignment
A0A0A0L2A6 Uncharacterized protein3.3e-23280.72Show/hide
Query:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
        MDRKGRTR+QSMRASANHE        KG+VDMP AN  DAAKA+TSGRV+SR+RKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
Subjt:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP

Query:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
        NMLELLAEVAVLEEEVVRLEEQVV FRQDLYQEAVNISSSKK MELS KNNSKQ QS+ SVQKTDN +GKENESRMNSTSN+K SS++K HTIKTPVKK 
Subjt:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS

Query:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
        P R KSS KPNSPKLN ENR  +PENAEARQLRAPD+KVSGDDSPN+ISENILKCLSSILLRMSSIKNRGATESLHLFSM+TTMQTEETD  DPY ICSE
Subjt:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE

Query:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----
        FG RDIGPYKNV  VEA SINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWIN+YNSCMINAFL+ GIPESPEMV  LMQK  I +      
Subjt:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----

Query:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
                    Y S   +  S                 +PLVTFALSCGSWSSPAVRVYTASQVEN+LELAKR+YL+AAVGI+SEKFGIPKLLDWYLLD
Subjt:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD

Query:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
        F KDLDSLVDWVCLQLPSELGKEAIKL+E R+NQPLSQFVKVIPYEFSFRYLLCT
Subjt:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT

A0A6J1F6S3 uncharacterized protein LOC111442849 isoform X21.7e-25291.01Show/hide
Query:  MPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQ
        MPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQ
Subjt:  MPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQ

Query:  EAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQL
        EAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQL
Subjt:  EAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQL

Query:  RAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLF
        RAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLF
Subjt:  RAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLF

Query:  QRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----------------YPSALLYIASF---------
        QRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK  I +                  Y S   +  S          
Subjt:  QRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----------------YPSALLYIASF---------

Query:  -----FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQ
               +PLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQ
Subjt:  -----FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQ

Query:  NQPLSQFVKVIPYEFSFRYLLCT
        NQPLSQFVKVIPYEFSFRYLLCT
Subjt:  NQPLSQFVKVIPYEFSFRYLLCT

A0A6J1F7J5 uncharacterized protein LOC111442849 isoform X11.2e-26390.09Show/hide
Query:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
        MDRKGRTRVQSMRASANHE        KGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
Subjt:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP

Query:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
        NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
Subjt:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS

Query:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
        PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
Subjt:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE

Query:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----
        FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK  I +      
Subjt:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----

Query:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
                    Y S   +  S                 +PLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
Subjt:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD

Query:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
        FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
Subjt:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT

A0A6J1IIJ2 uncharacterized protein LOC111474227 isoform X11.5e-25687.57Show/hide
Query:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
        MDRKGRTR+QSMRASANHE        KGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP
Subjt:  MDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPP

Query:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS
        NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNME SLKNNSKQ+QSE SVQKTDNGLGKENESRMNSTSNH+VSSLQKTHTIKTP+KKS
Subjt:  NMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKS

Query:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE
        PARYKSS KPNSPKLNFENRETSPENAEAR+LRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRG TESLHLFSMLTT QTEETDHQDPY ICSE
Subjt:  PARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSE

Query:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----
        FGMRDIGPYKNV IVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK  I +      
Subjt:  FGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----

Query:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
                    Y S   +  S                 +PL+TFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD
Subjt:  ------------YPSALLYIASF--------------FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLD

Query:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT
        FGKDLDSLVDWVCLQLPSELGKEAIKLIERR+NQPLSQFVKVIPYEFSFRYLLCT
Subjt:  FGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQPLSQFVKVIPYEFSFRYLLCT

A0A6J1IJQ1 uncharacterized protein LOC111474227 isoform X29.0e-24688.53Show/hide
Query:  MPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQ
        MPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQ
Subjt:  MPAANLPDAAKATTSGRVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQ

Query:  EAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQL
        EAVNISSSKKNME SLKNNSKQ+QSE SVQKTDNGLGKENESRMNSTSNH+VSSLQKTHTIKTP+KKSPARYKSS KPNSPKLNFENRETSPENAEAR+L
Subjt:  EAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQL

Query:  RAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLF
        RAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRG TESLHLFSMLTT QTEETDHQDPY ICSEFGMRDIGPYKNV IVEASSINTKRTTNSLFLF
Subjt:  RAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLF

Query:  QRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----------------YPSALLYIASF---------
        QRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQK  I +                  Y S   +  S          
Subjt:  QRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIIID-----------------YPSALLYIASF---------

Query:  -----FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQ
               +PL+TFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERR+
Subjt:  -----FLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQ

Query:  NQPLSQFVKVIPYEFSFRYLLCT
        NQPLSQFVKVIPYEFSFRYLLCT
Subjt:  NQPLSQFVKVIPYEFSFRYLLCT

SwissProt top hitse value%identityAlignment
F4I3B3 Polyadenylate-binding protein RBP47A2.4e-11052.13Show/hide
Query:  QEPNQRHQQPQSWMS--MQYPPPAMVM--PHHMMTPPHYMAPPL--------PPPPYMHYHPQYQHHHLPIQHSQPLQGSGGEN-KTIWVGDLHHWMDES
        Q+  Q+ QQ Q WM+   QYP  AM M     MM  PH    P         P   Y  Y  Q Q HH   Q  QP  GSGG++ KT+WVGDL HWMDE+
Subjt:  QEPNQRHQQPQSWMS--MQYPPPAMVM--PHHMMTPPHYMAPPL--------PPPPYMHYHPQYQHHHLPIQHSQPLQGSGGEN-KTIWVGDLHHWMDES

Query:  YLHNCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKR-SDNDPDLSIFVGDLAADVTDALLYDTF
        YLH CF+   E++S+K+IRNKQ+  SEGYGFVEF + + AE+ALQ+++GV MP+ EQPFRLNWA+FSTG+KR S+N PDLSIFVGDLA DV+DA+L +TF
Subjt:  YLHNCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKR-SDNDPDLSIFVGDLAADVTDALLYDTF

Query:  SSKFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCI
        + ++PSVK AKVVID+NTGRSKGYGFVRFGD+NERS+AM+EMNG +CSSR MR+G ATP++++ Y QQN SQ                            
Subjt:  SSKFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCI

Query:  CCPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF------------
           L + GG+  NG       S+G+S N+TIFVGGLD +VT+EDL QPFS +GE+VSVKIPVGKGCGFVQFANR  AEEA+  LN T             
Subjt:  CCPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF------------

Query:  -------RDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGA
                D GNQWNG Y  G    GY  G    D +MY  A  A
Subjt:  -------RDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGA

Q0WW84 Polyadenylate-binding protein RBP47B3.8e-11651.95Show/hide
Query:  PQEPNQRHQQPQSWM-SMQYPPPA---MVMPHHMMTPPHYMAPPLPPPPYMHYHPQ------YQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYL
        PQ+  Q+ QQ Q WM +MQYPP A   M+    M+  PH    P    PY  +HPQ      YQ H    QH    +GSG + KT+WVGDL HWMDE+YL
Subjt:  PQEPNQRHQQPQSWM-SMQYPPPA---MVMPHHMMTPPHYMAPPLPPPPYMHYHPQ------YQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYL

Query:  HNCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRS-DNDPDLSIFVGDLAADVTDALLYDTFSS
        H+CF+  GE++S+K+IRNK +  SEGYGFVEF + A AE+ LQNY+G +MP+++QPFR+NWA+FSTG+KR+ +N PDLS+FVGDL+ DVTD LL++TFS 
Subjt:  HNCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRS-DNDPDLSIFVGDLAADVTDALLYDTFSS

Query:  KFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICC
        ++PSVK+AKVVID+NTGRSKGYGFVRFGD+NERS+A++EMNG YCS+R MR+G ATP+++   QQQ+SSQ +                            
Subjt:  KFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICC

Query:  PLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF--------------
           + GG+ +NG    G QS+G+S N TIFVGG+DP+V DEDLRQPFSQ+GE+VSVKIPVGKGCGFVQFA+R  AE+A++ LN T               
Subjt:  PLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF--------------

Query:  -----RDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMY
              D G QWNG Y  GH Y+  G     HD + Y
Subjt:  -----RDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMY

Q9LEB3 Polyadenylate-binding protein RBP477.4e-12052.26Show/hide
Query:  NGSD--SQPQEPNQRHQQ-----------PQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDL
        NG D   Q Q+  Q+HQQ            Q WM+MQYP  AM M   MM    YM       PY   H Q Q      Q    +Q S  +NKTIW+GDL
Subjt:  NGSD--SQPQEPNQRHQQ-----------PQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDL

Query:  HHWMDESYLHNCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDNDPDLSIFVGDLAADVTDA
          WMDESYLH+CF+  GE+ S+KIIRNKQ+G SE YGFVEF  HA AEK LQ+Y G +MP+TEQPFRLNWA FSTG+KR++   D SIFVGDLA+DVTD 
Subjt:  HHWMDESYLHNCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDNDPDLSIFVGDLAADVTDA

Query:  LLYDTFSSKFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPL
        +L DTF+S++PS+K AKVV+DANTG SKGYGFVRFGD++ERS+AM+EMNGVYCSSR MRIG ATP+K S ++Q +S                        
Subjt:  LLYDTFSSKFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPL

Query:  LDCSCICCPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF------
                 + + GGY++NG  + G QS+GDS+NTTIFVGGLD  VTDE+LRQ F+Q+GE+VSVKIP GKGCGFVQF++R+ A+EA+QKL+         
Subjt:  LDCSCICCPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF------

Query:  --------------RDFGNQWNGAYYGGHIYDGYGYGLPPH-DPSMYQ--AAYGAYT-VYGSHQQ
                       D G+QWNG Y G   Y GYGYG   + D  MY   AAYGA +  YG+HQQ
Subjt:  --------------RDFGNQWNGAYYGGHIYDGYGYGLPPH-DPSMYQ--AAYGAYT-VYGSHQQ

Q9SX79 Polyadenylate-binding protein RBP47C2.5e-12054.14Show/hide
Query:  QPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLHNCFAS--I
        +  +P      P  WM     PP +++PH MM    Y  PP PP    H +P + H H   + ++      GENKTIWVGDLHHWMDE+YL++ FAS   
Subjt:  QPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLHNCFAS--I

Query:  GEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDND-PDLSIFVGDLAADVTDALLYDTFSSKFPSVKA
         EI S+K+IRNK +GLSEGYGFVEF +H  A+K L+ + G  MP+T+QPFRLNWA+FSTG+KR +N+ PDLSIFVGDL+ DV+D LL++TFS K+PSVKA
Subjt:  GEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDND-PDLSIFVGDLAADVTDALLYDTFSSKFPSVKA

Query:  AKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICCPLFIIGG
        AKVV+DANTGRSKGYGFVRFGD+NER++AM+EMNGV CSSR MRIG ATPRK++GYQQQ                                       GG
Subjt:  AKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICCPLFIIGG

Query:  YSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF-------------------RD
        Y  NG  +   + EGD  NTTIFVGGLD +VTDEDL+QPF+++GEIVSVKIPVGKGCGFVQF NR +AEEAL+KLN T                    RD
Subjt:  YSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF-------------------RD

Query:  -FGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQ
         +GNQW   YYGG  Y+GYGY +P  DP MY AA   Y +YG HQQQ
Subjt:  -FGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQ

Q9SX80 Polyadenylate-binding protein RBP47C'3.7e-11954.29Show/hide
Query:  MQSNGSDSQPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLH
        ++ N   + P  P      P  W  M+YPP  ++MP  M  PP    PP+P  PY H +P + H H   + ++      GENKTIWVGDL +WMDE+YL+
Subjt:  MQSNGSDSQPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLH

Query:  NCFASI--GEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDND-PDLSIFVGDLAADVTDALLYDTFS
        + F S    EI S+K+IRNK +G SEGYGFVEF +H  A+K LQ + G  MP+T+QPFRLNWA+FSTG+KR +N+ PDLSIFVGDLA DV+DALL++TFS
Subjt:  NCFASI--GEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDND-PDLSIFVGDLAADVTDALLYDTFS

Query:  SKFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCIC
         K+PSVKAAKVV+DANTGRSKGYGFVRFGD+NER++AM+EMNGV CSSR MRIG ATPRK++GYQQQ                                 
Subjt:  SKFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCIC

Query:  CPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF-------------
              GGY  +G F+   +SEGD+ NTTIFVGGLD +VTDEDL+QPFS++GEIVSVKIPVGKGCGFVQF NR +AEEAL+KLN T              
Subjt:  CPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF-------------

Query:  ------RD-FGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQ
              RD +GNQW   YYGG  Y+GYGY +P  DP MY AA   Y +YG HQQQ
Subjt:  ------RD-FGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQ

Arabidopsis top hitse value%identityAlignment
AT1G47490.1 RNA-binding protein 47C1.8e-12154.14Show/hide
Query:  QPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLHNCFAS--I
        +  +P      P  WM     PP +++PH MM    Y  PP PP    H +P + H H   + ++      GENKTIWVGDLHHWMDE+YL++ FAS   
Subjt:  QPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLHNCFAS--I

Query:  GEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDND-PDLSIFVGDLAADVTDALLYDTFSSKFPSVKA
         EI S+K+IRNK +GLSEGYGFVEF +H  A+K L+ + G  MP+T+QPFRLNWA+FSTG+KR +N+ PDLSIFVGDL+ DV+D LL++TFS K+PSVKA
Subjt:  GEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDND-PDLSIFVGDLAADVTDALLYDTFSSKFPSVKA

Query:  AKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICCPLFIIGG
        AKVV+DANTGRSKGYGFVRFGD+NER++AM+EMNGV CSSR MRIG ATPRK++GYQQQ                                       GG
Subjt:  AKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICCPLFIIGG

Query:  YSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF-------------------RD
        Y  NG  +   + EGD  NTTIFVGGLD +VTDEDL+QPF+++GEIVSVKIPVGKGCGFVQF NR +AEEAL+KLN T                    RD
Subjt:  YSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF-------------------RD

Query:  -FGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQ
         +GNQW   YYGG  Y+GYGY +P  DP MY AA   Y +YG HQQQ
Subjt:  -FGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQ

AT1G47500.1 RNA-binding protein 47C'2.6e-12054.29Show/hide
Query:  MQSNGSDSQPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLH
        ++ N   + P  P      P  W  M+YPP  ++MP  M  PP    PP+P  PY H +P + H H   + ++      GENKTIWVGDL +WMDE+YL+
Subjt:  MQSNGSDSQPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLH

Query:  NCFASI--GEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDND-PDLSIFVGDLAADVTDALLYDTFS
        + F S    EI S+K+IRNK +G SEGYGFVEF +H  A+K LQ + G  MP+T+QPFRLNWA+FSTG+KR +N+ PDLSIFVGDLA DV+DALL++TFS
Subjt:  NCFASI--GEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDND-PDLSIFVGDLAADVTDALLYDTFS

Query:  SKFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCIC
         K+PSVKAAKVV+DANTGRSKGYGFVRFGD+NER++AM+EMNGV CSSR MRIG ATPRK++GYQQQ                                 
Subjt:  SKFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCIC

Query:  CPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF-------------
              GGY  +G F+   +SEGD+ NTTIFVGGLD +VTDEDL+QPFS++GEIVSVKIPVGKGCGFVQF NR +AEEAL+KLN T              
Subjt:  CPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF-------------

Query:  ------RD-FGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQ
              RD +GNQW   YYGG  Y+GYGY +P  DP MY AA   Y +YG HQQQ
Subjt:  ------RD-FGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQ

AT3G19130.1 RNA-binding protein 47B2.7e-11751.95Show/hide
Query:  PQEPNQRHQQPQSWM-SMQYPPPA---MVMPHHMMTPPHYMAPPLPPPPYMHYHPQ------YQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYL
        PQ+  Q+ QQ Q WM +MQYPP A   M+    M+  PH    P    PY  +HPQ      YQ H    QH    +GSG + KT+WVGDL HWMDE+YL
Subjt:  PQEPNQRHQQPQSWM-SMQYPPPA---MVMPHHMMTPPHYMAPPLPPPPYMHYHPQ------YQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYL

Query:  HNCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRS-DNDPDLSIFVGDLAADVTDALLYDTFSS
        H+CF+  GE++S+K+IRNK +  SEGYGFVEF + A AE+ LQNY+G +MP+++QPFR+NWA+FSTG+KR+ +N PDLS+FVGDL+ DVTD LL++TFS 
Subjt:  HNCFASIGEIASIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRS-DNDPDLSIFVGDLAADVTDALLYDTFSS

Query:  KFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICC
        ++PSVK+AKVVID+NTGRSKGYGFVRFGD+NERS+A++EMNG YCS+R MR+G ATP+++   QQQ+SSQ +                            
Subjt:  KFPSVKAAKVVIDANTGRSKGYGFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICC

Query:  PLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF--------------
           + GG+ +NG    G QS+G+S N TIFVGG+DP+V DEDLRQPFSQ+GE+VSVKIPVGKGCGFVQFA+R  AE+A++ LN T               
Subjt:  PLFIIGGYSTNGPFSQGLQSEGDSANTTIFVGGLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATF--------------

Query:  -----RDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMY
              D G QWNG Y  GH Y+  G     HD + Y
Subjt:  -----RDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMY

AT5G42690.1 Protein of unknown function, DUF5473.4e-11249.51Show/hide
Query:  RVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELS-
        +  +RE+ + LQ+DV+KL+KKLR EEN+ RA++RAF+RPLGALPRLPPFLPP++LELLAEVAVLEEE+VRLEE +VH RQ+LYQEAV  SSS +N++ S 
Subjt:  RVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELS-

Query:  ---LKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKP-NSPKLNFENRETSPENAEARQLRAPDNKVSGDD
               +K   +  S +++++ L +   S ++     K + L  T +IKTP+KK+   +    K   + KL   +R         R+  A  +   G D
Subjt:  ---LKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKP-NSPKLNFENRETSPENAEARQLRAPDNKVSGDD

Query:  SPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRT-TNSLFLFQRLKLLLGKL
         PN ISE+++KCLS+I +RMSSIK    T+S            ++T  +DPY ICS F  RDIG YKN   VE +S+N  RT ++SLFL ++LK LLG+L
Subjt:  SPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRT-TNSLFLFQRLKLLLGKL

Query:  ASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQI---------------IIDYPSALLYI-------------ASFFL---QPL
        + VN+Q+L  QEKLAFWIN+YNSCM+N FL+ GIPESP+MV  LMQK  I               I+  P    YI             + F L   +PL
Subjt:  ASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQI---------------IIDYPSALLYI-------------ASFFL---QPL

Query:  VTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQ-PLSQFVK
        VTFALSCGSWSSPAVRVYTAS+VE +LE+AKR+YL+A+VGI+  K GIPKL+DWY  DF KD++SL+DW+ LQLP+ELGK+A+  +E+  +Q P S  V 
Subjt:  VTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQ-PLSQFVK

Query:  VIPYEFSFRYL
        +IPY+F+FRYL
Subjt:  VIPYEFSFRYL

AT5G42690.2 Protein of unknown function, DUF5475.2e-11349.22Show/hide
Query:  RVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELS-
        +  +RE+ + LQ+DV+KL+KKLR EEN+ RA++RAF+RPLGALPRLPPFLPP++LELLAEVAVLEEE+VRLEE +VH RQ+LYQEAV  SSS +N++ S 
Subjt:  RVASRERKVALQQDVDKLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELS-

Query:  ---LKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDS
               +K   +  S +++++ L +   S ++     K + L  T +IKTP+KK+   +    K      + E ++   ++   R+  A  +   G D 
Subjt:  ---LKNNSKQVQSEHSVQKTDNGLGKENESRMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDS

Query:  PNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRT-TNSLFLFQRLKLLLGKLA
        PN ISE+++KCLS+I +RMSSIK    T+S            ++T  +DPY ICS F  RDIG YKN   VE +S+N  RT ++SLFL ++LK LLG+L+
Subjt:  PNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTMQTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRT-TNSLFLFQRLKLLLGKLA

Query:  SVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQI---------------IIDYPSALLYI-------------ASFFL---QPLV
         VN+Q+L  QEKLAFWIN+YNSCM+N FL+ GIPESP+MV  LMQK  I               I+  P    YI             + F L   +PLV
Subjt:  SVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQI---------------IIDYPSALLYI-------------ASFFL---QPLV

Query:  TFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQ-PLSQFVKV
        TFALSCGSWSSPAVRVYTAS+VE +LE+AKR+YL+A+VGI+  K GIPKL+DWY  DF KD++SL+DW+ LQLP+ELGK+A+  +E+  +Q P S  V +
Subjt:  TFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQ-PLSQFVKV

Query:  IPYEFSFRYL
        IPY+F+FRYL
Subjt:  IPYEFSFRYL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAGCAACGGTTCTGATTCGCAGCCGCAAGAACCAAATCAGCGCCACCAGCAGCCGCAGTCGTGGATGTCAATGCAGTACCCGCCGCCGGCTATGGTAATGCCGCA
CCATATGATGACGCCTCCGCATTACATGGCGCCGCCGCTTCCGCCGCCGCCGTATATGCATTATCACCCCCAATATCAGCATCACCATCTCCCTATTCAGCATTCGCAGC
CGCTACAGGGATCTGGTGGCGAAAACAAAACGATTTGGGTGGGTGACTTGCATCATTGGATGGACGAGAGTTACCTCCATAACTGCTTTGCTTCCATTGGCGAGATTGCA
TCGATCAAGATCATTCGTAATAAGCAAAGTGGTTTGTCTGAGGGGTATGGGTTTGTAGAATTTTTTGCCCATGCAACAGCTGAAAAAGCTCTTCAGAACTATGCTGGAGT
GCTCATGCCAAGTACAGAGCAGCCTTTCCGCTTGAATTGGGCGACATTTAGCACCGGTGACAAACGATCAGATAATGATCCCGATCTTTCTATTTTTGTTGGTGACTTAG
CTGCAGATGTTACTGATGCTCTGTTGTACGATACATTTTCTAGTAAATTTCCTTCTGTTAAAGCAGCAAAAGTTGTCATTGATGCCAATACTGGTCGTTCAAAGGGTTAT
GGTTTTGTGCGGTTTGGAGATGACAATGAAAGGTCTCAGGCCATGTCTGAAATGAATGGTGTATATTGTTCGAGCAGACCTATGCGCATTGGTGCAGCAACGCCTAGAAA
GTCTTCTGGATATCAGCAACAGAATTCTTCACAAGGTATATTTAAATTGTATGGTTACAATGTAGTAGTTTACTCATGGTTGTACACTTCCCTACCCCTTTTGGACTGCT
CATGCATATGTTGTCCTCTGTTTATTATAGGAGGATATTCAACAAATGGTCCCTTTAGCCAAGGACTCCAATCTGAAGGAGATTCTGCAAATACAACTATATTTGTTGGA
GGGCTTGATCCTAATGTCACCGATGAAGATCTGAGGCAACCTTTCTCACAGTATGGTGAGATAGTCTCTGTTAAAATACCAGTTGGTAAAGGATGTGGGTTTGTTCAATT
TGCCAATAGAAATGATGCTGAAGAGGCCTTGCAGAAACTAAATGCAACTTTCAGAGATTTTGGTAACCAGTGGAATGGAGCTTACTACGGGGGGCACATATATGATGGCT
ATGGATATGGTCTGCCGCCTCACGATCCAAGTATGTATCAGGCAGCTTATGGAGCTTACACTGTATATGGAAGCCATCAACAACAAGATCGATTATTACTAAAAATGGAT
TTGTGGGTAGGTTGGTATCCTACATTAGGTTTCAAGATTATAACTGGATTTGTCGCTGATTTACTCAATAAACCCCTCTTCATTGCTGCTGCTGCTGCTAAGGACAACGA
CAAAGGAAGGAAGATGAGTTTTGAAGAGATGGATCGCAAAGGGAGGACAAGGGTGCAGTCAATGAGAGCATCTGCGAATCATGAAAAAGTTGAATTCCGAATTGCCTTGA
AGGGAAGCGTGGATATGCCGGCGGCGAACTTGCCTGATGCAGCCAAAGCAACCACGAGTGGACGGGTTGCAAGCAGAGAGAGAAAAGTGGCTTTACAACAAGACGTTGAC
AAGCTGAAGAAGAAGCTCAGACATGAAGAAAATGTGGGTCGTGCTTTGAAGAGAGCTTTCACCAGACCTCTGGGAGCTTTACCCCGCCTCCCTCCTTTTCTTCCTCCCAA
TATGCTAGAACTTCTTGCCGAAGTAGCTGTTCTTGAAGAGGAGGTGGTGCGGCTTGAAGAGCAGGTTGTGCATTTCAGGCAGGACTTGTATCAGGAAGCTGTCAACATTT
CCTCCTCCAAGAAGAACATGGAGCTTTCCCTCAAAAACAATTCCAAGCAAGTTCAATCCGAACATTCAGTTCAGAAGACTGATAATGGTCTGGGAAAGGAAAATGAATCA
CGTATGAATTCGACAAGTAACCACAAGGTTTCTTCTCTCCAGAAAACCCATACGATAAAGACCCCAGTCAAGAAATCTCCAGCTCGATACAAATCATCAGCGAAACCGAA
TTCTCCAAAACTGAATTTCGAAAACCGAGAGACGAGTCCAGAAAATGCTGAAGCTAGACAACTACGTGCTCCAGACAACAAGGTATCTGGTGATGATAGTCCGAACAATA
TCTCTGAAAATATTTTGAAATGCTTATCGAGCATTCTATTGAGGATGAGCTCAATCAAGAACAGAGGTGCCACTGAAAGTTTACATCTATTCTCAATGCTAACTACGATG
CAAACTGAAGAGACGGATCATCAGGACCCTTATACTATATGTTCAGAGTTTGGAATGCGAGATATTGGTCCATACAAGAACGTACGTATAGTTGAAGCCAGTTCAATTAA
TACAAAGCGGACGACAAACTCATTGTTTCTATTCCAAAGATTGAAACTCCTCTTGGGGAAACTTGCTTCTGTCAACTTGCAACGTCTTACACATCAGGAGAAACTTGCCT
TTTGGATCAACGTCTATAACTCCTGCATGATAAATGCATTTCTACAACTTGGAATACCAGAGAGTCCTGAGATGGTTGCTGGTTTGATGCAGAAGGTACAGATTATTATT
GATTATCCTTCTGCTCTACTCTATATAGCTTCGTTCTTTCTTCAACCATTGGTTACGTTCGCATTATCATGTGGAAGCTGGTCGTCCCCAGCTGTGAGAGTGTACACAGC
ATCCCAGGTCGAGAACCAGCTAGAATTAGCAAAAAGAGACTATTTACAAGCTGCAGTAGGAATTGCATCAGAGAAGTTTGGAATCCCAAAGCTGCTGGATTGGTATTTGC
TGGACTTTGGTAAAGACTTGGACTCCTTGGTTGACTGGGTCTGCCTTCAGCTACCAAGTGAATTAGGAAAAGAAGCAATTAAGTTGATAGAGAGGAGACAAAACCAGCCT
CTTTCTCAGTTTGTTAAAGTAATACCTTATGAATTTAGCTTTAGATACCTTCTCTGCACATAA
mRNA sequenceShow/hide mRNA sequence
ATTCGGATTCTTTTTCCTCTCATTCCATTCAGATTCTTTTTCCTCTCATTCCATTCAGATTCTTTTACCCTTTTCAGTAAGGGTGTACATTAAAATCTGCTCCATTCTCC
TTGTTAATACGGAGGAAGATAAGATGCAAAGCAACGGTTCTGATTCGCAGCCGCAAGAACCAAATCAGCGCCACCAGCAGCCGCAGTCGTGGATGTCAATGCAGTACCCG
CCGCCGGCTATGGTAATGCCGCACCATATGATGACGCCTCCGCATTACATGGCGCCGCCGCTTCCGCCGCCGCCGTATATGCATTATCACCCCCAATATCAGCATCACCA
TCTCCCTATTCAGCATTCGCAGCCGCTACAGGGATCTGGTGGCGAAAACAAAACGATTTGGGTGGGTGACTTGCATCATTGGATGGACGAGAGTTACCTCCATAACTGCT
TTGCTTCCATTGGCGAGATTGCATCGATCAAGATCATTCGTAATAAGCAAAGTGGTTTGTCTGAGGGGTATGGGTTTGTAGAATTTTTTGCCCATGCAACAGCTGAAAAA
GCTCTTCAGAACTATGCTGGAGTGCTCATGCCAAGTACAGAGCAGCCTTTCCGCTTGAATTGGGCGACATTTAGCACCGGTGACAAACGATCAGATAATGATCCCGATCT
TTCTATTTTTGTTGGTGACTTAGCTGCAGATGTTACTGATGCTCTGTTGTACGATACATTTTCTAGTAAATTTCCTTCTGTTAAAGCAGCAAAAGTTGTCATTGATGCCA
ATACTGGTCGTTCAAAGGGTTATGGTTTTGTGCGGTTTGGAGATGACAATGAAAGGTCTCAGGCCATGTCTGAAATGAATGGTGTATATTGTTCGAGCAGACCTATGCGC
ATTGGTGCAGCAACGCCTAGAAAGTCTTCTGGATATCAGCAACAGAATTCTTCACAAGGTATATTTAAATTGTATGGTTACAATGTAGTAGTTTACTCATGGTTGTACAC
TTCCCTACCCCTTTTGGACTGCTCATGCATATGTTGTCCTCTGTTTATTATAGGAGGATATTCAACAAATGGTCCCTTTAGCCAAGGACTCCAATCTGAAGGAGATTCTG
CAAATACAACTATATTTGTTGGAGGGCTTGATCCTAATGTCACCGATGAAGATCTGAGGCAACCTTTCTCACAGTATGGTGAGATAGTCTCTGTTAAAATACCAGTTGGT
AAAGGATGTGGGTTTGTTCAATTTGCCAATAGAAATGATGCTGAAGAGGCCTTGCAGAAACTAAATGCAACTTTCAGAGATTTTGGTAACCAGTGGAATGGAGCTTACTA
CGGGGGGCACATATATGATGGCTATGGATATGGTCTGCCGCCTCACGATCCAAGTATGTATCAGGCAGCTTATGGAGCTTACACTGTATATGGAAGCCATCAACAACAAG
ATCGATTATTACTAAAAATGGATTTGTGGGTAGGTTGGTATCCTACATTAGGTTTCAAGATTATAACTGGATTTGTCGCTGATTTACTCAATAAACCCCTCTTCATTGCT
GCTGCTGCTGCTAAGGACAACGACAAAGGAAGGAAGATGAGTTTTGAAGAGATGGATCGCAAAGGGAGGACAAGGGTGCAGTCAATGAGAGCATCTGCGAATCATGAAAA
AGTTGAATTCCGAATTGCCTTGAAGGGAAGCGTGGATATGCCGGCGGCGAACTTGCCTGATGCAGCCAAAGCAACCACGAGTGGACGGGTTGCAAGCAGAGAGAGAAAAG
TGGCTTTACAACAAGACGTTGACAAGCTGAAGAAGAAGCTCAGACATGAAGAAAATGTGGGTCGTGCTTTGAAGAGAGCTTTCACCAGACCTCTGGGAGCTTTACCCCGC
CTCCCTCCTTTTCTTCCTCCCAATATGCTAGAACTTCTTGCCGAAGTAGCTGTTCTTGAAGAGGAGGTGGTGCGGCTTGAAGAGCAGGTTGTGCATTTCAGGCAGGACTT
GTATCAGGAAGCTGTCAACATTTCCTCCTCCAAGAAGAACATGGAGCTTTCCCTCAAAAACAATTCCAAGCAAGTTCAATCCGAACATTCAGTTCAGAAGACTGATAATG
GTCTGGGAAAGGAAAATGAATCACGTATGAATTCGACAAGTAACCACAAGGTTTCTTCTCTCCAGAAAACCCATACGATAAAGACCCCAGTCAAGAAATCTCCAGCTCGA
TACAAATCATCAGCGAAACCGAATTCTCCAAAACTGAATTTCGAAAACCGAGAGACGAGTCCAGAAAATGCTGAAGCTAGACAACTACGTGCTCCAGACAACAAGGTATC
TGGTGATGATAGTCCGAACAATATCTCTGAAAATATTTTGAAATGCTTATCGAGCATTCTATTGAGGATGAGCTCAATCAAGAACAGAGGTGCCACTGAAAGTTTACATC
TATTCTCAATGCTAACTACGATGCAAACTGAAGAGACGGATCATCAGGACCCTTATACTATATGTTCAGAGTTTGGAATGCGAGATATTGGTCCATACAAGAACGTACGT
ATAGTTGAAGCCAGTTCAATTAATACAAAGCGGACGACAAACTCATTGTTTCTATTCCAAAGATTGAAACTCCTCTTGGGGAAACTTGCTTCTGTCAACTTGCAACGTCT
TACACATCAGGAGAAACTTGCCTTTTGGATCAACGTCTATAACTCCTGCATGATAAATGCATTTCTACAACTTGGAATACCAGAGAGTCCTGAGATGGTTGCTGGTTTGA
TGCAGAAGGTACAGATTATTATTGATTATCCTTCTGCTCTACTCTATATAGCTTCGTTCTTTCTTCAACCATTGGTTACGTTCGCATTATCATGTGGAAGCTGGTCGTCC
CCAGCTGTGAGAGTGTACACAGCATCCCAGGTCGAGAACCAGCTAGAATTAGCAAAAAGAGACTATTTACAAGCTGCAGTAGGAATTGCATCAGAGAAGTTTGGAATCCC
AAAGCTGCTGGATTGGTATTTGCTGGACTTTGGTAAAGACTTGGACTCCTTGGTTGACTGGGTCTGCCTTCAGCTACCAAGTGAATTAGGAAAAGAAGCAATTAAGTTGA
TAGAGAGGAGACAAAACCAGCCTCTTTCTCAGTTTGTTAAAGTAATACCTTATGAATTTAGCTTTAGATACCTTCTCTGCACATAATAATTCATAATCCAGTTGATTTTT
CTGTGGATGTTTTCTTTTCTTTTTTTTTTTGGCAGCAATGGTTCAGATTTAGTAGTAGCAGTAGTACACTGTATGATAGGTATTATAAAGAATATAGACTTGTCCATTGT
TTGGTAACAGTGCAAGGTGAAAAATTTAAGCACCTCGTGGCTAATGTGAACAACAGATGATTCTTGATCTTGGGTGTACTGAGCATTAATGAAAGAACCAAATTAACTAT
GAAC
Protein sequenceShow/hide protein sequence
MQSNGSDSQPQEPNQRHQQPQSWMSMQYPPPAMVMPHHMMTPPHYMAPPLPPPPYMHYHPQYQHHHLPIQHSQPLQGSGGENKTIWVGDLHHWMDESYLHNCFASIGEIA
SIKIIRNKQSGLSEGYGFVEFFAHATAEKALQNYAGVLMPSTEQPFRLNWATFSTGDKRSDNDPDLSIFVGDLAADVTDALLYDTFSSKFPSVKAAKVVIDANTGRSKGY
GFVRFGDDNERSQAMSEMNGVYCSSRPMRIGAATPRKSSGYQQQNSSQGIFKLYGYNVVVYSWLYTSLPLLDCSCICCPLFIIGGYSTNGPFSQGLQSEGDSANTTIFVG
GLDPNVTDEDLRQPFSQYGEIVSVKIPVGKGCGFVQFANRNDAEEALQKLNATFRDFGNQWNGAYYGGHIYDGYGYGLPPHDPSMYQAAYGAYTVYGSHQQQDRLLLKMD
LWVGWYPTLGFKIITGFVADLLNKPLFIAAAAAKDNDKGRKMSFEEMDRKGRTRVQSMRASANHEKVEFRIALKGSVDMPAANLPDAAKATTSGRVASRERKVALQQDVD
KLKKKLRHEENVGRALKRAFTRPLGALPRLPPFLPPNMLELLAEVAVLEEEVVRLEEQVVHFRQDLYQEAVNISSSKKNMELSLKNNSKQVQSEHSVQKTDNGLGKENES
RMNSTSNHKVSSLQKTHTIKTPVKKSPARYKSSAKPNSPKLNFENRETSPENAEARQLRAPDNKVSGDDSPNNISENILKCLSSILLRMSSIKNRGATESLHLFSMLTTM
QTEETDHQDPYTICSEFGMRDIGPYKNVRIVEASSINTKRTTNSLFLFQRLKLLLGKLASVNLQRLTHQEKLAFWINVYNSCMINAFLQLGIPESPEMVAGLMQKVQIII
DYPSALLYIASFFLQPLVTFALSCGSWSSPAVRVYTASQVENQLELAKRDYLQAAVGIASEKFGIPKLLDWYLLDFGKDLDSLVDWVCLQLPSELGKEAIKLIERRQNQP
LSQFVKVIPYEFSFRYLLCT