; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030384 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030384
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionnucleolin 1 isoform X1
Genome locationtig00153640:3087424..3097437
RNA-Seq ExpressionSgr030384
SyntenySgr030384
Gene Ontology termsGO:0043488 - regulation of mRNA stability (biological process)
GO:1900364 - negative regulation of mRNA polyadenylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0008143 - poly(A) binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR002483 - PWI domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR035979 - RNA-binding domain superfamily
IPR040366 - Nuclear polyadenylated RNA-binding protein Nab2/ZC3H14


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036580.1 Polyadenylate-binding protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-29280Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGS DR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRK+EA+NELNVFL DDSHSFVSWLWDHLASSM+LYVE P KSSA
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        +EVPR K                    + KSEKLSSRRRNREWKG+A+DETRV PRSEVSRVKHSSPEQVP HRKRSRADDHQG EREAAFQVSIAAPRR
Subjt:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVAT +PSN AKEPLSKRLRSVVSTSNSDTT+ PRRLQSVAKVPNPMATVIKAV+EAAEDVIRVKSSSVFDRLGRQS D+D  ESSGQ+AEY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
         V AVED KYGDM HTQD+PY    L  +                               I+ H+VFDDSWTAESGVRKGGNLRT PFRVVEN DD+ R+
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
        T+YKQK QPS VANSSRDIVNISVNVNTWKPPHYQDPGQIAE G QKFLQESEL+GTRSAV+V ENGEPVTIVNQQKK AA+ QKEFQKP  SANG  AA
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE+ADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSF+SRILKV RKNAS  EGASIV WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSP+PTPRF+RVPFPRGI  GFRPRPPIKLGARSMQWKRDSQ T+ DNGAS+ G S+PS+GARSLTYVRT+PKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

XP_008461790.1 PREDICTED: nucleolin 1 isoform X1 [Cucumis melo]4.8e-29279.26Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGS DRVDDRTFKVDF+GEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRR++EA+NELNVFL DDSHSFVSWLWDHLASSMDLYVE P KSSA
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        NEVPRPK                    + K+EKLSSRRRNREW+G+A++ETRV P+SEVSRVKHSSPEQVP+HRKRSR DD QGTEREAAFQVSIAAPRR
Subjt:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVATTRPSNAAKEP SKRLRSVVS SNSDTTNRPRRLQSVAKVPNPMATVIKAVTEA+EDV+RVKSSSVFDRLGRQS DMDLTESSG+L EY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
         V +VE+++YGDM H++D+PY    L  +                               I  HRVFDDSWTAESGVRKG NLRTV FR V+N+DD+ R+
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
         QY QKDQPS  ANSSRDIVNISVNVNTWKPPHYQD GQI EL  QKFLQESELQGTRSAVQV ENGEPVT+VNQ+K  A+N QKEFQKPPLSANGQFA+
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE ADARTIFVSNVHFAATKDSLSRHFNKFGEV+KVIIVTDATTGQPKGSAYVEFMRKE+AENALSLDGTSFMSRILKVVRKNAS LEGAS V WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSPFPT RFTRVPF RG+  GFR RPP+KLGARSMQWKRD+Q  +ADNGAS+SGNSIPS GARSLTYVRTEPKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

XP_022152638.1 protein gar2 isoform X1 [Momordica charantia]2.4e-30983.82Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGR KDEAKNELNVFL DDSHSFVSWLWDHLASSMDLYVE PTKSS 
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRP--------------------KKRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        NEVP P                    ++ KSEKLSSRRRNREWKG+A+DETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
Subjt:  NEVPRP--------------------KKRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVATTRPSN AKEP SKRLRSVVSTSN+DTTNRPRRLQSVAKVPNPMATVIKAVTEAAED IRVKSSSVFDRLGRQSHDMDLTE SGQLAEY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
        RV AVED KYGD+THTQDQPY    L  +                               +  HR FDDSWTAESGVRK GNLR+VPFRVVENAD+ TRI
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
        TQYKQKDQPS VANSSRDIVNISVNVNTWKPPHYQDPGQIAE GSQKFLQESELQG+RSAVQV ENG+ VTIVNQQK  AANPQKEFQKPP SANGQFAA
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE+ADARTIFVSNVHF ATKDSLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRK+AAENALSLDGTSFMSRILKV+RKNAS LEGASIV WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSP+P+PRF+R PF RGI  GFRPRPPIKLGARSMQWKRDSQATSADNGAS+SGNSI SSGARSLTYVRTEPKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

XP_023525351.1 nucleolin 1-like isoform X1 [Cucurbita pepo subsp. pepo]8.2e-29279.88Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGS DR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRK+EA+NELNVFL DDSHSFVSWLWDHLASSM+LYVE P KSSA
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        +EVPR K                    + KSEKLSSRRRNREWKG+A+DETRV PRSEVSRVKHSSPEQVPSHRKR RADDHQG EREAAFQVSIAAPRR
Subjt:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVAT +PSN AKEPLSKRLRSVVSTSNSDT + PRRLQS+AKVPNPMATVIKAV+EAAEDVIRVKSSSVFDRLGRQS D+DL ESSGQ+AEY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
         V AVED KYGDM HTQD+PY    L  +                               I+ H+VFDDSWTAESGVRKGGNLRT PFRVVEN DD+ R+
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
        T+YKQK QPS VANSSRDIVNISVNVNTWKPPHYQDPGQIAE G QKFLQESEL+GTRSAVQV ENGEPVTIVNQQKK AA+ QKEFQKP  SANG  AA
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE+ADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSF+SRILKV RKNAS  EGASIV WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQ-ATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSP+PTPRF+RVPFPRG+  GFRPRPPIKLGARSMQWKRDSQ  T+ DNGAS+ G S+PS+GARSLTYVRTEPKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQ-ATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

XP_038904558.1 nucleolin 1 isoform X1 [Benincasa hispida]2.5e-29680.59Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGS D VDDRTFKVDF+GEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRK+EA+NELNVFL DDSHSFVSWLWDHLASSMDLYVE PTKSSA
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        NEVPRPK                    + KSEKLSSRRRNREW+G+A+DETRV PRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
Subjt:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVATTRPS+AAKEP SKRLRSVVSTSNSDTTN PRRLQSVAKVPNPMATVIKAVTEAAEDV+RVKSSSVFDRLGRQS DMDLTESSGQLAEY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPP-----------ILREAIID--------------------HRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
         V +VE++KYGDM HT+D+PY               L EA+ +                    HR+F+DSWTAESGVR+GGNLRTVPFR VEN+DD+ R+
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPP-----------ILREAIID--------------------HRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
        TQYKQKDQPS VANSSRDIVNISVNVNTWKPPHYQDPGQI EL  QKFLQESELQ TRSAVQVMENGEPVT+VNQ+K+ A + QKEFQKPPLSANGQF  
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE ADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQP+GSAYVEFMRKE+AENALSLDGTSFMSRILKVVRKNAS +EGAS   WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSPFPTPRFTRVPF RG+  GFR R  +KLGARSMQWKRDSQ T+A+ GAS SGNS+PSSGARSLTYVRTE KPA+K
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

TrEMBL top hitse value%identityAlignment
A0A1S3CFE4 nucleolin 1 isoform X12.3e-29279.26Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGS DRVDDRTFKVDF+GEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRR++EA+NELNVFL DDSHSFVSWLWDHLASSMDLYVE P KSSA
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        NEVPRPK                    + K+EKLSSRRRNREW+G+A++ETRV P+SEVSRVKHSSPEQVP+HRKRSR DD QGTEREAAFQVSIAAPRR
Subjt:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVATTRPSNAAKEP SKRLRSVVS SNSDTTNRPRRLQSVAKVPNPMATVIKAVTEA+EDV+RVKSSSVFDRLGRQS DMDLTESSG+L EY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
         V +VE+++YGDM H++D+PY    L  +                               I  HRVFDDSWTAESGVRKG NLRTV FR V+N+DD+ R+
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
         QY QKDQPS  ANSSRDIVNISVNVNTWKPPHYQD GQI EL  QKFLQESELQGTRSAVQV ENGEPVT+VNQ+K  A+N QKEFQKPPLSANGQFA+
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE ADARTIFVSNVHFAATKDSLSRHFNKFGEV+KVIIVTDATTGQPKGSAYVEFMRKE+AENALSLDGTSFMSRILKVVRKNAS LEGAS V WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSPFPT RFTRVPF RG+  GFR RPP+KLGARSMQWKRD+Q  +ADNGAS+SGNSIPS GARSLTYVRTEPKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

A0A5A7U3W4 Nucleolin 1 isoform X12.3e-29279.26Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGS DRVDDRTFKVDF+GEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRR++EA+NELNVFL DDSHSFVSWLWDHLASSMDLYVE P KSSA
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        NEVPRPK                    + K+EKLSSRRRNREW+G+A++ETRV P+SEVSRVKHSSPEQVP+HRKRSR DD QGTEREAAFQVSIAAPRR
Subjt:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVATTRPSNAAKEP SKRLRSVVS SNSDTTNRPRRLQSVAKVPNPMATVIKAVTEA+EDV+RVKSSSVFDRLGRQS DMDLTESSG+L EY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
         V +VE+++YGDM H++D+PY    L  +                               I  HRVFDDSWTAESGVRKG NLRTV FR V+N+DD+ R+
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
         QY QKDQPS  ANSSRDIVNISVNVNTWKPPHYQD GQI EL  QKFLQESELQGTRSAVQV ENGEPVT+VNQ+K  A+N QKEFQKPPLSANGQFA+
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE ADARTIFVSNVHFAATKDSLSRHFNKFGEV+KVIIVTDATTGQPKGSAYVEFMRKE+AENALSLDGTSFMSRILKVVRKNAS LEGAS V WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSPFPT RFTRVPF RG+  GFR RPP+KLGARSMQWKRD+Q  +ADNGAS+SGNSIPS GARSLTYVRTEPKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

A0A6J1DFE3 protein gar2 isoform X11.2e-30983.82Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGR KDEAKNELNVFL DDSHSFVSWLWDHLASSMDLYVE PTKSS 
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRP--------------------KKRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        NEVP P                    ++ KSEKLSSRRRNREWKG+A+DETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
Subjt:  NEVPRP--------------------KKRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVATTRPSN AKEP SKRLRSVVSTSN+DTTNRPRRLQSVAKVPNPMATVIKAVTEAAED IRVKSSSVFDRLGRQSHDMDLTE SGQLAEY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
        RV AVED KYGD+THTQDQPY    L  +                               +  HR FDDSWTAESGVRK GNLR+VPFRVVENAD+ TRI
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
        TQYKQKDQPS VANSSRDIVNISVNVNTWKPPHYQDPGQIAE GSQKFLQESELQG+RSAVQV ENG+ VTIVNQQK  AANPQKEFQKPP SANGQFAA
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE+ADARTIFVSNVHF ATKDSLSRHFNKFGE+VKVIIVTDATTGQPKGSAYVEFMRK+AAENALSLDGTSFMSRILKV+RKNAS LEGASIV WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSP+P+PRF+R PF RGI  GFRPRPPIKLGARSMQWKRDSQATSADNGAS+SGNSI SSGARSLTYVRTEPKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

A0A6J1GC70 nucleolin 1-like isoform X14.0e-29279.85Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGS DR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRK+EA+NELNVFL DDSHSFVSWLWDHLASSM+LYVE P KSSA
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        +EVP  K                    + KSEKLSSRRRNREWKG+A+DETRV PRSEVSRVKHSSPEQVPSHRKRSRADDHQG EREAAFQVSIAAPRR
Subjt:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVAT +PSN AKEPLSKRLRSVVSTS SDTT+ PRRLQSVAKVPNPMATVIKAV+EAAEDVIRVKSSSVFDRLGRQS D+D  ESSGQ+AEY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
         V AVED KYGDM HTQD+PY    L  +                               I+ H+VFDDSWTAESGVRKGGNLRT PFRVVEN DD+ R+
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
        T+YKQK QPS VANSSRDIVNISVNVNTWKPPHYQDPGQIAE G QKFLQESEL+GTRSAV+V ENGEPVTIVNQQKK AA+ QKEFQKP  SANG  AA
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE+ADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSF+SRILKV RKNAS  EGASIV WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSP+PTPRF+RVPFPRG+  GFRPRPPIKLGARSMQWKRDSQ T+ DNGAS+ G S+PS+GARSLTYVRTEPKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

A0A6J1KE94 uncharacterized protein LOC111493050 isoform X19.8e-29179.41Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA
        MGS DR+DDRTFKVDFSGEGM KLRERIKLKMKEFMGDYTDDTLVEYV+VLLRNGRRK+EA+NELNVFL DDSHSFVSWLWDHLASSM+LYVE P KSSA
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSA

Query:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR
        +EVPR K                    + KSEKLSSRRRNREWKG+A++ETRV PRSEVSRVKHSSPEQVPSHRKRSRADDHQG EREA FQVSIAAPRR
Subjt:  NEVPRPK--------------------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRR

Query:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY
        LLQFAMRDAVAT +PSN AKEPLSKRLRSVVSTSNSDTT+ PRRLQSVAKVPNPMATVIKAV+EAAEDVIRVKSSSVFDRLGRQS D DL ESSGQ+AEY
Subjt:  LLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEY

Query:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI
         V  VED KYGDM HTQD+PY    L  +                               I+ H+VFDDSWTAESGVRKGGNLRT PFRVVEN DD+ R+
Subjt:  RVAAVEDEKYGDMTHTQDQPYQPPILREA-------------------------------IIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRI

Query:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA
         +YKQK QPS VANSSRDIVNISVNVNTWKPPHYQDPGQIAE G QKFLQ SEL+GTRSAV+V ENGEPVTIVNQQKK  A+ QKEFQKP  SANG  AA
Subjt:  TQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAA

Query:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR
         RPLE+ADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSF+SRILKV RKNAS  EGASIV WPR
Subjt:  ARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPR

Query:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK
        AVRGSP+PTPRF+RVPFPRG+  GFRPRPPIKLGARSMQWKRDSQ T+ DNGAS+ G S+PS+GARSLTYVRTEPKPADK
Subjt:  AVRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK

SwissProt top hitse value%identityAlignment
Q6NVP7 Polyadenylate-binding protein 22.0e-1432.34Show/hide
Query:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG   EL   + +++ EL+  ++ V+ ME         ++ ++    Q     PP +A     +      ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRA---VRGSPFPT-PRFTRVPFPRGISSGFRPR
         V +V I+ D  TG PKG AY+EF  KE+   +L+LD + F  R +KVV K  +   G S     +PRA    R S + +  RF          SG+ PR
Subjt:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRA---VRGSPFPT-PRFTRVPFPRGISSGFRPR

Query:  P
        P
Subjt:  P

Q7ZXB8 Polyadenylate-binding protein 2-B1.2e-1431.63Show/hide
Query:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG   EL   + +++ EL+  ++ V+ ME         ++ ++    Q     PP +A     +      ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRA---VRGSPFPT-PRFTRVPFPRGISSGFRPR
         V +V I+ D  TG PKG AY+EF  KE+   +L+LD + F  R +KVV K  +   G S     +PRA    R S + +  RF          SG+ PR
Subjt:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRA---VRGSPFPT-PRFTRVPFPRGISSGFRPR

Query:  PPIKL---GARSMQW
        P  ++    AR+  W
Subjt:  PPIKL---GARSMQW

Q86U42 Polyadenylate-binding protein 23.4e-1430.14Show/hide
Query:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG +        +++ EL+  ++ V+ ME         ++ ++    Q     PP +A     +      ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRAVRGSPFPTPRFTRVPFPRGISSGFRPRPPIK
         V +V I+ D  +G PKG AY+EF  KE+   +L+LD + F  R +KV+ K  +   G S     +PRA   +       +R  F  G +S  RPR  + 
Subjt:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRAVRGSPFPTPRFTRVPFPRGISSGFRPRPPIK

Query:  LG-ARSMQW
         G AR+  W
Subjt:  LG-ARSMQW

Q8CCS6 Polyadenylate-binding protein 24.4e-1430.14Show/hide
Query:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG +        +++ EL+  ++ V+ ME         ++ ++    Q     PP +A     +      ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRAVRGSPFPTPRFTRVPFPRGISSGFRPRPPIK
         V +V I+ D  +G PKG AY+EF  KE+   +L+LD + F  R +KV+ K  +   G S     +PR+   +       +R  F  G +S  RPR  I 
Subjt:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRAVRGSPFPTPRFTRVPFPRGISSGFRPRPPIK

Query:  LG-ARSMQW
         G AR+  W
Subjt:  LG-ARSMQW

Q9DDY9 Polyadenylate-binding protein 2-A1.2e-1431.63Show/hide
Query:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG
        ++PG   EL   + +++ EL+  ++ V+ ME         ++ ++    Q     PP +A     +      ADAR+I+V NV + AT + L  HF+  G
Subjt:  QDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFG

Query:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRA---VRGSPFPT-PRFTRVPFPRGISSGFRPR
         V +V I+ D  TG PKG AY+EF  KE+   +L+LD + F  R +KVV K  +   G S     +PRA    R S + +  RF          SG+ PR
Subjt:  EVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV--AWPRA---VRGSPFPT-PRFTRVPFPRGISSGFRPR

Query:  PPIKL---GARSMQW
        P  ++    AR+  W
Subjt:  PPIKL---GARSMQW

Arabidopsis top hitse value%identityAlignment
AT2G24350.1 RNA binding (RRM/RBD/RNP motifs) family protein1.2e-1926.54Show/hide
Query:  VDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSS-------
        VD  TF +    E  ++L+  I   +  F  DY+DD L EYV VL+ NG+ + +A  +L  FL + S  FV  LW+ L      Y  Q   +S       
Subjt:  VDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSS-------

Query:  --ANEVPRPKKRKSEKL---------SSRRRNREWKGLAS--DETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRRLLQFAM
          +N+    +   S K           +   N +   + +  D+   P   +V ++K S  E + S   R+R        R+     S    R++L+  +
Subjt:  --ANEVPRPKKRKSEKL---------SSRRRNREWKGLAS--DETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRRLLQFAM

Query:  RDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSS----SVFDRLGRQSHDMDLTESSGQLAEYRV
          A       N AK   S   RS         T + R      ++ +  A   +AV+    D    + +    SV+DRLGR S +  L   S  L+++ +
Subjt:  RDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSS----SVFDRLGRQSHDMDLTESSGQLAEYRV

Query:  AAVEDEKYGDMTHTQDQPYQPPILREAIIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRITQYKQKDQPSSVANSSRDIVNISVNVNTWK---
           E +      H Q  P  P    E    H           G R          RV +   D    ++     +P    N SR      V+ N+ +   
Subjt:  AAVEDEKYGDMTHTQDQPYQPPILREAIIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRITQYKQKDQPSSVANSSRDIVNISVNVNTWK---

Query:  PPHYQDPGQIAE----LGSQKFLQE--SELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKP-PLSANGQFAAARPLENADARTIFVSNVHFAATK
         P Y+   Q  E    L  Q   Q+  SE++  +  +Q +E    + I+  Q K     + E +KP P S   Q+      + +++R I V+NV++AA K
Subjt:  PPHYQDPGQIAE----LGSQKFLQE--SELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKP-PLSANGQFAAARPLENADARTIFVSNVHFAATK

Query:  DSLSRHF-NKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV-AWPRAVRGS
        +++S  F +K G V  VI+VTD  T  PKG+A+V F  KE+   A++L GT F SR +KV      H+  + +V A P+ V GS
Subjt:  DSLSRHF-NKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIV-AWPRAVRGS

AT3G12640.1 RNA binding (RRM/RBD/RNP motifs) family protein2.4e-11644.61Show/hide
Query:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLY----VEQPT
        MGS D VDDRTF  DFS EG+AKL+E +K K+KE+MGDYTDD LVEYVIVLLRNGRRK+EA NEL +FLGDDS SFV+WLWDHLA S+D Y    VE  T
Subjt:  MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLY----VEQPT

Query:  -KSSANEVPRPK----------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRRLLQFA
         KSS       K          K +S+K +  RR R+W+   ++ + +PP             +  S RKRSR DD +  +REA   VS    RRLLQFA
Subjt:  -KSSANEVPRPK----------KRKSEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRRLLQFA

Query:  MRDAVATTRPSNAAKEPLSKRLRSVVSTS--NSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSS-SVFDRLGRQSHDMDLTE----------
        +RDA+A +RP+N++ E   KRLRSVVSTS  NS   +  R+++SVA+V NPMATV+KAV EAAED  + KS  SVFDR+   +    L +          
Subjt:  MRDAVATTRPSNAAKEPLSKRLRSVVSTS--NSDTTNRPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSS-SVFDRLGRQSHDMDLTE----------

Query:  -------SSGQLA---EYRVAAVEDEKYGDMTHTQDQPYQPPILREAIIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRITQYKQKDQPSSVA
               S GQ A   +Y  +   +  Y +   T D    P    +        + S  +     +  N  ++  R+V   DD  R+     +++   VA
Subjt:  -------SSGQLA---EYRVAAVEDEKYGDMTHTQDQPYQPPILREAIIDHRVFDDSWTAESGVRKGGNLRTVPFRVVENADDDTRITQYKQKDQPSSVA

Query:  NSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMEN---GEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADAR
           +   + S N++T K    ++  +I ++G Q+++ E  L  + +  Q+      G+  TI N   K AA+  KE      S  G  +  RPLE+A +R
Subjt:  NSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMEN---GEPVTIVNQQKKHAANPQKEFQKPPLSANGQFAAARPLENADAR

Query:  TIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVR-KNASHLEGASIVAWPRAVRGSPFPT
        TIFV+NVHF ATKDSLSRHFNKFGEV+K  IVTD  TGQP GSAY+EF RKEAAENALSLDGTSFMSRILK+V+  N  + E AS ++W R         
Subjt:  TIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVR-KNASHLEGASIVAWPRAVRGSPFPT

Query:  PRFTRVP-FPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPK
         RFTR P + RG     R R  ++ G RSMQWKRD    SAD G   + N++  + ARSLTYVR E K
Subjt:  PRFTRVP-FPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPK

AT5G51120.2 polyadenylate-binding protein 12.1e-1137.86Show/hide
Query:  EFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRK
        E++K  +++     +A   E  D+R+I+V NV +A T + + +HF   G V +V I+TD   GQPKG AYVEF+  EA +N+L L+ +    R +KV  K
Subjt:  EFQKPPLSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRK

Query:  NAS
          +
Subjt:  NAS

AT5G51120.3 polyadenylate-binding protein 19.5e-1240.43Show/hide
Query:  NGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNAS
        +G   +A   E  D+R+I+V NV +A T + + +HF   G V +V I+TD   GQPKG AYVEF+  EA +N+L L+ +    R +KV  K  +
Subjt:  NGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNAS

AT5G65260.1 RNA-binding (RRM/RBD/RNP motifs) family protein1.9e-1238.52Show/hide
Query:  ENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPRAVRG
        E  DAR++FV NV +A T + + +HF   G V +V I+TD   GQPKG AYVEF+  EA + AL L+ +    R LKV++K  +++ G       R  R 
Subjt:  ENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPRAVRG

Query:  SPFPTPRFTRVPF-------PRGISSGFRPRPPIK
        +P+   RF R PF       P G     R R P++
Subjt:  SPFPTPRFTRVPF-------PRGISSGFRPRPPIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGTGTGGATCGTGTCGATGATCGGACGTTCAAGGTTGATTTCAGTGGCGAGGGAATGGCGAAGCTCAGAGAGAGGATAAAGCTGAAGATGAAGGAATTCATGGG
CGATTATACTGATGACACTCTTGTGGAATATGTGATTGTCTTATTGAGGAACGGCAGGCGCAAAGATGAAGCAAAGAATGAGCTAAATGTATTTTTGGGAGACGATAGTC
ATTCTTTTGTATCTTGGCTGTGGGACCATCTGGCCTCAAGTATGGATTTATATGTGGAGCAGCCTACAAAATCTTCTGCCAATGAGGTGCCCAGACCAAAAAAGAGGAAG
TCTGAAAAGTTAAGTAGCAGACGGCGTAATAGGGAATGGAAAGGGCTTGCTAGTGATGAAACCCGCGTTCCTCCAAGATCTGAAGTTAGTCGTGTTAAGCATTCTTCTCC
TGAACAAGTTCCTAGTCATAGGAAAAGGAGCCGAGCTGATGATCATCAGGGCACTGAGAGGGAGGCAGCCTTTCAGGTAAGCATTGCTGCCCCTCGGCGACTGCTCCAGT
TTGCAATGCGAGATGCTGTGGCCACCACCAGGCCGTCTAATGCAGCAAAGGAGCCCCTTTCGAAGCGTCTTCGTTCAGTAGTTTCTACATCCAATAGTGACACAACTAAT
CGTCCCAGGCGGCTTCAGTCTGTTGCAAAAGTGCCAAATCCTATGGCAACTGTTATTAAGGCTGTGACAGAGGCAGCTGAAGATGTGATAAGAGTTAAATCCTCCAGTGT
TTTTGATCGACTTGGTCGTCAATCTCATGATATGGATTTAACAGAGTCCAGTGGCCAACTTGCAGAATATAGAGTAGCTGCTGTAGAGGACGAAAAATATGGGGATATGA
CTCATACACAGGATCAACCATACCAGCCACCTATCTTGAGAGAAGCAATTATAGATCATAGAGTATTTGATGATTCTTGGACTGCAGAATCGGGAGTAAGAAAGGGAGGC
AATTTGCGGACTGTGCCATTCAGGGTAGTTGAGAATGCTGATGATGATACACGCATAACACAATATAAACAGAAGGATCAACCTTCCTCAGTTGCAAATTCCTCACGTGA
CATTGTAAATATCTCTGTGAATGTTAATACTTGGAAGCCACCTCACTATCAGGACCCAGGGCAGATTGCTGAACTTGGTAGTCAAAAGTTTTTGCAGGAGAGTGAGTTAC
AAGGTACCAGATCTGCTGTGCAAGTAATGGAGAATGGCGAGCCAGTCACTATTGTTAACCAACAGAAAAAGCATGCAGCAAACCCTCAAAAAGAGTTCCAGAAACCTCCA
TTATCTGCTAATGGCCAATTTGCTGCCGCTCGTCCTTTAGAGAATGCTGATGCGCGAACCATTTTTGTTAGCAATGTTCACTTTGCTGCTACAAAGGATAGCCTTTCTAG
GCATTTTAACAAGTTTGGAGAAGTTGTAAAAGTCATCATAGTAACTGATGCAACTACCGGGCAACCCAAAGGGTCAGCTTATGTGGAGTTCATGAGAAAAGAAGCTGCAG
AGAATGCGTTATCTCTGGATGGGACCTCGTTTATGTCTAGGATTCTGAAGGTCGTAAGGAAAAATGCTTCGCATTTAGAAGGTGCTTCAATTGTTGCGTGGCCTCGTGCT
GTGCGTGGCTCCCCATTTCCTACTCCTAGATTTACCAGAGTTCCTTTCCCGAGGGGCATTTCCAGTGGATTTAGACCTCGTCCCCCCATCAAACTTGGTGCCAGGAGCAT
GCAGTGGAAGCGGGATAGTCAGGCCACATCTGCTGACAATGGTGCGTCTATCTCTGGTAATTCTATACCCTCATCTGGTGCTCGTAGTCTCACCTATGTTCGAACAGAAC
CTAAGCCAGCTGACAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAGTGTGGATCGTGTCGATGATCGGACGTTCAAGGTTGATTTCAGTGGCGAGGGAATGGCGAAGCTCAGAGAGAGGATAAAGCTGAAGATGAAGGAATTCATGGG
CGATTATACTGATGACACTCTTGTGGAATATGTGATTGTCTTATTGAGGAACGGCAGGCGCAAAGATGAAGCAAAGAATGAGCTAAATGTATTTTTGGGAGACGATAGTC
ATTCTTTTGTATCTTGGCTGTGGGACCATCTGGCCTCAAGTATGGATTTATATGTGGAGCAGCCTACAAAATCTTCTGCCAATGAGGTGCCCAGACCAAAAAAGAGGAAG
TCTGAAAAGTTAAGTAGCAGACGGCGTAATAGGGAATGGAAAGGGCTTGCTAGTGATGAAACCCGCGTTCCTCCAAGATCTGAAGTTAGTCGTGTTAAGCATTCTTCTCC
TGAACAAGTTCCTAGTCATAGGAAAAGGAGCCGAGCTGATGATCATCAGGGCACTGAGAGGGAGGCAGCCTTTCAGGTAAGCATTGCTGCCCCTCGGCGACTGCTCCAGT
TTGCAATGCGAGATGCTGTGGCCACCACCAGGCCGTCTAATGCAGCAAAGGAGCCCCTTTCGAAGCGTCTTCGTTCAGTAGTTTCTACATCCAATAGTGACACAACTAAT
CGTCCCAGGCGGCTTCAGTCTGTTGCAAAAGTGCCAAATCCTATGGCAACTGTTATTAAGGCTGTGACAGAGGCAGCTGAAGATGTGATAAGAGTTAAATCCTCCAGTGT
TTTTGATCGACTTGGTCGTCAATCTCATGATATGGATTTAACAGAGTCCAGTGGCCAACTTGCAGAATATAGAGTAGCTGCTGTAGAGGACGAAAAATATGGGGATATGA
CTCATACACAGGATCAACCATACCAGCCACCTATCTTGAGAGAAGCAATTATAGATCATAGAGTATTTGATGATTCTTGGACTGCAGAATCGGGAGTAAGAAAGGGAGGC
AATTTGCGGACTGTGCCATTCAGGGTAGTTGAGAATGCTGATGATGATACACGCATAACACAATATAAACAGAAGGATCAACCTTCCTCAGTTGCAAATTCCTCACGTGA
CATTGTAAATATCTCTGTGAATGTTAATACTTGGAAGCCACCTCACTATCAGGACCCAGGGCAGATTGCTGAACTTGGTAGTCAAAAGTTTTTGCAGGAGAGTGAGTTAC
AAGGTACCAGATCTGCTGTGCAAGTAATGGAGAATGGCGAGCCAGTCACTATTGTTAACCAACAGAAAAAGCATGCAGCAAACCCTCAAAAAGAGTTCCAGAAACCTCCA
TTATCTGCTAATGGCCAATTTGCTGCCGCTCGTCCTTTAGAGAATGCTGATGCGCGAACCATTTTTGTTAGCAATGTTCACTTTGCTGCTACAAAGGATAGCCTTTCTAG
GCATTTTAACAAGTTTGGAGAAGTTGTAAAAGTCATCATAGTAACTGATGCAACTACCGGGCAACCCAAAGGGTCAGCTTATGTGGAGTTCATGAGAAAAGAAGCTGCAG
AGAATGCGTTATCTCTGGATGGGACCTCGTTTATGTCTAGGATTCTGAAGGTCGTAAGGAAAAATGCTTCGCATTTAGAAGGTGCTTCAATTGTTGCGTGGCCTCGTGCT
GTGCGTGGCTCCCCATTTCCTACTCCTAGATTTACCAGAGTTCCTTTCCCGAGGGGCATTTCCAGTGGATTTAGACCTCGTCCCCCCATCAAACTTGGTGCCAGGAGCAT
GCAGTGGAAGCGGGATAGTCAGGCCACATCTGCTGACAATGGTGCGTCTATCTCTGGTAATTCTATACCCTCATCTGGTGCTCGTAGTCTCACCTATGTTCGAACAGAAC
CTAAGCCAGCTGACAAGTAG
Protein sequenceShow/hide protein sequence
MGSVDRVDDRTFKVDFSGEGMAKLRERIKLKMKEFMGDYTDDTLVEYVIVLLRNGRRKDEAKNELNVFLGDDSHSFVSWLWDHLASSMDLYVEQPTKSSANEVPRPKKRK
SEKLSSRRRNREWKGLASDETRVPPRSEVSRVKHSSPEQVPSHRKRSRADDHQGTEREAAFQVSIAAPRRLLQFAMRDAVATTRPSNAAKEPLSKRLRSVVSTSNSDTTN
RPRRLQSVAKVPNPMATVIKAVTEAAEDVIRVKSSSVFDRLGRQSHDMDLTESSGQLAEYRVAAVEDEKYGDMTHTQDQPYQPPILREAIIDHRVFDDSWTAESGVRKGG
NLRTVPFRVVENADDDTRITQYKQKDQPSSVANSSRDIVNISVNVNTWKPPHYQDPGQIAELGSQKFLQESELQGTRSAVQVMENGEPVTIVNQQKKHAANPQKEFQKPP
LSANGQFAAARPLENADARTIFVSNVHFAATKDSLSRHFNKFGEVVKVIIVTDATTGQPKGSAYVEFMRKEAAENALSLDGTSFMSRILKVVRKNASHLEGASIVAWPRA
VRGSPFPTPRFTRVPFPRGISSGFRPRPPIKLGARSMQWKRDSQATSADNGASISGNSIPSSGARSLTYVRTEPKPADK