; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017351 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017351
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCCA-adding enzyme
Genome locationtig00153047:157527..171504
RNA-Seq ExpressionSgr017351
SyntenySgr017351
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0006633 - fatty acid biosynthetic process (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0031177 - phosphopantetheine binding (molecular function)
InterPro domainsIPR002646 - Poly A polymerase, head domain
IPR003231 - Acyl carrier protein (ACP)
IPR006162 - Phosphopantetheine attachment site
IPR009081 - Phosphopantetheine binding ACP domain
IPR020806 - Polyketide synthase, phosphopantetheine-binding domain
IPR032828 - tRNA nucleotidyltransferase/poly(A) polymerase, RNA and SrmB- binding domain
IPR036736 - ACP-like superfamily
IPR043519 - Nucleotidyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134981.1 uncharacterized protein LOC111007098 isoform X1 [Momordica charantia]3.2e-18365.51Show/hide
Query:  LGMAISGLGGLSCSCRSFLALQTPLFSF---VRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRK
        + ++IS LGG+  SCRS+L L TPLF F   +RKLR T I++G+ + QT    NVH+N  SL  P TE  DNDSKL  WKRFSS ELGIS FMI +PTRK
Subjt:  LGMAISGLGGLSCSCRSFLALQTPLFSF---VRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRK

Query:  VLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYV
        VLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVS+ FSW EIVGRRFPICHVHIDDTIVEVSSFST+SRP DRHLNCA+EKP NCEEEDYV
Subjt:  VLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYV

Query:  RWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYM
        RWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIR+AKV+T+VPA +SFQEDCARILRAIRVAARLGFHISKDTAR IK LSCLVSRL KGRLLMEMNYM
Subjt:  RWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYM

Query:  LSYGSAEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNM
        LSYGSAEASLRLLWKYGLLEILLPIQ +                                                        +VVAAFSLAVHNGGNM
Subjt:  LSYGSAEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNM

Query:  MDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPL
        ++AISIAR+I+R HN+                                  TDE+ VSLALEMYPQAPASDL                        VFIP 
Subjt:  MDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPL

Query:  AVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
         VYLKVCK+FECVV GAERGF PKRG INYESLALGNLLELRHVFAR+
Subjt:  AVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

XP_022134982.1 uncharacterized protein LOC111007098 isoform X2 [Momordica charantia]1.9e-16767.78Show/hide
Query:  EVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTI
        E  DNDSKL  WKRFSS ELGIS FMI +PTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVS+ FSW EIVGRRFPICHVHIDDTI
Subjt:  EVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTI

Query:  VEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLG
        VEVSSFST+SRP DRHLNCA+EKP NCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIR+AKV+T+VPA +SFQEDCARILRAIRVAARLG
Subjt:  VEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLG

Query:  FHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES---------------------------------------
        FHISKDTAR IK LSCLVSRL KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQ +                                       
Subjt:  FHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES---------------------------------------

Query:  -----------------IVVAAFSLAVHNGGNMMDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAP
                         +VVAAFSLAVHNGGNM++AISIAR+I+R HN+                                  TDE+ VSLALEMYPQAP
Subjt:  -----------------IVVAAFSLAVHNGGNMMDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAP

Query:  ASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
        ASDL                        VFIP  VYLKVCK+FECVV GAERGF PKRG INYESLALGNLLELRHVFAR+
Subjt:  ASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

XP_023516570.1 uncharacterized protein LOC111780413 isoform X4 [Cucurbita pepo subsp. pepo]1.4e-16762.06Show/hide
Query:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL
        M ISGL G+  SCR +L  +TPLF F+RK     +    P     P+L +   +        E DDNDSKL KWK FSSKELGI  FMI KPTR+VLNGL
Subjt:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL

Query:  KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNC
        KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVS+ FSW EIVGR+FPICHVHID +IVEVSSFSTSSRPFDRHLN AIEKPMNCEEEDYVRWKNC
Subjt:  KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNC

Query:  LQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGS
        LQRDFTINGLM+DPYNS VYDYL GMEDI+QAK+RT+VPA TSFQEDCARILRAIRVAARL F+ +KDTA  IK LSCLVS L+K RL MEMNY+LSYGS
Subjt:  LQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGS

Query:  AEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAIS
        AEASLRLLWKYGLLEILLPIQ +                                                        +VVAAFSLAVHNGGNMM+AIS
Subjt:  AEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAIS

Query:  IARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLK
        IA+SISRAHN+                                  TDEH VSL LEMYPQAPASDL                        VFIPL VYLK
Subjt:  IARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLK

Query:  VCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
        V KIFECVVEGAER FVPKRGK+NYE LALG+L ELRH FAR+
Subjt:  VCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

XP_038879020.1 poly(A) polymerase I-like isoform X1 [Benincasa hispida]8.1e-17964.38Show/hide
Query:  LGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGY
        LG L  S R +L L TPLF FVRKL P  I+VG+ V +T  + N H  SP +     E DDNDSKL  WKRFSSKELGI+  MI KPTRKVLNGLKK+GY
Subjt:  LGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGY

Query:  EVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCA-IEKPMNCEEEDYVRWKNCLQRD
        EVYLVGGCVRDLILNR PKDFDIITSAELKEVS+ FSW EIVGRRFPICHVHID T++EVSSFST+SRPFDRHLN A IEKPMNC+EEDYVRWKNCLQRD
Subjt:  EVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCA-IEKPMNCEEEDYVRWKNCLQRD

Query:  FTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEAS
        FTINGLM+DPYNS+VYDYLGGMEDIRQAKVRT++PA TSFQEDCARILRAIRVAARL FH +KDTA  IKNLSCLVS LDKGRLLMEMNY+LSYGS+EAS
Subjt:  FTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEAS

Query:  LRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAISIARS
        +RLLWKYGLLEILLPIQ +                                                        +VVAAFSLAVHNGGNMM+AISIA+S
Subjt:  LRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAISIARS

Query:  ISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKI
        ISRAHN+                                  TDEHFVSLALEMYPQAPASDL                        VFIPL VYLKV K 
Subjt:  ISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKI

Query:  FECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
        F CV EGAERGF+PKRGKINYE LALGNLLELRHVFAR+
Subjt:  FECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

XP_038879021.1 uncharacterized protein LOC120071069 isoform X2 [Benincasa hispida]2.5e-17263.08Show/hide
Query:  LGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGY
        LG L  S R +L L TPLF FVRKL P  I+VG+ V +T  + N H  SP +     E DDNDSKL  WKRFSSKELGI+  MI KPTRKVLNGLKK+  
Subjt:  LGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGY

Query:  EVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCA-IEKPMNCEEEDYVRWKNCLQRD
             GGCVRDLILNR PKDFDIITSAELKEVS+ FSW EIVGRRFPICHVHID T++EVSSFST+SRPFDRHLN A IEKPMNC+EEDYVRWKNCLQRD
Subjt:  EVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCA-IEKPMNCEEEDYVRWKNCLQRD

Query:  FTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEAS
        FTINGLM+DPYNS+VYDYLGGMEDIRQAKVRT++PA TSFQEDCARILRAIRVAARL FH +KDTA  IKNLSCLVS LDKGRLLMEMNY+LSYGS+EAS
Subjt:  FTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEAS

Query:  LRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAISIARS
        +RLLWKYGLLEILLPIQ +                                                        +VVAAFSLAVHNGGNMM+AISIA+S
Subjt:  LRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAISIARS

Query:  ISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKI
        ISRAHN+                                  TDEHFVSLALEMYPQAPASDL                        VFIPL VYLKV K 
Subjt:  ISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKI

Query:  FECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
        F CV EGAERGF+PKRGKINYE LALGNLLELRHVFAR+
Subjt:  FECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

TrEMBL top hitse value%identityAlignment
A0A6J1BZB1 uncharacterized protein LOC111007098 isoform X11.5e-18365.51Show/hide
Query:  LGMAISGLGGLSCSCRSFLALQTPLFSF---VRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRK
        + ++IS LGG+  SCRS+L L TPLF F   +RKLR T I++G+ + QT    NVH+N  SL  P TE  DNDSKL  WKRFSS ELGIS FMI +PTRK
Subjt:  LGMAISGLGGLSCSCRSFLALQTPLFSF---VRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRK

Query:  VLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYV
        VLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVS+ FSW EIVGRRFPICHVHIDDTIVEVSSFST+SRP DRHLNCA+EKP NCEEEDYV
Subjt:  VLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYV

Query:  RWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYM
        RWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIR+AKV+T+VPA +SFQEDCARILRAIRVAARLGFHISKDTAR IK LSCLVSRL KGRLLMEMNYM
Subjt:  RWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYM

Query:  LSYGSAEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNM
        LSYGSAEASLRLLWKYGLLEILLPIQ +                                                        +VVAAFSLAVHNGGNM
Subjt:  LSYGSAEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNM

Query:  MDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPL
        ++AISIAR+I+R HN+                                  TDE+ VSLALEMYPQAPASDL                        VFIP 
Subjt:  MDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPL

Query:  AVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
         VYLKVCK+FECVV GAERGF PKRG INYESLALGNLLELRHVFAR+
Subjt:  AVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

A0A6J1C1D0 uncharacterized protein LOC111007098 isoform X29.1e-16867.78Show/hide
Query:  EVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTI
        E  DNDSKL  WKRFSS ELGIS FMI +PTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVS+ FSW EIVGRRFPICHVHIDDTI
Subjt:  EVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTI

Query:  VEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLG
        VEVSSFST+SRP DRHLNCA+EKP NCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIR+AKV+T+VPA +SFQEDCARILRAIRVAARLG
Subjt:  VEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLG

Query:  FHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES---------------------------------------
        FHISKDTAR IK LSCLVSRL KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQ +                                       
Subjt:  FHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES---------------------------------------

Query:  -----------------IVVAAFSLAVHNGGNMMDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAP
                         +VVAAFSLAVHNGGNM++AISIAR+I+R HN+                                  TDE+ VSLALEMYPQAP
Subjt:  -----------------IVVAAFSLAVHNGGNMMDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAP

Query:  ASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
        ASDL                        VFIP  VYLKVCK+FECVV GAERGF PKRG INYESLALGNLLELRHVFAR+
Subjt:  ASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

A0A6J1E1V4 uncharacterized protein LOC111429822 isoform X43.2e-16561.33Show/hide
Query:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL
        M +SGL G+  SCR +L  +TPLF F+RK     +    P     P+L +   +        +  DNDSKL KWK FSSKELGI  FMI KPTRKVLNGL
Subjt:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL

Query:  KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNC
        KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVS+ FSW EIVGR+FPICHVHID +IVEVSSFSTSSRPFDRHLN AIEKPMNCEEEDYVRWKNC
Subjt:  KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNC

Query:  LQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGS
        +QRDFTINGLM+DPYNS VYDYL GMEDI+QAK+RT+VPA TSFQEDCARILRAIRVAARL F+ +KDTA  IK LSCLVS L+K RL MEMNY+LSYGS
Subjt:  LQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGS

Query:  AEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAIS
        AEASLRLLWKYGLLEILLPIQ +                                                        +VVAAFSLAVHNGGNMM+AIS
Subjt:  AEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAIS

Query:  IARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLK
        IA+SISRAHN+                                  TDEH VSL LEMYPQAPASDL                        VFIPL VYLK
Subjt:  IARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLK

Query:  VCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
        V KIFEC VEGAER FVPKRGK+NYE LALG+L ELRH FAR+
Subjt:  VCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

A0A6J1E6A2 uncharacterized protein LOC111429822 isoform X36.1e-16460.11Show/hide
Query:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL
        M +SGL G+  SCR +L  +TPLF F+RK     +    P     P+L +   +        +  DNDSKL KWK FSSKELGI  FMI KPTRKVLNGL
Subjt:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL

Query:  KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNC
        KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVS+ FSW EIVGR+FPICHVHID +IVEVSSFSTSSRPFDRHLN AIEKPMNCEEEDYVRWKNC
Subjt:  KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNC

Query:  LQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGS
        +QRDFTINGLM+DPYNS VYDYL GMEDI+QAK+RT+VPA TSFQEDCARILRAIRVAARL F+ +KDTA  IK LSCLVS L+K RL MEMNY+LSYGS
Subjt:  LQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGS

Query:  AEASLRLLWKYGLLEILLPIQES-------------------------------------------------------------------IVVAAFSLAV
        AEASLRLLWKYGLLEILLPIQ +                                                                   +VVAAFSLAV
Subjt:  AEASLRLLWKYGLLEILLPIQES-------------------------------------------------------------------IVVAAFSLAV

Query:  HNGGNMMDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDY
        HNGGNMM+AISIA+SISRAHN+                                  TDEH VSL LEMYPQAPASDL                       
Subjt:  HNGGNMMDAISIARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDY

Query:  KVFIPLAVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
         VFIPL VYLKV KIFEC VEGAER FVPKRGK+NYE LALG+L ELRH FAR+
Subjt:  KVFIPLAVYLKVCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

A0A6J1JFZ7 uncharacterized protein LOC111484780 isoform X42.1e-16461.14Show/hide
Query:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL
        M ISGL G+  SCR +L  +TPLF F+RK     +    P     P+L +   +        E  DNDSKL KWK FSSKELGI  FMI KPTRKVLNGL
Subjt:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL

Query:  KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNC
        KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVS+ FSW EIVG +FPICHVHID +IVEVSSFSTSSRPFDRHL+ AIEKPMNCEEEDYVRWKNC
Subjt:  KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNC

Query:  LQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGS
        LQRDFTINGLM+DPYNS VYDYL GMEDI+QAK+RT+VPA TSFQEDCARILRAIRVAARL F+ +KDTA  IK LSCLVS L+K RL MEMNY+LSYGS
Subjt:  LQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGS

Query:  AEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAIS
        AEASLRLLWKYGLLEILLPIQ +                                                        +VVAAFSLAVHNGGNMM+AIS
Subjt:  AEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDAIS

Query:  IARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLK
        IA+SI RAHN+                                  TDEH VSL LEMYPQAPAS                        Y VFIPL VYLK
Subjt:  IARSISRAHNI----------------------------------TDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLK

Query:  VCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL
        V KIFECVVEGAE  FVPKRGK+NY+ LALG+L ELRHVFAR+
Subjt:  VCKIFECVVEGAERGFVPKRGKINYESLALGNLLELRHVFARL

SwissProt top hitse value%identityAlignment
P0ABF1 Poly(A) polymerase I2.2e-3334.35Show/hide
Query:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF--STSSRPFDRHLNC
        IS   I++   KV+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++V K+F    +VGRRF + HV     I+EV++F         DR  + 
Subjt:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF--STSSRPFDRHLNC

Query:  AIEKPMNCEEEDYVRWKNCLQ-RDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLV
          +  M   +  +   +   Q RDFTIN L +   +  V DY+GGM+D++   +R +    T ++ED  R+LRA+R AA+LG  IS +TA  I  L+ L+
Subjt:  AIEKPMNCEEEDYVRWKNCLQ-RDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLV

Query:  SRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQESIVVAAFSLAVHNGGNMMDAI---SIARSISRAHN--ITDEHFVSLALEMYP
        + +   RL  E   +L  G    + +LL +Y L + L P     +   F+    NG + M+ I    +  + +R HN    +  F+  A+  YP
Subjt:  SRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQESIVVAAFSLAVHNGGNMMDAI---SIARSISRAHN--ITDEHFVSLALEMYP

P0ABF2 Poly(A) polymerase I2.2e-3334.35Show/hide
Query:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF--STSSRPFDRHLNC
        IS   I++   KV+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++V K+F    +VGRRF + HV     I+EV++F         DR  + 
Subjt:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF--STSSRPFDRHLNC

Query:  AIEKPMNCEEEDYVRWKNCLQ-RDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLV
          +  M   +  +   +   Q RDFTIN L +   +  V DY+GGM+D++   +R +    T ++ED  R+LRA+R AA+LG  IS +TA  I  L+ L+
Subjt:  AIEKPMNCEEEDYVRWKNCLQ-RDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLV

Query:  SRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQESIVVAAFSLAVHNGGNMMDAI---SIARSISRAHN--ITDEHFVSLALEMYP
        + +   RL  E   +L  G    + +LL +Y L + L P     +   F+    NG + M+ I    +  + +R HN    +  F+  A+  YP
Subjt:  SRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQESIVVAAFSLAVHNGGNMMDAI---SIARSISRAHN--ITDEHFVSLALEMYP

P0ABF3 Poly(A) polymerase I2.2e-3334.35Show/hide
Query:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF--STSSRPFDRHLNC
        IS   I++   KV+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++V K+F    +VGRRF + HV     I+EV++F         DR  + 
Subjt:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF--STSSRPFDRHLNC

Query:  AIEKPMNCEEEDYVRWKNCLQ-RDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLV
          +  M   +  +   +   Q RDFTIN L +   +  V DY+GGM+D++   +R +    T ++ED  R+LRA+R AA+LG  IS +TA  I  L+ L+
Subjt:  AIEKPMNCEEEDYVRWKNCLQ-RDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLV

Query:  SRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQESIVVAAFSLAVHNGGNMMDAI---SIARSISRAHN--ITDEHFVSLALEMYP
        + +   RL  E   +L  G    + +LL +Y L + L P     +   F+    NG + M+ I    +  + +R HN    +  F+  A+  YP
Subjt:  SRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQESIVVAAFSLAVHNGGNMMDAI---SIARSISRAHN--ITDEHFVSLALEMYP

Q8Z9C3 Poly(A) polymerase I2.2e-3337.24Show/hide
Query:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF--STSSRPFDRHLNC
        IS   I++   KVL  L K GYE YLVGG VRDL+L + PKDFD+ T+A   +V K+F    +VGRRF + HV     I+EV++F         DR  + 
Subjt:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF--STSSRPFDRHLNC

Query:  AIEKPMNCEEEDYVRWKNCLQ-RDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLV
          +  M   +  +   +   Q RDFTIN L +   +  V DY+GGM+D+++  +R +    T ++ED  R+LRA+R AA+L   IS +TA  I  L+ L+
Subjt:  AIEKPMNCEEEDYVRWKNCLQ-RDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLV

Query:  SRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLP
        + +   RL  E   +L  G+   + + L +Y L + L P
Subjt:  SRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLP

Q8ZRQ8 Poly(A) polymerase I3.3e-3437.96Show/hide
Query:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF------STSSRPFDR
        IS   I++   KVL  L K GYE YLVGG VRDL+L + PKDFD+ T+A   +V K+F    +VGRRF + HV     I+EV++F      S S R   +
Subjt:  ISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSF------STSSRPFDR

Query:  HLNCAI---EKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIK
             +   +      EED  R      RDFTIN L +   +  V DY+GGM+D+++  +R +    T ++ED  R+LRA+R AA+L  HIS +TA  I 
Subjt:  HLNCAI---EKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIK

Query:  NLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLP
         L+ L++ +   RL  E   +L  G+   + + L +Y L + L P
Subjt:  NLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLP

Arabidopsis top hitse value%identityAlignment
AT1G28090.1 Polynucleotide adenylyltransferase family protein2.3e-9944.31Show/hide
Query:  NDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVS
        N  K + WK+  + E GI   MI   TR VLN LKKKG++VYLVGGCVRDLIL+R PKDFD+IT+AELKEV K+F   +IVGRRFPICHV++DD I+EVS
Subjt:  NDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVS

Query:  SFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHIS
        SFSTS+R   +  N +  +P  C+E DY+RWKNCLQRDFT+NGLMFDP  ++VYDY+GG+ED+R +KVRT+  A  SF ED ARILRAIR+AARLGF ++
Subjt:  SFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHIS

Query:  KDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES-------------------------------------------
        KD A  +K LS  + RLD  R+ ME+NYML+YGSAEASLRLLW++GL+EILLPIQ S                                           
Subjt:  KDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES-------------------------------------------

Query:  -------------IVVAAFSLAVHNGGNMMDAISIARSISRAHN-------------------------------------ITDEHFVSLALEMYPQAPA
                      VVA+F LA+++  ++ +AI+IARS S+ HN                                     + +  +++ A+  YPQAP 
Subjt:  -------------IVVAAFSLAVHNGGNMMDAISIARSISRAHN-------------------------------------ITDEHFVSLALEMYPQAPA

Query:  SDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVE--GAERGFVPK-RGKINYESLALGNLLELRHVFARL
        SD+V    L         M E  E               K+F  V      ER  VP    +INY+SLALG+  E R VFAR+
Subjt:  SDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVE--GAERGFVPK-RGKINYESLALGNLLELRHVFARL

AT1G28090.2 Polynucleotide adenylyltransferase family protein2.3e-9944.31Show/hide
Query:  NDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVS
        N  K + WK+  + E GI   MI   TR VLN LKKKG++VYLVGGCVRDLIL+R PKDFD+IT+AELKEV K+F   +IVGRRFPICHV++DD I+EVS
Subjt:  NDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVS

Query:  SFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHIS
        SFSTS+R   +  N +  +P  C+E DY+RWKNCLQRDFT+NGLMFDP  ++VYDY+GG+ED+R +KVRT+  A  SF ED ARILRAIR+AARLGF ++
Subjt:  SFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHIS

Query:  KDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES-------------------------------------------
        KD A  +K LS  + RLD  R+ ME+NYML+YGSAEASLRLLW++GL+EILLPIQ S                                           
Subjt:  KDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES-------------------------------------------

Query:  -------------IVVAAFSLAVHNGGNMMDAISIARSISRAHN-------------------------------------ITDEHFVSLALEMYPQAPA
                      VVA+F LA+++  ++ +AI+IARS S+ HN                                     + +  +++ A+  YPQAP 
Subjt:  -------------IVVAAFSLAVHNGGNMMDAISIARSISRAHN-------------------------------------ITDEHFVSLALEMYPQAPA

Query:  SDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVE--GAERGFVPK-RGKINYESLALGNLLELRHVFARL
        SD+V    L         M E  E               K+F  V      ER  VP    +INY+SLALG+  E R VFAR+
Subjt:  SDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVE--GAERGFVPK-RGKINYESLALGNLLELRHVFARL

AT1G28090.3 Polynucleotide adenylyltransferase family protein2.3e-9944.31Show/hide
Query:  NDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVS
        N  K + WK+  + E GI   MI   TR VLN LKKKG++VYLVGGCVRDLIL+R PKDFD+IT+AELKEV K+F   +IVGRRFPICHV++DD I+EVS
Subjt:  NDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVS

Query:  SFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHIS
        SFSTS+R   +  N +  +P  C+E DY+RWKNCLQRDFT+NGLMFDP  ++VYDY+GG+ED+R +KVRT+  A  SF ED ARILRAIR+AARLGF ++
Subjt:  SFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHIS

Query:  KDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES-------------------------------------------
        KD A  +K LS  + RLD  R+ ME+NYML+YGSAEASLRLLW++GL+EILLPIQ S                                           
Subjt:  KDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQES-------------------------------------------

Query:  -------------IVVAAFSLAVHNGGNMMDAISIARSISRAHN-------------------------------------ITDEHFVSLALEMYPQAPA
                      VVA+F LA+++  ++ +AI+IARS S+ HN                                     + +  +++ A+  YPQAP 
Subjt:  -------------IVVAAFSLAVHNGGNMMDAISIARSISRAHN-------------------------------------ITDEHFVSLALEMYPQAPA

Query:  SDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVE--GAERGFVPK-RGKINYESLALGNLLELRHVFARL
        SD+V    L         M E  E               K+F  V      ER  VP    +INY+SLALG+  E R VFAR+
Subjt:  SDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVE--GAERGFVPK-RGKINYESLALGNLLELRHVFARL

AT3G48830.1 polynucleotide adenylyltransferase family protein / RNA recognition motif (RRM)-containing protein4.5e-11146.99Show/hide
Query:  VHMNSPSLSMPLTEVDDND----------SKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSK
        V +N+ + +M   +  D+D          SK  +WK+ +SK+LGI+T MI+KPTR VLNGLK KGY+VYLVGGCVRDLIL RTPKDFDI+TSAEL+EV +
Subjt:  VHMNSPSLSMPLTEVDDND----------SKLLKWKRFSSKELGISTFMIAKPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSK

Query:  IFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVP
         FS  EI+G++FPICHVHI + ++EVSSFSTS++   R+      K     +ED +R+ NCLQRDFTINGLMFDPY  ++YDYLGG+EDI++AKVRT+  
Subjt:  IFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVP

Query:  ARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQESI---------------
        A TSFQED ARILR  R+AARLGF ISK+TA F+KNLS LV RL +GR+L+EMNYML+YGSAEASLRLLWK+G+LEILLPIQ +                
Subjt:  ARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQESI---------------

Query:  -----------------------------------------VVAAFSLAVHNGGNMMDAISIARSISRAHN-----------------------------
                                                 VVAAFSLAVHNGG++++A+   R +++ HN                             
Subjt:  -----------------------------------------VVAAFSLAVHNGGNMMDAISIARSISRAHN-----------------------------

Query:  -----ITDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVEGAERGFVPKR
             +TD  F+S A+  YPQAP SD+                        VFIPL +YL   +IFECV E  ++GFVPK+
Subjt:  -----ITDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVEGAERGFVPKR

AT5G23690.1 Polynucleotide adenylyltransferase family protein1.9e-11746.35Show/hide
Query:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVG-KPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNG
        MAIS +G    +CRSF  ++T     + K+R  +++   + + ++D           +S        ++ +  +WK+ +SK+LG+S+ MIAK TRKVLNG
Subjt:  MAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVG-KPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNG

Query:  LKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCE-EEDYVRWK
        LK KG++VYLVGGCVRDLIL RTPKDFDI+TSAEL+EV + F   EIVGRRFPICHVHI D ++EVSSFSTS++   R+     ++    + +ED +R  
Subjt:  LKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCE-EEDYVRWK

Query:  NCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSY
        NCLQRDFTINGLMFDPY  +VYDYLGGMEDIR+AKVRT++ A TSF +DCARILRAIR+AARLGF +SK+TA FIKNLS LV RLDKGR+LMEMNYML+Y
Subjt:  NCLQRDFTINGLMFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSY

Query:  GSAEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDA
        GSAEASLRLLWK+G+LEILLPIQ +                                                        IVVAAFSLAVHN G++++A
Subjt:  GSAEASLRLLWKYGLLEILLPIQES--------------------------------------------------------IVVAAFSLAVHNGGNMMDA

Query:  ISIARSISRAHN-----------------------------------ITDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAV
        + I + I+R H+                                   +TD +F+S A+  YPQAP SDL                        VFIPL +
Subjt:  ISIARSISRAHN-----------------------------------ITDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAV

Query:  YLKVCKIFECVVEGAER-GFVPKRG-KINYESLALGNLLELRHVFARL
        YL+  +IF+CV     R GF  K+G KI Y SL  G   E+RHVFAR+
Subjt:  YLKVCKIFECVVEGAER-GFVPKRG-KINYESLALGNLLELRHVFARL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCTTGTCTCTAAAGGTGATCTGTGAAAACCGTGCAAGGAAAGTCACAGCAGATGTACCAGAGAAGCGTTCCCGTGCTGCTCCTGCTCCACACCGGCTGAGGCCTGA
TATTTGTTGCATGGTGGCATATATGCCACTTCTGCCTGTGACACCAAGCCATTCAATATCTGAAGGAAAAATGCGTAACCAATTTCTGAGGGAGGAGGCGGGGAGTTTTG
ATGTTATCCACGACAGTAGAGGAAGCAGCGCTGTGAGTTTCGAAGGCAGATCGCGTTCTTACTTCCTAGTAGTAATCGCTTTGCGCGGTTTTAACTTCACTTCACCTTCG
TCCTCTCCTCGATTTCGTTTCCATTTGGGCATGGCGATATCGGGTTTAGGTGGTCTGTCTTGCAGTTGCAGGTCCTTTCTCGCACTCCAAACTCCACTCTTCAGCTTCGT
CCGCAAGCTTCGCCCCACTTCGATTTCAGTGGGAAAGCCGGTTCATCAAACAGATCCCCTGCTCAATGTCCACATGAATTCTCCCTCTCTCTCAATGCCATTGACAGAAG
TCGATGATAATGATTCTAAGCTTCTCAAGTGGAAGAGGTTTTCTTCCAAGGAGCTTGGGATTAGTACTTTCATGATTGCCAAACCTACCAGAAAAGTTCTTAACGGACTC
AAGAAAAAAGGATATGAAGTTTACCTTGTAGGAGGTTGTGTTCGGGATCTGATCTTGAATAGAACGCCAAAGGATTTTGACATAATAACTTCAGCTGAGCTTAAGGAGGT
GTCAAAAATATTTTCATGGAGTGAAATAGTTGGGAGGAGGTTTCCTATATGCCACGTGCACATTGATGATACCATTGTAGAGGTGTCAAGTTTTAGCACGTCCAGTCGGC
CATTCGATAGACACTTGAACTGTGCTATTGAAAAGCCTATGAACTGTGAAGAGGAAGATTATGTCCGTTGGAAGAATTGCTTGCAACGTGACTTTACCATTAACGGGTTG
ATGTTTGACCCATACAACAGTATAGTGTACGACTACTTGGGAGGAATGGAGGATATAAGACAAGCTAAAGTACGTACTTTAGTTCCTGCCCGTACGTCCTTTCAAGAGGA
TTGTGCTCGAATCCTGCGAGCAATCAGAGTTGCAGCTCGTTTGGGATTCCACATTTCAAAGGATACTGCTCGGTTTATTAAAAATTTATCCTGCTTGGTGTCTAGACTCG
ATAAGGGAAGGCTTCTGATGGAAATGAACTATATGTTGTCTTATGGTTCTGCTGAGGCTTCTTTGAGGTTGTTATGGAAATATGGACTACTAGAAATACTTCTACCAATT
CAGGAGTCAATAGTGGTTGCAGCATTTAGCCTTGCAGTCCACAATGGTGGCAATATGATGGACGCAATCAGCATAGCTAGGAGCATCAGTAGAGCACATAATATAACTGA
TGAACATTTTGTGTCTCTAGCCCTGGAAATGTATCCTCAAGCACCAGCATCGGATCTAGTAATTTTTCTGCATTTGACCTCTTTTCTTTTTCAAAATCTTGAAATGTATG
AAACTGCAGAAGATTATAAGGTTTTTATCCCATTGGCGGTGTACTTGAAGGTATGCAAAATTTTTGAGTGCGTTGTAGAGGGTGCAGAGAGAGGATTTGTTCCAAAGCGA
GGAAAGATTAATTATGAGTCTTTGGCTCTTGGAAACCTGCTTGAACTTCGGCATGTCTTTGCAAGATTGTGTTTGACACTATCTACCCTCTTAATCTCAATCATACCCAC
GCATAAAAGGGCAGGTTCATTGGATCATCTATTGAATCTTCTAGAAGTTCAGATAATTACCTGGAAATCGTTAGTGCATAATCTAGAACTCGCTTTTCACCCAAAAGTTG
TTGATCCATTCAGGGCGCTTATCCTCTTCCAAGAGAAAAAAGAAGAAATAAGTGAGAAATTAGTGATGATTTCCATGAATGCAGTCTTCGAGGACGTGTACAAGGAGTTC
TTTGCCGATTGTTCCCGATCAGAGATCCATGGTAGGGAACCGAGTCTGTCATCATATTGCTTGAATTTAGAGAGGGTTCCCAAATATCAAAATGCAGAAAGAAAAGGGAA
TAATGCAATTGGCAACAATAGGTTGAAGCCTTCACTTGTTGGCCGCAGGAGCAGGGGTAGCGCCGCCAGTTGTCGGCCTACAAGATCAAGCAAATCACATATGTTAAGTA
ATGACAAGTTAAATGCAACCGTGGAGCTGGAGATGGAATGTAGGAAAGATGAAGCCAAAAAGACGTACAGTCCAACAAAAACTTTGAATGCAAATCTACAGGCATCTGAA
TTTTCGCATCATCAAACAAAACTTACATCCCCGGCGGTTGCTCCCTTGGCATTGTTGAGAAAGGAAACAAGATTATCAGCCGGGTGTGACACCACAGCACAGAGGACACC
CGCCACATATCCCCCGGCAAAGCTCACTCCGAGCTGCAGAGATTTGCTACACTGCTCTTTAGGCCTTGGGATCACATACTTGTACAGCATCTCAACAATTGTTTCGAAAG
ATGCAAATTTCATCATGAGAGCGACGTCCGCAATTACTTCAGCAGATGCTGAACCAGCAAGGTATATCAAAGTTTTGTACTTGGCTGCATTCTCTGGTCCAACCATGCGC
CTGAGCACTGTAACCAAGCAAGGTCGGCACCCAACCCCGGAAGAACCCTCTGACTCCCTGCTCCTTCAGCAGGATTCCAAAACCAGACGAGATATTCTTGTACTTTGCTG
GGTCAATCTAAAAGTTGGAATGGAAAAGAGAGCGCCCAAAAGAGTATATCGAAGGAACATGTTTACTATTAGCAGAGTTAGAAATATGAAACTATTGTTTTATCGCGTGA
AGAGATATGACATTAAACTTAAATATTCTAATGATGCAGGAAGGGGGGTGACAGTCATGTGGGTGAGTCCACAGCTCAATATTCCGCCTGCTGTACATGATGCCTCTTCG
AGGACGAGCGAAAACTGGATGGCGTCGATGCATGAGTGCTTGTCCTCATGGTGGAATCCATACAAAGCAGCGTGTGAGAAGACGATGAAGCAGAGTGGAAATGGAACTCA
CTCATATCATATGTGCATGAAACAGACGATTGGAATTAACCATGGACGCAATCTTAATTTGCAGTTGCAGGAGCTTCTTTGGAAGGATAGCAAAATCCCTAAACATCGCG
TTTGCAATAATAATGATAGAAAGATGGAGAAAGTAAGGTTGAGGTTAGGTGAGGAGCAAATGAGAGACTACTTGCCTCAGATTCCAGCTGCCATTGAACCCACCTACCAA
AATCAACCAGCTTCTGAGAGAAGTAGTTCAAGTTTAAAACTTTTCTCCGGTGGATTGAGTAGGAATGGTTTGCATGTTCTGAGAACATCCCACCTTCGCGTGTGTTGTGG
GGCAAAAGCAGAGACAGTTGATAAGGTGTGTTCAATTGTGAGGAAACACTTGGCTTTGGCTGCCGACTCAGAGCTCACCTCTGAATCCAAGTTCTCAGCTCTTGGTGCCG
ACTCCCTGGACACAGTGGAGATAATCATGACCTTGGAGGAAGAATTTGGCATCACTATCGAAGAGGACAATGCTCAGAACATAACGACAGTTCAAGAAGCTGCCGATTTG
ATTGAGGACCTTGTCAATAAGAAATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCTTGTCTCTAAAGGTGATCTGTGAAAACCGTGCAAGGAAAGTCACAGCAGATGTACCAGAGAAGCGTTCCCGTGCTGCTCCTGCTCCACACCGGCTGAGGCCTGA
TATTTGTTGCATGGTGGCATATATGCCACTTCTGCCTGTGACACCAAGCCATTCAATATCTGAAGGAAAAATGCGTAACCAATTTCTGAGGGAGGAGGCGGGGAGTTTTG
ATGTTATCCACGACAGTAGAGGAAGCAGCGCTGTGAGTTTCGAAGGCAGATCGCGTTCTTACTTCCTAGTAGTAATCGCTTTGCGCGGTTTTAACTTCACTTCACCTTCG
TCCTCTCCTCGATTTCGTTTCCATTTGGGCATGGCGATATCGGGTTTAGGTGGTCTGTCTTGCAGTTGCAGGTCCTTTCTCGCACTCCAAACTCCACTCTTCAGCTTCGT
CCGCAAGCTTCGCCCCACTTCGATTTCAGTGGGAAAGCCGGTTCATCAAACAGATCCCCTGCTCAATGTCCACATGAATTCTCCCTCTCTCTCAATGCCATTGACAGAAG
TCGATGATAATGATTCTAAGCTTCTCAAGTGGAAGAGGTTTTCTTCCAAGGAGCTTGGGATTAGTACTTTCATGATTGCCAAACCTACCAGAAAAGTTCTTAACGGACTC
AAGAAAAAAGGATATGAAGTTTACCTTGTAGGAGGTTGTGTTCGGGATCTGATCTTGAATAGAACGCCAAAGGATTTTGACATAATAACTTCAGCTGAGCTTAAGGAGGT
GTCAAAAATATTTTCATGGAGTGAAATAGTTGGGAGGAGGTTTCCTATATGCCACGTGCACATTGATGATACCATTGTAGAGGTGTCAAGTTTTAGCACGTCCAGTCGGC
CATTCGATAGACACTTGAACTGTGCTATTGAAAAGCCTATGAACTGTGAAGAGGAAGATTATGTCCGTTGGAAGAATTGCTTGCAACGTGACTTTACCATTAACGGGTTG
ATGTTTGACCCATACAACAGTATAGTGTACGACTACTTGGGAGGAATGGAGGATATAAGACAAGCTAAAGTACGTACTTTAGTTCCTGCCCGTACGTCCTTTCAAGAGGA
TTGTGCTCGAATCCTGCGAGCAATCAGAGTTGCAGCTCGTTTGGGATTCCACATTTCAAAGGATACTGCTCGGTTTATTAAAAATTTATCCTGCTTGGTGTCTAGACTCG
ATAAGGGAAGGCTTCTGATGGAAATGAACTATATGTTGTCTTATGGTTCTGCTGAGGCTTCTTTGAGGTTGTTATGGAAATATGGACTACTAGAAATACTTCTACCAATT
CAGGAGTCAATAGTGGTTGCAGCATTTAGCCTTGCAGTCCACAATGGTGGCAATATGATGGACGCAATCAGCATAGCTAGGAGCATCAGTAGAGCACATAATATAACTGA
TGAACATTTTGTGTCTCTAGCCCTGGAAATGTATCCTCAAGCACCAGCATCGGATCTAGTAATTTTTCTGCATTTGACCTCTTTTCTTTTTCAAAATCTTGAAATGTATG
AAACTGCAGAAGATTATAAGGTTTTTATCCCATTGGCGGTGTACTTGAAGGTATGCAAAATTTTTGAGTGCGTTGTAGAGGGTGCAGAGAGAGGATTTGTTCCAAAGCGA
GGAAAGATTAATTATGAGTCTTTGGCTCTTGGAAACCTGCTTGAACTTCGGCATGTCTTTGCAAGATTGTGTTTGACACTATCTACCCTCTTAATCTCAATCATACCCAC
GCATAAAAGGGCAGGTTCATTGGATCATCTATTGAATCTTCTAGAAGTTCAGATAATTACCTGGAAATCGTTAGTGCATAATCTAGAACTCGCTTTTCACCCAAAAGTTG
TTGATCCATTCAGGGCGCTTATCCTCTTCCAAGAGAAAAAAGAAGAAATAAGTGAGAAATTAGTGATGATTTCCATGAATGCAGTCTTCGAGGACGTGTACAAGGAGTTC
TTTGCCGATTGTTCCCGATCAGAGATCCATGGTAGGGAACCGAGTCTGTCATCATATTGCTTGAATTTAGAGAGGGTTCCCAAATATCAAAATGCAGAAAGAAAAGGGAA
TAATGCAATTGGCAACAATAGGTTGAAGCCTTCACTTGTTGGCCGCAGGAGCAGGGGTAGCGCCGCCAGTTGTCGGCCTACAAGATCAAGCAAATCACATATGTTAAGTA
ATGACAAGTTAAATGCAACCGTGGAGCTGGAGATGGAATGTAGGAAAGATGAAGCCAAAAAGACGTACAGTCCAACAAAAACTTTGAATGCAAATCTACAGGCATCTGAA
TTTTCGCATCATCAAACAAAACTTACATCCCCGGCGGTTGCTCCCTTGGCATTGTTGAGAAAGGAAACAAGATTATCAGCCGGGTGTGACACCACAGCACAGAGGACACC
CGCCACATATCCCCCGGCAAAGCTCACTCCGAGCTGCAGAGATTTGCTACACTGCTCTTTAGGCCTTGGGATCACATACTTGTACAGCATCTCAACAATTGTTTCGAAAG
ATGCAAATTTCATCATGAGAGCGACGTCCGCAATTACTTCAGCAGATGCTGAACCAGCAAGGTATATCAAAGTTTTGTACTTGGCTGCATTCTCTGGTCCAACCATGCGC
CTGAGCACTGTAACCAAGCAAGGTCGGCACCCAACCCCGGAAGAACCCTCTGACTCCCTGCTCCTTCAGCAGGATTCCAAAACCAGACGAGATATTCTTGTACTTTGCTG
GGTCAATCTAAAAGTTGGAATGGAAAAGAGAGCGCCCAAAAGAGTATATCGAAGGAACATGTTTACTATTAGCAGAGTTAGAAATATGAAACTATTGTTTTATCGCGTGA
AGAGATATGACATTAAACTTAAATATTCTAATGATGCAGGAAGGGGGGTGACAGTCATGTGGGTGAGTCCACAGCTCAATATTCCGCCTGCTGTACATGATGCCTCTTCG
AGGACGAGCGAAAACTGGATGGCGTCGATGCATGAGTGCTTGTCCTCATGGTGGAATCCATACAAAGCAGCGTGTGAGAAGACGATGAAGCAGAGTGGAAATGGAACTCA
CTCATATCATATGTGCATGAAACAGACGATTGGAATTAACCATGGACGCAATCTTAATTTGCAGTTGCAGGAGCTTCTTTGGAAGGATAGCAAAATCCCTAAACATCGCG
TTTGCAATAATAATGATAGAAAGATGGAGAAAGTAAGGTTGAGGTTAGGTGAGGAGCAAATGAGAGACTACTTGCCTCAGATTCCAGCTGCCATTGAACCCACCTACCAA
AATCAACCAGCTTCTGAGAGAAGTAGTTCAAGTTTAAAACTTTTCTCCGGTGGATTGAGTAGGAATGGTTTGCATGTTCTGAGAACATCCCACCTTCGCGTGTGTTGTGG
GGCAAAAGCAGAGACAGTTGATAAGGTGTGTTCAATTGTGAGGAAACACTTGGCTTTGGCTGCCGACTCAGAGCTCACCTCTGAATCCAAGTTCTCAGCTCTTGGTGCCG
ACTCCCTGGACACAGTGGAGATAATCATGACCTTGGAGGAAGAATTTGGCATCACTATCGAAGAGGACAATGCTCAGAACATAACGACAGTTCAAGAAGCTGCCGATTTG
ATTGAGGACCTTGTCAATAAGAAATCTTAA
Protein sequenceShow/hide protein sequence
MILSLKVICENRARKVTADVPEKRSRAAPAPHRLRPDICCMVAYMPLLPVTPSHSISEGKMRNQFLREEAGSFDVIHDSRGSSAVSFEGRSRSYFLVVIALRGFNFTSPS
SSPRFRFHLGMAISGLGGLSCSCRSFLALQTPLFSFVRKLRPTSISVGKPVHQTDPLLNVHMNSPSLSMPLTEVDDNDSKLLKWKRFSSKELGISTFMIAKPTRKVLNGL
KKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSKIFSWSEIVGRRFPICHVHIDDTIVEVSSFSTSSRPFDRHLNCAIEKPMNCEEEDYVRWKNCLQRDFTINGL
MFDPYNSIVYDYLGGMEDIRQAKVRTLVPARTSFQEDCARILRAIRVAARLGFHISKDTARFIKNLSCLVSRLDKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPI
QESIVVAAFSLAVHNGGNMMDAISIARSISRAHNITDEHFVSLALEMYPQAPASDLVIFLHLTSFLFQNLEMYETAEDYKVFIPLAVYLKVCKIFECVVEGAERGFVPKR
GKINYESLALGNLLELRHVFARLCLTLSTLLISIIPTHKRAGSLDHLLNLLEVQIITWKSLVHNLELAFHPKVVDPFRALILFQEKKEEISEKLVMISMNAVFEDVYKEF
FADCSRSEIHGREPSLSSYCLNLERVPKYQNAERKGNNAIGNNRLKPSLVGRRSRGSAASCRPTRSSKSHMLSNDKLNATVELEMECRKDEAKKTYSPTKTLNANLQASE
FSHHQTKLTSPAVAPLALLRKETRLSAGCDTTAQRTPATYPPAKLTPSCRDLLHCSLGLGITYLYSISTIVSKDANFIMRATSAITSADAEPARYIKVLYLAAFSGPTMR
LSTVTKQGRHPTPEEPSDSLLLQQDSKTRRDILVLCWVNLKVGMEKRAPKRVYRRNMFTISRVRNMKLLFYRVKRYDIKLKYSNDAGRGVTVMWVSPQLNIPPAVHDASS
RTSENWMASMHECLSSWWNPYKAACEKTMKQSGNGTHSYHMCMKQTIGINHGRNLNLQLQELLWKDSKIPKHRVCNNNDRKMEKVRLRLGEEQMRDYLPQIPAAIEPTYQ
NQPASERSSSSLKLFSGGLSRNGLHVLRTSHLRVCCGAKAETVDKVCSIVRKHLALAADSELTSESKFSALGADSLDTVEIIMTLEEEFGITIEEDNAQNITTVQEAADL
IEDLVNKKS