; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0018 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0018
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCCA-adding enzyme
Genome locationMC06:138312..147984
RNA-Seq ExpressionMC06g0018
SyntenyMC06g0018
Gene Ontology termsGO:0001680 - tRNA 3'-terminal CCA addition (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR002646 - Poly A polymerase, head domain
IPR032828 - tRNA nucleotidyltransferase/poly(A) polymerase, RNA and SrmB- binding domain
IPR043519 - Nucleotidyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134981.1 uncharacterized protein LOC111007098 isoform X1 [Momordica charantia]0.099.81Show/hide
Query:  MAVSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVL
        MAVSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVL
Subjt:  MAVSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVL

Query:  NGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRW
        NGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRW
Subjt:  NGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRW

Query:  KNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLS
        KNCLQRDFTING LMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLS
Subjt:  KNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLS

Query:  YGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIE
        YGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIE
Subjt:  YGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIE

Query:  AISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRG
        AISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRG
Subjt:  AISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRG

Query:  NINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV
        NINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV
Subjt:  NINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV

XP_022134982.1 uncharacterized protein LOC111007098 isoform X2 [Momordica charantia]0.099.79Show/hide
Query:  ESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTI
        ESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTI
Subjt:  ESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTI

Query:  VEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARL
        VEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTING LMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARL
Subjt:  VEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARL

Query:  GFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLW
        GFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLW
Subjt:  GFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLW

Query:  VAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAP
        VAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAP
Subjt:  VAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAP

Query:  ASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV
        ASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV
Subjt:  ASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV

XP_023516570.1 uncharacterized protein LOC111780413 isoform X4 [Cucurbita pepo subsp. pepo]8.54e-29279.17Show/hide
Query:  LGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTP---FTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLK
        LG +SCR YLP  TPLFRF   +RK         Q L+Q  S     I    LR P   F E  DNDSKL  WK FSS ELGI  FMIP+PTR+VLNGLK
Subjt:  LGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTP---FTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLK

Query:  KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCL
        KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGR+FPICHVHID +IVEVSSFST+SRP DRHLN A+EKP NCEEEDYVRWKNCL
Subjt:  KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCL

Query:  QRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGS
        QRDFTING LM+DPYNS VYDYL GMEDI++AK++TVVPA +SFQEDCARILRAIRVAARL F+ +KDTA SIKKLSCLVS L K RL MEMNY+LSYGS
Subjt:  QRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGS

Query:  AEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAIS
        AEASLRLLWKYGLLEILLPIQAAYFIQ GFRRSDK SNMLLSLFSS+DKLLAPNRPCH S+WVAVLAFH ALSDQPRSPLVVAAFSLAVHNGGNM+EAIS
Subjt:  AEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAIS

Query:  IARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNIN
        IA++I+R HNV FHELLEP++L+VQ LIDEVMDLTTS+K+AL KMTDE+SVSL LEMYPQAPASDLVFIP  VYLKV K+FECVV GAER F PKRG +N
Subjt:  IARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNIN

Query:  YESLALGNLLELRHVFARIVFDTIYPLNLNHTQ
        YE LALG+L ELRH FARIVFDT+YPL+LNHT+
Subjt:  YESLALGNLLELRHVFARIVFDTIYPLNLNHTQ

XP_038879020.1 poly(A) polymerase I-like isoform X1 [Benincasa hispida]1.29e-30282.03Show/hide
Query:  GVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYE
        G+S R YLPLHTPLFRF   +RKL   LI +G+ + +THS  N H     + T F E  DNDSKL NWKRFSS ELGI+  MIP+PTRKVLNGLKK+GYE
Subjt:  GVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYE

Query:  VYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCA-LEKPENCEEEDYVRWKNCLQRDF
        VYLVGGCVRDLILNR PKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHID T++EVSSFST SRP DRHLN A +EKP NC+EEDYVRWKNCLQRDF
Subjt:  VYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCA-LEKPENCEEEDYVRWKNCLQRDF

Query:  TINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEAS
        TING LM+DPYNS+VYDYLGGMEDIR+AKV+TV+PA +SFQEDCARILRAIRVAARL FH +KDTA SIK LSCLVS L KGRLLMEMNY+LSYGS+EAS
Subjt:  TINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEAS

Query:  LRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARN
        +RLLWKYGLLEILLPIQAAYFIQ GFRRSDKRSNMLLSLFSSLDKLLAPNRPCH SLWVAVLAFHAALS QPRSPLVVAAFSLAVHNGGNM+EAISIA++
Subjt:  LRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARN

Query:  INREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESL
        I+R HNV FHELLEP++L+VQALIDEVMDLTTSVK AL KMTDE+ VSLALEMYPQAPASDLVFIP  VYLKV K F CV  GAERGF PKRG INYE L
Subjt:  INREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESL

Query:  ALGNLLELRHVFARIVFDTIYPL
        ALGNLLELRHVFARIVFDT+YPL
Subjt:  ALGNLLELRHVFARIVFDTIYPL

XP_038879021.1 uncharacterized protein LOC120071069 isoform X2 [Benincasa hispida]3.31e-29480.69Show/hide
Query:  GVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYE
        G+S R YLPLHTPLFRF   +RKL   LI +G+ + +THS  N H     + T F E  DNDSKL NWKRFSS ELGI+  MIP+PTRKVLNGLKK+G  
Subjt:  GVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYE

Query:  VYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCA-LEKPENCEEEDYVRWKNCLQRDF
             GCVRDLILNR PKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHID T++EVSSFST SRP DRHLN A +EKP NC+EEDYVRWKNCLQRDF
Subjt:  VYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCA-LEKPENCEEEDYVRWKNCLQRDF

Query:  TINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEAS
        TING LM+DPYNS+VYDYLGGMEDIR+AKV+TV+PA +SFQEDCARILRAIRVAARL FH +KDTA SIK LSCLVS L KGRLLMEMNY+LSYGS+EAS
Subjt:  TINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEAS

Query:  LRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARN
        +RLLWKYGLLEILLPIQAAYFIQ GFRRSDKRSNMLLSLFSSLDKLLAPNRPCH SLWVAVLAFHAALS QPRSPLVVAAFSLAVHNGGNM+EAISIA++
Subjt:  LRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARN

Query:  INREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESL
        I+R HNV FHELLEP++L+VQALIDEVMDLTTSVK AL KMTDE+ VSLALEMYPQAPASDLVFIP  VYLKV K F CV  GAERGF PKRG INYE L
Subjt:  INREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESL

Query:  ALGNLLELRHVFARIVFDTIYPL
        ALGNLLELRHVFARIVFDT+YPL
Subjt:  ALGNLLELRHVFARIVFDTIYPL

TrEMBL top hitse value%identityAlignment
A0A6J1BZB1 uncharacterized protein LOC111007098 isoform X10.099.81Show/hide
Query:  MAVSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVL
        MAVSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVL
Subjt:  MAVSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVL

Query:  NGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRW
        NGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRW
Subjt:  NGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRW

Query:  KNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLS
        KNCLQRDFTING LMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLS
Subjt:  KNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLS

Query:  YGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIE
        YGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIE
Subjt:  YGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIE

Query:  AISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRG
        AISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRG
Subjt:  AISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRG

Query:  NINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV
        NINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV
Subjt:  NINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV

A0A6J1C1D0 uncharacterized protein LOC111007098 isoform X20.099.79Show/hide
Query:  ESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTI
        ESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTI
Subjt:  ESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTI

Query:  VEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARL
        VEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTING LMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARL
Subjt:  VEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARL

Query:  GFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLW
        GFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLW
Subjt:  GFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLW

Query:  VAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAP
        VAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAP
Subjt:  VAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAP

Query:  ASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV
        ASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV
Subjt:  ASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV

A0A6J1E0Z5 uncharacterized protein LOC111429822 isoform X24.42e-28976.77Show/hide
Query:  LGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTP---FTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLK
        LG +SCR YLP  TPLFRF   +RK         Q L+Q  S     I    LR P   F E  DNDSKL  WK FSS ELGI  FMIP+PTRKVLNGLK
Subjt:  LGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTP---FTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLK

Query:  KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCL
        KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGR+FPICHVHID +IVEVSSFST+SRP DRHLN A+EKP NCEEEDYVRWKNC+
Subjt:  KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCL

Query:  QRDFTINGW------------------LMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSR
        QRDFTINGW                  LM+DPYNS VYDYL GMEDI++AK++TVVPA +SFQEDCARILRAIRVAARL F+ +KDTA SIKKLSCLVS 
Subjt:  QRDFTINGW------------------LMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSR

Query:  L-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVV
        L K RL MEMNY+LSYGSAEASLRLLWKYGLLEILLPIQAAYFIQ GFRRSDK SNMLLSLFSS+DKLLAPNRPCH SLWVAVLAFH ALSDQPRSPLVV
Subjt:  L-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVV

Query:  AAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFE
        AAFSLAVHNGGNM+EAISIA++I+R HNV FHELLEP++L+VQ LIDEVMDLTTS+K+AL KMTDE+SVSL LEMYPQAPASDLVFIP  VYLKV K+FE
Subjt:  AAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFE

Query:  CVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQ
        C V GAER F PKRG +NYE LALG+L ELRH FARIVFDT+YPL+LNHT+
Subjt:  CVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQ

A0A6J1E1V4 uncharacterized protein LOC111429822 isoform X42.67e-29079.17Show/hide
Query:  LGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTP---FTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLK
        LG +SCR YLP  TPLFRF   +RK         Q L+Q  S     I    LR P   F E  DNDSKL  WK FSS ELGI  FMIP+PTRKVLNGLK
Subjt:  LGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTP---FTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLK

Query:  KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCL
        KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGR+FPICHVHID +IVEVSSFST+SRP DRHLN A+EKP NCEEEDYVRWKNC+
Subjt:  KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCL

Query:  QRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGS
        QRDFTING LM+DPYNS VYDYL GMEDI++AK++TVVPA +SFQEDCARILRAIRVAARL F+ +KDTA SIKKLSCLVS L K RL MEMNY+LSYGS
Subjt:  QRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGS

Query:  AEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAIS
        AEASLRLLWKYGLLEILLPIQAAYFIQ GFRRSDK SNMLLSLFSS+DKLLAPNRPCH SLWVAVLAFH ALSDQPRSPLVVAAFSLAVHNGGNM+EAIS
Subjt:  AEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAIS

Query:  IARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNIN
        IA++I+R HNV FHELLEP++L+VQ LIDEVMDLTTS+K+AL KMTDE+SVSL LEMYPQAPASDLVFIP  VYLKV K+FEC V GAER F PKRG +N
Subjt:  IARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNIN

Query:  YESLALGNLLELRHVFARIVFDTIYPLNLNHTQ
        YE LALG+L ELRH FARIVFDT+YPL+LNHT+
Subjt:  YESLALGNLLELRHVFARIVFDTIYPLNLNHTQ

A0A6J1JFZ7 uncharacterized protein LOC111484780 isoform X46.49e-28978.8Show/hide
Query:  LGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTP---FTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLK
        LG +SCR YLP  TPLFRF   +RK         Q L+Q  S     I    LR P   F E  DNDSKL  WK FSS ELGI  FMIP+PTRKVLNGLK
Subjt:  LGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTP---FTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLK

Query:  KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCL
        KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVG +FPICHVHID +IVEVSSFST+SRP DRHL+ A+EKP NCEEEDYVRWKNCL
Subjt:  KKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCL

Query:  QRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGS
        QRDFTING LM+DPYNS VYDYL GMEDI++AK++TVVPA +SFQEDCARILRAIRVAARL F+ +KDTA SIKKLSCLVS L K RL MEMNY+LSYGS
Subjt:  QRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGS

Query:  AEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAIS
        AEASLRLLWKYGLLEILLPIQAAYFIQ GFRRSDK SNMLLSLFSS+DKLLAPNRPCH SLWVAVLAFH ALSDQPRSPLVVAAFSLAVHNGGNM+EAIS
Subjt:  AEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAIS

Query:  IARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNIN
        IA++I R HNV FHELLEP++L+VQ LIDEVMDLTTS+K+AL KMTDE+SVSL LEMYPQAPAS LVFIP  VYLKV K+FECVV GAE  F PKRG +N
Subjt:  IARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNIN

Query:  YESLALGNLLELRHVFARIVFDTIYPLNLNHTQ
        Y+ LALG+L ELRHVFARIVFDT+YPL+LNHT+
Subjt:  YESLALGNLLELRHVFARIVFDTIYPLNLNHTQ

SwissProt top hitse value%identityAlignment
P0ABF1 Poly(A) polymerase I1.1e-3235.94Show/hide
Query:  ISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSF------STASRPIDR
        IS   I     KV+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++V + F  C +VGRRF + HV     I+EV++F      + + R   +
Subjt:  ISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSF------STASRPIDR

Query:  HLNCALEKPEN---CEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSI
             +   +N     EED  R      RDFTIN  L +   +  V DY+GGM+D++   ++ +    + ++ED  R+LRA+R AA+LG  IS +TA  I
Subjt:  HLNCALEKPEN---CEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSI

Query:  KKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCG
         +L+ L++ +   RL  E   +L  G    + +LL +Y L + L P    YF + G
Subjt:  KKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCG

P0ABF2 Poly(A) polymerase I1.1e-3235.94Show/hide
Query:  ISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSF------STASRPIDR
        IS   I     KV+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++V + F  C +VGRRF + HV     I+EV++F      + + R   +
Subjt:  ISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSF------STASRPIDR

Query:  HLNCALEKPEN---CEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSI
             +   +N     EED  R      RDFTIN  L +   +  V DY+GGM+D++   ++ +    + ++ED  R+LRA+R AA+LG  IS +TA  I
Subjt:  HLNCALEKPEN---CEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSI

Query:  KKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCG
         +L+ L++ +   RL  E   +L  G    + +LL +Y L + L P    YF + G
Subjt:  KKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCG

P0ABF3 Poly(A) polymerase I1.1e-3235.94Show/hide
Query:  ISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSF------STASRPIDR
        IS   I     KV+  L K GYE +LVGG VRDL+L + PKDFD+ T+A  ++V + F  C +VGRRF + HV     I+EV++F      + + R   +
Subjt:  ISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSF------STASRPIDR

Query:  HLNCALEKPEN---CEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSI
             +   +N     EED  R      RDFTIN  L +   +  V DY+GGM+D++   ++ +    + ++ED  R+LRA+R AA+LG  IS +TA  I
Subjt:  HLNCALEKPEN---CEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSI

Query:  KKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCG
         +L+ L++ +   RL  E   +L  G    + +LL +Y L + L P    YF + G
Subjt:  KKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCG

P44439 Poly(A) polymerase I6.6e-3531.6Show/hide
Query:  KRFSSNELGISNFMI-----PRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFS-WCEIVGRRFPICHVHIDDTIVEVSSF
        +R+  N +  + F I      R    V+  L+++G+E Y+VGGC+RDL+L + PKDFD+ T+A  +++   F   C +VGRRF + H+     I+EV++F
Subjt:  KRFSSNELGISNFMI-----PRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFS-WCEIVGRRFPICHVHIDDTIVEVSSF

Query:  STASRPIDRHLNCALEKPENCEEEDYVRW---KNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFH
          A+    R+ N A +  E     D V     ++  +RDFT+N  L ++P ++ + DY  G++D++  K++ +    + +QED  R+LR+IR  A+L   
Subjt:  STASRPIDRHLNCALEKPENCEEEDYVRW---KNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFH

Query:  ISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLA
        + K + + I++L+ L+  +   RL  E   +L  G    + RLL +YGL E L P  +AYF +   +       M+++  +S D+ +A
Subjt:  ISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLA

Q8ZRQ8 Poly(A) polymerase I2.1e-3336.72Show/hide
Query:  ISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSF------STASRPIDR
        IS   I     KVL  L K GYE YLVGG VRDL+L + PKDFD+ T+A   +V + F  C +VGRRF + HV     I+EV++F      S + R   +
Subjt:  ISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSF------STASRPIDR

Query:  HLNCALEKPEN---CEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSI
             +   +N     EED  R      RDFTIN  L +   +  V DY+GGM+D++   ++ +    + ++ED  R+LRA+R AA+L  HIS +TA  I
Subjt:  HLNCALEKPEN---CEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSI

Query:  KKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCG
         +L+ L++ +   RL  E   +L  G+   + + L +Y L + L P    YF + G
Subjt:  KKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCG

Arabidopsis top hitse value%identityAlignment
AT1G28090.1 Polynucleotide adenylyltransferase family protein2.1e-14557.02Show/hide
Query:  ADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVE
        ++ D  +P WK+  +NE GI   MIP  TR VLN LKKKG++VYLVGGCVRDLIL+R PKDFD+IT+AELKEV + F  C+IVGRRFPICHV++DD I+E
Subjt:  ADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVE

Query:  VSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGF
        VSSFST++R   +  N +  +P  C+E DY+RWKNCLQRDFT+NG LMFDP  ++VYDY+GG+ED+R +KV+TV  A+ SF ED ARILRAIR+AARLGF
Subjt:  VSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGF

Query:  HISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWV
         ++KD A S+K+LS  + RL   R+ ME+NYML+YGSAEASLRLLW++GL+EILLPIQA+Y +  GFRR D RSNMLLSLF +LD+L+AP+RPC   LW+
Subjt:  HISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWV

Query:  AVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEP--DDLDVQALI-DEVMDLTTSVKDALLKMTDENSVSLALEMYPQ
         +LAFH AL DQPR P VVA+F LA+++  ++ EAI+IAR+ +++HN  F EL  P  D  D ++ I  +V+ L  S++ A  K+ + + ++ A+  YPQ
Subjt:  AVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEP--DDLDVQALI-DEVMDLTTSVKDALLKMTDENSVSLALEMYPQ

Query:  APASDLVFIPFTVYLKVCKVFECVVH--GAERGFAPK-RGNINYESLALGNLLELRHVFARIVFDTIYPL
        AP SD+VF+   +  +V K+F  V      ER   P     INY+SLALG+  E R VFARIVFDTIYPL
Subjt:  APASDLVFIPFTVYLKVCKVFECVVH--GAERGFAPK-RGNINYESLALGNLLELRHVFARIVFDTIYPL

AT1G28090.2 Polynucleotide adenylyltransferase family protein2.1e-14557.02Show/hide
Query:  ADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVE
        ++ D  +P WK+  +NE GI   MIP  TR VLN LKKKG++VYLVGGCVRDLIL+R PKDFD+IT+AELKEV + F  C+IVGRRFPICHV++DD I+E
Subjt:  ADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVE

Query:  VSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGF
        VSSFST++R   +  N +  +P  C+E DY+RWKNCLQRDFT+NG LMFDP  ++VYDY+GG+ED+R +KV+TV  A+ SF ED ARILRAIR+AARLGF
Subjt:  VSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGF

Query:  HISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWV
         ++KD A S+K+LS  + RL   R+ ME+NYML+YGSAEASLRLLW++GL+EILLPIQA+Y +  GFRR D RSNMLLSLF +LD+L+AP+RPC   LW+
Subjt:  HISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWV

Query:  AVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEP--DDLDVQALI-DEVMDLTTSVKDALLKMTDENSVSLALEMYPQ
         +LAFH AL DQPR P VVA+F LA+++  ++ EAI+IAR+ +++HN  F EL  P  D  D ++ I  +V+ L  S++ A  K+ + + ++ A+  YPQ
Subjt:  AVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEP--DDLDVQALI-DEVMDLTTSVKDALLKMTDENSVSLALEMYPQ

Query:  APASDLVFIPFTVYLKVCKVFECVVH--GAERGFAPK-RGNINYESLALGNLLELRHVFARIVFDTIYPL
        AP SD+VF+   +  +V K+F  V      ER   P     INY+SLALG+  E R VFARIVFDTIYPL
Subjt:  APASDLVFIPFTVYLKVCKVFECVVH--GAERGFAPK-RGNINYESLALGNLLELRHVFARIVFDTIYPL

AT1G28090.3 Polynucleotide adenylyltransferase family protein2.1e-14557.02Show/hide
Query:  ADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVE
        ++ D  +P WK+  +NE GI   MIP  TR VLN LKKKG++VYLVGGCVRDLIL+R PKDFD+IT+AELKEV + F  C+IVGRRFPICHV++DD I+E
Subjt:  ADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVE

Query:  VSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGF
        VSSFST++R   +  N +  +P  C+E DY+RWKNCLQRDFT+NG LMFDP  ++VYDY+GG+ED+R +KV+TV  A+ SF ED ARILRAIR+AARLGF
Subjt:  VSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGF

Query:  HISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWV
         ++KD A S+K+LS  + RL   R+ ME+NYML+YGSAEASLRLLW++GL+EILLPIQA+Y +  GFRR D RSNMLLSLF +LD+L+AP+RPC   LW+
Subjt:  HISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWV

Query:  AVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEP--DDLDVQALI-DEVMDLTTSVKDALLKMTDENSVSLALEMYPQ
         +LAFH AL DQPR P VVA+F LA+++  ++ EAI+IAR+ +++HN  F EL  P  D  D ++ I  +V+ L  S++ A  K+ + + ++ A+  YPQ
Subjt:  AVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEP--DDLDVQALI-DEVMDLTTSVKDALLKMTDENSVSLALEMYPQ

Query:  APASDLVFIPFTVYLKVCKVFECVVH--GAERGFAPK-RGNINYESLALGNLLELRHVFARIVFDTIYPL
        AP SD+VF+   +  +V K+F  V      ER   P     INY+SLALG+  E R VFARIVFDTIYPL
Subjt:  APASDLVFIPFTVYLKVCKVFECVVH--GAERGFAPK-RGNINYESLALGNLLELRHVFARIVFDTIYPL

AT3G48830.1 polynucleotide adenylyltransferase family protein / RNA recognition motif (RRM)-containing protein6.8e-16057.03Show/hide
Query:  VSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNG
        +++ S+G  +CRS+ P+  PLF       K++L  +A            + H           + ++  SK P WK+ +S +LGI+  MI +PTR VLNG
Subjt:  VSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNG

Query:  LKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKN
        LK KGY+VYLVGGCVRDLIL RTPKDFDI+TSAEL+EV R+FS CEI+G++FPICHVHI + ++EVSSFST+++   R+      K     +ED +R+ N
Subjt:  LKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKN

Query:  CLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSY
        CLQRDFTING LMFDPY  ++YDYLGG+EDI++AKV+TV  A +SFQED ARILR  R+AARLGF ISK+TA  +K LS LV RL +GR+L+EMNYML+Y
Subjt:  CLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLMEMNYMLSY

Query:  GSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEA
        GSAEASLRLLWK+G+LEILLPIQAAY ++ GF+R DKRSN+LLSLF +LDKLLAP++PCH SLW+ +LA H AL+DQPR P VVAAFSLAVHNGG+++EA
Subjt:  GSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEA

Query:  ISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKR
        +   R + + HN  F ELLEP+++D Q L+DEVMD  +S+K+AL +MTD   +S A+  YPQAP SD+VFIP  +YL   ++FECV    ++GF PK+
Subjt:  ISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKR

AT5G23690.1 Polynucleotide adenylyltransferase family protein6.5e-17158.86Show/hide
Query:  VSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPN------WKRFSSNELGISNFMIPRPT
        ++ISS+ G +CRS+ P+ T       +IR    T+ A  +T+ ++                F   +D   K+ +      WK+ +S +LG+S+ MI + T
Subjt:  VSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPN------WKRFSSNELGISNFMIPRPT

Query:  RKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCE-EE
        RKVLNGLK KG++VYLVGGCVRDLIL RTPKDFDI+TSAEL+EV RTF  CEIVGRRFPICHVHI D ++EVSSFST+++   R+     ++    + +E
Subjt:  RKVLNGLKKKGYEVYLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCE-EE

Query:  DYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLME
        D +R  NCLQRDFTING LMFDPY  +VYDYLGGMEDIR+AKV+TV+ A +SF +DCARILRAIR+AARLGF +SK+TA  IK LS LV RL KGR+LME
Subjt:  DYVRWKNCLQRDFTINGWLMFDPYNSIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRL-KGRLLME

Query:  MNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHN
        MNYML+YGSAEASLRLLWK+G+LEILLPIQAAY  + GFRR DKR+NMLLSLF++LDKLLAP+RPCH SLW+A+LAFH AL+D+PRSP+VVAAFSLAVHN
Subjt:  MNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQCGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHN

Query:  GGNMIEAISIARNINREHNVQFHELLEPDD-LDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAER
         G+++EA+ I + I R H+  F EL+EP++ LD Q L+DEVMDL  S++DAL +MTD   +S A+  YPQAP SDLVFIP  +YL+  ++F+CV +   R
Subjt:  GGNMIEAISIARNINREHNVQFHELLEPDD-LDVQALIDEVMDLTTSVKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAER

Query:  -GFAPKRGN-INYESLALGNLLELRHVFARIVFDTIYPLNLN
         GF  K+G+ I Y SL  G   E+RHVFAR+VFDT++PLNL+
Subjt:  -GFAPKRGN-INYESLALGNLLELRHVFARIVFDTIYPLNLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTATCGATATCGAGTTTAGGTGGTGTATCTTGCAGATCTTATCTCCCTCTCCACACACCACTCTTCCGCTTCCGCATTCGCATTCGCAAGCTTCGCCTCACTTT
GATTGCGCTGGGACAGACGCTTCGTCAAACACATTCCCCGCCCAATGTCCACATCAATTATTCCTCTCTCCGAACGCCTTTTACTGAATCCGCTGATAATGATTCTAAGC
TGCCCAACTGGAAGAGGTTTTCTTCCAACGAGCTTGGGATTAGTAATTTCATGATTCCCAGACCTACCAGAAAAGTTCTTAACGGGCTCAAGAAAAAAGGATATGAAGTT
TACCTTGTAGGAGGTTGTGTCCGGGATCTGATCTTGAACAGAACGCCGAAGGATTTTGACATAATCACTTCAGCTGAGCTTAAGGAGGTGTCAAGGACGTTCTCATGGTG
TGAAATAGTTGGGAGGAGGTTTCCTATATGCCATGTGCACATCGATGATACCATTGTAGAGGTGTCAAGTTTTAGCACAGCCAGTCGACCAATTGATAGACACTTGAACT
GTGCTCTTGAAAAGCCTGAGAACTGTGAAGAGGAAGATTATGTCCGTTGGAAGAATTGCTTGCAACGTGACTTTACCATTAACGGGTGGTTGATGTTTGACCCATACAAC
AGTATAGTGTACGACTACTTGGGAGGAATGGAGGATATAAGACGAGCTAAAGTACAAACCGTGGTTCCTGCCCATTCTTCCTTCCAAGAGGATTGTGCTCGAATACTGCG
AGCAATCAGAGTTGCTGCTCGTTTAGGGTTCCACATTTCAAAGGATACTGCTCGGTCTATTAAAAAATTATCCTGCTTGGTGTCTAGACTTAAGGGAAGGCTTCTAATGG
AGATGAACTATATGTTGTCTTATGGTTCTGCTGAGGCTTCTTTGAGATTGTTATGGAAATATGGATTACTAGAAATACTCCTACCAATTCAGGCAGCATACTTTATCCAA
TGTGGATTTCGGAGGTCTGACAAGAGGTCGAATATGCTTTTGTCCCTATTTTCAAGCTTGGATAAACTTCTGGCACCTAATAGGCCCTGTCACTGTAGTTTATGGGTTGC
AGTATTAGCGTTCCACGCTGCCCTGTCTGACCAGCCTAGGAGTCCTTTAGTGGTTGCAGCATTTAGCCTTGCAGTGCACAATGGTGGCAATATGATAGAAGCAATCAGCA
TAGCTAGGAACATCAATAGAGAACATAATGTGCAGTTTCATGAATTGTTAGAACCTGACGATCTGGATGTTCAAGCATTGATTGATGAGGTTATGGATCTTACTACGTCT
GTTAAAGATGCACTTCTTAAGATGACCGATGAGAATTCTGTGTCTCTGGCCCTGGAAATGTATCCTCAAGCACCAGCATCAGATCTGGTTTTTATCCCGTTTACAGTGTA
CTTGAAGGTATGCAAAGTTTTTGAGTGCGTTGTGCATGGTGCGGAGAGAGGGTTTGCTCCAAAGCGAGGAAACATTAATTATGAGTCTCTGGCTCTTGGAAACCTGCTAG
AACTTCGGCATGTCTTTGCAAGGATTGTGTTTGACACTATCTACCCTCTTAATCTCAATCATACCCAAGTATAA
mRNA sequenceShow/hide mRNA sequence
GTCGAAATGTCGGCCCAACGGGAGTTTTCACCGTGTAAGGCCTGAGGTGCGGCCCAGTAGAGAATCCGCTTGTGGGCCCAAGCCCATGTCCTGAATTTCAGCATTTTATT
TTTTGTCCTAAAACGGTAAAACCCTTGGCGCGGGTTTAAGACTTGAGTTTTAACTTCACTTCCAGTTCCTCCTCCTAGCCCTCTCCTCCAGCTCCAGTCCTCCGTTGTCA
TTTTGATTTGGACATGGCGGTATCGATATCGAGTTTAGGTGGTGTATCTTGCAGATCTTATCTCCCTCTCCACACACCACTCTTCCGCTTCCGCATTCGCATTCGCAAGC
TTCGCCTCACTTTGATTGCGCTGGGACAGACGCTTCGTCAAACACATTCCCCGCCCAATGTCCACATCAATTATTCCTCTCTCCGAACGCCTTTTACTGAATCCGCTGAT
AATGATTCTAAGCTGCCCAACTGGAAGAGGTTTTCTTCCAACGAGCTTGGGATTAGTAATTTCATGATTCCCAGACCTACCAGAAAAGTTCTTAACGGGCTCAAGAAAAA
AGGATATGAAGTTTACCTTGTAGGAGGTTGTGTCCGGGATCTGATCTTGAACAGAACGCCGAAGGATTTTGACATAATCACTTCAGCTGAGCTTAAGGAGGTGTCAAGGA
CGTTCTCATGGTGTGAAATAGTTGGGAGGAGGTTTCCTATATGCCATGTGCACATCGATGATACCATTGTAGAGGTGTCAAGTTTTAGCACAGCCAGTCGACCAATTGAT
AGACACTTGAACTGTGCTCTTGAAAAGCCTGAGAACTGTGAAGAGGAAGATTATGTCCGTTGGAAGAATTGCTTGCAACGTGACTTTACCATTAACGGGTGGTTGATGTT
TGACCCATACAACAGTATAGTGTACGACTACTTGGGAGGAATGGAGGATATAAGACGAGCTAAAGTACAAACCGTGGTTCCTGCCCATTCTTCCTTCCAAGAGGATTGTG
CTCGAATACTGCGAGCAATCAGAGTTGCTGCTCGTTTAGGGTTCCACATTTCAAAGGATACTGCTCGGTCTATTAAAAAATTATCCTGCTTGGTGTCTAGACTTAAGGGA
AGGCTTCTAATGGAGATGAACTATATGTTGTCTTATGGTTCTGCTGAGGCTTCTTTGAGATTGTTATGGAAATATGGATTACTAGAAATACTCCTACCAATTCAGGCAGC
ATACTTTATCCAATGTGGATTTCGGAGGTCTGACAAGAGGTCGAATATGCTTTTGTCCCTATTTTCAAGCTTGGATAAACTTCTGGCACCTAATAGGCCCTGTCACTGTA
GTTTATGGGTTGCAGTATTAGCGTTCCACGCTGCCCTGTCTGACCAGCCTAGGAGTCCTTTAGTGGTTGCAGCATTTAGCCTTGCAGTGCACAATGGTGGCAATATGATA
GAAGCAATCAGCATAGCTAGGAACATCAATAGAGAACATAATGTGCAGTTTCATGAATTGTTAGAACCTGACGATCTGGATGTTCAAGCATTGATTGATGAGGTTATGGA
TCTTACTACGTCTGTTAAAGATGCACTTCTTAAGATGACCGATGAGAATTCTGTGTCTCTGGCCCTGGAAATGTATCCTCAAGCACCAGCATCAGATCTGGTTTTTATCC
CGTTTACAGTGTACTTGAAGGTATGCAAAGTTTTTGAGTGCGTTGTGCATGGTGCGGAGAGAGGGTTTGCTCCAAAGCGAGGAAACATTAATTATGAGTCTCTGGCTCTT
GGAAACCTGCTAGAACTTCGGCATGTCTTTGCAAGGATTGTGTTTGACACTATCTACCCTCTTAATCTCAATCATACCCAAGTATAAATTTTTAAGATATGACGAAAGGG
CCAGGTGTCATTTGCACCATTTAGGTGGGAGACCGAATCTGTCATCAAATTGCTTGAATTTACAGAGGGTTCCCAAATATCAAAATCAGAGAGGAGAATTATGCAATTGG
CAACGGAATGTAGCAGTTCCTACAAAAAGTGCTTCCTTGGTGTCAAAACCCTCAATTTTACGGTCCTAACGATGATTAGAATCATTTCCAACTCTTGGAGCTACAAACAT
TTGCAGTTGTTGGCAATTAGTGGGGACAAATCGCGTTCACTTGTTTGCAGGAGTAACAGGGGTAGCGCCGCCAGTTGTTGGCCTACAAGATGAAACAAATCAAATTTAAT
AAAGTATTGAGAGGTTGAATGGAGCTGGAAACGGGCATGTACGAAGAAAGCCGTACAGTCCGACGAAAACTTTGAATGCGTCATATATTCCCCATTGAGACCCAGTAAGG
GTTCCAATCATAACAATCCGAAGGGGAAGCCCACGAGTAAAAAGACCCCATAGACCTAGTTGCCTGACTGCCTGCAAAACCACAGTACAGTAGACAATAAGAAATGTGGA
CGCACACGTTCATCTTAATTTACATCATCAAACTTACATCTCCGGCGGTTGCTCCCTTGGCATTGTTGAGAAAAGAGACAAGATTATCAGCCGGGTGTGAGACCACAGCA
CAGAGGACGCCCGCCACATCCCCCGGCAAAGCTCACTCCAAGCTGCAGAGATTTGCTACACTGCTCTTTAGGCCTTGGGATGGCATACTTGTACAGCATCTCTACAATTG
TCTCGAAAGATGCAAATTTCATCATGGTATCTGCAAAACCCCAACCCAGCTTTATCATCCCAAACCATTCTCTATTTAACGAGGGTCAGTTTTCACATTTGCAGAATAAG
CAGTGAATTGCTAAACACGGGGAAATTGATGGAGGACCAGGACTGTTGAGGAAAGGAAAGATTGCTTGTTTAAGGGATTTTGACGTTTTTTTTTAAAACTTACATGGTAT
CTGACGTCCCCAAAGAGGAACCAGACCTTTGTAGAGCCTGACATCCAGAGATTTTAAGTTATTAATTAGGAGGAACAAAGTGTTGAGGAGGAAGCATTCATTATTTAGTG
TCGTACCCAGAAACACCCTCAGATTTTACAAACTTTGCGAACCCATCTGCCAAACCCCTAGCAAAACCAGGTTGAGTTTGGACGCGAACTTTGACGGCCTCCATGGGGCA
GAGAGCAACGTCTGCAATTACTTCGGCAGATGCCGAACCAGCAAGGTATAACAAAGTTTTGTACTTGGCTGCATTCTCTGCTCCAACCTTGTCAGAATAGAACTTCTTGA
AGAACTCGTAAAATCCAAACTTGCAGGCACCCTGAGCACTGTAACCAAGCAAGGTCGGCACCCACCCCCGGAAGAACCCTCTCGCTCCCTGCTCCTTGAGCAGAATTCCA
AAACCCGACGAGATATTCTTGTACTTTGTGGGGTCGATCTAAAATTTGGAATAGAAATTAAAATAGTGGCCACTGGCCAGAACCAGAAGAGTATATTAAAAGAACATGTT
TTCTGTATATTGGTAGAGTTAGGAGTATGAAACTAACAATTGTTTAATGGCGTGAAGATAAGATCATGAGGGAGATTGAACTTAGATATAGTAATTATTGAGGACCTGCA
TATTGCACTTGACGAGATCGAGGGGGGTGACAGCCATGTGGGTGAGTCCACAGCTCAAAATTCCGCCTGCTGTGCATGCTGCGTAGAAGGCCGGTGAATGCATCTTAATC
TTCTCCTTTGGAGCTGGAATTGGGAAGCTCTTGCCATAACTGGATCGCGTCGATTCCGTACTACGCAGTGTACGGGAAGATGAGTATAGAAAGCTCGGGATGAGAGATTG
CTGGACATCTTTGGAGAGAGCCATTGGTGCGTTGCCTCTTAGTAATTGGTAATTGGTGTCCTCCTGACACTAATCTCTTTAAGGAAGAGAATATGGATTTGACTATAAAA
AGGGCGGGGGAATTGGAATATATGTGGTGATAATTATTGGCAGATAAAAATTTTATAATATCGATTGCACTGGAAAATGTTTGAGAATTATATTGTCAGAACACACCAAC
AAAAACGGTTGAGAATTTCTCGGGGCTTTCCCTGTTTATAAGCATTGGCTAGGCTTCTTGCCTGAGATTCCAGCTACCTCTAATCCCGCCCACCAAGATCAACATCAACC
AGGTTGTGAGAAGAATTTCAAGTGTGAAATTTGGATGGAGAAAGAATGGTTTGCATGTTCTGAGAACATCCCAGCTCCGTATCTCTTGTGCGGCAAAACCAGAGACAGTT
GATAAAGTGTGTTCAGTTGTGAGGAAACAATTGGCCTTGCCTGCCGAATCAGAGCTCACCCCTGAATCCAAGTTCTTAGCTCTGGGTGCCGACTCCCTCGACCTTGTGGA
GATAATTATGGCCTTAGAGGAAGAATTTGACATCAATATCGAGGAGGATAACGCTCAGAACATAACCACCGTTCAAGAAGCTGCAGATTTGATCGAGGACCTCGTTGTTA
ACCCCAATCTTAACCCCACGGTTGAGTAGTTTCCTCAAAAAACGATTTTTTCAACCAAACCAACATTCTACTTGAGGAAGTCAAAAACCAAATTGGACCTGTTCTAAGAG
ATGCAAGTAATTCATTTCAGCTTTGTTAGAACTTGAAGAGAACTCAATGTGAAGTTTGTCGTGTTGTATTGTTTAATTTTGTGCCTCTCGAGACGAGGGGCGGCTCGACC
AGTCAATTTCATGGATTGTGGGGGTGTGGTGGAGACCGGAGTTGACGCTACTGCCCTACTAACCAAAGCTTTTCAAAGCATAGAGTCTCTTTTAAGATAAAGCTCTCTAC
CTTTTTGCCTCGGAAAGAGAGAAGGCAAGGCGGAAACAAAGTGATTGTAGTTGTAGATATGAAAATCCAATTTCTAAAACTCAAATGATAAAAAAATGCATTATTGAGTG
TGTTGGGGCCGGGGGTTAAGCGCATGGTATGGAAAATGATATTGGTACCTGTGTGTCGCTTTGTATTTTGGTGGTTGGTGTTATTTGTAGAGCGCCACAATTTAAGCAAA
TCAGAGGAACCCCACTGGCACCTTTTCCAGAGTGGTCCAAAGGCCTACACTACACTACACTACACACTGTAAAGTCTCCATCTATAGCTATACAACAAATGAGAAAATGG
TTTCGTTTGGATCCGCTGCTTCTTGGTAGCTCACCTACTACACTAAAGCTACCAGTAGGCATGATATGAATTATGAAATATGAAGTGAGGGAGAGGCCCCATTACCATTA
TCATAAATTGGGGGGTTGGTTCAAGCATGTGGATCTTGATGATAACCAATAATTAGATTAAGGCATTCTTTCCCGCACCCTCACTTCCCTTCCAGATTATCTTATTTGAA
CCTGAAGGAGCATTTGAAAGTCGGGATACAACCTATTATTTGGCTGTAGAGCAGTGCAGTTATAGGCGAAGTACAAGTCGAACAAGAAAACCCAATTGTCTCCTCGAAAA
ACTAAAAAGAACCCCTGAAAATCCTGTCAAGATTCTTCAAAATCCCCCAAGAGTCTGTCAGATAATCATACCATACTTGAAATCTAATAGAAATTTGTGTTACAAATAAC
TGTACAACTGATTTCTGAGAGTGGAAAGTAG
Protein sequenceShow/hide protein sequence
MAVSISSLGGVSCRSYLPLHTPLFRFRIRIRKLRLTLIALGQTLRQTHSPPNVHINYSSLRTPFTESADNDSKLPNWKRFSSNELGISNFMIPRPTRKVLNGLKKKGYEV
YLVGGCVRDLILNRTPKDFDIITSAELKEVSRTFSWCEIVGRRFPICHVHIDDTIVEVSSFSTASRPIDRHLNCALEKPENCEEEDYVRWKNCLQRDFTINGWLMFDPYN
SIVYDYLGGMEDIRRAKVQTVVPAHSSFQEDCARILRAIRVAARLGFHISKDTARSIKKLSCLVSRLKGRLLMEMNYMLSYGSAEASLRLLWKYGLLEILLPIQAAYFIQ
CGFRRSDKRSNMLLSLFSSLDKLLAPNRPCHCSLWVAVLAFHAALSDQPRSPLVVAAFSLAVHNGGNMIEAISIARNINREHNVQFHELLEPDDLDVQALIDEVMDLTTS
VKDALLKMTDENSVSLALEMYPQAPASDLVFIPFTVYLKVCKVFECVVHGAERGFAPKRGNINYESLALGNLLELRHVFARIVFDTIYPLNLNHTQV