; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0023355 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0023355
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPoly(A) polymerase I
Genome locationchr06:30649155..30660670
RNA-Seq ExpressionPI0023355
SyntenyPI0023355
Gene Ontology termsGO:0001680 - tRNA 3'-terminal CCA addition (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR002646 - Poly A polymerase, head domain
IPR032828 - tRNA nucleotidyltransferase/poly(A) polymerase, RNA and SrmB- binding domain
IPR043519 - Nucleotidyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052923.1 poly(A) polymerase I [Cucumis melo var. makuwa]6.2e-29377.32Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL
        MASFSLRA+ N STSRLID+IKL RLRHAF+YPA DFHSVGRQDAA+DDGLS EIH +KV W LTN KKK A NGDMPKWNK+NGR FGITR MIPSS  
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL

Query:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ-----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKK
         VL+ LH EGFEAYLVGGCVRDL+L RVPKDFDVITTAGL Q                             VSSFDTTAKHSEG EITAHSQIPKKCDKK
Subjt:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ-----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKK

Query:  DLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMEL
        DLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFK+DCARILRGLRIAARLGLSISKETETAI KFSPSITSL K RLMMEL
Subjt:  DLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMEL

Query:  NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHG
        NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYL++Q I KSS SSVMLMKLF NLDKLVSCARPSN NIWVALLAFHLALVNNPQNSLIVLAFAATLYHG
Subjt:  NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHG

Query:  EWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYK
        EWNEGVNYAR KSLVEIN RPEITRSA+FKS  ELAEGVT FAL VQGCIAALTSADCLLEAMSTFPASSNSGLVFV  K ARDVANIFEVLV DVESYK
Subjt:  EWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYK

Query:  NKRQNFKIDYQLLGK-GTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKL
        +KR+NF++DY+ LG  G L ENRYVLG+VILETL+ AILQGN N+LD NQNL IDA TKETS+SPVADLVQEQLVK+NKKVRKLPSVSEVELKANKKYK 
Subjt:  NKRQNFKIDYQLLGK-GTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKL

Query:  VRNEGSISDKVVENGRCINMTEIHTVEINPRKMDKVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKN
        V+ EGSISDKV       NM E      NP  M             E ++++  K+  +K  +++   VEI PRKMDKVA  + K +KKEHH+VPQGK+N
Subjt:  VRNEGSISDKVVENGRCINMTEIHTVEINPRKMDKVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKN

Query:  IKKKHRDITNIEQRRGPLSSLFK
        IK KHRDITNI+Q +GPLSSLFK
Subjt:  IKKKHRDITNIEQRRGPLSSLFK

XP_011650261.1 uncharacterized protein LOC101212579 isoform X1 [Cucumis sativus]0.0e+0077.06Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL
        MASFSLRA+NNASTSRLIDLIKL RL HAF+YPA DFHSVGRQDAA DDGLS EIH +KVAW LTN KKK AANGDMPKWNKINGRAFG+TRSMIPSSS 
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL

Query:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKKD
         VL++LH EGFEAYLVGGCVRDLLL+RVPKDFDVITTAGL Q                            VSSFDT AKHSE N+ITAHSQIPKKCDKKD
Subjt:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKKD

Query:  LIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELN
        LIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFK+DCARILRGLRIAARLGLSISKETETAIHKFSPSITSL KSRLMMELN
Subjt:  LIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELN

Query:  YMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGE
        YMLSYGAAVPSLYLLQRF LL  LLPFHAAYLDKQ I+KSS SSVMLMKLF NLDKLVSCA PSN NIWVALLAFHLALVNNPQNSL+VLAFAATLYHGE
Subjt:  YMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGE

Query:  WNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKN
        WNEGVNYAREKSLVEIN RPEITRSA+FKS E+LAEGVT FALKVQGCIAALTS DCLLEAMSTFPASSNSGLVFV NKTARDVA IFEVL K V+SYK+
Subjt:  WNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKN

Query:  KRQNFKIDYQLLGKGT-LSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKLV
        ++++FKIDY+ LGKG  L ENRYVLG++ILETL+DAILQGN N+ DRNQNL IDA TKETSDSPVADLVQEQLVK NKKVRK PSVSEVELKANKKYKLV
Subjt:  KRQNFKIDYQLLGKGT-LSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKLV

Query:  RNEGSISDKVVENGRCINMTE-------------------------------------------------------------------IHTVEINPRKMD
        R EGSISDKVVENGRCINMTE                                                                   +H VEINPRKMD
Subjt:  RNEGSISDKVVENGRCINMTE-------------------------------------------------------------------IHTVEINPRKMD

Query:  KVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK
        KVAGQE KSEKKEH +V QGK+NIKKKRRDITDTVEINPRKMDKVAE + KLKK EHHQVPQG +NIK +H DITNI+Q +GPLSSLFK
Subjt:  KVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK

XP_011650262.1 uncharacterized protein LOC101212579 isoform X2 [Cucumis sativus]2.3e-28776.45Show/hide
Query:  LTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ---------------------
        LTN KKK AANGDMPKWNKINGRAFG+TRSMIPSSS  VL++LH EGFEAYLVGGCVRDLLL+RVPKDFDVITTAGL Q                     
Subjt:  LTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ---------------------

Query:  -------VSSFDTTAKHSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLR
               VSSFDT AKHSE N+ITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFK+DCARILRGLR
Subjt:  -------VSSFDTTAKHSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLR

Query:  IAARLGLSISKETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARP
        IAARLGLSISKETETAIHKFSPSITSL KSRLMMELNYMLSYGAAVPSLYLLQRF LL  LLPFHAAYLDKQ I+KSS SSVMLMKLF NLDKLVSCA P
Subjt:  IAARLGLSISKETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARP

Query:  SNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMS
        SN NIWVALLAFHLALVNNPQNSL+VLAFAATLYHGEWNEGVNYAREKSLVEIN RPEITRSA+FKS E+LAEGVT FALKVQGCIAALTS DCLLEAMS
Subjt:  SNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMS

Query:  TFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKNKRQNFKIDYQLLGKGT-LSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDS
        TFPASSNSGLVFV NKTARDVA IFEVL K V+SYK+++++FKIDY+ LGKG  L ENRYVLG++ILETL+DAILQGN N+ DRNQNL IDA TKETSDS
Subjt:  TFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKNKRQNFKIDYQLLGKGT-LSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDS

Query:  PVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKLVRNEGSISDKVVENGRCINMTE------------------------------------------
        PVADLVQEQLVK NKKVRK PSVSEVELKANKKYKLVR EGSISDKVVENGRCINMTE                                          
Subjt:  PVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKLVRNEGSISDKVVENGRCINMTE------------------------------------------

Query:  -------------------------IHTVEINPRKMDKVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQG
                                 +H VEINPRKMDKVAGQE KSEKKEH +V QGK+NIKKKRRDITDTVEINPRKMDKVAE + KLKK EHHQVPQG
Subjt:  -------------------------IHTVEINPRKMDKVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQG

Query:  KKNIKKKHRDITNIEQRRGPLSSLFK
         +NIK +H DITNI+Q +GPLSSLFK
Subjt:  KKNIKKKHRDITNIEQRRGPLSSLFK

XP_023540398.1 uncharacterized protein LOC111800784 [Cucurbita pepo subsp. pepo]2.6e-24664.73Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAF---------IYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGIT
        MA FSLR  NN   S ++DLIKLQRL HAF         IY A+DFHSVGRQ                           DA+N DMPKWNK++GRAFGI+
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAF---------IYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGIT

Query:  RSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQ
        RSMIPSSS  VLK+LH +GFEAYLVGGCVRDLLL R PKDFDVITTAGL+Q                            VSSF+T AKHS+G E    S 
Subjt:  RSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQ

Query:  IPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLA
         P+KCD+KDLIRWRNS+ RDFTINSLFFDPF N+IYDYAEG+ADLRSLKLRTLIPASLSFK DCARILRGLRIAARLGLS+SK+TETA+ K S SI SL 
Subjt:  IPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLA

Query:  KSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLA
        KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS +S MLMKLF NLDKLVSC RPS+ NIWVALLAFH+ALVNNPQNSLIVLA
Subjt:  KSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLA

Query:  FAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVL
        FAATLYHGEWNEGVNYARE SL++IN RPEITRSAQFKSAEELAE VTHFALKVQGCIAALTSADCLLEAMSTFPAS +S LVFV  K A+DVA I EVL
Subjt:  FAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVL

Query:  VKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQL-VKRNKKVRKLPSVSEVEL
        V DVESYKN R+NF+IDYQLL KG L+E+R+VLG+VILETLK+AI+QG+G +LD  QNLC+DATT+E   SP++D V++QL VKRNKKVRKL S SEV+ 
Subjt:  VKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQL-VKRNKKVRKLPSVSEVEL

Query:  KANKKYKLVRNEGSISDKVVENGRCINMTE-----IHTVEINPRKMDKVAGQEGKSEKKEHRQVLQG---KENIK------------------KKRRDIT
        + NKK KL   EGSI D+VVE+ RC+N+ E     +   ++    ++ +     +S K  H +V +    +EN++                  K  +++ 
Subjt:  KANKKYKLVRNEGSISDKVVENGRCINMTE-----IHTVEINPRKMDKVAGQEGKSEKKEHRQVLQG---KENIK------------------KKRRDIT

Query:  DTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK
          VE NPRKMDKV   E K +KKEH  +PQGK+NI+KK R +T+IEQ + PLSSLFK
Subjt:  DTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK

XP_038904059.1 uncharacterized protein LOC120090455 [Benincasa hispida]1.8e-26068.92Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL
        MASFSL ANN   TSRLIDLIKLQRLRHA IYPA+DFHS+G QDAAN+                            MPKWNKI+GRAFGITRSMIPSSS 
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL

Query:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKKD
        TVLK+L  +GFEAYLVGGCVRDLLL RVPKDFDVITTAGL+Q                            VSSF+T A+H++G E    S+ PKKCDKKD
Subjt:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKKD

Query:  LIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELN
        LIRWRNSM RDFTINSLFFDPF N IYDYAEG+ADLRSLKLRTLIPASLSFK DCARILRGLRIAARLGLS+SKETETA+ + SPSITSLAKSRLMMELN
Subjt:  LIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELN

Query:  YMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGE
        YMLSYGAAVPSLYLLQRFNLLE+LLPF AAYLDKQ IKKSS SS+MLMKLF NLDKL SC  PS+ NIWVALLAFH+ALVNNPQNSLIVLAFAATLYHG+
Subjt:  YMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGE

Query:  WNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKN
        WNEGVNYARE SL++IN RPEIT SAQF S EEL EGVTHFA KVQGCIAALTSAD LLEAMSTFP S  S LV V NKT++DVANIFE LV DVESYKN
Subjt:  WNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKN

Query:  KRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKLVR
        KRQ F IDYQLLGKGTLSE+RYVLG+VILETLKDAILQG+GN+L R QNLC+DATT+ET +SP ADLV+ QL+KRNKKV+KL S  EV+ +ANKKYKL R
Subjt:  KRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKLVR

Query:  NEGSISDKVVENGRCINM---------TEIHTVEINPRKMDK-----VAGQEGKSEK-----KEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHE
         +GSISD+VV+NGRCINM           +H++E +  +  K     V   E   +       E + ++  +E   K  +++  T +IN RKMDKVA  E
Subjt:  NEGSISDKVVENGRCINM---------TEIHTVEINPRKMDK-----VAGQEGKSEK-----KEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHE

Query:  EKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK
         K  KKEHHQVPQGK+NI+KKHRD T++EQ + PLSSLFK
Subjt:  EKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK

TrEMBL top hitse value%identityAlignment
A0A0A0L3K6 Uncharacterized protein0.0e+0077.06Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL
        MASFSLRA+NNASTSRLIDLIKL RL HAF+YPA DFHSVGRQDAA DDGLS EIH +KVAW LTN KKK AANGDMPKWNKINGRAFG+TRSMIPSSS 
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL

Query:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKKD
         VL++LH EGFEAYLVGGCVRDLLL+RVPKDFDVITTAGL Q                            VSSFDT AKHSE N+ITAHSQIPKKCDKKD
Subjt:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKKD

Query:  LIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELN
        LIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFK+DCARILRGLRIAARLGLSISKETETAIHKFSPSITSL KSRLMMELN
Subjt:  LIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELN

Query:  YMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGE
        YMLSYGAAVPSLYLLQRF LL  LLPFHAAYLDKQ I+KSS SSVMLMKLF NLDKLVSCA PSN NIWVALLAFHLALVNNPQNSL+VLAFAATLYHGE
Subjt:  YMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGE

Query:  WNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKN
        WNEGVNYAREKSLVEIN RPEITRSA+FKS E+LAEGVT FALKVQGCIAALTS DCLLEAMSTFPASSNSGLVFV NKTARDVA IFEVL K V+SYK+
Subjt:  WNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKN

Query:  KRQNFKIDYQLLGKGT-LSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKLV
        ++++FKIDY+ LGKG  L ENRYVLG++ILETL+DAILQGN N+ DRNQNL IDA TKETSDSPVADLVQEQLVK NKKVRK PSVSEVELKANKKYKLV
Subjt:  KRQNFKIDYQLLGKGT-LSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKLV

Query:  RNEGSISDKVVENGRCINMTE-------------------------------------------------------------------IHTVEINPRKMD
        R EGSISDKVVENGRCINMTE                                                                   +H VEINPRKMD
Subjt:  RNEGSISDKVVENGRCINMTE-------------------------------------------------------------------IHTVEINPRKMD

Query:  KVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK
        KVAGQE KSEKKEH +V QGK+NIKKKRRDITDTVEINPRKMDKVAE + KLKK EHHQVPQG +NIK +H DITNI+Q +GPLSSLFK
Subjt:  KVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK

A0A5A7UAH1 Poly(A) polymerase I3.0e-29377.32Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL
        MASFSLRA+ N STSRLID+IKL RLRHAF+YPA DFHSVGRQDAA+DDGLS EIH +KV W LTN KKK A NGDMPKWNK+NGR FGITR MIPSS  
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSL

Query:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ-----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKK
         VL+ LH EGFEAYLVGGCVRDL+L RVPKDFDVITTAGL Q                             VSSFDTTAKHSEG EITAHSQIPKKCDKK
Subjt:  TVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ-----------------------------VSSFDTTAKHSEGNEITAHSQIPKKCDKK

Query:  DLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMEL
        DLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFK+DCARILRGLRIAARLGLSISKETETAI KFSPSITSL K RLMMEL
Subjt:  DLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMEL

Query:  NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHG
        NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYL++Q I KSS SSVMLMKLF NLDKLVSCARPSN NIWVALLAFHLALVNNPQNSLIVLAFAATLYHG
Subjt:  NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHG

Query:  EWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYK
        EWNEGVNYAR KSLVEIN RPEITRSA+FKS  ELAEGVT FAL VQGCIAALTSADCLLEAMSTFPASSNSGLVFV  K ARDVANIFEVLV DVESYK
Subjt:  EWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYK

Query:  NKRQNFKIDYQLLGK-GTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKL
        +KR+NF++DY+ LG  G L ENRYVLG+VILETL+ AILQGN N+LD NQNL IDA TKETS+SPVADLVQEQLVK+NKKVRKLPSVSEVELKANKKYK 
Subjt:  NKRQNFKIDYQLLGK-GTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKVRKLPSVSEVELKANKKYKL

Query:  VRNEGSISDKVVENGRCINMTEIHTVEINPRKMDKVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKN
        V+ EGSISDKV       NM E      NP  M             E ++++  K+  +K  +++   VEI PRKMDKVA  + K +KKEHH+VPQGK+N
Subjt:  VRNEGSISDKVVENGRCINMTEIHTVEINPRKMDKVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKN

Query:  IKKKHRDITNIEQRRGPLSSLFK
        IK KHRDITNI+Q +GPLSSLFK
Subjt:  IKKKHRDITNIEQRRGPLSSLFK

A0A6J1CVT5 uncharacterized protein LOC111014843 isoform X41.5e-22859.61Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAF---------IYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGIT
        MA FSLR+N N    RL DLIKLQ LRH F         + P +DFHS GR+D                          +AAN  M KWNK++ RAFGI 
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAF---------IYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGIT

Query:  RSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQ
        RSMIP SS  VL++L  +GFE YLVGGCVRDL+L RVPKDFDVITTAGL+Q                            VSSF+T AKHSEG      SQ
Subjt:  RSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQ

Query:  IPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLA
        IP+KC K+DLIRWRNSM RDFTINSLFFDPF N+IYDYAEG+ DLRSLKLRTLIPASLSFK DCARILRGLRIAARLGLS+SK+TETAI K SPSI SL 
Subjt:  IPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLA

Query:  KSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLA
        K+R+MMELNYMLSYGAAVPSLYLLQRFNLL+ILLPFHAAYLDKQDIK+SS +S+MLMKLF NLDKLVSC RPS+ NIWV LLAFH+ALV NPQNSLIVLA
Subjt:  KSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLA

Query:  FAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVL
        FA TLYHG+WNEGVNYARE SLV+IN RPEITRSAQFKS EELAEGV+HFA KVQGCIAA T ADCL EA  T P S  S LVFV  KTA+DVA IFEVL
Subjt:  FAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVL

Query:  VKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQL-VKRNKKVRKLPSVSEVEL
        V DVES+KNKR+NF+IDYQLLGKG LSE+RYV+G++I ETL  AI+QG+ N+LD+ QNLC+D TTKE  +SPV+D+V++QL V +  KV+KLPS SEV L
Subjt:  VKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQL-VKRNKKVRKLPSVSEVEL

Query:  KANKKYKLVRNEGSISDKVVENGRCINMTEIHTVEINPRKMDKVAGQEGKSEKKEHR-----QVLQGKENIKKKR-------------------------
         ANKK KLV+ EG  ++   E+   + M            MD+V GQE KSEK+E +        +GK    KK                          
Subjt:  KANKKYKLVRNEGSISDKVVENGRCINMTEIHTVEINPRKMDKVAGQEGKSEKKEHR-----QVLQGKENIKKKR-------------------------

Query:  ------------RDITDTVEINPRKMD--KVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK
                    +++   V++NPR M+  +V   E K +KKE + + QGK+N  KKHR +T   Q +GPLSSLFK
Subjt:  ------------RDITDTVEINPRKMD--KVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK

A0A6J1G6Y5 uncharacterized protein LOC1114514261.4e-24564.95Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAF---------IYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGIT
        MA FSLR  NN   S ++DLIKLQRL HAF         IY A+DFHS+GRQDA+N                            DMPKWNK++GRAFGI+
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAF---------IYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGIT

Query:  RSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQ
        RSMIPSSS  VLK+LH +GFEAYLVGGCVRDLLL R PKDFDVITTAGL+Q                            VSSF+T AKHS+G E    S 
Subjt:  RSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQ

Query:  IPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLA
         P+KCD+KDLIRWRNS+ RDFTINSLFFDPF NVIYDYAEG+ADLRSLKLRTLIPASLSFK DCARILRGLRIAARLGLS+SK+TETA+ K S SI SL 
Subjt:  IPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLA

Query:  KSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLA
        KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS +S MLMKLF NLDKLVSC RPS+ NIWVALLAFH+ALVNNPQNSLIVLA
Subjt:  KSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLA

Query:  FAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVL
        FAATLYHGEWNEGVNYARE SL++IN RPEITRSAQFKSAEELAE VTHFALKVQGCIAALTSADCLLEAMSTFPAS +S LVFV  K A+DVA I EVL
Subjt:  FAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVL

Query:  VKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQL-VKRNKKVRKLPSVSEVEL
        V DVESYKN R+NF+IDYQLL KG L+E+R+VLG+VILETLK+AI+Q +G +LD  QNLC+DATT+E   SPV+D V++QL VKRNKKVRKL S SEVE 
Subjt:  VKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQL-VKRNKKVRKLPSVSEVEL

Query:  KANKKYKLVRNEGSISDKVVENGRCINMTE-----IHTVEINPRKMDKVAGQEGKSEKKEHRQVLQG---KENIK-----------------KKRRDITD
        + NKK KL   EGSI D+VVE+GRC+N+ E     +   ++    ++ +     +S K  H +V +    +EN++                 K  +++  
Subjt:  KANKKYKLVRNEGSISDKVVENGRCINMTE-----IHTVEINPRKMDKVAGQEGKSEKKEHRQVLQG---KENIK-----------------KKRRDITD

Query:  TVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK
         VE NP KMDKV   E K +KKEH   PQGK+NI+KK R +T+IEQ + PLSSLFK
Subjt:  TVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK

A0A6J1I3F4 uncharacterized protein LOC1114701846.2e-24664.55Show/hide
Query:  MASFSLRANNNASTSRLIDLIKLQRLRHAF---------IYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGIT
        MA FSLR  NN   S ++DLIKLQRL HAF         I+ A+DFHSVGRQDA+N                            DMPKWNK++GRAFGI+
Subjt:  MASFSLRANNNASTSRLIDLIKLQRLRHAF---------IYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGIT

Query:  RSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQ
        RSMIPSSS  VLK+LH +GFE YLVGGCVRDLLL R PKDFDVITTAGL+Q                            VSSF+T AKHS+G E    S 
Subjt:  RSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ----------------------------VSSFDTTAKHSEGNEITAHSQ

Query:  IPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLA
         P+KCD+KDLIRWRNS+ RDFTINSLFFDPF NVIYDYAEG+ADLRSLKLRTLIPASLSFK DCARILRGLRIAARLGLS+SK+TETA+HK S SI SL 
Subjt:  IPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLA

Query:  KSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLA
        KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYL+KQDIKKS  +S MLMKLF NLDKLVSC RPS+ NIWVALLAFH+ALVNNPQNSLIVLA
Subjt:  KSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSS-SSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLA

Query:  FAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVL
        FAATLYHGEWNEGVNYARE SL++IN RPEITRSAQFKSAEELAE VTHFALKVQGCIAALTSADCLLEAMSTFPAS  S LVFV  K A+DVA I EVL
Subjt:  FAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVL

Query:  VKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQL-VKRNKKVRKLPSVSEVEL
        V DVESYKN R+NF+IDYQLL KG L+E+R+VLG+VILETLK+AI+QG+G +LD  QNLC+DATT+E   SPV+D V++QL VKRNKKVRKL S SEV+ 
Subjt:  VKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQL-VKRNKKVRKLPSVSEVEL

Query:  KANKKYKLVRNEGSISDKVVENGRCINMTEIHTVEINPRKMD--------------------KVAGQEGKSEKKEH-----RQVLQGKENIKKKRRDITD
        + NKK KL   EG+I D+VVE+GRC+N+ E +   +   ++                     +V   E + E  E      +++   +E   K  +++  
Subjt:  KANKKYKLVRNEGSISDKVVENGRCINMTEIHTVEINPRKMD--------------------KVAGQEGKSEKKEH-----RQVLQGKENIKKKRRDITD

Query:  TVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK
         VE NPRKMDKV   E K +KKEH  +PQGK+NI+KK R +T+IEQ + PLSSLFK
Subjt:  TVEINPRKMDKVAEHEEKLKKKEHHQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK

SwissProt top hitse value%identityAlignment
P0ABF1 Poly(A) polymerase I1.7e-1930.23Show/hide
Query:  PKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF-------------------------DTTAKH
        P+   I      I+R  I  ++L V+  L+  G+EA+LVGG VRDLLL + PKDFDV T A  +QV                             T   H
Subjt:  PKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF-------------------------DTTAKH

Query:  SEGNEITAHSQIPKKCDKKDLIR-------WRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSIS
         EGN   +     ++     L+R         ++  RDFTINSL++      + DY  GM DL+   +R +      ++ D  R+LR +R AA+LG+ IS
Subjt:  SEGNEITAHSQIPKKCDKKDLIR-------WRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSIS

Query:  KETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAY
         ET   I + +  +  +  +RL  E   +L  G    +  LL  ++L + L P    Y
Subjt:  KETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAY

P0ABF2 Poly(A) polymerase I1.7e-1930.23Show/hide
Query:  PKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF-------------------------DTTAKH
        P+   I      I+R  I  ++L V+  L+  G+EA+LVGG VRDLLL + PKDFDV T A  +QV                             T   H
Subjt:  PKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF-------------------------DTTAKH

Query:  SEGNEITAHSQIPKKCDKKDLIR-------WRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSIS
         EGN   +     ++     L+R         ++  RDFTINSL++      + DY  GM DL+   +R +      ++ D  R+LR +R AA+LG+ IS
Subjt:  SEGNEITAHSQIPKKCDKKDLIR-------WRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSIS

Query:  KETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAY
         ET   I + +  +  +  +RL  E   +L  G    +  LL  ++L + L P    Y
Subjt:  KETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAY

P0ABF3 Poly(A) polymerase I1.7e-1930.23Show/hide
Query:  PKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF-------------------------DTTAKH
        P+   I      I+R  I  ++L V+  L+  G+EA+LVGG VRDLLL + PKDFDV T A  +QV                             T   H
Subjt:  PKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF-------------------------DTTAKH

Query:  SEGNEITAHSQIPKKCDKKDLIR-------WRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSIS
         EGN   +     ++     L+R         ++  RDFTINSL++      + DY  GM DL+   +R +      ++ D  R+LR +R AA+LG+ IS
Subjt:  SEGNEITAHSQIPKKCDKKDLIR-------WRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSIS

Query:  KETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAY
         ET   I + +  +  +  +RL  E   +L  G    +  LL  ++L + L P    Y
Subjt:  KETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAY

P44439 Poly(A) polymerase I5.5e-2628.93Show/hide
Query:  NKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF---------------------------DTTAKHS
        N I    F I+      ++L V++ L  +GFEAY+VGGC+RDLLL + PKDFDV T A  +Q+ +                               A HS
Subjt:  NKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF---------------------------DTTAKHS

Query:  EGNEITAHSQIPKKCDKKDLIRW---RNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETET
        +        Q  +    +D +     +++  RDFT+N+L+++P  N + DY EG+ DL++ KLR +      ++ D  R+LR +R  A+L + + K +E 
Subjt:  EGNEITAHSQIPKKCDKKDLIRW---RNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETET

Query:  AIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSSSVMLMKLFCNLDKLVS
         I + +P + ++  +RL  E   +L  G  V +  LL+++ L E L P  +AY  +++   S +  M++    + D+ V+
Subjt:  AIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSSSVMLMKLFCNLDKLVS

Q8Z9C3 Poly(A) polymerase I2.3e-1930.89Show/hide
Query:  ITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF-------------------------DTTAKHSEGNEITAHSQI
        I+R  I  ++L VL  L+  G+EAYLVGG VRDLLL + PKDFDV T A   QV                             T   H+EG+E  +    
Subjt:  ITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSF-------------------------DTTAKHSEGNEITAHSQI

Query:  PKKCDKKDLIR-------WRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSP
         ++     L+R         ++  RDFTINSL++      + DY  GM DL+   +R +      ++ D  R+LR +R AA+L + IS ET   I + + 
Subjt:  PKKCDKKDLIR-------WRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSP

Query:  SITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAY
         +  +  +RL  E   +L  G    +   L+ ++L + L P    Y
Subjt:  SITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAY

Arabidopsis top hitse value%identityAlignment
AT1G28090.1 Polynucleotide adenylyltransferase family protein3.1e-8038.43Show/hide
Query:  WNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLK----------------------------QVSSFDTTAK
        W K++   FGI RSMIP S+  VL  L  +GF+ YLVGGCVRDL+L R+PKDFDVITTA LK                            +VSSF T+A+
Subjt:  WNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLK----------------------------QVSSFDTTAK

Query:  HSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETA
           G       + P  CD++D IRW+N ++RDFT+N L FDP  NV+YDY  G+ DLR+ K+RT+  A+LSF  D ARILR +RIAARLG S++K+   +
Subjt:  HSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETA

Query:  IHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDI-KKSSSSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLAL
        + + S S+  L  SR+ ME+NYML+YG+A  SL LL RF L+EILLP  A+YL  Q   ++   S ML+ LF NLD+LV+  RP +  +W+ +LAFH AL
Subjt:  IHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDI-KKSSSSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLAL

Query:  VNNPQNSLIVLAFAATLY-HGEWNEGVNYAREKSLVEINFRPEITRSAQ--FKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFV
        V+ P++  +V +F   +Y     +E +  AR  S    +   E++   +    S  ++++ V   A  ++     L + D +  AMS +P +  S +VF+
Subjt:  VNNPQNSLIVLAFAATLY-HGEWNEGVNYAREKSLVEINFRPEITRSAQ--FKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFV

Query:  PNKTARDVANIFEVLVKDVESYKNKRQNF--KIDYQLLGKGTLSENRYVLGEVILETL
               V  +F  + +     ++   +   +I+Y+ L  G   E R V   ++ +T+
Subjt:  PNKTARDVANIFEVLVKDVESYKNKRQNF--KIDYQLLGKGTLSENRYVLGEVILETL

AT1G28090.2 Polynucleotide adenylyltransferase family protein3.1e-8038.43Show/hide
Query:  WNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLK----------------------------QVSSFDTTAK
        W K++   FGI RSMIP S+  VL  L  +GF+ YLVGGCVRDL+L R+PKDFDVITTA LK                            +VSSF T+A+
Subjt:  WNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLK----------------------------QVSSFDTTAK

Query:  HSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETA
           G       + P  CD++D IRW+N ++RDFT+N L FDP  NV+YDY  G+ DLR+ K+RT+  A+LSF  D ARILR +RIAARLG S++K+   +
Subjt:  HSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETA

Query:  IHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDI-KKSSSSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLAL
        + + S S+  L  SR+ ME+NYML+YG+A  SL LL RF L+EILLP  A+YL  Q   ++   S ML+ LF NLD+LV+  RP +  +W+ +LAFH AL
Subjt:  IHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDI-KKSSSSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLAL

Query:  VNNPQNSLIVLAFAATLY-HGEWNEGVNYAREKSLVEINFRPEITRSAQ--FKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFV
        V+ P++  +V +F   +Y     +E +  AR  S    +   E++   +    S  ++++ V   A  ++     L + D +  AMS +P +  S +VF+
Subjt:  VNNPQNSLIVLAFAATLY-HGEWNEGVNYAREKSLVEINFRPEITRSAQ--FKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFV

Query:  PNKTARDVANIFEVLVKDVESYKNKRQNF--KIDYQLLGKGTLSENRYVLGEVILETL
               V  +F  + +     ++   +   +I+Y+ L  G   E R V   ++ +T+
Subjt:  PNKTARDVANIFEVLVKDVESYKNKRQNF--KIDYQLLGKGTLSENRYVLGEVILETL

AT1G28090.3 Polynucleotide adenylyltransferase family protein3.1e-8038.43Show/hide
Query:  WNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLK----------------------------QVSSFDTTAK
        W K++   FGI RSMIP S+  VL  L  +GF+ YLVGGCVRDL+L R+PKDFDVITTA LK                            +VSSF T+A+
Subjt:  WNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLK----------------------------QVSSFDTTAK

Query:  HSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETA
           G       + P  CD++D IRW+N ++RDFT+N L FDP  NV+YDY  G+ DLR+ K+RT+  A+LSF  D ARILR +RIAARLG S++K+   +
Subjt:  HSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRGLRIAARLGLSISKETETA

Query:  IHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDI-KKSSSSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLAL
        + + S S+  L  SR+ ME+NYML+YG+A  SL LL RF L+EILLP  A+YL  Q   ++   S ML+ LF NLD+LV+  RP +  +W+ +LAFH AL
Subjt:  IHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDI-KKSSSSVMLMKLFCNLDKLVSCARPSNSNIWVALLAFHLAL

Query:  VNNPQNSLIVLAFAATLY-HGEWNEGVNYAREKSLVEINFRPEITRSAQ--FKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFV
        V+ P++  +V +F   +Y     +E +  AR  S    +   E++   +    S  ++++ V   A  ++     L + D +  AMS +P +  S +VF+
Subjt:  VNNPQNSLIVLAFAATLY-HGEWNEGVNYAREKSLVEINFRPEITRSAQ--FKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASSNSGLVFV

Query:  PNKTARDVANIFEVLVKDVESYKNKRQNF--KIDYQLLGKGTLSENRYVLGEVILETL
               V  +F  + +     ++   +   +I+Y+ L  G   E R V   ++ +T+
Subjt:  PNKTARDVANIFEVLVKDVESYKNKRQNF--KIDYQLLGKGTLSENRYVLGEVILETL

AT2G17580.1 Polynucleotide adenylyltransferase family protein9.1e-12546.01Show/hide
Query:  SLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ--------------------
        S+     +D  + D  KW K+     GI  SMIP SS+ VL+LL  +GF+AYLVGGCVRDL+L RVPKD+DVITTA LKQ                    
Subjt:  SLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQ--------------------

Query:  --------VSSFDTTA-------KHSEGNEITAHSQIPKK---------CDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIP
                VSSFDT A       K  E + ++  ++  K           D KD  RWRNS++RDFTINSLF++PF   IYDYA GM DL  LKLRTL+P
Subjt:  --------VSSFDTTA-------KHSEGNEITAHSQIPKK---------CDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIP

Query:  ASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSSSVML
        A LSFK DCARILRGLRIAARLGLS+SK+ +TAI +F  S+ +L + RL+ME+NYML+YGAA PS+ LL +F LL +LLPF AAYLD+      SSS+ML
Subjt:  ASLSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSSSVML

Query:  MKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQG
        ++LF N+DKLVSC +P++  +W+A+LAFH+ALV NPQ +++V AFAA LYHG W++ V +ARE     I + PE+++S++ +S E+LAE V+ F   ++ 
Subjt:  MKLFCNLDKLVSCARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQG

Query:  CIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAIL----------
            LT  + L EA+  +P    SGLVF+P K  RDVA  F + + DVESY+++++ F IDY LLGKG   E R+VLG++IL+T+ +  +          
Subjt:  CIAALTSADCLLEAMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAIL----------

Query:  QGNGN------MLDRNQNLCIDATTKE--TSDSPVADLVQEQLVKRNKKVRK
        Q   N       L++   L +  ++KE   + +PV D     ++K  K+ R+
Subjt:  QGNGN------MLDRNQNLCIDATTKE--TSDSPVADLVQEQLVKRNKKVRK

AT5G23690.1 Polynucleotide adenylyltransferase family protein1.3e-7837.34Show/hide
Query:  TNAKKKDAANGDMPK---WNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLK--------------------
        T++ ++D  + D P+   W ++N +  G++ SMI  S+  VL  L  +G + YLVGGCVRDL+L+R PKDFD++T+A L+                    
Subjt:  TNAKKKDAANGDMPK---WNKINGRAFGITRSMIPSSSLTVLKLLHGEGFEAYLVGGCVRDLLLQRVPKDFDVITTAGLK--------------------

Query:  --------QVSSFDTTAKHSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRG
                +VSSF T+A++S  N  T   +       +D IR  N ++RDFTIN L FDP++ V+YDY  GM D+R  K+RT+I A  SF  DCARILR 
Subjt:  --------QVSSFDTTAKHSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPASLSFKMDCARILRG

Query:  LRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSSSV-MLMKLFCNLDKLVSCA
        +RIAARLG  +SKET   I   S  +  L K R++ME+NYML+YG+A  SL LL +F +LEILLP  AAYL +   ++      ML+ LF NLDKL++  
Subjt:  LRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSSSV-MLMKLFCNLDKLVSCA

Query:  RPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYH-GEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLE
        RP +S++W+A+LAFH AL + P++ ++V AF+  +++ G+  E V   ++ +        E+    +    + L + V      ++  +  +T A  + +
Subjt:  RPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYH-GEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLE

Query:  AMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESY--KNKRQNFKIDYQLLGKGTLSENRYVLGEVILETL
        AMS +P +  S LVF+P +       IF+  VK+ E+      +Q  KI+Y  L  G   E R+V   V+ +T+
Subjt:  AMSTFPASSNSGLVFVPNKTARDVANIFEVLVKDVESY--KNKRQNFKIDYQLLGKGTLSENRYVLGEVILETL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTTTTTCTCTCAGAGCGAACAACAACGCCTCCACTTCTCGCCTCATTGACCTAATCAAGCTTCAGAGGCTCAGACACGCTTTTATTTACCCAGCAGTGGATTT
TCACTCAGTGGGTCGTCAAGATGCAGCCAACGACGACGGGTTGTCCCGTGAAATTCACGAGCTTAAGGTGGCCTGGTCGCTCACGAATGCAAAAAAAAAAGATGCAGCCA
ACGGGGACATGCCCAAATGGAATAAAATTAATGGGCGGGCTTTTGGAATCACTCGTTCGATGATCCCATCTTCATCATTAACGGTCTTGAAACTACTTCATGGTGAAGGG
TTTGAAGCCTATTTGGTGGGTGGATGTGTGAGAGATTTACTTCTACAGAGAGTACCAAAAGACTTTGATGTGATTACCACAGCTGGACTTAAACAGGTTTCAAGTTTTGA
TACAACGGCAAAACATTCTGAGGGAAATGAAATTACAGCACATTCTCAAATTCCAAAAAAATGTGATAAAAAGGACTTAATTCGGTGGAGGAACTCTATGGAGCGGGATT
TCACAATTAACAGTTTATTCTTTGATCCCTTTTCGAACGTAATCTATGACTATGCTGAAGGGATGGCAGACTTAAGGTCGTTGAAGCTGCGGACACTAATTCCTGCATCA
TTGTCATTCAAGATGGACTGTGCTAGAATTCTGCGTGGCTTAAGAATTGCTGCTCGTTTGGGTTTGTCAATCTCCAAGGAAACTGAGACTGCGATCCATAAATTTTCTCC
TTCCATCACAAGCTTGGCTAAGTCCAGGTTAATGATGGAATTGAATTATATGCTATCTTATGGAGCAGCCGTTCCTTCACTCTATTTGCTTCAGAGGTTCAACCTGCTCG
AAATTCTGCTACCATTTCATGCTGCGTATCTTGATAAACAGGACATTAAGAAATCTTCGAGTTCTGTAATGTTGATGAAATTGTTCTGCAATTTGGATAAATTGGTTTCT
TGTGCTCGGCCTTCGAACTCCAATATATGGGTTGCATTGTTGGCATTTCACCTGGCGTTAGTTAACAATCCTCAGAATTCTCTTATAGTTCTGGCTTTTGCTGCCACCTT
GTACCATGGGGAGTGGAACGAAGGTGTGAATTATGCAAGAGAAAAGTCTCTTGTTGAGATCAATTTTAGGCCAGAGATTACAAGATCAGCTCAATTTAAATCTGCGGAGG
AGCTTGCTGAAGGGGTTACTCATTTTGCTTTGAAAGTACAAGGTTGCATAGCTGCTCTGACTTCAGCAGATTGTTTATTAGAAGCCATGTCAACATTTCCAGCCTCTTCA
AACTCTGGTTTGGTATTTGTGCCAAACAAAACTGCCAGAGATGTTGCCAACATTTTTGAAGTGCTGGTGAAGGATGTTGAATCCTACAAAAACAAGAGACAGAATTTCAA
AATTGATTATCAACTGCTTGGGAAGGGTACTTTGAGTGAGAATAGATATGTATTGGGAGAAGTTATTTTGGAAACCCTGAAAGATGCAATTCTGCAGGGAAATGGAAACA
TGCTTGACAGGAACCAAAATCTTTGTATTGATGCTACTACTAAGGAAACTTCTGATTCACCAGTTGCTGATCTGGTGCAGGAACAATTGGTTAAAAGAAATAAGAAAGTT
CGGAAACTGCCATCTGTTTCTGAAGTTGAGTTGAAAGCGAATAAGAAATATAAGCTTGTAAGGAATGAGGGAAGTATTTCTGACAAGGTGGTTGAGAATGGAAGGTGCAT
TAACATGACAGAGATTCACACTGTAGAGATCAACCCAAGGAAGATGGACAAAGTAGCAGGGCAAGAAGGAAAGTCCGAGAAGAAAGAGCATCGTCAGGTGCTGCAGGGAA
AGGAGAATATTAAAAAAAAACGTAGAGATATTACAGATACTGTAGAGATCAACCCAAGAAAGATGGACAAAGTAGCAGAGCATGAAGAAAAGCTCAAGAAGAAAGAGCAT
CATCAGGTGCCGCAGGGAAAGAAGAATATTAAAAAAAAACATAGAGATATAACAAATATCGAGCAGCGCAGAGGTCCACTGTCCAGCCTTTTTAAGTAA
mRNA sequenceShow/hide mRNA sequence
TTTCATTTCTTTCTTATTTTCTTTTGAAAGGACGAAACAGTGTTGAGGCGAAGGATACGGTGTGCGCATAGTTTGCATCCGAGCAGATAAACGAGTTGAAAACCCACCGC
CTCCACTCTACGATGCAATAAAAAAGCGCGCCATGCTTCAATTCTCCTCTTCTCATGGCGTCTTTTTCTCTCAGAGCGAACAACAACGCCTCCACTTCTCGCCTCATTGA
CCTAATCAAGCTTCAGAGGCTCAGACACGCTTTTATTTACCCAGCAGTGGATTTTCACTCAGTGGGTCGTCAAGATGCAGCCAACGACGACGGGTTGTCCCGTGAAATTC
ACGAGCTTAAGGTGGCCTGGTCGCTCACGAATGCAAAAAAAAAAGATGCAGCCAACGGGGACATGCCCAAATGGAATAAAATTAATGGGCGGGCTTTTGGAATCACTCGT
TCGATGATCCCATCTTCATCATTAACGGTCTTGAAACTACTTCATGGTGAAGGGTTTGAAGCCTATTTGGTGGGTGGATGTGTGAGAGATTTACTTCTACAGAGAGTACC
AAAAGACTTTGATGTGATTACCACAGCTGGACTTAAACAGGTTTCAAGTTTTGATACAACGGCAAAACATTCTGAGGGAAATGAAATTACAGCACATTCTCAAATTCCAA
AAAAATGTGATAAAAAGGACTTAATTCGGTGGAGGAACTCTATGGAGCGGGATTTCACAATTAACAGTTTATTCTTTGATCCCTTTTCGAACGTAATCTATGACTATGCT
GAAGGGATGGCAGACTTAAGGTCGTTGAAGCTGCGGACACTAATTCCTGCATCATTGTCATTCAAGATGGACTGTGCTAGAATTCTGCGTGGCTTAAGAATTGCTGCTCG
TTTGGGTTTGTCAATCTCCAAGGAAACTGAGACTGCGATCCATAAATTTTCTCCTTCCATCACAAGCTTGGCTAAGTCCAGGTTAATGATGGAATTGAATTATATGCTAT
CTTATGGAGCAGCCGTTCCTTCACTCTATTTGCTTCAGAGGTTCAACCTGCTCGAAATTCTGCTACCATTTCATGCTGCGTATCTTGATAAACAGGACATTAAGAAATCT
TCGAGTTCTGTAATGTTGATGAAATTGTTCTGCAATTTGGATAAATTGGTTTCTTGTGCTCGGCCTTCGAACTCCAATATATGGGTTGCATTGTTGGCATTTCACCTGGC
GTTAGTTAACAATCCTCAGAATTCTCTTATAGTTCTGGCTTTTGCTGCCACCTTGTACCATGGGGAGTGGAACGAAGGTGTGAATTATGCAAGAGAAAAGTCTCTTGTTG
AGATCAATTTTAGGCCAGAGATTACAAGATCAGCTCAATTTAAATCTGCGGAGGAGCTTGCTGAAGGGGTTACTCATTTTGCTTTGAAAGTACAAGGTTGCATAGCTGCT
CTGACTTCAGCAGATTGTTTATTAGAAGCCATGTCAACATTTCCAGCCTCTTCAAACTCTGGTTTGGTATTTGTGCCAAACAAAACTGCCAGAGATGTTGCCAACATTTT
TGAAGTGCTGGTGAAGGATGTTGAATCCTACAAAAACAAGAGACAGAATTTCAAAATTGATTATCAACTGCTTGGGAAGGGTACTTTGAGTGAGAATAGATATGTATTGG
GAGAAGTTATTTTGGAAACCCTGAAAGATGCAATTCTGCAGGGAAATGGAAACATGCTTGACAGGAACCAAAATCTTTGTATTGATGCTACTACTAAGGAAACTTCTGAT
TCACCAGTTGCTGATCTGGTGCAGGAACAATTGGTTAAAAGAAATAAGAAAGTTCGGAAACTGCCATCTGTTTCTGAAGTTGAGTTGAAAGCGAATAAGAAATATAAGCT
TGTAAGGAATGAGGGAAGTATTTCTGACAAGGTGGTTGAGAATGGAAGGTGCATTAACATGACAGAGATTCACACTGTAGAGATCAACCCAAGGAAGATGGACAAAGTAG
CAGGGCAAGAAGGAAAGTCCGAGAAGAAAGAGCATCGTCAGGTGCTGCAGGGAAAGGAGAATATTAAAAAAAAACGTAGAGATATTACAGATACTGTAGAGATCAACCCA
AGAAAGATGGACAAAGTAGCAGAGCATGAAGAAAAGCTCAAGAAGAAAGAGCATCATCAGGTGCCGCAGGGAAAGAAGAATATTAAAAAAAAACATAGAGATATAACAAA
TATCGAGCAGCGCAGAGGTCCACTGTCCAGCCTTTTTAAGTAAACACCTTTGGTGTGTAATTCAGAAACTGATCTTGGAATTTCTTCAAACTGGTCATAAAAAGTGCGCT
ATGATTCACCTGTCTGAAGGTGTATGATTCCATCACCATACATGTTTGGTGTTTCTACCCCAGGGTTTGATCACAACAGTGCTATGCCTCAACACAATCACCCTGTTTGT
TTCTGTTCAATAGATGGAGGGAAGAGTTTCGATTGAAACTTCCAGTGATCTAGAAATCCAACTGGAGTTGTTCCAAACTAAATGCTGAACATCCACATGACCGAGAAGAA
AAGAGCCAACGGAATTAAAATCAGCACATACAGAAAATGGTATTTCTGACTCGTTCCTGTGTCTTTGTTATCATTAGGAGTTGCAAATTCATGTTAGTAAAGGAGTTATT
TTCCCTGGTTCAAGACAAGTGGGTTAACTAATTCGTCTAGATTAGTGGATTCAATTCAGTCCGTTGATTTTCTTTTAACTTGTTTTTTCTTTCTAATTTTGGACTTATGG
TTATGGTCATCTCCTATCGTGTTTTAGTTGATATATAGCTAGATTAACAATTTGATTAACCTCGACCTTTTCCCTGCATTCTATTATTATTGTTATTATTTTTCTTCA
Protein sequenceShow/hide protein sequence
MASFSLRANNNASTSRLIDLIKLQRLRHAFIYPAVDFHSVGRQDAANDDGLSREIHELKVAWSLTNAKKKDAANGDMPKWNKINGRAFGITRSMIPSSSLTVLKLLHGEG
FEAYLVGGCVRDLLLQRVPKDFDVITTAGLKQVSSFDTTAKHSEGNEITAHSQIPKKCDKKDLIRWRNSMERDFTINSLFFDPFSNVIYDYAEGMADLRSLKLRTLIPAS
LSFKMDCARILRGLRIAARLGLSISKETETAIHKFSPSITSLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQDIKKSSSSVMLMKLFCNLDKLVS
CARPSNSNIWVALLAFHLALVNNPQNSLIVLAFAATLYHGEWNEGVNYAREKSLVEINFRPEITRSAQFKSAEELAEGVTHFALKVQGCIAALTSADCLLEAMSTFPASS
NSGLVFVPNKTARDVANIFEVLVKDVESYKNKRQNFKIDYQLLGKGTLSENRYVLGEVILETLKDAILQGNGNMLDRNQNLCIDATTKETSDSPVADLVQEQLVKRNKKV
RKLPSVSEVELKANKKYKLVRNEGSISDKVVENGRCINMTEIHTVEINPRKMDKVAGQEGKSEKKEHRQVLQGKENIKKKRRDITDTVEINPRKMDKVAEHEEKLKKKEH
HQVPQGKKNIKKKHRDITNIEQRRGPLSSLFK