; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002884 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002884
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPoly(A) polymerase I
Genome locationChr11:15058439..15071005
RNA-Seq ExpressionHG10002884
SyntenyHG10002884
Gene Ontology termsGO:0001680 - tRNA 3'-terminal CCA addition (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR032828 - tRNA nucleotidyltransferase/poly(A) polymerase, RNA and SrmB- binding domain
IPR043519 - Nucleotidyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596268.1 hypothetical protein SDJN03_09448, partial [Cucurbita argyrosperma subsp. sororia]2.0e-25672.21Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        A+N DMPKWNK++GRAFGI+RSMIPSSSW+VLK+LHNKG   + V  C      + A  + +V     +      QI KLFH AQI+GRRFPICMVNIKG
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        S+ EVSSFETVAK S G E VTSS  P+KCD+KDLIRWRNS+HRDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLSLSK+TETAMR LS SI  L KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQ IKKSSL+S MLMKLF NLDKLVSCDRPSDC
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWVALLAFHMALV NPQNSLIVLAFAATLYHGEWNEG+NYARENSL+Q+NLRPEITR AQFKS EELA+ VTHFALKVQGCI ALTSADCLLEAMSTFP
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD
         SP+SSLVFVSK+ AK VAKI EVLVNDVESYKN R+ FEIDYQLL KG L+ESR+VLGKVILETLK+AI+Q DG ILD  QN  VDA TEE YSSPV+D
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD

Query:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE
         VKDQL VKRNKKVRKL S SE++WE NKKNKL GKE SI D+VVE+GR +N+      GVEASQLP AGL SME+SLLES KCHHFEVR +EN Q+N E
Subjt:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE

Query:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
        T+ N+VKK+  SQE RDKVTKEL H VE NPRKMDKV GKEG   KKEH  +PQGKENI+KK R +TDI+Q KRPLSSLF
Subjt:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

XP_022947617.1 uncharacterized protein LOC111451426 [Cucurbita moschata]1.6e-25872.79Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        A+N DMPKWNK++GRAFGI+RSMIPSSSW+VLK+LHNKG   + V  C      + A  + +V     +      QI KLFH AQI+GRRFPICMVNIKG
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        S+ EVSSFETVAK S G E VTSS  P+KCD+KDLIRWRNS+HRDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLSLSK+TETAMR LS SI  L KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQ IKKSSL+S MLMKLF NLDKLVSCDRPSDC
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEG+NYARENSL+Q+NLRPEITR AQFKS EELA+ VTHFALKVQGCI ALTSADCLLEAMSTFP
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD
         SP+SSLVFVSK+ AK VAKI EVLVNDVESYKN R+NFEIDYQLL KG L+ESR+VLGKVILETLK+AI+Q DG ILD  QNL VDA TEE YSSPV+D
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD

Query:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE
         VKDQL VKRNKKVRKL S SE+EWE NKKNKL GKE SI D+VVE+GR +N+      GVEASQLP AGL SME+SLLES KCHHFEVR +E MQ+N E
Subjt:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE

Query:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
        TM N+VKK+  SQE RDKVTKEL H VE NP KMDKV GKEG   KKEH   PQGKENI+KK R +TDI+Q KRPLSSLF
Subjt:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

XP_022971476.1 uncharacterized protein LOC111470184 [Cucurbita maxima]1.2e-25872.5Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        A+N DMPKWNK++GRAFGI+RSMIPSSSW+VLK+LHNKG   + V  C      + A  + +V     +      QI KLFH AQI+GRRFPICMVNIKG
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        S+ EVSSFETVAK S G E VTSS  P+KCD+KDLIRWRNS+HRDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLSLSK+TETAM  LS SI  L KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYL+KQ IKKS L+S MLMKLF NLDKLVSCDRPSDC
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEG+NYARENSL+Q+NLRPEITR AQFKS EELA+ VTHFALKVQGCI ALTSADCLLEAMSTFP
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD
         SPYSSLVFVSK+ AK VAKI EVLVNDVESYKN R+NFEIDYQLL KG L+ESR+VLGKVILETLK+AI+QGDG ILD  QNL VDA TEE YSSPV+D
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD

Query:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE
         VKDQL VKRNKKVRKL S SE++WE NKKNKL GKE +I D+VVE+GR +N+      GVEASQLP AGL SME SLLES KCHHFEVR +EN Q+N E
Subjt:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE

Query:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
        TM N+VKK+  SQE RDKVTKEL H VE NPRKMDKV GKEG   KKEH  +PQGKENI+KK R +TDI+Q KRPLSSLF
Subjt:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

XP_023540398.1 uncharacterized protein LOC111800784 [Cucurbita pepo subsp. pepo]1.5e-25672.1Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        A+N DMPKWNK++GRAFGI+RSMIPSSSW+VLK+LHNKG   + V  C      + A  + +V     +      QI KLFH AQI+GRRFPICMVNIKG
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        S+ EVSSFETVAK S G E VTSS  P+KCD+KDLIRWRNS+HRDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLSLSK+TETAMR LS SI  L KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQ IKKSSL+S MLMKLF NLDKLVSCDRPSDC
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEG+NYARENSL+Q+NLRPEITR AQFKS EELA+ VTHFALKVQGCI ALTSADCLLEAMSTFP
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD
         SP+SSLVFVSK+ AK VAKI EVLVNDVESYKN R+NFEIDYQLL KG L+ESR+VLGKVILETLK+AI+QGDG ILD  QNL VDA TEE YSSP++D
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD

Query:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE
         VKDQL VKRNKKVRKL S SE++WE NKKNKL GKE SI D+VVE+ R +N+      GVEASQLP AGL SME+S LES KCHHFEVR +EN Q+N E
Subjt:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE

Query:  TMGNEVKKIIL-SQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
         M N+VKK+   S E RDKVTKEL H VE NPRKMDKV GKEG   KKEH  +PQGKENI+KK R +TDI+Q KRPLSSLF
Subjt:  TMGNEVKKIIL-SQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

XP_038904059.1 uncharacterized protein LOC120090455 [Benincasa hispida]1.4e-27576.67Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        AANN MPKWNKI+GRAFGITRSMIPSSSW VLK+L +KG   + V  C      +    + +V     ++     QI KLFHRA I+GRRFPIC VNIKG
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        SITEVSSFETVA+ +DG ERVTSS+ PKKCDKKDLIRWRNSMHRDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLSLSKETETAMR LSPSIT LAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLE+LLPF AAYLDKQGIKKSSLSSIMLMKLFFNLDKL SCD PSDC
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWVALLAFHMALVNNPQNSLIVLAFAATLYHG+WNEG+NYARENSL+Q+NLRPEIT  AQF SEEEL +GVTHFA KVQGCI ALTSAD LLEAMSTFP
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD
        DSP SSLV VS +T+K VA IFE LVNDVESYKNKRQ F+IDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNIL R QNL VDA TEET +SP AD
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD

Query:  LVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINMGVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPETMGNEVK
        LVK QL+KRNKKV+KL SG E++WEANKK KLG K+ SISD+VV+NGR INMGVEASQLPLAGL SMEES+LES KCH FEV A ENMQKNPETMGNEVK
Subjt:  LVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINMGVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPETMGNEVK

Query:  KIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
         II SQEARDKVTKEL HT +IN RKMDKVAG+EG S KKEH+QVPQGKENI+KKHRD TD++QCKRPLSSLF
Subjt:  KIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

TrEMBL top hitse value%identityAlignment
A0A0A0L3K6 Uncharacterized protein6.0e-23564.82Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        AAN DMPKWNKINGRAFG+TRSMIPSSSW VL+VLH +G   + V  C           + +V     + +    QI  LF R++I+GRRFPICMV+I+G
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        SITEVSSF+T AK S+ N+    SQIPKKCDKKDLIRWRNSM RDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLS+SKETETA+   SPSIT L KSRLMMELNYMLSYGAAVPSLYLLQRF LL  LLPFHAAYLDKQGI+KSSLSS+MLMKLFFNLDKLVSC  PS+C
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWVALLAFH+ALVNNPQNSL+VLAFAATLYHGEWNEG+NYARE SLV++NLRPEITR A+FKSEE+LA+GVT FALKVQGCI ALTS DCLLEAMSTFP
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGT-LSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVA
         S  S LVFVS +TA+ VA IFEVL   V+SYK+++++F+IDY+ LGKG  L E+RYVLGK+ILETL+DAILQG+ NI DRNQNLR+DA T+ET  SPVA
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGT-LSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVA

Query:  DLVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE
        DLV++QLVK NKKVRK  S SE+E +ANKK KL  KE SISDKVVENGR INM      GVE SQLPLA +    E +LES KCHH EVRATENM++NPE
Subjt:  DLVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE

Query:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITD--------------------------------
        +MGNEVKKII  ++A  KVTKEL H VEINPRKMDKVAG+E  S KKEH++V QGK+NIKKK RDITD                                
Subjt:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITD--------------------------------

Query:  ----------IKQCKRPLSSLF
                  IKQCK PLSSLF
Subjt:  ----------IKQCKRPLSSLF

A0A5A7UAH1 Poly(A) polymerase I1.8e-22365.33Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        A N DMPKWNK+NGR FGITR MIPSS   VL+ LH++G   + V  C           + +V     + +    Q+ KLF R++I+GRRFPICMV+I+G
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITE-VSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIA
        +ITE VSSF+T AK S+G E    SQIPKKCDKKDLIRWRNSM RDFT+N                      ++ +  L+   +S     ARILRGLRIA
Subjt:  SITE-VSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIA

Query:  ARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSD
        ARLGLS+SKETETA+R  SPSIT L K RLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYL++QGI KSSLSS+MLMKLFFNLDKLVSC RPS+
Subjt:  ARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSD

Query:  CNIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTF
        CNIWVALLAFH+ALVNNPQNSLIVLAFAATLYHGEWNEG+NYAR  SLV++NLRPEITR A+FKSE ELA+GVT FAL VQGCI ALTSADCLLEAMSTF
Subjt:  CNIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTF

Query:  PDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGK-GTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPV
        P S  S LVFVSK+ A+ VA IFEVLVNDVESYK+KR+NFE+DY+ LG  G L E+RYVLGKVILETL+ AILQG+ NILD NQNL +DA T+ET +SPV
Subjt:  PDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGK-GTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPV

Query:  ADLVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINMGVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPETMGNE
        ADLV++QLVK+NKKVRKL S SE+E +ANKK K   +E SISDK                                       VRATENM++NPETMGNE
Subjt:  ADLVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINMGVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPETMGNE

Query:  VKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
        VKKI+  ++A  KVTKEL H VEI PRKMDKVAG++G S KKEH++VPQGKENIK KHRDIT+IKQCK PLSSLF
Subjt:  VKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

A0A6J1CVT5 uncharacterized protein LOC111014843 isoform X44.5e-21462.3Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        AAN  M KWNK++ RAFGI RSMIP SSW+VL++L  KG   + V  C      +    + +V     +      QI KLFHRAQI+GRRFPICMVNIKG
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        S+ EVSSFETVAK S+G   V SSQIP+KC K+DLIRWRNSMHRDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLSLSK+TETA+R LSPSI  L K+R+MMELNYMLSYGAAVPSLYLLQRFNLL+ILLPFHAAYLDKQ IK+SSL+SIMLMKLF NLDKLVSCDRPSDC
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWV LLAFHMALV NPQNSLIVLAFA TLYHG+WNEG+NYARENSLVQ+NLRPEITR AQFKS+EELA+GV+HFA KVQGCI A T ADCL EA  T P
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD
         SP S+LVFVSK+TAK VAKIFEVLVNDVES+KNKR+NFEIDYQLLGKG LSESRYV+GK+I ETL  AI+QGD NILD+ QNL VD  T+E Y+SPV+D
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD

Query:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM---------GVEASQLPLAGLLSMEESLLESCKCHHFEVR--ATENM
        +VKDQL V +  KV+KL S SE+   ANKK KL  KE   ++   E+   + M           E  +  +       E  L + K H    +    EN 
Subjt:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM---------GVEASQLPLAGLLSMEESLLESCKCHHFEVR--ATENM

Query:  QKNPETMGNEVKKIILSQEARDKVTKELFHTVEINPRKMD--KVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
        Q + ET GN+VK +IL QEA DKVTKEL H V++NPR M+  +V G+EG S KKE   + QGKEN  KKHR +T   Q K PLSSLF
Subjt:  QKNPETMGNEVKKIILSQEARDKVTKELFHTVEINPRKMD--KVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

A0A6J1G6Y5 uncharacterized protein LOC1114514267.8e-25972.79Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        A+N DMPKWNK++GRAFGI+RSMIPSSSW+VLK+LHNKG   + V  C      + A  + +V     +      QI KLFH AQI+GRRFPICMVNIKG
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        S+ EVSSFETVAK S G E VTSS  P+KCD+KDLIRWRNS+HRDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLSLSK+TETAMR LS SI  L KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQ IKKSSL+S MLMKLF NLDKLVSCDRPSDC
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEG+NYARENSL+Q+NLRPEITR AQFKS EELA+ VTHFALKVQGCI ALTSADCLLEAMSTFP
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD
         SP+SSLVFVSK+ AK VAKI EVLVNDVESYKN R+NFEIDYQLL KG L+ESR+VLGKVILETLK+AI+Q DG ILD  QNL VDA TEE YSSPV+D
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD

Query:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE
         VKDQL VKRNKKVRKL S SE+EWE NKKNKL GKE SI D+VVE+GR +N+      GVEASQLP AGL SME+SLLES KCHHFEVR +E MQ+N E
Subjt:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE

Query:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
        TM N+VKK+  SQE RDKVTKEL H VE NP KMDKV GKEG   KKEH   PQGKENI+KK R +TDI+Q KRPLSSLF
Subjt:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

A0A6J1I3F4 uncharacterized protein LOC1114701846.0e-25972.5Show/hide
Query:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG
        A+N DMPKWNK++GRAFGI+RSMIPSSSW+VLK+LHNKG   + V  C      + A  + +V     +      QI KLFH AQI+GRRFPICMVNIKG
Subjt:  AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKG

Query:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA
        S+ EVSSFETVAK S G E VTSS  P+KCD+KDLIRWRNS+HRDFT+N                      ++ +  L+   +S     ARILRGLRIAA
Subjt:  SITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF--------------------TVHISALLAVVIS-----ARILRGLRIAA

Query:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC
        RLGLSLSK+TETAM  LS SI  L KSRLMME NYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYL+KQ IKKS L+S MLMKLF NLDKLVSCDRPSDC
Subjt:  RLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDC

Query:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP
        NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEG+NYARENSL+Q+NLRPEITR AQFKS EELA+ VTHFALKVQGCI ALTSADCLLEAMSTFP
Subjt:  NIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFP

Query:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD
         SPYSSLVFVSK+ AK VAKI EVLVNDVESYKN R+NFEIDYQLL KG L+ESR+VLGKVILETLK+AI+QGDG ILD  QNL VDA TEE YSSPV+D
Subjt:  DSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDAATEETYSSPVAD

Query:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE
         VKDQL VKRNKKVRKL S SE++WE NKKNKL GKE +I D+VVE+GR +N+      GVEASQLP AGL SME SLLES KCHHFEVR +EN Q+N E
Subjt:  LVKDQL-VKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINM------GVEASQLPLAGLLSMEESLLESCKCHHFEVRATENMQKNPE

Query:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF
        TM N+VKK+  SQE RDKVTKEL H VE NPRKMDKV GKEG   KKEH  +PQGKENI+KK R +TDI+Q KRPLSSLF
Subjt:  TMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLF

SwissProt top hitse value%identityAlignment
P0ABF1 Poly(A) polymerase I3.3e-0427Show/hide
Query:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL
        P Q+ KLF   +++GRRF +  V     I EV++F     + + ++R TS +       +D I      ++  RDFT+N        FTV      +  L
Subjt:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL

Query:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG
           VI               R+LR +R AA+LG+ +S ET   +  L+  +  +  +RL  E   +L  G    +  LL  ++L + L P    Y  + G
Subjt:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG

P0ABF2 Poly(A) polymerase I3.3e-0427Show/hide
Query:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL
        P Q+ KLF   +++GRRF +  V     I EV++F     + + ++R TS +       +D I      ++  RDFT+N        FTV      +  L
Subjt:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL

Query:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG
           VI               R+LR +R AA+LG+ +S ET   +  L+  +  +  +RL  E   +L  G    +  LL  ++L + L P    Y  + G
Subjt:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG

P0ABF3 Poly(A) polymerase I3.3e-0427Show/hide
Query:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL
        P Q+ KLF   +++GRRF +  V     I EV++F     + + ++R TS +       +D I      ++  RDFT+N        FTV      +  L
Subjt:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL

Query:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG
           VI               R+LR +R AA+LG+ +S ET   +  L+  +  +  +RL  E   +L  G    +  LL  ++L + L P    Y  + G
Subjt:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG

Q8Z9C3 Poly(A) polymerase I7.3e-0426.5Show/hide
Query:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL
        P Q+ KLF   +++GRRF +  V     I EV++F    + S+ ++R TS +       +D I      ++  RDFT+N        FTV      +  L
Subjt:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL

Query:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG
           VI               R+LR +R AA+L + +S ET   +  L+  +  +  +RL  E   +L  G    +   L+ ++L + L P    Y  + G
Subjt:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG

Q8ZRQ8 Poly(A) polymerase I7.3e-0426.5Show/hide
Query:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL
        P Q+ KLF   +++GRRF +  V     I EV++F    + S+ ++R TS +       +D I      ++  RDFT+N        FTV      +  L
Subjt:  PVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRW---RNSMHRDFTVNR-------FTVH-----ISAL

Query:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG
           VI               R+LR +R AA+L + +S ET   +  L+  +  +  +RL  E   +L  G    +   L+ ++L + L P    Y  + G
Subjt:  LAVVI-------------SARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQG

Arabidopsis top hitse value%identityAlignment
AT1G28090.1 Polynucleotide adenylyltransferase family protein3.4e-6535.61Show/hide
Query:  NNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSI
        +  MP W K++   FGI RSMIP S+ +VL  L  KG   + V  C           + +V     +      ++ K+F   QI+GRRFPIC V +   I
Subjt:  NNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSI

Query:  TEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF-------------------------TVHISALLAVVISARILRGLRIAARL
         EVSSF T A+   G     S + P  CD++D IRW+N + RDFTVN                           TV  + L  V  +ARILR +RIAARL
Subjt:  TEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF-------------------------TVHISALLAVVISARILRGLRIAARL

Query:  GLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNI
        G SL+K+   +++ LS S+ RL  SR+ ME+NYML+YG+A  SL LL RF L+EILLP  A+YL  QG ++    S ML+ LF NLD+LV+ DRP    +
Subjt:  GLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNI

Query:  WVALLAFHMALVNNPQNSLIVLAFAATLY-HGEWNEGMNYARENSLVQVNLRPEITRLAQ--FKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTF
        W+ +LAFH ALV+ P++  +V +F   +Y     +E +  AR NS    +   E++   +    SE +++  V   A  ++     L + D +  AMS +
Subjt:  WVALLAFHMALVNNPQNSLIVLAFAATLY-HGEWNEGMNYARENSLVQVNLRPEITRLAQ--FKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTF

Query:  PDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFE--IDYQLLGKGTLSESRYVLGKVILETL
        P +P S +VF+S+   + V K+F  +       ++   + E  I+Y+ L  G   E+R V  +++ +T+
Subjt:  PDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFE--IDYQLLGKGTLSESRYVLGKVILETL

AT1G28090.2 Polynucleotide adenylyltransferase family protein3.4e-6535.61Show/hide
Query:  NNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSI
        +  MP W K++   FGI RSMIP S+ +VL  L  KG   + V  C           + +V     +      ++ K+F   QI+GRRFPIC V +   I
Subjt:  NNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSI

Query:  TEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF-------------------------TVHISALLAVVISARILRGLRIAARL
         EVSSF T A+   G     S + P  CD++D IRW+N + RDFTVN                           TV  + L  V  +ARILR +RIAARL
Subjt:  TEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF-------------------------TVHISALLAVVISARILRGLRIAARL

Query:  GLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNI
        G SL+K+   +++ LS S+ RL  SR+ ME+NYML+YG+A  SL LL RF L+EILLP  A+YL  QG ++    S ML+ LF NLD+LV+ DRP    +
Subjt:  GLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNI

Query:  WVALLAFHMALVNNPQNSLIVLAFAATLY-HGEWNEGMNYARENSLVQVNLRPEITRLAQ--FKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTF
        W+ +LAFH ALV+ P++  +V +F   +Y     +E +  AR NS    +   E++   +    SE +++  V   A  ++     L + D +  AMS +
Subjt:  WVALLAFHMALVNNPQNSLIVLAFAATLY-HGEWNEGMNYARENSLVQVNLRPEITRLAQ--FKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTF

Query:  PDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFE--IDYQLLGKGTLSESRYVLGKVILETL
        P +P S +VF+S+   + V K+F  +       ++   + E  I+Y+ L  G   E+R V  +++ +T+
Subjt:  PDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFE--IDYQLLGKGTLSESRYVLGKVILETL

AT1G28090.3 Polynucleotide adenylyltransferase family protein3.4e-6535.61Show/hide
Query:  NNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSI
        +  MP W K++   FGI RSMIP S+ +VL  L  KG   + V  C           + +V     +      ++ K+F   QI+GRRFPIC V +   I
Subjt:  NNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSI

Query:  TEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF-------------------------TVHISALLAVVISARILRGLRIAARL
         EVSSF T A+   G     S + P  CD++D IRW+N + RDFTVN                           TV  + L  V  +ARILR +RIAARL
Subjt:  TEVSSFETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRF-------------------------TVHISALLAVVISARILRGLRIAARL

Query:  GLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNI
        G SL+K+   +++ LS S+ RL  SR+ ME+NYML+YG+A  SL LL RF L+EILLP  A+YL  QG ++    S ML+ LF NLD+LV+ DRP    +
Subjt:  GLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNI

Query:  WVALLAFHMALVNNPQNSLIVLAFAATLY-HGEWNEGMNYARENSLVQVNLRPEITRLAQ--FKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTF
        W+ +LAFH ALV+ P++  +V +F   +Y     +E +  AR NS    +   E++   +    SE +++  V   A  ++     L + D +  AMS +
Subjt:  WVALLAFHMALVNNPQNSLIVLAFAATLY-HGEWNEGMNYARENSLVQVNLRPEITRLAQ--FKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTF

Query:  PDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFE--IDYQLLGKGTLSESRYVLGKVILETL
        P +P S +VF+S+   + V K+F  +       ++   + E  I+Y+ L  G   E+R V  +++ +T+
Subjt:  PDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFE--IDYQLLGKGTLSESRYVLGKVILETL

AT2G17580.1 Polynucleotide adenylyltransferase family protein2.7e-9434.99Show/hide
Query:  DMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSITE
        D  KW K+     GI  SMIP SS  VL++L  +G   + V  C      +    + +V     +      QI +LFHRAQ+IG+RFPIC V + GSI E
Subjt:  DMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSITE

Query:  VSSFETVAK----------------DSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNR-------FTVH-------------ISALLAVVIS----
        VSSF+TVA                 D+  N+  +  ++    D KD  RWRNS+ RDFT+N        FT++             +  L+   +S    
Subjt:  VSSFETVAK----------------DSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNR-------FTVH-------------ISALLAVVIS----

Query:  -ARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNL
         ARILRGLRIAARLGLSLSK+ +TA+     S+  L + RL+ME+NYML+YGAA PS+ LL +F LL +LLPF AAYLD Q  K S  SS+ML++LF N+
Subjt:  -ARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNL

Query:  DKLVSCDRPSDCNIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTS
        DKLVSCD+P+D  +W+A+LAFH+ALV NPQ +++V AFAA LYHG W++ + +ARE+    +   PE+++ ++ +S+E+LA+ V+ F   ++     LT 
Subjt:  DKLVSCDRPSDCNIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTS

Query:  ADCLLEAMSTFPDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDA
         + L EA+  +PD  +S LVF+ K+  + VA+ F + ++DVESY+++++ F IDY LLGKG   E R+VLGK+IL+T+ +  +    N + + Q+ R   
Subjt:  ADCLLEAMSTFPDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILETLKDAILQGDGNILDRNQNLRVDA

Query:  ATEETYSSPVADLVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINMGVEASQLPLAGLLSMEESLLESCKCHHFEVRATENM
             +  P A L K   +  +K  ++  +      ++N  + L   + +  D    N +   +     + P     + ++S+++  K            
Subjt:  ATEETYSSPVADLVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINMGVEASQLPLAGLLSMEESLLESCKCHHFEVRATENM

Query:  QKNPETMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSG-----KKEHNQVPQGKENIKKKHRD--ITDIKQCKR
        +++ E   +E  K   S+ +R    +E   ++ + P K+   + K   +       K+       KE+ K KH D  + +IK+ K+
Subjt:  QKNPETMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSG-----KKEHNQVPQGKENIKKKHRD--ITDIKQCKR

AT5G23690.1 Polynucleotide adenylyltransferase family protein3.0e-6132.47Show/hide
Query:  KWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSITEVSS
        +W ++N +  G++ SMI  S+  VL  L +KG   + V  C           + ++  +  +      ++ + F R +I+GRRFPIC V+I   + EVSS
Subjt:  KWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSITEVSS

Query:  FETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVN--------------------------RFTVHISALLAVVISARILRGLRIAARLGLSL
        F T A++S  N R    +       +D IR  N + RDFT+N                          R  +H          ARILR +RIAARLG  +
Subjt:  FETVAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVN--------------------------RFTVHISALLAVVISARILRGLRIAARLGLSL

Query:  SKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNIWVAL
        SKET   ++NLS  + RL K R++ME+NYML+YG+A  SL LL +F +LEILLP  AAYL + G ++    + ML+ LF NLDKL++ DRP   ++W+A+
Subjt:  SKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPSLYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNIWVAL

Query:  LAFHMALVNNPQNSLIVLAFAATLYH-GEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFPDSPYS
        LAFH AL + P++ ++V AF+  +++ G+  E +   ++ +        E+    +    + L D V      ++  +  +T A  + +AMS +P +PYS
Subjt:  LAFHMALVNNPQNSLIVLAFAATLYH-GEWNEGMNYARENSLVQVNLRPEITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFPDSPYS

Query:  SLVFVSKETAKHVAKIFEVLVNDVESYK-NKRQNFEIDYQLLGKGTLSESRYVLGKVILETL
         LVF+  +      +IF+ + N+        +Q  +I+Y  L  G   E R+V  +V+ +T+
Subjt:  SLVFVSKETAKHVAKIFEVLVNDVESYK-NKRQNFEIDYQLLGKGTLSESRYVLGKVILETL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCAGCCAACAACGACATGCCCAAATGGAATAAAATTAATGGACGGGCTTTTGGGATCACTCGTTCAATGATCCCATCTTCATCATGGATAGTCTTGAAAGTTCTTCATAA
TAAAGGTATTTGTGGCCACCGTGTAATTTCTTGTGGTGTTGTTTTCTCTCCCTCTTCTGCTGAGAGTGAAATCAACGTTTTTAAGAATGGCTTAGTTGATTCCATGTGTC
CCGTGCAGATCTGCAAACTATTTCATCGGGCACAAATTATTGGGCGTCGTTTTCCTATTTGTATGGTCAATATTAAAGGCTCCATTACTGAGGTTTCTAGTTTTGAAACA
GTTGCAAAAGATTCTGATGGAAACGAAAGAGTAACATCTTCTCAAATTCCAAAAAAATGTGATAAAAAGGACTTAATCCGGTGGAGGAACTCTATGCACCGGGATTTCAC
AGTTAACAGGTTCACTGTTCACATTTCTGCCTTGTTAGCAGTCGTTATATCAGCAAGAATTCTGCGTGGCTTAAGAATTGCAGCTCGTTTGGGTTTGTCACTCTCAAAGG
AAACTGAGACTGCGATGCGTAATCTTTCTCCTTCCATCACAAGATTGGCTAAGTCCAGGTTAATGATGGAATTGAATTATATGCTATCTTATGGAGCAGCCGTTCCTTCT
CTCTATTTGCTTCAGAGGTTCAACCTGCTTGAAATTCTGCTACCATTTCATGCTGCTTATCTTGATAAACAGGGCATTAAGAAATCTTCGCTAAGTTCTATAATGTTGAT
GAAATTGTTCTTCAATTTGGATAAATTGGTTTCTTGTGATCGGCCTTCAGACTGCAATATATGGGTTGCATTGTTGGCATTTCACATGGCATTAGTTAACAACCCTCAGA
ATTCTCTTATAGTTCTGGCCTTTGCTGCCACCTTGTACCATGGGGAGTGGAATGAAGGTATGAATTATGCAAGAGAAAACTCTCTTGTGCAGGTCAATTTAAGGCCAGAG
ATTACGAGATTGGCTCAATTTAAATCCGAGGAGGAGCTTGCTGATGGGGTTACTCATTTTGCTTTGAAAGTACAAGGTTGCATAACTGCTCTGACTTCAGCAGATTGTCT
CTTAGAAGCCATGTCAACATTTCCAGACTCTCCATACTCTAGTTTGGTATTTGTGTCAAAGGAAACTGCCAAACATGTTGCTAAAATTTTTGAAGTGCTGGTGAATGATG
TTGAATCCTACAAAAACAAGAGACAGAATTTTGAAATTGACTATCAACTGCTTGGCAAGGGTACTTTGAGTGAGAGTAGATATGTCTTGGGAAAAGTTATCTTGGAAACC
CTGAAAGATGCAATTCTGCAGGGAGATGGAAACATTCTTGACAGGAACCAAAATCTTCGTGTTGATGCTGCTACTGAGGAAACTTATAGTTCACCAGTCGCTGATTTGGT
GAAGGACCAATTGGTTAAAAGAAATAAGAAAGTTCGAAAACTGGCATCTGGTTCTGAAATAGAGTGGGAGGCGAATAAGAAAAATAAGCTTGGAGGTAAGGAGGAAAGTA
TTTCTGACAAGGTGGTTGAGAATGGAAGGCGCATTAACATGGGAGTAGAAGCATCTCAATTACCCCTTGCTGGATTACTTTCAATGGAGGAGTCATTGTTGGAGTCGTGC
AAATGCCATCACTTTGAAGTGAGGGCTACTGAGAATATGCAAAAGAATCCTGAAACCATGGGGAATGAGGTTAAGAAGATTATTCTCTCCCAGGAAGCACGTGATAAGGT
TACCAAGGAGCTGTTTCACACTGTAGAGATCAACCCGAGGAAGATGGACAAAGTAGCAGGGAAAGAAGGAAACTCCGGGAAGAAAGAGCATAATCAGGTGCCACAGGGAA
AGGAGAATATTAAAAAGAAGCATAGGGATATAACAGATATCAAGCAGTGCAAACGTCCACTGTCCAGCCTTTTTAATTAA
mRNA sequenceShow/hide mRNA sequence
GCAGCCAACAACGACATGCCCAAATGGAATAAAATTAATGGACGGGCTTTTGGGATCACTCGTTCAATGATCCCATCTTCATCATGGATAGTCTTGAAAGTTCTTCATAA
TAAAGGTATTTGTGGCCACCGTGTAATTTCTTGTGGTGTTGTTTTCTCTCCCTCTTCTGCTGAGAGTGAAATCAACGTTTTTAAGAATGGCTTAGTTGATTCCATGTGTC
CCGTGCAGATCTGCAAACTATTTCATCGGGCACAAATTATTGGGCGTCGTTTTCCTATTTGTATGGTCAATATTAAAGGCTCCATTACTGAGGTTTCTAGTTTTGAAACA
GTTGCAAAAGATTCTGATGGAAACGAAAGAGTAACATCTTCTCAAATTCCAAAAAAATGTGATAAAAAGGACTTAATCCGGTGGAGGAACTCTATGCACCGGGATTTCAC
AGTTAACAGGTTCACTGTTCACATTTCTGCCTTGTTAGCAGTCGTTATATCAGCAAGAATTCTGCGTGGCTTAAGAATTGCAGCTCGTTTGGGTTTGTCACTCTCAAAGG
AAACTGAGACTGCGATGCGTAATCTTTCTCCTTCCATCACAAGATTGGCTAAGTCCAGGTTAATGATGGAATTGAATTATATGCTATCTTATGGAGCAGCCGTTCCTTCT
CTCTATTTGCTTCAGAGGTTCAACCTGCTTGAAATTCTGCTACCATTTCATGCTGCTTATCTTGATAAACAGGGCATTAAGAAATCTTCGCTAAGTTCTATAATGTTGAT
GAAATTGTTCTTCAATTTGGATAAATTGGTTTCTTGTGATCGGCCTTCAGACTGCAATATATGGGTTGCATTGTTGGCATTTCACATGGCATTAGTTAACAACCCTCAGA
ATTCTCTTATAGTTCTGGCCTTTGCTGCCACCTTGTACCATGGGGAGTGGAATGAAGGTATGAATTATGCAAGAGAAAACTCTCTTGTGCAGGTCAATTTAAGGCCAGAG
ATTACGAGATTGGCTCAATTTAAATCCGAGGAGGAGCTTGCTGATGGGGTTACTCATTTTGCTTTGAAAGTACAAGGTTGCATAACTGCTCTGACTTCAGCAGATTGTCT
CTTAGAAGCCATGTCAACATTTCCAGACTCTCCATACTCTAGTTTGGTATTTGTGTCAAAGGAAACTGCCAAACATGTTGCTAAAATTTTTGAAGTGCTGGTGAATGATG
TTGAATCCTACAAAAACAAGAGACAGAATTTTGAAATTGACTATCAACTGCTTGGCAAGGGTACTTTGAGTGAGAGTAGATATGTCTTGGGAAAAGTTATCTTGGAAACC
CTGAAAGATGCAATTCTGCAGGGAGATGGAAACATTCTTGACAGGAACCAAAATCTTCGTGTTGATGCTGCTACTGAGGAAACTTATAGTTCACCAGTCGCTGATTTGGT
GAAGGACCAATTGGTTAAAAGAAATAAGAAAGTTCGAAAACTGGCATCTGGTTCTGAAATAGAGTGGGAGGCGAATAAGAAAAATAAGCTTGGAGGTAAGGAGGAAAGTA
TTTCTGACAAGGTGGTTGAGAATGGAAGGCGCATTAACATGGGAGTAGAAGCATCTCAATTACCCCTTGCTGGATTACTTTCAATGGAGGAGTCATTGTTGGAGTCGTGC
AAATGCCATCACTTTGAAGTGAGGGCTACTGAGAATATGCAAAAGAATCCTGAAACCATGGGGAATGAGGTTAAGAAGATTATTCTCTCCCAGGAAGCACGTGATAAGGT
TACCAAGGAGCTGTTTCACACTGTAGAGATCAACCCGAGGAAGATGGACAAAGTAGCAGGGAAAGAAGGAAACTCCGGGAAGAAAGAGCATAATCAGGTGCCACAGGGAA
AGGAGAATATTAAAAAGAAGCATAGGGATATAACAGATATCAAGCAGTGCAAACGTCCACTGTCCAGCCTTTTTAATTAA
Protein sequenceShow/hide protein sequence
AANNDMPKWNKINGRAFGITRSMIPSSSWIVLKVLHNKGICGHRVISCGVVFSPSSAESEINVFKNGLVDSMCPVQICKLFHRAQIIGRRFPICMVNIKGSITEVSSFET
VAKDSDGNERVTSSQIPKKCDKKDLIRWRNSMHRDFTVNRFTVHISALLAVVISARILRGLRIAARLGLSLSKETETAMRNLSPSITRLAKSRLMMELNYMLSYGAAVPS
LYLLQRFNLLEILLPFHAAYLDKQGIKKSSLSSIMLMKLFFNLDKLVSCDRPSDCNIWVALLAFHMALVNNPQNSLIVLAFAATLYHGEWNEGMNYARENSLVQVNLRPE
ITRLAQFKSEEELADGVTHFALKVQGCITALTSADCLLEAMSTFPDSPYSSLVFVSKETAKHVAKIFEVLVNDVESYKNKRQNFEIDYQLLGKGTLSESRYVLGKVILET
LKDAILQGDGNILDRNQNLRVDAATEETYSSPVADLVKDQLVKRNKKVRKLASGSEIEWEANKKNKLGGKEESISDKVVENGRRINMGVEASQLPLAGLLSMEESLLESC
KCHHFEVRATENMQKNPETMGNEVKKIILSQEARDKVTKELFHTVEINPRKMDKVAGKEGNSGKKEHNQVPQGKENIKKKHRDITDIKQCKRPLSSLFN