; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0124 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0124
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCleavage and polyadenylation specificity factor subunit 2
Genome locationMC10:882072..897470
RNA-Seq ExpressionMC10g0124
SyntenyMC10g0124
Gene Ontology termsGO:0006378 - mRNA polyadenylation (biological process)
GO:0006379 - mRNA cleavage (biological process)
GO:0006508 - proteolysis (biological process)
GO:0005847 - mRNA cleavage and polyadenylation specificity factor complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001279 - Metallo-beta-lactamase
IPR011108 - Zn-dependent metallo-hydrolase, RNA specificity domain
IPR022712 - Beta-Casp domain
IPR025069 - Cleavage and polyadenylation specificity factor 2, C-terminal
IPR035639 - CPSF2, metallo-hydrolase domain
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW41981.1 Cleavage and polyadenylation specificity factor subunit 2 [Vitis vinifera]0.059.64Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGL+APV+STEPVYRLGLLT+  
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QY------IARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRP
                +  +QVS+FDLFTLDDIDSAFQ VTRLTYSQN+HL GKGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVDFNHRKER LNGT+LESFVRP
Subjt:  QY------IARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRP

Query:  AVLITDAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTI
        AVLITDAYNALNNQP RRQ+D+EF       +D I KTLR +GNVLLPVDTAGRVLEL+ ILE YW +  LNYPI+FLTYV+SSTIDY+KSFLEWMSD+I
Subjt:  AVLITDAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTI

Query:  AKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGD
        AKSFEHTR+NAFLLKHVTLLI+KSEL+  PDGPK        LEAG+SHDIFVEWA DAKNLVLFSE+GQFATLARMLQADPPPKAVKVT+SKRVPL G+
Subjt:  AKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGD

Query:  ELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINP
        EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  AS DV   HV  +RDILIDGFVPPSTSVAPMFPFYEN+SEWDDFGEVINP
Subjt:  ELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINP

Query:  DDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSH
        +DYVIKDEDMDQA M                          + G D++GKLDE AA+LI D  PSKV+SNELTVQVKC L YMDFEGRSDGRSIKSILSH
Subjt:  DDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSH

Query:  VAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKS
        VAPLKLVLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE+AW+DAEVGKTE+G+LSLLPLS     H +
Subjt:  VAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKS

Query:  VLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAI--------------------------------------GVGIF--------
        V VGD+KMAD+KQFLASKGIQVEF+GGALRCGEYVT+RKV DA  +++I                                       V IF        
Subjt:  VLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAI--------------------------------------GVGIF--------

Query:  ----------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGD
                              MAS     SPSS QLR +  C+   + P+V VR  VRKLD +VR+       NG       G+ R G  +  S+S GD
Subjt:  ----------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGD

Query:  GFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE-------LLLDSDTGNDRL----------GENE
          SGWS SD  E+    ++K W GG+VG G+ G +LV+G++FAA+S+SKQN SR + QMEA++ Q E         L+S TG D +          GE+ 
Subjt:  GFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE-------LLLDSDTGNDRL----------GENE

Query:  KEDNSVNADDRTLAGKTG----------------NHEESS---SYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNND-----VNNVASFQEDFQSDSPS
         +D  V  D  +++   G                N+ ++S     +E+  L++N  GD + +   S +D ++++ N+D     + +  +   +F  +S S
Subjt:  KEDNSVNADDRTLAGKTG----------------NHEESS---SYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNND-----VNNVASFQEDFQSDSPS

Query:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAV---------GTEVLNSEPEM----NILKDGPDNSSNSNTNSLN------PKTDIQDETPD------T
        +       S    ++ D  V S  K  N+  ++         GT+   SE E      + KD P N S    + LN      P +D +   PD      T
Subjt:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAV---------GTEVLNSEPEM----NILKDGPDNSSNSNTNSLN------PKTDIQDETPD------T

Query:  SENY----DFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLN-------------EISDSSLHELSSVSGD----TAKESGFVDRETVTESSKQVL
        +++Y    D ++    + V +K  L D           YE    LN             +I + S  +L  ++      +A +  ++D +   E  K + 
Subjt:  SENY----DFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLN-------------EISDSSLHELSSVSGD----TAKESGFVDRETVTESSKQVL

Query:  NPTKTERLLSEATTSTLE-QQIERGLSEAAFV--------------SVTAYPL-----VDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLA-SAAIK
        N +  +     A    L+ +++E+ +   ++V              S   YP      V++Q K      N +  +  +    FSSAG+PAP A S ++K
Subjt:  NPTKTERLLSEATTSTLE-QQIERGLSEAAFV--------------SVTAYPL-----VDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLA-SAAIK

Query:  TLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLIS
         LPG+V+VPAVVDQVQGQAL+ALQVLKVIE +V+PSDLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFDDITPEDPDF+SIQGLAEAGLIS
Subjt:  TLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLIS

Query:  SKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKP
        SKLSR D+LS S +EDQ PFYFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PALVAD S GE GIIALAFGYTRLFQP+KP
Subjt:  SKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKP

Query:  VTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAI
        VTKAQAAIALATGE+SDIVSEELARIEAE+MAE AVA H ALV QVEK++NASFEK+LS+ER+K+DA+EK+AEEA+QELE+LR+ER+ +N++L+KE  AI
Subjt:  VTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAI

Query:  ESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVV
        ESEMEV SRLR+E+EEQLQ  MSNKVE+SYEKERI+KLRKEAE ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE+ GIKVVV
Subjt:  ESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVV

Query:  DSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTA
        D++LRE+ SA  TWLD++KQFSV  TV RAENL+DKL  M ++LRGKSK+++D I++KI  LIS LR+  S  G Q  +LK+ A+ +A  S  ELQQ+TA
Subjt:  DSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTA

Query:  ELRLALKEGAKRVVGDCREGVEKITQKFKT
        E  LA+KEG KRVVGDCR GVEK+TQKFKT
Subjt:  ELRLALKEGAKRVVGDCREGVEKITQKFKT

RVX01974.1 Cleavage and polyadenylation specificity factor subunit 2 [Vitis vinifera]0.058.95Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGL+APV+STEPVYRLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARK-------------------------------QVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDV
        QY++RK                               QVS+FDLFTLDDIDSAFQ VTRLTYSQN+HL GKGEGIVIAPHVAGHLLGGT+WKITKDGEDV
Subjt:  QYIARK-------------------------------QVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDV

Query:  IYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI
        IYAVDFNHRKER LNGT+LESFVRPAVLITDAYNALNNQP RRQ+D+EF       +D I KTLR +GNVLLPVDTAGRVLEL+ ILE YW +  LNYPI
Subjt:  IYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI

Query:  YFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA
        +FLTYV+SSTIDY+KSFLEWMSD+IAKSFEHTR+NAFLLKHVTLLI+KSEL+  PDGPK+VLASMASLEAG+SHDIFVEWA DAKNLVLFSE+GQFATLA
Subjt:  YFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA

Query:  RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPS
        RMLQADPPPKAVKVT+SKRVPL G+EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  AS DV   HV  +RDILIDGFVPPS
Subjt:  RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPS

Query:  TSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELTVQ
        TSVAPMFPFYEN+SEWDDFGEVINP+DYVIKDEDMDQA M                          + G D++GKLDE AA+LI D  PSKV+SNELTVQ
Subjt:  TSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELTVQ

Query:  VKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLD
        VKC L YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE+AW+D
Subjt:  VKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLD

Query:  AEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA------------------IVQSAIGVGIF---
        AEVGKTE+G+LSLLPLS     H +V VGD+KMAD   F  +K  +     G   CGEYVT+RKV DA                  ++     V IF   
Subjt:  AEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA------------------IVQSAIGVGIF---

Query:  ---------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARS
                                   MAS     SPSS QLR +  C+   + P+V VR  VRKLD +VR+       NG       G+ R G  +  S
Subjt:  ---------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARS

Query:  DSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE-------LLLDSDTGNDRL---------
        +S GD  SGWS SD  E+    ++K W GG+VG G+ G +LV+G++FAA+S+SKQN SR + QMEA++ Q E         L+S TG D +         
Subjt:  DSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE-------LLLDSDTGNDRL---------

Query:  -GENEKEDNSVNADDRTLAGKTG----------------NHEESS---SYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPS
         GE+  +D  V  D  +++   G                N+ ++S     +E+  L++N  GD + +   S +D ++++ N+D             D   
Subjt:  -GENEKEDNSVNADDRTLAGKTG----------------NHEESS---SYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPS

Query:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQ----DETPDTSENYDFSSEHKMNLVPEKLPL
         + S   G+ +S    +SS +S  +  ++ H V ++ +     +N  K  P+ S      + +P ++ +    DE      N  +   H +N    + P+
Subjt:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQ----DETPDTSENYDFSSEHKMNLVPEKLPL

Query:  YD------DSTSNHNSGNQYEPPGPLNE------ISDSS---LHELSSVSGDTAKESGFV--DRETVTESSKQVLNPTKTERLLSEATTSTLEQ------
         D      D      +   Y     LN       +SD     L EL  +   + ++   +  DRE +     +   P K+   L +   S ++       
Subjt:  YD------DSTSNHNSGNQYEPPGPLNE------ISDSS---LHELSSVSGDTAKESGFV--DRETVTESSKQVLNPTKTERLLSEATTSTLEQ------

Query:  -----QIERGLSEAAFVSV------TAYPLVDVQEKD---------HET----IMNSTAAKP---------ELQGIL---------------FSSAGVPA
             ++++ +   ++V V        Y  ++  EKD         H+       +ST+A P          LQ  +               FSSAG+PA
Subjt:  -----QIERGLSEAAFVSV------TAYPLVDVQEKD---------HET----IMNSTAAKP---------ELQGIL---------------FSSAGVPA

Query:  PLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQ
        P A S ++K LPG+V+VPAVVDQVQGQAL+ALQVLKVIE +V+PSDLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFDDITPEDPDF+SIQ
Subjt:  PLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQ

Query:  GLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGY
        GLAEAGLISSKLSR D+LS S +EDQ PFYFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PAL+AD S GE GIIALAFGY
Subjt:  GLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGY

Query:  TRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENL
        TRLFQP+KPVTKAQAAIALATGE+SDIVSEELARIEAE+MAE AVA H ALV QVEK++NASFEK+LS+ER+K+DA+EK+AEEA+QELE+LR+ER+ +N+
Subjt:  TRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENL

Query:  ALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRW
        +L+KE  AIESEMEV SRLR+E+EEQLQ  MSNKVE+SYEKERI+KLRKEAE ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRW
Subjt:  ALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRW

Query:  ERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRS
        E+ GIKVVVD++LRE+ SA  TWLD++KQFSV  TV RAENL+DKL  M ++LRGKSK+++D I++KI  LIS LR+  S  G Q  +LK+ A+ +A  S
Subjt:  ERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRS

Query:  TSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
          ELQQ+TAE  LA+KEG KRVVGDCR GVEK+TQKFKT
Subjt:  TSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT

RZC57052.1 hypothetical protein C5167_004361 [Papaver somniferum]0.059.28Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPL GVYNENPLSYLVS+DGFN LIDCGWND FD +LL+PLS VAST+DAVL+SH DTLHLGALPYA + LGL A V+ST+PV+RLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
         Y ++KQVS+FDLF+LDDID AFQ +  L YSQNHHL+GKGEGIVIAPHVAGHLLGGT+WKITKDGED+IYAVDFNHRKE HLNGT+LE+FVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEH
        AYNALN+Q  R + +K F       +  I+ TL   GN+LLPVDTAGRVLEL+ ++E YW +  L YPIYFLTYVSSSTI+Y KSFLEWMSDTIAKSFE 
Subjt:  AYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEH

Query:  TRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYE
        TR+NAFLLKHVTL+INKSELD  P+GPK+VLASMASLE G SHDIF EWA D KNLVLF+E+GQF TLARMLQADPPP+AVKVT+  RVPL G+EL+++E
Subjt:  TRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYE

Query:  EEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIK
        EEQNR KKEEALKAS+ KEE  KAS+G+D    DPM+IDAS+  AS +V   H  A+RDI IDGF+P STSVAPMFPFYEN+SEWDDFGE+INPDDYV+K
Subjt:  EEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIK

Query:  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY
        +EDMD  +MH GGD+DGKLDE AANLILD +PSK+VSNELTVQVKCSL YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLKNVCPHVY
Subjt:  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY

Query:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCG
        APQ+ ETIDVTSDLCAYKVQLSEKLMS VLFKKLGD+E+AW+D+ V +TE   LSL P+S A  PHKSV VGDLK+AD+KQFLASKG+QVEF+GGALRCG
Subjt:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCG

Query:  EYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGD
        EYVT+RKV DA  +   G                                                     ++  G   E +  ++  G  +  +  +G 
Subjt:  EYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGD

Query:  GFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSS--RQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADD-----
         F G   S++G+     R    +GG V  G+ G +L +G+ FA+  ++K+N+   +Q+ QME ++TQ ELLL SD   D++ +   +++ V  D+     
Subjt:  GFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSS--RQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADD-----

Query:  RTLAGKTGNHEESSSYTE-NEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSV---AAGSLSSLMETDSSVASGSKDGNDC
          L  KTG ++ SS+  E  +D+ ++   D    E      VE +S++ D N+ +S QED +S S     SV    + S  SL E   S    SK   D 
Subjt:  RTLAGKTGNHEESSSYTE-NEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSV---AAGSLSSLMETDSSVASGSKDGNDC

Query:  HAVGTEVLNSEPEMNILKDGPDNSSNS---NTNSLNPKTDIQDETPDTSE------NYDFSSEHKMNLVPEKLPL--------YDDSTSNHNSGNQYEPP
           G+  L +EP     K+ P N S+     +NS N +TD Q+ T D+ +        DFSS        + LPL          +  S+  S N  E  
Subjt:  HAVGTEVLNSEPEMNILKDGPDNSSNS---NTNSLNPKTDIQDETPDTSE------NYDFSSEHKMNLVPEKLPL--------YDDSTSNHNSGNQYEPP

Query:  GPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPEL
         P +   D +L E     G+   E   +++  + E+   + NPT                            S+ A P     +  +E   + + ++  L
Subjt:  GPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPEL

Query:  QGILFSSAGVPAP-LASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDD
            FSSAG+PAP L SAA++  PGKVLVPAVVDQVQGQAL+ALQVLKVIE +V+  DLCTRR+YARWLVS+S+ LSRN+ SKVYPAMYIENVT+LAFDD
Subjt:  QGILFSSAGVPAP-LASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDD

Query:  ITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVG
        I PEDPDFASIQGLAEAGLI+SKLSR D+L S D +Q PF FSPESPLSRQDLVSWKMAL+KRQLPE +RKM++Q+ GFID DKI+PDA PALVAD+S G
Subjt:  ITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVG

Query:  EHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELE
        + GIIALAFGYTRLFQPDKPVT AQAAIALATG+A+DIVSEELARIEAESMAE AVAAH ALVA+VEKD+NA+F K+L++E+EKVDA+EK AEEAK ELE
Subjt:  EHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELE

Query:  RLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQ
        +LR+ERE ++  L +   A+ESEMEV +RLR ELEEQLQ L+S++ E+++E+ERINKLR+EAE ENQ I +LQY+LEVERKALSMAR+WAE+EAKRARE 
Subjt:  RLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQ

Query:  AKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDL
         KALEEARDRW++ GI+VVVDSDL++ ESAG TW ++ K+ ++ ETV+RAE L+DKLK MA  ++GKSK ++++II KI  +I  L++  S A K+ ++ 
Subjt:  AKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDL

Query:  KEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
        + +A S+AS S  ELQ S      A+K+GA R+ G+C+EGVEKITQKFK++
Subjt:  KEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS

XP_022140920.1 uncharacterized protein LOC111011467 isoform X1 [Momordica charantia]0.099.8Show/hide
Query:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW
        MASTPATCSP SLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW
Subjt:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW

Query:  FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD
        FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD
Subjt:  FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD

Query:  GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN
        GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN
Subjt:  GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN

Query:  PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL
        PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL
Subjt:  PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL

Query:  SEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE
        SEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE
Subjt:  SEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE

Query:  AEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
        AEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
Subjt:  AEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ

Query:  DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM
        DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM
Subjt:  DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM

Query:  AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE
        AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEH AIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE
Subjt:  AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE

Query:  KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAE
        KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAE
Subjt:  KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAE

Query:  NLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
        NLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
Subjt:  NLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS

Query:  YGD
        YGD
Subjt:  YGD

XP_022140921.1 uncharacterized protein LOC111011467 isoform X2 [Momordica charantia]0.099.23Show/hide
Query:  RRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDK
        R   +  GLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDK
Subjt:  RRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDK

Query:  NTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSN
        NTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSN
Subjt:  NTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSN

Query:  TNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTK
        TNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTK
Subjt:  TNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTK

Query:  TERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQV
        TERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQV
Subjt:  TERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQV

Query:  LKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPES
        LKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPES
Subjt:  LKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPES

Query:  PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI
        PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI
Subjt:  PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI

Query:  EAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKV
        EAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEH AIESEMEVFSRLRNELEEQLQGLMSNKV
Subjt:  EAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKV

Query:  EVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKET
        EVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKET
Subjt:  EVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKET

Query:  VDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQ
        VDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQ
Subjt:  VDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQ

Query:  KFKTSYGD
        KFKTSYGD
Subjt:  KFKTSYGD

TrEMBL top hitse value%identityAlignment
A0A438E2N4 Cleavage and polyadenylation specificity factor subunit 20.059.64Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGL+APV+STEPVYRLGLLT+  
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QY------IARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRP
                +  +QVS+FDLFTLDDIDSAFQ VTRLTYSQN+HL GKGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVDFNHRKER LNGT+LESFVRP
Subjt:  QY------IARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRP

Query:  AVLITDAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTI
        AVLITDAYNALNNQP RRQ+D+EF       +D I KTLR +GNVLLPVDTAGRVLEL+ ILE YW +  LNYPI+FLTYV+SSTIDY+KSFLEWMSD+I
Subjt:  AVLITDAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTI

Query:  AKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGD
        AKSFEHTR+NAFLLKHVTLLI+KSEL+  PDGPK        LEAG+SHDIFVEWA DAKNLVLFSE+GQFATLARMLQADPPPKAVKVT+SKRVPL G+
Subjt:  AKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGD

Query:  ELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINP
        EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  AS DV   HV  +RDILIDGFVPPSTSVAPMFPFYEN+SEWDDFGEVINP
Subjt:  ELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINP

Query:  DDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSH
        +DYVIKDEDMDQA M                          + G D++GKLDE AA+LI D  PSKV+SNELTVQVKC L YMDFEGRSDGRSIKSILSH
Subjt:  DDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSH

Query:  VAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKS
        VAPLKLVLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE+AW+DAEVGKTE+G+LSLLPLS     H +
Subjt:  VAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKS

Query:  VLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAI--------------------------------------GVGIF--------
        V VGD+KMAD+KQFLASKGIQVEF+GGALRCGEYVT+RKV DA  +++I                                       V IF        
Subjt:  VLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAI--------------------------------------GVGIF--------

Query:  ----------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGD
                              MAS     SPSS QLR +  C+   + P+V VR  VRKLD +VR+       NG       G+ R G  +  S+S GD
Subjt:  ----------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGD

Query:  GFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE-------LLLDSDTGNDRL----------GENE
          SGWS SD  E+    ++K W GG+VG G+ G +LV+G++FAA+S+SKQN SR + QMEA++ Q E         L+S TG D +          GE+ 
Subjt:  GFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE-------LLLDSDTGNDRL----------GENE

Query:  KEDNSVNADDRTLAGKTG----------------NHEESS---SYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNND-----VNNVASFQEDFQSDSPS
         +D  V  D  +++   G                N+ ++S     +E+  L++N  GD + +   S +D ++++ N+D     + +  +   +F  +S S
Subjt:  KEDNSVNADDRTLAGKTG----------------NHEESS---SYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNND-----VNNVASFQEDFQSDSPS

Query:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAV---------GTEVLNSEPEM----NILKDGPDNSSNSNTNSLN------PKTDIQDETPD------T
        +       S    ++ D  V S  K  N+  ++         GT+   SE E      + KD P N S    + LN      P +D +   PD      T
Subjt:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAV---------GTEVLNSEPEM----NILKDGPDNSSNSNTNSLN------PKTDIQDETPD------T

Query:  SENY----DFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLN-------------EISDSSLHELSSVSGD----TAKESGFVDRETVTESSKQVL
        +++Y    D ++    + V +K  L D           YE    LN             +I + S  +L  ++      +A +  ++D +   E  K + 
Subjt:  SENY----DFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLN-------------EISDSSLHELSSVSGD----TAKESGFVDRETVTESSKQVL

Query:  NPTKTERLLSEATTSTLE-QQIERGLSEAAFV--------------SVTAYPL-----VDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLA-SAAIK
        N +  +     A    L+ +++E+ +   ++V              S   YP      V++Q K      N +  +  +    FSSAG+PAP A S ++K
Subjt:  NPTKTERLLSEATTSTLE-QQIERGLSEAAFV--------------SVTAYPL-----VDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLA-SAAIK

Query:  TLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLIS
         LPG+V+VPAVVDQVQGQAL+ALQVLKVIE +V+PSDLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFDDITPEDPDF+SIQGLAEAGLIS
Subjt:  TLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLIS

Query:  SKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKP
        SKLSR D+LS S +EDQ PFYFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PALVAD S GE GIIALAFGYTRLFQP+KP
Subjt:  SKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKP

Query:  VTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAI
        VTKAQAAIALATGE+SDIVSEELARIEAE+MAE AVA H ALV QVEK++NASFEK+LS+ER+K+DA+EK+AEEA+QELE+LR+ER+ +N++L+KE  AI
Subjt:  VTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAI

Query:  ESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVV
        ESEMEV SRLR+E+EEQLQ  MSNKVE+SYEKERI+KLRKEAE ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE+ GIKVVV
Subjt:  ESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVV

Query:  DSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTA
        D++LRE+ SA  TWLD++KQFSV  TV RAENL+DKL  M ++LRGKSK+++D I++KI  LIS LR+  S  G Q  +LK+ A+ +A  S  ELQQ+TA
Subjt:  DSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTA

Query:  ELRLALKEGAKRVVGDCREGVEKITQKFKT
        E  LA+KEG KRVVGDCR GVEK+TQKFKT
Subjt:  ELRLALKEGAKRVVGDCREGVEKITQKFKT

A0A438IZ50 Cleavage and polyadenylation specificity factor subunit 20.058.95Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGL+APV+STEPVYRLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARK-------------------------------QVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDV
        QY++RK                               QVS+FDLFTLDDIDSAFQ VTRLTYSQN+HL GKGEGIVIAPHVAGHLLGGT+WKITKDGEDV
Subjt:  QYIARK-------------------------------QVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDV

Query:  IYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI
        IYAVDFNHRKER LNGT+LESFVRPAVLITDAYNALNNQP RRQ+D+EF       +D I KTLR +GNVLLPVDTAGRVLEL+ ILE YW +  LNYPI
Subjt:  IYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI

Query:  YFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA
        +FLTYV+SSTIDY+KSFLEWMSD+IAKSFEHTR+NAFLLKHVTLLI+KSEL+  PDGPK+VLASMASLEAG+SHDIFVEWA DAKNLVLFSE+GQFATLA
Subjt:  YFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA

Query:  RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPS
        RMLQADPPPKAVKVT+SKRVPL G+EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  AS DV   HV  +RDILIDGFVPPS
Subjt:  RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPS

Query:  TSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELTVQ
        TSVAPMFPFYEN+SEWDDFGEVINP+DYVIKDEDMDQA M                          + G D++GKLDE AA+LI D  PSKV+SNELTVQ
Subjt:  TSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELTVQ

Query:  VKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLD
        VKC L YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE+AW+D
Subjt:  VKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLD

Query:  AEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA------------------IVQSAIGVGIF---
        AEVGKTE+G+LSLLPLS     H +V VGD+KMAD   F  +K  +     G   CGEYVT+RKV DA                  ++     V IF   
Subjt:  AEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA------------------IVQSAIGVGIF---

Query:  ---------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARS
                                   MAS     SPSS QLR +  C+   + P+V VR  VRKLD +VR+       NG       G+ R G  +  S
Subjt:  ---------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARS

Query:  DSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE-------LLLDSDTGNDRL---------
        +S GD  SGWS SD  E+    ++K W GG+VG G+ G +LV+G++FAA+S+SKQN SR + QMEA++ Q E         L+S TG D +         
Subjt:  DSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE-------LLLDSDTGNDRL---------

Query:  -GENEKEDNSVNADDRTLAGKTG----------------NHEESS---SYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPS
         GE+  +D  V  D  +++   G                N+ ++S     +E+  L++N  GD + +   S +D ++++ N+D             D   
Subjt:  -GENEKEDNSVNADDRTLAGKTG----------------NHEESS---SYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPS

Query:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQ----DETPDTSENYDFSSEHKMNLVPEKLPL
         + S   G+ +S    +SS +S  +  ++ H V ++ +     +N  K  P+ S      + +P ++ +    DE      N  +   H +N    + P+
Subjt:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQ----DETPDTSENYDFSSEHKMNLVPEKLPL

Query:  YD------DSTSNHNSGNQYEPPGPLNE------ISDSS---LHELSSVSGDTAKESGFV--DRETVTESSKQVLNPTKTERLLSEATTSTLEQ------
         D      D      +   Y     LN       +SD     L EL  +   + ++   +  DRE +     +   P K+   L +   S ++       
Subjt:  YD------DSTSNHNSGNQYEPPGPLNE------ISDSS---LHELSSVSGDTAKESGFV--DRETVTESSKQVLNPTKTERLLSEATTSTLEQ------

Query:  -----QIERGLSEAAFVSV------TAYPLVDVQEKD---------HET----IMNSTAAKP---------ELQGIL---------------FSSAGVPA
             ++++ +   ++V V        Y  ++  EKD         H+       +ST+A P          LQ  +               FSSAG+PA
Subjt:  -----QIERGLSEAAFVSV------TAYPLVDVQEKD---------HET----IMNSTAAKP---------ELQGIL---------------FSSAGVPA

Query:  PLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQ
        P A S ++K LPG+V+VPAVVDQVQGQAL+ALQVLKVIE +V+PSDLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFDDITPEDPDF+SIQ
Subjt:  PLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQ

Query:  GLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGY
        GLAEAGLISSKLSR D+LS S +EDQ PFYFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PAL+AD S GE GIIALAFGY
Subjt:  GLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGY

Query:  TRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENL
        TRLFQP+KPVTKAQAAIALATGE+SDIVSEELARIEAE+MAE AVA H ALV QVEK++NASFEK+LS+ER+K+DA+EK+AEEA+QELE+LR+ER+ +N+
Subjt:  TRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENL

Query:  ALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRW
        +L+KE  AIESEMEV SRLR+E+EEQLQ  MSNKVE+SYEKERI+KLRKEAE ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRW
Subjt:  ALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRW

Query:  ERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRS
        E+ GIKVVVD++LRE+ SA  TWLD++KQFSV  TV RAENL+DKL  M ++LRGKSK+++D I++KI  LIS LR+  S  G Q  +LK+ A+ +A  S
Subjt:  ERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRS

Query:  TSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
          ELQQ+TAE  LA+KEG KRVVGDCR GVEK+TQKFKT
Subjt:  TSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT

A0A4Y7JAF6 Beta-Casp domain-containing protein0.059.28Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPL GVYNENPLSYLVS+DGFN LIDCGWND FD +LL+PLS VAST+DAVL+SH DTLHLGALPYA + LGL A V+ST+PV+RLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
         Y ++KQVS+FDLF+LDDID AFQ +  L YSQNHHL+GKGEGIVIAPHVAGHLLGGT+WKITKDGED+IYAVDFNHRKE HLNGT+LE+FVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEH
        AYNALN+Q  R + +K F       +  I+ TL   GN+LLPVDTAGRVLEL+ ++E YW +  L YPIYFLTYVSSSTI+Y KSFLEWMSDTIAKSFE 
Subjt:  AYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEH

Query:  TRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYE
        TR+NAFLLKHVTL+INKSELD  P+GPK+VLASMASLE G SHDIF EWA D KNLVLF+E+GQF TLARMLQADPPP+AVKVT+  RVPL G+EL+++E
Subjt:  TRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYE

Query:  EEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIK
        EEQNR KKEEALKAS+ KEE  KAS+G+D    DPM+IDAS+  AS +V   H  A+RDI IDGF+P STSVAPMFPFYEN+SEWDDFGE+INPDDYV+K
Subjt:  EEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIK

Query:  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY
        +EDMD  +MH GGD+DGKLDE AANLILD +PSK+VSNELTVQVKCSL YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLKNVCPHVY
Subjt:  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY

Query:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCG
        APQ+ ETIDVTSDLCAYKVQLSEKLMS VLFKKLGD+E+AW+D+ V +TE   LSL P+S A  PHKSV VGDLK+AD+KQFLASKG+QVEF+GGALRCG
Subjt:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCG

Query:  EYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGD
        EYVT+RKV DA  +   G                                                     ++  G   E +  ++  G  +  +  +G 
Subjt:  EYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGD

Query:  GFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSS--RQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADD-----
         F G   S++G+     R    +GG V  G+ G +L +G+ FA+  ++K+N+   +Q+ QME ++TQ ELLL SD   D++ +   +++ V  D+     
Subjt:  GFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSS--RQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADD-----

Query:  RTLAGKTGNHEESSSYTE-NEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSV---AAGSLSSLMETDSSVASGSKDGNDC
          L  KTG ++ SS+  E  +D+ ++   D    E      VE +S++ D N+ +S QED +S S     SV    + S  SL E   S    SK   D 
Subjt:  RTLAGKTGNHEESSSYTE-NEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSV---AAGSLSSLMETDSSVASGSKDGNDC

Query:  HAVGTEVLNSEPEMNILKDGPDNSSNS---NTNSLNPKTDIQDETPDTSE------NYDFSSEHKMNLVPEKLPL--------YDDSTSNHNSGNQYEPP
           G+  L +EP     K+ P N S+     +NS N +TD Q+ T D+ +        DFSS        + LPL          +  S+  S N  E  
Subjt:  HAVGTEVLNSEPEMNILKDGPDNSSNS---NTNSLNPKTDIQDETPDTSE------NYDFSSEHKMNLVPEKLPL--------YDDSTSNHNSGNQYEPP

Query:  GPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPEL
         P +   D +L E     G+   E   +++  + E+   + NPT                            S+ A P     +  +E   + + ++  L
Subjt:  GPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPEL

Query:  QGILFSSAGVPAP-LASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDD
            FSSAG+PAP L SAA++  PGKVLVPAVVDQVQGQAL+ALQVLKVIE +V+  DLCTRR+YARWLVS+S+ LSRN+ SKVYPAMYIENVT+LAFDD
Subjt:  QGILFSSAGVPAP-LASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDD

Query:  ITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVG
        I PEDPDFASIQGLAEAGLI+SKLSR D+L S D +Q PF FSPESPLSRQDLVSWKMAL+KRQLPE +RKM++Q+ GFID DKI+PDA PALVAD+S G
Subjt:  ITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVG

Query:  EHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELE
        + GIIALAFGYTRLFQPDKPVT AQAAIALATG+A+DIVSEELARIEAESMAE AVAAH ALVA+VEKD+NA+F K+L++E+EKVDA+EK AEEAK ELE
Subjt:  EHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELE

Query:  RLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQ
        +LR+ERE ++  L +   A+ESEMEV +RLR ELEEQLQ L+S++ E+++E+ERINKLR+EAE ENQ I +LQY+LEVERKALSMAR+WAE+EAKRARE 
Subjt:  RLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQ

Query:  AKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDL
         KALEEARDRW++ GI+VVVDSDL++ ESAG TW ++ K+ ++ ETV+RAE L+DKLK MA  ++GKSK ++++II KI  +I  L++  S A K+ ++ 
Subjt:  AKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDL

Query:  KEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
        + +A S+AS S  ELQ S      A+K+GA R+ G+C+EGVEKITQKFK++
Subjt:  KEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS

A0A6J1CGI7 uncharacterized protein LOC111011467 isoform X10.099.8Show/hide
Query:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW
        MASTPATCSP SLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW
Subjt:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW

Query:  FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD
        FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD
Subjt:  FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD

Query:  GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN
        GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN
Subjt:  GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN

Query:  PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL
        PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL
Subjt:  PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL

Query:  SEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE
        SEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE
Subjt:  SEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE

Query:  AEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
        AEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
Subjt:  AEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ

Query:  DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM
        DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM
Subjt:  DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM

Query:  AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE
        AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEH AIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE
Subjt:  AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE

Query:  KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAE
        KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAE
Subjt:  KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAE

Query:  NLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
        NLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
Subjt:  NLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS

Query:  YGD
        YGD
Subjt:  YGD

A0A6J1CHG6 uncharacterized protein LOC111011467 isoform X20.099.23Show/hide
Query:  RRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDK
        R   +  GLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDK
Subjt:  RRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDK

Query:  NTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSN
        NTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSN
Subjt:  NTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSN

Query:  TNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTK
        TNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTK
Subjt:  TNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTK

Query:  TERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQV
        TERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQV
Subjt:  TERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQV

Query:  LKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPES
        LKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPES
Subjt:  LKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPES

Query:  PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI
        PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI
Subjt:  PLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARI

Query:  EAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKV
        EAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEH AIESEMEVFSRLRNELEEQLQGLMSNKV
Subjt:  EAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKV

Query:  EVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKET
        EVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKET
Subjt:  EVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKET

Query:  VDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQ
        VDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQ
Subjt:  VDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQ

Query:  KFKTSYGD
        KFKTSYGD
Subjt:  KFKTSYGD

SwissProt top hitse value%identityAlignment
Q10568 Cleavage and polyadenylation specificity factor subunit 22.4e-14237.44Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        M + +++T L GV  E+ L YL+ VD F FL+DCGW++HF   ++  L +    IDAVL+SHPD LHLGALPYA+ +LGL   +++T PVY++G + MYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT
         Y +R    +F LFTLDD+D+AF  + +L +SQ  +L GKG G+ I P  AGH++GGT+WKI KDG E+++YAVDFNH++E HLNG  LE   RP++LIT
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT

Query:  DAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY---FLTYVSSSTIDYIKSFLEWMSDTIAK
        D++NA   QP R+Q+D++    +L       +TLR +GNVL+ VDTAGRVLEL Q+L+  W  +     +Y    L  VS + +++ KS +EWMSD + +
Subjt:  DAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY---FLTYVSSSTIDYIKSFLEWMSDTIAK

Query:  SFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDEL
         FE  RNN F  +H++L    S+L   P  PKVVLAS   LE G+S D+F++W  D KN ++ + +    TLAR L  +P  K  ++ + KRV L G EL
Subjt:  SFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDEL

Query:  LAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDG-------FVPPSTSVAPMFPFYENTSEWDDFGEV
          Y E++  KKE A K      EQSK +   D D+ D    ++ +    D   +H + + D+++ G       F   +    PMFP  E   +WD++GE+
Subjt:  LAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDG-------FVPPSTSVAPMFPFYENTSEWDDFGEV

Query:  INPDDYVIKD---EDMDQAAMHAG-GDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHL
        I P+D+++ +    + +++ + +G  + D  +D+  +++     P+K +S   ++++K  + Y+D+EGRSDG SIK I++ + P +L++VHG  EA++ L
Subjt:  INPDDYVIKD---EDMDQAAMHAG-GDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHL

Query:  KQHCL----KNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW----LDAEVGKTENGTL-------------------------
         + C     K++   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW    LD  V K + G +                         
Subjt:  KQHCL----KNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW----LDAEVGKTENGTL-------------------------

Query:  --------------------------SLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRK
                                  +L PL    +P H+SV + + +++D+KQ L  +GIQ EF GG L C   V +R+
Subjt:  --------------------------SLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRK

Q652P4 Cleavage and polyadenylation specificity factor subunit 24.1e-30472.7Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPL G Y E PL YL++VDGF FL+DCGW D  DP+ LQPL++VA TIDAVL+SH DT+HLGALPYAMK LGL+APV++TEPV+RLG+LT+YD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
         +I+R+QVS+FDLFTLDDID+AFQ V RL YSQNH L+ KGEGIVIAPHVAGH LGGT+WKITKDGEDV+YAVDFNHRKERHLNGT L SFVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEH
        AYNALNN  Y+RQ+D++F       ID + K L   G+VLLP+DTAGRVLE++ ILE YW +  L YPIYFLT VS+ST+DY+KSFLEWM+D+I+KSFEH
Subjt:  AYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEH

Query:  TRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYE
        TR+NAFLLK VT +INK EL+   D PKVVLASMASLE G+SHDIFV+ AN+AKNLVLF+EKGQF TLARMLQ DPPPKAVKVT+SKR+PL GDEL AYE
Subjt:  TRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYE

Query:  EEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKD
        EEQ R KKEEALKASL KEE+ KAS G++    DPM+IDAS++  P   GS      DILIDGFVPPS+SVAPMFPF+ENTSEWDDFGEVINP+DY++K 
Subjt:  EEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKD

Query:  EDMDQAAMHAGGD-VDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY
        E+MD   M   GD +D  LDE +A L+LD  PSKV+SNE+TVQVKCSL YMDFEGRSDGRS+KS+++HVAPLKLVLVHG+AEATEHLK HC KN   HVY
Subjt:  EDMDQAAMHAGGD-VDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY

Query:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCG
        APQIEETIDVTSDLCAYKVQLSEKLMSNV+ KKLG++EIAW+DAEVGKT++  L+LLP S     HKSVLVGDLK+AD+KQFLA+KG+QVEFAGGALRCG
Subjt:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCG

Query:  EYVTIRKVADAIVQSAIG
        EY+T+RK+ DA  + + G
Subjt:  EYVTIRKVADAIVQSAIG

Q9LKF9 Cleavage and polyadenylation specificity factor subunit 20.0e+0078.28Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFLIDCGWND FD +LL+PLSRVASTIDAVL+SHPDTLH+GALPYAMKQLGL+APV++TEPV+RLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
        Q+++RKQVS+FDLFTLDDIDSAFQ V RLTYSQN+HLSGKGEGIVIAPHVAGH+LGG++W+ITKDGEDVIYAVD+NHRKERHLNGT+L+SFVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNAL-NNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFE
        AY+AL  NQ  R+Q+DKEF       +DTI K L   GNVLLPVDTAGRVLEL+ ILE +W +   ++PIYFLTYVSSSTIDY+KSFLEWMSD+I+KSFE
Subjt:  AYNAL-NNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFE

Query:  HTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAY
         +R+NAFLL+HVTLLINK++LDNAP GPKVVLASMASLEAG++ +IFVEWAND +NLVLF+E GQF TLARMLQ+ PPPK VKVT+SKRVPLAG+EL+AY
Subjt:  HTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAY

Query:  EEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIK
        EEEQNR K+EEAL+ASL+KEE++KASHG+D+++ +PMIID  +  + DV GSH  AY+DILIDGFVPPS+SVAPMFP+Y+NTSEWDDFGE+INPDDYVIK
Subjt:  EEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIK

Query:  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY
        DEDMD+ AMH GGDVDG+LDE  A+L+LD +PSKV+SNEL V V CSL  MD+EGRSDGRSIKS+++HV+PLKLVLVH  AEATEHLKQHCL N+CPHVY
Subjt:  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY

Query:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFA-GGALRC
        APQIEET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AW+D+EVGKTE    SLLP+  AA PHK VLVGDLK+AD+KQFL+SKG+QVEFA GGALRC
Subjt:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFA-GGALRC

Query:  GEYVTIRKV
        GEYVT+RKV
Subjt:  GEYVTIRKV

Q9P2I0 Cleavage and polyadenylation specificity factor subunit 22.4e-14237.56Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        M + +++T L GV  E+ L YL+ VD F FL+DCGW++HF   ++  L +    IDAVL+SHPD LHLGALPYA+ +LGL   +++T PVY++G + MYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT
         Y +R    +F LFTLDD+D+AF  + +L +SQ  +L GKG G+ I P  AGH++GGT+WKI KDG E+++YAVDFNH++E HLNG  LE   RP++LIT
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT

Query:  DAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY---FLTYVSSSTIDYIKSFLEWMSDTIAK
        D++NA   QP R+Q+D++    +L       +TLR +GNVL+ VDTAGRVLEL Q+L+  W  +     +Y    L  VS + +++ KS +EWMSD + +
Subjt:  DAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY---FLTYVSSSTIDYIKSFLEWMSDTIAK

Query:  SFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDEL
         FE  RNN F  +H++L    S+L   P  PKVVLAS   LE G+S D+F++W  D KN ++ + +    TLAR L  +P  K  ++ + KRV L G EL
Subjt:  SFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDEL

Query:  LAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDG-------FVPPSTSVAPMFPFYENTSEWDDFGEV
          Y E++  KKE A K      EQSK +   D D+ D   I+       D   +H + + D+++ G       F   +    PMFP  E   +WD++GE+
Subjt:  LAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDG-------FVPPSTSVAPMFPFYENTSEWDDFGEV

Query:  INPDDYVIKD---EDMDQAAMHAG-GDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHL
        I P+D+++ +    + +++ + +G  + D  +D+  +++     P+K +S   ++++K  + Y+D+EGRSDG SIK I++ + P +L++VHG  EA++ L
Subjt:  INPDDYVIKD---EDMDQAAMHAG-GDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHL

Query:  KQHCL----KNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW----LDAEVGKTENGTL-------------------------
         + C     K++   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW    LD  V K + G +                         
Subjt:  KQHCL----KNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW----LDAEVGKTENGTL-------------------------

Query:  --------------------------SLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRK
                                  +L PL    +P H+SV + + +++D+KQ L  +GIQ EF GG L C   V +R+
Subjt:  --------------------------SLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRK

Q9V3D6 Probable cleavage and polyadenylation specificity factor subunit 24.7e-14338.13Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        M + +++  + G  +E+P  Y++ +D    L+DCGW++ FD   ++ L R   T+DAVL+SHPD  HLGALPY + +LGL  P+++T PV+++G + MYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT
         Y++   + +FDLF+LDD+D+AF+ +T+L Y+Q   L  KG GI I P  AGH++GGT+WKI K G ED++YA DFNH+KERHL+G  L+   RP++LIT
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT

Query:  DAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEES---LNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAK
        DAYNA   Q  RR +D++    IL       +T+R NGNVL+ VDTAGRVLEL  +L+  W+ +    + Y +  L  VS + I++ KS +EWMSD + K
Subjt:  DAYNALNNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEES---LNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAK

Query:  SFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA-RMLQADPPPKAVKVTVSKRVPLAGDE
        +FE  RNN F  KH+ L  + +++   P GPKVVLAS   LE+G++ D+FV+WA++A N ++ + +    TLA  +++   P K +++ V +RV L G E
Subjt:  SFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA-RMLQADPPPKAVKVTVSKRVPLAGDE

Query:  LLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDY
        L    EE  R + E L   ++K +  + S     D  +  +I    +      G H S        GF   +     MFP++E   + D++GE+IN DDY
Subjt:  LLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDY

Query:  VIKDE----------------DMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTA
         I D                   ++  + A    +G + +    L+   KP+K++S   T++V   +  +DFEGRSDG S+  ILS + P +++++HGTA
Subjt:  VIKDE----------------DMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTA

Query:  EATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGK-------------------TENGTLSLLPLSKA
        E T+ + +HC +NV   V+ PQ  E IDVTS++  Y+V+L+E L+S + F+K  D E+AW+D  +G                     E  TL+L  L+  
Subjt:  EATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGK-------------------TENGTLSLLPLSKA

Query:  ALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA
         +P H SVL+ +LK++D+KQ L    I  EF+GG L C       +  DA
Subjt:  ALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA

Arabidopsis top hitse value%identityAlignment
AT3G25680.1 FUNCTIONS IN: molecular_function unknown9.4e-5433.04Show/hide
Query:  AFVSVTAYPLVDVQEKDH---ETIM---NSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRRE
        +F ++  Y  V++ + D+   ET+    N++ ++   + + + S  V    +++  KT   +V  P  VD  Q +A++ L+ LK+ E ++   +LCT+RE
Subjt:  AFVSVTAYPLVDVQEKDH---ETIM---NSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPSDLCTRRE

Query:  YARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQ
        YARWLV ++S L RN    + PA+ +   +  AFDDI   DPDF  IQ LAEAG+ SSKLS  D  +    D G   F+PES +SR DLV+WK  LE   
Subjt:  YARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQ

Query:  LPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVA
         PE   ++      +IDT  I+PD       D  +G+   I   FG  + FQP++PVTKAQAA+AL +G+    ++ EL+R+EAES+++ A         
Subjt:  LPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVA

Query:  QVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEI
          + +I   +++++  ER +   +E++      E+E  ++ +E+ +   +KE  AI+ + ++ + L  E++E  Q L+S+K     E  ++ ++  + + 
Subjt:  QVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEI

Query:  ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE
        + + +   +  LE E +AL + R+W EDE K ++ +AK LEEA  RW+
Subjt:  ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE

AT5G23880.1 cleavage and polyadenylation specificity factor 1000.0e+0078.28Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFLIDCGWND FD +LL+PLSRVASTIDAVL+SHPDTLH+GALPYAMKQLGL+APV++TEPV+RLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
        Q+++RKQVS+FDLFTLDDIDSAFQ V RLTYSQN+HLSGKGEGIVIAPHVAGH+LGG++W+ITKDGEDVIYAVD+NHRKERHLNGT+L+SFVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNAL-NNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFE
        AY+AL  NQ  R+Q+DKEF       +DTI K L   GNVLLPVDTAGRVLEL+ ILE +W +   ++PIYFLTYVSSSTIDY+KSFLEWMSD+I+KSFE
Subjt:  AYNAL-NNQPYRRQKDKEFGVTILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFE

Query:  HTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAY
         +R+NAFLL+HVTLLINK++LDNAP GPKVVLASMASLEAG++ +IFVEWAND +NLVLF+E GQF TLARMLQ+ PPPK VKVT+SKRVPLAG+EL+AY
Subjt:  HTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAY

Query:  EEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIK
        EEEQNR K+EEAL+ASL+KEE++KASHG+D+++ +PMIID  +  + DV GSH  AY+DILIDGFVPPS+SVAPMFP+Y+NTSEWDDFGE+INPDDYVIK
Subjt:  EEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIK

Query:  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY
        DEDMD+ AMH GGDVDG+LDE  A+L+LD +PSKV+SNEL V V CSL  MD+EGRSDGRSIKS+++HV+PLKLVLVH  AEATEHLKQHCL N+CPHVY
Subjt:  DEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVY

Query:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFA-GGALRC
        APQIEET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AW+D+EVGKTE    SLLP+  AA PHK VLVGDLK+AD+KQFL+SKG+QVEFA GGALRC
Subjt:  APQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFA-GGALRC

Query:  GEYVTIRKV
        GEYVT+RKV
Subjt:  GEYVTIRKV

AT5G23890.1 LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope1.3e-20147Show/hide
Query:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLD-LRRKT
        MAS  AT +P+SLQLRLAL+     K P+V +R          R + Y IV    + ++      +G     S S+ D  +GW +SD+ ++    +++K+
Subjt:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLD-LRRKT

Query:  WFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG
           G+VG G+ G IL  G+++AA S SK+    +K +M +L++QQE ++ S   +D +  +E               K  N EES+   E++ ++ N V 
Subjt:  WFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG

Query:  ----DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSN-S
            +G   +KL G   E+SS +  + + A   E    ++P A   V A       ETD   A   K  ++       +L+S  E  +L     N     
Subjt:  ----DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSN-S

Query:  NTNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKES--GFVDRETVTESSKQVLN
        NTNS +P++ +  E  + S+  +  +  K           +DS S+ +  + Y   G + E     L E+SS    T+K       D ET   +++++  
Subjt:  NTNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKES--GFVDRETVTESSKQVLN

Query:  PTKTERLLSEATTSTLE-----QQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQG
           T      +  S++      +++E   S     +  +   +++  +D E   N    +    G  FSSAG+PAP  S  +   PGK+LVP   DQ+Q 
Subjt:  PTKTERLLSEATTSTLE-----QQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQG

Query:  QALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQG
        QA +ALQVLKVIE + +PSDLCTRREYARWL+SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDF+SIQGLAEAGLI+SKLS  D+L   D+ +G
Subjt:  QALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQG

Query:  PFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDI
         F FSPES LSRQDL+SWKMALEKRQLPEAD+KML+++SGFID DKI+PDA P+++ADLS GE GI ALAFG TRLFQP KPVTK QAAIAL++GEASDI
Subjt:  PFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDI

Query:  VSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQL
        VSEELARIEAESMAE AV+AH ALVA+VEKD+NASFEK+LS+EREK++AVEKMAE AK ELE+LR +RE ENLAL+KE  A+ESEMEV SRLR + EE+L
Subjt:  VSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQL

Query:  QGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLRE---QESAGDTWL
        + LMSNK E+++EKER+  LRKEAE E+Q I++LQYELEVERKALSMAR+WAE+EAK+AREQ +ALEEAR RWE  G++VVVD DL+E   +E+     L
Subjt:  QGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLRE---QESAGDTWL

Query:  DSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVG
        +  ++ SV+ET  RA+ LMDKLK MA  + GKS+E++  ++EKI L I+ L+++  + GK+A ++++ AI RA  + ++++Q T +    + +  K++  
Subjt:  DSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVG

Query:  DCREGVEKITQKFKT
        +CR+GV KI+Q+FKT
Subjt:  DCREGVEKITQKFKT

AT5G52410.1 CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119)1.5e-16063.53Show/hide
Query:  SALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFY
        +ALQ LKVIE++  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS +++ SS   +     
Subjt:  SALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFY

Query:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE
        FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GEHGI AL+FG TRLFQP K VTKAQ A++LA G+A ++V E
Subjt:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE

Query:  ELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGL
        ELARIEAE+MAEN V AH  LVAQVEKDINASFEK+L  E+E VDAVEK+AEEAK EL RLR E+E E LAL +E  +IE+EME  +R+RNELEEQLQ L
Subjt:  ELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGL

Query:  MSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGD-TWLDSSKQ
         SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWE+ G+KV+VDSDL EQ +  + TWL++ KQ
Subjt:  MSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGD-TWLDSSKQ

Query:  FSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREG
          V+ T+ RA NL+ KLK MA ++  KS+E++  IIEKI+LLIS L+Q V     +A+DLK    S+A     +      E+R      AK  V + ++ 
Subjt:  FSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREG

Query:  VEKITQKFKT
        V K+ +KFK+
Subjt:  VEKITQKFKT

AT5G52410.2 INVOLVED IN: biological_process unknown5.9e-16557.03Show/hide
Query:  QVLNPTKTERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQ
        QVL+P +    +  A+ ST  ++      +A+  S   Y        + + +   T+  P  +       G+PAP     + +L  K + P VVD VQ Q
Subjt:  QVLNPTKTERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQ

Query:  ALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGP
          +ALQ LKVIE++  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS +++ SS   +   
Subjt:  ALSALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGP

Query:  FYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIV
          FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GEHGI AL+FG TRLFQP K VTKAQ A++LA G+A ++V
Subjt:  FYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIV

Query:  SEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQ
         EELARIEAE+MAEN V AH  LVAQVEKDINASFEK+L  E+E VDAVEK+AEEAK EL RLR E+E E LAL +E  +IE+EME  +R+RNELEEQLQ
Subjt:  SEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQ

Query:  GLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGD-TWLDSS
         L SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWE+ G+KV+VDSDL EQ +  + TWL++ 
Subjt:  GLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGD-TWLDSS

Query:  KQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCR
        KQ  V+ T+ RA NL+ KLK MA ++  KS+E++  IIEKI+LLIS L+Q V     +A+DLK    S+A     +      E+R      AK  V + +
Subjt:  KQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCR

Query:  EGVEKITQKFKT
        + V K+ +KFK+
Subjt:  EGVEKITQKFKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACCTCTGTTCAGGTGACGCCTCTTTGCGGGGTCTACAATGAAAACCCTCTATCATATTTGGTCTCCGTCGACGGCTTCAACTTCCTTATTGACTGTGGTTGGAA
CGACCACTTCGACCCTGCTCTTCTTCAACCTCTCTCCAGGGTGGCATCGACGATAGATGCAGTTTTGATATCACATCCTGATACACTTCACCTTGGTGCCCTTCCTTATG
CCATGAAACAGCTTGGACTTGCTGCTCCAGTTTTTTCCACTGAACCCGTGTATCGATTGGGCCTTCTTACAATGTATGATCAGTATATTGCTAGGAAGCAAGTATCGGAG
TTTGATCTATTTACATTGGATGATATCGATTCTGCTTTCCAAGTTGTAACCAGACTAACATACTCCCAGAATCATCATCTTTCAGGCAAAGGAGAGGGAATAGTTATTGC
ACCTCACGTGGCTGGGCATCTTTTGGGGGGAACCCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGACTTCAACCACCGCAAGGAAAGGCATCTGA
ACGGAACCATTTTAGAGTCATTTGTGCGACCTGCTGTATTGATAACTGATGCTTATAATGCTCTAAATAATCAACCTTACAGACGCCAAAAGGACAAAGAATTCGGAGTT
ACCATTTTGGTTAACATAGATACTATCCAGAAGACCTTAAGAGCTAATGGAAATGTGTTACTTCCTGTTGATACTGCTGGGCGAGTGTTGGAGCTTATTCAAATTTTGGA
ATGGTACTGGGAAGAAGAAAGTTTAAATTATCCCATCTACTTTTTAACTTATGTTTCATCAAGCACAATTGATTATATCAAGAGTTTCCTGGAGTGGATGAGTGATACAA
TAGCAAAGTCTTTTGAACACACACGGAACAATGCTTTTCTTCTCAAGCATGTGACCCTTCTAATCAATAAAAGTGAACTTGATAATGCTCCAGATGGTCCAAAGGTTGTT
CTAGCATCAATGGCTAGTTTGGAAGCTGGTTACTCACATGACATTTTTGTTGAGTGGGCAAATGATGCCAAAAATCTCGTGCTTTTTTCTGAGAAAGGCCAGTTTGCCAC
TTTGGCTCGTATGCTTCAAGCAGATCCACCTCCAAAAGCTGTTAAGGTAACTGTGTCTAAGAGAGTTCCTTTGGCTGGAGACGAGCTACTTGCTTATGAAGAAGAGCAAA
ACAGGAAAAAGGAAGAAGCTCTTAAGGCTAGTTTGCTCAAGGAGGAACAATCCAAAGCATCTCATGGAACTGATAATGATACTGGTGATCCAATGATCATTGATGCTAGC
AGTAATGCATCACCAGATGTAGGTGGTTCACATGTAAGTGCATATCGAGATATATTAATTGATGGCTTCGTTCCTCCTTCAACAAGCGTTGCTCCAATGTTTCCCTTTTA
TGAAAACACTTCCGAATGGGATGATTTTGGTGAAGTAATAAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAAGCAGCGATGCATGCTGGTGGGGACGTGG
ATGGAAAACTAGATGAAACTGCTGCTAACCTGATCCTGGATATGAAGCCTTCAAAAGTTGTATCTAACGAATTGACAGTCCAAGTGAAATGTTCATTGCATTACATGGAT
TTTGAAGGTCGTTCAGATGGGAGATCAATTAAATCAATACTCTCCCACGTTGCTCCCTTGAAGCTTGTTTTGGTGCATGGAACTGCTGAGGCCACTGAGCATCTTAAGCA
ACATTGCCTAAAAAACGTCTGTCCCCATGTCTATGCCCCCCAAATTGAAGAGACGATTGATGTGACTTCTGATCTGTGTGCATATAAGGTACAACTTTCAGAGAAACTGA
TGAGTAATGTGCTGTTCAAGAAGCTTGGAGATTATGAAATCGCTTGGCTTGATGCTGAAGTAGGGAAGACTGAGAATGGAACGTTGTCTTTACTTCCCCTCTCAAAGGCT
GCTTTGCCTCATAAATCTGTTCTTGTTGGTGACTTAAAGATGGCTGACTACAAACAATTTCTTGCCAGCAAGGGAATACAGGTGGAATTTGCTGGGGGTGCTTTGAGATG
CGGTGAGTATGTCACAATACGCAAGGTTGCAGATGCAATTGTTCAATCTGCTATTGGAGTAGGAATTTTCATGGCTTCCACTCCTGCCACGTGTTCACCGAGCTCTCTCC
AGCTTCGGCTAGCGCTGAATTGCAAGAATTGCGCGAAATTCCCTTCAGTTCTTGTTCGGGCGAGAGTGAGGAAGCTGGATCCTCGGGTCCGCATGACTTGTTATCCTATT
GTTTATAATGGCGCGATAATCGAAAGAGCAAATGGGCAGCGCCGTAGTGGAGTTTGTTTTGCTCGATCGGATTCGACAGGCGATGGATTCTCAGGGTGGTCGGAATCGGA
TTCCGGGGAGGAGGTTTTGGACTTGCGGAGAAAGACGTGGTTTGGAGGGTTGGTGGGAATTGGAATTACTGGATTCATTCTTGTTTCGGGAATCACCTTTGCAGCATGGT
CAATAAGCAAGCAGAATAGTTCCAGACAAAAGCCACAAATGGAGGCCTTAAGTACGCAGCAAGAATTATTATTGGACTCTGATACTGGAAATGATAGGCTGGGTGAAAAT
GAAAAAGAAGATAACAGTGTGAATGCAGATGATAGAACTCTCGCTGGTAAAACAGGTAATCATGAGGAATCTTCTTCATATACAGAAAATGAAGATCTCGACAAAAATAC
AGTTGGTGATGGTGTTGATGTCGAGAAGTTATCAGGAAATGATGTTGAATCTTCATCCAGCAATAATGATGTCAATAATGTTGCTTCCTTCCAAGAAGATTTCCAATCTG
ATTCCCCATCAGCTGTTACGTCAGTTGCTGCCGGAAGTTTGAGTTCACTTATGGAGACTGATTCTAGTGTAGCTTCCGGTTCTAAAGATGGAAACGACTGTCATGCTGTT
GGTACAGAAGTCTTAAATTCTGAACCTGAAATGAACATATTGAAAGATGGCCCAGATAACTCTTCTAATTCTAATACTAATTCATTAAACCCTAAAACTGATATTCAGGA
TGAAACACCTGACACTAGTGAAAATTATGATTTCAGCTCTGAGCATAAAATGAACCTAGTACCTGAGAAGTTACCACTCTATGATGATAGTACATCAAACCATAATTCTG
GCAACCAGTATGAACCACCTGGGCCTCTTAATGAAATTTCAGATTCTTCGTTGCATGAACTTTCTAGCGTGTCTGGCGACACAGCCAAGGAATCAGGATTTGTTGACAGA
GAGACTGTGACTGAATCATCTAAACAAGTTCTGAACCCCACCAAAACTGAAAGGCTCTTATCTGAGGCAACTACATCAACCTTAGAACAGCAAATAGAAAGAGGATTATC
TGAAGCAGCATTTGTCTCTGTCACAGCTTATCCATTGGTTGATGTTCAAGAGAAAGATCATGAAACTATCATGAATAGTACTGCTGCCAAACCGGAACTACAAGGGATTT
TATTTTCTTCTGCAGGTGTTCCTGCTCCTTTGGCTTCTGCAGCTATAAAAACACTTCCTGGAAAGGTCCTAGTTCCTGCAGTTGTGGATCAGGTTCAGGGGCAGGCATTG
TCAGCACTGCAAGTTTTAAAGGTGATAGAGGCTGAGGTTGAACCTAGTGATCTATGTACACGTCGGGAATATGCTCGTTGGCTGGTGTCTGCAAGCAGTGCTCTTTCAAG
GAACACAACATCTAAAGTATATCCAGCAATGTATATAGAGAATGTTACTGAGTTGGCATTTGATGATATTACTCCCGAAGATCCTGATTTTGCATCTATTCAAGGTTTGG
CAGAAGCTGGACTGATTTCAAGCAAACTTTCGAGACATGACATTCTTTCTTCCTTTGATGAAGACCAGGGTCCTTTTTATTTCTCTCCCGAAAGCCCCCTATCACGTCAA
GATCTTGTGAGTTGGAAGATGGCCCTCGAGAAAAGACAGTTGCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGATTTATAGATACTGATAAGATCCATCCCGA
TGCTTGTCCTGCCCTTGTTGCTGATCTCTCTGTAGGAGAGCATGGAATTATAGCTCTAGCATTTGGATATACAAGGCTTTTCCAGCCGGATAAGCCTGTAACAAAAGCAC
AAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCTGATATAGTAAGTGAGGAGCTTGCAAGGATTGAAGCAGAATCAATGGCAGAAAATGCAGTTGCTGCACATGGTGCT
CTAGTAGCTCAAGTTGAGAAAGATATTAATGCCAGCTTTGAGAAACAACTCTCAATTGAAAGGGAAAAGGTCGATGCCGTGGAGAAAATGGCAGAAGAGGCAAAGCAAGA
GCTGGAGAGATTAAGATCAGAAAGAGAGAGAGAAAATCTCGCCTTGATGAAGGAACATGTTGCCATTGAATCGGAAATGGAAGTTTTTTCAAGATTAAGGAATGAGTTGG
AGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAGGTATCTTATGAAAAGGAAAGAATCAATAAACTCAGGAAAGAAGCTGAAATCGAAAACCAGGAGATTGCCCGC
CTGCAATATGAGCTTGAGGTCGAGAGAAAGGCGTTGTCCATGGCCAGAGCTTGGGCAGAGGACGAAGCGAAAAGAGCAAGAGAACAAGCCAAAGCACTTGAAGAGGCTAG
AGATCGCTGGGAAAGGCGTGGAATCAAAGTTGTCGTTGACAGCGATCTCCGTGAACAGGAATCCGCTGGCGATACTTGGCTCGATTCTAGCAAACAGTTTTCAGTCAAAG
AAACTGTCGACAGGGCCGAGAACTTAATGGACAAGCTGAAAGTAATGGCTGCAGAACTAAGAGGGAAATCCAAAGAGATAGTCGACAAGATCATCGAGAAGATAGCTTTA
CTAATATCAAACTTGAGGCAATGGGTTTCCGACGCTGGAAAACAGGCTGAAGATCTAAAGGAAGTGGCCATTTCAAGGGCAAGTAGATCAACAAGTGAGCTGCAACAGAG
CACTGCGGAGTTGAGGTTGGCGCTGAAAGAGGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTACCCAAAAGTTCAAAACATCGTATGGTGATT
GA
mRNA sequenceShow/hide mRNA sequence
ATTCGAACTTCAGATTTGAGAAAACTATTGAAATCCCACATGAGAGTCAGAAGATTTTGAGAGCAAAAAATTTCAGTTGTTCTTCCTCCCAAATTTGAGAATGCAAAGGT
TAAAAGAGAGTACTATAGTTTCATTTAAAACAAGGGATATAATTGTCTAACTTCGGTCAATTTCTTAAACCAAAAACCTGTTGGGTCATTTAAAACAAGGGGTCAATTGT
CTAACGACGCCGTGTTGCTTGATTGGAACCAAAAAAAAAAAAAAAAAAAAATACCGCCCCTCCAATTTCCCTAAACCCTTCTTGCTCCCCCCAAGCGAGGAGGGAGTTGC
AGAGAGCTTGGAAGCTAGATAGCAATGGGAACCTCTGTTCAGGTGACGCCTCTTTGCGGGGTCTACAATGAAAACCCTCTATCATATTTGGTCTCCGTCGACGGCTTCAA
CTTCCTTATTGACTGTGGTTGGAACGACCACTTCGACCCTGCTCTTCTTCAACCTCTCTCCAGGGTGGCATCGACGATAGATGCAGTTTTGATATCACATCCTGATACAC
TTCACCTTGGTGCCCTTCCTTATGCCATGAAACAGCTTGGACTTGCTGCTCCAGTTTTTTCCACTGAACCCGTGTATCGATTGGGCCTTCTTACAATGTATGATCAGTAT
ATTGCTAGGAAGCAAGTATCGGAGTTTGATCTATTTACATTGGATGATATCGATTCTGCTTTCCAAGTTGTAACCAGACTAACATACTCCCAGAATCATCATCTTTCAGG
CAAAGGAGAGGGAATAGTTATTGCACCTCACGTGGCTGGGCATCTTTTGGGGGGAACCCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGACTTCA
ACCACCGCAAGGAAAGGCATCTGAACGGAACCATTTTAGAGTCATTTGTGCGACCTGCTGTATTGATAACTGATGCTTATAATGCTCTAAATAATCAACCTTACAGACGC
CAAAAGGACAAAGAATTCGGAGTTACCATTTTGGTTAACATAGATACTATCCAGAAGACCTTAAGAGCTAATGGAAATGTGTTACTTCCTGTTGATACTGCTGGGCGAGT
GTTGGAGCTTATTCAAATTTTGGAATGGTACTGGGAAGAAGAAAGTTTAAATTATCCCATCTACTTTTTAACTTATGTTTCATCAAGCACAATTGATTATATCAAGAGTT
TCCTGGAGTGGATGAGTGATACAATAGCAAAGTCTTTTGAACACACACGGAACAATGCTTTTCTTCTCAAGCATGTGACCCTTCTAATCAATAAAAGTGAACTTGATAAT
GCTCCAGATGGTCCAAAGGTTGTTCTAGCATCAATGGCTAGTTTGGAAGCTGGTTACTCACATGACATTTTTGTTGAGTGGGCAAATGATGCCAAAAATCTCGTGCTTTT
TTCTGAGAAAGGCCAGTTTGCCACTTTGGCTCGTATGCTTCAAGCAGATCCACCTCCAAAAGCTGTTAAGGTAACTGTGTCTAAGAGAGTTCCTTTGGCTGGAGACGAGC
TACTTGCTTATGAAGAAGAGCAAAACAGGAAAAAGGAAGAAGCTCTTAAGGCTAGTTTGCTCAAGGAGGAACAATCCAAAGCATCTCATGGAACTGATAATGATACTGGT
GATCCAATGATCATTGATGCTAGCAGTAATGCATCACCAGATGTAGGTGGTTCACATGTAAGTGCATATCGAGATATATTAATTGATGGCTTCGTTCCTCCTTCAACAAG
CGTTGCTCCAATGTTTCCCTTTTATGAAAACACTTCCGAATGGGATGATTTTGGTGAAGTAATAAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAAGCAG
CGATGCATGCTGGTGGGGACGTGGATGGAAAACTAGATGAAACTGCTGCTAACCTGATCCTGGATATGAAGCCTTCAAAAGTTGTATCTAACGAATTGACAGTCCAAGTG
AAATGTTCATTGCATTACATGGATTTTGAAGGTCGTTCAGATGGGAGATCAATTAAATCAATACTCTCCCACGTTGCTCCCTTGAAGCTTGTTTTGGTGCATGGAACTGC
TGAGGCCACTGAGCATCTTAAGCAACATTGCCTAAAAAACGTCTGTCCCCATGTCTATGCCCCCCAAATTGAAGAGACGATTGATGTGACTTCTGATCTGTGTGCATATA
AGGTACAACTTTCAGAGAAACTGATGAGTAATGTGCTGTTCAAGAAGCTTGGAGATTATGAAATCGCTTGGCTTGATGCTGAAGTAGGGAAGACTGAGAATGGAACGTTG
TCTTTACTTCCCCTCTCAAAGGCTGCTTTGCCTCATAAATCTGTTCTTGTTGGTGACTTAAAGATGGCTGACTACAAACAATTTCTTGCCAGCAAGGGAATACAGGTGGA
ATTTGCTGGGGGTGCTTTGAGATGCGGTGAGTATGTCACAATACGCAAGGTTGCAGATGCAATTGTTCAATCTGCTATTGGAGTAGGAATTTTCATGGCTTCCACTCCTG
CCACGTGTTCACCGAGCTCTCTCCAGCTTCGGCTAGCGCTGAATTGCAAGAATTGCGCGAAATTCCCTTCAGTTCTTGTTCGGGCGAGAGTGAGGAAGCTGGATCCTCGG
GTCCGCATGACTTGTTATCCTATTGTTTATAATGGCGCGATAATCGAAAGAGCAAATGGGCAGCGCCGTAGTGGAGTTTGTTTTGCTCGATCGGATTCGACAGGCGATGG
ATTCTCAGGGTGGTCGGAATCGGATTCCGGGGAGGAGGTTTTGGACTTGCGGAGAAAGACGTGGTTTGGAGGGTTGGTGGGAATTGGAATTACTGGATTCATTCTTGTTT
CGGGAATCACCTTTGCAGCATGGTCAATAAGCAAGCAGAATAGTTCCAGACAAAAGCCACAAATGGAGGCCTTAAGTACGCAGCAAGAATTATTATTGGACTCTGATACT
GGAAATGATAGGCTGGGTGAAAATGAAAAAGAAGATAACAGTGTGAATGCAGATGATAGAACTCTCGCTGGTAAAACAGGTAATCATGAGGAATCTTCTTCATATACAGA
AAATGAAGATCTCGACAAAAATACAGTTGGTGATGGTGTTGATGTCGAGAAGTTATCAGGAAATGATGTTGAATCTTCATCCAGCAATAATGATGTCAATAATGTTGCTT
CCTTCCAAGAAGATTTCCAATCTGATTCCCCATCAGCTGTTACGTCAGTTGCTGCCGGAAGTTTGAGTTCACTTATGGAGACTGATTCTAGTGTAGCTTCCGGTTCTAAA
GATGGAAACGACTGTCATGCTGTTGGTACAGAAGTCTTAAATTCTGAACCTGAAATGAACATATTGAAAGATGGCCCAGATAACTCTTCTAATTCTAATACTAATTCATT
AAACCCTAAAACTGATATTCAGGATGAAACACCTGACACTAGTGAAAATTATGATTTCAGCTCTGAGCATAAAATGAACCTAGTACCTGAGAAGTTACCACTCTATGATG
ATAGTACATCAAACCATAATTCTGGCAACCAGTATGAACCACCTGGGCCTCTTAATGAAATTTCAGATTCTTCGTTGCATGAACTTTCTAGCGTGTCTGGCGACACAGCC
AAGGAATCAGGATTTGTTGACAGAGAGACTGTGACTGAATCATCTAAACAAGTTCTGAACCCCACCAAAACTGAAAGGCTCTTATCTGAGGCAACTACATCAACCTTAGA
ACAGCAAATAGAAAGAGGATTATCTGAAGCAGCATTTGTCTCTGTCACAGCTTATCCATTGGTTGATGTTCAAGAGAAAGATCATGAAACTATCATGAATAGTACTGCTG
CCAAACCGGAACTACAAGGGATTTTATTTTCTTCTGCAGGTGTTCCTGCTCCTTTGGCTTCTGCAGCTATAAAAACACTTCCTGGAAAGGTCCTAGTTCCTGCAGTTGTG
GATCAGGTTCAGGGGCAGGCATTGTCAGCACTGCAAGTTTTAAAGGTGATAGAGGCTGAGGTTGAACCTAGTGATCTATGTACACGTCGGGAATATGCTCGTTGGCTGGT
GTCTGCAAGCAGTGCTCTTTCAAGGAACACAACATCTAAAGTATATCCAGCAATGTATATAGAGAATGTTACTGAGTTGGCATTTGATGATATTACTCCCGAAGATCCTG
ATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCAAGCAAACTTTCGAGACATGACATTCTTTCTTCCTTTGATGAAGACCAGGGTCCTTTTTATTTCTCT
CCCGAAAGCCCCCTATCACGTCAAGATCTTGTGAGTTGGAAGATGGCCCTCGAGAAAAGACAGTTGCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGATTTAT
AGATACTGATAAGATCCATCCCGATGCTTGTCCTGCCCTTGTTGCTGATCTCTCTGTAGGAGAGCATGGAATTATAGCTCTAGCATTTGGATATACAAGGCTTTTCCAGC
CGGATAAGCCTGTAACAAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCTGATATAGTAAGTGAGGAGCTTGCAAGGATTGAAGCAGAATCAATGGCAGAA
AATGCAGTTGCTGCACATGGTGCTCTAGTAGCTCAAGTTGAGAAAGATATTAATGCCAGCTTTGAGAAACAACTCTCAATTGAAAGGGAAAAGGTCGATGCCGTGGAGAA
AATGGCAGAAGAGGCAAAGCAAGAGCTGGAGAGATTAAGATCAGAAAGAGAGAGAGAAAATCTCGCCTTGATGAAGGAACATGTTGCCATTGAATCGGAAATGGAAGTTT
TTTCAAGATTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAGGTATCTTATGAAAAGGAAAGAATCAATAAACTCAGGAAAGAAGCTGAA
ATCGAAAACCAGGAGATTGCCCGCCTGCAATATGAGCTTGAGGTCGAGAGAAAGGCGTTGTCCATGGCCAGAGCTTGGGCAGAGGACGAAGCGAAAAGAGCAAGAGAACA
AGCCAAAGCACTTGAAGAGGCTAGAGATCGCTGGGAAAGGCGTGGAATCAAAGTTGTCGTTGACAGCGATCTCCGTGAACAGGAATCCGCTGGCGATACTTGGCTCGATT
CTAGCAAACAGTTTTCAGTCAAAGAAACTGTCGACAGGGCCGAGAACTTAATGGACAAGCTGAAAGTAATGGCTGCAGAACTAAGAGGGAAATCCAAAGAGATAGTCGAC
AAGATCATCGAGAAGATAGCTTTACTAATATCAAACTTGAGGCAATGGGTTTCCGACGCTGGAAAACAGGCTGAAGATCTAAAGGAAGTGGCCATTTCAAGGGCAAGTAG
ATCAACAAGTGAGCTGCAACAGAGCACTGCGGAGTTGAGGTTGGCGCTGAAAGAGGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTACCCAAA
AGTTCAAAACATCGTATGGTGATTGAGCAGCTGAGTTGAGTTTTCTATGTATTGTAATTCTGAAGAGCACAATAAAGTTGGCGTCTAGTCTATTTTGGAAAATTTTGTAA
GGCACTCTCCTATAATGGAAACTTCAGTCCAAATCTACCATAAATCCCAACACATTTAAATTCATGTGTAGCTATTTAATCAAATTGGATAAGCTGTCGATAATGAACTT
AACTCAGTAATAATTGAC
Protein sequenceShow/hide protein sequence
MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYDQYIARKQVSE
FDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGV
TILVNIDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVV
LASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDAS
SNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMD
FEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKA
ALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPI
VYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGEN
EKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAV
GTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDR
ETVTESSKQVLNPTKTERLLSEATTSTLEQQIERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQAL
SALQVLKVIEAEVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGA
LVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALMKEHVAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIAR
LQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVKETVDRAENLMDKLKVMAAELRGKSKEIVDKIIEKIAL
LISNLRQWVSDAGKQAEDLKEVAISRASRSTSELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTSYGD