; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000413 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000413
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCleavage and polyadenylation specificity factor subunit 2
Genome locationscaffold44:1525026..1539779
RNA-Seq ExpressionMS000413
SyntenyMS000413
Gene Ontology termsGO:0006378 - mRNA polyadenylation (biological process)
GO:0006379 - mRNA cleavage (biological process)
GO:0006508 - proteolysis (biological process)
GO:0005847 - mRNA cleavage and polyadenylation specificity factor complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001279 - Metallo-beta-lactamase
IPR011108 - Zn-dependent metallo-hydrolase, RNA specificity domain
IPR022712 - Beta-Casp domain
IPR025069 - Cleavage and polyadenylation specificity factor 2, C-terminal
IPR035639 - CPSF2, metallo-hydrolase domain
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW41981.1 Cleavage and polyadenylation specificity factor subunit 2 [Vitis vinifera]0.0e+0058.81Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTM--
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGL+APV+STEPVYRLGLLT+  
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTM--

Query:  ----YDQYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRP
                +  +QVS+FDLFTLDDIDSAFQ VTRLTYSQN+HL GKGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVDFNHRKER LNGT+LESFVRP
Subjt:  ----YDQYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRP

Query:  AVLITDAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEES
        AVLITDAYNALNNQP RRQ+D+EF                                        I KTLR +GNVLLPVDTAGRVLEL+ ILE YW +  
Subjt:  AVLITDAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEES

Query:  LNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQ
        LNYPI+FLTYV+SSTIDY+KSFLEWMSD+IAKSFEHTR+NAFLLKHVTLLI+KSEL+  PDGPK        LEAG+SHDIFVEWA DAKNLVLFSE+GQ
Subjt:  LNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQ

Query:  FATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDG
        FATLARMLQADPPPKAVKVT+SKRVPL G+EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  AS DV   HV  +RDILIDG
Subjt:  FATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDG

Query:  FVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSN
        FVPPSTSVAPMFPFYEN+SEWDDFGEVINP+DYVIKDEDMDQA M                          + G D++GKLDE AA+LI D  PSKV+SN
Subjt:  FVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSN

Query:  ELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE
        ELTVQVKC L YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE
Subjt:  ELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE

Query:  IAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAI---------------------
        +AW+DAEVGKTE+G+LSLLPLS     H +V VGD+KMAD+KQFLASKGIQVEF+GGALRCGEYVT+RKV DA  +++I                     
Subjt:  IAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAI---------------------

Query:  -----------------GVGIF------------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCY
                          V IF                              MAS     SPSS QLR +  C+   + P+V VR  VRKLD +VR+   
Subjt:  -----------------GVGIF------------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCY

Query:  PIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE---
            NG       G+ R G  +  S+S GD  SGWS SD  E+    ++K W GG+VG G+ G +LV+G++FAA+S+SKQN SR + QMEA++ Q E   
Subjt:  PIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE---

Query:  ----LLLDSDTGNDRL----------GENEKEDNSVNADDR-----------------TLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSGNDVES
              L+S TG D +          GE+  +D  V  D                   TL+    +   +   +E+  L++N  GD + +   S +D ++
Subjt:  ----LLLDSDTGNDRL----------GENEKEDNSVNADDR-----------------TLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSGNDVES

Query:  SSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGS--------KDGNDCHAVGTEVLNSEPEM----NILKDGPDNSSNSNTNSLN--
        ++ N+D        +  ++ + S  +S  + S    ++ D  V S           + +   A GT+   SE E      + KD P N S    + LN  
Subjt:  SSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGS--------KDGNDCHAVGTEVLNSEPEM----NILKDGPDNSSNSNTNSLN--

Query:  ----PKTDIQDETPD------TSENY----DFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLN-------------EISDSSLHELSSVSGD---
            P +D +   PD      T+++Y    D ++    + V +K  L D           YE    LN             +I + S  +L  ++     
Subjt:  ----PKTDIQDETPD------TSENY----DFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLN-------------EISDSSLHELSSVSGD---

Query:  -TAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLE-QQLERGLSEAAFV--------------SVTAYPL-----VDVQEKDHETIMNSTAAKPE
         +A +  ++D   + E  K + N +  +     A    L+ ++LE+ +   ++V              S   YP      V++Q K      N +  +  
Subjt:  -TAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLE-QQLERGLSEAAFV--------------SVTAYPL-----VDVQEKDHETIMNSTAAKPE

Query:  LQGILFSSAGVPAPLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFD
        +    FSSAG+PAP A S ++K LPG+V+VPAVVDQVQGQAL+ALQVLKVIE +V+P DLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFD
Subjt:  LQGILFSSAGVPAPLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFD

Query:  DITPEDPDFASIQGLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLS
        DITPEDPDF+SIQGLAEAGLISSKLSR D+LS S +EDQ PFYFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PALVAD S
Subjt:  DITPEDPDFASIQGLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLS

Query:  VGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQE
         GE GIIALAFGYTRLFQP+KPVTKAQAAIALATGE+SDIVSEELARIEAE+MAE AVA H ALV QVEK++NASFEK+LS+ER+K+DA+EK+AEEA+QE
Subjt:  VGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQE

Query:  LERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAR
        LE+LR+ER+ +N++L KERAAIESEMEV SRLR+E+EEQLQ  MSNKVE+SYEKERI+KLRKEAE ENQEIARLQYELEVERKALSMARAWAEDEAKRAR
Subjt:  LERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAR

Query:  EQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAE
        EQAKALEEARDRWE+ GIKVVVD++LRE+ SA  TWLD++KQFSV+ TV RAENL+DKL  M +++RGKSK+++D I++KI  LIS LR+  S  G Q  
Subjt:  EQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAE

Query:  DLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
        +LK+ A+ +A  S  ELQQ+TAE  LA+KEG KRVVGDCR GVEK+TQKFKT
Subjt:  DLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT

RVX01974.1 Cleavage and polyadenylation specificity factor subunit 2 [Vitis vinifera]0.0e+0058.11Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGL+APV+STEPVYRLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARK-------------------------------QVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDV
        QY++RK                               QVS+FDLFTLDDIDSAFQ VTRLTYSQN+HL GKGEGIVIAPHVAGHLLGGT+WKITKDGEDV
Subjt:  QYIARK-------------------------------QVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDV

Query:  IYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNV
        IYAVDFNHRKER LNGT+LESFVRPAVLITDAYNALNNQP RRQ+D+EF                                        I KTLR +GNV
Subjt:  IYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNV

Query:  LLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEA
        LLPVDTAGRVLEL+ ILE YW +  LNYPI+FLTYV+SSTIDY+KSFLEWMSD+IAKSFEHTR+NAFLLKHVTLLI+KSEL+  PDGPK+VLASMASLEA
Subjt:  LLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEA

Query:  GYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIID
        G+SHDIFVEWA DAKNLVLFSE+GQFATLARMLQADPPPKAVKVT+SKRVPL G+EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID
Subjt:  GYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIID

Query:  ASS-NASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGG
         ++  AS DV   HV  +RDILIDGFVPPSTSVAPMFPFYEN+SEWDDFGEVINP+DYVIKDEDMDQA M                          + G 
Subjt:  ASS-NASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGG

Query:  DVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSD
        D++GKLDE AA+LI D  PSKV+SNELTVQVKC L YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSD
Subjt:  DVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSD

Query:  LCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA--
        LCAYKVQLSEKLMSNVLFKKLGDYE+AW+DAEVGKTE+G+LSLLPLS     H +V VGD+KMAD   F  +K  +     G   CGEYVT+RKV DA  
Subjt:  LCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA--

Query:  ----------------IVQSAIGVGIF------------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRV
                        ++     V IF                              MAS     SPSS QLR +  C+   + P+V VR  VRKLD +V
Subjt:  ----------------IVQSAIGVGIF------------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRV

Query:  RMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQ
        R+       NG       G+ R G  +  S+S GD  SGWS SD  E+    ++K W GG+VG G+ G +LV+G++FAA+S+SKQN SR + QMEA++ Q
Subjt:  RMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQ

Query:  QE-------LLLDSDTGNDRL----------GENEKEDNSVNADDR-----------------TLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSG
         E         L+S TG D +          GE+  +D  V  D                   TL+    +   +   +E+  L++N  GD + +   S 
Subjt:  QE-------LLLDSDTGNDRL----------GENEKEDNSVNADDR-----------------TLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSG

Query:  NDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQ---
        +D ++++ N+D             D    + S   G+ +S    +SS +S  +  ++ H V     +S+  +N  K  P+ S      + +P ++ +   
Subjt:  NDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQ---

Query:  -DETPDTSENYDFSSEHKMNLVPEKLPLYD------DSTSNHNSGNQYEPPGPLN------EISDS---SLHELSSVSGDTAKESGFV--DRETVTESSK
         DE      N  +   H +N    + P+ D      D      +   Y     LN       +SD     L EL  +   + ++   +  DRE +     
Subjt:  -DETPDTSENYDFSSEHKMNLVPEKLPLYD------DSTSNHNSGNQYEPPGPLN------EISDS---SLHELSSVSGDTAKESGFV--DRETVTESSK

Query:  QVLNPTKTERLLSEATTSTLE-----------QQLERGLSEAAFVSV------TAYPLVDVQEKD---------HET----IMNSTAAKP---------E
        +   P K+   L +   S ++            +L++ +   ++V V        Y  ++  EKD         H+       +ST+A P          
Subjt:  QVLNPTKTERLLSEATTSTLE-----------QQLERGLSEAAFVSV------TAYPLVDVQEKD---------HET----IMNSTAAKP---------E

Query:  LQGIL---------------FSSAGVPAPLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKV
        LQ  +               FSSAG+PAP A S ++K LPG+V+VPAVVDQVQGQAL+ALQVLKVIE +V+P DLCTRRE+ARWLVSASS LSRNT SKV
Subjt:  LQGIL---------------FSSAGVPAPLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKV

Query:  YPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTD
        YPAMYI N+TELAFDDITPEDPDF+SIQGLAEAGLISSKLSR D+LS S +EDQ PFYFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D
Subjt:  YPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTD

Query:  KIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIERE
         I+PDA PAL+AD S GE GIIALAFGYTRLFQP+KPVTKAQAAIALATGE+SDIVSEELARIEAE+MAE AVA H ALV QVEK++NASFEK+LS+ER+
Subjt:  KIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIERE

Query:  KVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKAL
        K+DA+EK+AEEA+QELE+LR+ER+ +N++L KERAAIESEMEV SRLR+E+EEQLQ  MSNKVE+SYEKERI+KLRKEAE ENQEIARLQYELEVERKAL
Subjt:  KVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKAL

Query:  SMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLI
        SMARAWAEDEAKRAREQAKALEEARDRWE+ GIKVVVD++LRE+ SA  TWLD++KQFSV+ TV RAENL+DKL  M +++RGKSK+++D I++KI  LI
Subjt:  SMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLI

Query:  SNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
        S LR+  S  G Q  +LK+ A+ +A  S  ELQQ+TAE  LA+KEG KRVVGDCR GVEK+TQKFKT
Subjt:  SNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT

RZC57052.1 hypothetical protein C5167_004361 [Papaver somniferum]0.0e+0058.39Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPL GVYNENPLSYLVS+DGFN LIDCGWND FD +LL+PLS VAST+DAVL+SH DTLHLGALPYA + LGL A V+ST+PV+RLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
         Y ++KQVS+FDLF+LDDID AFQ +  L YSQNHHL+GKGEGIVIAPHVAGHLLGGT+WKITKDGED+IYAVDFNHRKE HLNGT+LE+FVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY
        AYNALN+Q  R + +K F G                                      I+ TL   GN+LLPVDTAGRVLEL+ ++E YW +  L YPIY
Subjt:  AYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY

Query:  FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLAR
        FLTYVSSSTI+Y KSFLEWMSDTIAKSFE TR+NAFLLKHVTL+INKSELD  P+GPK+VLASMASLE G SHDIF EWA D KNLVLF+E+GQF TLAR
Subjt:  FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLAR

Query:  MLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDAS-SNASPDVGGSHVSAYRDILIDGFVPPST
        MLQADPPP+AVKVT+  RVPL G+EL+++EEEQNR KKEEALKAS+ KEE  KAS+G+D    DPM+IDAS + AS +V   H  A+RDI IDGF+P ST
Subjt:  MLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDAS-SNASPDVGGSHVSAYRDILIDGFVPPST

Query:  SVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA
        SVAPMFPFYEN+SEWDDFGE+INPDDYV+K+EDMD  +MH GGD+DGKLDE AANLILD +PSK+VSNELTVQVKCSL YMDFEGRSDGRSIKSILSHVA
Subjt:  SVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA

Query:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL
        PLKLVLVHG+AEATEHLKQHCLKNVCPHVYAPQ+ ETIDVTSDLCAYKVQLSEKLMS VLFKKLGD+E+AW+D+ V +TE   LSL P+S A  PHKSV 
Subjt:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL

Query:  VGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCY
        VGDLK+AD+KQFLASKG+QVEF+GGALRCGEYVT+RKV DA  +   G                                                    
Subjt:  VGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCY

Query:  PIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNS--SRQKPQMEALSTQQEL
         ++  G   E +  ++  G  +  +  +G  F G   S++G+     R    +GG V  G+ G +L +G+ FA+  ++K+N+   +Q+ QME ++TQ EL
Subjt:  PIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNS--SRQKPQMEALSTQQEL

Query:  LLDSDTGNDRLGENEKEDNSVNADD-----RTLAGKTGNHEESSSYTE-NEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDS---PS
        LL SD   D++ +   +++ V  D+       L  KTG ++ SS+  E  +D+ ++   D    E      VE +S++ D N+ +S QED +S S     
Subjt:  LLDSDTGNDRLGENEKEDNSVNADD-----RTLAGKTGNHEESSSYTE-NEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDS---PS

Query:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSN---SNTNSLNPKTDIQDETPDTSE------NYDFSSEHKMNLVP
        +V +  + S  SL E   S    SK   D    G+  L +EP     K+ P N S+     +NS N +TD Q+ T D+ +        DFSS        
Subjt:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSN---SNTNSLNPKTDIQDETPDTSE------NYDFSSEHKMNLVP

Query:  EKLPL--------YDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQLERGLSEAA
        + LPL          +  S+  S N  E   P +   D +L E     G+   E   +++  + E+   + NPT                          
Subjt:  EKLPL--------YDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQLERGLSEAA

Query:  FVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAP-LASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLV
          S+ A P     +  +E   + + ++  L    FSSAG+PAP L SAA++  PGKVLVPAVVDQVQGQAL+ALQVLKVIE +V+  DLCTRR+YARWLV
Subjt:  FVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAP-LASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLV

Query:  SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADR
        S+S+ LSRN+ SKVYPAMYIENVT+LAFDDI PEDPDFASIQGLAEAGLI+SKLSR D+L S D +Q PF FSPESPLSRQDLVSWKMAL+KRQLPE +R
Subjt:  SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADR

Query:  KMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDI
        KM++Q+ GFID DKI+PDA PALVAD+S G+ GIIALAFGYTRLFQPDKPVT AQAAIALATG+A+DIVSEELARIEAESMAE AVAAH ALVA+VEKD+
Subjt:  KMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDI

Query:  NASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIA
        NA+F K+L++E+EKVDA+EK AEEAK ELE+LR+ERE ++  L + RAA+ESEMEV +RLR ELEEQLQ L+S++ E+++E+ERINKLR+EAE ENQ I 
Subjt:  NASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIA

Query:  RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKE
        +LQY+LEVERKALSMAR+WAE+EAKRARE  KALEEARDRW++ GI+VVVDSDL++ ESAG TW ++ K+ +++ETV+RAE L+DKLK MA  ++GKSK 
Subjt:  RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKE

Query:  IVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
        ++++II KI  +I  L++  S A K+ ++ + +A S+AS S  ELQ S      A+K+GA R+ G+C+EGVEKITQKFK++
Subjt:  IVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS

XP_022140920.1 uncharacterized protein LOC111011467 isoform X1 [Momordica charantia]0.0e+0099.2Show/hide
Query:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW
        MASTPATCSP SLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW
Subjt:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW

Query:  FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD
        FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD
Subjt:  FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD

Query:  GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN
        GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN
Subjt:  GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN

Query:  PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL
        PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL
Subjt:  PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL

Query:  SEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE
        SEATTSTLEQQ+ERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE
Subjt:  SEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE

Query:  AEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
        AEVEP DLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
Subjt:  AEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ

Query:  DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM
        DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM
Subjt:  DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM

Query:  AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE
        AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLAL KE AAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE
Subjt:  AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE

Query:  KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAE
        KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSV+ETVDRAE
Subjt:  KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAE

Query:  NLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
        NLMDKLKVMAAE+RGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRS SELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
Subjt:  NLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS

Query:  YG
        YG
Subjt:  YG

XP_022140922.1 uncharacterized protein LOC111011467 isoform X3 [Momordica charantia]0.0e+0098.89Show/hide
Query:  WFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG
        +  GLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG
Subjt:  WFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG

Query:  DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSL
        DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSL
Subjt:  DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSL

Query:  NPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERL
        NPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERL
Subjt:  NPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERL

Query:  LSEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVI
        LSEATTSTLEQQ+ERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVI
Subjt:  LSEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVI

Query:  EAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSR
        EAEVEP DLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSR
Subjt:  EAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSR

Query:  QDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAES
        QDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAES
Subjt:  QDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAES

Query:  MAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSY
        MAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLAL KE AAIESEMEVFSRLRNELEEQLQGLMSNKVEVSY
Subjt:  MAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSY

Query:  EKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRA
        EKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSV+ETVDRA
Subjt:  EKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRA

Query:  ENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
        ENLMDKLKVMAAE+RGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRS SELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
Subjt:  ENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT

Query:  SYG
        SYG
Subjt:  SYG

TrEMBL top hitse value%identityAlignment
A0A438E2N4 Cleavage and polyadenylation specificity factor subunit 20.0e+0058.81Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTM--
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGL+APV+STEPVYRLGLLT+  
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTM--

Query:  ----YDQYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRP
                +  +QVS+FDLFTLDDIDSAFQ VTRLTYSQN+HL GKGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVDFNHRKER LNGT+LESFVRP
Subjt:  ----YDQYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRP

Query:  AVLITDAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEES
        AVLITDAYNALNNQP RRQ+D+EF                                        I KTLR +GNVLLPVDTAGRVLEL+ ILE YW +  
Subjt:  AVLITDAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEES

Query:  LNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQ
        LNYPI+FLTYV+SSTIDY+KSFLEWMSD+IAKSFEHTR+NAFLLKHVTLLI+KSEL+  PDGPK        LEAG+SHDIFVEWA DAKNLVLFSE+GQ
Subjt:  LNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQ

Query:  FATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDG
        FATLARMLQADPPPKAVKVT+SKRVPL G+EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  AS DV   HV  +RDILIDG
Subjt:  FATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSN-ASPDVGGSHVSAYRDILIDG

Query:  FVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSN
        FVPPSTSVAPMFPFYEN+SEWDDFGEVINP+DYVIKDEDMDQA M                          + G D++GKLDE AA+LI D  PSKV+SN
Subjt:  FVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSN

Query:  ELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE
        ELTVQVKC L YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE
Subjt:  ELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE

Query:  IAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAI---------------------
        +AW+DAEVGKTE+G+LSLLPLS     H +V VGD+KMAD+KQFLASKGIQVEF+GGALRCGEYVT+RKV DA  +++I                     
Subjt:  IAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAI---------------------

Query:  -----------------GVGIF------------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCY
                          V IF                              MAS     SPSS QLR +  C+   + P+V VR  VRKLD +VR+   
Subjt:  -----------------GVGIF------------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCY

Query:  PIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE---
            NG       G+ R G  +  S+S GD  SGWS SD  E+    ++K W GG+VG G+ G +LV+G++FAA+S+SKQN SR + QMEA++ Q E   
Subjt:  PIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQE---

Query:  ----LLLDSDTGNDRL----------GENEKEDNSVNADDR-----------------TLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSGNDVES
              L+S TG D +          GE+  +D  V  D                   TL+    +   +   +E+  L++N  GD + +   S +D ++
Subjt:  ----LLLDSDTGNDRL----------GENEKEDNSVNADDR-----------------TLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSGNDVES

Query:  SSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGS--------KDGNDCHAVGTEVLNSEPEM----NILKDGPDNSSNSNTNSLN--
        ++ N+D        +  ++ + S  +S  + S    ++ D  V S           + +   A GT+   SE E      + KD P N S    + LN  
Subjt:  SSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGS--------KDGNDCHAVGTEVLNSEPEM----NILKDGPDNSSNSNTNSLN--

Query:  ----PKTDIQDETPD------TSENY----DFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLN-------------EISDSSLHELSSVSGD---
            P +D +   PD      T+++Y    D ++    + V +K  L D           YE    LN             +I + S  +L  ++     
Subjt:  ----PKTDIQDETPD------TSENY----DFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLN-------------EISDSSLHELSSVSGD---

Query:  -TAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLE-QQLERGLSEAAFV--------------SVTAYPL-----VDVQEKDHETIMNSTAAKPE
         +A +  ++D   + E  K + N +  +     A    L+ ++LE+ +   ++V              S   YP      V++Q K      N +  +  
Subjt:  -TAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLE-QQLERGLSEAAFV--------------SVTAYPL-----VDVQEKDHETIMNSTAAKPE

Query:  LQGILFSSAGVPAPLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFD
        +    FSSAG+PAP A S ++K LPG+V+VPAVVDQVQGQAL+ALQVLKVIE +V+P DLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFD
Subjt:  LQGILFSSAGVPAPLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFD

Query:  DITPEDPDFASIQGLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLS
        DITPEDPDF+SIQGLAEAGLISSKLSR D+LS S +EDQ PFYFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PALVAD S
Subjt:  DITPEDPDFASIQGLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLS

Query:  VGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQE
         GE GIIALAFGYTRLFQP+KPVTKAQAAIALATGE+SDIVSEELARIEAE+MAE AVA H ALV QVEK++NASFEK+LS+ER+K+DA+EK+AEEA+QE
Subjt:  VGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQE

Query:  LERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAR
        LE+LR+ER+ +N++L KERAAIESEMEV SRLR+E+EEQLQ  MSNKVE+SYEKERI+KLRKEAE ENQEIARLQYELEVERKALSMARAWAEDEAKRAR
Subjt:  LERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAR

Query:  EQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAE
        EQAKALEEARDRWE+ GIKVVVD++LRE+ SA  TWLD++KQFSV+ TV RAENL+DKL  M +++RGKSK+++D I++KI  LIS LR+  S  G Q  
Subjt:  EQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAE

Query:  DLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
        +LK+ A+ +A  S  ELQQ+TAE  LA+KEG KRVVGDCR GVEK+TQKFKT
Subjt:  DLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT

A0A438IZ50 Cleavage and polyadenylation specificity factor subunit 20.0e+0058.11Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGL+APV+STEPVYRLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARK-------------------------------QVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDV
        QY++RK                               QVS+FDLFTLDDIDSAFQ VTRLTYSQN+HL GKGEGIVIAPHVAGHLLGGT+WKITKDGEDV
Subjt:  QYIARK-------------------------------QVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDV

Query:  IYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNV
        IYAVDFNHRKER LNGT+LESFVRPAVLITDAYNALNNQP RRQ+D+EF                                        I KTLR +GNV
Subjt:  IYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNV

Query:  LLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEA
        LLPVDTAGRVLEL+ ILE YW +  LNYPI+FLTYV+SSTIDY+KSFLEWMSD+IAKSFEHTR+NAFLLKHVTLLI+KSEL+  PDGPK+VLASMASLEA
Subjt:  LLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEA

Query:  GYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIID
        G+SHDIFVEWA DAKNLVLFSE+GQFATLARMLQADPPPKAVKVT+SKRVPL G+EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID
Subjt:  GYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIID

Query:  ASS-NASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGG
         ++  AS DV   HV  +RDILIDGFVPPSTSVAPMFPFYEN+SEWDDFGEVINP+DYVIKDEDMDQA M                          + G 
Subjt:  ASS-NASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGG

Query:  DVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSD
        D++GKLDE AA+LI D  PSKV+SNELTVQVKC L YMDFEGRSDGRSIKSILSHVAPLKLVLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSD
Subjt:  DVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSD

Query:  LCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA--
        LCAYKVQLSEKLMSNVLFKKLGDYE+AW+DAEVGKTE+G+LSLLPLS     H +V VGD+KMAD   F  +K  +     G   CGEYVT+RKV DA  
Subjt:  LCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA--

Query:  ----------------IVQSAIGVGIF------------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRV
                        ++     V IF                              MAS     SPSS QLR +  C+   + P+V VR  VRKLD +V
Subjt:  ----------------IVQSAIGVGIF------------------------------MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRV

Query:  RMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQ
        R+       NG       G+ R G  +  S+S GD  SGWS SD  E+    ++K W GG+VG G+ G +LV+G++FAA+S+SKQN SR + QMEA++ Q
Subjt:  RMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQ

Query:  QE-------LLLDSDTGNDRL----------GENEKEDNSVNADDR-----------------TLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSG
         E         L+S TG D +          GE+  +D  V  D                   TL+    +   +   +E+  L++N  GD + +   S 
Subjt:  QE-------LLLDSDTGNDRL----------GENEKEDNSVNADDR-----------------TLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSG

Query:  NDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQ---
        +D ++++ N+D             D    + S   G+ +S    +SS +S  +  ++ H V     +S+  +N  K  P+ S      + +P ++ +   
Subjt:  NDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQ---

Query:  -DETPDTSENYDFSSEHKMNLVPEKLPLYD------DSTSNHNSGNQYEPPGPLN------EISDS---SLHELSSVSGDTAKESGFV--DRETVTESSK
         DE      N  +   H +N    + P+ D      D      +   Y     LN       +SD     L EL  +   + ++   +  DRE +     
Subjt:  -DETPDTSENYDFSSEHKMNLVPEKLPLYD------DSTSNHNSGNQYEPPGPLN------EISDS---SLHELSSVSGDTAKESGFV--DRETVTESSK

Query:  QVLNPTKTERLLSEATTSTLE-----------QQLERGLSEAAFVSV------TAYPLVDVQEKD---------HET----IMNSTAAKP---------E
        +   P K+   L +   S ++            +L++ +   ++V V        Y  ++  EKD         H+       +ST+A P          
Subjt:  QVLNPTKTERLLSEATTSTLE-----------QQLERGLSEAAFVSV------TAYPLVDVQEKD---------HET----IMNSTAAKP---------E

Query:  LQGIL---------------FSSAGVPAPLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKV
        LQ  +               FSSAG+PAP A S ++K LPG+V+VPAVVDQVQGQAL+ALQVLKVIE +V+P DLCTRRE+ARWLVSASS LSRNT SKV
Subjt:  LQGIL---------------FSSAGVPAPLA-SAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKV

Query:  YPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTD
        YPAMYI N+TELAFDDITPEDPDF+SIQGLAEAGLISSKLSR D+LS S +EDQ PFYFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D
Subjt:  YPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILS-SFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTD

Query:  KIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIERE
         I+PDA PAL+AD S GE GIIALAFGYTRLFQP+KPVTKAQAAIALATGE+SDIVSEELARIEAE+MAE AVA H ALV QVEK++NASFEK+LS+ER+
Subjt:  KIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIERE

Query:  KVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKAL
        K+DA+EK+AEEA+QELE+LR+ER+ +N++L KERAAIESEMEV SRLR+E+EEQLQ  MSNKVE+SYEKERI+KLRKEAE ENQEIARLQYELEVERKAL
Subjt:  KVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKAL

Query:  SMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLI
        SMARAWAEDEAKRAREQAKALEEARDRWE+ GIKVVVD++LRE+ SA  TWLD++KQFSV+ TV RAENL+DKL  M +++RGKSK+++D I++KI  LI
Subjt:  SMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLI

Query:  SNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
        S LR+  S  G Q  +LK+ A+ +A  S  ELQQ+TAE  LA+KEG KRVVGDCR GVEK+TQKFKT
Subjt:  SNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT

A0A4Y7JAF6 Beta-Casp domain-containing protein0.0e+0058.39Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPL GVYNENPLSYLVS+DGFN LIDCGWND FD +LL+PLS VAST+DAVL+SH DTLHLGALPYA + LGL A V+ST+PV+RLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
         Y ++KQVS+FDLF+LDDID AFQ +  L YSQNHHL+GKGEGIVIAPHVAGHLLGGT+WKITKDGED+IYAVDFNHRKE HLNGT+LE+FVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY
        AYNALN+Q  R + +K F G                                      I+ TL   GN+LLPVDTAGRVLEL+ ++E YW +  L YPIY
Subjt:  AYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY

Query:  FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLAR
        FLTYVSSSTI+Y KSFLEWMSDTIAKSFE TR+NAFLLKHVTL+INKSELD  P+GPK+VLASMASLE G SHDIF EWA D KNLVLF+E+GQF TLAR
Subjt:  FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLAR

Query:  MLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDAS-SNASPDVGGSHVSAYRDILIDGFVPPST
        MLQADPPP+AVKVT+  RVPL G+EL+++EEEQNR KKEEALKAS+ KEE  KAS+G+D    DPM+IDAS + AS +V   H  A+RDI IDGF+P ST
Subjt:  MLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDAS-SNASPDVGGSHVSAYRDILIDGFVPPST

Query:  SVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA
        SVAPMFPFYEN+SEWDDFGE+INPDDYV+K+EDMD  +MH GGD+DGKLDE AANLILD +PSK+VSNELTVQVKCSL YMDFEGRSDGRSIKSILSHVA
Subjt:  SVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA

Query:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL
        PLKLVLVHG+AEATEHLKQHCLKNVCPHVYAPQ+ ETIDVTSDLCAYKVQLSEKLMS VLFKKLGD+E+AW+D+ V +TE   LSL P+S A  PHKSV 
Subjt:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL

Query:  VGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCY
        VGDLK+AD+KQFLASKG+QVEF+GGALRCGEYVT+RKV DA  +   G                                                    
Subjt:  VGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCY

Query:  PIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNS--SRQKPQMEALSTQQEL
         ++  G   E +  ++  G  +  +  +G  F G   S++G+     R    +GG V  G+ G +L +G+ FA+  ++K+N+   +Q+ QME ++TQ EL
Subjt:  PIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNS--SRQKPQMEALSTQQEL

Query:  LLDSDTGNDRLGENEKEDNSVNADD-----RTLAGKTGNHEESSSYTE-NEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDS---PS
        LL SD   D++ +   +++ V  D+       L  KTG ++ SS+  E  +D+ ++   D    E      VE +S++ D N+ +S QED +S S     
Subjt:  LLDSDTGNDRLGENEKEDNSVNADD-----RTLAGKTGNHEESSSYTE-NEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDS---PS

Query:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSN---SNTNSLNPKTDIQDETPDTSE------NYDFSSEHKMNLVP
        +V +  + S  SL E   S    SK   D    G+  L +EP     K+ P N S+     +NS N +TD Q+ T D+ +        DFSS        
Subjt:  AVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSN---SNTNSLNPKTDIQDETPDTSE------NYDFSSEHKMNLVP

Query:  EKLPL--------YDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQLERGLSEAA
        + LPL          +  S+  S N  E   P +   D +L E     G+   E   +++  + E+   + NPT                          
Subjt:  EKLPL--------YDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQLERGLSEAA

Query:  FVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAP-LASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLV
          S+ A P     +  +E   + + ++  L    FSSAG+PAP L SAA++  PGKVLVPAVVDQVQGQAL+ALQVLKVIE +V+  DLCTRR+YARWLV
Subjt:  FVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAP-LASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLV

Query:  SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADR
        S+S+ LSRN+ SKVYPAMYIENVT+LAFDDI PEDPDFASIQGLAEAGLI+SKLSR D+L S D +Q PF FSPESPLSRQDLVSWKMAL+KRQLPE +R
Subjt:  SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADR

Query:  KMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDI
        KM++Q+ GFID DKI+PDA PALVAD+S G+ GIIALAFGYTRLFQPDKPVT AQAAIALATG+A+DIVSEELARIEAESMAE AVAAH ALVA+VEKD+
Subjt:  KMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDI

Query:  NASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIA
        NA+F K+L++E+EKVDA+EK AEEAK ELE+LR+ERE ++  L + RAA+ESEMEV +RLR ELEEQLQ L+S++ E+++E+ERINKLR+EAE ENQ I 
Subjt:  NASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEIENQEIA

Query:  RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKE
        +LQY+LEVERKALSMAR+WAE+EAKRARE  KALEEARDRW++ GI+VVVDSDL++ ESAG TW ++ K+ +++ETV+RAE L+DKLK MA  ++GKSK 
Subjt:  RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKE

Query:  IVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
        ++++II KI  +I  L++  S A K+ ++ + +A S+AS S  ELQ S      A+K+GA R+ G+C+EGVEKITQKFK++
Subjt:  IVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS

A0A6J1CGI7 uncharacterized protein LOC111011467 isoform X10.0e+0099.2Show/hide
Query:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW
        MASTPATCSP SLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW
Subjt:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTW

Query:  FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD
        FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD
Subjt:  FGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGD

Query:  GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN
        GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN
Subjt:  GVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLN

Query:  PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL
        PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL
Subjt:  PKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLL

Query:  SEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE
        SEATTSTLEQQ+ERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE
Subjt:  SEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIE

Query:  AEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
        AEVEP DLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ
Subjt:  AEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQ

Query:  DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM
        DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM
Subjt:  DLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESM

Query:  AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE
        AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLAL KE AAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE
Subjt:  AENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYE

Query:  KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAE
        KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSV+ETVDRAE
Subjt:  KERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAE

Query:  NLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
        NLMDKLKVMAAE+RGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRS SELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS
Subjt:  NLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTS

Query:  YG
        YG
Subjt:  YG

A0A6J1CHG6 uncharacterized protein LOC111011467 isoform X20.0e+0098.89Show/hide
Query:  WFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG
        +  GLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG
Subjt:  WFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG

Query:  DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSL
        DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSL
Subjt:  DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSL

Query:  NPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERL
        NPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERL
Subjt:  NPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERL

Query:  LSEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVI
        LSEATTSTLEQQ+ERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVI
Subjt:  LSEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVI

Query:  EAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSR
        EAEVEP DLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSR
Subjt:  EAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSR

Query:  QDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAES
        QDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAES
Subjt:  QDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAES

Query:  MAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSY
        MAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLAL KE AAIESEMEVFSRLRNELEEQLQGLMSNKVEVSY
Subjt:  MAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSY

Query:  EKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRA
        EKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSV+ETVDRA
Subjt:  EKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRA

Query:  ENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
        ENLMDKLKVMAAE+RGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRS SELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT
Subjt:  ENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKT

Query:  SYG
        SYG
Subjt:  SYG

SwissProt top hitse value%identityAlignment
Q10568 Cleavage and polyadenylation specificity factor subunit 22.1e-13835.93Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        M + +++T L GV  E+ L YL+ VD F FL+DCGW++HF   ++  L +    IDAVL+SHPD LHLGALPYA+ +LGL   +++T PVY++G + MYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT
         Y +R    +F LFTLDD+D+AF  + +L +SQ  +L GKG G+ I P  AGH++GGT+WKI KDG E+++YAVDFNH++E HLNG  LE   RP++LIT
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT

Query:  DAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI
        D++NA   QP R+Q+D++                                         + +TLR +GNVL+ VDTAGRVLEL Q+L+  W  +     +
Subjt:  DAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI

Query:  Y---FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFA
        Y    L  VS + +++ KS +EWMSD + + FE  RNN F  +H++L    S+L   P  PKVVLAS   LE G+S D+F++W  D KN ++ + +    
Subjt:  Y---FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFA

Query:  TLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDG----
        TLAR L  +P  K  ++ + KRV L G EL  Y E++  KKE A K      EQSK +   D D+ D    ++ +    D   +H + + D+++ G    
Subjt:  TLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDG----

Query:  ---FVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKD---EDMDQAAMHAG-GDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRS
           F   +    PMFP  E   +WD++GE+I P+D+++ +    + +++ + +G  + D  +D+  +++     P+K +S   ++++K  + Y+D+EGRS
Subjt:  ---FVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKD---EDMDQAAMHAG-GDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRS

Query:  DGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCL----KNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW----LDAEVGKT
        DG SIK I++ + P +L++VHG  EA++ L + C     K++   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW    LD  V K 
Subjt:  DGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCL----KNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW----LDAEVGKT

Query:  ENGTL---------------------------------------------------SLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGAL
        + G +                                                   +L PL    +P H+SV + + +++D+KQ L  +GIQ EF GG L
Subjt:  ENGTL---------------------------------------------------SLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGAL

Query:  RCGEYVTIRK
         C   V +R+
Subjt:  RCGEYVTIRK

Q652P4 Cleavage and polyadenylation specificity factor subunit 22.4e-29969.65Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPL G Y E PL YL++VDGF FL+DCGW D  DP+ LQPL++VA TIDAVL+SH DT+HLGALPYAMK LGL+APV++TEPV+RLG+LT+YD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
         +I+R+QVS+FDLFTLDDID+AFQ V RL YSQNH L+ KGEGIVIAPHVAGH LGGT+WKITKDGEDV+YAVDFNHRKERHLNGT L SFVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY
        AYNALNN  Y+RQ+D++F                  D+++                    K L   G+VLLP+DTAGRVLE++ ILE YW +  L YPIY
Subjt:  AYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIY

Query:  FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLAR
        FLT VS+ST+DY+KSFLEWM+D+I+KSFEHTR+NAFLLK VT +INK EL+   D PKVVLASMASLE G+SHDIFV+ AN+AKNLVLF+EKGQF TLAR
Subjt:  FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLAR

Query:  MLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTS
        MLQ DPPPKAVKVT+SKR+PL GDEL AYEEEQ R KKEEALKASL KEE+ KAS G++    DPM+IDAS++  P   GS      DILIDGFVPPS+S
Subjt:  MLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTS

Query:  VAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGD-VDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA
        VAPMFPF+ENTSEWDDFGEVINP+DY++K E+MD   M   GD +D  LDE +A L+LD  PSKV+SNE+TVQVKCSL YMDFEGRSDGRS+KS+++HVA
Subjt:  VAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGD-VDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA

Query:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL
        PLKLVLVHG+AEATEHLK HC KN   HVYAPQIEETIDVTSDLCAYKVQLSEKLMSNV+ KKLG++EIAW+DAEVGKT++  L+LLP S     HKSVL
Subjt:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL

Query:  VGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAIG
        VGDLK+AD+KQFLA+KG+QVEFAGGALRCGEY+T+RK+ DA  + + G
Subjt:  VGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAIG

Q9LKF9 Cleavage and polyadenylation specificity factor subunit 20.0e+0074.97Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFLIDCGWND FD +LL+PLSRVASTIDAVL+SHPDTLH+GALPYAMKQLGL+APV++TEPV+RLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
        Q+++RKQVS+FDLFTLDDIDSAFQ V RLTYSQN+HLSGKGEGIVIAPHVAGH+LGG++W+ITKDGEDVIYAVD+NHRKERHLNGT+L+SFVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNAL-NNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI
        AY+AL  NQ  R+Q+DKEF                                       TI K L   GNVLLPVDTAGRVLEL+ ILE +W +   ++PI
Subjt:  AYNAL-NNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI

Query:  YFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA
        YFLTYVSSSTIDY+KSFLEWMSD+I+KSFE +R+NAFLL+HVTLLINK++LDNAP GPKVVLASMASLEAG++ +IFVEWAND +NLVLF+E GQF TLA
Subjt:  YFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA

Query:  RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPST
        RMLQ+ PPPK VKVT+SKRVPLAG+EL+AYEEEQNR K+EEAL+ASL+KEE++KASHG+D+++ +PMIID  +  + DV GSH  AY+DILIDGFVPPS+
Subjt:  RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPST

Query:  SVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA
        SVAPMFP+Y+NTSEWDDFGE+INPDDYVIKDEDMD+ AMH GGDVDG+LDE  A+L+LD +PSKV+SNEL V V CSL  MD+EGRSDGRSIKS+++HV+
Subjt:  SVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA

Query:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL
        PLKLVLVH  AEATEHLKQHCL N+CPHVYAPQIEET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AW+D+EVGKTE    SLLP+  AA PHK VL
Subjt:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL

Query:  VGDLKMADYKQFLASKGIQVEFA-GGALRCGEYVTIRKV
        VGDLK+AD+KQFL+SKG+QVEFA GGALRCGEYVT+RKV
Subjt:  VGDLKMADYKQFLASKGIQVEFA-GGALRCGEYVTIRKV

Q9P2I0 Cleavage and polyadenylation specificity factor subunit 22.7e-13836.05Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        M + +++T L GV  E+ L YL+ VD F FL+DCGW++HF   ++  L +    IDAVL+SHPD LHLGALPYA+ +LGL   +++T PVY++G + MYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT
         Y +R    +F LFTLDD+D+AF  + +L +SQ  +L GKG G+ I P  AGH++GGT+WKI KDG E+++YAVDFNH++E HLNG  LE   RP++LIT
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT

Query:  DAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI
        D++NA   QP R+Q+D++                                         + +TLR +GNVL+ VDTAGRVLEL Q+L+  W  +     +
Subjt:  DAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI

Query:  Y---FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFA
        Y    L  VS + +++ KS +EWMSD + + FE  RNN F  +H++L    S+L   P  PKVVLAS   LE G+S D+F++W  D KN ++ + +    
Subjt:  Y---FLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFA

Query:  TLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDG----
        TLAR L  +P  K  ++ + KRV L G EL  Y E++  KKE A K      EQSK +   D D+ D   I+       D   +H + + D+++ G    
Subjt:  TLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDG----

Query:  ---FVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKD---EDMDQAAMHAG-GDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRS
           F   +    PMFP  E   +WD++GE+I P+D+++ +    + +++ + +G  + D  +D+  +++     P+K +S   ++++K  + Y+D+EGRS
Subjt:  ---FVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKD---EDMDQAAMHAG-GDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRS

Query:  DGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCL----KNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW----LDAEVGKT
        DG SIK I++ + P +L++VHG  EA++ L + C     K++   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW    LD  V K 
Subjt:  DGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCL----KNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW----LDAEVGKT

Query:  ENGTL---------------------------------------------------SLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGAL
        + G +                                                   +L PL    +P H+SV + + +++D+KQ L  +GIQ EF GG L
Subjt:  ENGTL---------------------------------------------------SLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGAL

Query:  RCGEYVTIRK
         C   V +R+
Subjt:  RCGEYVTIRK

Q9V3D6 Probable cleavage and polyadenylation specificity factor subunit 24.2e-13936.54Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        M + +++  + G  +E+P  Y++ +D    L+DCGW++ FD   ++ L R   T+DAVL+SHPD  HLGALPY + +LGL  P+++T PV+++G + MYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT
         Y++   + +FDLF+LDD+D+AF+ +T+L Y+Q   L  KG GI I P  AGH++GGT+WKI K G ED++YA DFNH+KERHL+G  L+   RP++LIT
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDG-EDVIYAVDFNHRKERHLNGTILESFVRPAVLIT

Query:  DAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEES---LN
        DAYNA   Q  RR +D++                                         I +T+R NGNVL+ VDTAGRVLEL  +L+  W+ +    + 
Subjt:  DAYNALNNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEES---LN

Query:  YPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFA
        Y +  L  VS + I++ KS +EWMSD + K+FE  RNN F  KH+ L  + +++   P GPKVVLAS   LE+G++ D+FV+WA++A N ++ + +    
Subjt:  YPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFA

Query:  TLA-RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVP
        TLA  +++   P K +++ V +RV L G EL    EE  R + E L   ++K +  + S     D  +  +I    +      G H S        GF  
Subjt:  TLA-RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVP

Query:  PSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDE----------------DMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYM
         +     MFP++E   + D++GE+IN DDY I D                   ++  + A    +G + +    L+   KP+K++S   T++V   +  +
Subjt:  PSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDE----------------DMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYM

Query:  DFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGK---
        DFEGRSDG S+  ILS + P +++++HGTAE T+ + +HC +NV   V+ PQ  E IDVTS++  Y+V+L+E L+S + F+K  D E+AW+D  +G    
Subjt:  DFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGK---

Query:  ----------------TENGTLSLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA
                         E  TL+L  L+   +P H SVL+ +LK++D+KQ L    I  EF+GG L C       +  DA
Subjt:  ----------------TENGTLSLLPLSKAALP-HKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADA

Arabidopsis top hitse value%identityAlignment
AT3G25680.1 FUNCTIONS IN: molecular_function unknown4.3e-5433.26Show/hide
Query:  AFVSVTAYPLVDVQEKDH---ETIM---NSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRRE
        +F ++  Y  V++ + D+   ET+    N++ ++   + + + S  V    +++  KT   +V  P  VD  Q +A++ L+ LK+ E ++   +LCT+RE
Subjt:  AFVSVTAYPLVDVQEKDH---ETIM---NSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRRE

Query:  YARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQ
        YARWLV ++S L RN    + PA+ +   +  AFDDI   DPDF  IQ LAEAG+ SSKLS  D  +    D G   F+PES +SR DLV+WK  LE   
Subjt:  YARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQ

Query:  LPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVA
         PE   ++      +IDT  I+PD       D  +G+   I   FG  + FQP++PVTKAQAA+AL +G+    ++ EL+R+EAES+++ A         
Subjt:  LPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELARIEAESMAENAVAAHGALVA

Query:  QVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEI
          + +I   +++++  ER +   +E++      E+E  ++ +E+ +    KE+AAI+ + ++ + L  E++E  Q L+S+K     E  ++ ++  + + 
Subjt:  QVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGLMSNKVEVSYEKERINKLRKEAEI

Query:  ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE
        + + +   +  LE E +AL + R+W EDE K ++ +AK LEEA  RW+
Subjt:  ENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWE

AT5G23880.1 cleavage and polyadenylation specificity factor 1000.0e+0074.97Show/hide
Query:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD
        MGTSVQVTPLCGVYNENPLSYLVS+DGFNFLIDCGWND FD +LL+PLSRVASTIDAVL+SHPDTLH+GALPYAMKQLGL+APV++TEPV+RLGLLTMYD
Subjt:  MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYD

Query:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
        Q+++RKQVS+FDLFTLDDIDSAFQ V RLTYSQN+HLSGKGEGIVIAPHVAGH+LGG++W+ITKDGEDVIYAVD+NHRKERHLNGT+L+SFVRPAVLITD
Subjt:  QYIARKQVSEFDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNAL-NNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI
        AY+AL  NQ  R+Q+DKEF                                       TI K L   GNVLLPVDTAGRVLEL+ ILE +W +   ++PI
Subjt:  AYNAL-NNQPYRRQKDKEFGGTKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPI

Query:  YFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA
        YFLTYVSSSTIDY+KSFLEWMSD+I+KSFE +R+NAFLL+HVTLLINK++LDNAP GPKVVLASMASLEAG++ +IFVEWAND +NLVLF+E GQF TLA
Subjt:  YFLTYVSSSTIDYIKSFLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLA

Query:  RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPST
        RMLQ+ PPPK VKVT+SKRVPLAG+EL+AYEEEQNR K+EEAL+ASL+KEE++KASHG+D+++ +PMIID  +  + DV GSH  AY+DILIDGFVPPS+
Subjt:  RMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPST

Query:  SVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA
        SVAPMFP+Y+NTSEWDDFGE+INPDDYVIKDEDMD+ AMH GGDVDG+LDE  A+L+LD +PSKV+SNEL V V CSL  MD+EGRSDGRSIKS+++HV+
Subjt:  SVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVA

Query:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL
        PLKLVLVH  AEATEHLKQHCL N+CPHVYAPQIEET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AW+D+EVGKTE    SLLP+  AA PHK VL
Subjt:  PLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVL

Query:  VGDLKMADYKQFLASKGIQVEFA-GGALRCGEYVTIRKV
        VGDLK+AD+KQFL+SKG+QVEFA GGALRCGEYVT+RKV
Subjt:  VGDLKMADYKQFLASKGIQVEFA-GGALRCGEYVTIRKV

AT5G23890.1 LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope2.5e-20347.49Show/hide
Query:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLD-LRRKT
        MAS  AT +P+SLQLRLAL+     K P+V +R          R + Y IV    + ++      +G     S S+ D  +GW +SD+ ++    +++K+
Subjt:  MASTPATCSPSSLQLRLALNCKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLD-LRRKT

Query:  WFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG
           G+VG G+ G IL  G+++AA S SK+    +K +M +L++QQE ++ S   +D +  +E               K  N EES+   E++ ++ N V 
Subjt:  WFGGLVGIGITGFILVSGITFAAWSISKQNSSRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVG

Query:  ----DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSN-S
            +G   +KL G   E+SS +  + + A   E    ++P A   V A       ETD   A   K  ++       +L+S  E  +L     N     
Subjt:  ----DGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVTSVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSN-S

Query:  NTNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKES--GFVDRETVTESSKQVLN
        NTNS +P++ +  E  + S+  +  +  K           +DS S+ +  + Y   G + E     L E+SS    T+K       D ET   +++++  
Subjt:  NTNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPPGPLNEISDSSLHELSSVSGDTAKES--GFVDRETVTESSKQVLN

Query:  PTKTERLLSEATTSTLE-----QQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQG
           T      +  S++      ++LE   S     +  +   +++  +D E   N    +    G  FSSAG+PAP  S  +   PGK+LVP   DQ+Q 
Subjt:  PTKTERLLSEATTSTLE-----QQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQG

Query:  QALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQG
        QA +ALQVLKVIE + +P DLCTRREYARWL+SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDF+SIQGLAEAGLI+SKLS  D+L   D+ +G
Subjt:  QALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQG

Query:  PFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDI
         F FSPES LSRQDL+SWKMALEKRQLPEAD+KML+++SGFID DKI+PDA P+++ADLS GE GI ALAFG TRLFQP KPVTK QAAIAL++GEASDI
Subjt:  PFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDI

Query:  VSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQL
        VSEELARIEAESMAE AV+AH ALVA+VEKD+NASFEK+LS+EREK++AVEKMAE AK ELE+LR +RE ENLAL KERAA+ESEMEV SRLR + EE+L
Subjt:  VSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQL

Query:  QGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLRE---QESAGDTWL
        + LMSNK E+++EKER+  LRKEAE E+Q I++LQYELEVERKALSMAR+WAE+EAK+AREQ +ALEEAR RWE  G++VVVD DL+E   +E+     L
Subjt:  QGLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLRE---QESAGDTWL

Query:  DSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVG
        +  ++ SVEET  RA+ LMDKLK MA  V GKS+E++  ++EKI L I+ L+++  + GK+A ++++ AI RA  +A++++Q T +    + +  K++  
Subjt:  DSSKQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVG

Query:  DCREGVEKITQKFKT
        +CR+GV KI+Q+FKT
Subjt:  DCREGVEKITQKFKT

AT5G52410.1 CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119)1.4e-16164.12Show/hide
Query:  SALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFY
        +ALQ LKVIE++  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS +++ SS   +     
Subjt:  SALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGPFY

Query:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE
        FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GEHGI AL+FG TRLFQP K VTKAQ A++LA G+A ++V E
Subjt:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE

Query:  ELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGL
        ELARIEAE+MAEN V AH  LVAQVEKDINASFEK+L  E+E VDAVEK+AEEAK EL RLR E+E E LAL +ER +IE+EME  +R+RNELEEQLQ L
Subjt:  ELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGL

Query:  MSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGD-TWLDSSKQ
         SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWE+ G+KV+VDSDL EQ +  + TWL++ KQ
Subjt:  MSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGD-TWLDSSKQ

Query:  FSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREG
          VE T+ RA NL+ KLK MA +V  KS+E++  IIEKI+LLIS L+Q V     +A+DLK    S+A     +      E+R      AK  V + ++ 
Subjt:  FSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREG

Query:  VEKITQKFKT
        V K+ +KFK+
Subjt:  VEKITQKFKT

AT5G52410.2 INVOLVED IN: biological_process unknown9.3e-16657.68Show/hide
Query:  QVLNPTKTERLLSEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQ
        QVL+P + +  ++  +TST +   E    +A+  S   Y        + + +   T+  P  +       G+PAP     + +L  K + P VVD VQ Q
Subjt:  QVLNPTKTERLLSEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGVPAPLASAAIKTLPGKVLVPAVVDQVQGQ

Query:  ALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGP
          +ALQ LKVIE++  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS +++ SS   +   
Subjt:  ALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRHDILSSFDEDQGP

Query:  FYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIV
          FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GEHGI AL+FG TRLFQP K VTKAQ A++LA G+A ++V
Subjt:  FYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIV

Query:  SEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQ
         EELARIEAE+MAEN V AH  LVAQVEKDINASFEK+L  E+E VDAVEK+AEEAK EL RLR E+E E LAL +ER +IE+EME  +R+RNELEEQLQ
Subjt:  SEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQ

Query:  GLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGD-TWLDSS
         L SNK E+SYEKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWE+ G+KV+VDSDL EQ +  + TWL++ 
Subjt:  GLMSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGD-TWLDSS

Query:  KQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCR
        KQ  VE T+ RA NL+ KLK MA +V  KS+E++  IIEKI+LLIS L+Q V     +A+DLK    S+A     +      E+R      AK  V + +
Subjt:  KQFSVEETVDRAENLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCR

Query:  EGVEKITQKFKT
        + V K+ +KFK+
Subjt:  EGVEKITQKFKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACCTCTGTTCAGGTGACGCCTCTTTGCGGGGTCTACAATGAAAACCCTCTATCATATTTGGTCTCCGTCGACGGCTTCAACTTCCTTATTGACTGTGGTTGGAA
CGACCACTTCGACCCTGCTCTTCTTCAACCTCTCTCCAGGGTGGCATCGACGATAGATGCAGTTTTGATATCACATCCTGATACACTTCACCTTGGTGCCCTTCCTTATG
CCATGAAACAGCTTGGACTTGCTGCTCCAGTTTTTTCCACTGAACCCGTGTATCGATTGGGCCTTCTTACAATGTATGATCAGTATATTGCTAGGAAGCAAGTATCGGAG
TTTGATCTATTTACATTGGATGATATCGATTCTGCTTTCCAAGTTGTAACCAGACTAACATACTCCCAGAATCATCATCTTTCAGGCAAAGGAGAGGGAATAGTTATTGC
ACCTCACGTGGCTGGGCATCTTTTGGGGGGAACCCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGACTTCAACCACCGCAAGGAAAGGCATCTGA
ACGGAACCATTTTAGAGTCATTTGTGCGACCTGCTGTATTGATAACTGATGCTTATAATGCTCTAAATAATCAACCTTACAGACGCCAAAAGGACAAAGAATTCGGAGGT
ACTAAAATCTGTTTTAGGTTTTTAAAATTCTTCATTGCATCTGCAAGAGATTCCATCATTTGCCTCATTCTTCCCCTCAGCCTCCATGCCTGCTTGCGGGGAAAAAAGTA
TACTATCCAGAAGACCTTAAGAGCTAATGGAAATGTGTTACTTCCTGTTGATACTGCTGGGCGAGTGTTGGAGCTTATTCAAATTTTGGAATGGTACTGGGAAGAAGAAA
GTTTAAATTATCCCATCTACTTTTTAACTTATGTTTCATCAAGCACAATTGATTATATCAAGAGTTTCCTGGAGTGGATGAGTGATACAATAGCAAAGTCTTTTGAACAC
ACACGGAACAATGCTTTTCTTCTCAAGCATGTGACCCTTCTAATCAATAAAAGTGAACTTGATAATGCTCCAGATGGTCCAAAGGTTGTTCTAGCATCAATGGCTAGTTT
GGAAGCTGGTTACTCACATGACATTTTTGTTGAGTGGGCAAATGATGCCAAAAATCTCGTGCTTTTTTCTGAGAAAGGCCAGTTTGCCACTTTGGCTCGTATGCTTCAAG
CAGATCCACCTCCAAAAGCTGTTAAGGTAACTGTGTCTAAGAGAGTTCCTTTGGCTGGAGACGAGCTACTTGCTTATGAAGAAGAGCAAAACAGGAAAAAGGAAGAAGCT
CTTAAGGCTAGTTTGCTCAAGGAGGAACAATCCAAAGCATCTCATGGAACTGATAATGATACTGGTGATCCAATGATCATTGATGCTAGCAGTAATGCATCACCAGATGT
AGGTGGTTCACATGTAAGTGCATATCGAGATATATTAATTGATGGCTTCGTTCCTCCTTCAACAAGCGTTGCTCCAATGTTTCCCTTTTATGAAAACACTTCCGAATGGG
ATGATTTTGGTGAAGTAATAAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAAGCAGCGATGCATGCTGGTGGGGACGTGGATGGAAAACTAGATGAAACT
GCTGCTAACCTGATCCTGGATATGAAGCCTTCAAAAGTTGTATCTAACGAATTGACAGTCCAAGTGAAATGTTCATTGCATTACATGGATTTTGAAGGTCGTTCAGATGG
GAGATCAATTAAATCAATACTCTCCCACGTTGCTCCCTTGAAGCTTGTTTTGGTGCATGGAACTGCTGAGGCCACTGAGCATCTTAAGCAACATTGCCTAAAAAACGTCT
GTCCCCATGTCTATGCCCCCCAAATTGAAGAAACGATTGATGTGACTTCTGATCTGTGTGCATATAAGGTACAACTTTCAGAGAAACTGATGAGTAATGTGCTGTTCAAG
AAGCTTGGAGATTATGAAATCGCTTGGCTTGATGCTGAAGTAGGGAAGACTGAGAATGGAACGTTGTCTTTACTTCCCCTCTCAAAGGCTGCTTTGCCTCATAAATCTGT
TCTTGTTGGTGACTTAAAGATGGCTGACTACAAACAATTTCTTGCCAGCAAGGGAATACAGGTGGAATTTGCTGGGGGTGCTTTGAGATGCGGTGAGTATGTCACAATAC
GCAAGGTTGCAGATGCAATTGTTCAATCTGCTATTGGAGTAGGAATTTTCATGGCTTCCACTCCTGCCACGTGTTCACCGAGCTCTCTCCAGCTTCGGCTAGCGCTGAAT
TGCAAGAATTGCGCGAAATTCCCTTCAGTTCTTGTTCGGGCGAGAGTGAGGAAGCTGGATCCTCGGGTCCGCATGACTTGTTATCCTATTGTTTATAATGGCGCGATAAT
CGAAAGAGCAAATGGGCAGCGCCGTAGTGGAGTTTGTTTTGCTCGATCGGATTCGACAGGCGATGGATTCTCAGGGTGGTCGGAATCGGATTCCGGGGAGGAGGTTTTGG
ACTTGCGGAGAAAGACGTGGTTTGGAGGGTTGGTGGGAATTGGAATTACTGGATTCATTCTTGTTTCGGGAATCACCTTTGCAGCATGGTCAATAAGCAAGCAGAATAGT
TCCAGACAAAAGCCACAAATGGAGGCCTTAAGTACGCAGCAAGAATTATTATTGGACTCTGATACTGGAAATGATAGGCTGGGTGAAAATGAAAAAGAAGATAACAGTGT
GAATGCAGATGATAGAACTCTCGCTGGTAAAACAGGTAATCATGAGGAATCTTCTTCATATACAGAAAATGAAGATCTCGACAAAAATACAGTTGGTGATGGTGTTGATG
TCGAGAAGTTATCAGGAAATGATGTTGAATCTTCATCCAGCAATAATGATGTCAATAATGTTGCTTCCTTCCAAGAAGATTTCCAATCTGATTCCCCATCAGCTGTTACG
TCAGTTGCTGCCGGAAGTTTGAGTTCACTTATGGAGACTGATTCTAGTGTAGCTTCCGGTTCTAAAGATGGAAACGACTGTCATGCTGTTGGTACAGAAGTCTTAAATTC
TGAACCTGAAATGAACATATTGAAAGATGGCCCAGATAACTCTTCTAATTCTAATACTAATTCATTAAACCCTAAAACTGATATTCAGGATGAAACACCTGACACTAGTG
AAAATTATGATTTCAGCTCTGAGCATAAAATGAACCTAGTACCTGAGAAGTTACCACTCTATGATGATAGTACATCAAACCATAATTCTGGCAACCAGTATGAACCACCT
GGGCCTCTTAATGAAATTTCAGATTCTTCGTTGCATGAACTTTCTAGCGTGTCTGGCGACACAGCCAAGGAATCAGGATTTGTTGACAGAGAGACTGTGACTGAATCATC
TAAACAAGTTCTGAACCCCACCAAAACTGAAAGGCTCTTATCTGAGGCAACTACATCAACCTTAGAACAGCAACTAGAAAGAGGATTATCTGAAGCAGCATTTGTCTCTG
TCACAGCTTATCCATTGGTTGATGTTCAAGAGAAAGATCATGAAACTATCATGAATAGTACTGCTGCCAAACCGGAACTACAAGGGATTTTATTTTCTTCTGCAGGTGTT
CCTGCTCCTTTGGCTTCTGCAGCTATAAAAACACTTCCTGGAAAGGTCCTAGTTCCTGCAGTTGTGGATCAGGTTCAGGGGCAGGCATTGTCAGCACTGCAAGTTTTAAA
GGTGATAGAGGCTGAGGTTGAACCTGGTGATCTATGTACACGTCGGGAATATGCTCGTTGGCTGGTGTCTGCAAGCAGTGCTCTTTCAAGGAACACAACATCTAAAGTAT
ATCCAGCAATGTATATAGAGAATGTTACTGAGTTGGCATTTGATGATATTACTCCCGAAGATCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCA
AGCAAACTTTCGAGACATGACATTCTTTCTTCCTTTGATGAAGACCAGGGTCCTTTTTATTTCTCTCCCGAAAGCCCCCTATCACGTCAAGATCTTGTGAGTTGGAAGAT
GGCCCTCGAGAAAAGACAGTTGCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGATTTATAGATACTGATAAGATCCATCCCGATGCTTGTCCTGCCCTTGTTG
CTGATCTCTCTGTAGGAGAGCATGGAATTATAGCTCTAGCATTTGGATATACAAGGCTTTTCCAGCCGGATAAGCCTGTAACAAAAGCACAAGCTGCCATTGCTCTTGCA
ACTGGGGAGGCTTCTGATATAGTAAGTGAGGAGCTTGCAAGGATTGAAGCAGAATCAATGGCAGAAAATGCAGTTGCTGCACATGGTGCTCTAGTAGCTCAAGTTGAGAA
AGATATTAATGCCAGCTTTGAGAAACAACTCTCAATTGAAAGGGAAAAGGTCGATGCCGTGGAGAAAATGGCAGAAGAGGCAAAGCAAGAGCTGGAGAGATTAAGATCAG
AAAGAGAGAGAGAAAATCTCGCCTTGACGAAGGAACGTGCTGCCATTGAATCGGAAATGGAAGTTTTTTCAAGATTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTG
ATGAGTAATAAAGTAGAGGTATCTTATGAAAAGGAAAGAATCAATAAACTCAGGAAAGAAGCTGAAATCGAAAACCAGGAGATTGCCCGCCTGCAATATGAGCTTGAGGT
TGAGAGAAAGGCGTTGTCCATGGCCAGAGCTTGGGCAGAGGACGAAGCGAAAAGAGCAAGAGAACAAGCCAAAGCACTTGAAGAGGCTAGAGATCGCTGGGAAAGGCGTG
GAATCAAAGTTGTCGTTGACAGCGATCTCCGTGAACAGGAATCCGCTGGCGATACTTGGCTCGATTCTAGCAAACAGTTTTCAGTCGAAGAAACTGTCGACAGGGCCGAG
AACTTAATGGACAAGCTGAAAGTAATGGCTGCAGAAGTAAGAGGGAAATCCAAAGAGATAGTCGACAAGATCATCGAGAAGATAGCTTTACTAATATCAAACTTGAGGCA
ATGGGTTTCCGACGCTGGAAAACAGGCTGAAGATCTAAAGGAAGTGGCCATTTCAAGGGCAAGTAGATCAGCAAGTGAGCTGCAACAGAGCACTGCGGAGTTGAGGTTGG
CGCTGAAAGAGGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTACCCAAAAGTTCAAAACATCGTATGGT
mRNA sequenceShow/hide mRNA sequence
ATGGGAACCTCTGTTCAGGTGACGCCTCTTTGCGGGGTCTACAATGAAAACCCTCTATCATATTTGGTCTCCGTCGACGGCTTCAACTTCCTTATTGACTGTGGTTGGAA
CGACCACTTCGACCCTGCTCTTCTTCAACCTCTCTCCAGGGTGGCATCGACGATAGATGCAGTTTTGATATCACATCCTGATACACTTCACCTTGGTGCCCTTCCTTATG
CCATGAAACAGCTTGGACTTGCTGCTCCAGTTTTTTCCACTGAACCCGTGTATCGATTGGGCCTTCTTACAATGTATGATCAGTATATTGCTAGGAAGCAAGTATCGGAG
TTTGATCTATTTACATTGGATGATATCGATTCTGCTTTCCAAGTTGTAACCAGACTAACATACTCCCAGAATCATCATCTTTCAGGCAAAGGAGAGGGAATAGTTATTGC
ACCTCACGTGGCTGGGCATCTTTTGGGGGGAACCCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGACTTCAACCACCGCAAGGAAAGGCATCTGA
ACGGAACCATTTTAGAGTCATTTGTGCGACCTGCTGTATTGATAACTGATGCTTATAATGCTCTAAATAATCAACCTTACAGACGCCAAAAGGACAAAGAATTCGGAGGT
ACTAAAATCTGTTTTAGGTTTTTAAAATTCTTCATTGCATCTGCAAGAGATTCCATCATTTGCCTCATTCTTCCCCTCAGCCTCCATGCCTGCTTGCGGGGAAAAAAGTA
TACTATCCAGAAGACCTTAAGAGCTAATGGAAATGTGTTACTTCCTGTTGATACTGCTGGGCGAGTGTTGGAGCTTATTCAAATTTTGGAATGGTACTGGGAAGAAGAAA
GTTTAAATTATCCCATCTACTTTTTAACTTATGTTTCATCAAGCACAATTGATTATATCAAGAGTTTCCTGGAGTGGATGAGTGATACAATAGCAAAGTCTTTTGAACAC
ACACGGAACAATGCTTTTCTTCTCAAGCATGTGACCCTTCTAATCAATAAAAGTGAACTTGATAATGCTCCAGATGGTCCAAAGGTTGTTCTAGCATCAATGGCTAGTTT
GGAAGCTGGTTACTCACATGACATTTTTGTTGAGTGGGCAAATGATGCCAAAAATCTCGTGCTTTTTTCTGAGAAAGGCCAGTTTGCCACTTTGGCTCGTATGCTTCAAG
CAGATCCACCTCCAAAAGCTGTTAAGGTAACTGTGTCTAAGAGAGTTCCTTTGGCTGGAGACGAGCTACTTGCTTATGAAGAAGAGCAAAACAGGAAAAAGGAAGAAGCT
CTTAAGGCTAGTTTGCTCAAGGAGGAACAATCCAAAGCATCTCATGGAACTGATAATGATACTGGTGATCCAATGATCATTGATGCTAGCAGTAATGCATCACCAGATGT
AGGTGGTTCACATGTAAGTGCATATCGAGATATATTAATTGATGGCTTCGTTCCTCCTTCAACAAGCGTTGCTCCAATGTTTCCCTTTTATGAAAACACTTCCGAATGGG
ATGATTTTGGTGAAGTAATAAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAAGCAGCGATGCATGCTGGTGGGGACGTGGATGGAAAACTAGATGAAACT
GCTGCTAACCTGATCCTGGATATGAAGCCTTCAAAAGTTGTATCTAACGAATTGACAGTCCAAGTGAAATGTTCATTGCATTACATGGATTTTGAAGGTCGTTCAGATGG
GAGATCAATTAAATCAATACTCTCCCACGTTGCTCCCTTGAAGCTTGTTTTGGTGCATGGAACTGCTGAGGCCACTGAGCATCTTAAGCAACATTGCCTAAAAAACGTCT
GTCCCCATGTCTATGCCCCCCAAATTGAAGAAACGATTGATGTGACTTCTGATCTGTGTGCATATAAGGTACAACTTTCAGAGAAACTGATGAGTAATGTGCTGTTCAAG
AAGCTTGGAGATTATGAAATCGCTTGGCTTGATGCTGAAGTAGGGAAGACTGAGAATGGAACGTTGTCTTTACTTCCCCTCTCAAAGGCTGCTTTGCCTCATAAATCTGT
TCTTGTTGGTGACTTAAAGATGGCTGACTACAAACAATTTCTTGCCAGCAAGGGAATACAGGTGGAATTTGCTGGGGGTGCTTTGAGATGCGGTGAGTATGTCACAATAC
GCAAGGTTGCAGATGCAATTGTTCAATCTGCTATTGGAGTAGGAATTTTCATGGCTTCCACTCCTGCCACGTGTTCACCGAGCTCTCTCCAGCTTCGGCTAGCGCTGAAT
TGCAAGAATTGCGCGAAATTCCCTTCAGTTCTTGTTCGGGCGAGAGTGAGGAAGCTGGATCCTCGGGTCCGCATGACTTGTTATCCTATTGTTTATAATGGCGCGATAAT
CGAAAGAGCAAATGGGCAGCGCCGTAGTGGAGTTTGTTTTGCTCGATCGGATTCGACAGGCGATGGATTCTCAGGGTGGTCGGAATCGGATTCCGGGGAGGAGGTTTTGG
ACTTGCGGAGAAAGACGTGGTTTGGAGGGTTGGTGGGAATTGGAATTACTGGATTCATTCTTGTTTCGGGAATCACCTTTGCAGCATGGTCAATAAGCAAGCAGAATAGT
TCCAGACAAAAGCCACAAATGGAGGCCTTAAGTACGCAGCAAGAATTATTATTGGACTCTGATACTGGAAATGATAGGCTGGGTGAAAATGAAAAAGAAGATAACAGTGT
GAATGCAGATGATAGAACTCTCGCTGGTAAAACAGGTAATCATGAGGAATCTTCTTCATATACAGAAAATGAAGATCTCGACAAAAATACAGTTGGTGATGGTGTTGATG
TCGAGAAGTTATCAGGAAATGATGTTGAATCTTCATCCAGCAATAATGATGTCAATAATGTTGCTTCCTTCCAAGAAGATTTCCAATCTGATTCCCCATCAGCTGTTACG
TCAGTTGCTGCCGGAAGTTTGAGTTCACTTATGGAGACTGATTCTAGTGTAGCTTCCGGTTCTAAAGATGGAAACGACTGTCATGCTGTTGGTACAGAAGTCTTAAATTC
TGAACCTGAAATGAACATATTGAAAGATGGCCCAGATAACTCTTCTAATTCTAATACTAATTCATTAAACCCTAAAACTGATATTCAGGATGAAACACCTGACACTAGTG
AAAATTATGATTTCAGCTCTGAGCATAAAATGAACCTAGTACCTGAGAAGTTACCACTCTATGATGATAGTACATCAAACCATAATTCTGGCAACCAGTATGAACCACCT
GGGCCTCTTAATGAAATTTCAGATTCTTCGTTGCATGAACTTTCTAGCGTGTCTGGCGACACAGCCAAGGAATCAGGATTTGTTGACAGAGAGACTGTGACTGAATCATC
TAAACAAGTTCTGAACCCCACCAAAACTGAAAGGCTCTTATCTGAGGCAACTACATCAACCTTAGAACAGCAACTAGAAAGAGGATTATCTGAAGCAGCATTTGTCTCTG
TCACAGCTTATCCATTGGTTGATGTTCAAGAGAAAGATCATGAAACTATCATGAATAGTACTGCTGCCAAACCGGAACTACAAGGGATTTTATTTTCTTCTGCAGGTGTT
CCTGCTCCTTTGGCTTCTGCAGCTATAAAAACACTTCCTGGAAAGGTCCTAGTTCCTGCAGTTGTGGATCAGGTTCAGGGGCAGGCATTGTCAGCACTGCAAGTTTTAAA
GGTGATAGAGGCTGAGGTTGAACCTGGTGATCTATGTACACGTCGGGAATATGCTCGTTGGCTGGTGTCTGCAAGCAGTGCTCTTTCAAGGAACACAACATCTAAAGTAT
ATCCAGCAATGTATATAGAGAATGTTACTGAGTTGGCATTTGATGATATTACTCCCGAAGATCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCA
AGCAAACTTTCGAGACATGACATTCTTTCTTCCTTTGATGAAGACCAGGGTCCTTTTTATTTCTCTCCCGAAAGCCCCCTATCACGTCAAGATCTTGTGAGTTGGAAGAT
GGCCCTCGAGAAAAGACAGTTGCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGATTTATAGATACTGATAAGATCCATCCCGATGCTTGTCCTGCCCTTGTTG
CTGATCTCTCTGTAGGAGAGCATGGAATTATAGCTCTAGCATTTGGATATACAAGGCTTTTCCAGCCGGATAAGCCTGTAACAAAAGCACAAGCTGCCATTGCTCTTGCA
ACTGGGGAGGCTTCTGATATAGTAAGTGAGGAGCTTGCAAGGATTGAAGCAGAATCAATGGCAGAAAATGCAGTTGCTGCACATGGTGCTCTAGTAGCTCAAGTTGAGAA
AGATATTAATGCCAGCTTTGAGAAACAACTCTCAATTGAAAGGGAAAAGGTCGATGCCGTGGAGAAAATGGCAGAAGAGGCAAAGCAAGAGCTGGAGAGATTAAGATCAG
AAAGAGAGAGAGAAAATCTCGCCTTGACGAAGGAACGTGCTGCCATTGAATCGGAAATGGAAGTTTTTTCAAGATTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTG
ATGAGTAATAAAGTAGAGGTATCTTATGAAAAGGAAAGAATCAATAAACTCAGGAAAGAAGCTGAAATCGAAAACCAGGAGATTGCCCGCCTGCAATATGAGCTTGAGGT
TGAGAGAAAGGCGTTGTCCATGGCCAGAGCTTGGGCAGAGGACGAAGCGAAAAGAGCAAGAGAACAAGCCAAAGCACTTGAAGAGGCTAGAGATCGCTGGGAAAGGCGTG
GAATCAAAGTTGTCGTTGACAGCGATCTCCGTGAACAGGAATCCGCTGGCGATACTTGGCTCGATTCTAGCAAACAGTTTTCAGTCGAAGAAACTGTCGACAGGGCCGAG
AACTTAATGGACAAGCTGAAAGTAATGGCTGCAGAAGTAAGAGGGAAATCCAAAGAGATAGTCGACAAGATCATCGAGAAGATAGCTTTACTAATATCAAACTTGAGGCA
ATGGGTTTCCGACGCTGGAAAACAGGCTGAAGATCTAAAGGAAGTGGCCATTTCAAGGGCAAGTAGATCAGCAAGTGAGCTGCAACAGAGCACTGCGGAGTTGAGGTTGG
CGCTGAAAGAGGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAGTAGAGAAAATTACCCAAAAGTTCAAAACATCGTATGGT
Protein sequenceShow/hide protein sequence
MGTSVQVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLAAPVFSTEPVYRLGLLTMYDQYIARKQVSE
FDLFTLDDIDSAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGG
TKICFRFLKFFIASARDSIICLILPLSLHACLRGKKYTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIYFLTYVSSSTIDYIKSFLEWMSDTIAKSFEH
TRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVEWANDAKNLVLFSEKGQFATLARMLQADPPPKAVKVTVSKRVPLAGDELLAYEEEQNRKKEEA
LKASLLKEEQSKASHGTDNDTGDPMIIDASSNASPDVGGSHVSAYRDILIDGFVPPSTSVAPMFPFYENTSEWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDET
AANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLFK
KLGDYEIAWLDAEVGKTENGTLSLLPLSKAALPHKSVLVGDLKMADYKQFLASKGIQVEFAGGALRCGEYVTIRKVADAIVQSAIGVGIFMASTPATCSPSSLQLRLALN
CKNCAKFPSVLVRARVRKLDPRVRMTCYPIVYNGAIIERANGQRRSGVCFARSDSTGDGFSGWSESDSGEEVLDLRRKTWFGGLVGIGITGFILVSGITFAAWSISKQNS
SRQKPQMEALSTQQELLLDSDTGNDRLGENEKEDNSVNADDRTLAGKTGNHEESSSYTENEDLDKNTVGDGVDVEKLSGNDVESSSSNNDVNNVASFQEDFQSDSPSAVT
SVAAGSLSSLMETDSSVASGSKDGNDCHAVGTEVLNSEPEMNILKDGPDNSSNSNTNSLNPKTDIQDETPDTSENYDFSSEHKMNLVPEKLPLYDDSTSNHNSGNQYEPP
GPLNEISDSSLHELSSVSGDTAKESGFVDRETVTESSKQVLNPTKTERLLSEATTSTLEQQLERGLSEAAFVSVTAYPLVDVQEKDHETIMNSTAAKPELQGILFSSAGV
PAPLASAAIKTLPGKVLVPAVVDQVQGQALSALQVLKVIEAEVEPGDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLIS
SKLSRHDILSSFDEDQGPFYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEHGIIALAFGYTRLFQPDKPVTKAQAAIALA
TGEASDIVSEELARIEAESMAENAVAAHGALVAQVEKDINASFEKQLSIEREKVDAVEKMAEEAKQELERLRSERERENLALTKERAAIESEMEVFSRLRNELEEQLQGL
MSNKVEVSYEKERINKLRKEAEIENQEIARLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWERRGIKVVVDSDLREQESAGDTWLDSSKQFSVEETVDRAE
NLMDKLKVMAAEVRGKSKEIVDKIIEKIALLISNLRQWVSDAGKQAEDLKEVAISRASRSASELQQSTAELRLALKEGAKRVVGDCREGVEKITQKFKTSYG