; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G012160 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G012160
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCleavage and polyadenylation specificity factor subunit 2
Genome locationchr05:20026351..20069699
RNA-Seq ExpressionLsi05G012160
SyntenyLsi05G012160
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0006412 - translation (biological process)
GO:0006468 - protein phosphorylation (biological process)
GO:0016567 - protein ubiquitination (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005840 - ribosome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0004842 - ubiquitin-protein transferase activity (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR000509 - Ribosomal protein L36e
IPR038097 - Ribosomal protein L36e domain superfamily
IPR036866 - Ribonuclease Z/Hydroxyacylglutathione hydrolase-like
IPR035639 - CPSF2, metallo-hydrolase domain
IPR025069 - Cleavage and polyadenylation specificity factor 2, C-terminal
IPR017441 - Protein kinase, ATP binding site
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR011009 - Protein kinase-like domain superfamily
IPR007369 - Peptidase A22B, signal peptide peptidase
IPR003613 - U box domain
IPR001279 - Metallo-beta-lactamase
IPR001245 - Serine-threonine/tyrosine-protein kinase, catalytic domain
IPR000719 - Protein kinase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW41981.1 Cleavage and polyadenylation specificity factor subunit 2 [Vitis vinifera]0.0e+0053.78Show/hide
Query:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK
        VTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLT+  Q +   
Subjt:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK

Query:  ELRYLPRHLWEIKEAN----DMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAV
            LP    ++ + +    D +      +T+   S       KGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVDFNHRKER LNGT+LESFVRPAV
Subjt:  ELRYLPRHLWEIKEAN----DMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAV

Query:  LITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE--WV---------VLASMAS--------------------------
        LITDAYNALNNQP RRQ+D+EF D I KTLR +GNVLLPVDTAGRVLEL+ ILE  W           L  +AS                          
Subjt:  LITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE--WV---------VLASMAS--------------------------

Query:  ------------------------------LEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-
                                      LEAG+SHDIFV+WA DAKNLVLFSERGQF TLARMLQADPPPKAVKVT+SKRVPL G+EL AYEEEQ R 
Subjt:  ------------------------------LEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-

Query:  KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQ
        KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  A  DV   H G +RDILIDGFVP STSVAPMFPFYEN+S WDDFGEVINP+DYVIKDEDMDQ
Subjt:  KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQ

Query:  AAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------------------------VLVHGT
        A M                          + G D++GKLDE AA+LI D  PSKV+SNELT                                 VLVHG+
Subjt:  AAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------------------------VLVHGT

Query:  AEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAA
        AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYK                 VQLSEKLMSNVLFKKLGDYE+AW+DAEVGKTE+G+LSLLPLS   
Subjt:  AEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAA

Query:  SPHKSVIVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQK---LRRLVEYGRDP-----------------------------------
          H +V VGD+KMADFKQFLASKGIQVEF+GGALRCGEYVTLRKV DASQK   +  ++E+ RD                                    
Subjt:  SPHKSVIVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQK---LRRLVEYGRDP-----------------------------------

Query:  -----------------AILRF----FMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGS
                         ++L F     MAS     SPSS QLR +  C+   + P+V VR  VRKLD ++RV+   I  +G     G G  R G  +  S
Subjt:  -----------------AILRF----FMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGS

Query:  KSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS-------MQQELLLDSDTGNDRLGEDE-KEDN
        +S  D  SGWS SD   +    ++K+W GG+VG G+ G +LV+G++FAA+S++KQN SR + QMEA++       +Q++  L+S TG D +     +ED 
Subjt:  KSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS-------MQQELLLDSDTGNDRLGEDE-KEDN

Query:  SVNA--DDETLAGRTGNQE----DSSSCTENEETLNKNRVGDSVDVG-----ELAEND-----VESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLS
        S  +  DD  +AG + +      +S    +  +TL+ N    S+ VG     +L EN      V SS  + D  N+ S  +     S  T  S      S
Subjt:  SVNA--DDETLAGRTGNQE----DSSSCTENEETLNKNRVGDSVDVG-----ELAEND-----VESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLS

Query:  SLISPESEFDSNVASCFKDVNNCHS----------GAEVSTSEPEMNILKDEPDNLPNSNTNSLN--LKTDIQDESPDTGENYDFSSKKLPVYDDSSSNY
        S   P +  + +     K +N   S          G +   SE E   L +   ++PN +   L+    + IQD   D GE  D    +  + + S    
Subjt:  SLISPESEFDSNVASCFKDVNNCHS----------GAEVSTSEPEMNILKDEPDNLPNSNTNSLN--LKTDIQDESPDTGENYDFSSKKLPVYDDSSSNY

Query:  NSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLF--DRETVAESSDGVLSPSKTEQLLSEDTASTIE------------------------------
           N      PV++     L+    I   + ++      DRE +         P+K+   L +  AS I+                              
Subjt:  NSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLF--DRETVAESSDGVLSPSKTEQLLSEDTASTIE------------------------------

Query:  --------QQVERGLSEAALVSITD--------------YPSADDQEKNHET-------------IKNS-----VPAP-LVSAAVKTLPGKVLVPAVVDQ
                +++E+ +   + V + D              YP   DQ+ N +               +NS     +PAP  VS ++K LPG+V+VPAVVDQ
Subjt:  --------QQVERGLSEAALVSITD--------------YPSADDQEKNHET-------------IKNS-----VPAP-LVSAAVKTLPGKVLVPAVVDQ

Query:  VHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRND-ISSSLD
        V GQALAALQVLKVIE DV+PSDLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFDDITPEDPDF+SIQGLAEAGLISSKLSR D +S S +
Subjt:  VHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRND-ISSSLD

Query:  EDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGE
        EDQ P YFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PALVAD S GEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGE
Subjt:  EDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGE

Query:  ASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNEL
        +SDIVSEEL RIEAE+MAE AVA HSALV QVEK++NASFEKELS+ER+K++A+EK+AEEA+QELE+LR+ER+ D+I+L++ERA+IESEMEV SRLR+E+
Subjt:  ASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNEL

Query:  EEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTW
        EEQLQ  MSNKVE+S+EKERI+KLRKEAE ENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVD++LRE+ SA  TW
Subjt:  EEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTW

Query:  LDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVV
        LD++KQF+V+ T  RAENL++KL  M +++RGKS+DVI+ I+QKI  L+S LR+  SK G +  +LK+ A+ KA  S  ELQQ+TAE SLA+KEG KRVV
Subjt:  LDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVV

Query:  GDCREG
        GDCR G
Subjt:  GDCREG

RVX01974.1 Cleavage and polyadenylation specificity factor subunit 2 [Vitis vinifera]0.0e+0053.19Show/hide
Query:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK
        VTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQ+++RK
Subjt:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK

Query:  E-------------------LRYLPRHLWEIKEAN------DMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDF
        +                   L  L   L   + ++      D +      +T+   S       KGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVDF
Subjt:  E-------------------LRYLPRHLWEIKEAN------DMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDF

Query:  NHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI---------------------------
        NHRKER LNGT+LESFVRPAVLITDAYNALNNQP RRQ+D+EF D I KTLR +GNVLLPVDTAGRVLEL+ I                           
Subjt:  NHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI---------------------------

Query:  ----LEW---------------------------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVK
            LEW                                       +VLASMASLEAG+SHDIFV+WA DAKNLVLFSERGQF TLARMLQADPPPKAVK
Subjt:  ----LEW---------------------------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVK

Query:  VTVSKRVPLTGDELIAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENN
        VT+SKRVPL G+EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  A  DV   H G +RDILIDGFVP STSVAPMFPFYEN+
Subjt:  VTVSKRVPLTGDELIAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENN

Query:  SAWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------
        S WDDFGEVINP+DYVIKDEDMDQA M                          + G D++GKLDE AA+LI D  PSKV+SNELT               
Subjt:  SAWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------

Query:  ------------------VLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEI
                          VLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYK                 VQLSEKLMSNVLFKKLGDYE+
Subjt:  ------------------VLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEI

Query:  AWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQKL------------RRLVEYGRDPAI
        AW+DAEVGKTE+G+LSLLPLS     H +V VGD+KMAD   F  +K  +     G   CGEYVTLRKV DASQK+            + ++  G++ AI
Subjt:  AWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQKL------------RRLVEYGRDPAI

Query:  LRF---------------------------FMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLC
          +                            MAS     SPSS QLR +  C+   + P+V VR  VRKLD ++RV+   I  +G     G G  R G  
Subjt:  LRF---------------------------FMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLC

Query:  FAGSKSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS-------MQQELLLDSDTGNDRLGEDE-
        +  S+S  D  SGWS SD   +    ++K+W GG+VG G+ G +LV+G++FAA+S++KQN SR + QMEA++       +Q++  L+S TG D +     
Subjt:  FAGSKSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS-------MQQELLLDSDTGNDRLGEDE-

Query:  KEDNSVNA--DDETLAGRTGNQE----DSSSCTENEETLNKNRVGDSVDVG-----ELAEND-----VESSSSNNDVNNVASLQEDFQSDSSLTVTSVAP
        +ED S  +  DD  +AG + +      +S    +  +TL+ N    S+ VG     +L EN      V SS  + D  N+ S  +     S  T  S   
Subjt:  KEDNSVNA--DDETLAGRTGNQE----DSSSCTENEETLNKNRVGDSVDVG-----ELAEND-----VESSSSNNDVNNVASLQEDFQSDSSLTVTSVAP

Query:  GSLSSLISPESEFDSNVASCFKDVNNCHS----------GAEVSTSEPEMNILKDEPDNLPNSNTNSLN--LKTDIQDESPDTGENYDFSSKKLPVYDDS
           SS   P +  + +     K +N   S          G +   SE E   L +   ++PN +   L+    + IQD   D GE  D    +  + + S
Subjt:  GSLSSLISPESEFDSNVASCFKDVNNCHS----------GAEVSTSEPEMNILKDEPDNLPNSNTNSLN--LKTDIQDESPDTGENYDFSSKKLPVYDDS

Query:  SSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLF--DRETVAESSDGVLSPSKTEQLLSEDTASTIE--------------------------
               N      PV++     L+    I   + ++      DRE +         P+K+   L +  AS I+                          
Subjt:  SSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLF--DRETVAESSDGVLSPSKTEQLLSEDTASTIE--------------------------

Query:  ------------QQVERGLSEAALVSITD--------------YPSADDQEKNHET-------------IKNS-----VPAP-LVSAAVKTLPGKVLVPA
                    +++E+ +   + V + D              YP   DQ+ N +               +NS     +PAP  VS ++K LPG+V+VPA
Subjt:  ------------QQVERGLSEAALVSITD--------------YPSADDQEKNHET-------------IKNS-----VPAP-LVSAAVKTLPGKVLVPA

Query:  VVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRND-IS
        VVDQV GQALAALQVLKVIE DV+PSDLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFDDITPEDPDF+SIQGLAEAGLISSKLSR D +S
Subjt:  VVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRND-IS

Query:  SSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIAL
         S +EDQ P YFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PAL+AD S GEQGIIALAFGYTRLFQP+KPVTKAQAAIAL
Subjt:  SSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIAL

Query:  ATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRL
        ATGE+SDIVSEEL RIEAE+MAE AVA HSALV QVEK++NASFEKELS+ER+K++A+EK+AEEA+QELE+LR+ER+ D+I+L++ERA+IESEMEV SRL
Subjt:  ATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRL

Query:  RNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESA
        R+E+EEQLQ  MSNKVE+S+EKERI+KLRKEAE ENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVD++LRE+ SA
Subjt:  RNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESA

Query:  GDTWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGA
          TWLD++KQF+V+ T  RAENL++KL  M +++RGKS+DVI+ I+QKI  L+S LR+  SK G +  +LK+ A+ KA  S  ELQQ+TAE SLA+KEG 
Subjt:  GDTWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGA

Query:  KRVVGDCREG
        KRVVGDCR G
Subjt:  KRVVGDCREG

XP_008439251.1 PREDICTED: uncharacterized protein LOC103484091 isoform X1 [Cucumis melo]0.0e+0089.24Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MAST PTCSPSSLQLRLALNC NCGKFPSV VRARVRKLDPRLR++C PIVHNGAK +RGNGLR +G+CFAGS+STADGFSGWSESDSQGEVLDLRRKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGGLVGIGITGFILVSGITFAAWSINKQNSSRQK QMEALS QQELLLDS+TG DRLGEDEKEDNSV+ADDET AG+ GNQEDSSSCTENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        D VDV ELAEN VESSSSNNDV+N ASLQEDFQSDSSLTVT+VAPGSLSS ISPESEFDSNVASC KDVNNCH G EVSTSEPEMN+LKDEPDN PNSN 
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI+DE PDTGEN+D SSKKLPVYD+SSSNY SGNQDETLG VNEITDSSLQGFS +S DT KES LFD  TVA+S +GVLSPSK EQ  SED 
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        A +IEQQ+E GLSEAALVSITDYP ADDQE NHETI N                 VPAPLVSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIE+DVE
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSR+DISSSLDEDQGPLYFSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER RDS+ LM ERAS+ESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEK GIKVVVDSDLREQES GDTWLDSSKQF VEETTDRAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG
        KLKRM+ EVRGKSRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKNVAIS+A+RSATELQQSTAELSLA+KEGAKRVVGDCREG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG

XP_011651163.1 uncharacterized protein LOC101215442 isoform X1 [Cucumis sativus]0.0e+0088.83Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MAST PTCSP+SLQLRLALNC NCGKFPS+LVRARVRKLDPRLRVICHPIVHNG K +RGNG R +G+CFAGS+ST DGFSGWSESDSQGE LDLRRKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGG VGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS QQELLLDS+TG DRLGEDEKED SV+ADDETLAG+ GNQEDSSS TENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        D VDV ELAEN VESSSSNNDV+NVASLQEDFQSDSSL VTSVAPGSLSSLISPESEFD+NVASC KDVNN H G EVSTSEPEMNILKDEPDNLPNSNT
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI+DE PDTGENYD  SKKLPVYDDSSSNY SGNQDETL PV+EITDSSLQGFS IS DT KESGLFD ETVA+SS+GV SPS+ EQ  SED 
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        A +IEQ +E  LSEAALVSI+DYP ADDQEKNHETI N                 VPAPLVSAAVKT PGKVL+PAVVDQV GQALAALQVLKVIE DVE
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSR+DISSSLDEDQGPLYFSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERER+ + LM ERASIESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEK GIKVVVDSDLREQESAGDTWLDSSKQF VEETT+RAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG
        KLKRM+ EVRG+SRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKN AIS+A RSA ELQQSTAELSLA+KEGAKRVVGDCREG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG

XP_038892464.1 uncharacterized protein LOC120081550 isoform X1 [Benincasa hispida]0.0e+0087.61Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MASTP TCSPSSLQLRLALNCKNCGKFPSVLVRAR+RKLDPRLRVICHPIV+NGA++ERGNGL  +G+CFAGS+STADGFSGWSESDSQ +VLDL RKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
         GG VGIGITGFIL+SGITFAAWSINKQNSSRQKPQMEALS QQELLL SDTGND+LGED KE+N +NADDET  G+TGNQEDSSSCTENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        DSVDV ELAENDVESSSSNNDVN+V SLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSN+ASCFKDVNN HSGAEVSTSE EMNILKDEPDNLPNSNT
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI DE PDTGENYDFSSKKLP+YDDSSSNYNSGNQD+TLGPVNEITDSSLQ  S                               EQ LSE T
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        ASTIEQ++ERGLSEAALVS+TDYPSADDQEKNHE++ N                 VPAP+VSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIEADV 
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSR+DISSSLDEDQGPL FSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLP ADRKMLHQVSGFID DKIHPDACPALVADLSVGEQGI+ALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VA HSALVAQVEKDINASFEKELSIEREKV+AVEKMAEEAKQELERLRS+RERDSITL+RERASIESEME+ SRLR+ELEEQL+GLMSNKVE+SFEKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEK GIKVVVDSDLREQESA DTWLDSSKQFAVEETTDRAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG
        KLKRM+TEVRGKSRDVIEKIIQKIALLVSNLRQWISK+GE+AEDLKNVAIS+A+RSA ELQQSTAELSLALKEGAKRVVGDCREG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG

TrEMBL top hitse value%identityAlignment
A0A0A0L9T9 Uncharacterized protein0.0e+0088.83Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MAST PTCSP+SLQLRLALNC NCGKFPS+LVRARVRKLDPRLRVICHPIVHNG K +RGNG R +G+CFAGS+ST DGFSGWSESDSQGE LDLRRKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGG VGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS QQELLLDS+TG DRLGEDEKED SV+ADDETLAG+ GNQEDSSS TENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        D VDV ELAEN VESSSSNNDV+NVASLQEDFQSDSSL VTSVAPGSLSSLISPESEFD+NVASC KDVNN H G EVSTSEPEMNILKDEPDNLPNSNT
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI+DE PDTGENYD  SKKLPVYDDSSSNY SGNQDETL PV+EITDSSLQGFS IS DT KESGLFD ETVA+SS+GV SPS+ EQ  SED 
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        A +IEQ +E  LSEAALVSI+DYP ADDQEKNHETI N                 VPAPLVSAAVKT PGKVL+PAVVDQV GQALAALQVLKVIE DVE
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITP+DPDFASIQGLAEAG+ISSKLSR+DISSSLDEDQGPLYFSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERER+ + LM ERASIESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAK+AREQAKALEEARDRWEK GIKVVVDSDLREQESAGDTWLDSSKQF VEETT+RAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG
        KLKRM+ EVRG+SRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKN AIS+A RSA ELQQSTAELSLA+KEGAKRVVGDCREG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG

A0A1S3AYZ8 uncharacterized protein LOC103484091 isoform X10.0e+0089.24Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW
        MAST PTCSPSSLQLRLALNC NCGKFPSV VRARVRKLDPRLR++C PIVHNGAK +RGNGLR +G+CFAGS+STADGFSGWSESDSQGEVLDLRRKKW
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKW

Query:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG
        FGGLVGIGITGFILVSGITFAAWSINKQNSSRQK QMEALS QQELLLDS+TG DRLGEDEKEDNSV+ADDET AG+ GNQEDSSSCTENEETLNKNRVG
Subjt:  FGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKNRVG

Query:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT
        D VDV ELAEN VESSSSNNDV+N ASLQEDFQSDSSLTVT+VAPGSLSS ISPESEFDSNVASC KDVNNCH G EVSTSEPEMN+LKDEPDN PNSN 
Subjt:  DSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPNSNT

Query:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT
        NSLNLKTDI+DE PDTGEN+D SSKKLPVYD+SSSNY SGNQDETLG VNEITDSSLQGFS +S DT KES LFD  TVA+S +GVLSPSK EQ  SED 
Subjt:  NSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVLSPSKTEQLLSEDT

Query:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE
        A +IEQQ+E GLSEAALVSITDYP ADDQE NHETI N                 VPAPLVSAAVKTLPGKVLVPAVVDQV GQALAALQVLKVIE+DVE
Subjt:  ASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKN----------------SVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVE

Query:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS
        PSDLCTRREYARWLVSASSALSRNTTSKVYPAMY+ENVTELAFDDITP+DPDFASIQGLAEAGLISSKLSR+DISSSLDEDQGPLYFSPES LSRQDLVS
Subjt:  PSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVS

Query:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA
        WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPA+VADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEEL RIEAESMAENA
Subjt:  WKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENA

Query:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI
        VAAHSALVAQVEKDINASFEKELSIEREKVEAVE+MAEEAKQELERLRSER RDS+ LM ERAS+ESEMEV SRLR+ELEEQLQGLMSNKVEVS+EKERI
Subjt:  VAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERI

Query:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME
        NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEK GIKVVVDSDLREQES GDTWLDSSKQF VEETTDRAENLME
Subjt:  NKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLME

Query:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG
        KLKRM+ EVRGKSRDVIEKIIQKIALLVSNLRQWISK+GE+AE+LKNVAIS+A+RSATELQQSTAELSLA+KEGAKRVVGDCREG
Subjt:  KLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG

A0A438E2N4 Cleavage and polyadenylation specificity factor subunit 20.0e+0053.78Show/hide
Query:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK
        VTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLT+  Q +   
Subjt:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK

Query:  ELRYLPRHLWEIKEAN----DMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAV
            LP    ++ + +    D +      +T+   S       KGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVDFNHRKER LNGT+LESFVRPAV
Subjt:  ELRYLPRHLWEIKEAN----DMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAV

Query:  LITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE--WV---------VLASMAS--------------------------
        LITDAYNALNNQP RRQ+D+EF D I KTLR +GNVLLPVDTAGRVLEL+ ILE  W           L  +AS                          
Subjt:  LITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILE--WV---------VLASMAS--------------------------

Query:  ------------------------------LEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-
                                      LEAG+SHDIFV+WA DAKNLVLFSERGQF TLARMLQADPPPKAVKVT+SKRVPL G+EL AYEEEQ R 
Subjt:  ------------------------------LEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-

Query:  KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQ
        KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  A  DV   H G +RDILIDGFVP STSVAPMFPFYEN+S WDDFGEVINP+DYVIKDEDMDQ
Subjt:  KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQ

Query:  AAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------------------------VLVHGT
        A M                          + G D++GKLDE AA+LI D  PSKV+SNELT                                 VLVHG+
Subjt:  AAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------------------------VLVHGT

Query:  AEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAA
        AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYK                 VQLSEKLMSNVLFKKLGDYE+AW+DAEVGKTE+G+LSLLPLS   
Subjt:  AEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAA

Query:  SPHKSVIVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQK---LRRLVEYGRDP-----------------------------------
          H +V VGD+KMADFKQFLASKGIQVEF+GGALRCGEYVTLRKV DASQK   +  ++E+ RD                                    
Subjt:  SPHKSVIVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQK---LRRLVEYGRDP-----------------------------------

Query:  -----------------AILRF----FMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGS
                         ++L F     MAS     SPSS QLR +  C+   + P+V VR  VRKLD ++RV+   I  +G     G G  R G  +  S
Subjt:  -----------------AILRF----FMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGS

Query:  KSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS-------MQQELLLDSDTGNDRLGEDE-KEDN
        +S  D  SGWS SD   +    ++K+W GG+VG G+ G +LV+G++FAA+S++KQN SR + QMEA++       +Q++  L+S TG D +     +ED 
Subjt:  KSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS-------MQQELLLDSDTGNDRLGEDE-KEDN

Query:  SVNA--DDETLAGRTGNQE----DSSSCTENEETLNKNRVGDSVDVG-----ELAEND-----VESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLS
        S  +  DD  +AG + +      +S    +  +TL+ N    S+ VG     +L EN      V SS  + D  N+ S  +     S  T  S      S
Subjt:  SVNA--DDETLAGRTGNQE----DSSSCTENEETLNKNRVGDSVDVG-----ELAEND-----VESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLS

Query:  SLISPESEFDSNVASCFKDVNNCHS----------GAEVSTSEPEMNILKDEPDNLPNSNTNSLN--LKTDIQDESPDTGENYDFSSKKLPVYDDSSSNY
        S   P +  + +     K +N   S          G +   SE E   L +   ++PN +   L+    + IQD   D GE  D    +  + + S    
Subjt:  SLISPESEFDSNVASCFKDVNNCHS----------GAEVSTSEPEMNILKDEPDNLPNSNTNSLN--LKTDIQDESPDTGENYDFSSKKLPVYDDSSSNY

Query:  NSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLF--DRETVAESSDGVLSPSKTEQLLSEDTASTIE------------------------------
           N      PV++     L+    I   + ++      DRE +         P+K+   L +  AS I+                              
Subjt:  NSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLF--DRETVAESSDGVLSPSKTEQLLSEDTASTIE------------------------------

Query:  --------QQVERGLSEAALVSITD--------------YPSADDQEKNHET-------------IKNS-----VPAP-LVSAAVKTLPGKVLVPAVVDQ
                +++E+ +   + V + D              YP   DQ+ N +               +NS     +PAP  VS ++K LPG+V+VPAVVDQ
Subjt:  --------QQVERGLSEAALVSITD--------------YPSADDQEKNHET-------------IKNS-----VPAP-LVSAAVKTLPGKVLVPAVVDQ

Query:  VHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRND-ISSSLD
        V GQALAALQVLKVIE DV+PSDLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFDDITPEDPDF+SIQGLAEAGLISSKLSR D +S S +
Subjt:  VHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRND-ISSSLD

Query:  EDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGE
        EDQ P YFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PALVAD S GEQGIIALAFGYTRLFQP+KPVTKAQAAIALATGE
Subjt:  EDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGE

Query:  ASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNEL
        +SDIVSEEL RIEAE+MAE AVA HSALV QVEK++NASFEKELS+ER+K++A+EK+AEEA+QELE+LR+ER+ D+I+L++ERA+IESEMEV SRLR+E+
Subjt:  ASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNEL

Query:  EEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTW
        EEQLQ  MSNKVE+S+EKERI+KLRKEAE ENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVD++LRE+ SA  TW
Subjt:  EEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTW

Query:  LDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVV
        LD++KQF+V+ T  RAENL++KL  M +++RGKS+DVI+ I+QKI  L+S LR+  SK G +  +LK+ A+ KA  S  ELQQ+TAE SLA+KEG KRVV
Subjt:  LDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVV

Query:  GDCREG
        GDCR G
Subjt:  GDCREG

A0A438IZ50 Cleavage and polyadenylation specificity factor subunit 20.0e+0053.19Show/hide
Query:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK
        VTPLCGVYNENPLSYLVS+DGFNFL+DCGWNDHFDP+ LQPL+RVASTIDAVL+SHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQ+++RK
Subjt:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK

Query:  E-------------------LRYLPRHLWEIKEAN------DMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDF
        +                   L  L   L   + ++      D +      +T+   S       KGEGIVIAPHVAGHLLGGT+WKITKDGEDVIYAVDF
Subjt:  E-------------------LRYLPRHLWEIKEAN------DMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDF

Query:  NHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI---------------------------
        NHRKER LNGT+LESFVRPAVLITDAYNALNNQP RRQ+D+EF D I KTLR +GNVLLPVDTAGRVLEL+ I                           
Subjt:  NHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI---------------------------

Query:  ----LEW---------------------------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVK
            LEW                                       +VLASMASLEAG+SHDIFV+WA DAKNLVLFSERGQF TLARMLQADPPPKAVK
Subjt:  ----LEW---------------------------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVK

Query:  VTVSKRVPLTGDELIAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENN
        VT+SKRVPL G+EL AYEEEQ R KKEEALKASL KE++ KAS G+DN  GDPM+ID ++  A  DV   H G +RDILIDGFVP STSVAPMFPFYEN+
Subjt:  VTVSKRVPLTGDELIAYEEEQNR-KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENN

Query:  SAWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------
        S WDDFGEVINP+DYVIKDEDMDQA M                          + G D++GKLDE AA+LI D  PSKV+SNELT               
Subjt:  SAWDDFGEVINPDDYVIKDEDMDQAAM--------------------------HAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------

Query:  ------------------VLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEI
                          VLVHG+AEATEHLKQHCLK+VCPHVYAPQI ETIDVTSDLCAYK                 VQLSEKLMSNVLFKKLGDYE+
Subjt:  ------------------VLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEI

Query:  AWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQKL------------RRLVEYGRDPAI
        AW+DAEVGKTE+G+LSLLPLS     H +V VGD+KMAD   F  +K  +     G   CGEYVTLRKV DASQK+            + ++  G++ AI
Subjt:  AWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQKL------------RRLVEYGRDPAI

Query:  LRF---------------------------FMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLC
          +                            MAS     SPSS QLR +  C+   + P+V VR  VRKLD ++RV+   I  +G     G G  R G  
Subjt:  LRF---------------------------FMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLC

Query:  FAGSKSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS-------MQQELLLDSDTGNDRLGEDE-
        +  S+S  D  SGWS SD   +    ++K+W GG+VG G+ G +LV+G++FAA+S++KQN SR + QMEA++       +Q++  L+S TG D +     
Subjt:  FAGSKSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALS-------MQQELLLDSDTGNDRLGEDE-

Query:  KEDNSVNA--DDETLAGRTGNQE----DSSSCTENEETLNKNRVGDSVDVG-----ELAEND-----VESSSSNNDVNNVASLQEDFQSDSSLTVTSVAP
        +ED S  +  DD  +AG + +      +S    +  +TL+ N    S+ VG     +L EN      V SS  + D  N+ S  +     S  T  S   
Subjt:  KEDNSVNA--DDETLAGRTGNQE----DSSSCTENEETLNKNRVGDSVDVG-----ELAEND-----VESSSSNNDVNNVASLQEDFQSDSSLTVTSVAP

Query:  GSLSSLISPESEFDSNVASCFKDVNNCHS----------GAEVSTSEPEMNILKDEPDNLPNSNTNSLN--LKTDIQDESPDTGENYDFSSKKLPVYDDS
           SS   P +  + +     K +N   S          G +   SE E   L +   ++PN +   L+    + IQD   D GE  D    +  + + S
Subjt:  GSLSSLISPESEFDSNVASCFKDVNNCHS----------GAEVSTSEPEMNILKDEPDNLPNSNTNSLN--LKTDIQDESPDTGENYDFSSKKLPVYDDS

Query:  SSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLF--DRETVAESSDGVLSPSKTEQLLSEDTASTIE--------------------------
               N      PV++     L+    I   + ++      DRE +         P+K+   L +  AS I+                          
Subjt:  SSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLF--DRETVAESSDGVLSPSKTEQLLSEDTASTIE--------------------------

Query:  ------------QQVERGLSEAALVSITD--------------YPSADDQEKNHET-------------IKNS-----VPAP-LVSAAVKTLPGKVLVPA
                    +++E+ +   + V + D              YP   DQ+ N +               +NS     +PAP  VS ++K LPG+V+VPA
Subjt:  ------------QQVERGLSEAALVSITD--------------YPSADDQEKNHET-------------IKNS-----VPAP-LVSAAVKTLPGKVLVPA

Query:  VVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRND-IS
        VVDQV GQALAALQVLKVIE DV+PSDLCTRRE+ARWLVSASS LSRNT SKVYPAMYI N+TELAFDDITPEDPDF+SIQGLAEAGLISSKLSR D +S
Subjt:  VVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRND-IS

Query:  SSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIAL
         S +EDQ P YFSP+SPLSRQDLVSWKMALEKRQLPE D+K+L+QVSGFID D I+PDA PAL+AD S GEQGIIALAFGYTRLFQP+KPVTKAQAAIAL
Subjt:  SSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIAL

Query:  ATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRL
        ATGE+SDIVSEEL RIEAE+MAE AVA HSALV QVEK++NASFEKELS+ER+K++A+EK+AEEA+QELE+LR+ER+ D+I+L++ERA+IESEMEV SRL
Subjt:  ATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRL

Query:  RNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESA
        R+E+EEQLQ  MSNKVE+S+EKERI+KLRKEAE ENQEI+RLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVD++LRE+ SA
Subjt:  RNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESA

Query:  GDTWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGA
          TWLD++KQF+V+ T  RAENL++KL  M +++RGKS+DVI+ I+QKI  L+S LR+  SK G +  +LK+ A+ KA  S  ELQQ+TAE SLA+KEG 
Subjt:  GDTWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGA

Query:  KRVVGDCREG
        KRVVGDCR G
Subjt:  KRVVGDCREG

A0A4Y7JAF6 Beta-Casp domain-containing protein0.0e+0052.56Show/hide
Query:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK
        VTPL GVYNENPLSYLVS+DGFN LIDCGWND FD +LL+PLS VAST+DAVL+SH DTLHLGALPYA + LGL+A VYST+PV+RLGLLTMYD + ++K
Subjt:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK

Query:  ELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
        ++       +++   +D+        T  Y  N      KGEGIVIAPHVAGHLLGGT+WKITKDGED+IYAVDFNHRKE HLNGT+LE+FVRPAVLITD
Subjt:  ELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI-------------------------------LEW------------------
        AYNALN+Q  R + +K F   I+ TL   GN+LLPVDTAGRVLEL+ +                               LEW                  
Subjt:  AYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI-------------------------------LEW------------------

Query:  ---------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-K
                             +VLASMASLE G SHDIF +WA D KNLVLF+ERGQFGTLARMLQADPPP+AVKVT+  RVPL G+EL+++EEEQNR K
Subjt:  ---------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-K

Query:  KEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQA
        KEEALKAS+ KEE  KAS+G+D    DPM+IDAS+  A  +V   HGGA+RDI IDGF+PSSTSVAPMFPFYEN+S WDDFGE+INPDDYV+K+EDMD  
Subjt:  KEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVA-PDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQA

Query:  AMHAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------------------------VLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET
        +MH GGD+DGKLDE AANLILD +PSK+VSNELT                                 VLVHG+AEATEHLKQHCLKNVCPHVYAPQ+ ET
Subjt:  AMHAGGDVDGKLDETAANLILDMKPSKVVSNELT---------------------------------VLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET

Query:  IDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQV
        IDVTSDLCAYK                 VQLSEKLMS VLFKKLGD+E+AW+D+ V +TE   LSL P+S A  PHKSV VGDLK+ADFKQFLASKG+QV
Subjt:  IDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQV

Query:  EFAGGALRCGEYVTLRKVTDASQKLRRLVEYGRDPAILRFFMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIER
        EF+GGALRCGEYVTLRKV DA+QK                                         G    V++                      +K E+
Subjt:  EFAGGALRCGEYVTLRKVTDASQKLRRLVEYGRDPAILRFFMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIER

Query:  GNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNS--SRQKPQMEALSMQQELLLDSDTGNDRL
          G        +  +++  G+ G  +S   G+  + +R   +GG V  G+ G +L +G+ FA+  +NK+N+   +Q+ QME ++ Q ELLL SD   D++
Subjt:  GNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNS--SRQKPQMEALSMQQELLLDSDTGNDRL

Query:  GEDEKEDNSVNADD-----ETLAGRTGNQEDSSSCTENEETLNKNRVGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLI
         +   +++ V  D+       L  +TG  + SS+  E  + +++++  DS          VE +S++ D N+ +S QED +S S     SV   +  S  
Subjt:  GEDEKEDNSVNADD-----ETLAGRTGNQEDSSSCTENEETLNKNRVGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLI

Query:  SPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPN---SNTNSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPV
        S  SE+  +        +   +G+    +EP     K+ P NL +     +NS NL+TD Q+    T ++    + +LP+  D SS ++     + L   
Subjt:  SPESEFDSNVASCFKDVNNCHSGAEVSTSEPEMNILKDEPDNLPN---SNTNSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPV

Query:  NEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGV-LSPSKTEQLLSEDTASTIE---QQVERGLSEAALVSITDYPSADDQEKNHETI---------
        + +   S+Q    +  D   ++ +   ETV+ +S G  L+ S+T Q   E+ +   +    + E  L+    +    +P   D     E           
Subjt:  NEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGV-LSPSKTEQLLSEDTASTIE---QQVERGLSEAALVSITDYPSADDQEKNHETI---------

Query:  KNS-----VPAP-LVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDI
        +NS     +PAP LVSAA++  PGKVLVPAVVDQV GQALAALQVLKVIE DV+  DLCTRR+YARWLVS+S+ LSRN+ SKVYPAMYIENVT+LAFDDI
Subjt:  KNS-----VPAP-LVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDI

Query:  TPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGE
         PEDPDFASIQGLAEAGLI+SKLSR D+  S D +Q P  FSPESPLSRQDLVSWKMAL+KRQLPE +RKM++Q+ GFID DKI+PDA PALVAD+S G+
Subjt:  TPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGE

Query:  QGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELER
        QGIIALAFGYTRLFQPDKPVT AQAAIALATG+A+DIVSEEL RIEAESMAE AVAAHSALVA+VEKD+NA+F KEL++E+EKV+A+EK AEEAK ELE+
Subjt:  QGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELER

Query:  LRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQA
        LR+ERE     L R RA++ESEMEV +RLR ELEEQLQ L+S++ E++FE+ERINKLR+EAE ENQ I++LQY+LEVERKALSMAR+WAE+EAKRARE  
Subjt:  LRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQA

Query:  KALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLK
        KALEEARDRW+KHGI+VVVDSDL++ ESAG TW ++ K+  ++ET +RAE L++KLK M+  ++GKS+ VIE+II KI  ++  L++  S + ++ ++ +
Subjt:  KALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLK

Query:  NVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG
        ++A SKAS S  ELQ S    S A+K+GA R+ G+C+EG
Subjt:  NVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG

SwissProt top hitse value%identityAlignment
Q5WA76 U-box domain-containing protein 702.4e-10946.04Show/hide
Query:  AEFKAKSLESSCMEEVKKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFAAVDLLVSFKEKRDKLQTENEGAM
        +E  A++LE   + +V++RKE EE L R ++E E++  + DE+  ELQ V EQ   L  + S+ R   E L                + D+L  E + A+
Subjt:  AEFKAKSLESSCMEEVKKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFAAVDLLVSFKEKRDKLQTENEGAM

Query:  DKLRKLKNIVKREPSHYRSAEMPTFSFMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTFQYEVEVLSRVRHPNLVSIIGACP
         ++ +L+    +  S   +A    FS  E+  AT NF  S KIGEG  G VYKG+LR++ VA+K+        QS F+ EV +LSRVRHP+LV+++GAC 
Subjt:  DKLRKLKNIVKREPSHYRSAEMPTFSFMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTFQYEVEVLSRVRHPNLVSIIGACP

Query:  ESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRIAADICSALIFLHYSDP-CIVHGDIKPSKILLDANFIAKLGGLGISRLIPQEEKAFNSASVCDML
        ES  +VYE L NGSLED L C +  + L WQ RIRI A+ICSALIFLH + P  +VHGD+KP+ ILL  N ++KL   GISRL+ Q     N+     M 
Subjt:  ESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRIAADICSALIFLHYSDP-CIVHGDIKPSKILLDANFIAKLGGLGISRLIPQEEKAFNSASVCDML

Query:  KENNP-YIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVRCALENDEIGVILDSSAGDWPHDLAEQLALVALRCCEKEKLDRPDLVSELWGV
            P Y+DPE+L TG  TP+SDVYS G+++LR+LTG+ P GI   V  A+E  ++  ++D+S G+WPH   EQLA +ALRC E  +  RPDL  E+W +
Subjt:  KENNP-YIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVRCALENDEIGVILDSSAGDWPHDLAEQLALVALRCCEKEKLDRPDLVSELWGV

Query:  LEPMRSIASASCSSLKK---HSRVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSGHNTSPMTNLKLEHCDLVPNYALLNAIQEWQHQ
        +E +R  A +S SS +     +  P++F CPI Q+IM DP IAADGFTYEA+AIR W  +GH+TSPMTNL LEH +L+PN AL +AIQEW  Q
Subjt:  LEPMRSIASASCSSLKK---HSRVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSGHNTSPMTNLKLEHCDLVPNYALLNAIQEWQHQ

Q652P4 Cleavage and polyadenylation specificity factor subunit 27.4e-22058.15Show/hide
Query:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK
        VTPL G Y E PL YL++VDGF FL+DCGW D  DP+ LQPL++VA TIDAVL+SH DT+HLGALPYAMK LGLSAPVY+TEPV+RLG+LT+YD FI+R+
Subjt:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK

Query:  ELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
        ++       +++   +D+     + +   Y  N   +  KGEGIVIAPHVAGH LGGT+WKITKDGEDV+YAVDFNHRKERHLNGT L SFVRPAVLITD
Subjt:  ELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI-------------------------------LEW------------------
        AYNALNN  Y+RQ+D++F D + K L   G+VLLP+DTAGRVLE++ I                               LEW                  
Subjt:  AYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI-------------------------------LEW------------------

Query:  ---------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-K
                             VVLASMASLE G+SHDIFVD A +AKNLVLF+E+GQFGTLARMLQ DPPPKAVKVT+SKR+PL GDEL AYEEEQ R K
Subjt:  ---------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-K

Query:  KEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVAPDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQAA
        KEEALKASL KEE+ KAS G++    DPM+IDAS++  P   GS  G   DILIDGFVP S+SVAPMFPF+EN S WDDFGEVINP+DY++K E+MD   
Subjt:  KEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVAPDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQAA

Query:  MHAGGD-VDGKLDETAANLILDMKPSKVVSNELT---------------------------------VLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET
        M   GD +D  LDE +A L+LD  PSKV+SNE+T                                 VLVHG+AEATEHLK HC KN   HVYAPQIEET
Subjt:  MHAGGD-VDGKLDETAANLILDMKPSKVVSNELT---------------------------------VLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET

Query:  IDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQV
        IDVTSDLCAYK                 VQLSEKLMSNV+ KKLG++EIAW+DAEVGKT++  L+LLP S   + HKSV+VGDLK+ADFKQFLA+KG+QV
Subjt:  IDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQV

Query:  EFAGGALRCGEYVTLRKVTDASQK
        EFAGGALRCGEY+TLRK+ DA QK
Subjt:  EFAGGALRCGEYVTLRKVTDASQK

Q8GUH1 U-box domain-containing protein 331.8e-11746.71Show/hide
Query:  EEDGAGAEFKAKSLESSCMEEVKKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFAAVDLLVSFKEKRDKLQT
        E++   A  +AK  ES+  EE+K+RK+ E  + +EK+    +  E + +++ELQ    QK+ LE + ++    +E+L +K+  AV LL   +++R++LQT
Subjt:  EEDGAGAEFKAKSLESSCMEEVKKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFAAVDLLVSFKEKRDKLQT

Query:  ENEGAMDKLRKLKNIVKREPSHYRSAEMPT-FSFMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTFQYEVEVLSRVRHPNLV
        E + A+ +  +L++    E S  +  +  T FSF EI EAT +FD + KIGEG +GS+Y GLLRH  VA+KM     S     +Q EV+VLS++RHPN++
Subjt:  ENEGAMDKLRKLKNIVKREPSHYRSAEMPT-FSFMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTFQYEVEVLSRVRHPNLV

Query:  SIIGACPESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRIAADICSALIFLHYSDP-CIVHGDIKPSKILLDANFIAKLGGLGISRLIPQEEKAFNS
        ++IGACPE   +VYE L  GSLED L CK+++ PL WQ R+RIA +IC+AL+FLH +    +VHGD+KP+ ILLD+N ++KL   G   L+        S
Subjt:  SIIGACPESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRIAADICSALIFLHYSDP-CIVHGDIKPSKILLDANFIAKLGGLGISRLIPQEEKAFNS

Query:  ASVCDMLKENNPYIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVRCALENDEIGVILDSSAGDWPHDLAEQLALVALRCCEKEKLDRPDLV
         SV   +     Y+DPE   +G  TP+SDVYS G+ILLR+LTGR    I  +V+ AL+N  +  +LD  AGDWP   AEQLA +ALRCCE    +RPDL 
Subjt:  ASVCDMLKENNPYIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVRCALENDEIGVILDSSAGDWPHDLAEQLALVALRCCEKEKLDRPDLV

Query:  SELWGVLEPMRSIASASCS---SLKKHSRVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSGHNTSPMTNLKLEHCDLVPNYALLNAIQEW-QHQ
        +E+W VLEPMR+ +  S S      +H   P +F CPIFQE+M+DP +AADGFTYEA+AIR W  S H+TSPMTN+KL H  L+ N+AL +AIQEW QH 
Subjt:  SELWGVLEPMRSIASASCS---SLKKHSRVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSGHNTSPMTNLKLEHCDLVPNYALLNAIQEW-QHQ

Query:  L
        L
Subjt:  L

Q8GUH1 U-box domain-containing protein 338.7e-1134.82Show/hide
Query:  VEDTIFVAVGTDVDDAKATLIWAVHNFAGKSFCLLHVHHLSPKKEEP------SSYTLKEHVAKAYEEPKSQKPFELLNQYVLILAKLGVQAQKVWTETN
        +++ IFVAV   V  +K+TLIWA+ N  GK  CL+HVH   P +  P          +KE   + + E + +K   +L+ Y+ I  + GV+A+K++ E  
Subjt:  VEDTIFVAVGTDVDDAKATLIWAVHNFAGKSFCLLHVHHLSPKKEEP------SSYTLKEHVAKAYEEPKSQKPFELLNQYVLILAKLGVQAQKVWTETN

Query:  NVERGIVEIIAQ
        ++E GIV++I++
Subjt:  NVERGIVEIIAQ

Q94A51 U-box domain-containing protein 322.1e-14237.87Show/hide
Query:  GQQLEFDVEDTIFVAVGTDVDDAKATLIWAVHNFAGKSFCLLHVRLGNPSRRESSVANSGTASSNTKQ--ISIVHFLQMGTAEEIGQQL-----EFDVED
        G++L  DV++TIFVAV  DV+ +K T++WA  NF+GK  CLL+V   + + R +S  +      + K+  + ++  ++    +E+         E +++ 
Subjt:  GQQLEFDVEDTIFVAVGTDVDDAKATLIWAVHNFAGKSFCLLHVRLGNPSRRESSVANSGTASSNTKQ--ISIVHFLQMGTAEEIGQQL-----EFDVED

Query:  TIFVAVGTDVDDAKATLI------WAVHNFAGKSFCLLHVHHLSPKK-----------------------------EEPSSYTLKEHVAKAYEEPKSQKP
              G ++++    LI      W V   A        +  L  KK                             ++ ++      + +   + +++K 
Subjt:  TIFVAVGTDVDDAKATLI------WAVHNFAGKSFCLLHVHHLSPKK-----------------------------EEPSSYTLKEHVAKAYEEPKSQKP

Query:  FELLNQYVLILAKL--GVQAQKVWTETNNVERGIVEIIAQYNAQGKENACDRGVKGGHPVSQSGLQEEASMNVKSIKD----------------------
         +L + Y+    +    +  Q    +T  +ER  VE  A        ++      G  PVS   L +  ++N  ++++                      
Subjt:  FELLNQYVLILAKL--GVQAQKVWTETNNVERGIVEIIAQYNAQGKENACDRGVKGGHPVSQSGLQEEASMNVKSIKD----------------------

Query:  -------FEGVKAWVEEDGAGAE--FKAKSLESSCMEEVKKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFA
                E  K W EED +  E   KAK+LE  C++E  +RK +EELLE+EK EV+ + ++++  +KELQ VQ +   LE +  + +   +E  +K   
Subjt:  -------FEGVKAWVEEDGAGAE--FKAKSLESSCMEEVKKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFA

Query:  AVDLLVSFKEKRDKLQTENEGAMDKLRKLKNIVKREPSHYRSAEMPTFSFMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTF
        A++LL SF++KRD+++ ++E A+ ++  L+ +VK E      +EM  +SFMEI EAT  FDPSWK+GEG++GSVYKG L+HL VA+KM  SYGS +   F
Subjt:  AVDLLVSFKEKRDKLQTENEGAMDKLRKLKNIVKREPSHYRSAEMPTFSFMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTF

Query:  QYEVEVLSRVRHPNLVSIIGACPESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRIAADICSALIFLHYSDPCIVHGDIKPSKILLDANFIAKLGGL
        +  VE+LSRVRHPNLV+++GACPESR ++Y+ + NGSLED  + +N+   L W+ RIRIA++ICSAL+FLH + PCI+HG++KPSKILLD+N + K+   
Subjt:  QYEVEVLSRVRHPNLVSIIGACPESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRIAADICSALIFLHYSDPCIVHGDIKPSKILLDANFIAKLGGL

Query:  GISRLIPQEEKAFNSASVCDMLKENNPYIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVRCALENDEIGVILDSSAGDWPHDLAEQLALVA
        GIS+LIP            D L +++P++DP Y  +   T ESD+Y+ G+ILL++LT R  SGI++DV+CALEND I  +LD+SAGDWP    ++LA VA
Subjt:  GISRLIPQEEKAFNSASVCDMLKENNPYIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVRCALENDEIGVILDSSAGDWPHDLAEQLALVA

Query:  LRCCEKEKLDRPDLVSELWGVLEPMRS--IASASCSSLKKHS---RVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSGHNTSPMTNLKLEHCDL
        +RCC+K  ++RPDL   L   ++ M++  + S+  SS    +   R P+H+ CPIFQE+MKDPLIAADGFTYEA+AIR W  +GH+TSPMTNLK+E C+L
Subjt:  LRCCEKEKLDRPDLVSELWGVLEPMRS--IASASCSSLKKHS---RVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSGHNTSPMTNLKLEHCDL

Query:  VPNYALLNAIQEWQHQ
        +PN+AL  AIQ+WQ+Q
Subjt:  VPNYALLNAIQEWQHQ

Q9LKF9 Cleavage and polyadenylation specificity factor subunit 21.4e-24262.48Show/hide
Query:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK
        VTPLCGVYNENPLSYLVS+DGFNFLIDCGWND FD +LL+PLSRVASTIDAVL+SHPDTLH+GALPYAMKQLGLSAPVY+TEPV+RLGLLTMYDQF++RK
Subjt:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK

Query:  ELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
        ++       +++   +D+     + I   Y  N      KGEGIVIAPHVAGH+LGG++W+ITKDGEDVIYAVD+NHRKERHLNGT+L+SFVRPAVLITD
Subjt:  ELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNAL-NNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI-------------------------------LEW-----------------
        AY+AL  NQ  R+Q+DKEF DTI K L   GNVLLPVDTAGRVLEL+ I                               LEW                 
Subjt:  AYNAL-NNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI-------------------------------LEW-----------------

Query:  ----------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-
                              VVLASMASLEAG++ +IFV+WA D +NLVLF+E GQFGTLARMLQ+ PPPK VKVT+SKRVPL G+ELIAYEEEQNR 
Subjt:  ----------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-

Query:  KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVAPDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQA
        K+EEAL+ASL+KEE++KASHG+D+++ +PMIID  +    DV GSHG AY+DILIDGFVP S+SVAPMFP+Y+N S WDDFGE+INPDDYVIKDEDMD+ 
Subjt:  KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVAPDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQA

Query:  AMHAGGDVDGKLDETAANLILDMKPSKVVSNEL---------------------------------TVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET
        AMH GGDVDG+LDE  A+L+LD +PSKV+SNEL                                  VLVH  AEATEHLKQHCL N+CPHVYAPQIEET
Subjt:  AMHAGGDVDGKLDETAANLILDMKPSKVVSNEL---------------------------------TVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET

Query:  IDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQV
        +DVTSDLCAYK                 VQLSEKLMSNV+FKKLGD E+AW+D+EVGKTE    SLLP+  AASPHK V+VGDLK+ADFKQFL+SKG+QV
Subjt:  IDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQV

Query:  EFA-GGALRCGEYVTLRKVTDASQK
        EFA GGALRCGEYVTLRKV    QK
Subjt:  EFA-GGALRCGEYVTLRKVTDASQK

Arabidopsis top hitse value%identityAlignment
AT3G49060.1 U-box domain-containing protein kinase family protein1.5e-14337.87Show/hide
Query:  GQQLEFDVEDTIFVAVGTDVDDAKATLIWAVHNFAGKSFCLLHVRLGNPSRRESSVANSGTASSNTKQ--ISIVHFLQMGTAEEIGQQL-----EFDVED
        G++L  DV++TIFVAV  DV+ +K T++WA  NF+GK  CLL+V   + + R +S  +      + K+  + ++  ++    +E+         E +++ 
Subjt:  GQQLEFDVEDTIFVAVGTDVDDAKATLIWAVHNFAGKSFCLLHVRLGNPSRRESSVANSGTASSNTKQ--ISIVHFLQMGTAEEIGQQL-----EFDVED

Query:  TIFVAVGTDVDDAKATLI------WAVHNFAGKSFCLLHVHHLSPKK-----------------------------EEPSSYTLKEHVAKAYEEPKSQKP
              G ++++    LI      W V   A        +  L  KK                             ++ ++      + +   + +++K 
Subjt:  TIFVAVGTDVDDAKATLI------WAVHNFAGKSFCLLHVHHLSPKK-----------------------------EEPSSYTLKEHVAKAYEEPKSQKP

Query:  FELLNQYVLILAKL--GVQAQKVWTETNNVERGIVEIIAQYNAQGKENACDRGVKGGHPVSQSGLQEEASMNVKSIKD----------------------
         +L + Y+    +    +  Q    +T  +ER  VE  A        ++      G  PVS   L +  ++N  ++++                      
Subjt:  FELLNQYVLILAKL--GVQAQKVWTETNNVERGIVEIIAQYNAQGKENACDRGVKGGHPVSQSGLQEEASMNVKSIKD----------------------

Query:  -------FEGVKAWVEEDGAGAE--FKAKSLESSCMEEVKKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFA
                E  K W EED +  E   KAK+LE  C++E  +RK +EELLE+EK EV+ + ++++  +KELQ VQ +   LE +  + +   +E  +K   
Subjt:  -------FEGVKAWVEEDGAGAE--FKAKSLESSCMEEVKKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFA

Query:  AVDLLVSFKEKRDKLQTENEGAMDKLRKLKNIVKREPSHYRSAEMPTFSFMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTF
        A++LL SF++KRD+++ ++E A+ ++  L+ +VK E      +EM  +SFMEI EAT  FDPSWK+GEG++GSVYKG L+HL VA+KM  SYGS +   F
Subjt:  AVDLLVSFKEKRDKLQTENEGAMDKLRKLKNIVKREPSHYRSAEMPTFSFMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTF

Query:  QYEVEVLSRVRHPNLVSIIGACPESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRIAADICSALIFLHYSDPCIVHGDIKPSKILLDANFIAKLGGL
        +  VE+LSRVRHPNLV+++GACPESR ++Y+ + NGSLED  + +N+   L W+ RIRIA++ICSAL+FLH + PCI+HG++KPSKILLD+N + K+   
Subjt:  QYEVEVLSRVRHPNLVSIIGACPESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRIAADICSALIFLHYSDPCIVHGDIKPSKILLDANFIAKLGGL

Query:  GISRLIPQEEKAFNSASVCDMLKENNPYIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVRCALENDEIGVILDSSAGDWPHDLAEQLALVA
        GIS+LIP            D L +++P++DP Y  +   T ESD+Y+ G+ILL++LT R  SGI++DV+CALEND I  +LD+SAGDWP    ++LA VA
Subjt:  GISRLIPQEEKAFNSASVCDMLKENNPYIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVRCALENDEIGVILDSSAGDWPHDLAEQLALVA

Query:  LRCCEKEKLDRPDLVSELWGVLEPMRS--IASASCSSLKKHS---RVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSGHNTSPMTNLKLEHCDL
        +RCC+K  ++RPDL   L   ++ M++  + S+  SS    +   R P+H+ CPIFQE+MKDPLIAADGFTYEA+AIR W  +GH+TSPMTNLK+E C+L
Subjt:  LRCCEKEKLDRPDLVSELWGVLEPMRS--IASASCSSLKKHS---RVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSGHNTSPMTNLKLEHCDL

Query:  VPNYALLNAIQEWQHQ
        +PN+AL  AIQ+WQ+Q
Subjt:  VPNYALLNAIQEWQHQ

AT5G23880.1 cleavage and polyadenylation specificity factor 1009.9e-24462.48Show/hide
Query:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK
        VTPLCGVYNENPLSYLVS+DGFNFLIDCGWND FD +LL+PLSRVASTIDAVL+SHPDTLH+GALPYAMKQLGLSAPVY+TEPV+RLGLLTMYDQF++RK
Subjt:  VTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARK

Query:  ELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD
        ++       +++   +D+     + I   Y  N      KGEGIVIAPHVAGH+LGG++W+ITKDGEDVIYAVD+NHRKERHLNGT+L+SFVRPAVLITD
Subjt:  ELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITD

Query:  AYNAL-NNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI-------------------------------LEW-----------------
        AY+AL  NQ  R+Q+DKEF DTI K L   GNVLLPVDTAGRVLEL+ I                               LEW                 
Subjt:  AYNAL-NNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQI-------------------------------LEW-----------------

Query:  ----------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-
                              VVLASMASLEAG++ +IFV+WA D +NLVLF+E GQFGTLARMLQ+ PPPK VKVT+SKRVPL G+ELIAYEEEQNR 
Subjt:  ----------------------VVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-

Query:  KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVAPDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQA
        K+EEAL+ASL+KEE++KASHG+D+++ +PMIID  +    DV GSHG AY+DILIDGFVP S+SVAPMFP+Y+N S WDDFGE+INPDDYVIKDEDMD+ 
Subjt:  KKEEALKASLLKEEQSKASHGTDNDTGDPMIIDASSNVAPDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQA

Query:  AMHAGGDVDGKLDETAANLILDMKPSKVVSNEL---------------------------------TVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET
        AMH GGDVDG+LDE  A+L+LD +PSKV+SNEL                                  VLVH  AEATEHLKQHCL N+CPHVYAPQIEET
Subjt:  AMHAGGDVDGKLDETAANLILDMKPSKVVSNEL---------------------------------TVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEET

Query:  IDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQV
        +DVTSDLCAYK                 VQLSEKLMSNV+FKKLGD E+AW+D+EVGKTE    SLLP+  AASPHK V+VGDLK+ADFKQFL+SKG+QV
Subjt:  IDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMADFKQFLASKGIQV

Query:  EFA-GGALRCGEYVTLRKVTDASQK
        EFA GGALRCGEYVTLRKV    QK
Subjt:  EFA-GGALRCGEYVTLRKVTDASQK

AT5G23890.1 LOCATED IN: mitochondrion, chloroplast thylakoid membrane, chloroplast, plastid, chloroplast envelope5.1e-19246.28Show/hide
Query:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLD-LRRKK
        MAS   T +P+SLQLRLAL+     K P+V +R           ++C   V    +++   G          SKS+AD  +GW +SD+  +    +++K 
Subjt:  MASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKIERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLD-LRRKK

Query:  WFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDS--DTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKN
           G+VG G+ G IL  G+++AA S +K+    +K +M +L+ QQE ++ S  +  +D +     E++++  +D+++      Q+      E ++ L K 
Subjt:  WFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDS--DTGNDRLGEDEKEDNSVNADDETLAGRTGNQEDSSSCTENEETLNKN

Query:  RVGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCH-SGAEVSTSEPEMNILKDEPDNLP
            S D     E D   S   N     A L  + ++D     +        SL+      DS+      D  + +  G E + SE   ++L  EP N+ 
Subjt:  RVGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCH-SGAEVSTSEPEMNILKDEPDNLP

Query:  NSNTNSLNLKTD---------------IQDESPDTGENYDFSSKK--LPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETV
        +   +  + K D                  E P+     D +SK   +P+ D  ++   +    E  G       S     S I  DTTKE     +  V
Subjt:  NSNTNSLNLKTD---------------IQDESPDTGENYDFSSKK--LPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETV

Query:  AESSDGVLSPSKTE-QLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKV
         ES+DG    SK E  + S+D        +E     +A  S                    +PAP +S  V   PGK+LVP   DQ+  QA AALQVLKV
Subjt:  AESSDGVLSPSKTE-QLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKV

Query:  IEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLS
        IE D +PSDLCTRREYARWL+SASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDF+SIQGLAEAGLI+SKLS  D+   LD+ +G   FSPES LS
Subjt:  IEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLS

Query:  RQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAE
        RQDL+SWKMALEKRQLPEAD+KML+++SGFID DKI+PDA P+++ADLS GEQGI ALAFG TRLFQP KPVTK QAAIAL++GEASDIVSEEL RIEAE
Subjt:  RQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAE

Query:  SMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVS
        SMAE AV+AH+ALVA+VEKD+NASFEKELS+EREK+EAVEKMAE AK ELE+LR +RE +++ L++ERA++ESEMEV SRLR + EE+L+ LMSNK E++
Subjt:  SMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVS

Query:  FEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLRE---QESAGDTWLDSSKQFAVEET
        FEKER+  LRKEAE E+Q IS+LQYELEVERKALSMAR+WAE+EAK+AREQ +ALEEAR RWE +G++VVVD DL+E   +E+     L+  ++ +VEET
Subjt:  FEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLRE---QESAGDTWLDSSKQFAVEET

Query:  TDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG
          RA+ LM+KLK M+  V GKSR+VI  +++KI L ++ L+++    G++A ++++ AI +A  +A +++Q T ++S    +  K++  +CR+G
Subjt:  TDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKASRSATELQQSTAELSLALKEGAKRVVGDCREG

AT5G52410.1 CONTAINS InterPro DOMAIN/s: S-layer homology domain (InterPro:IPR001119)5.7e-15967.74Show/hide
Query:  AALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLY
        AALQ LKVIE+D  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS N++ SS   +   + 
Subjt:  AALQVLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLY

Query:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE
        FSPESPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GE GI AL+FG TRLFQP K VTKAQ A++LA G+A ++V E
Subjt:  FSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSE

Query:  ELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGL
        EL RIEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E V+AVEK+AEEAK EL RLR E+E +++ L RER SIE+EME  +R+RNELEEQLQ L
Subjt:  ELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGL

Query:  MSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGD-TWLDSSKQ
         SNK E+S+EKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWEK+G+KV+VDSDL EQ +  + TWL++ KQ
Subjt:  MSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGD-TWLDSSKQ

Query:  FAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKA
          VE T  RA NL+ KLK+M+ +V  KSR+VI  II+KI+LL+S L+Q +     KA+DLK    SKA
Subjt:  FAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKA

AT5G52410.2 INVOLVED IN: biological_process unknown1.2e-16460.46Show/hide
Query:  DRETVAESSDGVLSPSKTEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQ
        D + ++     V S S + +  +E+         ER  S   L  +  + S    EK        +PAP     V +L  K + P VVD V  Q  AALQ
Subjt:  DRETVAESSDGVLSPSKTEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQ

Query:  VLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPE
         LKVIE+D  P DLCTRRE+ARW+VSAS+ LSRN+ SKVYPAMYIENVTELAFDDITPEDPDF  IQGLAEAGLISSKLS N++ SS   +   + FSPE
Subjt:  VLKVIEADVEPSDLCTRREYARWLVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPE

Query:  SPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTR
        SPL+RQDL+SWKMALE RQLPEAD K L+Q+SGF+D DKI+P+A PAL+ADLS GE GI AL+FG TRLFQP K VTKAQ A++LA G+A ++V EEL R
Subjt:  SPLSRQDLVSWKMALEKRQLPEADRKMLHQVSGFIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTR

Query:  IEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNK
        IEAE+MAEN V AH+ LVAQVEKDINASFEKEL  E+E V+AVEK+AEEAK EL RLR E+E +++ L RER SIE+EME  +R+RNELEEQLQ L SNK
Subjt:  IEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAVEKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNK

Query:  VEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGD-TWLDSSKQFAVE
         E+S+EKER ++L+K+ E ENQEI RLQ ELEVER ALS+AR WA+DEA+RAREQAK LEEAR RWEK+G+KV+VDSDL EQ +  + TWL++ KQ  VE
Subjt:  VEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAREQAKALEEARDRWEKHGIKVVVDSDLREQESAGD-TWLDSSKQFAVE

Query:  ETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKA
         T  RA NL+ KLK+M+ +V  KSR+VI  II+KI+LL+S L+Q +     KA+DLK    SKA
Subjt:  ETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTCAGAGTGGCTAAGAGGGAAGGTCCTGAGTTCGAAGAGCTTCAAAGTCCAAGCCATGTACGTCTCCCTAATGTTAAAATATGTGCAATGCTCCTAGTTTGTCT
CTTTGTATATGATATATTCTGGGTCTTCTTCTCTGAGAGATTCTTTGGGGCAAATGTAATGGTATCTGTAGCAACTCAGCAAGCATCAAATCCTGTTCACACAGTTGCTA
ATAGTCTGAGTCTTCCTGGGCTGCAATTGATAACTAAGAAGCTGGAGTTACCTGTCAAGATAGTTTTTCCAAGGAACTTACTTGGTGGAGTCATTCCAGGAAAAAATGCC
ACTGATTTCATGATGCTTGGTCTTGGTGATATGGCAATTCCTGCCATGTTTCTAGCTCTAGTTCTTTGTTTTGACCATCGGAAAAGTAGGGATACAGTCAATCTCTTAGA
TATCCACACAAGGGGCCACAAGTACATTTGGTATGCCCTGCCTGGATATGCCTTCGGGCTGGTGACCGCTCTAACAGCTGGCGTTTTGACTCACTCACCTCAACCTGCTC
TGTTGTATCTGGTGACACCTCTTTGCGGGGTGTACAATGAAAATCCTTTATCCTATTTGGTCTCAGTTGACGGTTTCAACTTCCTTATTGACTGTGGTTGGAACGACCAC
TTCGATCCTGCTCTTCTTCAACCTTTATCCAGGGTGGCATCGACGATTGATGCAGTTTTGATATCACATCCTGATACACTTCACCTCGGTGCTCTTCCTTATGCCATGAA
ACAACTTGGACTTTCTGCTCCAGTTTATTCCACCGAACCCGTGTACCGATTGGGTCTTCTTACAATGTATGATCAGTTTATCGCGAGGAAGGAGTTAAGATATCTGCCAC
GCCACCTATGGGAAATTAAAGAAGCCAACGACATGCTTCCAAAGATGCTACATTTTATAACAAAAGCTTATAAAAGCAATGGAAAACCAGATAAAAGCAAAGGAGAGGGA
ATAGTTATTGCACCTCATGTGGCTGGGCATCTATTGGGGGGAACTCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGACTTCAACCACCGCAAGGA
AAGGCATCTGAATGGAACCATTCTAGAGTCATTCGTGCGACCTGCTGTATTGATAACTGATGCTTATAATGCTCTAAACAATCAGCCTTACAGACGCCAAAAGGACAAAG
AATTTGGAGATACTATTCAGAAGACCTTAAGAGCTAATGGAAATGTCTTACTTCCTGTTGATACTGCTGGGCGAGTGTTGGAGCTTATTCAAATTTTAGAATGGGTTGTT
CTAGCATCAATGGCTAGTTTGGAAGCTGGCTACTCACATGACATATTTGTTGATTGGGCAATGGATGCCAAAAACCTCGTCCTTTTTTCTGAAAGAGGCCAGTTTGGCAC
TTTGGCCCGCATGCTTCAAGCTGACCCACCTCCCAAAGCAGTTAAGGTAACTGTGTCTAAGAGAGTCCCTTTGACTGGAGATGAGCTTATTGCTTATGAAGAAGAGCAAA
ACAGGAAAAAGGAAGAAGCTCTTAAAGCTAGTTTGCTTAAGGAGGAACAATCTAAAGCATCTCATGGAACTGATAATGATACTGGCGATCCAATGATCATTGATGCTAGC
AGTAATGTAGCACCAGACGTAGGGGGTTCACATGGAGGTGCATACAGAGACATATTGATTGATGGTTTTGTTCCTTCTTCAACCAGCGTTGCTCCAATGTTTCCCTTTTA
TGAAAATAACTCCGCATGGGATGATTTTGGCGAAGTAATCAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAAGCAGCGATGCATGCTGGTGGGGACGTGG
ATGGGAAACTAGATGAAACTGCCGCTAACTTGATTCTGGATATGAAGCCTTCGAAAGTTGTATCTAATGAATTAACAGTTTTGGTGCATGGAACCGCAGAGGCCACTGAG
CATCTTAAGCAGCATTGCCTTAAAAATGTCTGTCCCCATGTCTATGCCCCCCAAATTGAAGAAACGATCGATGTTACTTCTGATTTATGTGCATATAAGGCTATCGTTTC
ATATACTGAGTCGAAGAAATCTCCAATTCTTGAATTTCAGGTTCAACTTTCAGAAAAGCTGATGAGTAATGTGCTGTTTAAGAAGCTAGGTGATTATGAAATCGCTTGGC
TTGATGCTGAAGTAGGGAAGACCGAGAATGGAACGCTGTCTTTACTTCCCCTCTCAAAGGCTGCTTCGCCTCATAAATCTGTTATTGTTGGTGATTTAAAAATGGCTGAC
TTCAAACAATTTCTTGCCAGCAAGGGAATACAGGTTGAATTTGCTGGGGGTGCTTTGAGATGTGGCGAGTATGTCACCCTACGCAAGGTTACAGATGCAAGTCAAAAGTT
GAGACGCCTCGTTGAATATGGAAGAGATCCAGCAATTTTAAGATTTTTCATGGCTTCCACTCCTCCCACGTGTTCACCCAGCTCTCTCCAGCTTCGCCTCGCTCTCAATT
GCAAGAATTGCGGCAAATTCCCTTCAGTTCTTGTTCGGGCGAGAGTGAGGAAATTGGATCCTCGACTCCGCGTAATTTGTCACCCTATTGTTCATAATGGCGCCAAAATC
GAGAGAGGAAATGGACTGCGTCGCAGTGGACTTTGTTTTGCTGGATCGAAGTCGACGGCTGATGGATTCTCTGGGTGGTCGGAATCGGATTCCCAGGGGGAGGTTTTGGA
CTTGCGGAGAAAGAAGTGGTTTGGAGGATTGGTGGGGATTGGAATTACTGGATTCATCCTTGTCTCCGGAATCACCTTTGCAGCATGGTCAATAAACAAGCAGAATAGTT
CCAGACAAAAGCCGCAAATGGAGGCCTTAAGTATGCAGCAAGAATTATTGTTGGACTCTGATACTGGAAATGATAGGCTTGGTGAAGATGAAAAGGAAGATAACAGTGTG
AATGCAGATGATGAGACTCTCGCTGGTAGAACAGGTAACCAAGAGGACTCTTCTTCATGTACAGAAAATGAAGAAACTCTCAACAAAAATAGGGTTGGTGACAGTGTTGA
TGTTGGGGAGTTAGCAGAAAATGATGTTGAATCTTCATCCAGTAATAACGATGTCAATAATGTTGCTTCCTTGCAAGAAGATTTCCAATCTGATTCCTCATTAACTGTTA
CATCAGTTGCTCCCGGAAGTTTGAGTTCACTTATCTCACCTGAATCTGAGTTTGATTCTAATGTAGCTTCTTGTTTTAAAGATGTAAACAACTGTCATTCTGGTGCAGAA
GTCTCAACTTCTGAACCTGAAATGAATATATTGAAAGATGAACCAGACAACTTGCCTAACTCTAATACTAATTCATTAAACCTTAAAACTGATATTCAGGATGAAAGTCC
TGACACTGGTGAAAATTATGATTTCAGCTCCAAGAAGTTACCAGTTTATGATGATAGTTCATCAAACTATAATTCTGGCAATCAGGATGAGACACTTGGTCCTGTAAATG
AAATTACAGATTCTTCATTGCAAGGATTTTCTAGAATATCTGGCGACACAACCAAGGAATCAGGATTATTTGACAGAGAGACTGTGGCTGAATCATCTGACGGAGTTCTG
AGCCCCAGCAAAACTGAACAGCTCTTATCTGAGGATACTGCATCAACCATAGAACAGCAAGTAGAAAGAGGATTATCTGAAGCAGCATTAGTCTCTATCACAGATTATCC
ATCGGCAGATGATCAAGAGAAAAATCATGAAACTATCAAGAATAGTGTTCCTGCTCCTTTGGTTTCTGCAGCTGTTAAAACACTTCCTGGCAAGGTCCTAGTTCCTGCAG
TAGTGGATCAGGTTCATGGGCAGGCATTGGCAGCACTGCAAGTTTTAAAGGTGATAGAGGCTGACGTTGAACCTAGTGATCTATGTACTCGTCGGGAATATGCTCGTTGG
CTGGTGTCTGCAAGCAGTGCTCTTTCAAGGAACACAACATCCAAAGTATACCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCTTTTGATGATATTACACCCGAAGA
CCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCAAGCAAGCTTTCGAGAAATGATATTTCTTCTTCATTGGACGAAGACCAGGGTCCTTTATATT
TCTCTCCCGAAAGCCCCCTATCACGTCAAGATCTCGTGAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTTCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGA
TTTATTGATACTGATAAGATCCATCCAGATGCTTGTCCTGCACTTGTTGCTGATCTTTCTGTAGGAGAACAAGGAATAATAGCTCTTGCATTTGGATATACAAGGCTTTT
CCAGCCAGATAAGCCTGTAACGAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCAGATATAGTAAGTGAGGAGCTTACAAGGATTGAAGCTGAATCAATGG
CAGAAAATGCTGTTGCTGCGCATAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATAAATGCTAGCTTTGAGAAAGAACTTTCCATCGAGAGAGAAAAGGTTGAGGCTGTG
GAGAAAATGGCAGAAGAGGCCAAGCAAGAATTGGAAAGATTAAGATCTGAAAGAGAGAGAGATAGTATCACCTTGATGAGGGAACGCGCTTCTATTGAATCGGAAATGGA
AGTTTTTTCAAGGTTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAGGTATCCTTTGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAG
CTGAAATTGAAAATCAGGAGATTTCCCGCCTGCAGTATGAGCTTGAAGTTGAGAGAAAGGCGCTTTCCATGGCCAGAGCTTGGGCCGAGGACGAAGCAAAAAGAGCAAGA
GAACAAGCAAAAGCACTCGAAGAGGCTAGGGATCGCTGGGAAAAGCATGGCATCAAAGTAGTCGTTGACAGCGATCTTCGTGAACAAGAATCTGCTGGCGATACCTGGCT
TGATTCTAGCAAACAGTTTGCAGTCGAAGAAACCACGGACAGGGCCGAGAACTTAATGGAGAAGCTGAAAAGAATGTCTACAGAAGTAAGAGGGAAATCGAGAGATGTAA
TTGAGAAGATCATCCAGAAGATAGCTTTGCTAGTATCAAACTTGAGACAATGGATTTCCAAAAGTGGAGAAAAGGCTGAAGATCTAAAGAATGTGGCGATTTCAAAGGCA
AGTAGATCAGCAACAGAGTTGCAACAGAGCACTGCAGAGTTGAGCTTGGCCCTGAAGGAGGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAAAAACTCCCGAAAC
CCTAGTCCTTCGCTTCCCTCGCCTCCGATCATTTGGTTCTGGTCCATACGAGAAGGCAATGGCTCCGAAGCAGCCGAATACTGGTCTCTTCGTGGGACTTAACAAAGGAC
ACGTAGTTACAAAGAAAGAGTTGGCTCCGCGCCCCTCAGATCGCAAAGGAAGTTGTTTTAGCTGTTGGTGGATGGAGAAAAGTAGCAAAAGAGTTCTCTTTGTAAGGAAT
TTGATCCGGGAAGTTGCTGGCTTTGCCCCATATGAGAAGAGAATCACTGAGCTTCTTAAAGTTGGAAAGGACAAGAGAGCACTGAAAGTGGCTAAGAGAAAGTTGGGAAC
TCACAAGAGAGCCAAGAAGAAGAGAGAAGAGATGTCCAGCGTTCTCCGCAAGATGAGCTTCCTGCAAATGGGCACTGCCGAGGAGATTGGGCAACAGCTCGAATTCGATG
TAGAAGACACTATATTTGTTGCGGTGGGCACCGATGTTGACGACGCCAAAGCTACTCTCATTTGGGCTGTCCACAACTTCGCTGGCAAGAGTTTCTGTCTGCTTCACGTG
CGACTTGGGAATCCGTCAAGACGAGAGTCGTCTGTAGCTAATTCCGGCACCGCCTCGTCCAATACCAAACAAATCTCAATCGTACACTTCCTGCAAATGGGCACTGCCGA
GGAGATTGGGCAACAGCTCGAATTCGATGTAGAAGACACTATATTTGTTGCGGTGGGCACCGATGTTGACGACGCCAAAGCTACTCTCATTTGGGCTGTCCACAACTTCG
CTGGCAAGAGTTTCTGTCTGCTTCACGTTCATCATCTTTCTCCCAAAAAAGAAGAGCCATCTTCCTATACACTAAAAGAACATGTAGCAAAGGCATACGAGGAACCCAAA
AGCCAAAAGCCTTTTGAACTTCTTAATCAATATGTTCTCATTCTTGCCAAACTGGGGGTACAAGCCCAAAAAGTATGGACTGAGACGAACAACGTTGAGAGAGGGATTGT
TGAAATTATTGCTCAGTACAATGCTCAAGGAAAGGAGAATGCGTGTGATAGGGGAGTAAAAGGTGGACATCCAGTTAGCCAGAGTGGTTTGCAAGAAGAAGCCTCGATGA
ATGTCAAGTCCATAAAAGACTTTGAGGGAGTGAAGGCTTGGGTAGAGGAAGATGGTGCGGGTGCTGAATTCAAGGCTAAATCATTAGAAAGCTCGTGTATGGAGGAGGTG
AAGAAAAGAAAAGAGATGGAAGAACTTTTAGAGAGAGAGAAAAAGGAAGTAGAACGAATGAACAAAGAGCATGATGAGCTTTTAAAAGAACTACAACATGTCCAAGAGCA
AAAATCATTCCTTGAGAGAAAAGCTTCAGAGTACCGGTGTGAGGTGGAGGAGTTGGAGAAGAAGATGTTTGCAGCTGTTGACCTCTTAGTAAGTTTCAAGGAAAAACGAG
ATAAGTTGCAGACAGAGAATGAAGGTGCAATGGACAAGCTTCGAAAGCTGAAGAATATTGTTAAAAGGGAACCTTCACACTATCGCAGTGCAGAAATGCCTACGTTCTCC
TTCATGGAAATAATTGAAGCAACAAGAAACTTTGACCCTTCCTGGAAGATTGGTGAGGGAAGACATGGTAGTGTTTATAAAGGCCTCCTTCGTCACTTGGATGTTGCTCT
AAAAATGTTTGCTTCGTATGGTTCTTATTCCCAATCAACGTTCCAATACGAGGTTGAGGTCTTGAGCAGGGTTAGGCATCCCAACCTGGTTTCAATTATTGGAGCATGTC
CAGAATCTAGGTTAATAGTGTATGAGAATTTGAAAAATGGCAGCTTGGAAGACAACCTTGCCTGCAAAAATCACAATCGCCCACTTCCATGGCAGATACGGATTCGCATT
GCTGCTGATATCTGCTCAGCCCTTATATTTCTACATTACAGCGACCCTTGCATTGTCCATGGAGATATAAAACCAAGCAAAATCCTACTTGATGCCAATTTTATTGCCAA
ACTAGGTGGCCTGGGCATCTCTCGTTTGATCCCGCAAGAAGAGAAGGCCTTTAATTCGGCATCAGTATGTGATATGTTGAAAGAAAATAATCCATATATAGATCCCGAAT
ATCTCGAGACTGGAAGGTTCACTCCAGAATCAGACGTCTACTCTCTTGGTGTTATTTTGCTGCGAATTTTAACTGGCAGAGCACCTTCGGGAATTGTAAAAGATGTAAGA
TGTGCTCTAGAAAATGACGAGATTGGTGTCATCTTGGACTCATCAGCTGGAGACTGGCCACATGATCTAGCAGAGCAGTTAGCTCTTGTGGCATTGAGGTGCTGTGAGAA
GGAAAAGTTGGATCGGCCTGACCTTGTCTCAGAATTATGGGGTGTTTTGGAGCCAATGAGATCCATTGCCTCAGCATCGTGTTCGAGTTTGAAGAAACACTCTCGAGTGC
CTGCCCATTTTACGTGTCCCATTTTTCAGGAAATTATGAAAGATCCACTAATTGCTGCAGATGGATTCACATACGAAGCTGATGCAATAAGAGGATGGTTCAAAAGTGGC
CACAACACTTCTCCGATGACAAATCTTAAACTTGAGCACTGTGATCTTGTGCCAAATTATGCTCTTCTGAATGCAATTCAAGAGTGGCAGCATCAGCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTCAGAGTGGCTAAGAGGGAAGGTCCTGAGTTCGAAGAGCTTCAAAGTCCAAGCCATGTACGTCTCCCTAATGTTAAAATATGTGCAATGCTCCTAGTTTGTCT
CTTTGTATATGATATATTCTGGGTCTTCTTCTCTGAGAGATTCTTTGGGGCAAATGTAATGGTATCTGTAGCAACTCAGCAAGCATCAAATCCTGTTCACACAGTTGCTA
ATAGTCTGAGTCTTCCTGGGCTGCAATTGATAACTAAGAAGCTGGAGTTACCTGTCAAGATAGTTTTTCCAAGGAACTTACTTGGTGGAGTCATTCCAGGAAAAAATGCC
ACTGATTTCATGATGCTTGGTCTTGGTGATATGGCAATTCCTGCCATGTTTCTAGCTCTAGTTCTTTGTTTTGACCATCGGAAAAGTAGGGATACAGTCAATCTCTTAGA
TATCCACACAAGGGGCCACAAGTACATTTGGTATGCCCTGCCTGGATATGCCTTCGGGCTGGTGACCGCTCTAACAGCTGGCGTTTTGACTCACTCACCTCAACCTGCTC
TGTTGTATCTGGTGACACCTCTTTGCGGGGTGTACAATGAAAATCCTTTATCCTATTTGGTCTCAGTTGACGGTTTCAACTTCCTTATTGACTGTGGTTGGAACGACCAC
TTCGATCCTGCTCTTCTTCAACCTTTATCCAGGGTGGCATCGACGATTGATGCAGTTTTGATATCACATCCTGATACACTTCACCTCGGTGCTCTTCCTTATGCCATGAA
ACAACTTGGACTTTCTGCTCCAGTTTATTCCACCGAACCCGTGTACCGATTGGGTCTTCTTACAATGTATGATCAGTTTATCGCGAGGAAGGAGTTAAGATATCTGCCAC
GCCACCTATGGGAAATTAAAGAAGCCAACGACATGCTTCCAAAGATGCTACATTTTATAACAAAAGCTTATAAAAGCAATGGAAAACCAGATAAAAGCAAAGGAGAGGGA
ATAGTTATTGCACCTCATGTGGCTGGGCATCTATTGGGGGGAACTCTATGGAAGATAACTAAGGATGGAGAAGATGTTATATATGCTGTTGACTTCAACCACCGCAAGGA
AAGGCATCTGAATGGAACCATTCTAGAGTCATTCGTGCGACCTGCTGTATTGATAACTGATGCTTATAATGCTCTAAACAATCAGCCTTACAGACGCCAAAAGGACAAAG
AATTTGGAGATACTATTCAGAAGACCTTAAGAGCTAATGGAAATGTCTTACTTCCTGTTGATACTGCTGGGCGAGTGTTGGAGCTTATTCAAATTTTAGAATGGGTTGTT
CTAGCATCAATGGCTAGTTTGGAAGCTGGCTACTCACATGACATATTTGTTGATTGGGCAATGGATGCCAAAAACCTCGTCCTTTTTTCTGAAAGAGGCCAGTTTGGCAC
TTTGGCCCGCATGCTTCAAGCTGACCCACCTCCCAAAGCAGTTAAGGTAACTGTGTCTAAGAGAGTCCCTTTGACTGGAGATGAGCTTATTGCTTATGAAGAAGAGCAAA
ACAGGAAAAAGGAAGAAGCTCTTAAAGCTAGTTTGCTTAAGGAGGAACAATCTAAAGCATCTCATGGAACTGATAATGATACTGGCGATCCAATGATCATTGATGCTAGC
AGTAATGTAGCACCAGACGTAGGGGGTTCACATGGAGGTGCATACAGAGACATATTGATTGATGGTTTTGTTCCTTCTTCAACCAGCGTTGCTCCAATGTTTCCCTTTTA
TGAAAATAACTCCGCATGGGATGATTTTGGCGAAGTAATCAATCCTGATGATTATGTAATTAAGGATGAAGACATGGACCAAGCAGCGATGCATGCTGGTGGGGACGTGG
ATGGGAAACTAGATGAAACTGCCGCTAACTTGATTCTGGATATGAAGCCTTCGAAAGTTGTATCTAATGAATTAACAGTTTTGGTGCATGGAACCGCAGAGGCCACTGAG
CATCTTAAGCAGCATTGCCTTAAAAATGTCTGTCCCCATGTCTATGCCCCCCAAATTGAAGAAACGATCGATGTTACTTCTGATTTATGTGCATATAAGGCTATCGTTTC
ATATACTGAGTCGAAGAAATCTCCAATTCTTGAATTTCAGGTTCAACTTTCAGAAAAGCTGATGAGTAATGTGCTGTTTAAGAAGCTAGGTGATTATGAAATCGCTTGGC
TTGATGCTGAAGTAGGGAAGACCGAGAATGGAACGCTGTCTTTACTTCCCCTCTCAAAGGCTGCTTCGCCTCATAAATCTGTTATTGTTGGTGATTTAAAAATGGCTGAC
TTCAAACAATTTCTTGCCAGCAAGGGAATACAGGTTGAATTTGCTGGGGGTGCTTTGAGATGTGGCGAGTATGTCACCCTACGCAAGGTTACAGATGCAAGTCAAAAGTT
GAGACGCCTCGTTGAATATGGAAGAGATCCAGCAATTTTAAGATTTTTCATGGCTTCCACTCCTCCCACGTGTTCACCCAGCTCTCTCCAGCTTCGCCTCGCTCTCAATT
GCAAGAATTGCGGCAAATTCCCTTCAGTTCTTGTTCGGGCGAGAGTGAGGAAATTGGATCCTCGACTCCGCGTAATTTGTCACCCTATTGTTCATAATGGCGCCAAAATC
GAGAGAGGAAATGGACTGCGTCGCAGTGGACTTTGTTTTGCTGGATCGAAGTCGACGGCTGATGGATTCTCTGGGTGGTCGGAATCGGATTCCCAGGGGGAGGTTTTGGA
CTTGCGGAGAAAGAAGTGGTTTGGAGGATTGGTGGGGATTGGAATTACTGGATTCATCCTTGTCTCCGGAATCACCTTTGCAGCATGGTCAATAAACAAGCAGAATAGTT
CCAGACAAAAGCCGCAAATGGAGGCCTTAAGTATGCAGCAAGAATTATTGTTGGACTCTGATACTGGAAATGATAGGCTTGGTGAAGATGAAAAGGAAGATAACAGTGTG
AATGCAGATGATGAGACTCTCGCTGGTAGAACAGGTAACCAAGAGGACTCTTCTTCATGTACAGAAAATGAAGAAACTCTCAACAAAAATAGGGTTGGTGACAGTGTTGA
TGTTGGGGAGTTAGCAGAAAATGATGTTGAATCTTCATCCAGTAATAACGATGTCAATAATGTTGCTTCCTTGCAAGAAGATTTCCAATCTGATTCCTCATTAACTGTTA
CATCAGTTGCTCCCGGAAGTTTGAGTTCACTTATCTCACCTGAATCTGAGTTTGATTCTAATGTAGCTTCTTGTTTTAAAGATGTAAACAACTGTCATTCTGGTGCAGAA
GTCTCAACTTCTGAACCTGAAATGAATATATTGAAAGATGAACCAGACAACTTGCCTAACTCTAATACTAATTCATTAAACCTTAAAACTGATATTCAGGATGAAAGTCC
TGACACTGGTGAAAATTATGATTTCAGCTCCAAGAAGTTACCAGTTTATGATGATAGTTCATCAAACTATAATTCTGGCAATCAGGATGAGACACTTGGTCCTGTAAATG
AAATTACAGATTCTTCATTGCAAGGATTTTCTAGAATATCTGGCGACACAACCAAGGAATCAGGATTATTTGACAGAGAGACTGTGGCTGAATCATCTGACGGAGTTCTG
AGCCCCAGCAAAACTGAACAGCTCTTATCTGAGGATACTGCATCAACCATAGAACAGCAAGTAGAAAGAGGATTATCTGAAGCAGCATTAGTCTCTATCACAGATTATCC
ATCGGCAGATGATCAAGAGAAAAATCATGAAACTATCAAGAATAGTGTTCCTGCTCCTTTGGTTTCTGCAGCTGTTAAAACACTTCCTGGCAAGGTCCTAGTTCCTGCAG
TAGTGGATCAGGTTCATGGGCAGGCATTGGCAGCACTGCAAGTTTTAAAGGTGATAGAGGCTGACGTTGAACCTAGTGATCTATGTACTCGTCGGGAATATGCTCGTTGG
CTGGTGTCTGCAAGCAGTGCTCTTTCAAGGAACACAACATCCAAAGTATACCCAGCAATGTATATAGAGAATGTTACCGAGCTTGCTTTTGATGATATTACACCCGAAGA
CCCTGATTTTGCATCTATTCAAGGTTTGGCAGAAGCTGGACTGATTTCAAGCAAGCTTTCGAGAAATGATATTTCTTCTTCATTGGACGAAGACCAGGGTCCTTTATATT
TCTCTCCCGAAAGCCCCCTATCACGTCAAGATCTCGTGAGTTGGAAGATGGCCCTTGAAAAAAGACAGCTTCCAGAGGCAGATAGAAAGATGCTCCACCAAGTTTCTGGA
TTTATTGATACTGATAAGATCCATCCAGATGCTTGTCCTGCACTTGTTGCTGATCTTTCTGTAGGAGAACAAGGAATAATAGCTCTTGCATTTGGATATACAAGGCTTTT
CCAGCCAGATAAGCCTGTAACGAAAGCACAAGCTGCCATTGCTCTTGCAACTGGGGAGGCTTCAGATATAGTAAGTGAGGAGCTTACAAGGATTGAAGCTGAATCAATGG
CAGAAAATGCTGTTGCTGCGCATAGTGCTTTGGTAGCTCAAGTTGAGAAAGATATAAATGCTAGCTTTGAGAAAGAACTTTCCATCGAGAGAGAAAAGGTTGAGGCTGTG
GAGAAAATGGCAGAAGAGGCCAAGCAAGAATTGGAAAGATTAAGATCTGAAAGAGAGAGAGATAGTATCACCTTGATGAGGGAACGCGCTTCTATTGAATCGGAAATGGA
AGTTTTTTCAAGGTTAAGGAATGAGTTGGAGGAGCAGTTGCAAGGCCTGATGAGTAATAAAGTAGAGGTATCCTTTGAAAAGGAAAGAATCAACAAACTCAGGAAAGAAG
CTGAAATTGAAAATCAGGAGATTTCCCGCCTGCAGTATGAGCTTGAAGTTGAGAGAAAGGCGCTTTCCATGGCCAGAGCTTGGGCCGAGGACGAAGCAAAAAGAGCAAGA
GAACAAGCAAAAGCACTCGAAGAGGCTAGGGATCGCTGGGAAAAGCATGGCATCAAAGTAGTCGTTGACAGCGATCTTCGTGAACAAGAATCTGCTGGCGATACCTGGCT
TGATTCTAGCAAACAGTTTGCAGTCGAAGAAACCACGGACAGGGCCGAGAACTTAATGGAGAAGCTGAAAAGAATGTCTACAGAAGTAAGAGGGAAATCGAGAGATGTAA
TTGAGAAGATCATCCAGAAGATAGCTTTGCTAGTATCAAACTTGAGACAATGGATTTCCAAAAGTGGAGAAAAGGCTGAAGATCTAAAGAATGTGGCGATTTCAAAGGCA
AGTAGATCAGCAACAGAGTTGCAACAGAGCACTGCAGAGTTGAGCTTGGCCCTGAAGGAGGGAGCAAAGCGAGTTGTGGGAGATTGTAGGGAAGGAAAAACTCCCGAAAC
CCTAGTCCTTCGCTTCCCTCGCCTCCGATCATTTGGTTCTGGTCCATACGAGAAGGCAATGGCTCCGAAGCAGCCGAATACTGGTCTCTTCGTGGGACTTAACAAAGGAC
ACGTAGTTACAAAGAAAGAGTTGGCTCCGCGCCCCTCAGATCGCAAAGGAAGTTGTTTTAGCTGTTGGTGGATGGAGAAAAGTAGCAAAAGAGTTCTCTTTGTAAGGAAT
TTGATCCGGGAAGTTGCTGGCTTTGCCCCATATGAGAAGAGAATCACTGAGCTTCTTAAAGTTGGAAAGGACAAGAGAGCACTGAAAGTGGCTAAGAGAAAGTTGGGAAC
TCACAAGAGAGCCAAGAAGAAGAGAGAAGAGATGTCCAGCGTTCTCCGCAAGATGAGCTTCCTGCAAATGGGCACTGCCGAGGAGATTGGGCAACAGCTCGAATTCGATG
TAGAAGACACTATATTTGTTGCGGTGGGCACCGATGTTGACGACGCCAAAGCTACTCTCATTTGGGCTGTCCACAACTTCGCTGGCAAGAGTTTCTGTCTGCTTCACGTG
CGACTTGGGAATCCGTCAAGACGAGAGTCGTCTGTAGCTAATTCCGGCACCGCCTCGTCCAATACCAAACAAATCTCAATCGTACACTTCCTGCAAATGGGCACTGCCGA
GGAGATTGGGCAACAGCTCGAATTCGATGTAGAAGACACTATATTTGTTGCGGTGGGCACCGATGTTGACGACGCCAAAGCTACTCTCATTTGGGCTGTCCACAACTTCG
CTGGCAAGAGTTTCTGTCTGCTTCACGTTCATCATCTTTCTCCCAAAAAAGAAGAGCCATCTTCCTATACACTAAAAGAACATGTAGCAAAGGCATACGAGGAACCCAAA
AGCCAAAAGCCTTTTGAACTTCTTAATCAATATGTTCTCATTCTTGCCAAACTGGGGGTACAAGCCCAAAAAGTATGGACTGAGACGAACAACGTTGAGAGAGGGATTGT
TGAAATTATTGCTCAGTACAATGCTCAAGGAAAGGAGAATGCGTGTGATAGGGGAGTAAAAGGTGGACATCCAGTTAGCCAGAGTGGTTTGCAAGAAGAAGCCTCGATGA
ATGTCAAGTCCATAAAAGACTTTGAGGGAGTGAAGGCTTGGGTAGAGGAAGATGGTGCGGGTGCTGAATTCAAGGCTAAATCATTAGAAAGCTCGTGTATGGAGGAGGTG
AAGAAAAGAAAAGAGATGGAAGAACTTTTAGAGAGAGAGAAAAAGGAAGTAGAACGAATGAACAAAGAGCATGATGAGCTTTTAAAAGAACTACAACATGTCCAAGAGCA
AAAATCATTCCTTGAGAGAAAAGCTTCAGAGTACCGGTGTGAGGTGGAGGAGTTGGAGAAGAAGATGTTTGCAGCTGTTGACCTCTTAGTAAGTTTCAAGGAAAAACGAG
ATAAGTTGCAGACAGAGAATGAAGGTGCAATGGACAAGCTTCGAAAGCTGAAGAATATTGTTAAAAGGGAACCTTCACACTATCGCAGTGCAGAAATGCCTACGTTCTCC
TTCATGGAAATAATTGAAGCAACAAGAAACTTTGACCCTTCCTGGAAGATTGGTGAGGGAAGACATGGTAGTGTTTATAAAGGCCTCCTTCGTCACTTGGATGTTGCTCT
AAAAATGTTTGCTTCGTATGGTTCTTATTCCCAATCAACGTTCCAATACGAGGTTGAGGTCTTGAGCAGGGTTAGGCATCCCAACCTGGTTTCAATTATTGGAGCATGTC
CAGAATCTAGGTTAATAGTGTATGAGAATTTGAAAAATGGCAGCTTGGAAGACAACCTTGCCTGCAAAAATCACAATCGCCCACTTCCATGGCAGATACGGATTCGCATT
GCTGCTGATATCTGCTCAGCCCTTATATTTCTACATTACAGCGACCCTTGCATTGTCCATGGAGATATAAAACCAAGCAAAATCCTACTTGATGCCAATTTTATTGCCAA
ACTAGGTGGCCTGGGCATCTCTCGTTTGATCCCGCAAGAAGAGAAGGCCTTTAATTCGGCATCAGTATGTGATATGTTGAAAGAAAATAATCCATATATAGATCCCGAAT
ATCTCGAGACTGGAAGGTTCACTCCAGAATCAGACGTCTACTCTCTTGGTGTTATTTTGCTGCGAATTTTAACTGGCAGAGCACCTTCGGGAATTGTAAAAGATGTAAGA
TGTGCTCTAGAAAATGACGAGATTGGTGTCATCTTGGACTCATCAGCTGGAGACTGGCCACATGATCTAGCAGAGCAGTTAGCTCTTGTGGCATTGAGGTGCTGTGAGAA
GGAAAAGTTGGATCGGCCTGACCTTGTCTCAGAATTATGGGGTGTTTTGGAGCCAATGAGATCCATTGCCTCAGCATCGTGTTCGAGTTTGAAGAAACACTCTCGAGTGC
CTGCCCATTTTACGTGTCCCATTTTTCAGGAAATTATGAAAGATCCACTAATTGCTGCAGATGGATTCACATACGAAGCTGATGCAATAAGAGGATGGTTCAAAAGTGGC
CACAACACTTCTCCGATGACAAATCTTAAACTTGAGCACTGTGATCTTGTGCCAAATTATGCTCTTCTGAATGCAATTCAAGAGTGGCAGCATCAGCTATGACAACTTTG
GTTTCGTTTCCTTCCACCTTCACCTGCCAGAAAAGACTACAAACTGGTAAGGTTTGGTATATTTTATTTACATACCTATATCCTTCGATGTACAGAACATTGTTATAGAC
CTAAATAAAGCATAGTCATACTGCTGATTTAAAGTTAGGCCTTCTGTTAAGTCTAATGAGTGATGTCTAATGTTGGTTATGGAACCGAATCCATTATAGAACATTTTTCT
TCTCCAAGTAAAATTTCCTACTCAGTCAAAATTCATTTCAGTTCCAACTTTTGCTTTG
Protein sequenceShow/hide protein sequence
MDFRVAKREGPEFEELQSPSHVRLPNVKICAMLLVCLFVYDIFWVFFSERFFGANVMVSVATQQASNPVHTVANSLSLPGLQLITKKLELPVKIVFPRNLLGGVIPGKNA
TDFMMLGLGDMAIPAMFLALVLCFDHRKSRDTVNLLDIHTRGHKYIWYALPGYAFGLVTALTAGVLTHSPQPALLYLVTPLCGVYNENPLSYLVSVDGFNFLIDCGWNDH
FDPALLQPLSRVASTIDAVLISHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQFIARKELRYLPRHLWEIKEANDMLPKMLHFITKAYKSNGKPDKSKGEG
IVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAGRVLELIQILEWVV
LASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGTLARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNRKKEEALKASLLKEEQSKASHGTDNDTGDPMIIDAS
SNVAPDVGGSHGGAYRDILIDGFVPSSTSVAPMFPFYENNSAWDDFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVLVHGTAEATE
HLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKAIVSYTESKKSPILEFQVQLSEKLMSNVLFKKLGDYEIAWLDAEVGKTENGTLSLLPLSKAASPHKSVIVGDLKMAD
FKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQKLRRLVEYGRDPAILRFFMASTPPTCSPSSLQLRLALNCKNCGKFPSVLVRARVRKLDPRLRVICHPIVHNGAKI
ERGNGLRRSGLCFAGSKSTADGFSGWSESDSQGEVLDLRRKKWFGGLVGIGITGFILVSGITFAAWSINKQNSSRQKPQMEALSMQQELLLDSDTGNDRLGEDEKEDNSV
NADDETLAGRTGNQEDSSSCTENEETLNKNRVGDSVDVGELAENDVESSSSNNDVNNVASLQEDFQSDSSLTVTSVAPGSLSSLISPESEFDSNVASCFKDVNNCHSGAE
VSTSEPEMNILKDEPDNLPNSNTNSLNLKTDIQDESPDTGENYDFSSKKLPVYDDSSSNYNSGNQDETLGPVNEITDSSLQGFSRISGDTTKESGLFDRETVAESSDGVL
SPSKTEQLLSEDTASTIEQQVERGLSEAALVSITDYPSADDQEKNHETIKNSVPAPLVSAAVKTLPGKVLVPAVVDQVHGQALAALQVLKVIEADVEPSDLCTRREYARW
LVSASSALSRNTTSKVYPAMYIENVTELAFDDITPEDPDFASIQGLAEAGLISSKLSRNDISSSLDEDQGPLYFSPESPLSRQDLVSWKMALEKRQLPEADRKMLHQVSG
FIDTDKIHPDACPALVADLSVGEQGIIALAFGYTRLFQPDKPVTKAQAAIALATGEASDIVSEELTRIEAESMAENAVAAHSALVAQVEKDINASFEKELSIEREKVEAV
EKMAEEAKQELERLRSERERDSITLMRERASIESEMEVFSRLRNELEEQLQGLMSNKVEVSFEKERINKLRKEAEIENQEISRLQYELEVERKALSMARAWAEDEAKRAR
EQAKALEEARDRWEKHGIKVVVDSDLREQESAGDTWLDSSKQFAVEETTDRAENLMEKLKRMSTEVRGKSRDVIEKIIQKIALLVSNLRQWISKSGEKAEDLKNVAISKA
SRSATELQQSTAELSLALKEGAKRVVGDCREGKTPETLVLRFPRLRSFGSGPYEKAMAPKQPNTGLFVGLNKGHVVTKKELAPRPSDRKGSCFSCWWMEKSSKRVLFVRN
LIREVAGFAPYEKRITELLKVGKDKRALKVAKRKLGTHKRAKKKREEMSSVLRKMSFLQMGTAEEIGQQLEFDVEDTIFVAVGTDVDDAKATLIWAVHNFAGKSFCLLHV
RLGNPSRRESSVANSGTASSNTKQISIVHFLQMGTAEEIGQQLEFDVEDTIFVAVGTDVDDAKATLIWAVHNFAGKSFCLLHVHHLSPKKEEPSSYTLKEHVAKAYEEPK
SQKPFELLNQYVLILAKLGVQAQKVWTETNNVERGIVEIIAQYNAQGKENACDRGVKGGHPVSQSGLQEEASMNVKSIKDFEGVKAWVEEDGAGAEFKAKSLESSCMEEV
KKRKEMEELLEREKKEVERMNKEHDELLKELQHVQEQKSFLERKASEYRCEVEELEKKMFAAVDLLVSFKEKRDKLQTENEGAMDKLRKLKNIVKREPSHYRSAEMPTFS
FMEIIEATRNFDPSWKIGEGRHGSVYKGLLRHLDVALKMFASYGSYSQSTFQYEVEVLSRVRHPNLVSIIGACPESRLIVYENLKNGSLEDNLACKNHNRPLPWQIRIRI
AADICSALIFLHYSDPCIVHGDIKPSKILLDANFIAKLGGLGISRLIPQEEKAFNSASVCDMLKENNPYIDPEYLETGRFTPESDVYSLGVILLRILTGRAPSGIVKDVR
CALENDEIGVILDSSAGDWPHDLAEQLALVALRCCEKEKLDRPDLVSELWGVLEPMRSIASASCSSLKKHSRVPAHFTCPIFQEIMKDPLIAADGFTYEADAIRGWFKSG
HNTSPMTNLKLEHCDLVPNYALLNAIQEWQHQL