; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G026830 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G026830
OrganismCucumis hystrix (Cucumber (hystrix) v1)
Descriptionaspartic proteinase-like
Genome locationchrH02:2603287..2619928
RNA-Seq ExpressionChy2G026830
SyntenyChy2G026830
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0006629 - lipid metabolic process (biological process)
GO:0016036 - cellular response to phosphate starvation (biological process)
GO:0045892 - negative regulation of transcription, DNA-templated (biological process)
GO:0080169 - cellular response to boron-containing substance deprivation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR036576 - WRKY domain superfamily
IPR033869 - Phytepsin
IPR033121 - Peptidase family A1 domain
IPR021109 - Aspartic peptidase domain superfamily
IPR011001 - Saposin-like
IPR008139 - Saposin B type domain
IPR008138 - Saposin B type, region 2
IPR007856 - Saposin-like type B, region 1
IPR003657 - WRKY domain
IPR001969 - Aspartic peptidase, active site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4372572.1 hypothetical protein F8388_027245 [Cannabis sativa]0.056.81Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTL---NSF---------QRMFQGLEFSG--------KLGHTDSTP--DDNNRLAVEVDFFSSKKRVVDD
        M+KGWGLTL DSD  S GFF NKQ P     T    N F           MF G+E S         +L  T   P  D+N   A  VDFFS K R V+D
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTL---NSF---------QRMFQGLEFSG--------KLGHTDSTP--DDNNRLAVEVDFFSSKKRVVDD

Query:  LQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTGL--HLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSY
             DSK  +  S+ K+       P P+     +NTGL  HL TANTGSD+STVDDG+ SD E+K  KN++ +LQVELQ MN EN KL++ML+ VSN+Y
Subjt:  LQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTGL--HLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSY

Query:  SSLHMHLLSLMQQKQQ---QQNHPSEPAH--QREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINN---NN
        S+L MH++S+MQQ+QQ   QQNH ++ A   Q       K  E+KHE       RQF+DLGPS       + +    +SSS+ERTRS +P N NN   + 
Subjt:  SSLHMHLLSLMQQKQQ---QQNHPSEPAH--QREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINN---NN

Query:  NNNNTETASKKRDHAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKY
          NN+ T  K      I   N DH ++     + + R++SPESES+GW  NN+K P+ N+   SKP++QS +ATMRKARVSVRARSEAPMI+DGCQWRKY
Subjt:  NNNNTETASKKRDHAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKY

Query:  GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMAT
        GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNH HPLPPAAMAMASTTTAAA MLLSGSMSSAD  LMNPNLLARAILPCSS+MAT
Subjt:  GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMAT

Query:  ISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGH----HQITQPATPAQPGGAS
        ISASAPFPT+TLDLT+  NPLQ  R     FQVP   GQ  + A       GQ LYN QSKFSGLQLS   GA  +H+ +     Q+    T  QP   S
Subjt:  ISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGH----HQITQPATPAQPGGAS

Query:  FADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV
         AD+++  T                IS+I+++            LLVS LL      S  +S SNEG LRIGLKK+K D N R  A L+SKK + L +S+
Subjt:  FADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV

Query:  GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFF
         K++    NL +S +ADIV LKNYLDAQYYGEIGIG+PPQKFT+                    AC+FHAKY+S +S+TY+KNG  A+IQYGSGA+SGFF
Subjt:  GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFF

Query:  SYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYV
        SYD+V+VGD +V++QE IE T+   +TF+ AKFDG+LGLGFQEI+ G +VPVWYNM+KQ LVKE VFSFWLNRN +E+EGGE+VFGGVDPKHFKG+HTYV
Subjt:  SYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYV

Query:  PVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDET
        PV+ KGYWQFD+GD+LIGG+ T YC  GCSAIADSGTSLLAGP++++  IN AIGA+ V   ECKA+V QY + I+DLLLA+   +K+CS+IG+CTFD T
Subjt:  PVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDET

Query:  HDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILK
          VS+ IE+V+ +K+G+ S G  +A CS CEMAV+W+Q++L QN+TQE I++ V+ELCDR  + + E+ VDCGR+S MP +SFTIG +VF+L  + YILK
Subjt:  HDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILK

Query:  VGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA
        VGEG+ AQCISGF   D+PPPRGPLWILGD+FMG YHTVFD+GK RVGFAEAA
Subjt:  VGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA

KAF4399586.1 hypothetical protein G4B88_022669 [Cannabis sativa]0.058.28Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTL---NSF---------QRMFQGLEFSG--------KLGHTDSTP--DDNNRLAVEVDFFSSKKRVVDD
        M+KGWGLTL DSD  S GFF N Q P     T    N F           MF G+E S         +L  T   P  D+N   A  VDFFS K R V+D
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTL---NSF---------QRMFQGLEFSG--------KLGHTDSTP--DDNNRLAVEVDFFSSKKRVVDD

Query:  LQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTGL--HLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSY
             DSK  +  S+ K+       P P+     +NTGL  HL TANTGSD+STVDDG+ SD E+K  KN++ +LQVELQ MN EN KL++ML+ VSN+Y
Subjt:  LQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTGL--HLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSY

Query:  SSLHMHLLSLMQQKQQ---QQNHPSEPAH--QREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINN---NN
        S+L MH++S+MQQ+QQ   QQNH ++ A   Q       K  E+KHE       RQF+DLGPS       + +    +SSS+ERTRS +P N NN   + 
Subjt:  SSLHMHLLSLMQQKQQ---QQNHPSEPAH--QREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINN---NN

Query:  NNNNTETASKKRDHAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKY
          NN+ T  K      I   N DH ++     + + R++SPESES+GW  NN+K P+ N+   SKP++QS +ATMRKARVSVRARSEAPMI+DGCQWRKY
Subjt:  NNNNTETASKKRDHAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKY

Query:  GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMAT
        GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNH HPLPPAAMAMASTTTAAA MLLSGSMSSAD  LMNPNLLARAILPCSS+MAT
Subjt:  GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMAT

Query:  ISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGH----HQITQPATPAQPGGAS
        ISASAPFPT+TLDLT+  NPLQ  R     FQVP   GQ  + A       GQ LYN QSKFSGLQLS   GA  +H+ +     Q+    T  QP   S
Subjt:  ISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGH----HQITQPATPAQPGGAS

Query:  FADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV
         AD+++  T                IS+I+++            LLVS LL      S  +S SNEG LRIGLKK+K D N R  A L+SKK + L +S+
Subjt:  FADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV

Query:  GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFF
         K++    NL +S +ADIV LKNYLDAQYYGEIGIG+PPQKFTVIFDTGSSNLWVPSAKC FS+AC+FHAKY+S +S+TY+KNG  A+IQYGSGA+SGFF
Subjt:  GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFF

Query:  SYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYV
        SYD+V+VGD +V++QE IE T+   +TF+ AKFDG+LGLGFQEI+ G +VPV YNM+KQ LVKE VFSFWLNRN +E+EGGE+VFGGVDPKHFKG+HTYV
Subjt:  SYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYV

Query:  PVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDET
        PV+ KGYWQFD+GD+LIGG+ T YC  GCSAIADSGTSLLAGP++++  IN AIGA+ V   ECKA+V QY + I+DLLLA+   +K+CS+IG+CTFD T
Subjt:  PVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDET

Query:  HDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILK
          VS+ IE+V+ +K+G+ S G  +A CS CEMAV+W+Q++L QN+TQE I++ V+ELCDR  + + E+ VDCGR+S MP +SFTIG +VF+L  + YILK
Subjt:  HDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILK

Query:  VGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA
        VGEG+ AQCISGF   D+PPPRGPLWILGD+FMG YHTVFD+GK RVGFAEAA
Subjt:  VGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA

KAF5742150.1 Saposin-like aspartyl protease family protein [Tripterygium wilfordii]0.059.7Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSKP--------TSTTS
        MDKGWGLTL   D +   FF+  Q       + +SF  +    +F   LG   +  D+N  L  EVDF + K R+VD    D+D           T   +
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSKP--------TSTTS

Query:  IIKDDKASTPPPPPTTSFNL-VNTGLHLLTANTGSDQSTV----DDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLM
        + K++  S          NL VNTGLHLLTANTGSDQS V    DDG+SSD ++KRAK ELAQLQ EL RMN EN +LRDML  V+ +Y +L MHL++LM
Subjt:  IIKDDKASTPPPPPTTSFNL-VNTGLHLLTANTGSDQSTV----DDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLM

Query:  -QQKQQQQNHPSEPAHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIM
         QQ +  Q+HP           E KS +   +V  + VPRQFMDL PS       E+ E + NSSSDERTRSGS  N        N E ASK      + 
Subjt:  -QQKQQQQNHPSEPAHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIM

Query:  PPNFDHENSKRSIPREDSPESESKGWGPNNHKTPRFNNSSNS-KPLDQST-EATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMA
              +N KR + RE+S ESE   W  +  K  + NNS+ S   +DQST +ATMRKARVSVRARSEA + +DGCQWRKYGQKMAKGNPCPRAYYRCTMA
Subjt:  PPNFDHENSKRSIPREDSPESESKGWGPNNHKTPRFNNSSNS-KPLDQST-EATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMA

Query:  VGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNP
        VGCPVRKQVQRCAED+TILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSAD  +MN NLLAR +LPCSS+MATISASAPFPT+TLDLTH+PNP
Subjt:  VGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNP

Query:  LQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAA
        LQFQRP A  FQ PFPG        QL Q +GQ +YNNQSKFSG + S +M  +SSHL          PAQ   + FAD +SAATAAITADPNFTAALAA
Subjt:  LQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAA

Query:  AISSII---------------VNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNN
        AI+SII               VN    +R I FS  L  L      + S  +SA+N+  +R+ LKK+K D N+   + LE+K G+ L SS+ K+   G  
Subjt:  AISSII---------------VNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNN

Query:  LEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGD
        L +SKN DIV LKNYLDAQY+GEIGIGTPPQKFTVIFDTGSSNLWVPSAKC FS+AC+FH+KY+S  SSTY KNG  AAIQYG+GAISGFFSYDNV+VGD
Subjt:  LEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGD

Query:  AIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQ
         +V+ Q  IEAT    +TFM AKFDGILGLGFQEI+ G A PVWYNM+KQ L+K+ VFSFWLNRN  E+EGGE+VFGGVDP H+KG+HTYVP+T KGYWQ
Subjt:  AIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQ

Query:  FDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIEN
        FD+GD+LI  + T YC  GCSAIADSGTSLLAGPS +V  IN AIGA+ V   ECK +VSQY + IMDLLLA+ QP+K+CS++G+CTFD TH +S+ IE+
Subjt:  FDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIEN

Query:  VMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQC
        V+ D+  + S G  +AMC+ACEM V+W+Q +LKQN+TQE I+D VN+LCD+  +   E+ VDC  I  MP VSFTIG +VF+L  ++YILKVGEGSAAQC
Subjt:  VMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQC

Query:  ISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA
        ISGF   DIPPPRGPLWILGD+FMG YHTVFD+GK RVGFAEAA
Subjt:  ISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA

KAF9683352.1 hypothetical protein SADUNF_Sadunf04G0004600 [Salix dunnii]0.056.3Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRL-----------------AVEVDFFSSKKRVVDDLQADQ
        MDKGWGLTL  SD  S+   +N       P  ++SF ++ +  +FS      DS   +N+                   A EVDFF  +   VD      
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRL-----------------AVEVDFFSSKKRVVDDLQADQ

Query:  DSKPTSTTSIIKDDKASTPPPPPTTSFNL-VNTGLHLLTANTGSDQSTVDDGISSDGENKRAKN-ELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHM
        DSK   TTS+I   + S P   P +S  L VNTGLHL TAN  SDQSTVDDG+S + ++KR+KN ELAQLQVELQ+MN EN +L+DMLS V+N+YS+L M
Subjt:  DSKPTSTTSIIKDDKASTPPPPPTTSFNL-VNTGLHLLTANTGSDQSTVDDGISSDGENKRAKN-ELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHM

Query:  HLLSLMQQKQQQQNHPSEPAHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRD
        H ++LMQQ+    NH  E   ++E    K S E K E    MVPRQFMDLGPS       E++E+  NSSS+ERTRS +P N        + E AS K +
Subjt:  HLLSLMQQKQQQQNHPSEPAHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRD

Query:  -HAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNS-KPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCP
           E +P  +D ENS     + I  ++SP+SES+GW PN  K  + N +S++ K ++QS EATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCP
Subjt:  -HAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNS-KPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCP

Query:  RAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILP-CSSSMATISASAPFPTI
        RAYYRCTMAVGCPVRKQVQRCAED+TILITTYEGNHNHPLPPAAM MASTTTAAATMLLSGSMSSAD  +MNPNLLARAILP CSSS+ATISASAPFPT+
Subjt:  RAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILP-CSSSMATISASAPFPTI

Query:  TLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPS----AAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATA
        TLDLT+  NPLQFQRP A  FQVPFPG QP +     A Q+PQ  GQALYN QSKFSGLQLS + G  SS LGH   +Q     Q    S  DTLSAATA
Subjt:  TLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPS----AAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATA

Query:  AITADPNFTAALAAAISSIIVN--------------------------CKVTMRKISFS-------------------------HL--------------
        AIT+DPNFTAALAAAISSII                             K+  RK+S                           H+              
Subjt:  AITADPNFTAALAAAISSIIVN--------------------------CKVTMRKISFS-------------------------HL--------------

Query:  ---------------------LVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNY
                             +V  L L     +  +SASN+G LRIGLKK+K D+NSR  A L+SK+   L +S+ K+N   +NL ES++ DIV LKNY
Subjt:  ---------------------LVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNY

Query:  LDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMS
        LDAQYYGEIG+GTPPQKFTVIFDTGSSNLWVPS+KC  S+AC+FH+KY SG+SSTYK+NG SA IQYGSG+ISGFFS D V+VGD +V++QE IEAT   
Subjt:  LDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMS

Query:  TMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQ---------FDIGDI
        ++TF+ AKFDGILGLGF+EI+ G AVPVW NM+K  L+ E VFSFWLNRNAE+++GGE+VFGG+DPKH+KG+HT+VPVT KGYWQ         F++GD+
Subjt:  TMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQ---------FDIGDI

Query:  LIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKD
        LIG + T YCA GC+AIADSGTSLLAGP+ I+  IN+AIGA+     +CK +VSQY  AIMDLLL +AQP++ICS+IG+CTFD T  +S+ I++V+ + +
Subjt:  LIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKD

Query:  GRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIP
         +SSG   +AMC ACEMA +W++ +LKQN+TQ+ I+D VN+LC+R  N   E+ VDCG +  MP VSFTIG + F+L  ++YILKVG+GSAAQCISGF  
Subjt:  GRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIP

Query:  FDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA
         D+PPPRGPLWILGD+FMG YHTVFD GK RVGFAEAA
Subjt:  FDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA

XP_004134775.1 probable WRKY transcription factor 31 [Cucumis sativus]0.097.29Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSKPTSTTSIIKDDKAS
        MDKGWGLTLRDS+HQSIGFFSNK PPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFS+KKRVVDDL+ADQDSKPTSTTSIIKDDKA 
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSKPTSTTSIIKDDKAS

Query:  TPPPPPTTSFNLVNTGLHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLMQQKQQQQNHPSEP
        TPPPPPTTSFNLVNTGLHLLTANTGSDQSTVDDGISSDGE+KRAKNELAQLQVELQRMNAENHKLRDMLSHVSN+YSSLHMHLLSLMQQKQQQQNHPSEP
Subjt:  TPPPPPTTSFNLVNTGLHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLMQQKQQQQNHPSEP

Query:  AHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIMPPNFDHENSKRSIP
        AHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGN+N IGESEELLCNSSSDERTRSGSPLNINNNNNN  TETASKKRDHAEIMPPN DHENSKRSIP
Subjt:  AHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIMPPNFDHENSKRSIP

Query:  REDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDR
        REDSPESES+GWGPN HKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDR
Subjt:  REDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDR

Query:  TILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFP
        TILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCS+SMATISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFP
Subjt:  TILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFP

Query:  GGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAAAISSII
        GGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPA+PAQPGGASFADTLSAATAAITADPNFTAALAAAISSII
Subjt:  GGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAAAISSII

TrEMBL top hitse value%identityAlignment
A0A1S3B045 probable WRKY transcription factor 310.0e+0095.42Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSKPTSTTSIIKDDKAS
        MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFS KLGHTDST DDNNRLAVEVDFFS+KKR+VDDL+ADQDSKPTSTTSIIKDDKA 
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSKPTSTTSIIKDDKAS

Query:  TPPPPPTTSFNLVNTGLHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLMQQKQQQQNHPSEP
        TPPPPPTTSFNLVNTGLHLLTANTGS QSTVDDGISSDGE+KRAKNELAQLQVELQRMNAENHKLRDMLSHVSN+YSSL MHLL+LMQQ+QQQQNH SEP
Subjt:  TPPPPPTTSFNLVNTGLHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLMQQKQQQQNHPSEP

Query:  AHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIMPPNFDHENSKRSIP
        A+QREI GEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNN+GESEELLCNSSSDERTRSGSPLNI    NNNNTETASKKRDHAEIMPPN DHENSKRSIP
Subjt:  AHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIMPPNFDHENSKRSIP

Query:  REDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDR
        REDSPESES+GWGP NHKTPRFNNSSNSKP+DQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDR
Subjt:  REDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDR

Query:  TILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFP
        TILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTH+PNPLQFQRPTAAPF VPFP
Subjt:  TILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFP

Query:  GGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAAAISSII
        GGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSS LGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAAAISSII
Subjt:  GGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAAAISSII

A0A498HG15 Uncharacterized protein0.0e+0055.02Show/hide
Query:  KQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNN---RLAVEVDFFSSKKRVVDDLQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTGLHL
        + PPPPPPPT   F  +         + H+  +P  +N   ++  E DFF+  K      + D +    ST    K D       P    FN VNTGL+L
Subjt:  KQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNN---RLAVEVDFFSSKKRVVDDLQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTGLHL

Query:  LTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLMQQKQQQQNHPSEPAHQREIGGEKKSTEIKHEV
        L  NT SDQS VDDGISS+ E+KRAK+ELA LQ EL+RMNAENH+L+ ML+ V+ +Y++L +HLL+LMQ ++  QN  +   H   +  +K     K   
Subjt:  LTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLMQQKQQQQNHPSEPAHQREIGGEKKSTEIKHEV

Query:  GKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIMPPNFDHENS--KRSIPREDSPESESKGWGPNNH
          ++VPRQFMDLG + NN +  E  +    SSSDER+R     ++ +N          K   H++     FDHE    +R I RE+SP+  S+ WGPN  
Subjt:  GKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIMPPNFDHENS--KRSIPREDSPESESKGWGPNNH

Query:  KTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPP
        K PR N   + K +DQ TEATMRKARVSVRARSEAPMI+DGCQWRKYGQKMAKGNPCPRAYYRCTMA GCPVRKQVQRCAEDRTILITTYEGNHNHPLPP
Subjt:  KTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPP

Query:  AAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQ---PPSAAAQLPQV
        AAMAMASTT++AA MLLSGSM SAD  LM+ N L R ILPCSSSMATISASAPFPT+TLDLT +PNPLQ  +     F +PFP          A+ LPQ+
Subjt:  AAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQ---PPSAAAQLPQV

Query:  LGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAAAISSIIVNCK--------------------
          QALY NQSKFSGLQ+S +    +   GH      + P Q  G S AD L+AATAAI ADPNFTAALAAAI+SII N                      
Subjt:  LGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAAAISSIIVNCK--------------------

Query:  ------------------------------VTMRKIS-------FSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLG
                                      V +RKIS        + L+ S L L+L  S    S  ++G +RIGLKK+K DQ       L  +     G
Subjt:  ------------------------------VTMRKIS-------FSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLG

Query:  SSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAIS
          + K++  G NL ES++ DIV LKNY+DAQY+GEIGIGTP QKFTVIFDTGSSNLWVPSAKC FS+AC+ H KY+S +SSTY KNG SAAIQYG+GAIS
Subjt:  SSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAIS

Query:  GFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQH
        GFFS D+++VGD +V++Q+ IEAT    +TF+AAKFDGILGLGFQEI+ G AVPVWYN+V Q LVKE VFSFWLNRN E +EGGE+VFGGVD  HFKG+H
Subjt:  GFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQH

Query:  TYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTF
        TYVPVT KGYWQFD+GD+LI GE++ +CA GCSAIADSGTSLLAGP+ +V  IN AIGA+ V   ECK +V QY + I+++L+AK+QP+K+CS+IG CTF
Subjt:  TYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTF

Query:  DETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDY
        D T  VS  IE+++  K  + S G ++A C+ACEMAV+W+Q  L++N+T+E I+D VN+LC+R      E++V C  +S +P+VSFTIG +VF+L  + Y
Subjt:  DETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDY

Query:  ILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA
        ILKVGEG AAQCISGFI  D+ PPRGPLWILGD+FMG YHTVFD+G   VGFAEAA
Subjt:  ILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA

A0A7J6FPJ6 Uncharacterized protein0.0e+0056.46Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLN------------SFQRMFQGLEFSG--------KLGHTDSTP--DDNNRLAVEVDFFSSKKRVVDD
        M+KGWGLTL   D  S GFF NKQ P     T +                MF G+E S         +L  T   P  D+N   A  VDFFS K R V+D
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLN------------SFQRMFQGLEFSG--------KLGHTDSTP--DDNNRLAVEVDFFSSKKRVVDD

Query:  LQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTG--LHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSY
             DSK  +  S+ K+       P P+     +NTG  LHL TANTGSD+STVDDG+ SD E+K  KN++ +LQVELQ MN EN KL++ML+ VSN+Y
Subjt:  LQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTG--LHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSY

Query:  SSLHMHLLSLMQQK---QQQQNHPSEPAH--QREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINN---NN
        S+L MH++S+MQQ+   QQQQNH ++ A   Q       K  E+KHE       RQF+DLGPS       + +    +SSS+ERTRS +P N NN   + 
Subjt:  SSLHMHLLSLMQQK---QQQQNHPSEPAH--QREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINN---NN

Query:  NNNNTETASKKRDHAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKY
          NN+ T  K      I   N DH ++     + + R++SPESES+GW  NN+K P+ N+   SKP++QS +ATMRKARVSVRARSEAPMI+DGCQWRKY
Subjt:  NNNNTETASKKRDHAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKY

Query:  GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMAT
        GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNH HPLPPAAMAMASTTTAAA MLLSGSMSSAD  LMNPNLLARAILPCSS+MAT
Subjt:  GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMAT

Query:  ISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGH----HQITQPATPAQPGGAS
        ISASAPFPT+TLDLT+  NPLQ  R     FQVP   GQ  + A       GQ LY NQSKFSGLQLS   GA  +H+ +     Q+    T  QP   S
Subjt:  ISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGH----HQITQPATPAQPGGAS

Query:  FADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV
         AD+++  T                IS+I+++            LLVS LL  L      +S SNEG LRIGLKK+K D N R  A L+SKK + L +S+
Subjt:  FADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV

Query:  GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFF
         K++    NL +S +ADIV LKNYLDAQYYGEIGIG+PPQKFT+                    AC+FHAKY+S +S+TY+KNG  A+IQYGSGA+SGFF
Subjt:  GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFF

Query:  SYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYV
        SYD+V+VGD +V++QE IE T+   +TF+ AKFDG+LGLGFQEI+ G +VPVWYNM+KQ LVKE VFSFWLNRN +E+EGGE+VFGGVDPKHFKG+HTYV
Subjt:  SYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYV

Query:  PVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDET
        PV+ KGYWQFD+GD+LIGG+ T YC  GCSAIADSGTSLLAGP++++  IN AIGA+ V   ECKA+V QY + I+DLLLA+   +K+CS+IG+CTFD T
Subjt:  PVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDET

Query:  HDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILK
          VS+ IE+V+ +K+G+ S G  +A CS CEMAV+W+Q++L QN+TQE I++ V+ELCDR  + + E+ VDCGR+S MP +SFTIG +VF+L  + YILK
Subjt:  HDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILK

Query:  VGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA
        VGEG+ AQCISGF   D+PPPRGPLWILGD+FMG YHTVFD+GK RVGFAEAA
Subjt:  VGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA

A0A7J6HWJ2 Uncharacterized protein0.0e+0057.94Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLN------------SFQRMFQGLEFSG--------KLGHTDSTP--DDNNRLAVEVDFFSSKKRVVDD
        M+KGWGLTL   D  S GFF N Q P     T +                MF G+E S         +L  T   P  D+N   A  VDFFS K R V+D
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLN------------SFQRMFQGLEFSG--------KLGHTDSTP--DDNNRLAVEVDFFSSKKRVVDD

Query:  LQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTG--LHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSY
             DSK  +  S+ K+       P P+     +NTG  LHL TANTGSD+STVDDG+ SD E+K  KN++ +LQVELQ MN EN KL++ML+ VSN+Y
Subjt:  LQADQDSKPTSTTSIIKDDKASTPPPPPTTSFNLVNTG--LHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSY

Query:  SSLHMHLLSLMQQK---QQQQNHPSEPAH--QREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINN---NN
        S+L MH++S+MQQ+   QQQQNH ++ A   Q       K  E+KHE       RQF+DLGPS       + +    +SSS+ERTRS +P N NN   + 
Subjt:  SSLHMHLLSLMQQK---QQQQNHPSEPAH--QREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINN---NN

Query:  NNNNTETASKKRDHAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKY
          NN+ T  K      I   N DH ++     + + R++SPESES+GW  NN+K P+ N+   SKP++QS +ATMRKARVSVRARSEAPMI+DGCQWRKY
Subjt:  NNNNTETASKKRDHAEIMPPNFDHENSK----RSIPREDSPESESKGWGPNNHKTPRFNNSSNSKPLDQSTEATMRKARVSVRARSEAPMISDGCQWRKY

Query:  GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMAT
        GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNH HPLPPAAMAMASTTTAAA MLLSGSMSSAD  LMNPNLLARAILPCSS+MAT
Subjt:  GQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMAT

Query:  ISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGH----HQITQPATPAQPGGAS
        ISASAPFPT+TLDLT+  NPLQ  R     FQVP   GQ  + A       GQ LY NQSKFSGLQLS   GA  +H+ +     Q+    T  QP   S
Subjt:  ISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGH----HQITQPATPAQPGGAS

Query:  FADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV
         AD+++  T                IS+I+++            LLVS LL  L      +S SNEG LRIGLKK+K D N R  A L+SKK + L +S+
Subjt:  FADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV

Query:  GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFF
         K++    NL +S +ADIV LKNYLDAQYYGEIGIG+PPQKFTVIFDTGSSNLWVPSAKC FS+AC+FHAKY+S +S+TY+KNG  A+IQYGSGA+SGFF
Subjt:  GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFF

Query:  SYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYV
        SYD+V+VGD +V++QE IE T+   +TF+ AKFDG+LGLGFQEI+ G +VPV YNM+KQ LVKE VFSFWLNRN +E+EGGE+VFGGVDPKHFKG+HTYV
Subjt:  SYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYV

Query:  PVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDET
        PV+ KGYWQFD+GD+LIGG+ T YC  GCSAIADSGTSLLAGP++++  IN AIGA+ V   ECKA+V QY + I+DLLLA+   +K+CS+IG+CTFD T
Subjt:  PVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDET

Query:  HDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILK
          VS+ IE+V+ +K+G+ S G  +A CS CEMAV+W+Q++L QN+TQE I++ V+ELCDR  + + E+ VDCGR+S MP +SFTIG +VF+L  + YILK
Subjt:  HDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILK

Query:  VGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA
        VGEG+ AQCISGF   D+PPPRGPLWILGD+FMG YHTVFD+GK RVGFAEAA
Subjt:  VGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA

A0A7J7D707 Saposin-like aspartyl protease family protein0.0e+0059.62Show/hide
Query:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSK--------PTSTTS
        MDKGWGLTL   D +   FF+  Q       + +SF  +    +F   LG   +  D+N  L  EVDF + K R+VD    D+D           T   +
Subjt:  MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSK--------PTSTTS

Query:  IIKDDKASTPPPPPTTSFNL-VNTGLHLLTANTGSDQSTV----DDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLM
        + K++  S          NL VNTGLHLLTANTGSDQS V    DDG+SSD ++KRAK ELAQLQ EL RMN EN +LRDML  V+ +Y +L MHL++LM
Subjt:  IIKDDKASTPPPPPTTSFNL-VNTGLHLLTANTGSDQSTV----DDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLM

Query:  -QQKQQQQNHPSEPAHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIM
         QQ +  Q+HP           E KS +   +V  + VPRQFMDL PS       E+ E + NSSSDERTRSGS        + NN E ASK        
Subjt:  -QQKQQQQNHPSEPAHQREIGGEKKSTEIKHEVGKVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIM

Query:  PPNFDHENSKRSIPREDSPESESKGWGPNNHKTPRFNNSSNS-KPLDQST-EATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMA
              +N KR + RE+S ESE   W  +  K  + NNS+ S   +DQST +ATMRKARVSVRARSEA + +DGCQWRKYGQKMAKGNPCPRAYYRCTMA
Subjt:  PPNFDHENSKRSIPREDSPESESKGWGPNNHKTPRFNNSSNS-KPLDQST-EATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMA

Query:  VGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNP
        VGCPVRKQVQRCAED+TILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSAD  +MN NLLAR +LPCSS+MATISASAPFPT+TLDLTH+PNP
Subjt:  VGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSSADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNP

Query:  LQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAA
        LQFQRP  A FQ PFPG        QL Q +GQ +YNNQSKFSG + S +M  +SSHL          PAQ   + FAD +SAATAAITADPNFTAALAA
Subjt:  LQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQPATPAQPGGASFADTLSAATAAITADPNFTAALAA

Query:  AISSII---------------VNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNN
        AI+SII               VN    +R I FS      L L   + S  +SA+N+  +R+ LKK+K D N+   + LE+K G+ L SS+ K+ ++   
Subjt:  AISSII---------------VNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNN

Query:  LEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGD
        L +SKN DIV LKNYLDAQY+GEIGIGTPPQKFTVIFDTGSSNLWVPSAKC FS+AC+FH+KY+S  SSTY KNG  AAIQYG+GAISGFFSYDNV+VGD
Subjt:  LEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGD

Query:  AIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQ
         +V+ Q  IEAT    +TFM AKFDGILGLGFQEI+ G A PVWYNM+KQ L+K+ VFSFWLNRN  E+EGGE+VFGGVDP H+KG+HTYVP+T KGYWQ
Subjt:  AIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQ

Query:  FDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIEN
        FD+GD+LI  + T YC  GCSAIADSGTSLLAGPS +V  IN AIGA+ V   ECK +VSQY + IMDLLLA+ QP+K+CS++G+CTFD TH +S+ IE+
Subjt:  FDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIEN

Query:  VMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQC
        V+ D+  + S G  +AMC+ACEM V+W+Q +LKQN+TQE I+D VN+LCD+  +   E+ VDC  I  MP VSFTIG +VF+L  ++YILKVGEGSAAQC
Subjt:  VMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQEDIIDNVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQC

Query:  ISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA
        ISGF   DIPPPRGPLWILGD+FMG YHTVFD+GK RVGFAEAA
Subjt:  ISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVGFAEAA

SwissProt top hitse value%identityAlignment
O04057 Aspartic proteinase9.1e-19665.74Show/hide
Query:  LLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDT
        L L +L   +  +SASN+G LR+GLKKIK D  +R  A +ESK  + L ++  K+N  G NL ES + DIV LKNYLDAQYYGEI IGTPPQKFTVIFDT
Subjt:  LLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDT

Query:  GSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGG
        GSSNLWV   +C+FS+AC FHA+Y+S RSS+YKKNGTSA+I+YG+GA+SGFFSYDNV+VGD +V+ Q  IEAT   ++TF+ AKFDG+LGLGFQEIA G 
Subjt:  GSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGG

Query:  AVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVV
        AVPVWYNMV+Q LVKE VFSFWLNRN EE+EGGE+VFGGVDPKH++G+HTYVPVT KGYWQFD+GD+LI GE T +C GGCSAIADSGTSLLAGP+ ++ 
Subjt:  AVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSNIVV

Query:  SINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQE
         IN AIGA  V   +CKA+V+QY + IMDLLL++A P+KICS+I +CTFD T  VS+ IE+V+ +  G+SS    + MCS CEM V+W+Q++L+QN+T+E
Subjt:  SINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNKTQE

Query:  DIIDNVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVG
         II+ +NELCDR  +   ++ VDCG++S MP VSFTIG ++F+L  ++YILKVGEG  AQCISGF  FDIPPPRGPLWILGDVFMG YHTVFDFGK RVG
Subjt:  DIIDNVNELCDRGLN-QDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKARVG

Query:  FAEAA
         AEAA
Subjt:  FAEAA

O65390 Aspartic proteinase A12.0e-18762.88Show/hide
Query:  KVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEI
        K+  R ++ S L+VS LL        A +  N+G  R+GLKK+K D  +R  A +ESK+ K L +           L +S +AD+V LKNYLDAQYYGEI
Subjt:  KVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEI

Query:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF
         IGTPPQKFTV+FDTGSSNLWVPS+KC FSLAC  H KY+S RSSTY+KNG +AAI YG+GAI+GFFS D V VGD +V++QE IEAT    +TF+ AKF
Subjt:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF

Query:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA
        DGILGLGFQEI+ G A PVWYNM+KQ L+KE VFSFWLNRNA+E+EGGELVFGGVDP HFKG+HTYVPVT KGYWQFD+GD+LIGG  T +C  GCSAIA
Subjt:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA

Query:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA
        DSGTSLLAGP+ I+  IN AIGAA V   +CK +V QY + I+DLLL++ QP+KICS+IG+CTFD T  VS+ IE+V+  ++ + S G  +A CSACEMA
Subjt:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA

Query:  VLWIQDELKQNKTQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM
        V+WIQ +L+QN TQE I++ VNELC+R      E+ VDC ++S MP VS TIG +VF+L  ++Y+LKVGEG  AQCISGFI  D+ PPRGPLWILGDVFM
Subjt:  VLWIQDELKQNKTQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM

Query:  GPYHTVFDFGKARVGFAEAA
        G YHTVFDFG  +VGFAEAA
Subjt:  GPYHTVFDFGKARVGFAEAA

P42210 Phytepsin2.6e-18261.45Show/hide
Query:  LLVSLLLL--ILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKF
        LL ++LLL  +L  +SEA     EG +RI LKK   D+NSR    L   + + L S         N L   +  DIV LKNY++AQY+GEIG+GTPPQKF
Subjt:  LLVSLLLL--ILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKF

Query:  TVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQ
        TVIFDTGSSNLWVPSAKC FS+AC+ H++Y++G SSTYKKNG  AAIQYG+G+I+G+FS D+V VGD +V++QE IEAT    +TF+ AKFDGILGLGF+
Subjt:  TVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQ

Query:  EIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAG
        EI+ G AVPVWY M++Q LV + VFSFWLNR+ +E EGGE++FGG+DPKH+ G+HTYVPVT KGYWQFD+GD+L+GG++T +CAGGC+AIADSGTSLLAG
Subjt:  EIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAG

Query:  PSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELK
        P+ I+  IN  IGAA V   ECK IVSQY + I+DLLLA+ QP+KICS++G+CTFD T  VS  I +V+ D+  +S+G  ++ MCSACEMAV+W+Q++L 
Subjt:  PSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELK

Query:  QNKTQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDF
        QNKTQ+ I+D VN+LC+R      E+ VDCG +  MP++ FTIG + F L  ++YILKVGEG+AAQCISGF   DIPPPRGPLWILGDVFMGPYHTVFD+
Subjt:  QNKTQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDF

Query:  GKARVGFAEAA
        GK R+GFA+AA
Subjt:  GKARVGFAEAA

Q42456 Aspartic proteinase oryzasin-13.0e-18362.2Show/hide
Query:  LVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVI
        LV L  ++L     A++A  EG +RI LKK   D+NSR  A L  ++G       G ++  G   E     DIV LKNY++AQY+GEIG+GTPPQKFTVI
Subjt:  LVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVI

Query:  FDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIA
        FDTGSSNLWVPSAKC FS+ACFFH++Y+SG+SSTY+KNG  AAIQYG+G+I+GFFS D+V VGD +V++QE IEAT    +TFM AKFDGILGLGFQEI+
Subjt:  FDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIA

Query:  TGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSN
         G AVPVWY MV+Q LV E VFSFW NR+++E EGGE+VFGG+DP H+KG HTYVPV+ KGYWQF++GD+LIGG+TT +CA GCSAIADSGTSLLAGP+ 
Subjt:  TGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAGPSN

Query:  IVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNK
        I+  IN  IGA  V   ECK +VSQY + I+DLLLA+ QP KICS++G+CTFD  H VS  I++V+ D+ G S+G  S  MC+ACEMAV+W+Q++L QNK
Subjt:  IVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELKQNK

Query:  TQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKA
        TQ+ I++ +N+LCD+      E+ VDCG ++ MP +SFTIG + F L  ++YILKVGEG+AAQCISGF   DIPPPRGPLWILGDVFMG YHTVFD+GK 
Subjt:  TQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDFGKA

Query:  RVGFAEAA
        RVGFA++A
Subjt:  RVGFAEAA

Q8VYL3 Aspartic proteinase A21.4e-19163.2Show/hide
Query:  VTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNL-EESKNADIVPLKNYLDAQYYGEI
        V  R ++FS + VS LL    YS       N+G  R+GLKK+K D N+R      SK+ + L SS+  +N   NNL  +S +ADIVPLKNYLDAQYYGEI
Subjt:  VTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNL-EESKNADIVPLKNYLDAQYYGEI

Query:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF
         IGTPPQKFTVIFDTGSSNLWVPS KC FSL+C+FHAKY+S RSSTYKK+G  AAI YGSG+ISGFFSYD V VGD +V++QE IE TS   +TF+ AKF
Subjt:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF

Query:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA
        DG+LGLGFQEIA G A PVWYNM+KQ L+K  VFSFWLNR+ + +EGGE+VFGGVDPKHF+G+HT+VPVT +GYWQFD+G++LI GE+T YC  GCSAIA
Subjt:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA

Query:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA
        DSGTSLLAGP+ +V  IN+AIGA+ V   +CK +V QY + I+DLLLA+ QP+KICS+IG+C +D TH VS+ IE+V+  ++ RSS G  +A C ACEMA
Subjt:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA

Query:  VLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM
        V+WIQ +L+QN TQE I++ +NE+C+R  + + E+ VDC ++S+MP VSFTIG +VF+L  ++Y+LK+GEG  AQCISGF   DIPPPRGPLWILGDVFM
Subjt:  VLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM

Query:  GPYHTVFDFGKARVGFAEA
        G YHTVFDFG  +VGFAEA
Subjt:  GPYHTVFDFGKARVGFAEA

Arabidopsis top hitse value%identityAlignment
AT1G11910.1 aspartic proteinase A11.4e-18862.88Show/hide
Query:  KVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEI
        K+  R ++ S L+VS LL        A +  N+G  R+GLKK+K D  +R  A +ESK+ K L +           L +S +AD+V LKNYLDAQYYGEI
Subjt:  KVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEI

Query:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF
         IGTPPQKFTV+FDTGSSNLWVPS+KC FSLAC  H KY+S RSSTY+KNG +AAI YG+GAI+GFFS D V VGD +V++QE IEAT    +TF+ AKF
Subjt:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF

Query:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA
        DGILGLGFQEI+ G A PVWYNM+KQ L+KE VFSFWLNRNA+E+EGGELVFGGVDP HFKG+HTYVPVT KGYWQFD+GD+LIGG  T +C  GCSAIA
Subjt:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA

Query:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA
        DSGTSLLAGP+ I+  IN AIGAA V   +CK +V QY + I+DLLL++ QP+KICS+IG+CTFD T  VS+ IE+V+  ++ + S G  +A CSACEMA
Subjt:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA

Query:  VLWIQDELKQNKTQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM
        V+WIQ +L+QN TQE I++ VNELC+R      E+ VDC ++S MP VS TIG +VF+L  ++Y+LKVGEG  AQCISGFI  D+ PPRGPLWILGDVFM
Subjt:  VLWIQDELKQNKTQEDIIDNVNELCDR-GLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM

Query:  GPYHTVFDFGKARVGFAEAA
        G YHTVFDFG  +VGFAEAA
Subjt:  GPYHTVFDFGKARVGFAEAA

AT1G62290.1 Saposin-like aspartyl protease family protein9.6e-19363.2Show/hide
Query:  VTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNL-EESKNADIVPLKNYLDAQYYGEI
        V  R ++FS + VS LL    YS       N+G  R+GLKK+K D N+R      SK+ + L SS+  +N   NNL  +S +ADIVPLKNYLDAQYYGEI
Subjt:  VTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNL-EESKNADIVPLKNYLDAQYYGEI

Query:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF
         IGTPPQKFTVIFDTGSSNLWVPS KC FSL+C+FHAKY+S RSSTYKK+G  AAI YGSG+ISGFFSYD V VGD +V++QE IE TS   +TF+ AKF
Subjt:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF

Query:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA
        DG+LGLGFQEIA G A PVWYNM+KQ L+K  VFSFWLNR+ + +EGGE+VFGGVDPKHF+G+HT+VPVT +GYWQFD+G++LI GE+T YC  GCSAIA
Subjt:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA

Query:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA
        DSGTSLLAGP+ +V  IN+AIGA+ V   +CK +V QY + I+DLLLA+ QP+KICS+IG+C +D TH VS+ IE+V+  ++ RSS G  +A C ACEMA
Subjt:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA

Query:  VLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM
        V+WIQ +L+QN TQE I++ +NE+C+R  + + E+ VDC ++S+MP VSFTIG +VF+L  ++Y+LK+GEG  AQCISGF   DIPPPRGPLWILGDVFM
Subjt:  VLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM

Query:  GPYHTVFDFGKARVGFAEA
        G YHTVFDFG  +VGFAEA
Subjt:  GPYHTVFDFGKARVGFAEA

AT1G62290.2 Saposin-like aspartyl protease family protein9.6e-19363.2Show/hide
Query:  VTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNL-EESKNADIVPLKNYLDAQYYGEI
        V  R ++FS + VS LL    YS       N+G  R+GLKK+K D N+R      SK+ + L SS+  +N   NNL  +S +ADIVPLKNYLDAQYYGEI
Subjt:  VTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNL-EESKNADIVPLKNYLDAQYYGEI

Query:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF
         IGTPPQKFTVIFDTGSSNLWVPS KC FSL+C+FHAKY+S RSSTYKK+G  AAI YGSG+ISGFFSYD V VGD +V++QE IE TS   +TF+ AKF
Subjt:  GIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKF

Query:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA
        DG+LGLGFQEIA G A PVWYNM+KQ L+K  VFSFWLNR+ + +EGGE+VFGGVDPKHF+G+HT+VPVT +GYWQFD+G++LI GE+T YC  GCSAIA
Subjt:  DGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIA

Query:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA
        DSGTSLLAGP+ +V  IN+AIGA+ V   +CK +V QY + I+DLLLA+ QP+KICS+IG+C +D TH VS+ IE+V+  ++ RSS G  +A C ACEMA
Subjt:  DSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMA

Query:  VLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM
        V+WIQ +L+QN TQE I++ +NE+C+R  + + E+ VDC ++S+MP VSFTIG +VF+L  ++Y+LK+GEG  AQCISGF   DIPPPRGPLWILGDVFM
Subjt:  VLWIQDELKQNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFM

Query:  GPYHTVFDFGKARVGFAEA
        G YHTVFDFG  +VGFAEA
Subjt:  GPYHTVFDFGKARVGFAEA

AT4G04460.1 Saposin-like aspartyl protease family protein1.8e-17859.49Show/hide
Query:  SHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKF
        S LLV LL  ++  S+ +   + +G +RIGLKK K D+++R  + L  K     GS     + +  N    +NAD+VPLKNYLDAQYYG+I IGTPPQKF
Subjt:  SHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKF

Query:  TVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQ
        TVIFDTGSSNLW+PS KC  S+AC+FH+KY++ +SS+Y+KNG  A+I+YG+GAISG+FS D+V+VGD +V+ QE IEATS   +TF+ AKFDGILGLGF+
Subjt:  TVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQ

Query:  EIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAG
        EI+ G + PVWYNMV++ LVKE +FSFWLNRN ++ EGGE+VFGGVDPKHFKG+HT+VPVT KGYWQFD+GD+ I G+ T YCA GCSAIADSGTSLL G
Subjt:  EIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAG

Query:  PSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELK
        PS ++  IN AIGA  +   ECKA+V QY + +++ LLA+  P+K+CS+IGVC +D T  VS+ I++V+ D    +SG  ++AMCSACEMA +W++ EL 
Subjt:  PSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELK

Query:  QNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDF
        QN+TQE I+    ELCD    Q+ ++ VDCGR+S MP V+F+IG R F+LT +DYI K+GEG  +QC SGF   DI PPRGPLWILGD+FMGPYHTVFD+
Subjt:  QNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDF

Query:  GKARVGFAEAA
        GK RVGFA+AA
Subjt:  GKARVGFAEAA

AT4G04460.2 Saposin-like aspartyl protease family protein6.3e-17659.3Show/hide
Query:  SHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKF
        S LLV LL  ++  S+ +   + +G +RIGLKK K D+++R  + L  K     GS     + +  N    +NAD+VPLKNYLDAQYYG+I IGTPPQKF
Subjt:  SHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSVGKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKF

Query:  TVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQ
        TVIFDTGSSNLW+PS KC  S+AC+FH+KY++ +SS+Y+KNG  A+I+YG+GAISG+FS D+V+VGD +V+ QE IEATS   +TF+ AKFDGILGLGF+
Subjt:  TVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDAIVRNQELIEATSMSTMTFMAAKFDGILGLGFQ

Query:  EIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAG
        EI+ G + PVWYNMV++ LVKE +FSFWLNRN ++ EGGE+VFGGVDPKHFKG+HT+VPVT KGYWQFD+GD+ I G+ T YCA GCSAIADSGTSLL G
Subjt:  EIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAIADSGTSLLAG

Query:  PSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELK
        PS ++  IN AIGA  +   ECKA+V QY + +++ LLA    +K+CS+IGVC +D T  VS+ I++V+ D    +SG  ++AMCSACEMA +W++ EL 
Subjt:  PSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSACEMAVLWIQDELK

Query:  QNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDF
        QN+TQE I+    ELCD    Q+ ++ VDCGR+S MP V+F+IG R F+LT +DYI K+GEG  +QC SGF   DI PPRGPLWILGD+FMGPYHTVFD+
Subjt:  QNKTQEDIIDNVNELCDRGLNQD-ETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFDF

Query:  GKARVGFAEAA
        GK RVGFA+AA
Subjt:  GKARVGFAEAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAAGGATGGGGTCTAACCCTTCGTGATTCTGATCATCAGTCCATTGGGTTCTTCTCAAACAAGCAACCACCACCCCCTCCCCCACCCACTCTCAATTCCTTTCA
AAGAATGTTTCAAGGTTTAGAATTCTCCGGAAAACTTGGCCACACCGACTCCACTCCCGACGATAATAATCGTTTGGCCGTTGAGGTCGATTTTTTCTCCTCTAAGAAAA
GAGTGGTTGATGATCTTCAGGCTGATCAAGACTCCAAACCTACTTCTACTACTTCTATCATTAAAGACGATAAAGCTTCGACCCCTCCGCCTCCCCCCACAACTTCCTTC
AATCTTGTTAACACTGGGTTGCATCTGTTAACTGCTAACACCGGAAGCGATCAATCAACCGTGGATGATGGAATTTCATCAGATGGTGAAAATAAACGAGCCAAAAACGA
GCTAGCGCAGCTACAAGTGGAGCTTCAGCGAATGAACGCGGAGAATCACAAGCTAAGGGACATGCTAAGCCATGTGAGCAACAGCTATAGTTCTTTACATATGCATCTCT
TGAGCTTAATGCAACAGAAGCAGCAACAACAAAATCACCCCTCCGAACCCGCTCATCAACGAGAGATTGGTGGGGAGAAGAAATCAACGGAAATAAAACATGAAGTTGGA
AAGGTAATGGTACCAAGACAATTTATGGATCTAGGACCAAGTGGAAATAACAACAACATAGGTGAAAGCGAAGAATTATTATGTAATTCCTCATCGGATGAAAGAACTCG
TTCTGGGTCTCCATTGAACATCAACAACAACAACAACAACAACAATACTGAAACCGCTTCCAAAAAAAGAGATCACGCGGAAATCATGCCTCCTAACTTCGATCACGAAA
ATTCCAAAAGATCCATCCCCAGAGAAGATAGCCCCGAATCAGAATCTAAGGGTTGGGGTCCCAACAACCACAAGACCCCCCGCTTCAACAATTCTTCTAATTCTAAACCA
CTTGACCAATCCACCGAAGCCACCATGCGTAAAGCCCGTGTCTCAGTCCGTGCTCGTTCCGAAGCTCCCATGATTTCCGATGGGTGTCAATGGCGAAAATATGGCCAAAA
AATGGCTAAAGGAAACCCTTGTCCACGAGCCTATTATCGCTGCACAATGGCTGTGGGTTGCCCTGTTCGAAAACAAGTTCAACGTTGTGCTGAAGATAGGACTATATTGA
TAACAACTTACGAAGGCAACCACAACCACCCACTTCCGCCGGCGGCGATGGCGATGGCGTCAACCACGACAGCGGCGGCTACTATGTTATTATCAGGGTCTATGTCAAGT
GCGGACCATAATTTAATGAACCCAAATTTATTAGCCCGAGCCATACTTCCATGTTCTTCAAGCATGGCTACAATTTCAGCTTCAGCTCCATTTCCAACCATCACATTAGA
CCTCACTCACACTCCAAACCCATTACAGTTTCAAAGACCCACCGCAGCGCCTTTCCAAGTGCCGTTCCCTGGCGGACAACCACCATCAGCCGCCGCCCAATTGCCTCAAG
TATTGGGACAAGCCTTGTATAACAATCAGTCAAAATTCTCCGGCCTACAGCTTTCTCATGAAATGGGAGCTAATTCCTCTCACTTGGGTCATCACCAAATTACACAACCC
GCGACTCCAGCTCAACCCGGTGGTGCTTCTTTTGCTGACACGTTGAGTGCCGCAACGGCTGCTATCACCGCTGATCCCAATTTCACAGCCGCTCTAGCCGCCGCTATCTC
CTCCATCATCGTAAACTGCAAAGTTACAATGAGAAAGATCAGCTTTAGTCACCTTTTGGTTTCTCTGTTACTTTTGATTCTACATTATTCCTCTGAAGCAACATCTGCTT
CCAATGAAGGGTTTTTAAGGATTGGATTGAAAAAGATTAAATCTGATCAAAACAGTCGGTTCAAAGCTTTGCTTGAGTCAAAGAAAGGAAAGTTTTTAGGATCTTCTGTT
GGAAAACATAATCAATGGGGTAATAATCTTGAAGAATCTAAAAATGCTGATATTGTACCATTAAAGAACTATTTGGATGCTCAATACTATGGAGAGATTGGCATTGGCAC
ACCACCTCAAAAGTTCACTGTAATTTTTGATACTGGAAGCTCTAATTTGTGGGTGCCATCTGCCAAATGTATTTTTTCGCTTGCTTGCTTTTTCCATGCTAAGTATCAAT
CAGGGCGGTCAAGCACATACAAAAAAAATGGAACATCTGCTGCTATTCAGTACGGTTCAGGAGCTATTTCTGGCTTCTTTAGTTACGACAACGTTCGAGTTGGTGATGCT
ATCGTTCGTAATCAGGAACTCATTGAGGCAACTAGCATGTCTACTATGACATTCATGGCCGCCAAATTTGATGGTATATTGGGACTTGGATTTCAAGAGATCGCGACTGG
TGGTGCTGTTCCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCAAGTTTTCTCGTTTTGGCTGAATCGTAATGCTGAAGAGAAAGAAGGAGGTGAACTTG
TGTTTGGAGGGGTCGATCCTAAGCACTTCAAAGGCCAACATACATATGTGCCTGTGACTGACAAAGGGTATTGGCAGTTCGACATTGGCGATATTCTTATTGGTGGTGAA
ACAACAAAATATTGTGCTGGCGGTTGCTCTGCAATTGCAGATTCTGGAACTTCTTTGTTGGCTGGTCCATCTAATATAGTGGTATCAATTAATAGAGCAATTGGAGCAGC
TGCAGTTGCACATCCAGAATGCAAGGCAATTGTTTCACAATATAGACGGGCTATTATGGATTTGCTTTTAGCCAAGGCACAACCAGAGAAGATATGCTCCAAAATTGGGG
TGTGTACCTTTGATGAAACTCATGATGTTAGTTTGAAAATTGAGAACGTGATGAGTGATAAAGATGGTAGATCGTCTGGTGGGTTCTCAGAAGCTATGTGCTCAGCTTGT
GAGATGGCGGTTTTGTGGATTCAAGATGAGCTGAAGCAGAACAAAACTCAAGAAGATATCATTGATAATGTCAATGAGCTATGCGACCGTGGGTTGAACCAAGATGAAAC
ATTGGTTGACTGTGGACGAATCTCTCAAATGCCTAACGTGTCCTTCACCATTGGCGATAGAGTTTTTGAGCTCACCTCAAAAGATTATATTCTGAAGGTGGGTGAGGGAT
CTGCAGCTCAATGCATCAGTGGATTCATACCTTTCGACATTCCTCCTCCTCGTGGACCCCTATGGATCTTGGGAGACGTTTTCATGGGACCTTATCATACAGTCTTTGAT
TTTGGCAAAGCGAGAGTCGGATTTGCCGAAGCTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAAAGGATGGGGTCTAACCCTTCGTGATTCTGATCATCAGTCCATTGGGTTCTTCTCAAACAAGCAACCACCACCCCCTCCCCCACCCACTCTCAATTCCTTTCA
AAGAATGTTTCAAGGTTTAGAATTCTCCGGAAAACTTGGCCACACCGACTCCACTCCCGACGATAATAATCGTTTGGCCGTTGAGGTCGATTTTTTCTCCTCTAAGAAAA
GAGTGGTTGATGATCTTCAGGCTGATCAAGACTCCAAACCTACTTCTACTACTTCTATCATTAAAGACGATAAAGCTTCGACCCCTCCGCCTCCCCCCACAACTTCCTTC
AATCTTGTTAACACTGGGTTGCATCTGTTAACTGCTAACACCGGAAGCGATCAATCAACCGTGGATGATGGAATTTCATCAGATGGTGAAAATAAACGAGCCAAAAACGA
GCTAGCGCAGCTACAAGTGGAGCTTCAGCGAATGAACGCGGAGAATCACAAGCTAAGGGACATGCTAAGCCATGTGAGCAACAGCTATAGTTCTTTACATATGCATCTCT
TGAGCTTAATGCAACAGAAGCAGCAACAACAAAATCACCCCTCCGAACCCGCTCATCAACGAGAGATTGGTGGGGAGAAGAAATCAACGGAAATAAAACATGAAGTTGGA
AAGGTAATGGTACCAAGACAATTTATGGATCTAGGACCAAGTGGAAATAACAACAACATAGGTGAAAGCGAAGAATTATTATGTAATTCCTCATCGGATGAAAGAACTCG
TTCTGGGTCTCCATTGAACATCAACAACAACAACAACAACAACAATACTGAAACCGCTTCCAAAAAAAGAGATCACGCGGAAATCATGCCTCCTAACTTCGATCACGAAA
ATTCCAAAAGATCCATCCCCAGAGAAGATAGCCCCGAATCAGAATCTAAGGGTTGGGGTCCCAACAACCACAAGACCCCCCGCTTCAACAATTCTTCTAATTCTAAACCA
CTTGACCAATCCACCGAAGCCACCATGCGTAAAGCCCGTGTCTCAGTCCGTGCTCGTTCCGAAGCTCCCATGATTTCCGATGGGTGTCAATGGCGAAAATATGGCCAAAA
AATGGCTAAAGGAAACCCTTGTCCACGAGCCTATTATCGCTGCACAATGGCTGTGGGTTGCCCTGTTCGAAAACAAGTTCAACGTTGTGCTGAAGATAGGACTATATTGA
TAACAACTTACGAAGGCAACCACAACCACCCACTTCCGCCGGCGGCGATGGCGATGGCGTCAACCACGACAGCGGCGGCTACTATGTTATTATCAGGGTCTATGTCAAGT
GCGGACCATAATTTAATGAACCCAAATTTATTAGCCCGAGCCATACTTCCATGTTCTTCAAGCATGGCTACAATTTCAGCTTCAGCTCCATTTCCAACCATCACATTAGA
CCTCACTCACACTCCAAACCCATTACAGTTTCAAAGACCCACCGCAGCGCCTTTCCAAGTGCCGTTCCCTGGCGGACAACCACCATCAGCCGCCGCCCAATTGCCTCAAG
TATTGGGACAAGCCTTGTATAACAATCAGTCAAAATTCTCCGGCCTACAGCTTTCTCATGAAATGGGAGCTAATTCCTCTCACTTGGGTCATCACCAAATTACACAACCC
GCGACTCCAGCTCAACCCGGTGGTGCTTCTTTTGCTGACACGTTGAGTGCCGCAACGGCTGCTATCACCGCTGATCCCAATTTCACAGCCGCTCTAGCCGCCGCTATCTC
CTCCATCATCGTAAACTGCAAAGTTACAATGAGAAAGATCAGCTTTAGTCACCTTTTGGTTTCTCTGTTACTTTTGATTCTACATTATTCCTCTGAAGCAACATCTGCTT
CCAATGAAGGGTTTTTAAGGATTGGATTGAAAAAGATTAAATCTGATCAAAACAGTCGGTTCAAAGCTTTGCTTGAGTCAAAGAAAGGAAAGTTTTTAGGATCTTCTGTT
GGAAAACATAATCAATGGGGTAATAATCTTGAAGAATCTAAAAATGCTGATATTGTACCATTAAAGAACTATTTGGATGCTCAATACTATGGAGAGATTGGCATTGGCAC
ACCACCTCAAAAGTTCACTGTAATTTTTGATACTGGAAGCTCTAATTTGTGGGTGCCATCTGCCAAATGTATTTTTTCGCTTGCTTGCTTTTTCCATGCTAAGTATCAAT
CAGGGCGGTCAAGCACATACAAAAAAAATGGAACATCTGCTGCTATTCAGTACGGTTCAGGAGCTATTTCTGGCTTCTTTAGTTACGACAACGTTCGAGTTGGTGATGCT
ATCGTTCGTAATCAGGAACTCATTGAGGCAACTAGCATGTCTACTATGACATTCATGGCCGCCAAATTTGATGGTATATTGGGACTTGGATTTCAAGAGATCGCGACTGG
TGGTGCTGTTCCAGTGTGGTATAACATGGTTAAACAAAAACTTGTCAAGGAGCAAGTTTTCTCGTTTTGGCTGAATCGTAATGCTGAAGAGAAAGAAGGAGGTGAACTTG
TGTTTGGAGGGGTCGATCCTAAGCACTTCAAAGGCCAACATACATATGTGCCTGTGACTGACAAAGGGTATTGGCAGTTCGACATTGGCGATATTCTTATTGGTGGTGAA
ACAACAAAATATTGTGCTGGCGGTTGCTCTGCAATTGCAGATTCTGGAACTTCTTTGTTGGCTGGTCCATCTAATATAGTGGTATCAATTAATAGAGCAATTGGAGCAGC
TGCAGTTGCACATCCAGAATGCAAGGCAATTGTTTCACAATATAGACGGGCTATTATGGATTTGCTTTTAGCCAAGGCACAACCAGAGAAGATATGCTCCAAAATTGGGG
TGTGTACCTTTGATGAAACTCATGATGTTAGTTTGAAAATTGAGAACGTGATGAGTGATAAAGATGGTAGATCGTCTGGTGGGTTCTCAGAAGCTATGTGCTCAGCTTGT
GAGATGGCGGTTTTGTGGATTCAAGATGAGCTGAAGCAGAACAAAACTCAAGAAGATATCATTGATAATGTCAATGAGCTATGCGACCGTGGGTTGAACCAAGATGAAAC
ATTGGTTGACTGTGGACGAATCTCTCAAATGCCTAACGTGTCCTTCACCATTGGCGATAGAGTTTTTGAGCTCACCTCAAAAGATTATATTCTGAAGGTGGGTGAGGGAT
CTGCAGCTCAATGCATCAGTGGATTCATACCTTTCGACATTCCTCCTCCTCGTGGACCCCTATGGATCTTGGGAGACGTTTTCATGGGACCTTATCATACAGTCTTTGAT
TTTGGCAAAGCGAGAGTCGGATTTGCCGAAGCTGCTTGA
Protein sequenceShow/hide protein sequence
MDKGWGLTLRDSDHQSIGFFSNKQPPPPPPPTLNSFQRMFQGLEFSGKLGHTDSTPDDNNRLAVEVDFFSSKKRVVDDLQADQDSKPTSTTSIIKDDKASTPPPPPTTSF
NLVNTGLHLLTANTGSDQSTVDDGISSDGENKRAKNELAQLQVELQRMNAENHKLRDMLSHVSNSYSSLHMHLLSLMQQKQQQQNHPSEPAHQREIGGEKKSTEIKHEVG
KVMVPRQFMDLGPSGNNNNIGESEELLCNSSSDERTRSGSPLNINNNNNNNNTETASKKRDHAEIMPPNFDHENSKRSIPREDSPESESKGWGPNNHKTPRFNNSSNSKP
LDQSTEATMRKARVSVRARSEAPMISDGCQWRKYGQKMAKGNPCPRAYYRCTMAVGCPVRKQVQRCAEDRTILITTYEGNHNHPLPPAAMAMASTTTAAATMLLSGSMSS
ADHNLMNPNLLARAILPCSSSMATISASAPFPTITLDLTHTPNPLQFQRPTAAPFQVPFPGGQPPSAAAQLPQVLGQALYNNQSKFSGLQLSHEMGANSSHLGHHQITQP
ATPAQPGGASFADTLSAATAAITADPNFTAALAAAISSIIVNCKVTMRKISFSHLLVSLLLLILHYSSEATSASNEGFLRIGLKKIKSDQNSRFKALLESKKGKFLGSSV
GKHNQWGNNLEESKNADIVPLKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKCIFSLACFFHAKYQSGRSSTYKKNGTSAAIQYGSGAISGFFSYDNVRVGDA
IVRNQELIEATSMSTMTFMAAKFDGILGLGFQEIATGGAVPVWYNMVKQKLVKEQVFSFWLNRNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGE
TTKYCAGGCSAIADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKAIVSQYRRAIMDLLLAKAQPEKICSKIGVCTFDETHDVSLKIENVMSDKDGRSSGGFSEAMCSAC
EMAVLWIQDELKQNKTQEDIIDNVNELCDRGLNQDETLVDCGRISQMPNVSFTIGDRVFELTSKDYILKVGEGSAAQCISGFIPFDIPPPRGPLWILGDVFMGPYHTVFD
FGKARVGFAEAA