; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G007450 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G007450
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionthiol protease aleurain-like
Genome locationCmo_Chr11:3622306..3632302
RNA-Seq ExpressionCmoCh11G007450
SyntenyCmoCh11G007450
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR013201 - Cathepsin propeptide inhibitor domain (I29)
IPR025660 - Cysteine peptidase, histidine active site
IPR029399 - TMEM192 family
IPR038765 - Papain-like cysteine peptidase superfamily
IPR039417 - Papain-like cysteine endopeptidase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7149320.1 hypothetical protein RHSIM_Rhsim03G0227400 [Rhododendron simsii]1.2e-22855.88Show/hide
Query:  MAPRLLFVSSVLLVLSCAVAG-SVFDDSNPIR-MVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
        MA   L +S  LL+L+ AVAG S FD+ NPIR +VS+ LRE E+ VV VVG++R AL FARFAHRYGK YET EE+KLRF IF E+L+LIKS NR+GLSY
Subjt:  MAPRLLFVSSVLLVLSCAVAG-SVFDDSNPIR-MVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY

Query:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF
         + +N FADWTWE+F++HRLGAAQNCSAT KGNHKLT+ +LPE KDWR+ GIVSPVKDQGHCGSCWTFSTTGALEAAYRQA G+ +SLSEQQLVDCAGAF
Subjt:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF

Query:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP
        NNFGC+GGLPSQAFEYIK+NGGL TE AYPYTAK+GECK+ SEN             GAEDELKHAVAFVRPVSVAF+VV GFRLY  GVYTS+SCG++P
Subjt:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP

Query:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGP------------------KCCNLCFIPHCRLEFENGKASYRDIKVW----------------
                  DVNHAVLAVGYGVE+GV YWLIKNSWG                   + CNLC IPHC L    GK        W                
Subjt:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGP------------------KCCNLCFIPHCRLEFENGKASYRDIKVW----------------

Query:  ------VWWIT----STIQTWQRRGTPLRAQHHLKT-------------------------TRWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLML
              V  +T    S  Q  Q++   +  + H  T                              GYA LA  APWIFH I+ L+ PLLCSC V+LL++
Subjt:  ------VWWIT----STIQTWQRRGTPLRAQHHLKT-------------------------TRWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLML

Query:  TASLLFVIARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVM---------------------------VWEPQISALS---------IPLILRMIMLI
        T       +R  GYY FSQKLKHIVRLPFA TAYG    +L++                             +  +S+LS         I   LR+IMLI
Subjt:  TASLLFVIARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVM---------------------------VWEPQISALS---------IPLILRMIMLI

Query:  EAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFS
        E VCAG FM  YI Y+ +YNSL+SQPDVLKSLYSPLQ S+SLE LRYHD GRL+DQQMALLQYQRENLHFL+EE                          
Subjt:  EAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFS

Query:  LFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNK
                    ILRLQE LSKYERS+DG+TPQV    L   VDLAH+LAARDQELRT+SAE        MNQ+ SELRLARS+IAERD+EIQ +  TN 
Subjt:  LFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNK

Query:  QYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSS
        QYVEENERLRAILGEWSTRAAKLERALE ER+SN ELQ+++ T+ ++Q S
Subjt:  QYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSS

KAG5559118.1 hypothetical protein RHGRI_008890 [Rhododendron griersonianum]1.6e-21451.51Show/hide
Query:  MEMAPRLLFVSSVLLVLSCAVAG-SVFDDSNPIR-MVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGL
        M  +P  L +  +LL ++ AVAG S FD+ NPIR +VS+ LRE E+ VV VVG++R AL FARFAHRYGK YET EE+KLRF IF E+L+LIKS NR+GL
Subjt:  MEMAPRLLFVSSVLLVLSCAVAG-SVFDDSNPIR-MVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGL

Query:  SYKLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAG
        SY + +N FADWTWEEF + RLGAAQNCSAT KGNHKLT+ +LPE KDWR+ GIVSPVKDQGHCGSCWTFSTTGALEAAYRQA G+ +SLSEQQLVDCAG
Subjt:  SYKLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAG

Query:  AFNNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGS
        AFNNFGC+GGLPSQAFEYIK+NGGL TE AYPYTAK+GECK+ SEN             GAEDELKHAVAFVRPVSVAF+VV GFRLY  GVYTS+SCG+
Subjt:  AFNNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGS

Query:  SPQVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSW--------------GPKCCN---LCFIP--------HCRLEFENGKASYRDIKV--------
        +P          DVNHAVLAVGYGVE+GV YWL+KNSW              G   C    L  +P        H RL        +  + V        
Subjt:  SPQVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSW--------------GPKCCN---LCFIP--------HCRLEFENGKASYRDIKV--------

Query:  ----WVWWITSTI-----------------------------------------------------QTWQRRGTPLRAQHHLKTT---------------
              W+++S                                                       + WQ++ TPLR  H LK T               
Subjt:  ----WVWWITSTI-----------------------------------------------------QTWQRRGTPLRAQHHLKTT---------------

Query:  --------------RWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTASL-----------LFVIAR------------------LKGYYSFSQ
                       W+ GYA LA  APWIFH I+ L+ PLLCSC V+LL++T              ++ I R                  L+GYY FSQ
Subjt:  --------------RWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTASL-----------LFVIAR------------------LKGYYSFSQ

Query:  KLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSD
        KLKHIVRLPFA TAYGTAA+LLVMVW+P IS LSI ++LR+IMLIE VCAG FM  YI Y+ +YNSL+SQPDVLKSLYSPLQ S+SLE L          
Subjt:  KLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSD

Query:  QQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQE
          +A +                 ++ ++   W                         ILRLQE LSKYERS+DGSTPQ         VDLAH+LAARDQE
Subjt:  QQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQE

Query:  LRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSS
        LRTLSAE        MNQ+ SELRLARS+IAERD+EIQ++  TN QYVEENERLRAILGEWSTRAAKLERALE ER+SN ELQ+++ T+ ++Q S
Subjt:  LRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSS

KAG5559119.1 hypothetical protein RHGRI_008890 [Rhododendron griersonianum]1.6e-21752.87Show/hide
Query:  MEMAPRLLFVSSVLLVLSCAVAG-SVFDDSNPIR-MVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGL
        M  +P  L +  +LL ++ AVAG S FD+ NPIR +VS+ LRE E+ VV VVG++R AL FARFAHRYGK YET EE+KLRF IF E+L+LIKS NR+GL
Subjt:  MEMAPRLLFVSSVLLVLSCAVAG-SVFDDSNPIR-MVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGL

Query:  SYKLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAG
        SY + +N FADWTWEEF + RLGAAQNCSAT KGNHKLT+ +LPE KDWR+ GIVSPVKDQGHCGSCWTFSTTGALEAAYRQA G+ +SLSEQQLVDCAG
Subjt:  SYKLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAG

Query:  AFNNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGS
        AFNNFGC+GGLPSQAFEYIK+NGGL TE AYPYTAK+GECK+ SEN             GAEDELKHAVAFVRPVSVAF+VV GFRLY  GVYTS+SCG+
Subjt:  AFNNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGS

Query:  SPQVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSW--------------GPKCCN---LCFIP--------HCRLEFENGKASYRDIKV--------
        +P          DVNHAVLAVGYGVE+GV YWL+KNSW              G   C    L  +P        H RL        +  + V        
Subjt:  SPQVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSW--------------GPKCCN---LCFIP--------HCRLEFENGKASYRDIKV--------

Query:  ----WVWWITSTI-----------------------------------------------------QTWQRRGTPLRAQHHLKTT---------------
              W+++S                                                       + WQ++ TPLR  H LK T               
Subjt:  ----WVWWITSTI-----------------------------------------------------QTWQRRGTPLRAQHHLKTT---------------

Query:  --------------RWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTA----SLLFVI--ARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLV
                       W+ GYA LA  APWIFH I+ L+ PLLCSC V+LL++T      L++ +   RL+GYY FSQKLKHIVRLPFA TAYGTAA+LLV
Subjt:  --------------RWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTA----SLLFVI--ARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLV

Query:  MVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSN
        MVW+P IS LSI ++LR+IMLIE VCAG FM  YI Y+ +YNSL+SQPDVLKSLYSPLQ S+SLE L            +A +                 
Subjt:  MVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSN

Query:  LIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSEL
        ++ ++   W                         ILRLQE LSKYERS+DGSTPQ         VDLAH+LAARDQELRTLSAE        MNQ+ SEL
Subjt:  LIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSEL

Query:  RLARSVIAERDTEIQKLLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSS
        RLARS+IAERD+EIQ++  TN QYVEENERLRAILGEWSTRAAKLERALE ER+SN ELQ+++ T+ ++Q S
Subjt:  RLARSVIAERDTEIQKLLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSS

KAG5559120.1 hypothetical protein RHGRI_008890 [Rhododendron griersonianum]4.1e-21854.45Show/hide
Query:  MEMAPRLLFVSSVLLVLSCAVAG-SVFDDSNPIR-MVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGL
        M  +P  L +  +LL ++ AVAG S FD+ NPIR +VS+ LRE E+ VV VVG++R AL FARFAHRYGK YET EE+KLRF IF E+L+LIKS NR+GL
Subjt:  MEMAPRLLFVSSVLLVLSCAVAG-SVFDDSNPIR-MVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGL

Query:  SYKLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAG
        SY + +N FADWTWEEF + RLGAAQNCSAT KGNHKLT+ +LPE KDWR+ GIVSPVKDQGHCGSCWTFSTTGALEAAYRQA G+ +SLSEQQLVDCAG
Subjt:  SYKLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAG

Query:  AFNNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGS
        AFNNFGC+GGLPSQAFEYIK+NGGL TE AYPYTAK+GECK+ SEN             GAEDELKHAVAFVRPVSVAF+VV GFRLY  GVYTS+SCG+
Subjt:  AFNNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGS

Query:  SPQVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSW--------------GPKCCN---LCFIP--------HCRLEFENGKASYRDIKV--------
        +P          DVNHAVLAVGYGVE+GV YWL+KNSW              G   C    L  +P        H RL        +  + V        
Subjt:  SPQVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSW--------------GPKCCN---LCFIP--------HCRLEFENGKASYRDIKV--------

Query:  ----WVWWITSTI-----------------------------------------------------QTWQRRGTPLRAQHHLKTTRWKVGYAALAIGAPW
              W+++S                                                       + WQ++ TPLR  H LK T     YA LA  APW
Subjt:  ----WVWWITSTI-----------------------------------------------------QTWQRRGTPLRAQHHLKTTRWKVGYAALAIGAPW

Query:  IFHPIKHLVEPLLCSCDVVLLMLTA----SLLFVI--ARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLILRMIMLIEAVCAGS
        IFH I+ L+ PLLCSC V+LL++T      L++ +   RL+GYY FSQKLKHIVRLPFA TAYGTAA+LLVMVW+P IS LSI ++LR+IMLIE VCAG 
Subjt:  IFHPIKHLVEPLLCSCDVVLLMLTA----SLLFVI--ARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLILRMIMLIEAVCAGS

Query:  FMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFSLFLLERA
        FM  YI Y+ +YNSL+SQPDVLKSLYSPLQ S+SLE L            +A +                 ++ ++   W                    
Subjt:  FMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFSLFLLERA

Query:  HSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNKQYVEENE
             ILRLQE LSKYERS+DGSTPQ         VDLAH+LAARDQELRTLSAE        MNQ+ SELRLARS+IAERD+EIQ++  TN QYVEENE
Subjt:  HSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNKQYVEENE

Query:  RLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSS
        RLRAILGEWSTRAAKLERALE ER+SN ELQ+++ T+ ++Q S
Subjt:  RLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSS

KAG6428559.1 hypothetical protein SASPL_112811 [Salvia splendens]1.3e-21956.63Show/hide
Query:  LLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNH
        LLF+  ++   + A +GS   D NPIR V D L ELES +++ VG +R A+ FARFAHRYGK YE++EE++ RF +F E+L +I+S NR+GLSY +G+N 
Subjt:  LLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNH

Query:  FADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCN
        F D TW+EFKKHRLGAAQNCSAT  GNHKLT+ VLP   DWR  GIVSPVK+QG CGSCWTFS+TGALEAAY QA G+ ISLSEQQLVDCAGAFNNFGCN
Subjt:  FADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCN

Query:  GGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISY
        GGLPSQAFEYIKYNGGL TEAAYPYT K+G CKY SEN             GAEDELKHAVAFVRPVSVAF+VV GF+ Y+ GVYTS +CGS P      
Subjt:  GGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISY

Query:  FVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGPKCCNLCFIPHCRLEFENGKASYRDIKVWVWWITSTIQTWQRRGTPLRAQ-----------------HH
            DVNHAVLAVGYGVE+GVPYWLIKNSWG    +  +    ++E      +  D         S+  +  R G+   A                   H
Subjt:  FVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGPKCCNLCFIPHCRLEFENGKASYRDIKVWVWWITSTIQTWQRRGTPLRAQ-----------------HH

Query:  LKTTRWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTASLLFVIA-----------------RLKGYYSFSQKLKHIVRLPFAVTAYGTAALLL
         K TR    YA  A+G  WI   ++ L   LLCSC+++LL++T   L  +                   L+GYY FSQKLKHI+RLPFA  AYGTAA+LL
Subjt:  LKTTRWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTASLLFVIA-----------------RLKGYYSFSQKLKHIVRLPFAVTAYGTAALLL

Query:  VMVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYS
        VM W+  IS LSI ++LR+IML+EAVCAG FM +Y+ YV +YNSL+SQPD L SLYSPLQQ++ LE LRYHD GRLSDQQMALLQYQRENLHFL+EE   
Subjt:  VMVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYS

Query:  NLIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSE
                                           ILRLQE LSKYER++DGSTPQ         VDLAH+LA RDQELRTLSAE        MNQ+ SE
Subjt:  NLIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSE

Query:  LRLARSVIAERDTEIQKLLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSSRGVD
        LRLARS+IAERD+EIQ +  TN QY+EENERLRAILGEWS RAAKLERALE   +SN ELQ++IS+ K   ++  VD
Subjt:  LRLARSVIAERDTEIQKLLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSSRGVD

TrEMBL top hitse value%identityAlignment
A0A4D9A883 Uncharacterized protein1.2e-20757.22Show/hide
Query:  LLVLSCAVA--GSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWT
        LL+ S AVA  GS   D NPIR V D L ELES +++ VG +  A+ FARFAHRYGK YE++EE++ RF +F E+L +I+S NR+GLSY +G+N F D T
Subjt:  LLVLSCAVA--GSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWT

Query:  WEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPS
        W+EFKKHRLGAAQNCSAT  GNHKLT+ VLP   DWR  GIVSPVK+QG CGSCWTFS+TGALEAAY QA G+ ISLSEQQLVDCAGAFNNFGCNGGLPS
Subjt:  WEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPS

Query:  QAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSD
        QAFEYIKYNGGL TEAAYPYT K+G CKY SEN             GAEDELKHAVAFVRPVSVAF+VV GF+ Y+ GVYTS +CGS P          D
Subjt:  QAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSD

Query:  VNHAVLAVGYGVEDGVPYWLIKNSWGPKCCNLCFIPHCRLEFENGK---ASYRDIKVWVWWITSTIQTWQRRGTPLRAQHHLKTTRWKVGYAALAIGAPW
        VNHAVLAVGYGVE+GVPYWLIKNSWG    +  +      + E GK    S    K   + +T + Q +                          +GA +
Subjt:  VNHAVLAVGYGVEDGVPYWLIKNSWGPKCCNLCFIPHCRLEFENGK---ASYRDIKVWVWWITSTIQTWQRRGTPLRAQHHLKTTRWKVGYAALAIGAPW

Query:  IFHPIKHLVEPLLCSCDVVLLMLTASLLFVIARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYI
         F                               L+GYY FSQKLK I+RLPFA  AYGTAA+LLVM W+  IS LSI ++LR+IML+EAVCAG FM +Y+
Subjt:  IFHPIKHLVEPLLCSCDVVLLMLTASLLFVIARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYI

Query:  SYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKI
         YV +YNSL+SQPDVL SLYSPLQQ++ LE LRYHD GRLSDQQMALLQYQRENLHFL+EE                                      I
Subjt:  SYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKI

Query:  LRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNKQYVEENERLRAIL
        LRLQE LSKYER++DGSTPQ         VDLAH+LA RDQELRTLSAE        MNQ+ SELRLARS+IAERD+E+Q +  TN QY+EENERLRAIL
Subjt:  LRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNKQYVEENERLRAIL

Query:  GEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSSRGVD
        GEWS RAAKLERALE   +SN ELQ++IS+ K   ++  VD
Subjt:  GEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSSRGVD

A0A4D9BI93 Uncharacterized protein1.4e-21957.24Show/hide
Query:  LLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNH
        LLF+  ++   + A +GS   D NPIR V D L ELES +++ VG +R A+ FARFAHRYGK YE++EE++ RF +F E+L +I+S NR+GLSY +G+N 
Subjt:  LLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNH

Query:  FADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCN
        F D TW+EFKKHRLGAAQNCSAT  GNHKLT+ VLP   DWR  GIVSPVK+QG CGSCWTFS+TGALEAAY QA G+ ISLSEQQLVDCAGAFNNFGCN
Subjt:  FADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCN

Query:  GGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISY
        GGLPSQAFEYIKYNGGL TEAAYPYT K+G CKY SEN             GAEDELKHAVAFVRPVSVAF+VV GF+ Y+ GVYTS +CGS P      
Subjt:  GGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISY

Query:  FVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGPKCCNLCFIPHCRLEFENGKASYRDIKVWVWWITSTIQTWQRRGTPLRAQHHLKTTRWKVGYAALAIGA
            DVNHAVLAVGYGVE+GVPYWLIKNSWG                + G   Y  +++      S     +      R++       +  GYA  A+G 
Subjt:  FVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGPKCCNLCFIPHCRLEFENGKASYRDIKVWVWWITSTIQTWQRRGTPLRAQHHLKTTRWKVGYAALAIGA

Query:  PWIFHPIKHLVEPLLCSCDVVLLMLTASLLFVIA-----------------RLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLIL
         WI   ++ L   LLCSC+++LL++T   L  +                   L+GYY FSQKLKHI+RLPFA  AYGTAA+LLVM W+  IS LSI ++L
Subjt:  PWIFHPIKHLVEPLLCSCDVVLLMLTASLLFVIA-----------------RLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLIL

Query:  RMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLP
        R+IML+EAVCAG FM +Y+ YV +YNSL+SQPD L SLYSPLQQ++ LE LRYHD GRLSDQQMALLQYQRENLHFL+EE                    
Subjt:  RMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLEDLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLP

Query:  FVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQK
                          ILRLQE LSKYER++DGSTPQ         VDLAH+LA RDQELRTLSAE        MNQ+ SELRLARS+IAERD+EIQ 
Subjt:  FVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHMLAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQK

Query:  LLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSSRGVD
        +  TN QY+EENERLRAILGEWS RAAKLERALE   +SN ELQ++IS+ K   ++  VD
Subjt:  LLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSSRGVD

A0A5A7UIP1 Thiol protease aleurain-like8.8e-15883.28Show/hide
Query:  MAPRLLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKL
        MA RL FVSSVLL+LSCAVAGSVFDDSNPIRMVSDRLRELE EVVRV+G   HALRFARFAHRYGK+YETAEE+K RFGIFLESLELIKSTN+QGLSYKL
Subjt:  MAPRLLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKL

Query:  GLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNN
        G+N FADWTWEEFKKHRLGAAQNCSAT KG+HKLT+ V PESKDWR DGIVSPVKDQGHCGSCWTFSTTGALEAAY QAHG+G+SLSEQQLVDCAGAFNN
Subjt:  GLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNN

Query:  FGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQV
        FGCNGGLPSQAFEYIKYNGGL TEAAYPYT K+G+CK++SEN             GAEDELKHAVAFVRPVSVAF+VV GFRLYSKGVYTSNSCGS+P  
Subjt:  FGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQV

Query:  SISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG
                DVNHAVLAVGYGVEDG+PYWLIKNSWG
Subjt:  SISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG

A0A6J1EZ10 thiol protease aleurain-like1.5e-17893.51Show/hide
Query:  MEMAPRLLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
        MEMAPRLLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
Subjt:  MEMAPRLLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY

Query:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF
        KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF
Subjt:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF

Query:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP
        NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN             GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP
Subjt:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP

Query:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGPK
        Q         DVNHAVLAVGYGVEDGVPYWLIKNSWGPK
Subjt:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGPK

A0A6J1HSV8 thiol protease aleurain-like5.5e-17692.33Show/hide
Query:  MEMAPRLLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
        MEMA RLLFVSSVLL+LSCAVA SVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
Subjt:  MEMAPRLLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY

Query:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF
        KLGLNHFADW+WEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF
Subjt:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF

Query:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP
        NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN             GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP
Subjt:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP

Query:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGPK
        Q         DVNHAVLAVGYGVEDGVPYWLIKNSWGPK
Subjt:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWGPK

SwissProt top hitse value%identityAlignment
A0A072UTP9 Pro-cathepsin H8.0e-13270.21Show/hide
Query:  SVLLVLSC---AVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFA
        ++L+V  C   A AG  F DSNPIRMVSD    +E ++++V+G +RHA+ FARFA+RYGKRY+T +E+K RF IF E+L+LIKSTN++ L Y LG+NHFA
Subjt:  SVLLVLSC---AVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFA

Query:  DWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGG
        DWTWEEF+ HRLGAAQNCSAT KGNH++T+VVLP  KDWR +GIVS VKDQGHCGSCWTFSTTGALE+AY QA G+ ISLSEQQLVDCAGA+NNFGCNGG
Subjt:  DWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGG

Query:  LPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFV
        LPSQAFEYIKYNGGL TE AYPYT +NG CK+ SEN             GAEDELKHAVAF RPVSVAFQVV  FRLY KGVYTS +CGS+P        
Subjt:  LPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFV

Query:  GSDVNHAVLAVGYGVEDGVPYWLIKNSWG
          DVNHAVLAVGYG+EDGVPYWLIKNSWG
Subjt:  GSDVNHAVLAVGYGVEDGVPYWLIKNSWG

P25778 Oryzain gamma chain8.3e-12166.37Show/hide
Query:  LLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRE-LESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLN
        LL  ++V    + A A S FDDSNPIR V+D     LES V+  +G TR ALRFARFA R+GKRY  A E++ RF IF ESLEL++STNR+GL Y+LG+N
Subjt:  LLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRE-LESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLN

Query:  HFADWTWEEFKKHRLGAAQNCSATAKGNHKLTN-VVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFG
         FAD +WEEF+  RLGAAQNCSAT  GNH++ +   LPE+KDWR+DGIVSPVKDQGHCGSCWTFSTTG+LEAAY QA G+ +SLSEQQLVDCA A+NNFG
Subjt:  HFADWTWEEFKKHRLGAAQNCSATAKGNHKLTN-VVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFG

Query:  CNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSI
        C+GGLPSQAFEYIKYNGGL TE AYPYT  NG C Y  EN             GAEDELK+AV  VRPVSVAFQV+ GFR+Y  GVYTS+ CG+SP    
Subjt:  CNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSI

Query:  SYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG
              DVNHAVLAVGYGVE+GVPYWLIKNSWG
Subjt:  SYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG

Q10717 Cysteine proteinase 24.4e-12266.96Show/hide
Query:  MAPRLLFVSSVLLVL-SCAVAGSVFDDSNPIRMVSDRLRE-LESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
        M PR LFV +V+++  + AV  S F DSNPIR V+DR    LES V   +G TR ALRFARFA RYGK YE+A E+  RF IF ESL+L++STNR+GLSY
Subjt:  MAPRLLFVSSVLLVL-SCAVAGSVFDDSNPIRMVSDRLRE-LESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY

Query:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKL--TNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAG
        +LG+N FAD +WEEF+  RLGAAQNCSAT  GNH++    V LPE+KDWR+DGIVSPVK+QGHCGSCWTFSTTGALEAAY QA G+ ISLSEQQLVDC  
Subjt:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKL--TNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAG

Query:  AFNNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGS
        AFNNFGCNGGLPSQAFEYIKYNGGL TE +YPY   NG CK+ +EN             GAEDELK AV  VRPVSVAF+V+ GFRLY  GVYTS+ CG+
Subjt:  AFNNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGS

Query:  SPQVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG
        +P          DVNHAVLAVGYGVEDGVPYWLIKNSWG
Subjt:  SPQVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG

Q8H166 Thiol protease aleurain2.2e-12968.92Show/hide
Query:  VLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWTW
        V+LV + A A   FD+SNPIRMVSD LRE+E  V +++G +RH L FARF HRYGK+Y+  EE+KLRF IF E+L+LI+STN++GLSYKLG+N FAD TW
Subjt:  VLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWTW

Query:  EEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPSQ
        +EF++ +LGAAQNCSAT KG+HK+T   LPE+KDWR+DGIVSPVKDQG CGSCWTFSTTGALEAAY QA G+GISLSEQQLVDCAGAFNN+GCNGGLPSQ
Subjt:  EEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPSQ

Query:  AFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDV
        AFEYIK NGGL TE AYPYT K+  CK+ +EN             GAEDELKHAV  VRPVS+AF+V+  FRLY  GVYT + CGS+P          DV
Subjt:  AFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDV

Query:  NHAVLAVGYGVEDGVPYWLIKNSWG
        NHAVLAVGYGVEDGVPYWLIKNSWG
Subjt:  NHAVLAVGYGVEDGVPYWLIKNSWG

Q8RWQ9 Thiol protease aleurain-like1.5e-13066.77Show/hide
Query:  MAPRLLFVSSVLLVLSCAVAGSV--FDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
        M+ +L   SS+LL+L  A A     FD+SNPI+MVSD L ELE  VV+++G +RH L F+RF HRYGK+Y++ EE+KLRF +F E+L+LI+STN++GLSY
Subjt:  MAPRLLFVSSVLLVLSCAVAGSV--FDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY

Query:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF
        KL LN FAD TW+EF++++LGAAQNCSAT KG+HK+T   +P++KDWR+DGIVSPVK+QGHCGSCWTFSTTGALEAAY QA G+GISLSEQQLVDCAG F
Subjt:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF

Query:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP
        NNFGC+GGLPSQAFEYIKYNGGL TE AYPYT K+G CK+ ++N             GAEDELKHAV  VRPVSVAF+VV  FR Y KGV+TSN+CG++P
Subjt:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP

Query:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG
                  DVNHAVLAVGYGVED VPYWLIKNSWG
Subjt:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG

Arabidopsis top hitse value%identityAlignment
AT3G45310.1 Cysteine proteinases superfamily protein1.1e-13166.77Show/hide
Query:  MAPRLLFVSSVLLVLSCAVAGSV--FDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
        M+ +L   SS+LL+L  A A     FD+SNPI+MVSD L ELE  VV+++G +RH L F+RF HRYGK+Y++ EE+KLRF +F E+L+LI+STN++GLSY
Subjt:  MAPRLLFVSSVLLVLSCAVAGSV--FDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY

Query:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF
        KL LN FAD TW+EF++++LGAAQNCSAT KG+HK+T   +P++KDWR+DGIVSPVK+QGHCGSCWTFSTTGALEAAY QA G+GISLSEQQLVDCAG F
Subjt:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF

Query:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP
        NNFGC+GGLPSQAFEYIKYNGGL TE AYPYT K+G CK+ ++N             GAEDELKHAV  VRPVSVAF+VV  FR Y KGV+TSN+CG++P
Subjt:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP

Query:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG
                  DVNHAVLAVGYGVED VPYWLIKNSWG
Subjt:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG

AT3G45310.2 Cysteine proteinases superfamily protein1.1e-13166.77Show/hide
Query:  MAPRLLFVSSVLLVLSCAVAGSV--FDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY
        M+ +L   SS+LL+L  A A     FD+SNPI+MVSD L ELE  VV+++G +RH L F+RF HRYGK+Y++ EE+KLRF +F E+L+LI+STN++GLSY
Subjt:  MAPRLLFVSSVLLVLSCAVAGSV--FDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSY

Query:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF
        KL LN FAD TW+EF++++LGAAQNCSAT KG+HK+T   +P++KDWR+DGIVSPVK+QGHCGSCWTFSTTGALEAAY QA G+GISLSEQQLVDCAG F
Subjt:  KLGLNHFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAF

Query:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP
        NNFGC+GGLPSQAFEYIKYNGGL TE AYPYT K+G CK+ ++N             GAEDELKHAV  VRPVSVAF+VV  FR Y KGV+TSN+CG++P
Subjt:  NNFGCNGGLPSQAFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSP

Query:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG
                  DVNHAVLAVGYGVED VPYWLIKNSWG
Subjt:  QVSISYFVGSDVNHAVLAVGYGVEDGVPYWLIKNSWG

AT5G60360.1 aleurain-like protease1.5e-13068.92Show/hide
Query:  VLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWTW
        V+LV + A A   FD+SNPIRMVSD LRE+E  V +++G +RH L FARF HRYGK+Y+  EE+KLRF IF E+L+LI+STN++GLSYKLG+N FAD TW
Subjt:  VLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWTW

Query:  EEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPSQ
        +EF++ +LGAAQNCSAT KG+HK+T   LPE+KDWR+DGIVSPVKDQG CGSCWTFSTTGALEAAY QA G+GISLSEQQLVDCAGAFNN+GCNGGLPSQ
Subjt:  EEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPSQ

Query:  AFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDV
        AFEYIK NGGL TE AYPYT K+  CK+ +EN             GAEDELKHAV  VRPVS+AF+V+  FRLY  GVYT + CGS+P          DV
Subjt:  AFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDV

Query:  NHAVLAVGYGVEDGVPYWLIKNSWG
        NHAVLAVGYGVEDGVPYWLIKNSWG
Subjt:  NHAVLAVGYGVEDGVPYWLIKNSWG

AT5G60360.2 aleurain-like protease1.5e-13068.92Show/hide
Query:  VLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWTW
        V+LV + A A   FD+SNPIRMVSD LRE+E  V +++G +RH L FARF HRYGK+Y+  EE+KLRF IF E+L+LI+STN++GLSYKLG+N FAD TW
Subjt:  VLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWTW

Query:  EEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPSQ
        +EF++ +LGAAQNCSAT KG+HK+T   LPE+KDWR+DGIVSPVKDQG CGSCWTFSTTGALEAAY QA G+GISLSEQQLVDCAGAFNN+GCNGGLPSQ
Subjt:  EEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPSQ

Query:  AFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDV
        AFEYIK NGGL TE AYPYT K+  CK+ +EN             GAEDELKHAV  VRPVS+AF+V+  FRLY  GVYT + CGS+P          DV
Subjt:  AFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDV

Query:  NHAVLAVGYGVEDGVPYWLIKNSWG
        NHAVLAVGYGVEDGVPYWLIKNSWG
Subjt:  NHAVLAVGYGVEDGVPYWLIKNSWG

AT5G60360.3 aleurain-like protease2.8e-13264.71Show/hide
Query:  VLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWTW
        V+LV + A A   FD+SNPIRMVSD LRE+E  V +++G +RH L FARF HRYGK+Y+  EE+KLRF IF E+L+LI+STN++GLSYKLG+N FAD TW
Subjt:  VLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLNHFADWTW

Query:  EEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPSQ
        +EF++ +LGAAQNCSAT KG+HK+T   LPE+KDWR+DGIVSPVKDQG CGSCWTFSTTGALEAAY QA G+GISLSEQQLVDCAGAFNN+GCNGGLPSQ
Subjt:  EEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLPSQ

Query:  AFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDV
        AFEYIK NGGL TE AYPYT K+  CK+ +EN             GAEDELKHAV  VRPVS+AF+V+  FRLY  GVYT + CGS+P          DV
Subjt:  AFEYIKYNGGLATEAAYPYTAKNGECKYMSEN-------------GAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDV

Query:  NHAVLAVGYGVEDGVPYWLIKNSWGP------------------KCCNLCFIPHCRL
        NHAVLAVGYGVEDGVPYWLIKNSWG                   K C +C IP C L
Subjt:  NHAVLAVGYGVEDGVPYWLIKNSWGP------------------KCCNLCFIPHCRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGGCTCCCCGTTTGCTCTTCGTCTCCTCTGTTCTTCTGGTTCTTTCCTGCGCTGTTGCCGGATCCGTCTTCGATGATTCCAATCCCATTCGAATGGTC
TCTGATCGTCTCCGGGAGTTGGAGTCGGAGGTCGTTCGAGTCGTTGGTCACACTCGTCACGCTCTCCGATTCGCTCGATTCGCTCACAGGTATGGGAAGAGGTAT
GAGACGGCGGAGGAGTTGAAGCTTCGTTTCGGAATTTTCTTGGAGAGTTTGGAACTGATCAAATCGACTAATAGACAAGGCCTTTCTTACAAACTTGGTCTCAAT
CACTTTGCGGATTGGACGTGGGAAGAGTTCAAGAAACACAGGCTAGGAGCTGCTCAAAACTGCTCTGCTACCGCGAAAGGCAACCACAAACTGACTAATGTCGTT
CTTCCTGAATCGAAAGATTGGAGAGATGATGGCATTGTCAGCCCTGTTAAAGATCAAGGCCACTGTGGTTCTTGCTGGACATTCAGTACAACTGGAGCTCTTGAA
GCGGCCTATAGGCAAGCACATGGAGAGGGAATCTCCCTGTCCGAGCAGCAGCTGGTGGACTGTGCTGGAGCTTTTAACAACTTTGGCTGTAATGGTGGATTGCCT
TCCCAAGCTTTTGAATACATCAAGTACAATGGTGGCCTTGCCACTGAAGCAGCATATCCTTACACTGCAAAGAACGGCGAATGCAAATACATGTCTGAGAATGGC
GCTGAAGATGAATTGAAGCATGCAGTTGCTTTTGTTCGACCAGTAAGCGTAGCATTTCAAGTGGTGAAAGGTTTTCGCTTATATTCAAAAGGAGTTTACACCAGT
AACTCATGCGGCAGTTCTCCTCAGGTTAGCATTAGCTATTTTGTTGGCTCTGATGTAAACCACGCCGTGCTTGCAGTTGGTTATGGGGTTGAAGATGGTGTCCCA
TACTGGCTTATAAAGAACTCATGGGGACCAAAGTGTTGCAACTTGTGCTTCATACCCCATTGTCGCTTAGAGTTCGAGAACGGCAAAGCTTCTTACCGCGATATT
AAGGTATGGGTATGGTGGATCACATCTACAATCCAAACATGGCAGCGGAGAGGCACGCCTCTTCGCGCGCAACATCATCTCAAGACAACGCGATGGAAAGTTGGC
TATGCTGCTCTGGCTATTGGAGCACCATGGATTTTTCATCCTATAAAGCACTTGGTTGAACCGTTGCTCTGCAGTTGTGATGTTGTTCTGTTGATGCTCACAGCC
AGTTTACTTTTCGTCATTGCAAGGCTAAAAGGTTATTATAGCTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTCGCAGTTACTGCATATGGAACTGCT
GCCCTTTTACTCGTCATGGTATGGGAACCTCAAATCAGTGCACTTTCGATTCCCCTAATTCTGAGGATGATTATGTTAATTGAAGCAGTATGTGCTGGATCGTTT
ATGGTTATATATATCAGTTATGTACAAAAGTACAATTCATTAAATTCTCAGCCTGATGTTTTGAAGTCATTGTATTCCCCACTTCAGCAATCAACTTCTTTGGAA
GATCTAAGGTATCATGATGTCGGACGACTTTCTGATCAGCAAATGGCTCTGTTGCAATATCAGCGAGAGAACCTTCATTTTCTAAATGAGGAGGTATATAGTAAC
CTTATACACATACTTACCTCAAGATGGTTTGGCACCCAGCCGCTTCCTTTCGTCATGCTCTTCTCCCTGTTCCTATTGGAAAGAGCTCATTCAATTGGTAAAATT
CTTCGGTTGCAAGAGTGCTTAAGTAAATATGAACGGTCTAGCGATGGAAGCACTCCTCAGGTGATCATTGTTAATCTGATATTTGATGTTGACCTTGCCCATATG
CTAGCTGCTCGTGACCAGGAATTGAGGACACTTTCAGCTGAGGTATGTCTTGCATACAGTACTATGATGAATCAGGTAACATCAGAACTTAGGCTTGCTCGATCT
GTGATAGCTGAGAGGGATACCGAGATTCAGAAATTACTCATCACCAACAAGCAGTATGTAGAAGAAAATGAAAGACTGAGAGCTATTCTAGGAGAATGGAGTACA
CGAGCTGCTAAGCTTGAGAGAGCGCTTGAAGGCGAGCGTTTATCAAATAATGAATTGCAAAGGAGGATTTCAACACTAAAAAAGCATCAGAGCAGCAGGGGAGTT
GATGCCTAA
mRNA sequenceShow/hide mRNA sequence
AAATTGCTTCCATCAGATTCCCCATTTTCCACTGCTAAATAGCGCATTGGGTTTCGTCTTCTTTCCCAACTCTGTGGAGGGAAGAAGACTGAGAGTGAGAAGCTG
CCCATGGAAATGGCTCCCCGTTTGCTCTTCGTCTCCTCTGTTCTTCTGGTTCTTTCCTGCGCTGTTGCCGGATCCGTCTTCGATGATTCCAATCCCATTCGAATG
GTCTCTGATCGTCTCCGGGAGTTGGAGTCGGAGGTCGTTCGAGTCGTTGGTCACACTCGTCACGCTCTCCGATTCGCTCGATTCGCTCACAGGTATGGGAAGAGG
TATGAGACGGCGGAGGAGTTGAAGCTTCGTTTCGGAATTTTCTTGGAGAGTTTGGAACTGATCAAATCGACTAATAGACAAGGCCTTTCTTACAAACTTGGTCTC
AATCACTTTGCGGATTGGACGTGGGAAGAGTTCAAGAAACACAGGCTAGGAGCTGCTCAAAACTGCTCTGCTACCGCGAAAGGCAACCACAAACTGACTAATGTC
GTTCTTCCTGAATCGAAAGATTGGAGAGATGATGGCATTGTCAGCCCTGTTAAAGATCAAGGCCACTGTGGTTCTTGCTGGACATTCAGTACAACTGGAGCTCTT
GAAGCGGCCTATAGGCAAGCACATGGAGAGGGAATCTCCCTGTCCGAGCAGCAGCTGGTGGACTGTGCTGGAGCTTTTAACAACTTTGGCTGTAATGGTGGATTG
CCTTCCCAAGCTTTTGAATACATCAAGTACAATGGTGGCCTTGCCACTGAAGCAGCATATCCTTACACTGCAAAGAACGGCGAATGCAAATACATGTCTGAGAAT
GGCGCTGAAGATGAATTGAAGCATGCAGTTGCTTTTGTTCGACCAGTAAGCGTAGCATTTCAAGTGGTGAAAGGTTTTCGCTTATATTCAAAAGGAGTTTACACC
AGTAACTCATGCGGCAGTTCTCCTCAGGTTAGCATTAGCTATTTTGTTGGCTCTGATGTAAACCACGCCGTGCTTGCAGTTGGTTATGGGGTTGAAGATGGTGTC
CCATACTGGCTTATAAAGAACTCATGGGGACCAAAGTGTTGCAACTTGTGCTTCATACCCCATTGTCGCTTAGAGTTCGAGAACGGCAAAGCTTCTTACCGCGAT
ATTAAGGTATGGGTATGGTGGATCACATCTACAATCCAAACATGGCAGCGGAGAGGCACGCCTCTTCGCGCGCAACATCATCTCAAGACAACGCGATGGAAAGTT
GGCTATGCTGCTCTGGCTATTGGAGCACCATGGATTTTTCATCCTATAAAGCACTTGGTTGAACCGTTGCTCTGCAGTTGTGATGTTGTTCTGTTGATGCTCACA
GCCAGTTTACTTTTCGTCATTGCAAGGCTAAAAGGTTATTATAGCTTTAGCCAGAAGTTAAAGCATATTGTTCGTCTACCTTTCGCAGTTACTGCATATGGAACT
GCTGCCCTTTTACTCGTCATGGTATGGGAACCTCAAATCAGTGCACTTTCGATTCCCCTAATTCTGAGGATGATTATGTTAATTGAAGCAGTATGTGCTGGATCG
TTTATGGTTATATATATCAGTTATGTACAAAAGTACAATTCATTAAATTCTCAGCCTGATGTTTTGAAGTCATTGTATTCCCCACTTCAGCAATCAACTTCTTTG
GAAGATCTAAGGTATCATGATGTCGGACGACTTTCTGATCAGCAAATGGCTCTGTTGCAATATCAGCGAGAGAACCTTCATTTTCTAAATGAGGAGGTATATAGT
AACCTTATACACATACTTACCTCAAGATGGTTTGGCACCCAGCCGCTTCCTTTCGTCATGCTCTTCTCCCTGTTCCTATTGGAAAGAGCTCATTCAATTGGTAAA
ATTCTTCGGTTGCAAGAGTGCTTAAGTAAATATGAACGGTCTAGCGATGGAAGCACTCCTCAGGTGATCATTGTTAATCTGATATTTGATGTTGACCTTGCCCAT
ATGCTAGCTGCTCGTGACCAGGAATTGAGGACACTTTCAGCTGAGGTATGTCTTGCATACAGTACTATGATGAATCAGGTAACATCAGAACTTAGGCTTGCTCGA
TCTGTGATAGCTGAGAGGGATACCGAGATTCAGAAATTACTCATCACCAACAAGCAGTATGTAGAAGAAAATGAAAGACTGAGAGCTATTCTAGGAGAATGGAGT
ACACGAGCTGCTAAGCTTGAGAGAGCGCTTGAAGGCGAGCGTTTATCAAATAATGAATTGCAAAGGAGGATTTCAACACTAAAAAAGCATCAGAGCAGCAGGGGA
GTTGATGCCTAACCGCCTTCCACAGGGTAAATTTATTCTCCATTGAATATAGATTGAGCAGCAGCAGATTAGACCGAGTTTTCTCCTGCGTTATGGTCTCCAACA
TTTGACCAAATGCAGGCAGCAGCAGCTGTAAAGGAATGAAGGTTGTTTTTTTCTTTCTTTCTTTCATTCAAGATCCTTTAGATTTATTTTATTTTATTTTATCCT
CCTCTCCTCTCTCTTTTGGACTCGCGAGTCATGACTAAGCATCAATTTCTTTGATTCCTCACGACACGAAATGACCAGAAATTTATTGTAAAACTAGATAAATCA
GGAATGAACTATTTCAATGTTATTGGATAATCGTGTTGCTTTTTCATAAGTGAGGTGATGAAATAGAATATGTGCTGATTTTAT
Protein sequenceShow/hide protein sequence
MEMAPRLLFVSSVLLVLSCAVAGSVFDDSNPIRMVSDRLRELESEVVRVVGHTRHALRFARFAHRYGKRYETAEELKLRFGIFLESLELIKSTNRQGLSYKLGLN
HFADWTWEEFKKHRLGAAQNCSATAKGNHKLTNVVLPESKDWRDDGIVSPVKDQGHCGSCWTFSTTGALEAAYRQAHGEGISLSEQQLVDCAGAFNNFGCNGGLP
SQAFEYIKYNGGLATEAAYPYTAKNGECKYMSENGAEDELKHAVAFVRPVSVAFQVVKGFRLYSKGVYTSNSCGSSPQVSISYFVGSDVNHAVLAVGYGVEDGVP
YWLIKNSWGPKCCNLCFIPHCRLEFENGKASYRDIKVWVWWITSTIQTWQRRGTPLRAQHHLKTTRWKVGYAALAIGAPWIFHPIKHLVEPLLCSCDVVLLMLTA
SLLFVIARLKGYYSFSQKLKHIVRLPFAVTAYGTAALLLVMVWEPQISALSIPLILRMIMLIEAVCAGSFMVIYISYVQKYNSLNSQPDVLKSLYSPLQQSTSLE
DLRYHDVGRLSDQQMALLQYQRENLHFLNEEVYSNLIHILTSRWFGTQPLPFVMLFSLFLLERAHSIGKILRLQECLSKYERSSDGSTPQVIIVNLIFDVDLAHM
LAARDQELRTLSAEVCLAYSTMMNQVTSELRLARSVIAERDTEIQKLLITNKQYVEENERLRAILGEWSTRAAKLERALEGERLSNNELQRRISTLKKHQSSRGV
DA