; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030856 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030856
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptioncathepsin B-like
Genome locationscaffold11:25790557..25801155
RNA-Seq ExpressionSpg030856
SyntenySpg030856
Gene Ontology termsGO:0050790 - regulation of catalytic activity (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR012599 - Peptidase C1A, propeptide
IPR025660 - Cysteine peptidase, histidine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008465335.1 PREDICTED: cathepsin B-like isoform X1 [Cucumis melo]1.5e-14973.37Show/hide
Query:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL
        +QV+AEEQVLKFK +ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++ 
Subjt:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL

Query:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD
         GH      F  A   +   F  + D +NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVD
Subjt:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD

Query:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL
        KNQIW+K+KHYGVNAY+I+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+       
Subjt:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL

Query:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                         DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_008465336.1 PREDICTED: cathepsin B-like isoform X2 [Cucumis melo]1.9e-14973.57Show/hide
Query:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLA
        QV+AEEQVLKFK +ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++  
Subjt:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLA

Query:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK
        GH      F  A   +   F  + D +NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVDK
Subjt:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK

Query:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC
        NQIW+K+KHYGVNAY+I+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+        
Subjt:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC

Query:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                        DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_011652326.1 cathepsin B-like protease 2 isoform X1 [Cucumis sativus]7.2e-14972.83Show/hide
Query:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL
        +QVYAEEQVLKFK +ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++ 
Subjt:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL

Query:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD
         GH      F  A   +   F  + D +NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVD
Subjt:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD

Query:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL
        KNQIW+K+KHYGV+AY+++ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGD       
Subjt:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL

Query:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                         DGYFKIRRGTNECGIEEDVVAGLPS +NIAREAAI
Subjt:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_038903448.1 cathepsin B-like protease 2 isoform X1 [Benincasa hispida]4.5e-15173.91Show/hide
Query:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL
        +QVYAEEQVL+FKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++ 
Subjt:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL

Query:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD
         GH      F  A   +   F  + D +NI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVD
Subjt:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD

Query:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL
        KNQIW+K+KHYGVNAY+I++DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGD       
Subjt:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL

Query:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                         DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

XP_038903449.1 cathepsin B-like protease 2 isoform X2 [Benincasa hispida]5.9e-15174.11Show/hide
Query:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLA
        QVYAEEQVL+FKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++  
Subjt:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLA

Query:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK
        GH      F  A   +   F  + D +NI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVDK
Subjt:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK

Query:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC
        NQIW+K+KHYGVNAY+I++DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGD        
Subjt:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC

Query:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                        DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

TrEMBL top hitse value%identityAlignment
A0A0A0LFN4 Pept_C1 domain-containing protein3.5e-14972.83Show/hide
Query:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL
        +QVYAEEQVLKFK +ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++ 
Subjt:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL

Query:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD
         GH      F  A   +   F  + D +NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVD
Subjt:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD

Query:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL
        KNQIW+K+KHYGV+AY+++ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGD       
Subjt:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL

Query:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                         DGYFKIRRGTNECGIEEDVVAGLPS +NIAREAAI
Subjt:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

A0A1S3CNJ5 cathepsin B-like isoform X17.1e-15073.37Show/hide
Query:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL
        +QV+AEEQVLKFK +ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++ 
Subjt:  KQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVL

Query:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD
         GH      F  A   +   F  + D +NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVD
Subjt:  AGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVD

Query:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL
        KNQIW+K+KHYGVNAY+I+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+       
Subjt:  KNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDIL

Query:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                         DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  CTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

A0A1S3CNM3 cathepsin B-like isoform X29.2e-15073.57Show/hide
Query:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLA
        QV+AEEQVLKFK +ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++  
Subjt:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLA

Query:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK
        GH      F  A   +   F  + D +NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVDK
Subjt:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK

Query:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC
        NQIW+K+KHYGVNAY+I+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+        
Subjt:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC

Query:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                        DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

A0A5A7U7U4 Cathepsin B-like isoform X29.2e-15073.57Show/hide
Query:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLA
        QV+AEEQVLKFK +ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFK+LLGVKQTPEKDLKSTPVLSHPKSL            P     GT++  
Subjt:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLA

Query:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK
        GH      F  A   +   F  + D +NITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE+CDPYFDTTGCSHPGCEPAYPTPRCVR CVDK
Subjt:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK

Query:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC
        NQIW+K+KHYGVNAY+I+ DP DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWG+        
Subjt:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC

Query:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
                                        DGYFKIRRGTNECGIEEDVVAGLPS RNIAREAAI
Subjt:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

A0A6J1F2K3 cathepsin B-like protease 22.8e-14668.54Show/hide
Query:  IISLVFFPFLAISGRHLHLSSSGKQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL
        ++ LV F FL     H H      QVYAEEQVLKFK NADILQESIVRHVNEHP AGWKA MNP FSNYSVSQFKH+LGVKQTPEKDLKSTPVLSHPKSL
Subjt:  IISLVFFPFLAISGRHLHLSSSGKQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSL

Query:  N-----------PLFHFRGTVVLAGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDT
                    P     GT++  GH      F    +    +  +Y   +NI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDT
Subjt:  N-----------PLFHFRGTVVLAGHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDT

Query:  TGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDG
         GCSHPGCEPAY TP+CVR CVDKNQIW+KSKHYGVNAY+I+ DPYDIMAEVYKNGPVEV FTVYEDFAHYKSGVYK+I GD +GGHAVKLIGWGT+DDG
Subjt:  TGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDG

Query:  EDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI
        EDYWLLANQWN GWGD                                        DGYFKIRRGTNECGIEEDVVAGLPSPRNIAREA+I
Subjt:  EDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAREAAI

SwissProt top hitse value%identityAlignment
F4HVZ1 Cathepsin B-like protease 12.0e-10952.2Show/hide
Query:  LHLSSSGKQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFRGTV------
        L  SS   Q  A E + K K  + ILQ  IV+ VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P++ H  SL     F          
Subjt:  LHLSSSGKQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFRGTV------

Query:  ----VLAGHL----VLLNHFQIAFAFIL-----TWFANYTDI----------LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYF
            +L G++    +L +   + F F+L      W     +           LN++LS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+ECDPYF
Subjt:  ----VLAGHL----VLLNHFQIAFAFIL-----TWFANYTDI----------LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYF

Query:  DTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTD
        D TGCSHPGCEP YPTP+C R+CV +NQ+W +SKHYGV AY+I  DP DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITG  +GGHAVKLIGWGT+D
Subjt:  DTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTD

Query:  DGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNI
        DGEDYWLLANQWNR WGD                                        DGYFKIRRGTNECGIE+ VVAGLPS +N+
Subjt:  DGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNI

P07858 Cathepsin B2.8e-5040.98Show/hide
Query:  LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EECDPYFDTTGCSH------PGCEPAYPTPRCVRQC-VDKNQIWKKSKHYGV
        +++ +S  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C     TP+C + C    +  +K+ KHYG 
Subjt:  LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EECDPYFDTTGCSH------PGCEPAYPTPRCVRQC-VDKNQIWKKSKHYGV

Query:  NAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLE
        N+Y + +   DIMAE+YKNGPVE AF+VY DF  YKSGVY+++TG++MGGHA++++GWG  ++G  YWL+AN WN  WGD                    
Subjt:  NAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLE

Query:  VLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLP
                            +G+FKI RG + CGIE +VVAG+P
Subjt:  VLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLP

Q4R5M2 Cathepsin B1.6e-5040.98Show/hide
Query:  LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EECDPYFDTTGCSH------PGCEPAYPTPRCVRQC-VDKNQIWKKSKHYGV
        +++ +S  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY     C H      P C     TP+C + C    +  +K+ KHYG 
Subjt:  LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EECDPYFDTTGCSH------PGCEPAYPTPRCVRQC-VDKNQIWKKSKHYGV

Query:  NAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLE
        N+Y + +   DIMAE+YKNGPVE AF+VY DF  YKSGVY+++TG++MGGHA++++GWG  ++G  YWL+AN WN  WGD                    
Subjt:  NAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLE

Query:  VLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLP
                            +G+FKI RG + CGIE +VVAG+P
Subjt:  VLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLP

Q93VC9 Cathepsin B-like protease 28.4e-11657.02Show/hide
Query:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFR-----------GTVVLA
        Q  A E + K K  + ILQ  IV+ VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P++SH  SL     F            G ++  
Subjt:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFR-----------GTVVLA

Query:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK
        GH      F    +    +   Y   +N++LSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTEECDPYFD TGCSHPGCEPAYPTP+C R+CV  
Subjt:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK

Query:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC
        NQ+W++SKHYGV+AYK+RS P DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+DDGEDYWLLANQWNR WGD        
Subjt:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC

Query:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
                                        DGYFKIRRGTNECGIE  VVAGLPS RN+ +
Subjt:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR

Q94K85 Cathepsin B-like protease 39.6e-11254.35Show/hide
Query:  EQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLAGHL--
        E + K K ++ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SL            P     G ++  GH   
Subjt:  EQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLAGHL--

Query:  --------VLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVR
                 L + F I F             +NI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTEECDPYFD TGCSHPGCEPAYPTP+C R
Subjt:  --------VLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVR

Query:  QCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSF
        +CV  N++W +SKHY V+ Y ++S+P DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGD   
Subjt:  QCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSF

Query:  HDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
                                             DGYF IRRGTNECGIE++ VAGLPS +N+ R
Subjt:  HDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR

Arabidopsis top hitse value%identityAlignment
AT1G02300.1 Cysteine proteinases superfamily protein1.4e-11052.2Show/hide
Query:  LHLSSSGKQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFRGTV------
        L  SS   Q  A E + K K  + ILQ  IV+ VNE+P AGWKA  N RF+N +V++FK LLGV QTP+      P++ H  SL     F          
Subjt:  LHLSSSGKQVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFRGTV------

Query:  ----VLAGHL----VLLNHFQIAFAFIL-----TWFANYTDI----------LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYF
            +L G++    +L +   + F F+L      W     +           LN++LS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+ECDPYF
Subjt:  ----VLAGHL----VLLNHFQIAFAFIL-----TWFANYTDI----------LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYF

Query:  DTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTD
        D TGCSHPGCEP YPTP+C R+CV +NQ+W +SKHYGV AY+I  DP DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITG  +GGHAVKLIGWGT+D
Subjt:  DTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTD

Query:  DGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNI
        DGEDYWLLANQWNR WGD                                        DGYFKIRRGTNECGIE+ VVAGLPS +N+
Subjt:  DGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNI

AT1G02305.1 Cysteine proteinases superfamily protein6.0e-11757.02Show/hide
Query:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFR-----------GTVVLA
        Q  A E + K K  + ILQ  IV+ VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +    P++SH  SL     F            G ++  
Subjt:  QVYAEEQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFR-----------GTVVLA

Query:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK
        GH      F    +    +   Y   +N++LSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTEECDPYFD TGCSHPGCEPAYPTP+C R+CV  
Subjt:  GHLVLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDK

Query:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC
        NQ+W++SKHYGV+AYK+RS P DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+DDGEDYWLLANQWNR WGD        
Subjt:  NQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILC

Query:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
                                        DGYFKIRRGTNECGIE  VVAGLPS RN+ +
Subjt:  TFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR

AT1G29080.1 Papain family cysteine protease8.9e-1229.52Show/hide
Query:  ITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRH-GVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAE
        I+LS   LL C      +GC GG  ++A+ Y ++H G+ +E   PY    G       PA      +R                     + S+    + E
Subjt:  ITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRH-GVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAE

Query:  VYKNGPVEVAFTVYE-DFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGD
             PV VA    E  F HY  GVY          HAV L+G+GT+ +G  YWL  N W + WG+
Subjt:  VYKNGPVEVAFTVYE-DFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGD

AT4G01610.1 Cysteine proteinases superfamily protein6.8e-11354.35Show/hide
Query:  EQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLAGHL--
        E + K K ++ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SL            P     G ++  GH   
Subjt:  EQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN-----------PLFHFRGTVVLAGHL--

Query:  --------VLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVR
                 L + F I F             +NI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTEECDPYFD TGCSHPGCEPAYPTP+C R
Subjt:  --------VLLNHFQIAFAFILTWFANYTDILNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVR

Query:  QCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSF
        +CV  N++W +SKHY V+ Y ++S+P DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGD   
Subjt:  QCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSF

Query:  HDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
                                             DGYF IRRGTNECGIE++ VAGLPS +N+ R
Subjt:  HDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR

AT4G01610.2 Cysteine proteinases superfamily protein5.2e-11356.18Show/hide
Query:  EQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN--PLFHFRGT---VVLAGHLVLLNHFQI
        E + K K ++ ILQ+ IV+ VNE+P AGWKA +N RFSN +V++FK LLGVK TP+K     P++SH  SL     F  R         G+++ L H   
Subjt:  EQVLKFKFNADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLN--PLFHFRGT---VVLAGHLVLLNHFQI

Query:  AFAF-ILTWFANYTDI---LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKS
         +AF  +   ++   I   +NI+LSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTEECDPYFD TGCSHPGCEPAYPTP+C R+CV  N++W +S
Subjt:  AFAF-ILTWFANYTDI---LNITLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKS

Query:  KHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILCTFRFCCS
        KHY V+ Y ++S+P DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGD               
Subjt:  KHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILCTFRFCCS

Query:  KLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR
                                 DGYF IRRGTNECGIE++ VAGLPS +N+ R
Subjt:  KLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGLPSPRNIAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTTACTGGTTTTGCAGGCACGTTTTTCCCGTCTTCTACAAATTCACTACTGGTGTCACGTGAAGGTCAGGAGCGAGAAAATTGTCAAGCCCGAGAAAGGGATGA
TCTTCGAAAACGATTGGTCGCCAATGGCAGCAACCTGCGTACAATCATGGATGACGTTAACTCGGACGTTGAATCGAAGGGAAACTTGGGATTGCAAGGAGATGGCATCA
TCTCACTTGTATTTTTCCCTTTCCTTGCTATTTCTGGCCGCCATCTGCACCTTTCATCATCAGGTAAGCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAATTTAAC
GCTGATATTCTTCAGGAGTCTATCGTTCGCCACGTAAATGAACACCCACAGGCTGGCTGGAAAGCTACCATGAACCCACGTTTTTCGAACTATTCCGTTAGCCAATTCAA
GCACCTGCTTGGTGTCAAACAAACTCCTGAGAAGGATTTAAAAAGTACCCCTGTTTTATCCCATCCCAAGTCTTTAAACCCGCTTTTTCATTTTAGGGGCACTGTGGTTC
TTGCTGGGCATTTGGTGCTGTTGAATCACTTTCAGATCGCTTTTGCATTCATTTTGACATGGTTTGCCAACTACACTGATATTCTTAACATTACTCTGTCTGTTAATGAT
CTTTTGGCATGCTGTGGCTTCATGTGTGGTGACGGCTGTGATGGGGGTTACCCAATTTCTGCATGGCGATACTTTGTTCGTCATGGAGTTGTTACTGAAGAGTGTGATCC
ATATTTTGACACTACTGGTTGTTCCCACCCTGGTTGTGAACCTGCATATCCGACTCCTAGATGTGTCAGGCAGTGTGTAGATAAGAACCAGATTTGGAAAAAATCAAAGC
ACTATGGTGTTAATGCTTATAAGATTAGAAGCGATCCCTATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTTGAAGTTGCCTTCACGGTGTATGAGGATTTTGCT
CACTATAAATCTGGGGTTTACAAATATATTACTGGTGATGTAATGGGAGGGCATGCTGTGAAGCTTATTGGATGGGGAACAACGGATGATGGAGAGGATTATTGGCTTTT
GGCAAATCAGTGGAACAGAGGCTGGGGTGATGTAAGTTTTCATGACATTTTGTGTACCTTTAGATTTTGTTGCTCAAAGTTACATTTGGAAGTGCTTTTAACAGGGCTGA
AAACACACGTTTACCATGGTCATAAGCACCTTGATCCAAAAGATGGCTACTTCAAGATAAGAAGAGGAACGAATGAGTGTGGCATTGAGGAAGATGTTGTTGCTGGTTTG
CCCTCACCTAGAAATATTGCTAGGGAGGCTGCCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTTACTGGTTTTGCAGGCACGTTTTTCCCGTCTTCTACAAATTCACTACTGGTGTCACGTGAAGGTCAGGAGCGAGAAAATTGTCAAGCCCGAGAAAGGGATGA
TCTTCGAAAACGATTGGTCGCCAATGGCAGCAACCTGCGTACAATCATGGATGACGTTAACTCGGACGTTGAATCGAAGGGAAACTTGGGATTGCAAGGAGATGGCATCA
TCTCACTTGTATTTTTCCCTTTCCTTGCTATTTCTGGCCGCCATCTGCACCTTTCATCATCAGGTAAGCAGGTCTATGCGGAGGAACAAGTTCTAAAGTTCAAATTTAAC
GCTGATATTCTTCAGGAGTCTATCGTTCGCCACGTAAATGAACACCCACAGGCTGGCTGGAAAGCTACCATGAACCCACGTTTTTCGAACTATTCCGTTAGCCAATTCAA
GCACCTGCTTGGTGTCAAACAAACTCCTGAGAAGGATTTAAAAAGTACCCCTGTTTTATCCCATCCCAAGTCTTTAAACCCGCTTTTTCATTTTAGGGGCACTGTGGTTC
TTGCTGGGCATTTGGTGCTGTTGAATCACTTTCAGATCGCTTTTGCATTCATTTTGACATGGTTTGCCAACTACACTGATATTCTTAACATTACTCTGTCTGTTAATGAT
CTTTTGGCATGCTGTGGCTTCATGTGTGGTGACGGCTGTGATGGGGGTTACCCAATTTCTGCATGGCGATACTTTGTTCGTCATGGAGTTGTTACTGAAGAGTGTGATCC
ATATTTTGACACTACTGGTTGTTCCCACCCTGGTTGTGAACCTGCATATCCGACTCCTAGATGTGTCAGGCAGTGTGTAGATAAGAACCAGATTTGGAAAAAATCAAAGC
ACTATGGTGTTAATGCTTATAAGATTAGAAGCGATCCCTATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTTGAAGTTGCCTTCACGGTGTATGAGGATTTTGCT
CACTATAAATCTGGGGTTTACAAATATATTACTGGTGATGTAATGGGAGGGCATGCTGTGAAGCTTATTGGATGGGGAACAACGGATGATGGAGAGGATTATTGGCTTTT
GGCAAATCAGTGGAACAGAGGCTGGGGTGATGTAAGTTTTCATGACATTTTGTGTACCTTTAGATTTTGTTGCTCAAAGTTACATTTGGAAGTGCTTTTAACAGGGCTGA
AAACACACGTTTACCATGGTCATAAGCACCTTGATCCAAAAGATGGCTACTTCAAGATAAGAAGAGGAACGAATGAGTGTGGCATTGAGGAAGATGTTGTTGCTGGTTTG
CCCTCACCTAGAAATATTGCTAGGGAGGCTGCCATATGA
Protein sequenceShow/hide protein sequence
MILTGFAGTFFPSSTNSLLVSREGQERENCQARERDDLRKRLVANGSNLRTIMDDVNSDVESKGNLGLQGDGIISLVFFPFLAISGRHLHLSSSGKQVYAEEQVLKFKFN
ADILQESIVRHVNEHPQAGWKATMNPRFSNYSVSQFKHLLGVKQTPEKDLKSTPVLSHPKSLNPLFHFRGTVVLAGHLVLLNHFQIAFAFILTWFANYTDILNITLSVND
LLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEECDPYFDTTGCSHPGCEPAYPTPRCVRQCVDKNQIWKKSKHYGVNAYKIRSDPYDIMAEVYKNGPVEVAFTVYEDFA
HYKSGVYKYITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDVSFHDILCTFRFCCSKLHLEVLLTGLKTHVYHGHKHLDPKDGYFKIRRGTNECGIEEDVVAGL
PSPRNIAREAAI