; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0961 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0961
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptioncathepsin B-like protease 2
Genome locationMC05:10354945..10360045
RNA-Seq ExpressionMC05g0961
SyntenyMC05g0961
Gene Ontology termsGO:0050790 - regulation of catalytic activity (biological process)
GO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005615 - extracellular space (cellular component)
GO:0005764 - lysosome (cellular component)
GO:0004197 - cysteine-type endopeptidase activity (molecular function)
InterPro domainsIPR000169 - Cysteine peptidase, cysteine active site
IPR000668 - Peptidase C1A, papain C-terminal
IPR012599 - Peptidase C1A, propeptide
IPR025660 - Cysteine peptidase, histidine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141146.1 cathepsin B-like protease 3 isoform X2 [Cucumis sativus]2.53e-24690.23Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
        MASS FY SLSLLF AAV +FHHQVYAEEQVLKFKL+ADILQESIV+ VNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE+DL+ST V+SHPKSLK
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK

Query:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
        LP +FDAREAWPQC SIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC

Query:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEP+YPTPRCV+ CVDKNQ+W ++KHYGV+AYRV +D  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY

Query:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        WLLANQWNRGWGDDGYFKI+RGTNECGIEEDVVAGLPS +NIA EAAI
Subjt:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

XP_008465336.1 PREDICTED: cathepsin B-like isoform X2 [Cucumis melo]2.42e-24489.37Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
        MASS+ Y SLSLLF AAV +FHHQV+AEEQVLKFKL+ADILQESIV+ VNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE+DL+ST V+SHPKSL+
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK

Query:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
        LP +FDAREAWPQC SIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC

Query:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEP+YPTPRCV+ CVDKNQ+W ++KHYGVNAYR+ KD  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY

Query:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        WLLANQWNRGWG+DGYFKI+RGTNECGIEEDVVAGLPS RNIA EAAI
Subjt:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

XP_011652326.1 cathepsin B-like protease 2 isoform X1 [Cucumis sativus]1.77e-24489.97Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQ-VYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSL
        MASS FY SLSLLF AAV +FHHQ VYAEEQVLKFKL+ADILQESIV+ VNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE+DL+ST V+SHPKSL
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQ-VYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSL

Query:  KLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTG
        KLP +FDAREAWPQC SIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TG
Subjt:  KLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTG

Query:  CSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED
        CSHPGCEP+YPTPRCV+ CVDKNQ+W ++KHYGV+AYRV +D  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED
Subjt:  CSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED

Query:  YWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        YWLLANQWNRGWGDDGYFKI+RGTNECGIEEDVVAGLPS +NIA EAAI
Subjt:  YWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

XP_022159267.1 cathepsin B-like protease 2 [Momordica charantia]1.77e-271100Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
        MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK

Query:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
        LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
Subjt:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC

Query:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY

Query:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
Subjt:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

XP_038903449.1 cathepsin B-like protease 2 isoform X2 [Benincasa hispida]2.08e-24590.23Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
        MASS  Y SLSLLF AAV +FHHQVYAEEQVL+FK NADILQESIV+ VNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE+DL+ST V+SHPKSLK
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK

Query:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
        LP +FDAREAWPQC SIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC

Query:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEP+YPTPRCV+ CVDKNQ+W ++KHYGVNAYR+  D  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY

Query:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        WLLANQWNRGWGDDGYFKI+RGTNECGIEEDVVAGLPSARNIA EAAI
Subjt:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

TrEMBL top hitse value%identityAlignment
A0A0A0LFN4 Pept_C1 domain-containing protein8.58e-24589.97Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQ-VYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSL
        MASS FY SLSLLF AAV +FHHQ VYAEEQVLKFKL+ADILQESIV+ VNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE+DL+ST V+SHPKSL
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQ-VYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSL

Query:  KLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTG
        KLP +FDAREAWPQC SIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TG
Subjt:  KLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTG

Query:  CSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED
        CSHPGCEP+YPTPRCV+ CVDKNQ+W ++KHYGV+AYRV +D  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED
Subjt:  CSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED

Query:  YWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        YWLLANQWNRGWGDDGYFKI+RGTNECGIEEDVVAGLPS +NIA EAAI
Subjt:  YWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

A0A1S3CNJ5 cathepsin B-like isoform X18.22e-24389.11Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQ-VYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSL
        MASS+ Y SLSLLF AAV +FHHQ V+AEEQVLKFKL+ADILQESIV+ VNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE+DL+ST V+SHPKSL
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQ-VYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSL

Query:  KLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTG
        +LP +FDAREAWPQC SIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TG
Subjt:  KLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTG

Query:  CSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED
        CSHPGCEP+YPTPRCV+ CVDKNQ+W ++KHYGVNAYR+ KD  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGED
Subjt:  CSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGED

Query:  YWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        YWLLANQWNRGWG+DGYFKI+RGTNECGIEEDVVAGLPS RNIA EAAI
Subjt:  YWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

A0A1S3CNM3 cathepsin B-like isoform X21.17e-24489.37Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
        MASS+ Y SLSLLF AAV +FHHQV+AEEQVLKFKL+ADILQESIV+ VNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE+DL+ST V+SHPKSL+
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK

Query:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
        LP +FDAREAWPQC SIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC

Query:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEP+YPTPRCV+ CVDKNQ+W ++KHYGVNAYR+ KD  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY

Query:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        WLLANQWNRGWG+DGYFKI+RGTNECGIEEDVVAGLPS RNIA EAAI
Subjt:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

A0A5A7U7U4 Cathepsin B-like isoform X21.17e-24489.37Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
        MASS+ Y SLSLLF AAV +FHHQV+AEEQVLKFKL+ADILQESIV+ VNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPE+DL+ST V+SHPKSL+
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK

Query:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
        LP +FDAREAWPQC SIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNI+LSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFD TGC
Subjt:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC

Query:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEP+YPTPRCV+ CVDKNQ+W ++KHYGVNAYR+ KD  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYK+ITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY

Query:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        WLLANQWNRGWG+DGYFKI+RGTNECGIEEDVVAGLPS RNIA EAAI
Subjt:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

A0A6J1DZC8 cathepsin B-like protease 28.55e-272100Show/hide
Query:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
        MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK
Subjt:  MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLK

Query:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
        LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC
Subjt:  LPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGC

Query:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
        SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY
Subjt:  SHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDY

Query:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
        WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI
Subjt:  WLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNIAWEAAI

SwissProt top hitse value%identityAlignment
F4HVZ1 Cathepsin B-like protease 12.2e-14065.72Show/hide
Query:  LSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDARE
        L+ +F    SSF+ Q  A E + K KL + ILQ  IVK+VNE+P AGWKA  N RF+N +V++FK LLGV QTP+       +V H  SLKLP  FDAR 
Subjt:  LSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDARE

Query:  AWPQCSSI-----GTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE
        AW  C+SI     G IL+                GHCGSCWAFGAVESLSDRFCI +++N+SLS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+
Subjt:  AWPQCSSI-----GTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE

Query:  QCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI
        +CDPYFDNTGCSHPGCEP+YPTP+C +KCV +NQLW  SKHYGV AYR+  D  DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK+ITG  +GGHAVKLI
Subjt:  QCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI

Query:  GWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNI
        GWGT+DDGEDYWLLANQWNR WGDDGYFKI+RGTNECGIE+ VVAGLPS +N+
Subjt:  GWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNI

Q4R5M2 Cathepsin B8.0e-8248.73Show/hide
Query:  LQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVE
        L + +V  VN+     W+A  N  F N  VS  K L    LG  + P+        V   + LKLP +FDARE WPQC +I  I DQG CGSCWAFGAVE
Subjt:  LQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVE

Query:  SLSDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYF-----DNTGCSHPGCEPSYPTPRCVKKC-VDK
        ++SDR CIH + ++S+ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY       +   S P C     TP+C K C    
Subjt:  SLSDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYF-----DNTGCSHPGCEPSYPTPRCVKKC-VDK

Query:  NQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKR
        +  + + KHYG N+Y V+    DIMAE+YKNGPVE AF+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YWL+AN WN  WGD+G+FKI R
Subjt:  NQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKR

Query:  GTNECGIEEDVVAGLP
        G + CGIE +VVAG+P
Subjt:  GTNECGIEEDVVAGLP

Q5R6D1 Cathepsin B1.4e-8148.73Show/hide
Query:  LQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVE
        L + +V  VN+     W+A  N  F N  VS  K L    LG  + P+        V   + LKLP +FDARE WPQC +I  I DQG CGSCWAFGAVE
Subjt:  LQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYL----LGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVE

Query:  SLSDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYF-----DNTGCSHPGCEPSYPTPRCVKKC-VDK
        ++SDR CIH + ++S+ V+  DLL CCG MCGDGC+GGYP  AW ++ R G+V+         C PY       +   S P C     TP+C K C    
Subjt:  SLSDRFCIHFDMNISLSVN--DLLACCGFMCGDGCDGGYPISAWRYFVRHGVVT-------EQCDPYF-----DNTGCSHPGCEPSYPTPRCVKKC-VDK

Query:  NQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKR
        +  + + KHYG N+Y V+    DIMAE+YKNGPVE AF+VY DF  YKSGVY+H+TG++MGGHA++++GWG  ++G  YWL+AN WN  WGD+G+FKI R
Subjt:  NQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKR

Query:  GTNECGIEEDVVAGLP
        G + CGIE +VVAG+P
Subjt:  GTNECGIEEDVVAGLP

Q93VC9 Cathepsin B-like protease 23.6e-15173.02Show/hide
Query:  SSRFYFSLSLLFFAAVSSFH-HQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKL
        S+  +F L LL    +SSF+  Q  A E + K KL + ILQ  IVK+VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +     +VSH  SLKL
Subjt:  SSRFYFSLSLLFFAAVSSFH-HQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKL

Query:  PTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCS
        P  FDAR AW QC+SIG ILDQGHCGSCWAFGAVESLSDRFCI ++MN+SLSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTE+CDPYFDNTGCS
Subjt:  PTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCS

Query:  HPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYW
        HPGCEP+YPTP+C +KCV  NQLW  SKHYGV+AY+V     DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+DDGEDYW
Subjt:  HPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYW

Query:  LLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNI
        LLANQWNR WGDDGYFKI+RGTNECGIE  VVAGLPS RN+
Subjt:  LLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNI

Q94K85 Cathepsin B-like protease 35.8e-14974.84Show/hide
Query:  EQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGS
        E + K KL++ ILQ+ IVK+VNE+P AGWKA +N RFSN +V++FK LLGVK TP++      +VSH  SLKLP  FDAR AWPQC+SIG ILDQGHCGS
Subjt:  EQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGS

Query:  CWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRS
        CWAFGAVESLSDRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFDNTGCSHPGCEP+YPTP+C +KCV  N+LWS S
Subjt:  CWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRS

Query:  KHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGI
        KHY V+ Y V  +  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGDDGYF I+RGTNECGI
Subjt:  KHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGI

Query:  EEDVVAGLPSARNI
        E++ VAGLPS++N+
Subjt:  EEDVVAGLPSARNI

Arabidopsis top hitse value%identityAlignment
AT1G02300.1 Cysteine proteinases superfamily protein1.6e-14165.72Show/hide
Query:  LSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDARE
        L+ +F    SSF+ Q  A E + K KL + ILQ  IVK+VNE+P AGWKA  N RF+N +V++FK LLGV QTP+       +V H  SLKLP  FDAR 
Subjt:  LSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDARE

Query:  AWPQCSSI-----GTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE
        AW  C+SI     G IL+                GHCGSCWAFGAVESLSDRFCI +++N+SLS ND++ACCG +CG GC+GG+P+ AW YF  HGVVT+
Subjt:  AWPQCSSI-----GTILDQ---------------GHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTE

Query:  QCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI
        +CDPYFDNTGCSHPGCEP+YPTP+C +KCV +NQLW  SKHYGV AYR+  D  DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYK+ITG  +GGHAVKLI
Subjt:  QCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLI

Query:  GWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNI
        GWGT+DDGEDYWLLANQWNR WGDDGYFKI+RGTNECGIE+ VVAGLPS +N+
Subjt:  GWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNI

AT1G02305.1 Cysteine proteinases superfamily protein2.6e-15273.02Show/hide
Query:  SSRFYFSLSLLFFAAVSSFH-HQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKL
        S+  +F L LL    +SSF+  Q  A E + K KL + ILQ  IVK+VNE+P AGWKA+ N RF+N +V++FK LLGVK TP+ +     +VSH  SLKL
Subjt:  SSRFYFSLSLLFFAAVSSFH-HQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKL

Query:  PTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCS
        P  FDAR AW QC+SIG ILDQGHCGSCWAFGAVESLSDRFCI ++MN+SLSVNDLLACCGF+CG GC+GGYPI+AWRYF  HGVVTE+CDPYFDNTGCS
Subjt:  PTNFDAREAWPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCS

Query:  HPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYW
        HPGCEP+YPTP+C +KCV  NQLW  SKHYGV+AY+V     DIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+DDGEDYW
Subjt:  HPGCEPSYPTPRCVKKCVDKNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYW

Query:  LLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNI
        LLANQWNR WGDDGYFKI+RGTNECGIE  VVAGLPS RN+
Subjt:  LLANQWNRGWGDDGYFKIKRGTNECGIEEDVVAGLPSARNI

AT3G45310.1 Cysteine proteinases superfamily protein7.3e-3030.62Show/hide
Query:  FKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQF-KYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGSCWAF
        FK N D+++ +  K ++      +K ++N +F++ +  +F +Y LG  Q     L+ +H ++      +P   D    W +   +  + +QGHCGSCW F
Subjt:  FKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQF-KYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGSCWAF

Query:  GAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHY
            +L   +   F   ISLS   L+ C G     GC GG P  A+ Y     G+ TE+  PY    G    GC+ S                   +K+ 
Subjt:  GAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYF-VRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRSKHY

Query:  GV---NAYRVTKDTYD-IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMG------GHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKR
        GV   ++  +T    D +   V    PV VAF V  +F  YK GV+   T +  G       HAV  +G+G  DD   YWL+ N W   WGD+GYFK++ 
Subjt:  GV---NAYRVTKDTYD-IMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMG------GHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKR

Query:  GTNECGI
        G N CG+
Subjt:  GTNECGI

AT4G01610.1 Cysteine proteinases superfamily protein4.1e-15074.84Show/hide
Query:  EQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGS
        E + K KL++ ILQ+ IVK+VNE+P AGWKA +N RFSN +V++FK LLGVK TP++      +VSH  SLKLP  FDAR AWPQC+SIG ILDQGHCGS
Subjt:  EQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGS

Query:  CWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRS
        CWAFGAVESLSDRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFDNTGCSHPGCEP+YPTP+C +KCV  N+LWS S
Subjt:  CWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRS

Query:  KHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGI
        KHY V+ Y V  +  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGDDGYF I+RGTNECGI
Subjt:  KHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGI

Query:  EEDVVAGLPSARNI
        E++ VAGLPS++N+
Subjt:  EEDVVAGLPSARNI

AT4G01610.2 Cysteine proteinases superfamily protein1.7e-14874.2Show/hide
Query:  EQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGS
        E + K KL++ ILQ+ IVK+VNE+P AGWKA +N RFSN +V++FK LLGVK TP++      +VSH  SLKLP  FDAR AWPQC+SIG IL  GHCGS
Subjt:  EQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREAWPQCSSIGTILDQGHCGS

Query:  CWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRS
        CWAFGAVESLSDRFCI F MNISLSVNDLLACCGF CGDGCDGGYPI+AW+YF   GVVTE+CDPYFDNTGCSHPGCEP+YPTP+C +KCV  N+LWS S
Subjt:  CWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVDKNQLWSRS

Query:  KHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGI
        KHY V+ Y V  +  DIMAEVYKNGPVEV+FTVYEDFAHYKSGVYKHITG  +GGHAVKLIGWGT+ +GEDYWL+ANQWNRGWGDDGYF I+RGTNECGI
Subjt:  KHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGI

Query:  EEDVVAGLPSARNI
        E++ VAGLPS++N+
Subjt:  EEDVVAGLPSARNI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCATCTCGCTTCTATTTTTCCCTCTCCTTGCTATTTTTCGCTGCCGTCTCCTCCTTCCATCATCAGGTTTATGCAGAGGAACAAGTTCTAAAGTTCAAACTCAA
TGCTGATATTCTTCAGGAGTCAATCGTTAAGCAGGTCAACGAACACCCACAGGCTGGTTGGAAAGCTACCATGAATCCACGTTTTTCGAACTATTCTGTTAGCCAATTCA
AGTACCTGCTTGGTGTCAAACAAACTCCTGAAGAGGATTTAAGAAGTACTCATGTTGTATCCCATCCCAAGTCGTTAAAGTTGCCAACAAACTTTGATGCAAGAGAAGCT
TGGCCTCAGTGTAGCTCCATTGGAACAATTCTAGATCAGGGGCACTGTGGCTCTTGCTGGGCATTTGGTGCTGTTGAATCACTTTCAGATCGCTTTTGCATTCATTTTGA
CATGAACATTTCTCTGTCTGTTAATGATCTTTTGGCATGCTGCGGCTTCATGTGTGGTGACGGCTGTGATGGTGGTTACCCAATTTCTGCATGGAGATACTTTGTTCGCC
ATGGAGTTGTTACTGAGCAGTGTGATCCATATTTTGACAATACTGGTTGTTCCCACCCTGGTTGTGAACCTTCATATCCTACTCCCAGATGTGTCAAGAAGTGTGTAGAT
AAGAACCAGCTTTGGAGTAGGTCAAAGCACTATGGTGTCAATGCTTACAGGGTGACAAAGGATACCTATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTCGAAGT
TGCCTTCACAGTGTACGAGGATTTTGCTCACTATAAATCTGGGGTATACAAACATATCACTGGTGATGTAATGGGAGGGCATGCTGTGAAGCTTATTGGATGGGGAACAA
CGGATGATGGAGAGGATTATTGGCTTTTGGCAAATCAGTGGAACAGAGGCTGGGGCGATGATGGCTACTTCAAGATAAAAAGAGGAACAAATGAGTGTGGAATTGAGGAA
GACGTTGTTGCTGGTTTGCCCTCAGCCAGAAATATTGCATGGGAGGCTGCCATCTGA
mRNA sequenceShow/hide mRNA sequence
CGACGATAAAAAAAAATCATAAGTTTTCAAAATAGAAAGTAATAAACAAAAAAATAAAATAGCTATTAAACAAATGGCTAAAATAGTCAAAACGATATTTCTCAAAGAAA
AAATAAAATAAAAAAGAAAAAGTTAATATCTTTATGGACATGTTTTTCATAAAGAGCAGAGCTAGACAAAATTGATTAGCCCGAGAAAGGGCTTATCTCGAAAGACGATT
GGTGACCAATGGCAGCATTGCGTATGATCATGGAAGATGATGTCAAGTTGGACCTTGAAGCAAAGCCGAAGTTTGGATTCCTCCCAATTTCCAAATCTGAACTCTCCATT
TTTTTCTCGGAACTGATTTCTTCCCATTTTTCCACATCCCCTCTCGCTTCGCCTTTCTCTCTGCTTCTGCCGTCCAAACCAGCAAGGAAATGGCGTCATCTCGCTTCTAT
TTTTCCCTCTCCTTGCTATTTTTCGCTGCCGTCTCCTCCTTCCATCATCAGGTTTATGCAGAGGAACAAGTTCTAAAGTTCAAACTCAATGCTGATATTCTTCAGGAGTC
AATCGTTAAGCAGGTCAACGAACACCCACAGGCTGGTTGGAAAGCTACCATGAATCCACGTTTTTCGAACTATTCTGTTAGCCAATTCAAGTACCTGCTTGGTGTCAAAC
AAACTCCTGAAGAGGATTTAAGAAGTACTCATGTTGTATCCCATCCCAAGTCGTTAAAGTTGCCAACAAACTTTGATGCAAGAGAAGCTTGGCCTCAGTGTAGCTCCATT
GGAACAATTCTAGATCAGGGGCACTGTGGCTCTTGCTGGGCATTTGGTGCTGTTGAATCACTTTCAGATCGCTTTTGCATTCATTTTGACATGAACATTTCTCTGTCTGT
TAATGATCTTTTGGCATGCTGCGGCTTCATGTGTGGTGACGGCTGTGATGGTGGTTACCCAATTTCTGCATGGAGATACTTTGTTCGCCATGGAGTTGTTACTGAGCAGT
GTGATCCATATTTTGACAATACTGGTTGTTCCCACCCTGGTTGTGAACCTTCATATCCTACTCCCAGATGTGTCAAGAAGTGTGTAGATAAGAACCAGCTTTGGAGTAGG
TCAAAGCACTATGGTGTCAATGCTTACAGGGTGACAAAGGATACCTATGATATCATGGCAGAAGTTTATAAGAATGGACCAGTCGAAGTTGCCTTCACAGTGTACGAGGA
TTTTGCTCACTATAAATCTGGGGTATACAAACATATCACTGGTGATGTAATGGGAGGGCATGCTGTGAAGCTTATTGGATGGGGAACAACGGATGATGGAGAGGATTATT
GGCTTTTGGCAAATCAGTGGAACAGAGGCTGGGGCGATGATGGCTACTTCAAGATAAAAAGAGGAACAAATGAGTGTGGAATTGAGGAAGACGTTGTTGCTGGTTTGCCC
TCAGCCAGAAATATTGCATGGGAGGCTGCCATCTGAGCCAGATTGTTGCTGTTTCACAAGCAAGTATTTGCCAAACCAAAGATATAAAATAGTGAGTATTTGTGTTACTT
TTGGCTTTGGATCAGAACTATTTATGCATATGAAGTTGATAAATATGGCTAAATGCTTTGCTGGATTTGTTAAGTATTTGTATTGCATCTTCAACCAGTAGTGGAAACAT
GATGTTTGGAAAGAGGTATTAATCAAATAATGTGTTAGTATTCTTGCAGAATAAAAGCTTTTTTATTTTTTATTTTTTTATTATTATTATTATTTTTTTAGAAAAG
Protein sequenceShow/hide protein sequence
MASSRFYFSLSLLFFAAVSSFHHQVYAEEQVLKFKLNADILQESIVKQVNEHPQAGWKATMNPRFSNYSVSQFKYLLGVKQTPEEDLRSTHVVSHPKSLKLPTNFDAREA
WPQCSSIGTILDQGHCGSCWAFGAVESLSDRFCIHFDMNISLSVNDLLACCGFMCGDGCDGGYPISAWRYFVRHGVVTEQCDPYFDNTGCSHPGCEPSYPTPRCVKKCVD
KNQLWSRSKHYGVNAYRVTKDTYDIMAEVYKNGPVEVAFTVYEDFAHYKSGVYKHITGDVMGGHAVKLIGWGTTDDGEDYWLLANQWNRGWGDDGYFKIKRGTNECGIEE
DVVAGLPSARNIAWEAAI